
The LLM Architecture Gallery: Mapping the Evolution of Open-Weight Models
383
A comprehensive technical reference gallery documenting the architectural evolution and specifications of modern open-weight large language models.
Mixture of Experts (MoE) model architectures that route tokens to specialized expert subnetworks, enabling massive parameter counts with sparse activation for improved efficiency and scaling.

A comprehensive technical reference gallery documenting the architectural evolution and specifications of modern open-weight large language models.
Qwen3-Next matches larger models while slashing training cost and delivering order-of-magnitude faster long-context inference via a hybrid attention + ultra-sparse MoE design with native MTP.