LLM Architecture Gallery Changelog
May 17, 2026
Added Gemma 4, Laguna, ZAYA1, and DeepSeek V4 explainers
Added cross-layer KV-sharing and per-layer embedding explainers for Gemma 4 E2B and E4B, added Laguna attention-budgeting, ZAYA1 compressed convolutional attention, and DeepSeek V4 mHC and CSA/HCA explainers, and linked recent architecture cards to the May 16 article sections.
-
Added explainers for cross-layer KV sharing, per-layer embeddings, Laguna XS.2 attention budgeting, ZAYA1-8B compressed convolutional attention, DeepSeek V4 manifold-constrained hyper-connections, and DeepSeek V4 CSA/HCA compressed attention.
-
Added View in article links for Gemma 4 E2B, Gemma 4 E4B, Laguna XS.2, ZAYA1-8B, and DeepSeek V4.
Â
May 14, 2026
Added Xiaomi MiMo-V2.5-Pro
Added the Xiaomi MiMo-V2.5-Pro 1.02T architecture card (and updated the active-parameter ratio and attention mechanism distribution tables).
Â
May 14, 2026
Added active-parameter ratio meta analysis
Added a meta-analysis block below the architecture cards and linked standalone tables for active-parameter ratios and attention mechanism distribution.
-
Added the standalone active-parameter ratio table for sparse MoE and hybrid models.
-
Added the standalone attention mechanism distribution table for visible gallery cards.
Â
May 12, 2026
Updated gallery metadata and performance
Restored missing architecture cards, corrected gallery metadata, and reduced card rendering cost with generated thumbnails, offscreen rendering hints, and cheaper sticky controls.
-
Removed the incorrect Gemma 4 Artificial Analysis score and source link from the GPT-2 XL card.
-
Added the missing Llama 3.2 3B, Qwen3 0.6B, and Qwen3 30B-A3B figure cards.
-
Added smaller generated thumbnails for architecture cards while keeping full-size images for zoom.
-
Removed backdrop blur from the sticky search and sort controls to reduce scroll work.
Â
May 10, 2026
Added May 10 architecture gallery updates
Added ZAYA1-8B and LongCat-Flash-Lite 68.5B-A3B to the public architecture gallery.
-
Clarified that the gallery focuses on text-only LLMs and language-model backbones.
Â
May 3, 2026
Added May 3 architecture gallery updates
Added Laguna XS.2, Tencent Hy3-preview 295B-A21B, Granite 4.1 30B, and a Multi-Token Prediction explainer to the public architecture gallery.
Â
May 1, 2026
Added May architecture gallery updates
Added Xiaomi MiMo-V2.5 310B, MiniMax M2.7 230B, and Ling 2.6 1T to the public architecture gallery; added AA Index score sorting and refreshed AA profile data for newly covered models.
-
Added an AA Index score sort option that ranks models with numeric Artificial Analysis scores first.
Â
April 26, 2026
Added new April architecture cards
Added Kimi K2.6, Qwen3.6 35B-A3B, Qwen3.6 27B, DeepSeek V4-Pro, and DeepSeek V4-Flash to the public architecture gallery. Details on mHC and compressed attention will be added at a later time.
Â
April 10, 2026
Implemented Gemma 4 E2B and E4B from scratch
Implemented Gemma 4 E2B and E4B from scratch (link in the architecture cards).
Â
April 9, 2026
Expanded Gemma 4 gallery coverage
Added Gemma 4 E2B and E4B architecture cards and filled missing AA Intelligence Index data for the larger Gemma 4 models and GLM-5.1.
Â
April 7, 2026
Added the GLM-5.1 architecture card
Added GLM-5.1 to the public architecture gallery.
Â
April 4, 2026
Added GLM-4.5-Air and INTELLECT-3
Added standalone technical-report cards for GLM-4.5-Air and INTELLECT-3.
Â
April 2, 2026
Added Gemma 4 architecture cards
Added two new Gemma 4 architecture cards to the gallery.
Â
March 29, 2026
Added a digital poster purchase link
Added a Gumroad option alongside the existing Redbubble poster listing.
- Added a Gumroad link for the print-ready digital poster alongside the existing Redbubble physical poster option.
Â
March 27, 2026
Expanded gallery controls and benchmark metadata
Added Artificial Analysis Intelligence Index data and more flexible ways to browse dense cards.
-
Added Artificial Analysis Intelligence Index scores for each model where applicable.
-
Added a Detailed / Compact view switch for the main card grid.
-
Added per-card Show details / Show less toggles in compact view.
Â
March 26, 2026
Added architecture comparison and sorting tools
Turned the gallery into a more interactive comparison tool and expanded several card fact fields.
-
Added a side-by-side architecture diff with Model A / Model B selectors and per-card compare actions.
-
Added a Sort by control for Release date (newest first), Release date (oldest first), A-Z, and Size.
-
Added active-parameter percentages to MoE Scale fields where available.
-
Added KV cache / token (bf16) estimates to the fact sheets.
-
Added Layer mix to the fact sheets.
Â
March 25, 2026
Added the Phi-4 architecture card
Added Phi-4 to the public gallery.
Â
March 20, 2026
Added four new architecture cards
Added Nemotron 3 Nano 4B, Kimi K2.5, Mistral Small 4, and xLSTM 7B.
Â
March 17, 2026
Added license metadata
Added license info and links to license files where applicable.
- Added license info and links to license files where applicable.