Ling 2.5 1T

MoE decoder architecture with Lightning Attention plus MLA attention mechanism.

Lightning Attention plus MLA·MoE · 63B active

63B active / 1T total|256K context|Lightning Attention plus MLA|MoE

Architecture Specifications

Parameters63B active / 1T total

Context Window256K

Decoder TypeMoE

AttentionLightning Attention plus MLA

Active Parameters63B

Layers80

Hidden Size8,192

Vocabulary Size157K

Release Date2026-02

CategoryMixture of Experts

OrganizationUnknown

Multi-head Latent AttentionLayer mix: 10 MLA + 70 Lightning AttentionKV cache: 11.2 KiB/token

Enterprise AI platform

Colaberry AI provides architecture specifications, benchmark comparisons, and deployment guidance for enterprise AI teams.