Sarvam 30B

MoE decoder architecture with GQA + QK-Norm attention mechanism.

GQA + QK-Norm·MoE · 2.4B active

2.4B active / 30B total|131K context|GQA + QK-Norm|MoE

Architecture Specifications

Parameters2.4B active / 30B total

Context Window131K

Decoder TypeMoE

AttentionGQA + QK-Norm

Active Parameters2.4B

Layers19

Hidden Size4,096

Vocabulary Size262K

Release Date2026-03

CategoryMixture of Experts

OrganizationUnknown

Grouped Query AttentionQK normalizationExpert routingLayer mix: 19 GQAKV cache: 19 KiB/token

Enterprise AI platform

Colaberry AI provides architecture specifications, benchmark comparisons, and deployment guidance for enterprise AI teams.