Gemma 4 31B

Dense decoder architecture with GQA + QK-Norm + SWA attention mechanism.

GQA + QK-Norm + SWA·SwiGLU

30.7B|256K context|GQA + QK-Norm + SWA|Dense

Architecture Specifications

Parameters30.7B

Context Window256K

Decoder TypeDense

AttentionGQA + QK-Norm + SWA

Vocabulary Size262K

Release Date2026-04

CategoryLong Context

OrganizationGoogle

Grouped Query AttentionSliding Window AttentionQK normalizationLayer mix: 50 sliding-window + 10 globalKV cache: 840 KiB/token

Enterprise AI platform

Colaberry AI provides architecture specifications, benchmark comparisons, and deployment guidance for enterprise AI teams.