Llama 3.2 3B

Dense decoder architecture with GQA attention mechanism.

GQA·SwiGLU

3B|128K context|GQA|Dense

Architecture Specifications

Parameters3B

Context Window128K

Decoder TypeDense

AttentionGQA

Release DateUnknown

CategoryEfficient & Small

OrganizationMeta

Grouped Query AttentionLayer mix: 28 GQAKV cache: 112 KiB/token

Enterprise AI platform

Colaberry AI provides architecture specifications, benchmark comparisons, and deployment guidance for enterprise AI teams.