Back to Arena

Mamba

by CMU / Princeton (Gu, Dao)

System Card

OrganizationCMU / Princeton (Gu, Dao)
Released2023-12
Architectureexternal-memory-network / Selective state-space model (input-dependent SSM)
DetailsMakes SSM parameters input-dependent, allowing selective information propagation. Replaces attention/MLP blocks with a unified selective SSM block for linear-time sequence modeling and constant-size recurrent state.
Parameters
Domainlong-context
Open SourceYes
colm-2024ssmlinear-timeselectiverecurrent

Capability Profile

Benchmark Scores

6 of 14 benchmarks
Long-Context Retrieval
5/5
RULER
77.890p
NIAH
75.138p
LooGLE
81.191p
Multi-Turn Recall
0/2
LoCoMo
no data
MemoryBank
no data
Cross-Session Memory
0/1
LongMemEval
no data
Multi-Hop QA
1/3
BABILong
82.597p
MultiHop-RAG
no data
HotpotQA
no data
Agent Task Memory
0/1
AgentBench-Mem
no data
Personalization
0/1
PerLTQA
no data
Factuality / Grounding
0/1
RAGAS
no data