Back to Arena

Recurrent Memory Transformer

by MIPT / DeepPavlov (Bulatov, Kuratov, Burtsev)

System Card

OrganizationMIPT / DeepPavlov (Bulatov, Kuratov, Burtsev)
Released2022-07
Architectureexternal-memory-network / Memory tokens passed between segments recurrently
DetailsAdds special memory tokens to each segment that pass information recurrently across segments of a long sequence, with no architectural changes beyond token-level memory slots.
Parameters
Domainlong-context
Open SourceYes
neurips-2022aaai-2024recurrentmemory-tokens1m-tokens

Capability Profile

Benchmark Scores

6 of 14 benchmarks
Data Transparency:6 estimated
Long-Context Retrieval
5/5
RULER
79.594pEstimated
NIAH
75.962pEstimated
LooGLE
7977pEstimated
LongBench
603pEstimated
∞Bench
77.841pEstimated
Multi-Turn Recall
0/2
LoCoMo
no data
MemoryBank
no data
Cross-Session Memory
0/1
LongMemEval
no data
Multi-Hop QA
1/3
BABILong
80.390pEstimated
MultiHop-RAG
no data
HotpotQA
no data
Agent Task Memory
0/1
AgentBench-Mem
no data
Personalization
0/1
PerLTQA
no data
Factuality / Grounding
0/1
RAGAS
no data