Back to Arena
Recurrent Memory Transformer
by MIPT / DeepPavlov (Bulatov, Kuratov, Burtsev)
System Card
OrganizationMIPT / DeepPavlov (Bulatov, Kuratov, Burtsev)
Released2022-07
Architectureexternal-memory-network / Memory tokens passed between segments recurrently
DetailsAdds special memory tokens to each segment that pass information recurrently across segments of a long sequence, with no architectural changes beyond token-level memory slots.
Parameters—
Domainlong-context
Open SourceYes
PaperView Paper
CodeRepository
neurips-2022aaai-2024recurrentmemory-tokens1m-tokens
Capability Profile
Benchmark Scores
6 of 14 benchmarksData Transparency:6 estimated
Long-Context Retrieval5/5
Multi-Turn Recall0/2
LoCoMo
no dataMemoryBank
no dataCross-Session Memory0/1
LongMemEval
no dataMulti-Hop QA1/3
Agent Task Memory0/1
AgentBench-Mem
no dataPersonalization0/1
PerLTQA
no dataFactuality / Grounding0/1
RAGAS
no data