Back to Arena
Memorizing Transformer
by Google Research (Wu, Rabe, Hutchins, Szegedy)
System Card
OrganizationGoogle Research (Wu, Rabe, Hutchins, Szegedy)
Released2022-03
Architectureexternal-memory-network / Non-differentiable kNN lookup over (key,value) pairs
DetailsApproximate kNN lookup into a non-differentiable cache of recent attention (key, value) pairs. Scales the effective attention context up to 262k tokens.
Parameters—
Domainlong-contextlifelong-learning
Open SourceNo
PaperView Paper
WebsiteVisit
iclr-2022-spotlightknnnon-differentiable262k
Capability Profile
Benchmark Scores
6 of 14 benchmarksData Transparency:6 estimated
Long-Context Retrieval3/5
Multi-Turn Recall1/2
MemoryBank
no dataCross-Session Memory1/1
Multi-Hop QA1/3
Agent Task Memory0/1
AgentBench-Mem
no dataPersonalization0/1
PerLTQA
no dataFactuality / Grounding0/1
RAGAS
no data