Back to Arena
Compressive Transformer
by DeepMind (Rae et al.)
System Card
OrganizationDeepMind (Rae et al.)
Released2019-11
Architectureexternal-memory-network / Compacted past-activation memory + TransformerXL short-term
DetailsMaintains a TransformerXL-style short-term memory of past activations, but compresses old activations into a compressed memory instead of discarding them. Introduces PG-19 benchmark.
Parameters—
Domainlong-context
Open SourcePartial
PaperView Paper
WebsiteVisit
CodeRepository
iclr-2020deepmindpg-19compression
Capability Profile
Benchmark Scores
6 of 14 benchmarksData Transparency:6 estimated
Long-Context Retrieval5/5
Multi-Turn Recall0/2
LoCoMo
no dataMemoryBank
no dataCross-Session Memory0/1
LongMemEval
no dataMulti-Hop QA1/3
Agent Task Memory0/1
AgentBench-Mem
no dataPersonalization0/1
PerLTQA
no dataFactuality / Grounding0/1
RAGAS
no data