Back to Arena

Compressive Transformer

by DeepMind (Rae et al.)

System Card

OrganizationDeepMind (Rae et al.)
Released2019-11
Architectureexternal-memory-network / Compacted past-activation memory + TransformerXL short-term
DetailsMaintains a TransformerXL-style short-term memory of past activations, but compresses old activations into a compressed memory instead of discarding them. Introduces PG-19 benchmark.
Parameters
Domainlong-context
Open SourcePartial
WebsiteVisit
iclr-2020deepmindpg-19compression

Capability Profile

Benchmark Scores

6 of 14 benchmarks
Data Transparency:6 estimated
Long-Context Retrieval
5/5
RULER
75.483pEstimated
NIAH
75.138pEstimated
LooGLE
77.655pEstimated
LongBench
603pEstimated
∞Bench
82.878pEstimated
Multi-Turn Recall
0/2
LoCoMo
no data
MemoryBank
no data
Cross-Session Memory
0/1
LongMemEval
no data
Multi-Hop QA
1/3
BABILong
74.143pEstimated
MultiHop-RAG
no data
HotpotQA
no data
Agent Task Memory
0/1
AgentBench-Mem
no data
Personalization
0/1
PerLTQA
no data
Factuality / Grounding
0/1
RAGAS
no data