Back to Arena
Landmark Attention
by EPFL (Mohtashami, Jaggi)
System Card
OrganizationEPFL (Mohtashami, Jaggi)
Released2023-05
Architecturekv-cache-extension / Block-level landmark tokens with direct attention retrieval
DetailsInserts landmark tokens representing each input block, and trains attention to use them for selecting relevant blocks. Retrieval flows through the model's own attention mechanism, preserving random access to the full context.
Parameters—
Domainlong-context
Open SourceYes
PaperView Paper
CodeRepository
neurips-2023random-accessblockretrieval-by-attention
Capability Profile
Benchmark Scores
6 of 14 benchmarksData Transparency:6 estimated
Long-Context Retrieval5/5
Multi-Turn Recall0/2
LoCoMo
no dataMemoryBank
no dataCross-Session Memory0/1
LongMemEval
no dataMulti-Hop QA1/3
Agent Task Memory0/1
AgentBench-Mem
no dataPersonalization0/1
PerLTQA
no dataFactuality / Grounding0/1
RAGAS
no data