Back to Arena
RAPTOR
by Stanford (Sarthi, Abdullah et al.)
System Card
OrganizationStanford (Sarthi, Abdullah et al.)
Released2024-01
Architecturehierarchical-summary / Recursive bottom-up clustering + summarization tree
DetailsRecursively embeds, clusters, and summarizes chunks to build a multi-level tree. Inference retrieves across tree levels, integrating information at multiple abstraction granularities.
Parameters—
Domainrag-retrievallong-context
Open SourceYes
PaperView Paper
CodeRepository
iclr-2024treerecursive-summarymulti-hop
Capability Profile
Benchmark Scores
6 of 14 benchmarksMulti-Turn Recall0/2
LoCoMo
no dataMemoryBank
no dataCross-Session Memory0/1
LongMemEval
no dataAgent Task Memory0/1
AgentBench-Mem
no dataPersonalization0/1
PerLTQA
no dataFactuality / Grounding0/1
RAGAS
no dataSources:RAPTOR paper (arXiv:2401.18059); evaluated on LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding (Tsinghua KEG, 2308)RAPTOR paper (arXiv:2401.18059); evaluated on RULER: What's the Real Context Size of Your Long-Context Language Models (NVIDIA, 2404)RAPTOR paper (arXiv:2401.18059); evaluated on BABILong: Testing the Limits of LLMs with Long-Context Reasoning-in-a-Haystack (AIRI, 2406)RAPTOR paper (arXiv:2401.18059); evaluated on HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering (Stanford / CMU, 1809)RAPTOR paper (arXiv:2401.18059); evaluated on InfiniteBench: Extending Long Context Evaluation Beyond 100K Tokens (Tsinghua / OpenBMB, 2402)RAPTOR paper (arXiv:2401.18059); evaluated on LooGLE: Can Long-Context Language Models Understand Long Contexts? (Peking University, 2311)