Back to Arena
Scissorhands
by Rice / Stanford / Meta (Liu et al.)
System Card
OrganizationRice / Stanford / Meta (Liu et al.)
Released2023-05
Architecturekv-cache-extension / Persistence-of-importance KV pruning
DetailsBased on the "persistence of importance" hypothesis: tokens with high past attention impact remain pivotal for future generations. Maintains a fixed KV budget by storing pivotal tokens with higher probability.
Parameters—
Domainlong-context
Open SourceNo
PaperView Paper
neurips-2023pruningkvquantization-compatible
Capability Profile
Benchmark Scores
6 of 14 benchmarksData Transparency:6 estimated
Long-Context Retrieval5/5
Multi-Turn Recall0/2
LoCoMo
no dataMemoryBank
no dataCross-Session Memory0/1
LongMemEval
no dataMulti-Hop QA1/3
Agent Task Memory0/1
AgentBench-Mem
no dataPersonalization0/1
PerLTQA
no dataFactuality / Grounding0/1
RAGAS
no data