Back to Arena
ICAE
by Microsoft Research (Ge et al.)
System Card
OrganizationMicrosoft Research (Ge et al.)
Released2023-07
Architecturekv-cache-extension / LoRA encoder + frozen decoder compressed memory slots
DetailsLoRA-adapted encoder compresses long contexts into a few memory-slot tokens that the frozen base LLM can condition on. Pretrained with autoencoding + LM objectives, then instruction-tuned.
Parameters—
Domainlong-context
Open SourceYes
PaperView Paper
CodeRepository
iclr-2024autoencoder4x-compressionlora
Capability Profile
Benchmark Scores
6 of 14 benchmarksData Transparency:6 estimated
Long-Context Retrieval5/5
Multi-Turn Recall0/2
LoCoMo
no dataMemoryBank
no dataCross-Session Memory0/1
LongMemEval
no dataMulti-Hop QA1/3
Agent Task Memory0/1
AgentBench-Mem
no dataPersonalization0/1
PerLTQA
no dataFactuality / Grounding0/1
RAGAS
no data