Back to Arena
LongMem
by UCSB / Microsoft Research
System Card
OrganizationUCSB / Microsoft Research
Released2023-06
Architectureexternal-memory-network / Decoupled frozen encoder + residual side network
DetailsDecoupled network with a frozen LLM backbone as the memory encoder and a trainable residual side network as retriever/reader. Caches up to 65k tokens in a non-differentiable memory bank, mitigating memory staleness.
Parameters—
Domainlong-contextlifelong-learning
Open SourceYes
PaperView Paper
CodeRepository
neurips-2023side-networkin-context-learningmemory-bank
Capability Profile
Benchmark Scores
6 of 14 benchmarksMulti-Turn Recall1/2
MemoryBank
no dataCross-Session Memory1/1
Multi-Hop QA1/3
Agent Task Memory0/1
AgentBench-Mem
no dataPersonalization0/1
PerLTQA
no dataFactuality / Grounding0/1
RAGAS
no dataSources:LongMem paper (arXiv:2306.07174); evaluated on BABILong: Testing the Limits of LLMs with Long-Context Reasoning-in-a-Haystack (AIRI, 2406)LongMem paper (arXiv:2306.07174); evaluated on InfiniteBench: Extending Long Context Evaluation Beyond 100K Tokens (Tsinghua / OpenBMB, 2402)LongMem paper (arXiv:2306.07174); evaluated on LoCoMo: Long-Term Conversational Memory Benchmark (Snap Research, 2402)LongMem paper (arXiv:2306.07174); evaluated on LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding (Tsinghua KEG, 2308)LongMem paper (arXiv:2306.07174); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410)LongMem paper (arXiv:2306.07174); evaluated on LooGLE: Can Long-Context Language Models Understand Long Contexts? (Peking University, 2311)