Back to Arena
ReMe
by ModelScope (Alibaba)
System Card
OrganizationModelScope (Alibaba)
Released2025-01
Architecturehybrid / File-based + vector memory with ReAct compactor
DetailsDual-mode memory: ReMeLight writes Markdown/JSONL files (MEMORY.md, daily journals, dialog logs, tool-result cache), while vector store uses hybrid BM25 + embeddings. A ReAct compactor summarizes into Goal/Progress/Decisions/Next-Steps.
Parameters—
Domainagent-memorylong-context
Open SourceYes
CodeRepository
file-memoryreact-compactorbm25-hybridalibabalocomo
Capability Profile
Benchmark Scores
6 of 14 benchmarksMulti-Turn Recall1/2
MemoryBank
no dataCross-Session Memory0/1
LongMemEval
no dataAgent Task Memory1/1
Personalization0/1
PerLTQA
no dataFactuality / Grounding0/1
RAGAS
no dataSources:ReMe (modelscope/ReMe); evaluated on AgentBench Memory Track (Tsinghua KEG, 2308)ReMe (modelscope/ReMe); evaluated on BABILong: Testing the Limits of LLMs with Long-Context Reasoning-in-a-Haystack (AIRI, 2406)ReMe (modelscope/ReMe); evaluated on HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering (Stanford / CMU, 1809)ReMe (modelscope/ReMe); evaluated on InfiniteBench: Extending Long Context Evaluation Beyond 100K Tokens (Tsinghua / OpenBMB, 2402)ReMe (modelscope/ReMe); evaluated on LoCoMo: Long-Term Conversational Memory Benchmark (Snap Research, 2402)ReMe (modelscope/ReMe); evaluated on LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding (Tsinghua KEG, 2308)