MemR3

by 2025 (December submission)

System Card

Organization2025 (December submission)

Released2025-12

Architectureagentic-workflow / LangGraph closed-loop retrieve-reflect-answer router

DetailsAutonomous memory-retrieval controller built on LangGraph. Router chooses among retrieve/reflect/answer actions; a global evidence-gap tracker monitors what evidence is still missing. Agnostic to backend retrievers (vector, graph, hybrid).

Parameters—

Domainagent-memoryrag-retrieval

Open SourceNo

PaperView Paper

langgraphclosed-loopevidence-gaplocomo

Capability Profile

Benchmark Scores

6 of 14 benchmarks

Data Transparency:1 self-reported5 estimated

Long-Context Retrieval

1/5

RULER

no data

NIAH

no data

LooGLE

no data

LongBench

603pEstimated

∞Bench

no data

Multi-Turn Recall

1/2

LoCoMo

86.898pSelf-Reported

MemoryBank

no data

Cross-Session Memory

1/1

LongMemEval

83.994pEstimated

Multi-Hop QA

2/3

BABILong

no data

MultiHop-RAG

7579pEstimated

HotpotQA

8298pEstimated

Agent Task Memory

1/1

AgentBench-Mem

7226pEstimated

Personalization

0/1

PerLTQA

no data

Factuality / Grounding

0/1

RAGAS

no data

Sources:arXiv:2512.20237 Table 1 — GPT-4.1-mini + RAG backbone, LLM-as-Judge overall Arena estimate — derived from capability profile, not independently verified