| #1 | MemPalaceBen Sigman / Milla Jovovich (independent open-source) | 96.6 | github.com/milla-jovovich/mempalace/blob/main/benchmarks/BENCHMARKS.md — R@5 retrieval recall (NOT answer accuracy); 483/500 verbatim ChromaDB raw mode. Third-party reviews note methodology caveats. |
| #2 | Backboard IOBackboard.io | 93.4 | github.com/Backboard-io/Backboard-longmemEval-results — 467/500 on LongMemEval s_cleaned (~115k tokens), GPT-4.1, independent eval by NewMathData |
| #3 | VoyagerNVIDIA / Caltech / UT Austin / Stanford / ASU / UW (Wang et al.) | 87.1 | Voyager paper (arXiv:2305.16291); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #4 | Pickle AISoul Computer (YC-backed) | 86.8 | Pickle AI vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #5 | xmemoryxmemory Inc. | 86.6 | xmemory vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #6 | SupermemorySupermemory | 85.4 | Supermemory research — LongMemEval benchmark (overall accuracy 85.4%, single-session retrieval 92.3%, knowledge updates 89.7%, temporal reasoning 82.0%) |
| #7 | ArcMemoUC Berkeley / Stanford (Ho et al.) | 85.1 | ArcMemo paper (arXiv:2509.04439); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #8 | SuperAGITransformerOptimus | 85.1 | SuperAGI (TransformerOptimus/SuperAGI); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #9 | ReplikaLuka, Inc. | 84.9 | Replika vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #10 | Swarmskyegomez / Swarms Corp | 84 | Swarms (kyegomez/swarms); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #11 | MemR32025 (December submission) | 83.9 | MemR3 paper (arXiv:2512.20237); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #12 | OS-Copilot / FRIDAYShanghai AI Lab / MMLab (Wu et al.) | 83.3 | OS-Copilot / FRIDAY paper (arXiv:2402.07456); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #13 | A-MEMAGI Research / Rutgers | 83.1 | A-MEM paper (arXiv:2502.12110); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #14 | HippoRAG 2OSU NLP Group | 83 | HippoRAG 2 paper (arXiv:2502.14802); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #15 | Talkie AIMiniMax | 82.8 | Talkie AI vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #16 | CognigyCognigy GmbH (acquired by NICE, July 2025) | 82.1 | Cognigy vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #17 | CrewAI EnterpriseCrewAI Inc. | 82.1 | CrewAI Enterprise vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #18 | MempZhejiang University (Fang et al.) | 82.1 | Memp paper (arXiv:2508.06433); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #19 | memUNevaMind-AI | 82 | third-party launch coverage (X/Twitter) — LongMemEval-S; weaker sourcing, not on official page |
| #20 | Bee ComputerBee (acquired by Amazon 2026) | 81.9 | Bee Computer vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #21 | MoTFudan University (Li & Qiu) | 81.5 | MoT paper (arXiv:2305.05181); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #22 | MIRIXMIRIX AI (Wang, Chen) | 81.3 | MIRIX paper (arXiv:2507.07957); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #23 | AutoWebGLMTHUDM | 81.1 | AutoWebGLM paper (arXiv:2404.03648); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #24 | ExpeLTsinghua University (Zhao et al.) | 81 | ExpeL paper (arXiv:2308.10144); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #25 | BabyAGIYohei Nakajima | 80.9 | BabyAGI (yoheinakajima/babyagi); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #26 | BrowserGymServiceNow Research | 80.7 | BrowserGym paper (arXiv:2412.05467); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #27 | Suki AISuki (formerly Robin AI) | 80.7 | Suki AI vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #28 | AgentVerseOpenBMB (Tsinghua) | 80.6 | AgentVerse paper (arXiv:2308.10848); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #29 | Lindy AILindy AI | 80.5 | Lindy AI vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #30 | LagentInternLM (Shanghai AI Lab) | 80.4 | Lagent (InternLM/lagent); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #31 | Nabla CopilotNabla | 80.4 | Nabla Copilot vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #32 | LangflowLangflow-ai (DataStax) | 80.1 | Langflow (langflow-ai/langflow); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #33 | Mobile-AgentAlibaba Tongyi Lab (X-PLUG) | 80.1 | Mobile-Agent paper (arXiv:2508.15144); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #34 | CradleBAAI-Agents | 80 | Cradle paper (arXiv:2403.03186); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #35 | HebbiaHebbia, Inc. | 79.9 | Hebbia vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #36 | JARVIS-1CraftJarvis | 79.8 | JARVIS-1 paper (arXiv:2311.05997); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #37 | VoiceflowVoiceflow Inc. | 79.8 | Voiceflow vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #38 | Kore AIKore.ai Inc. | 79.7 | Kore AI vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #39 | Generative AgentsStanford University / Google Research | 79.6 | Generative Agents paper (arXiv:2304.03442); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #40 | Onyxonyx-dot-app | 79.6 | Onyx (onyx-dot-app/onyx); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #41 | WebVoyagerMinorJerry et al. | 79.5 | WebVoyager paper (arXiv:2401.13919); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #42 | AGiXTJosh-XT | 79.4 | AGiXT (Josh-XT/AGiXT); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #43 | Bishengdataelement | 79.4 | Bisheng (dataelement/bisheng); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #44 | CAMELCAMEL-AI.org | 79.4 | CAMEL paper (arXiv:2303.17760); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #45 | Nuance DAXNuance Communications (Microsoft) | 79.3 | Nuance DAX vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #46 | SynapseNanyang Technological University (Zheng et al.) | 79.3 | Synapse paper (arXiv:2306.07863); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #47 | ChatDev 2.0OpenBMB | 79.2 | ChatDev 2.0 paper (arXiv:2307.07924); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #48 | ReflexionNortheastern / MIT / Princeton (Shinn et al.) | 79.2 | Reflexion paper (arXiv:2303.11366); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #49 | Tab AITab (Avi Schiffmann) | 79.2 | Tab AI vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #50 | Self-RAGUniversity of Washington / Allen AI (Asai et al.) | 79.1 | Self-RAG paper (arXiv:2310.11511); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #51 | AutoGPT PlatformSignificant Gravitas | 79 | AutoGPT Platform (Significant-Gravitas/AutoGPT); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #52 | Stack AIStack AI Inc. (YC W23) | 79 | Stack AI vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #53 | Galileo AIGalileo Technologies Inc. | 78.9 | Galileo AI vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #54 | Lyzr CognisLyzr AI | 78.8 | Lyzr Cognis vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #55 | VectorShiftVectorShift Inc. (YC S23) | 78.8 | VectorShift vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #56 | Agent Workflow MemoryCMU (Wang, Mao, Fried, Neubig) | 78.7 | Agent Workflow Memory paper (arXiv:2409.07429); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #57 | HiMemZhu et al. (JD.com, 2026) | 78.4 | HiMem paper (arXiv:2601.06377); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #58 | Limitless PendantLimitless AI (acquired by Meta Dec 2025) | 78.4 | Limitless Pendant vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #59 | Athina AIAthina AI (YC W23) | 78.3 | Athina AI vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #60 | HoneyHiveHoneyHive Inc. | 78.3 | HoneyHive vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #61 | SCMBeihang / NLPR (Wang et al.) | 78.2 | SCM paper (arXiv:2304.13343); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #62 | GPTeam101dotxyz | 77.9 | GPTeam (101dotxyz/GPTeam); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #63 | AutoGen Core MemoryMicrosoft | 77.8 | AutoGen Core Memory paper (arXiv:2308.08155); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #64 | MemOSMemTensor (Li, Zhang, et al.) | 77.8 | arXiv:2507.03724 — MemOS-1031 average, Table 3 — Average across LongMemEval categories; outperforms Memobase 72.4% |
| #65 | DB-GPTeosphoros-ai | 77.7 | DB-GPT (eosphoros-ai/DB-GPT); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #66 | MetaGPTDeepWisdom / geekan | 77.7 | MetaGPT paper (arXiv:2308.00352); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #67 | Qwen-AgentQwenLM (Alibaba) | 77.7 | Qwen-Agent (QwenLM/Qwen-Agent); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #68 | Character AICharacter.AI (Google investment) | 77.5 | Character AI vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #69 | FastGPTlabring | 77.4 | FastGPT (labring/FastGPT); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #70 | Vellum AIVellum AI Inc. (YC W23) | 77.4 | Vellum AI vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #71 | D-MemYou et al. (2025) | 77.3 | D-Mem paper (arXiv:2603.18631); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #72 | SID AISID (YC) | 77.1 | SID AI vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #73 | Friend AIFriend | 77 | Friend AI vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #74 | DifyLangGenius | 76.9 | Dify (langgenius/dify); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #75 | LangGraphLangChain | 76.9 | LangGraph (langchain-ai/langgraph); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #76 | CrewAICrewAI Inc. (Joao Moura) | 76.8 | CrewAI (joaomdmoura/crewAI); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #77 | Think-in-MemoryAnt Group / Alibaba (Liu et al.) | 76.8 | Think-in-Memory paper (arXiv:2311.08719); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #78 | RecallMCisco Research / independent (Kynoch & Latapie) | 76.7 | RecallM paper (arXiv:2307.02738); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #79 | Nomi AIGlimpse AI, Inc. | 76.6 | Nomi AI vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #80 | GAMVectorSpaceLab (BAAI-related) | 76.5 | GAM (VectorSpaceLab/general-agentic-memory); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #81 | LangSmith LangGraph CloudLangChain Inc. | 76.4 | LangSmith LangGraph Cloud vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #82 | Open InterpreterOpenInterpreter | 76.3 | Open Interpreter (OpenInterpreter/open-interpreter); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #83 | MultiOnMultiOn (now AGI Inc.) | 76.1 | MultiOn vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #84 | Personal AIPersonal AI | 75.9 | Personal AI vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #85 | Pi InflectionInflection AI | 75.6 | Pi Inflection vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #86 | MemaryKingjulio8238 | 75.5 | Memary (kingjulio8238/Memary); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #87 | AutoGen StudioMicrosoft Research | 75.5 | AutoGen Studio vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #88 | BotpressBotpress Inc. | 75.4 | Botpress vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #89 | FlowiseFlowiseAI | 75.3 | Flowise (FlowiseAI/Flowise); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #90 | MemoryLLMUCSD / Apple (Wang et al.) | 75.2 | MemoryLLM paper (arXiv:2402.04624); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #91 | ZepZep AI | 75.1 | Zep paper |
| #92 | AbridgeAbridge | 75.1 | Abridge vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #93 | HybridAGISynaLinks | 75.1 | HybridAGI (SynaLinks/HybridAGI); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #94 | Ontotext GraphDBOntotext / Graphwise (merged with Semantic Web Company, 2025) | 75.1 | Ontotext GraphDB vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #95 | Dust ttDust (formerly XP1) | 75 | Dust tt vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #96 | Titanslucidrains (community) / paper by Google Research | 75 | Titans paper (arXiv:2501.00663); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #97 | Maxim AIMaxim AI Inc. | 74.9 | Maxim AI vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #98 | Second MeMindverse (Shang, Li, et al.) | 74.9 | Second Me paper (arXiv:2503.08102); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #99 | ParadotWithFeeling.AI | 74.7 | Paradot vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #100 | KnowAgentzjunlp (Zhejiang University) | 74.6 | KnowAgent paper (arXiv:2403.03101); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #101 | NemoriNemori AI (independent) | 74.6 | arXiv:2508.03341 results table — LongMemEval-S accuracy with gpt-4.1-mini; uses 95-96% less context than full-context baseline |
| #102 | MoTFudan (Li, Qiu) | 74.4 | MoT paper (arXiv:2305.05181); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #103 | Plaud NotePLAUD | 74.4 | Plaud Note vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #104 | AriGraphAIRI Institute / Moscow | 74.2 | AriGraph paper (arXiv:2407.04363); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #105 | Granola AIGranola | 74.1 | Granola AI vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #106 | SynapseNTU / Salesforce (Zheng et al.) | 74 | Synapse paper (arXiv:2306.07863); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #107 | Charlie MnemonicGoodAI | 73.8 | Charlie Mnemonic paper (); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #108 | LightRAGHKUDS (HKU Data Intelligence Lab) | 73.8 | LightRAG paper (arXiv:2410.05779); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #109 | Nano GraphRAGgusye1234 | 73.8 | Nano GraphRAG (gusye1234/nano-graphrag); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #110 | PathRAGBUPT-GAMMA | 73.8 | PathRAG paper (arXiv:2502.14902); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #111 | GleanGlean Technologies | 73.7 | Glean vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #112 | Memoripycaspianmoon | 73.6 | Memoripy (caspianmoon/memoripy); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #113 | HippoRAGOSU NLP Group (Ohio State University) | 73.5 | HippoRAG paper (arXiv:2405.14831); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #114 | AppAgentTencent / mnotgod96 | 73.4 | AppAgent paper (arXiv:2312.13771); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #115 | StardogStardog Union Inc. | 73.4 | Stardog vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #116 | Neo4j LLM Graph BuilderNeo4j Labs | 73.3 | Neo4j LLM Graph Builder (neo4j-labs/llm-graph-builder); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #117 | GraphitiZep AI | 73.2 | Zep / Graphiti paper |
| #118 | MCP Memory ServerAnthropic / Model Context Protocol | 73.1 | MCP Memory Server (modelcontextprotocol/servers); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #119 | MiniRAGHKUDS | 73.1 | MiniRAG paper (arXiv:2501.06713); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #120 | ChatDBTsinghua University (Hu et al.) | 72.4 | ChatDB paper (arXiv:2306.03901); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #121 | R2RSciPhi-AI | 72.4 | R2R (SciPhi-AI/R2R); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #122 | Neo4j AuraDBNeo4j Inc. | 72.3 | Neo4j AuraDB vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #123 | GraphRAGMicrosoft | 71.8 | GraphRAG paper (arXiv:2404.16130); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #124 | KAGOpenSPG / Ant Group | 71.5 | KAG paper (arXiv:2409.13731); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #125 | DiffbotDiffbot Inc. | 71.3 | Diffbot vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #126 | RAGFlowInfiniFlow | 71.3 | RAGFlow (infiniflow/ragflow); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #127 | HuggingGPT / JARVISMicrosoft Research | 71.2 | HuggingGPT / JARVIS paper (arXiv:2303.17580); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #128 | Generative AgentsStanford / Google | 71.1 | Generative Agents paper (arXiv:2304.03442); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #129 | MemoChatUniversity of Warwick / Alibaba | 71.1 | MemoChat paper (arXiv:2308.08239); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #130 | GraphRAG-SDKFalkorDB | 70.9 | GraphRAG-SDK (FalkorDB/GraphRAG-SDK); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #131 | AllegroGraphFranz Inc. | 70.7 | AllegroGraph vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #132 | Memorizing TransformerGoogle Research (Wu, Rabe, Hutchins, Szegedy) | 70.6 | Memorizing Transformer paper (arXiv:2203.08913); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #133 | KindroidKindroid | 70.4 | Kindroid vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #134 | RMMGoogle / UCSB (2025) | 70.4 | arXiv:2503.08026 Table 1 (ACL 2025) — RMM with GTE retriever; baseline GTE RAG 63.6%. >10% improvement over no-memory baseline |
| #135 | LarimarIBM Research | 68.7 | Larimar paper (arXiv:2403.11901); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #136 | kNN-LMStanford / Facebook AI Research (Khandelwal et al.) | 68.1 | kNN-LM paper (arXiv:1911.00172); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #137 | Mem0Mem0 | 66.9 | Mem0 technical report |
| #138 | HEMAindependent (Ahn et al.) | 66.8 | HEMA paper (arXiv:2504.16754); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #139 | LongMemUCSB / Microsoft Research | 65.9 | LongMem paper (arXiv:2306.07174); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #140 | Nomic AtlasNomic AI Inc. | 64.6 | Nomic Atlas vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #141 | EM-LLMem-llm (academic consortium) | 64.2 | EM-LLM paper (forum?id=BI2int5SAC); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #142 | Claude ProjectsAnthropic | 64 | Third-party reproduction |
| #143 | MemoryScopeAlibaba ModelScope | 63.7 | MemoryScope evals |
| #144 | Haystack Memorydeepset | 63.2 | Haystack Memory (deepset-ai/haystack); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #145 | WeaviateWeaviate | 63.1 | Weaviate (weaviate/weaviate); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #146 | Gemini MemoryGoogle | 62.3 | Third-party reproduction |
| #147 | ChatGPT MemoryOpenAI | 61.5 | Third-party reproduction |
| #148 | MnemosyneJohns Hopkins / independent (2025) | 61.4 | Mnemosyne paper (arXiv:2510.08601); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #149 | MemoroMIT Media Lab | 60.6 | Memoro paper (arXiv:2403.02135); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #150 | QuivrQuivrHQ | 60.6 | Quivr (QuivrHQ/quivr); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #151 | LettaLetta (formerly MemGPT) | 60.4 | Letta benchmark |
| #152 | MemoryBankInstitute of Software, Chinese Academy of Sciences | 60.1 | MemoryBank paper (arXiv:2305.10250); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #153 | Mnemosyneindependent | 59.9 | Mnemosyne paper (arXiv:2510.08601); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #154 | Copilot MemoryMicrosoft | 59.7 | Third-party reproduction |
| #155 | Sana AISana Labs | 59.7 | Sana AI vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #156 | Heyday AIHeyday (shut down 2025) | 59.2 | Heyday AI vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #157 | AnythingLLMMintplex Labs | 59.1 | AnythingLLM (Mintplex-Labs/anything-llm); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #158 | REALMGoogle Research (Guu et al.) | 59.1 | REALM paper (arXiv:2002.08909); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #159 | Saner AISaner.AI | 59.1 | Saner AI vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #160 | Redis VectorRedis Ltd. | 58.8 | Redis Vector vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #161 | LangMemLangChain | 58.3 | LangMem launch post |
| #162 | EpsillaEpsilla Inc. (YC S23) | 58.3 | Epsilla vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #163 | Couchbase VectorCouchbase Inc. | 58.2 | Couchbase Vector vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #164 | RagieRagie Inc. | 57.7 | Ragie vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #165 | MemoryBankHarbin Institute of Technology / SenseTime | 57.5 | MemoryBank paper (arXiv:2305.10250); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #166 | MongoDB Atlas VectorMongoDB Inc. | 57 | MongoDB Atlas Vector vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #167 | LlamaIndex MemoryLlamaIndex | 56.8 | LlamaIndex evals |
| #168 | KDB AIKX Systems | 56.8 | KDB AI vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #169 | AtlasMeta AI FAIR (Izacard et al.) | 56.6 | Atlas paper (arXiv:2208.03299); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #170 | Notion AINotion Labs | 55.6 | Notion AI vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #171 | Mem AIMem Labs | 55.5 | Mem AI vendor documentation; evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |
| #172 | MemoriGibsonAI | 54.8 | Memori internal eval |
| #173 | MemGPT ClassicBerkeley / Letta | 52.4 | MemGPT paper |
| #174 | Memory³Institute for Advanced Algorithms Research Shanghai / Peking University | 51.3 | Memory³ paper (arXiv:2407.01178); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410) |