Back to Arena
AnythingLLM
by Mintplex Labs
System Card
OrganizationMintplex Labs
Released2023-06
Architecturevector-rag / Full-stack ChatGPT clone with workspace memory
DetailsMonorepo (Vite/React + Node Express + collector + embeddable widget). Workspaces encapsulate document sets and chat history; supports many LLMs + vector DBs out of the box.
Parameters—
Domainrag-retrievalagent-memory
Open SourceYes
WebsiteVisit
CodeRepository
workspacesbrowser-extensionmulti-useragents
Capability Profile
Benchmark Scores
6 of 14 benchmarksLong-Context Retrieval1/5
Multi-Turn Recall1/2
MemoryBank
no dataCross-Session Memory1/1
Multi-Hop QA2/3
Agent Task Memory1/1
Personalization0/1
PerLTQA
no dataFactuality / Grounding0/1
RAGAS
no dataSources:AnythingLLM (Mintplex-Labs/anything-llm); evaluated on HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering (Stanford / CMU, 1809)AnythingLLM (Mintplex-Labs/anything-llm); evaluated on MultiHop-RAG: Benchmarking Retrieval-Augmented Generation for Multi-Hop Queries (HKUST, 2401)AnythingLLM (Mintplex-Labs/anything-llm); evaluated on AgentBench Memory Track (Tsinghua KEG, 2308)AnythingLLM (Mintplex-Labs/anything-llm); evaluated on LoCoMo: Long-Term Conversational Memory Benchmark (Snap Research, 2402)AnythingLLM (Mintplex-Labs/anything-llm); evaluated on LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding (Tsinghua KEG, 2308)AnythingLLM (Mintplex-Labs/anything-llm); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410)