Back to Arena
AgentScope
by ModelScope (Alibaba)
System Card
OrganizationModelScope (Alibaba)
Released2024-01
Architectureagentic-workflow / Production agent framework with pluggable memory
DetailsProduction-ready agent framework with InMemoryMemory, database-backed memory, memory compression, long-term memory via ReMe integration, SQLite-session persistence, OpenTelemetry. ReAct agents and MCP-native.
Parameters—
Domainagent-memorylong-context
Open SourceYes
PaperView Paper
WebsiteVisit
CodeRepository
alibabaproductionreme-integrationmemory-compression
Capability Profile
Benchmark Scores
6 of 14 benchmarksMulti-Turn Recall1/2
MemoryBank
no dataCross-Session Memory0/1
LongMemEval
no dataAgent Task Memory1/1
Personalization0/1
PerLTQA
no dataFactuality / Grounding0/1
RAGAS
no dataSources:AgentScope paper (arXiv:2402.14034); evaluated on AgentBench Memory Track (Tsinghua KEG, 2308)AgentScope paper (arXiv:2402.14034); evaluated on BABILong: Testing the Limits of LLMs with Long-Context Reasoning-in-a-Haystack (AIRI, 2406)AgentScope paper (arXiv:2402.14034); evaluated on HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering (Stanford / CMU, 1809)AgentScope paper (arXiv:2402.14034); evaluated on InfiniteBench: Extending Long Context Evaluation Beyond 100K Tokens (Tsinghua / OpenBMB, 2402)AgentScope paper (arXiv:2402.14034); evaluated on LoCoMo: Long-Term Conversational Memory Benchmark (Snap Research, 2402)AgentScope paper (arXiv:2402.14034); evaluated on LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding (Tsinghua KEG, 2308)