Back to Arena
Mobile-Agent
by Alibaba Tongyi Lab (X-PLUG)
System Card
OrganizationAlibaba Tongyi Lab (X-PLUG)
Released2024-01
Architectureagentic-workflow / Multi-platform GUI agent with long-horizon memory
DetailsFamily of GUI agents (GUI-Owl-1.5 models 2B-235B) for desktop, mobile, and browser automation. v3.5 emphasizes end-to-end task execution and long-horizon memory across cross-app workflows.
Parameters—
Domainagent-memory
Open SourceYes
PaperView Paper
CodeRepository
gui-agentgui-owllong-horizonalibaba
Capability Profile
Benchmark Scores
5 of 14 benchmarksLong-Context Retrieval0/5
RULER
no dataNIAH
no dataLooGLE
no dataLongBench
no data∞Bench
no dataMulti-Turn Recall1/2
MemoryBank
no dataCross-Session Memory1/1
Multi-Hop QA2/3
Agent Task Memory1/1
Personalization0/1
PerLTQA
no dataFactuality / Grounding0/1
RAGAS
no dataSources:Mobile-Agent paper (arXiv:2508.15144); evaluated on AgentBench Memory Track (Tsinghua KEG, 2308)Mobile-Agent paper (arXiv:2508.15144); evaluated on HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering (Stanford / CMU, 1809)Mobile-Agent paper (arXiv:2508.15144); evaluated on LoCoMo: Long-Term Conversational Memory Benchmark (Snap Research, 2402)Mobile-Agent paper (arXiv:2508.15144); evaluated on LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Salesforce AI Research, 2410)Mobile-Agent paper (arXiv:2508.15144); evaluated on MultiHop-RAG: Benchmarking Retrieval-Augmented Generation for Multi-Hop Queries (HKUST, 2401)