Back to Arena

AppAgent

by Tencent / mnotgod96

System Card

OrganizationTencent / mnotgod96
Released2023-12
Architectureknowledge-base / UI documentation knowledge base from exploration
DetailsMultimodal LLM agent that explores mobile apps (or learns from demos), generating UI documentation stored as a knowledge base used for later task execution via touchscreen actions. Works with GPT-4V / Qwen-VL.
Parameters
Domainagent-memoryknowledge-graph
Open SourceYes
mobile-agentui-docgpt-4vexploration

Capability Profile

Benchmark Scores

6 of 14 benchmarks
Data Transparency:6 estimated
Long-Context Retrieval
0/5
RULER
no data
NIAH
no data
LooGLE
no data
LongBench
no data
∞Bench
no data
Multi-Turn Recall
1/2
LoCoMo
73.137pEstimated
MemoryBank
no data
Cross-Session Memory
1/1
LongMemEval
73.434pEstimated
Multi-Hop QA
2/3
BABILong
no data
MultiHop-RAG
7579pEstimated
HotpotQA
76.480pEstimated
Agent Task Memory
1/1
AgentBench-Mem
7226pEstimated
Personalization
0/1
PerLTQA
no data
Factuality / Grounding
1/1
RAGAS
7594pEstimated