Back to Arena

AppAgent

by Tencent / mnotgod96

System Card

OrganizationTencent / mnotgod96
Released2023-12
Architectureknowledge-base / UI documentation knowledge base from exploration
DetailsMultimodal LLM agent that explores mobile apps (or learns from demos), generating UI documentation stored as a knowledge base used for later task execution via touchscreen actions. Works with GPT-4V / Qwen-VL.
Parameters
Domainagent-memoryknowledge-graph
Open SourceYes
mobile-agentui-docgpt-4vexploration

Capability Profile

Benchmark Scores

6 of 14 benchmarks
Long-Context Retrieval
0/5
RULER
no data
NIAH
no data
LooGLE
no data
LongBench
no data
∞Bench
no data
Multi-Turn Recall
1/2
LoCoMo
73.137p
MemoryBank
no data
Cross-Session Memory
1/1
Multi-Hop QA
2/3
BABILong
no data
HotpotQA
76.480p
Agent Task Memory
1/1
Personalization
0/1
PerLTQA
no data
Factuality / Grounding
1/1
RAGAS
7594p