AppAgent

by Tencent / mnotgod96

System Card

OrganizationTencent / mnotgod96

Released2023-12

Architectureknowledge-base / UI documentation knowledge base from exploration

DetailsMultimodal LLM agent that explores mobile apps (or learns from demos), generating UI documentation stored as a knowledge base used for later task execution via touchscreen actions. Works with GPT-4V / Qwen-VL.

Parameters—

Domainagent-memoryknowledge-graph

Open SourceYes

PaperView Paper

CodeRepository

mobile-agentui-docgpt-4vexploration

Capability Profile

Benchmark Scores

6 of 14 benchmarks

Data Transparency:6 estimated

Long-Context Retrieval

0/5

RULER

no data

NIAH

no data

LooGLE

no data

LongBench

no data

∞Bench

no data

Multi-Turn Recall

1/2

LoCoMo

73.137pEstimated

MemoryBank

no data

Cross-Session Memory

1/1

LongMemEval

73.434pEstimated

Multi-Hop QA

2/3

BABILong

no data

MultiHop-RAG

7579pEstimated

HotpotQA

76.480pEstimated

Agent Task Memory

1/1

AgentBench-Mem

7226pEstimated

Personalization

0/1

PerLTQA

no data

Factuality / Grounding

1/1

RAGAS

7594pEstimated

Sources:Arena estimate — derived from capability profile, not independently verified