Back to Arena
WebVoyager
by MinorJerry et al.
System Card
OrganizationMinorJerry et al.
Released2024-01
Architectureagentic-workflow / Multimodal web agent with action/state history
DetailsLMM-powered web agent on Selenium that navigates real websites, uses screenshots + DOM, and maintains trajectory memory across 643 test tasks plus GAIA tasks.
Parameters—
Domainagent-memory
Open SourceYes
PaperView Paper
CodeRepository
lmmseleniumgaiaweb-benchmark
Capability Profile
Benchmark Scores
5 of 14 benchmarksData Transparency:5 estimated
Long-Context Retrieval0/5
RULER
no dataNIAH
no dataLooGLE
no dataLongBench
no data∞Bench
no dataMulti-Turn Recall1/2
MemoryBank
no dataCross-Session Memory1/1
Multi-Hop QA2/3
Agent Task Memory1/1
Personalization0/1
PerLTQA
no dataFactuality / Grounding0/1
RAGAS
no data