Back to Arena

BrowserGym

by ServiceNow Research

System Card

OrganizationServiceNow Research
Released2024-02
Architectureagentic-workflow / Standardized Gym env for web agents (memory evaluation)
DetailsGym-style framework for web-agent research supporting MiniWoB, WebArena, WorkArena, VisualWebArena. Standard API for state/action/reward, memory across steps, and multi-benchmark integration.
Parameters
Domainagent-memory
Open SourceYes
servicenowgymwebarenabenchmark-harness

Capability Profile

Benchmark Scores

5 of 14 benchmarks
Long-Context Retrieval
0/5
RULER
no data
NIAH
no data
LooGLE
no data
LongBench
no data
∞Bench
no data
Multi-Turn Recall
1/2
LoCoMo
80.388p
MemoryBank
no data
Cross-Session Memory
1/1
Multi-Hop QA
2/3
BABILong
no data
HotpotQA
75.371p
Agent Task Memory
1/1
Personalization
0/1
PerLTQA
no data
Factuality / Grounding
0/1
RAGAS
no data