Back to Arena

AutoWebGLM

by THUDM

System Card

OrganizationTHUDM
Released2024-04
Architectureagentic-workflow / HTML-simplification web agent + curriculum
DetailsChatGLM3-6B web agent with HTML simplification, RL + rejection sampling, curriculum learning. Includes AutoWebBench bilingual benchmark.
Parameters
Domainagent-memory
Open SourceYes
chatglmweb-agentkdd-2024bilingual

Capability Profile

Benchmark Scores

5 of 14 benchmarks
Data Transparency:5 estimated
Long-Context Retrieval
0/5
RULER
no data
NIAH
no data
LooGLE
no data
LongBench
no data
∞Bench
no data
Multi-Turn Recall
1/2
LoCoMo
74.442pEstimated
MemoryBank
no data
Cross-Session Memory
1/1
LongMemEval
81.186pEstimated
Multi-Hop QA
2/3
BABILong
no data
MultiHop-RAG
7579pEstimated
HotpotQA
74.466pEstimated
Agent Task Memory
1/1
AgentBench-Mem
7226pEstimated
Personalization
0/1
PerLTQA
no data
Factuality / Grounding
0/1
RAGAS
no data