AutoWebGLM

by THUDM

System Card

OrganizationTHUDM

Released2024-04

Architectureagentic-workflow / HTML-simplification web agent + curriculum

DetailsChatGLM3-6B web agent with HTML simplification, RL + rejection sampling, curriculum learning. Includes AutoWebBench bilingual benchmark.

Parameters—

Domainagent-memory

Open SourceYes

PaperView Paper

CodeRepository

chatglmweb-agentkdd-2024bilingual

Capability Profile

Benchmark Scores

5 of 14 benchmarks

Data Transparency:5 estimated

Long-Context Retrieval

0/5

RULER

no data

NIAH

no data

LooGLE

no data

LongBench

no data

∞Bench

no data

Multi-Turn Recall

1/2

LoCoMo

74.442pEstimated

MemoryBank

no data

Cross-Session Memory

1/1

LongMemEval

81.186pEstimated

Multi-Hop QA

2/3

BABILong

no data

MultiHop-RAG

7579pEstimated

HotpotQA

74.466pEstimated

Agent Task Memory

1/1

AgentBench-Mem

7226pEstimated

Personalization

0/1

PerLTQA

no data

Factuality / Grounding

0/1

RAGAS

no data

Sources:Arena estimate — derived from capability profile, not independently verified