Back to Arena

Synapse

by NTU / Salesforce (Zheng et al.)

System Card

OrganizationNTU / Salesforce (Zheng et al.)
Released2023-06
Architectureepisodic-buffer / Trajectory-as-exemplar prompting with exemplar memory
DetailsState abstraction compresses raw HTML into task-relevant observations; trajectory-as-exemplar prompting uses full abstracted action sequences retrieved from an exemplar memory via similarity search.
Parameters
Domainagent-memoryepisodic-session
Open SourceYes
WebsiteVisit
iclr-2024web-agentsminiwobtrajectory

Capability Profile

Benchmark Scores

6 of 14 benchmarks
Long-Context Retrieval
0/5
RULER
no data
NIAH
no data
LooGLE
no data
LongBench
no data
∞Bench
no data
Multi-Turn Recall
2/2
LoCoMo
71.830p
Cross-Session Memory
1/1
Multi-Hop QA
2/3
BABILong
no data
HotpotQA
67.436p
Agent Task Memory
1/1
Personalization
0/1
PerLTQA
no data
Factuality / Grounding
0/1
RAGAS
no data