Back to Arena
Reflexion
by Northeastern / MIT / Princeton (Shinn et al.)
System Card
OrganizationNortheastern / MIT / Princeton (Shinn et al.)
Released2023-03
Architectureagentic-workflow / Verbal reinforcement via episodic reflection buffer
DetailsAgents verbally reflect on task feedback signals, maintaining their own reflective text in an episodic memory buffer to induce better decisions in subsequent trials. Avoids weight updates by using language as a policy encoding.
Parameters—
Domainagent-memoryepisodic-sessionlifelong-learning
Open SourceYes
PaperView Paper
CodeRepository
verbal-rlself-reflectionepisodic-bufferneurips-2023
Capability Profile
Benchmark Scores
6 of 14 benchmarksData Transparency:1 self-reported5 estimated
Long-Context Retrieval0/5
RULER
no dataNIAH
no dataLooGLE
no dataLongBench
no data∞Bench
no dataMulti-Turn Recall2/2
Cross-Session Memory1/1
Agent Task Memory1/1
Personalization0/1
PerLTQA
no dataFactuality / Grounding0/1
RAGAS
no data