Back to Arena
PaperQA2
by FutureHouse
System Card
OrganizationFutureHouse
Released2024-01
Architectureagentic-workflow / Agentic RAG for scientific papers (3-phase)
DetailsThree-phase agent (search -> evidence gathering with embedding+LLM re-scoring -> answer). Metadata-aware embeddings, automatic paper metadata + citation/retraction checks, multimodal tables/figures/equations.
Parameters—
Domainrag-retrieval
Open SourceYes
PaperView Paper
WebsiteVisit
CodeRepository
scienceagentic-ragcitationsmultimodal
Capability Profile
Benchmark Scores
5 of 14 benchmarksData Transparency:5 estimated
Long-Context Retrieval2/5
Multi-Turn Recall0/2
LoCoMo
no dataMemoryBank
no dataCross-Session Memory0/1
LongMemEval
no dataMulti-Hop QA2/3
Agent Task Memory0/1
AgentBench-Mem
no dataPersonalization0/1
PerLTQA
no dataFactuality / Grounding1/1