Back to Arena

JARVIS-1

by CraftJarvis

System Card

OrganizationCraftJarvis
Released2023-10
Architectureepisodic-buffer / Multimodal memory for open-world planning
DetailsMinecraft agent combining a multimodal LM with embodied control and a multimodal memory store. Plans using both pre-trained knowledge and retrieved game-experience memories across 200+ tasks.
Parameters
Domainagent-memorylifelong-learning
Open SourceYes
minecraftmultimodal-memoryopen-worldpaper-repo

Capability Profile

Benchmark Scores

6 of 14 benchmarks
Data Transparency:6 estimated
Long-Context Retrieval
0/5
RULER
no data
NIAH
no data
LooGLE
no data
LongBench
no data
∞Bench
no data
Multi-Turn Recall
2/2
LoCoMo
71.729pEstimated
MemoryBank
79.877pEstimated
Cross-Session Memory
1/1
LongMemEval
79.878pEstimated
Multi-Hop QA
2/3
BABILong
71.616pEstimated
MultiHop-RAG
no data
HotpotQA
70.648pEstimated
Agent Task Memory
1/1
AgentBench-Mem
7226pEstimated
Personalization
0/1
PerLTQA
no data
Factuality / Grounding
0/1
RAGAS
no data