Back to Arena

OS-Copilot / FRIDAY

by Shanghai AI Lab / MMLab (Wu et al.)

System Card

OrganizationShanghai AI Lab / MMLab (Wu et al.)
Released2024-02
Architectureagentic-workflow / Skill library + long-term memory + tool discovery
DetailsGeneralist OS agent with a Configuration Tracker using dense retrieval over long-term memory to recall tools, user profiles, and working directory state. Self-improves via accumulated skills across Excel, PowerPoint, web, code, and multimedia applications.
Parameters
Domainagent-memorylifelong-learning
Open SourceYes
WebsiteVisit
gaia-benchmarkos-agentskill-libraryself-improve

Capability Profile

Benchmark Scores

6 of 14 benchmarks
Long-Context Retrieval
0/5
RULER
no data
NIAH
no data
LooGLE
no data
LongBench
no data
∞Bench
no data
Multi-Turn Recall
2/2
LoCoMo
8086p
MemoryBank
75.349p
Cross-Session Memory
1/1
Multi-Hop QA
2/3
BABILong
74.243p
MultiHop-RAG
no data
HotpotQA
75.775p
Agent Task Memory
1/1
Personalization
0/1
PerLTQA
no data
Factuality / Grounding
0/1
RAGAS
no data