Back to Arena
RWKV
by RWKV Foundation / BlinkDL community
System Card
OrganizationRWKV Foundation / BlinkDL community
Released2023-05
Architectureexternal-memory-network / Linear-attention RNN with receptance-weighted key-value
DetailsCombines Transformer-style parallelizable training with RNN-style linear-time inference through a receptance-weighted key-value (RWKV) attention. Constant memory, no KV cache, unbounded context length.
Parameters—
Domainlong-context
Open SourceYes
PaperView Paper
WebsiteVisit
CodeRepository
rnnlinear-attentionconstant-memoryefficient
Capability Profile
Benchmark Scores
6 of 14 benchmarksData Transparency:6 estimated
Long-Context Retrieval5/5
Multi-Turn Recall0/2
LoCoMo
no dataMemoryBank
no dataCross-Session Memory0/1
LongMemEval
no dataMulti-Hop QA1/3
Agent Task Memory0/1
AgentBench-Mem
no dataPersonalization0/1
PerLTQA
no dataFactuality / Grounding0/1
RAGAS
no data