Back to Benchmarks

RULER

RULER: What's the Real Context Size of Your Long-Context Language Models

Benchmark Metadata

PublisherNVIDIA
VenueCOLM 2024
Evaluation Typeautomatic
Dimensions13
Test Prompts4,000
ScoringHigher is better
Update Frequencyannual
LeaderboardView Leaderboard

What It Measures

  • Single and multi-key needle retrieval
  • Variable tracking
  • Common and frequent word extraction
  • Question answering with long contexts
  • Effective context length

What It Does Not Measure

  • Multi-session consistency
  • Personalization
  • Generation quality

All Systems Evaluated(71 systems)

RankSystemScore
#1Titanslucidrains (community) / paper by Google Research97.9
#2PineconePinecone Systems82.5
#3WeaviateWeaviate81.5
#4Recurrent Memory TransformerMIPT / DeepPavlov (Bulatov, Kuratov, Burtsev)79.5
#5MilvusZilliz78.2
#6QdrantQdrant77.9
#7MambaCMU / Princeton (Gu, Dao)77.8
#8Jina AI EmbeddingsJina AI GmbH76.1
#9RWKVRWKV Foundation / BlinkDL community75.8
#10txtaiNeuML75.7
#11Landmark AttentionEPFL (Mohtashami, Jaggi)75.6
#12Compressive TransformerDeepMind (Rae et al.)75.4
#13Cohere EmbedCohere Inc.75
#14MiniRAGHKUDS74.8
#15LM-InfiniteIllinois / Meta (Han et al.)74.5
#16H2OUT Austin / Rice / CMU / Stanford / Meta (Zhang et al.)74.4
#17ChromaChroma74.2
#18MarkerDatalab (datalab-to)74.1
#19AllegroGraphFranz Inc.74
#20PathRAGBUPT-GAMMA73.6
#21TRIMEPrinceton NLP (Zhong, Lei, Chen)73.6
#22LanceDBLanceDB Inc. (YC S22)73.3
#23R2RSciPhi-AI73.3
#24ScissorhandsRice / Stanford / Meta (Liu et al.)73.2
#25∞ FormerInstituto de Telecomunicações / DeepMind / IST (Martins, Marinho, Martins)72.7
#26KAGOpenSPG / Ant Group72.5
#27GraphRAGMicrosoft72.4
#28ICAEMicrosoft Research (Ge et al.)72.3
#29Neo4j LLM Graph BuilderNeo4j Labs72.3
#30Mixedbread AIMixedbread AI71.9
#31Voyage AIVoyage AI (acquired by MongoDB, Feb 2025)71.9
#32Ontotext GraphDBOntotext / Graphwise (merged with Semantic Web Company, 2025)71.6
#33Supabase VectorSupabase Inc.71.4
#34Nano GraphRAGgusye123471.3
#35DiffbotDiffbot Inc.71.1
#36SingleStore VectorSingleStore Inc.71
#37Activeloop Deep LakeActiveloop Inc.70.7
#38LightRAGHKUDS (HKU Data Intelligence Lab)70.7
#39RAPTORStanford (Sarthi, Abdullah et al.)70.1
#40AtlasMeta AI FAIR (Izacard et al.)69.9
#41StardogStardog Union Inc.69.9
#42Unstructured IOUnstructured Technologies Inc.69.9
#43MyScaleMyScale Inc.69.5
#44Elasticsearch VectorElastic N.V.69.4
#45ValdYahoo Japan69.4
#46pgvector Supabase Neonpgvector OSS / Supabase Inc. / Neon Inc.69.3
#47PrivateGPTZylon AI69.3
#48OpenSearch VectorOpenSearch Project (AWS-led)69.2
#49PaperQA2FutureHouse69.1
#50RETRODeepMind (Borgeaud et al.)68.9
#51Neon VectorNeon Inc.68.8
#52TrustRAGGoMate Community68.7
#53REALMGoogle Research (Guu et al.)68.6
#54SelfmemTsinghua / Microsoft (Cheng et al.)68.4
#55Carbon AICarbon (acquired by Perplexity, Dec 2024)68.3
#56GraphRAG-SDKFalkorDB68.3
#57LlamaCloudLlamaIndex Inc.67.7
#58MarqoMarqo Pty Ltd67.6
#59Astra DBDataStax67.1
#60Activation BeaconBAAI / Renmin University (Zhang et al.)66.7
#61Vespa AIYahoo / Vespa.ai (independent OSS project)66.6
#62vectorizeVectorize Inc.66.5
#63Manticore SearchManticore Software Ltd.66.4
#64VectaraVectara Inc.66.4
#65ColPaliilluin-tech66.2
#66ParadeDBParadeDB Inc. (YC S23)66.1
#67CognitaTrueFoundry66
#68VerbaWeaviate66
#69TurboPufferTurboPuffer Inc.65.6
#70MendableMendable (YC-backed)64.5
#71StreamingLLMMIT Han Lab / Meta AI (Xiao et al.)57.2