Back to Benchmarks

LongBench

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

Benchmark Metadata

PublisherTsinghua KEG
VenueACL 2024
Evaluation Typeautomatic
Dimensions21
Test Prompts4,750
ScoringHigher is better
Update Frequencyannual
LeaderboardView Leaderboard

What It Measures

  • Single- and multi-document QA
  • Summarization
  • Few-shot in-context learning
  • Synthetic retrieval
  • Code completion over long contexts

What It Does Not Measure

  • Cross-session memory
  • Personalization
  • Latency

All Systems Evaluated(121 systems)

RankSystemScore
#1txtaiNeuML60
#2LlamaIndex MemoryLlamaIndex60
#3Haystack Memorydeepset60
#4Claude ProjectsAnthropic60
#5PineconePinecone Systems60
#6WeaviateWeaviate60
#7QdrantQdrant60
#8ChromaChroma60
#9MilvusZilliz60
#10Activeloop Deep LakeActiveloop Inc.60
#11Adept AIAdept AI Labs (acquired by Amazon 2024)60
#12AgentScopeModelScope (Alibaba)60
#13AllegroGraphFranz Inc.60
#14AnythingLLMMintplex Labs60
#15Astra DBDataStax60
#16Athina AIAthina AI (YC W23)60
#17AtlasMeta AI FAIR (Izacard et al.)60
#18Bishengdataelement60
#19Carbon AICarbon (acquired by Perplexity, Dec 2024)60
#20CognitaTrueFoundry60
#21Cohere EmbedCohere Inc.60
#22ColPaliilluin-tech60
#23Compressive TransformerDeepMind (Rae et al.)60
#24Couchbase VectorCouchbase Inc.60
#25DiffbotDiffbot Inc.60
#26DifyLangGenius60
#27Dust ttDust (formerly XP1)60
#28Elasticsearch VectorElastic N.V.60
#29EpsillaEpsilla Inc. (YC S23)60
#30FastGPTlabring60
#31FlowiseFlowiseAI60
#32Galileo AIGalileo Technologies Inc.60
#33Granola AIGranola60
#34GraphRAGMicrosoft60
#35GraphRAG-SDKFalkorDB60
#36H2OUT Austin / Rice / CMU / Stanford / Meta (Zhang et al.)60
#37HebbiaHebbia, Inc.60
#38HoneyHiveHoneyHive Inc.60
#39ICAEMicrosoft Research (Ge et al.)60
#40∞ FormerInstituto de Telecomunicações / DeepMind / IST (Martins, Marinho, Martins)60
#41Jina AI EmbeddingsJina AI GmbH60
#42KAGOpenSPG / Ant Group60
#43KDB AIKX Systems60
#44kNN-LMStanford / Facebook AI Research (Khandelwal et al.)60
#45LanceDBLanceDB Inc. (YC S22)60
#46Landmark AttentionEPFL (Mohtashami, Jaggi)60
#47LangflowLangflow-ai (DataStax)60
#48LangSmith LangGraph CloudLangChain Inc.60
#49LightRAGHKUDS (HKU Data Intelligence Lab)60
#50LlamaCloudLlamaIndex Inc.60
#51LM-InfiniteIllinois / Meta (Han et al.)60
#52LongMemUCSB / Microsoft Research60
#53MambaCMU / Princeton (Gu, Dao)60
#54Manticore SearchManticore Software Ltd.60
#55MarkerDatalab (datalab-to)60
#56MarqoMarqo Pty Ltd60
#57Maxim AIMaxim AI Inc.60
#58Mem AIMem Labs60
#59MemformerUC Santa Barbara / Amazon (Wu, Lan, Liu, et al.)60
#60Memorizing TransformerGoogle Research (Wu, Rabe, Hutchins, Szegedy)60
#61Memory³Institute for Advanced Algorithms Research Shanghai / Peking University60
#62MemoryLLMUCSD / Apple (Wang et al.)60
#63MemR32025 (December submission)60
#64MendableMendable (YC-backed)60
#65MiniRAGHKUDS60
#66Mixedbread AIMixedbread AI60
#67MongoDB Atlas VectorMongoDB Inc.60
#68MyScaleMyScale Inc.60
#69Nano GraphRAGgusye123460
#70Neo4j LLM Graph BuilderNeo4j Labs60
#71Neon VectorNeon Inc.60
#72Nomic AtlasNomic AI Inc.60
#73Notion AINotion Labs60
#74Ontotext GraphDBOntotext / Graphwise (merged with Semantic Web Company, 2025)60
#75Onyxonyx-dot-app60
#76OpenSearch VectorOpenSearch Project (AWS-led)60
#77PaperQA2FutureHouse60
#78ParadeDBParadeDB Inc. (YC S23)60
#79PathRAGBUPT-GAMMA60
#80pgvector Supabase Neonpgvector OSS / Supabase Inc. / Neon Inc.60
#81PrivateGPTZylon AI60
#82QuivrQuivrHQ60
#83Qwen-AgentQwenLM (Alibaba)60
#84R2RSciPhi-AI60
#85R3MemHKUST (2025)60
#86RAGFlowInfiniFlow60
#87RagieRagie Inc.60
#88RAPTORStanford (Sarthi, Abdullah et al.)60
#89REALMGoogle Research (Guu et al.)60
#90Recurrent Memory TransformerMIPT / DeepPavlov (Bulatov, Kuratov, Burtsev)60
#91Redis VectorRedis Ltd.60
#92ReMeModelScope (Alibaba)60
#93RETRODeepMind (Borgeaud et al.)60
#94RWKVRWKV Foundation / BlinkDL community60
#95Sana AISana Labs60
#96Saner AISaner.AI60
#97ScissorhandsRice / Stanford / Meta (Liu et al.)60
#98Self-RAGUniversity of Washington / Allen AI (Asai et al.)60
#99SelfmemTsinghua / Microsoft (Cheng et al.)60
#100SID AISID (YC)60
#101SingleStore VectorSingleStore Inc.60
#102Stack AIStack AI Inc. (YC W23)60
#103StardogStardog Union Inc.60
#104Supabase VectorSupabase Inc.60
#105Titanslucidrains (community) / paper by Google Research60
#106TRIMEPrinceton NLP (Zhong, Lei, Chen)60
#107TrustRAGGoMate Community60
#108TurboPufferTurboPuffer Inc.60
#109Unstructured IOUnstructured Technologies Inc.60
#110ValdYahoo Japan60
#111VectaraVectara Inc.60
#112vectorizeVectorize Inc.60
#113VectorShiftVectorShift Inc. (YC S23)60
#114Vellum AIVellum AI Inc. (YC W23)60
#115VerbaWeaviate60
#116Vespa AIYahoo / Vespa.ai (independent OSS project)60
#117Voyage AIVoyage AI (acquired by MongoDB, Feb 2025)60
#118EM-LLMem-llm (academic consortium)51.3
#119MemoRAGBAAI / Qhjqhj0044.4
#120Activation BeaconBAAI / Renmin University (Zhang et al.)39.8
#121StreamingLLMMIT Han Lab / Meta AI (Xiao et al.)24.5