TurboPuffer

by TurboPuffer Inc.

System Card

OrganizationTurboPuffer Inc.

Released2023-10

Architecturevector-rag / serverless object-storage vector engine

DetailsTurboPuffer stores all vector data on object storage (S3-compatible) and builds HNSW indexes on demand, enabling truly serverless scaling with zero namespace limits. Hybrid BM25 + vector search is natively supported. The system powers production deployments at Cursor, Notion, and Linear. Pricing is radically cheaper than in-memory stores: $1/month per million vectors.

Parameters—

Domainrag-retrieval

Open SourceNo

WebsiteVisit

serverlessobject-storagehybrid-searchcost-efficientmulti-tenant

Capability Profile

Benchmark Scores

5 of 14 benchmarks

Data Transparency:5 estimated

Long-Context Retrieval

2/5

RULER

65.63pEstimated

NIAH

no data

LooGLE

no data

LongBench

603pEstimated

∞Bench

no data

Multi-Turn Recall

0/2

LoCoMo

no data

MemoryBank

no data

Cross-Session Memory

0/1

LongMemEval

no data

Multi-Hop QA

2/3

BABILong

no data

MultiHop-RAG

60.97pEstimated

HotpotQA

59.18pEstimated

Agent Task Memory

0/1

AgentBench-Mem

no data

Personalization

0/1

PerLTQA

no data

Factuality / Grounding

1/1

RAGAS

64.516pEstimated

Sources:Arena estimate — derived from capability profile, not independently verified