Back to Arena
Vellum AI
by Vellum AI Inc. (YC W23)
System Card
OrganizationVellum AI Inc. (YC W23)
Released2023-02
Architectureagentic-workflow / LLM workflow and evaluation platform
DetailsVellum provides an enterprise platform for building, evaluating, and deploying LLM workflows (including RAG pipelines). Features include prompt versioning, A/B testing, workflow orchestration with a visual editor, regression testing suites, and production monitoring with human-in-the-loop feedback. Raised $25.5M total ($5M seed, $20M Series A). YC W23 company.
Parameters—
Domainrag-retrievalagent-memory
Open SourceNo
WebsiteVisit
prompt-managementworkflow-orchestrationevaluationA/B-testingenterprise
Capability Profile
Benchmark Scores
6 of 14 benchmarksData Transparency:6 estimated
Long-Context Retrieval1/5
Multi-Turn Recall1/2
MemoryBank
no dataCross-Session Memory1/1
Multi-Hop QA2/3
Agent Task Memory1/1
Personalization0/1
PerLTQA
no dataFactuality / Grounding0/1
RAGAS
no data