Back to Arena
Athina AI
by Athina AI (YC W23)
System Card
OrganizationAthina AI (YC W23)
Released2023-01
Architectureagentic-workflow / LLM evaluation and production monitoring
DetailsAthina AI is an evaluation and monitoring platform offering 50+ preset evaluations (hallucination, context relevance, groundedness, toxicity) from multiple providers (OpenAI, Ragas, Guardrails). It supports SOC-2-compliant VPC deployment for enterprise customers. Provides both pre-deployment testing and production monitoring with real-time alerting on regression. Raised $4.1M total.
Parameters—
Domainrag-retrievalagent-memory
Open SourceNo
WebsiteVisit
eval-frameworkhallucination-detectionSOC2production-monitoring50-evals
Capability Profile
Benchmark Scores
6 of 14 benchmarksData Transparency:6 estimated
Long-Context Retrieval1/5
Multi-Turn Recall1/2
MemoryBank
no dataCross-Session Memory1/1
Multi-Hop QA2/3
Agent Task Memory1/1
Personalization0/1
PerLTQA
no dataFactuality / Grounding0/1
RAGAS
no data