PerLTQA

Name: PerLTQA: A Personal Long-Term Memory Question Answering Dataset
Creator: PolyU
Keywords: personalization-eval, personalization, agent-memory

PerLTQA: A Personal Long-Term Memory Question Answering Dataset

Benchmark Metadata

PublisherPolyU

VenuearXiv preprint

Evaluation Typeautomatic

Dimensions3

Test Prompts8,593

ScoringHigher is better

Update Frequencyannual

PaperView Paper

LeaderboardView Leaderboard

What It Measures

Personal semantic-memory recall
Personal episodic-memory recall
Memory-grounded answer accuracy

What It Does Not Measure

Generic factual QA
Code or math reasoning
Latency

All Systems Evaluated(22 systems)

22 estimated

Rank	System	Score	Provenance	Source
#1	Tab AITab (Avi Schiffmann)	87.8	Estimated	Arena estimate — derived from capability profile, not independently verified
#2	ReplikaLuka, Inc.	86.8	Estimated	Arena estimate — derived from capability profile, not independently verified
#3	Pi InflectionInflection AI	85.7	Estimated	Arena estimate — derived from capability profile, not independently verified
#4	Limitless PendantLimitless AI (acquired by Meta Dec 2025)	85.5	Estimated	Arena estimate — derived from capability profile, not independently verified
#5	Talkie AIMiniMax	84.8	Estimated	Arena estimate — derived from capability profile, not independently verified
#6	Second MeMindverse (Shang, Li, et al.)	84.6	Estimated	Arena estimate — derived from capability profile, not independently verified
#7	Friend AIFriend	84	Estimated	Arena estimate — derived from capability profile, not independently verified
#8	Character AICharacter.AI (Google investment)	82.8	Estimated	Arena estimate — derived from capability profile, not independently verified
#9	Bee ComputerBee (acquired by Amazon 2026)	82.5	Estimated	Arena estimate — derived from capability profile, not independently verified
#10	Charlie MnemonicGoodAI	81.9	Estimated	Arena estimate — derived from capability profile, not independently verified
#11	Pickle AISoul Computer (YC-backed)	80.4	Estimated	Arena estimate — derived from capability profile, not independently verified
#12	ParadotWithFeeling.AI	76.6	Estimated	Arena estimate — derived from capability profile, not independently verified
#13	Nomi AIGlimpse AI, Inc.	75.7	Estimated	Arena estimate — derived from capability profile, not independently verified
#14	KindroidKindroid	74.6	Estimated	Arena estimate — derived from capability profile, not independently verified
#15	Personal AIPersonal AI	74	Estimated	Arena estimate — derived from capability profile, not independently verified
#16	memUNevaMind-AI	73.2	Estimated	Arena estimate — derived from capability profile, not independently verified
#17	MemoryBankInstitute of Software, Chinese Academy of Sciences	72.6	Estimated	Arena estimate — derived from capability profile, not independently verified
#18	Copilot MemoryMicrosoft	67.9	Estimated	Arena estimate — derived from capability profile, not independently verified
#19	ChatGPT MemoryOpenAI	67.7	Estimated	Arena estimate — derived from capability profile, not independently verified
#20	MnemosyneJohns Hopkins / independent (2025)	67.7	Estimated	Arena estimate — derived from capability profile, not independently verified
#21	Gemini MemoryGoogle	67.4	Estimated	Arena estimate — derived from capability profile, not independently verified
#22	Heyday AIHeyday (shut down 2025)	62.9	Estimated	Arena estimate — derived from capability profile, not independently verified