Agent Memory Is Just a Vector DB. That’s the Problem.

The Benchmark Numbers Full-context injection into an LLM prompt scores 72.9% accuracy on the LoCoMo benchmark at 17.12 seconds p95 latency. Flat vector retrieval drops to 66.9% accuracy — but cuts latency to 1.44 seconds. That is a 6-point accuracy gap buying a 91% speed gain and a 90% reduction …