Tencent का 4-tier agent memory: SWE-bench 58.4 से 64.2%, SQLite local, MIT

Tencent ने इस हफ़्ते MIT के तहत एक serious agent-memory release drop किया — TencentDB Agent Memory, एक 4-tier pyramid जो long-horizon context-bloat problem solve करता है, जिसे ज़्यादातर agent shops अभी भी हाथ से workaround कर रहे हैं। Shape: base पर L0 raw conversation logs, L1 atomic facts JSONL में, L2 scenario blocks Markdown में, top पर L3 user persona Markdown में। ऊपरी tiers structure preserve करते हैं, निचले evidence preserve करते हैं, और हर retrieval `node_id` + `result_ref` के साथ आता है ताकि persona-level fact काफ़ी न हो तो agent deterministically drill-down कर सके। जो लोग कुछ ज़्यादा turns चलने वाले agents ship करते हैं, उनके लिए यह memory problem के लिए अब तक की सबसे साफ़ published architecture है।

Continuous long-horizon sessions पर numbers (single-turn lookups नहीं, यही सही benchmark है): plugin enable होने पर SWE-bench 58.4% → 64.2% (+9.9% relative), token use 33% down। WideSearch 33% → 50% (+51.5%), tokens 61% down। AA-LCR 44.0% → 47.5%, tokens 31% down। PersonaMem 48% → 76% (+59%)। Defaults: sqlite-vec extension के साथ SQLite, zero external API dependency, Markdown files `~/.openclaw/memory-tdai/` पर। Recall का 5-second timeout है और timeout पर system block करने के बजाय injection skip कर देता है — यानी slow retrieval agent loop को stall नहीं कर सकता। Hybrid BM25 + vector via Reciprocal Rank Fusion, default top-5। L1 atomic-fact extraction हर 5 turns में; persona regeneration हर 50 नई memories पर।

Ecosystem read: Mem0, Letta, MemGPT और Zep दो साल से agent-memory को बाँट रहे हैं, पर 4-tier breakdown architectural delta है। ज़्यादातर existing systems या तो सब कुछ vector index पर flatten करते हैं (Mem0, Zep) या hot/cold split maintain करते हैं (MemGPT)। Pyramid approach तुम्हें Persona-as-Markdown (user-auditable, human-readable, easy to edit), Atomic-as-JSONL (structured, parseable, drill-down keys), और floor पर raw logs देता है। यह एक white-box memory system है जिसे `grep` से debug किया जा सकता है। Tencent ने Mem0/Letta/MemGPT/Zep के against head-to-head benchmarks release से बाहर रखे — asterisk flag करो — लेकिन 33% token cut के साथ SWE-bench delta वो number है जो reproduction में टिकता है। Repository: github.com/Tencent/TencentDB-Agent-Memory।

Monday सुबह: integration story अभी Tencent के OpenClaw (single npm package `@tencentdb-agent-memory/memory-tencentdb`, Node.js 22.16+ चाहिए) या Hermes Agent (Docker only) तक locked है। LangChain और LlamaIndex bindings v1 में नहीं हैं — अगर तुम existing harness के नीचे 4-tier architecture use करना चाहते हो तो वो obvious community gap है। अगर तुम्हारा agent हर turn पर conversation history replay करते हुए tokens जला रहा है और तुम अपना memory system बनाने से पहले एक published baseline का इंतज़ार कर रहे थे, तो repo clone करो, L0→L3 schema पढ़ो, और decide करो कि wrap करना है या अपनी stack के against architecture reimplement करना है। Benchmark numbers credible हैं; integration cost ही trade है।

Tencent का 4-tier agent memory: SWE-bench 58.4 से 64.2%, SQLite local, MIT

और समाचार