<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Agent-Memory on Yonk-Labs</title><link>https://yonk.dev/tags/agent-memory/</link><description>Recent content in Agent-Memory on Yonk-Labs</description><generator>Hugo -- gohugo.io</generator><language>en</language><copyright>© 2026 Yonk-Labs</copyright><lastBuildDate>Thu, 04 Jun 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://yonk.dev/tags/agent-memory/index.xml" rel="self" type="application/rss+xml"/><item><title>We Gave Agent Memory Semantic Search. It Still Lost to Boring Old RAG.</title><link>https://yonk.dev/blog/your-agent-cant-find-what-it-remembers/</link><pubDate>Thu, 04 Jun 2026 00:00:00 +0000</pubDate><guid>https://yonk.dev/blog/your-agent-cant-find-what-it-remembers/</guid><description>We added semantic search to agent memory, then benchmarked it against plain document RAG on the same questions. The boring baseline won by 6x. Here is why that is the point.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://yonk.dev/blog/your-agent-cant-find-what-it-remembers/feature.jpg"/></item><item><title>Your Agent Doesn't Need Memory. It Needs Six of Them.</title><link>https://yonk.dev/blog/six-memories-why-memory-is-hard/</link><pubDate>Wed, 03 Jun 2026 00:00:00 +0000</pubDate><guid>https://yonk.dev/blog/six-memories-why-memory-is-hard/</guid><description>&amp;ldquo;Add memory to the agent&amp;rdquo; sounds like one feature. It is six different jobs that need three different mechanisms. Here is the map, with a concrete example for each.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://yonk.dev/blog/six-memories-why-memory-is-hard/feature.jpg"/></item><item><title>Reading the Big-Ass Grid: A Field Guide to Our RAG Bake-Off</title><link>https://yonk.dev/blog/reading-the-grid/</link><pubDate>Tue, 02 Jun 2026 00:00:00 +0000</pubDate><guid>https://yonk.dev/blog/reading-the-grid/</guid><description>A 150-row benchmark grid looks like the output of a robot having a stroke — until you know the three things each row tells you. A field guide to reading our RAG bake-off: read the parametric floor first, decode the system and lane columns, and ask the only two questions that matter — is it right, and what did it cost?</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://yonk.dev/blog/reading-the-grid/feature.jpg"/></item><item><title>What Actually Moves RAG Accuracy (And What I Spent A Week Measuring Wrong)</title><link>https://yonk.dev/blog/what-actually-moves-rag-accuracy/</link><pubDate>Mon, 01 Jun 2026 00:00:00 +0000</pubDate><guid>https://yonk.dev/blog/what-actually-moves-rag-accuracy/</guid><description>One failing LoCoMo question turned into a cross-corpus, multi-system benchmark — and a pile of retracted conclusions. Small-N runs lie, cross-vendor numbers are rarely apples-to-apples, and a correctness bug will impersonate an architecture win every time. Run the no-context baseline, 6x your sample, and diff the bytes that reach the model before you trust any RAG number.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://yonk.dev/blog/what-actually-moves-rag-accuracy/feature.jpg"/></item><item><title>Wire Real Memory Into Your Agent In An Afternoon</title><link>https://yonk.dev/blog/wire-memory-afternoon/</link><pubDate>Wed, 27 May 2026 00:00:00 +0000</pubDate><guid>https://yonk.dev/blog/wire-memory-afternoon/</guid><description>The practical follow-up to the goldfish-memory post. Bring a Postgres database with pgvector and an agent that talks to users; an hour later you&amp;rsquo;ve got two-tier memory bolted on. Staging, realtime and consolidate cells, three scheduling options, three reader patterns, and an LLM fact extractor — Python and Rust both.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://yonk.dev/blog/wire-memory-afternoon/feature.jpg"/></item><item><title>Your Agent Has Goldfish Memory (And Your Vector Store Won't Fix It)</title><link>https://yonk.dev/blog/goldfish-memory/</link><pubDate>Sun, 24 May 2026 00:00:00 +0000</pubDate><guid>https://yonk.dev/blog/goldfish-memory/</guid><description>Agent memory has two completely different jobs — fast context for the next reply, and curated truth three weeks later — and most people try to do both with one tool. Here&amp;rsquo;s the two-tier pattern I built chunkshop&amp;rsquo;s memory layer around, the late-event bug that silently eats conversations, and why &amp;lsquo;just use pgvector&amp;rsquo; isn&amp;rsquo;t the whole answer.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://yonk.dev/blog/goldfish-memory/feature.jpg"/></item><item><title>Stele Tutorial: Five Minutes, Then Your Agent Remembers</title><link>https://yonk.dev/blog/stele-tutorial-five-minutes-then-your-agent-remembers/</link><pubDate>Sat, 23 May 2026 00:00:00 +0000</pubDate><guid>https://yonk.dev/blog/stele-tutorial-five-minutes-then-your-agent-remembers/</guid><description>The hands-on follow-up to the why-I-built-it post. Real commands, real outputs: install Stele, wire it into your agent, store artifacts with citations, supersede facts, time-travel with as_of, stash oversized tool output, and run recall through two strategies. Five minutes to install, the rest is just typing.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://yonk.dev/blog/stele-tutorial-five-minutes-then-your-agent-remembers/feature.jpg"/></item><item><title>Stele: The Memory Layer I Couldn't Stop Building</title><link>https://yonk.dev/blog/stele-the-memory-layer-i-couldnt-stop-building/</link><pubDate>Thu, 21 May 2026 00:00:00 +0000</pubDate><guid>https://yonk.dev/blog/stele-the-memory-layer-i-couldnt-stop-building/</guid><description>I said the implementation needed another quarter. Three weeks later I&amp;rsquo;d shipped Stele — source-backed, time-traveling, sovereign agent memory that plugs into seven coding assistants. What it does, the three goals driving it, what&amp;rsquo;s solid on main, and what&amp;rsquo;s still wobbly. The honest version, including the parts that aren&amp;rsquo;t built yet.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://yonk.dev/blog/stele-the-memory-layer-i-couldnt-stop-building/feature.jpg"/></item><item><title>stele</title><link>https://yonk.dev/projects/stele/</link><pubDate>Sun, 17 May 2026 00:00:00 +0000</pubDate><guid>https://yonk.dev/projects/stele/</guid><description>Agentic memory implemented natively in PostgreSQL — the episodic, relational, time-anchored memory layer agents actually forget, kept in the database you already run.</description></item><item><title>Your agent forgets things. pg-raggraph might be how you fix that.</title><link>https://yonk.dev/blog/pg-raggraph-as-agent-memory/</link><pubDate>Tue, 12 May 2026 00:00:00 +0000</pubDate><guid>https://yonk.dev/blog/pg-raggraph-as-agent-memory/</guid><description>Modern AI agents need three different kinds of memory and only one of them is RAG. The episodic, relational, time-anchored kind needs a graph — and pg-raggraph happens to be shaped exactly right. Tier 1 evolution awareness, retraction-aware retrieval, namespace isolation. What&amp;rsquo;s built, what&amp;rsquo;s still gap.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://yonk.dev/blog/pg-raggraph-as-agent-memory/feature.jpg"/></item></channel></rss>