Llm

The Process Is the Memory: Why Agentic Memory Should Cache Reasoning, Not Facts

17 July 2026·29 mins

Agent-Memory Stele Ai-Infrastructure Llm

Long-running agents are usually given a memory system that stores facts and retrieves them by similarity, borrowed straight from RAG. We argue that’s the wrong default: cheaply re-derivable facts are net-negative to cache, while the expensive, non-re-derivable thing an agent accumulates, process, is exactly what the field under-builds. We present the Agentic Context and Protocol Ledger, validate it with two policy simulations and a four-model adversarial review, and report four empirical results on staleness, dependency-checked reuse, and the mechanisms that make imperfect dependency declaration safe.

Why on Earth Would You Run AI Inside the Database?

16 July 2026·11 mins

Postgres Llm Ai-Infrastructure Data-Gravity Operations

A defense of in-database AI agents: data gravity, security, transactional isolation, backups, replication, and why pg-synapse lives inside Postgres on purpose.

How to Actually Build an Agent on Postgres (a pg-synapse Tutorial)

15 July 2026·12 mins

Postgres Llm Agents Tutorial Rust

A working tutorial: install pg-synapse, write a tool, register an LLM profile, create an agent as a row, run it from SQL, attach a reactive trigger. All copy-pasteable.

pg-synapse: Run AI Agents From SQL, Like a Stored Procedure

14 July 2026·7 mins

Postgres Llm Agents Rust Open-Source

pg-synapse is a Postgres-native agent loop runtime in Rust. Invoke LLM agents from SQL, with tools that read and write your database under the caller’s grants.

pg-synapse ↗ ↖

1 July 2026·2 mins

Rust Postgresql Pgrx Agents Mcp Llm

Postgres-native agent-loop runtime in Rust. Invoke an LLM agent and its tool dispatch from SQL like a stored procedure — the agent reads and writes your tables directly, with a small trait-based kernel and everything else as a plugin.

llm-judge ↗ ↖

1 July 2026·2 mins

Python Llm Rag Benchmarking Evaluation Cli

Portable CLI for judging RAG and LLM benchmark runs across local, OpenAI-compatible, and cloud providers — a deterministic quick mode, a paraphrase-tolerant LLM-as-judge mode, and a full per-case audit trail for every verdict.

abe ↗ ↖

1 July 2026·3 mins

Rust Agents Llm Debate Code-Review Mcp

Multi-model LLM debate and second-opinion validation. Broadcasts a prompt to several models — HTTP providers or local CLIs — has them argue over N rounds, and returns a synthesized answer plus an agreement/disagreement report.

Vibe Coding Isn't the Problem. Stopping at Vibe Coding Is.

17 May 2026·7 mins

Ai Agents Vibe-Coding Engineering Llm Software Workflow

Vibe coding is a real and useful phase — the problem is people stop there. The space between ‘I had an idea on a plane’ and ’this runs in an air-gapped Kubernetes cluster’ is where the actual work happens. A generalizable playbook for the middle, starting with: treat the LLM like a very literal child.

The Dungeon Master Era: Why Product and Engineering Are Becoming the Same Job

16 May 2026·6 mins

Ai Agents Product Engineering Career Llm Workflow

Agents got good, code became the cheapest thing in the room, and the gap between product and engineering is closing fast. The people who internalize that — who spend their time deciding what should exist and ripping into what the agents hand back — are going to run circles around everyone else.

Let an LLM Tune Your Postgres (With a Safety Net That Actually Works)

24 April 2026·22 mins

Pg-Retest Postgres Ai Llm Tuning Performance

A full LLM-driven tuning loop with four real outcomes: a successful apply, an automatic rollback on regression, a safety-layer rejection, and a hint-driven redirect. No recommendations without measurement.

↑