Yonk-Labs/
Blog/

Blog

2026

I made pg-raggraph 7× faster by deleting code I wrote. Here's where it goes next.

11 May 2026·9 mins

Pg-Raggraph Postgres Performance Rust Pgrx Benchmarking Rag

A 17× perf gap between pg-raggraph and Apache AGE turned out to be 5 lines of glue code in the bakeoff adapter, not an architectural problem. The fix, the four library-side wins still on the floor, and the three architectural directions ahead — pg_net sidecar, pgrx Rust extension, hybrid embedding tiers.

I keep picking the wrong chunker. Bakeoff fixed it.

8 May 2026·8 mins

Chunkshop Pg-Raggraph Rag Chunking Embeddings Benchmarking Postgres

Three corpora, three different winners, none of them the chunker the README recommended. Why nobody can tell you in advance which chunker to use, and the 30-minute primitive that does the work for you.

pg-raggraph and Apache AGE solve different problems. Stop comparing them on the wrong axis.

7 May 2026·8 mins

Pg-Raggraph Apache-Age Graphrag Postgres Rag Ai

AGE is a property-graph engine; pg-raggraph is read-mostly retrieval that combines vector + BM25 + shallow graph traversal in one query plan. Where each wins, where neither fits, and the deployment story that closes off most of the postgres install base.

Chunkshop, end to end: sales notes → bakeoff → LangGraph agent

6 May 2026·16 mins

Chunkshop Rag Postgres Pgvector Langgraph Tutorial Ingest Hybrid-Search

Real OLTP corpus, twelve-combo bakeoff with three baked-in models plus Snowflake Arctic via BYO YAML, hybrid search via promoted metadata, then wired into a LangGraph agent through inline mode. Every command actually run.

GraphRAG without a graph database. Yes, in postgres.

5 May 2026·9 mins

Pg-Raggraph Graphrag Postgres Pgvector Rag Ai

Most teams reach for Neo4j or Apache AGE the moment they read the Microsoft GraphRAG paper. The honest answer is most GraphRAG workloads don’t need a graph database — pgvector + recursive CTEs + tsvector handle 1-3 hop traversal in one ACID database.

A field guide to the seven chunkers, and where each one falls over

4 May 2026·12 mins

Chunkshop Rag Chunking Embeddings Pgvector Postgres

Seven walkthroughs with opinions — what each chunker is good at, where it falls over, and the corpus shape that flips the leaderboard between them. A field guide, not a recommendation. Bakeoff first.

Meet chunkshop. Yes, the name is a mistake. No, I'm not changing it.

1 May 2026·7 mins

Chunkshop Rag Postgres Pgvector Ingest Chunking Embeddings

An illegal chop shop for your data — the YAML-driven RAG ingest tool that ships a bakeoff primitive so you measure chunker × embedder × your corpus instead of vibe-picking from somebody else’s blog post.

Three Signals Beat One: Hybrid RAG with lede and Postgres

29 April 2026·16 mins

Lede Rag Postgres Pgvector Hybrid-Search Summarization

Most production RAG pipelines run on one signal: chunks. Add doc-level summaries plus structured metadata in Postgres and you get three signals — with working SQL at the bottom of the post.

Meet lede: The Thing You Reach for Before the LLM Call

28 April 2026·11 mins

Lede Summarization Rag Python Rust Preprocessing

Sub-millisecond extractive summarization with byte-identical Python and Rust implementations. The preprocessor that sits in front of the LLM call and cuts tokens 40-94 percent.

Moving MySQL to Postgres Without Praying on Cutover Day

27 April 2026·15 mins

Pg-Retest Postgres Mysql Migration

Capture your real MySQL slow log, push it through a MySQL→Postgres transform pipeline, and replay against Postgres. Every failure in replay is one you don’t find in production.

↑