Part I

Pragmatic AI for Founders

Founders
Part II

Agent Design Fieldbook

Technical
01
The Foundation: Why Most AI Agents Fail in Production 12 min
Most AI agents are wrappers that crash in production; production agents are systems with 8 deterministic layers where the LLM is constrained, not trusted.
02
The Data Layer: Your Agent Is Only As Good As Your Data 11 min
LLMs will hallucinate field names and values unless you explicitly define what's real through schemas, field registries, and validation boundaries.
03
The Ingestion Layer: Getting Messy Data Into Your Clean Schema 12 min
Vendor data is chaos -- different units, formats, nulls, duplicates; ingestion is 80% of the work, and scripts beat pipelines for flexibility.
04
The Intent Layer: Classify Before You Act 12 min
The first LLM call should classify what the user wants (search, compare, select), not execute it -- then route to specialized handlers that do one job well.
05
The Filter Extraction Layer: From Natural Language to Query 15 min
LLMs are excellent at extracting filters from natural language, but terrible at enforcing boundaries -- inject your field registry, validate everything, and relax constraints when needed.
06
The Memory Layer: Conversations That Persist 16 min
Agents need memory -- not just chat history, but structured context (what was fetched, what was selected, what tokens were used) persisted to database for multi-turn conversations.
07
The Sort & Rank Layer: Ordering Results Intelligently 12 min
Sorting isn't just ORDER BY -- define what 'best' means for each field, handle JSONB with SQL expressions, and let the LLM infer user intent from keywords like 'cheapest' or 'best'.
08
The Product Deep-Dive Layer: When Users Want More Than Specs 17 min
When users want to go deep on a product -- reading reviews, understanding thermal performance, checking compatibility -- RAG lets you search unstructured knowledge that doesn't fit in structured columns.
Part III

Building Effective Tools for AI

Tools
01
What Makes a Good Tool 17 min
Five properties every production MCP tool must have — and why most demo tools satisfy only one of them.
02
MCP Architecture In Depth 18 min
Three transports, three primitives, and the trust boundary that determines where auth and secrets belong in an MCP server.
03
Building Your First Production MCP Server 22 min
How to wire auth, timeout, and logging middleware before your first tool — and the async-acknowledge pattern that prevents hanging tool calls.
04
Tool Design in the Real World 21 min
How Recall's 8-tool design collapsed to 5 tools — and why designing for the LLM's decision surface, not your backend's capability, is the key to lower planning error rates.
05
A2A — When Agents Need to Talk to Each Other 21 min
A2A gives multi-agent systems a task lifecycle that makes every state in a sub-agent's execution visible, pausable, and recoverable — solving the coordination failures that async function calls cannot.
06
Tool Observability 22 min
Every tool call produces one record that answers three questions — did it succeed, how long did it take, and how much did it cost — and those three questions, asked consistently, are the foundation of everything useful you'll ever know about your tool layer in production.
07
The Tool Ecosystem in 2026 21 min
MCP is no longer an emerging standard — it's infrastructure, with real security threats, five open problems, and a clear picture of what teams can solve today versus what requires ecosystem-level coordination.
Part IV

Memory in AI Systems

Memory