AI Agent Memory Degradation: Why Multi-Turn LLMs Collapse
LLMs lose 39% accuracy across 200K+ multi-turn sessions. Three mechanisms drive collapse and longer context windows fix none of them.
AI & TechnologyThoughts on design, development, AI infrastructure, and building products.
LLMs lose 39% accuracy across 200K+ multi-turn sessions. Three mechanisms drive collapse and longer context windows fix none of them.
AI & Technology15,800 Obsidian notes in embedding space reveal three knowledge topologies. Each has failure modes you can diagnose and reshape with bridge notes.
AI & TechnologyRuntime constitutions enforce AI agent governance where training-phase alignment fails. Competence checks, output gates, and four subsystems keep agents safe.
AI & TechnologyFive research groups published about the same problem this week: AI agents produce code faster than developers can understand it. The debt is in your head.
AI & TechnologyA practical GLSL playground with live controls for learning shader intuition fast. Presets, uniform manipulation, and zero-framework WebGL in the browser.
Interactive ExplorationsCraig Reynolds' 1986 boids algorithm produces flocking from three local rules. The same principles and failure modes appear in multi-agent AI systems.
Interactive ExplorationsMost agent instructions define behavior. The missing layer teaches self-evaluation. False evidence gates, seven named failure modes, and hedging detection.
AI & TechnologyFastAPI + HTMX + plain CSS with zero build tools and perfect Lighthouse scores. Real production numbers, honest tradeoffs, and a clear decision framework.
Engineering PracticeContext engineering for AI agents across a 650-file, seven-layer hierarchy. Three production failures, real token budgets, and the system that survived.
AI & TechnologyA 733-line Python pipeline that scores notes across four dimensions and routes 7,700+ items deterministically. The algorithm, weights, and honest results.
Engineering PracticeNegative space is infrastructure, not absence. How emptiness, silence, and whitespace create structure in physics, music, information theory, and design.
Design & UXMulti-agent deliberation catches failures that single-agent systems miss. Here is the architecture, the dead ends, and what is actually worth building.
AI & TechnologyTechnical writing at Introl
Comprehensive hardware recommendations and cost analysis for running large language models locally.
GPU selection guide comparing NVIDIA's latest datacenter accelerators for different AI workloads.
Deep technical dive into Google's Tensor Processing Unit evolution from TPUv1 to TPUv5.
Resource sharing strategies for GPU clusters in containerized environments.
Guide to building and managing distributed AI computing with Ray framework.
Analysis of open source LLM economics and DeepSeek's competitive positioning.
Future datacenter power requirements and NVIDIA's next-generation GPU roadmap.
Small modular reactor solutions for powering next-generation AI infrastructure.
Technical analysis of DeepSeek's Multi-Head Compression architecture innovations.