Silent Egress: The Attack Surface You Didn't Build
A malicious web page injected instructions into URL metadata. The agent fetched it, read the poison, and exfiltrated the API key. No error. No log.
AI & TechnologyThoughts on design, development, AI infrastructure, and building products.
A malicious web page injected instructions into URL metadata. The agent fetched it, read the poison, and exfiltrated the API key. No error. No log.
AI & TechnologyAI agents consume disk, CPU, and network with zero operator visibility. Three observability layers close the gap before damage is irreversible.
AI & TechnologyGit captures what changed. Agent sessions capture why. When agents write code, the session transcript is the real design document — and we discard it.
AI & Technology49,746 chunks, 83 MB, zero API calls. How BM25 + vector search + RRF fusion in one SQLite file turns 16,894 Obsidian files into a queryable knowledge base.
AI EngineeringBuild custom Claude Code skills that auto-activate based on context. Step-by-step tutorial covering SKILL.md structure, frontmatter, LLM-based matching, and team sharing via git.
AI Development118 functions with slowdowns from 3x to 446x in two Claude Code PRs. AI agents optimize for correctness, not performance — here's the data.
AI EngineeringWhich AGENTS.md patterns actually change agent behavior? Anti-patterns to avoid, patterns that work, and a cross-tool compatibility matrix for 8 tools.
AI DevelopmentContext engineering is the highest-impact skill in agent development. Three compression layers turn a 200K token window from liability into advantage.
AI & TechnologyThree top HN Claude Code threads converge on one conclusion: CLI-first architecture is cheaper, faster, and more composable than IDE agent workflows.
AI & TechnologySeven named failure modes from 500+ autonomous agent sessions. Each has a detection signal, a real example, and a concrete fix. The taxonomy HN asked for.
AI EngineeringA 7B model with sparse expert access matches agents 50x its size. Route routine work to small models and judgment calls to frontier models.
AI & TechnologyClaude Code vs Codex CLI, current to June 2026: Opus 4.8 vs GPT-5.5, hooks vs kernel sandboxing, AGENTS.md portability, and 36 blind duel results.
AI DevelopmentTechnical writing at Introl
Comprehensive hardware recommendations and cost analysis for running large language models locally.
GPU selection guide comparing NVIDIA's latest datacenter accelerators for different AI workloads.
Deep technical dive into Google's Tensor Processing Unit evolution from TPUv1 to TPUv5.
Resource sharing strategies for GPU clusters in containerized environments.
Guide to building and managing distributed AI computing with Ray framework.
Analysis of open source LLM economics and DeepSeek's competitive positioning.
Future datacenter power requirements and NVIDIA's next-generation GPU roadmap.
Small modular reactor solutions for powering next-generation AI infrastructure.
Technical analysis of DeepSeek's Multi-Head Compression architecture innovations.