Foundation Models On-Device LLM: The Tool Protocol
iOS 26's Foundation Models framework puts a 3B-parameter LLM on every Apple Intelligence device. The Tool protocol is the surface that makes the model useful.
AI & TechnologyThoughts on design, development, AI infrastructure, and building products.
iOS 26's Foundation Models framework puts a 3B-parameter LLM on every Apple Intelligence device. The Tool protocol is the surface that makes the model useful.
AI & TechnologywatchOS has no iOS-style background. WKExtendedRuntimeSession is the contract; without it, the app suspends on wrist drop. Pattern shipped in Return.
AI & TechnologyReal production patterns from Water (water tracking, HKQuantitySample) and Return (mindful sessions, HKCategorySample). Permission UX, async wrappers, watchOS variants, and the trap to avoid.
AI & TechnologyApple's Liquid Glass is a one-line SwiftUI API. Three patterns from Return go beyond .glassEffect(): glass on text via Core Text glyph paths, mirror reflection, and HUD overlays.
AI & TechnologyReturn runs on iPhone, iPad, Mac, Apple Watch, and Apple TV. Three Swift files are shared across all five targets out of 40 total. The rest is duplicated, and that is on purpose.
AI & TechnologyWhat changes when you move from `claude` in a terminal to the Mac desktop app, and how /remote-control lets you steer a local session from your phone without losing Xcode, MCP, or files.
AI EngineeringGet Bananas runs on iOS, macOS, watchOS, and visionOS. It also lives inside Claude Desktop as an MCP server. Bridge: iCloud Drive plus a JSON file.
AI & TechnologyI shipped an App Intent in Water on Feb 8, 2026. Here's what Apple Intelligence wants from third-party apps in iOS 26, and why App Intents are the contract that matters.
AI & TechnologyCharlie Labs pivoted from building agents to cleaning up after them. The AI agent market is moving from generation to proof. Cleanup is the durable layer.
AI & TechnologyTwo Claude Code trust dialog bypass CVEs in 37 days reveal a load-order failure. One invariant fixes it: interpret no workspace byte until the path is trusted.
AI & TechnologyAI agents fail when answers claim tool work that never happened. Four failure modes and the rule that catches them, with a tool-supervised RL parallel.
AI & TechnologySteve Test vs Jiro Test: a product framework for deciding whether work deserves to exist, holds user trust, and belongs in the whole you are building.
Career & GrowthTechnical writing at Introl
Comprehensive hardware recommendations and cost analysis for running large language models locally.
GPU selection guide comparing NVIDIA's latest datacenter accelerators for different AI workloads.
Deep technical dive into Google's Tensor Processing Unit evolution from TPUv1 to TPUv5.
Resource sharing strategies for GPU clusters in containerized environments.
Guide to building and managing distributed AI computing with Ray framework.
Analysis of open source LLM economics and DeepSeek's competitive positioning.
Future datacenter power requirements and NVIDIA's next-generation GPU roadmap.
Small modular reactor solutions for powering next-generation AI infrastructure.
Technical analysis of DeepSeek's Multi-Head Compression architecture innovations.