Local LLMs on macOS in 2026: when they're worth the GPU
Local LLMs got dramatically better in 2025-2026. They're competitive with frontier APIs for some workflows; not all. Here's the honest picture.
Practical guides for Claude Code, Cursor, Codex, cmux, and the rest of the AI dev stack.
Local LLMs got dramatically better in 2025-2026. They're competitive with frontier APIs for some workflows; not all. Here's the honest picture.
Claude Code's memory feature is powerful but easy to misuse. The pattern that scales — what to put in global memory, what to put per-project, what to never persist.
When an AI agent has access to your whole repo, it doesn't read your whole repo. Here's how to choose what enters context, what stays out, and how that decision affects output quality.
AI agents fail in specific ways. The right debugging response depends on the failure mode. Here's the structured protocol that catches drift fast.
cmux turns one terminal window into a tile of independent agent sessions. Here's how to wire it up, name your sessions, and integrate it with a file manager so context never gets lost.
Prompt engineering peaked. Context engineering — what's in the agent's working memory, what's not — is the 2026 leverage point. Here's the practical playbook.
Both tools edit files. They have very different opinions about which files, how many at once, and where their working memory lives. Here's the honest comparison from someone who runs both daily.
OpenAI's Codex CLI doesn't get the attention Claude Code or Cursor do, but it's surprisingly capable for terminal-native workflows. The honest review.
Prompt caching is the highest-ROI optimization for heavy Claude API users. Done right, it cuts costs 50-80%. Here's how — with concrete patterns.
Claude skills are reusable agent capabilities. They're powerful — but writing one for the wrong workflow is wasted effort. Here's the practical guide.
Cursor's Composer and Agent mode look similar but optimize for different work. Composer is for in-flow edits; Agent is for delegated multi-step. The decision tree.
Aider and Claude Code both do agent-style coding from a terminal. They diverge on git workflow, model flexibility, and edit precision. The honest comparison for refactor work.