TOPIC

Simon Willison

topictopic-notepractitioner

Overview

Simon Willison is a long-running practitioner-voice blogger whose daily LLM digest at simonwillison.net the AI Digest tracks as a primary-source stream alongside Andrej Karpathy’s X/YouTube output. Willison’s posts tend to crystallise practitioner consensus a beat ahead of mainstream coverage — the “best model crown changed hands five times in six months” framing, the “Claws” product-category naming, and the “coding agents have crossed the daily-driver reliability bar” claim all originated as Willison observations before being picked up elsewhere.

Timeline

  • 2026-05-02-AI-Digest — Willison publishes an end-to-end iNaturalist sightings explorer written entirely on a phone via Claude Code for web; the “build it in an afternoon, on a phone, while waiting” framing is the corpus’s load-bearing data point that individual-developer productivity ceiling has moved further than headline model-capability releases suggest.
  • 2026-05-10-AI-Digest — Willison amplifies Thariq Shihipar’s argument that asking Claude to emit HTML — not Markdown — unlocks SVG diagrams, interactive widgets, in-page navigation, and other rendering the Markdown surface cannot carry. Developer-tooling-affordance discovery, not a new model capability.
  • 2026-05-11-AI-Digest — Willison publishes a piece arguing “vibe coding” and “agentic engineering” are converging on the same practice; the gap between casual prototypers and professional agentic engineers is narrowing faster than either community acknowledges.
  • 2026-05-19-AI-Digest — PyCon US 2026 lightning talk’s annotated slides publish: the “best model crown changed hands five times” framing across Anthropic, OpenAI, and Google in six months (Willison’s own “depending mostly on vibes” hedge), with Claude Opus 4.5 holding longest; coding agents moved from “often-work to mostly-work”; the “Claws” category has consolidated; Chinese open-weights (GLM-5.1, Qwen 3.6-35B-A3B) “wildly outperforming expectations” on laptop-local inference.
  • 2026-05-20-AI-Digest — Willison publishes the annotated slides from his PyCon US 2026 lightning talk as a five-minute compressed retrospective covering Nov 2025 through May 2026 — the corpus is going to lean on this synthesis for the next several weeks. Two load-bearing claims: coding agents have crossed the “daily-driver reliability” bar via late-2025 RL work, and ~20GB open-weight models running locally on laptops now compete with proprietary frontier models on practical workloads (GLM-5.1 and Qwen 3.6-35B-A3B at 20.9GB quantised are his cited reference points). The “best-model crown changed hands five times in six months” framing extends into the new post unchanged.

Key Developments

  1. Practitioner Synthesis as Corpus Working Frame: Willison’s “last six months” post is the synthesis the back half of 2026 will be read against — five frontier-crown handovers, coding agents at daily-driver reliability, and 20GB local-laptop models within reach of proprietary frontier are the three currents the corpus is now anchoring to.

  2. Local-Inference Floor Naming: Willison’s reference points (GLM-5.1, Qwen 3.6-35B-A3B at 20.9GB quantised) are what the corpus now uses to describe the “frontier-on-a-laptop” floor. The pelican-on-bicycle SVG benchmark continues to climb as the practitioner-flavored capability marker.

  3. Vibe-Coding / Agentic-Engineering Convergence: Willison’s framing that the two practices are collapsing into one — at different velocity-and-oversight settings — is the conceptual coda to the Airbnb/Snap/Google ”% AI-authored code” CEO disclosures, recasting the question from “which tool serious engineers use” to “which tool serves the full velocity spectrum.”