TD Stuff

Letta Code: Stateful Coding Agents That Learn and Lead on Terminal-Bench

Dec 17, 2025

A memory-first, stateful coding agent that learns from experience and matches provider-specific harness performance across models.

AI Coding Agents LLM Context Management AI Benchmarks Open Source

Products & Announcements

OpenAI Quietly Ships Skills in ChatGPT and Codex CLI

Dec 13, 2025587

OpenAI has quietly adopted Anthropic-style skills in ChatGPT and Codex CLI, proving the simple folder-based pattern works and should be standardized.

AI Coding Agents AI Agents Developer Tooling OpenAI LLM Context Management

Programming

When AI ‘Improves’ Code: 200 Runs, 84k LOC, and Little Real Quality

Dec 11, 2025641

Unconstrained AI optimized for the wrong signals, turning ‘quality’ into bloat and busywork rather than real improvements.

AI Coding Agents Software Craftsmanship Human-AI Collaboration Technical Debt

Programming

Pixel-Perfect, Not Intent-Perfect: Rebuilding Space Jam ’96 with Tests and Nori

Dec 8, 2025116

Good tests and tailored configs let Claude rebuild Space Jam ’96, but the ‘pixel-perfect’ target nudged it to game the metric—showing why objective design matters more than prompts.

AI Coding Agents Developer Tooling Software Craftsmanship Automated Testing

Programming

Write a Minimal, High-Leverage CLAUDE.md

Dec 1, 2025748

Keep CLAUDE.md minimal, universal, and handcrafted—push specifics to on-demand docs and use deterministic tools for everything else.

AI Coding Agents LLM Context Management Prompt Engineering Developer Tooling

Agentic Systems

From Chatbot to Coworker: Gemini 3 Ushers in the Agent Era

Nov 24, 2025352

AI has moved from chatting to doing—Gemini 3 acts like a capable digital coworker that plans and builds while you manage.

AI Agents Human-AI Collaboration AI Coding Agents AI for Science

Products & Announcements

Claude Opus 4.5 Launches: Safer SOTA Coding and Agents, Now Cheaper and More Efficient

Nov 24, 20251113

Claude Opus 4.5 debuts as a safer, cheaper, and more efficient SOTA model for coding and agentic workflows, backed by platform and product updates that turn frontier reasoning into practical, long-running work.

AI Coding Agents AI Agents AI Safety AI Benchmarks

Products & Announcements

GPT‑5.1‑Codex‑Max: Long‑Horizon Agentic Coding with Compaction and Fewer Tokens

Nov 19, 2025483

GPT-5.1-Codex-Max brings compaction-powered, long-running agentic coding with better accuracy and far fewer tokens, and is now the default Codex model with enhanced safeguards.

AI Coding Agents LLM Context Management AI Benchmarks OpenAI

Programming

Google Antigravity: An Agent-First IDE for Autonomous, Trustworthy Coding

Nov 18, 2025193

Antigravity is Google’s agent-first IDE and manager that enables autonomous, trustworthy, and asynchronous software development with built-in feedback and learning.

AI Coding Agents Developer Tooling Human-AI Collaboration Corporate AI Strategy Task Orchestration

Products & Announcements

Google Antigravity: An Agent‑First AI IDE for Trusted, Cross‑Surface Development

Nov 18, 20251088

An AI, agent-first IDE that coordinates trusted, cross-surface development workflows and multi-agent management, free to download.

AI Coding Agents Developer Tooling Human-AI Collaboration Task Orchestration

Products & Announcements

Gemini 3: Google’s most intelligent, widely deployed AI arrives

Nov 18, 20251735

Gemini 3 launches as Google’s most intelligent, widely deployed, and safety-hardened AI—advancing reasoning, multimodality, agentic coding, and long-horizon planning across products and platforms.

AI Benchmarks AI Coding Agents Multimodal AI AI Safety Corporate AI Strategy

Products & Announcements

Gemini 3 Pro Comes to Gemini CLI: 5 Ways to Supercharge Your Terminal

Nov 18, 2025104

Gemini 3 Pro now powers the Gemini CLI, turning natural-language ideas into end-to-end terminal workflows—from coding to cloud ops.

AI Coding Agents Developer Tooling Multimodal AI Human-AI Collaboration

Products & Announcements

Gemini 3 Pro launches: agentic coding meets multimodal app building

Nov 18, 20251735

Google’s Gemini 3 Pro ushers in agentic, multimodal app building—turning natural-language ideas into production-ready software across an integrated developer stack.

AI Coding Agents Multimodal AI Vibe Coding Developer Tooling

Agentic Systems

Skip MCP: Tiny Bash + Puppeteer Tools Beat Bloated Browser Dev Servers

Nov 17, 2025237

Skip MCP: use a tiny, composable Bash + Puppeteer toolset with a short README to drive browser work more efficiently.

Model Context Protocol AI Coding Agents Developer Tooling Browser Automation LLM Context Management

Programming

Codemaps: Just-in-time AI maps for understanding and navigating your codebase

Nov 4, 2025315

Windsurf Codemaps gives humans and AI a shared, just-in-time map of your code so you can understand, navigate, and safely ship faster.

AI Coding Agents Developer Tooling LLM Context Management Human-AI Collaboration Vibe Coding

Programming

Operationalizing Claude Code: Guardrails, Context Hygiene, Skills, and CI

Nov 2, 2025534

Treat Claude Code as an operational system—guardrails in CLAUDE.md, explicit context hygiene, scripting-first Skills, and CI integration—then let the agent orchestrate itself.

AI Coding Agents Developer Tooling LLM Context Management CI/CD Task Orchestration

Products & Announcements

Composer: A Fast, RL-Trained Coding Agent for Real-World Software Development

Oct 29, 2025215

A fast, RL-trained MoE coding agent that brings frontier-level usefulness to real-world development with tools, long context, and production-grade infrastructure.

AI Coding Agents Reinforcement Learning AI Benchmarks AI Infrastructure Developer Tooling

Products & Announcements

Claude Code on the Web: A Solid v1 That Outshines Cursor

Oct 28, 2025161

A solid, dependable v1 of Claude Code on the web makes async coding tasks easy and outshines Cursor’s more finicky version.

AI Coding Agents Developer Tooling Vibe Coding AI & Productivity

Products & Announcements

Claude Code on the Web: Secure Parallel Coding Tasks in Your Browser

Oct 20, 2025578

Delegate and parallelize secure, cloud-run coding tasks from your browser (and iOS) with Claude Code on the web.

AI Coding Agents Developer Tooling Cloud Infrastructure Sandboxing

Programming

Reddit Sentiment: Codex Beats Claude Code, but Claude Wins on Speed and UX

Oct 18, 2025141

Codex wins on perceived capability, Claude Code wins on speed and UX, and Reddit talks far more about Claude—choose based on your priorities.

AI Coding Agents Developer Tooling AI Benchmarks Sentiment Analysis

Products & Announcements

Claude Skills: Simple Files, Big Agent Power

Oct 17, 2025738

A simple, token-efficient “skills as Markdown” approach turns Claude Code into a powerful general agent, likely outpacing MCP in practicality and adoption.

AI Coding Agents Model Context Protocol LLM Context Management Developer Tooling

Products & Announcements

Claude Skills: Portable, Composable Expertise Across Apps, Code, and API

Oct 16, 2025816

Claude Skills let you package and auto-load expertise—plus code—so Claude can perform specialized tasks reliably across apps, code, and API.

AI Coding Agents Developer Tooling LLM Context Management Prompt Engineering

Products & Announcements

Claude Haiku 4.5: Near-Frontier Coding at 1/3 Cost and 2x+ Speed

Oct 15, 2025730

Anthropic’s Claude Haiku 4.5 brings near-frontier coding capability at a fraction of the cost and latency, with strong safety and immediate, broad availability.

AI Coding Agents AI Benchmarks Technology Economics AI Safety Task Orchestration

Programming

Superpowers: Enforceable Skills for Reliable Coding Agents

Oct 11, 2025435

A Claude Code plugin that turns skills into enforceable procedures, delivering a disciplined, self-improving coding agent workflow powered by TDD, subagents, and persuasion-aware testing.

AI Coding Agents Prompt Engineering Task Orchestration Developer Tooling

Programming

Vibe Code Hell: When AI Builds Your App but Not Your Understanding

Oct 10, 2025283

Turn off the copilot, do the hard work yourself, and use AI only as a Socratic tutor if you actually want to learn.

Vibe Coding AI in Education AI Coding Agents AI & Productivity

Programming

We Normalized Broken Software—and Physics Won’t Bail Us Out

Oct 9, 2025314

We normalized broken software and tried to paper it over with AI and hardware, but physics and fundamentals are catching up.

Software Craftsmanship Technical Debt Technology Economics Service Reliability AI Coding Agents

Programming

Two Reasons LLM Coding Agents Still Miss the Mark

Oct 9, 2025345

LLM coding agents still mishandle code movement and avoid clarifying questions, making them unreliable, overconfident interns rather than developer replacements.

AI Coding Agents Human-AI Collaboration Software Craftsmanship Vibe Coding

Products & Announcements

Gemini CLI extensions: Build a personalized, AI-powered terminal with an open ecosystem

Oct 8, 2025158

Gemini CLI extensions let you turn the terminal into a personalized, AI-powered hub by installing intelligent tool integrations from an open ecosystem.

Developer Tooling Model Context Protocol AI Coding Agents Open Source

Products & Announcements

Recall: Persistent Redis Memory for Claude (v1.5)

Oct 8, 2025171

A Redis‑backed MCP server that gives Claude persistent, secure, cross‑session memory with powerful organization, search, and governance features.

Model Context Protocol LLM Context Management Developer Tooling AI Coding Agents

Damage Control

After the GenAI Bubble: Fewer Layoffs, Persistent Hallucinations, and Pragmatic Code Gen

Oct 1, 2025

GenAI’s hype will pop: hallucinations persist, mass layoffs won’t happen, code-gen becomes a practical tool, and after the bubble bursts we’ll avoid the grifters’ future.

AI Hype AI Hallucinations AI Coding Agents Technology Economics Labor Economics

Agentic Systems

Designing Safe, Effective Agentic Loops for Coding Work

Sep 30, 2025284

Safely empower coding agents to iterate autonomously by sandboxing YOLO mode, exposing simple shell tools, tightly scoping credentials, and relying on tests to guide trial-and-error.

AI Coding Agents Sandboxing AI Safety Developer Tooling

Damage Control

Comprehension Debt: The Hidden Cost of Fast AI Code

Sep 30, 2025532

Rapidly shipping unread LLM-generated code creates a mounting comprehension debt that will slow teams down when real changes are needed.

AI Coding Agents Technical Debt Software Craftsmanship Code Review

Products & Announcements

Claude Code: Terminal-Native AI Coding Agent with Easy Install and Privacy Safeguards

Sep 29, 2025842

A terminal-native coding agent that accelerates development via natural language, easy to install and backed by clear privacy safeguards.

AI Coding Agents Developer Tooling Data Privacy Human-AI Collaboration

Products & Announcements

Claude Sonnet 4.5 Launches: SOTA Coding & Agent Model With SDK and Major Product Upgrades

Sep 29, 20251585

Anthropic unveils Claude Sonnet 4.5—its state-of-the-art, most aligned coding and agent model—alongside major product upgrades and a new Agent SDK, available now at the same price.

AI Coding Agents AI Agents Developer Tooling AI Safety AI Benchmarks

Agentic Systems

Avoiding the AI Coding Trap: Treat LLMs Like Fast Juniors with Real Engineering Discipline

Sep 28, 2025685

Use AI’s speed within disciplined engineering practices—treat LLMs like fast juniors—to ship sustainable, high-quality software instead of quick but brittle code.

AI Coding Agents Software Craftsmanship Vibe Coding Human-AI Collaboration Technical Debt

Agentic Systems

Coding Agents Don’t Lack IQ—They Lack Context

Sep 26, 2025196

The bottleneck for autonomous coding isn’t IQ—it’s missing, implicit context that agents must access, synthesize, and query humans about.

AI Coding Agents LLM Context Management Human-AI Collaboration AI Benchmarks

Agentic Systems

Rebuilding a Startup Site with Claude: Fast, Powerful—But Human-Guided

Sep 26, 2025178

AI can help non-engineers ship real, high-fidelity code fast—so long as humans stay in the loop to guide, review, and correct.

AI Coding Agents Human-AI Collaboration Developer Tooling AI & Productivity

Agentic Systems

How HubSpot Scaled AI Coding: Context, Central Teams, and Data-Driven Rollout

Sep 24, 2025

Treat AI coding as a platform capability: measure it, centralize enablement, hardwire context, remove friction—and adoption will safely scale to unlock agents and bigger wins.

AI Coding Agents Developer Tooling AI & Productivity Enterprise AI Adoption Model Context Protocol

Products & Announcements

Zed Switches to Token-Based AI Billing, Cuts Pro to $10, Adds GPT‑5 and Gemini

Sep 24, 2025182

Zed switches to token-based AI billing, cuts Pro to $10 with credits, adds top models, and offers flexible BYO/local options with a staged migration.

Developer Tooling AI Coding Agents AI Business Models Technology Economics

Agentic Systems

Make AI Work in Big Repos: Spec-First Workflow and Frequent Intentional Compaction

Sep 23, 2025517

Make AI work in big, messy repos by compacting context and reviewing specs, not just code: research → plan → implement, with humans focused upstream.

AI Coding Agents LLM Context Management Code Review AI & Productivity

Agentic Systems

Faster LLMs, Bigger Demands: Why Coding Agents Won’t Stabilize Soon

Sep 22, 2025137

Faster LLMs will reshape coding workflows and productivity, but escalating demand, hardware limits, and pricing pressures mean a bumpy, fast-changing road ahead.

AI Coding Agents AI & Productivity AI Infrastructure LLM Inference AI Business Models

Agentic Systems

AI Amplifies Seniors, Not Juniors

Sep 21, 2025461

Today, AI amplifies senior engineers’ impact instead of democratizing coding for juniors.

AI Coding Agents Human-AI Collaboration AI & Productivity Software Craftsmanship

Agentic Systems

AI Agents Can Already Do Lean Proofs—But They Still Need a Project Manager

Sep 20, 2025219

A general-purpose AI coding agent can already do real Lean proof engineering with guidance, hinting that theorem proving may soon be cheap and automated despite today’s rough edges.

AI Coding Agents Human-AI Collaboration Formal Verification

Programming

Small, Business-Value Units Make AI Coding Work

Sep 18, 2025170

Make AI coding reliable by breaking work into small, business-valued, human-verifiable units and rigorously engineering the context for each.

AI Coding Agents LLM Context Management Human-AI Collaboration Software Craftsmanship

Programming

Microsoft steers VS Code to Claude 4, signaling an Anthropic tilt

Sep 16, 2025213

Microsoft is steering VS Code and parts of Microsoft 365 toward Anthropic’s Claude where it performs best, even as it builds its own models and keeps working with OpenAI.

AI Coding Agents Developer Tooling Corporate AI Strategy OpenAI

Programming

GPT‑5-Codex Lands in Codex Tools, API Coming Soon, With Big Push on Code Review

Sep 16, 2025

OpenAI’s GPT‑5-Codex is a tooling-first, code-focused upgrade that boosts review and refactoring while the API and polish catch up.

AI Coding Agents Code Review OpenAI Developer Tooling

Programming

When Code Gets Cheap: Value Shifts to Judgment and Systems

Sep 15, 2025115

As code gets cheap, the scarce—and valuable—skills become judgment, integration, and systems thinking, not typing more code.

AI Coding Agents Technology Economics Human-AI Collaboration Future of Work

Products & Announcements

GPT-5-Codex: Agentic Coding with Layered Safety

Sep 15, 2025250

A safety-focused addendum introduces GPT-5-Codex, an agentic coding model trained on real tasks, widely available, and protected by layered mitigations.

AI Coding Agents AI Safety OpenAI Reinforcement Learning

Programming

LLMs Don’t Code—They Compile Your Prompts

Sep 13, 2025410

LLMs don’t write code—they compile your prompts; treat them as tools and fix our languages and tooling instead of buying the hype.

AI Coding Agents Vibe Coding AI Hype AI & Productivity Prompt Engineering

Programming

Async Programming as a Workflow: Specify, Automate, Review

Sep 11, 2025123

Define problems clearly, automate verification, and review thoroughly so AI can build in the background while you focus on higher-leverage engineering work.

AI Coding Agents Human-AI Collaboration Code Review AI & Productivity

Programming

Reviving a 1990s Linux Tape Driver with AI in Two Evenings

Sep 8, 2025929

With careful guidance, an AI coding agent helped revive a 1990s Linux tape driver to run on modern kernels, proving AI as a strong force multiplier for legacy code.

AI Coding Agents Human-AI Collaboration Computing History Linux Kernel Development

Programming

AI Gatekeeper Slashes E2E CI Time by 84%

Sep 6, 2025105

Let Claude Code act as an AI gatekeeper that inspects your PR and runs only the relevant E2E tests—cutting CI time by ~84% without losing coverage.

AI Coding Agents CI/CD Automated Testing Developer Tooling

Programming

Disciplined AI Collaboration: Plan, Measure, and Ship in Small, Reliable Modules

Sep 6, 2025

Constrain AI with small, testable modules and continuous measurement to turn planning into reliable, data-driven delivery.

AI Coding Agents Human-AI Collaboration Software Craftsmanship Software Architecture

Programming

Today’s Tally: 16 “Absolutely Right” + 5 “Right”

Sep 5, 2025651

A lighthearted dashboard counts how often Claude Code says he’s right—16 times "absolutely right" today plus 5 times "right."

AI Sycophancy AI Coding Agents AI UX LLM Training

Programming

Ship Faster by Orchestrating Parallel AI Coding Agents

Sep 2, 2025

Run many AI coding agents in parallel, orchestrate and review their work, and you’ll ship more by trading precision for throughput.

AI Coding Agents Task Orchestration Human-AI Collaboration Code Review Developer Tooling

Programming

Ship Faster by Treating AI as a Forgetful Junior Dev

Sep 2, 2025550

Use AI as a forgetful junior dev: provide rich context, expect three iterations, and enforce rigorous review to ship faster with better focus.

AI Coding Agents Human-AI Collaboration LLM Context Management AI & Productivity

Programming

Senior Devs Ship More AI Code, Feel Faster—But Real Gains Are Mixed

Sep 1, 2025215

Senior devs ship more AI code and feel faster, but real productivity gains are uneven and often offset by rework, even as enjoyment rises and sustainability concerns grow.

AI Coding Agents AI & Productivity Human-AI Collaboration Developer Experience

Programming

Vibe Coding, Not Replacement: A 40-Hour Test of AI as a Powerful but Unforgiving Pair Programmer

Aug 30, 2025181

AI coding assistants dramatically accelerate development but demand expert oversight—vibe coding is a collaboration, not a replacement.

Vibe Coding AI Coding Agents Human-AI Collaboration AI & Productivity