Microsoft steers VS Code to Claude 4, signaling an Anthropic tilt

Microsoft added auto model selection to VS Code that, for paid Copilot users, defaults to Anthropic’s Claude Sonnet 4 over OpenAI’s GPT-5. Internal benchmarks and guidance back the choice, and Microsoft plans to use Anthropic in Microsoft 365 where it tests better. Meanwhile, the company is scaling its own model training efforts and still deepening ties with OpenAI.
Key Points
- VS Code gains an auto model selector; paid Copilot users will primarily use Anthropic’s Claude Sonnet 4 over OpenAI’s GPT-5.
- Microsoft’s internal benchmarks recommend Claude Sonnet 4 for Copilot, and that guidance persists after GPT-5’s release.
- Microsoft 365 Copilot is expected to be partly powered by Anthropic models where they outperform OpenAI (notably in Excel and PowerPoint).
- Microsoft is ramping up in-house model training and compute, with plans to scale beyond the 15,000-H100 cluster used for MAI-1-preview.
- Microsoft maintains its partnership with OpenAI, recently striking a new deal that could aid OpenAI’s IPO.
Sentiment
The discussion is largely sympathetic to the article's premise that Claude is strong for coding and Microsoft's move makes strategic sense. Many commenters validate the choice from personal experience. However, there is a meaningful counter-current arguing GPT-5 and Codex are catching up or have surpassed Claude, and a general sense that model leadership is fleeting. The overall tone is moderately agreeing with the article but with significant caveats about durability.
In Agreement
- Claude is genuinely superior for coding tasks — it produces practical, actionable code while ChatGPT tends to describe what it would do rather than doing it
- Microsoft is making a pragmatic choice based on technical merit, treating AI models as commodities and positioning as a model-agnostic tool-maker
- It is notable that Microsoft would choose on merit over existing business relationships, suggesting a significant quality gap between Claude and OpenAI for coding
- Competitive pressure from Cursor likely forced Microsoft to offer the best available model in VS Code to protect its developer tool market share
- Claude also has an edge in creativity and style emulation beyond just code generation
Opposed
- GPT-5 and the new Codex models have matched or surpassed Claude for coding, making this preference potentially outdated
- The 'auto' model selector is really a mechanism to push users toward paid tiers rather than a genuine endorsement of Claude
- Anthropic has concerning practices including aggressive rate limiting, credit expiration, and data usage policies that make it a questionable partner
- Which model is 'best' changes every few months, so any default choice is temporary and benchmarks are not particularly helpful
- Claude Sonnet 4 has known weaknesses in math and formal proofs compared to GPT-5 Thinking