Anthropic Defends Claude Code Cache TTL Reduction as Cost Optimization

A user-led data analysis revealed that Claude Code's cache TTL was reduced from 1 hour to 5 minutes in March 2026, sparking complaints about increased costs for long-running sessions. Anthropic responded that the change was an intentional optimization because 1-hour cache writes are more expensive than 5-minute writes. The company has declined to revert the change, opting instead for a dynamic TTL selection based on request patterns.
Key Points
- Data analysis of 119,866 API calls showed a shift from 1-hour to 5-minute cache TTL starting around March 6, 2026.
- Users reported that the shorter TTL leads to frequent cache expirations in long sessions, resulting in expensive re-uploads and rapid quota exhaustion.
- Anthropic clarified that 1-hour cache writes cost nearly double the rate of 5-minute writes, making 1-hour TTL less efficient for one-shot requests.
- The change was an intentional server-side optimization designed to lower total costs across a diverse mix of request types.
- Anthropic declined to restore a global 1-hour TTL default but fixed a client-side bug in v2.1.90 that affected quota management.
Sentiment
The overall sentiment is strongly negative toward Anthropic. While a few commenters defend specific technical decisions and acknowledge that third-party harnesses can waste compute, the overwhelming majority express frustration with Anthropic's lack of transparency, fear-inducing enforcement policies, and pattern of silent service degradation. The discussion reveals a community that was once enthusiastic about Claude but is now increasingly disillusioned and actively considering alternatives.
In Agreement
- The cache TTL change is technically justified because 1-hour TTL writes are significantly more expensive, and dynamic selection based on expected reuse makes economic sense
- Third-party harnesses often have poor cache management, with specific evidence that some break prompt caching entirely by aggressively trimming context on every turn
- Anthropic handled the OpenClaw situation reasonably by transitioning to extra credits billing with a discount rather than outright banning
- Claude Code is optimized for token efficiency in ways that third-party harnesses cannot match, justifying the subscription pricing model
Opposed
- Anthropic is silently degrading service quality across multiple dimensions including cache TTL, reasoning effort, usage limits, and response length in what appears to be desperate cost-cutting
- The enforcement of third-party harness bans creates a chilling effect where users fear using legitimate features like claude -p for automation
- Users paying $200/month deserve transparency and premium service, not silent downgrades and mysterious account bans with no customer support recourse
- Anthropic should raise prices honestly instead of enshittifying the product — the current approach erodes trust more than a price increase would
- Competition from Codex is catching up while Claude Code becomes more restrictive and less reliable
- Customer support is effectively non-existent, with banned users having no meaningful appeal process