Anthropic Defends Claude Code Cache TTL Reduction as Cost Optimization

Added
Article: NegativeCommunity: NegativeDivisive
Anthropic Defends Claude Code Cache TTL Reduction as Cost Optimization

A user-led data analysis revealed that Claude Code's cache TTL was reduced from 1 hour to 5 minutes in March 2026, sparking complaints about increased costs for long-running sessions. Anthropic responded that the change was an intentional optimization because 1-hour cache writes are more expensive than 5-minute writes. The company has declined to revert the change, opting instead for a dynamic TTL selection based on request patterns.

Key Points

  • Data analysis of 119,866 API calls showed a shift from 1-hour to 5-minute cache TTL starting around March 6, 2026.
  • Users reported that the shorter TTL leads to frequent cache expirations in long sessions, resulting in expensive re-uploads and rapid quota exhaustion.
  • Anthropic clarified that 1-hour cache writes cost nearly double the rate of 5-minute writes, making 1-hour TTL less efficient for one-shot requests.
  • The change was an intentional server-side optimization designed to lower total costs across a diverse mix of request types.
  • Anthropic declined to restore a global 1-hour TTL default but fixed a client-side bug in v2.1.90 that affected quota management.

Sentiment

The overall sentiment is strongly negative toward Anthropic. While a few commenters defend specific technical decisions and acknowledge that third-party harnesses can waste compute, the overwhelming majority express frustration with Anthropic's lack of transparency, fear-inducing enforcement policies, and pattern of silent service degradation. The discussion reveals a community that was once enthusiastic about Claude but is now increasingly disillusioned and actively considering alternatives.

In Agreement

  • The cache TTL change is technically justified because 1-hour TTL writes are significantly more expensive, and dynamic selection based on expected reuse makes economic sense
  • Third-party harnesses often have poor cache management, with specific evidence that some break prompt caching entirely by aggressively trimming context on every turn
  • Anthropic handled the OpenClaw situation reasonably by transitioning to extra credits billing with a discount rather than outright banning
  • Claude Code is optimized for token efficiency in ways that third-party harnesses cannot match, justifying the subscription pricing model

Opposed

  • Anthropic is silently degrading service quality across multiple dimensions including cache TTL, reasoning effort, usage limits, and response length in what appears to be desperate cost-cutting
  • The enforcement of third-party harness bans creates a chilling effect where users fear using legitimate features like claude -p for automation
  • Users paying $200/month deserve transparency and premium service, not silent downgrades and mysterious account bans with no customer support recourse
  • Anthropic should raise prices honestly instead of enshittifying the product — the current approach erodes trust more than a price increase would
  • Competition from Codex is catching up while Claude Code becomes more restrictive and less reliable
  • Customer support is effectively non-existent, with banned users having no meaningful appeal process
Anthropic Defends Claude Code Cache TTL Reduction as Cost Optimization | TD Stuff