Live
OpenAI announces GPT-5 with unprecedented reasoning capabilitiesGoogle DeepMind achieves breakthrough in protein folding for rare diseasesEU passes landmark AI Safety Act with global implicationsAnthropic raises $7B as enterprise demand for Claude surgesMeta open-sources Llama 4 with 1T parameter modelNVIDIA unveils next-gen Blackwell Ultra chips for AI data centersApple integrates on-device AI across entire product lineupSam Altman testifies before Congress on AI regulation frameworkMistral AI reaches $10B valuation after Series C funding roundStability AI launches video generation model rivaling SoraOpenAI announces GPT-5 with unprecedented reasoning capabilitiesGoogle DeepMind achieves breakthrough in protein folding for rare diseasesEU passes landmark AI Safety Act with global implicationsAnthropic raises $7B as enterprise demand for Claude surgesMeta open-sources Llama 4 with 1T parameter modelNVIDIA unveils next-gen Blackwell Ultra chips for AI data centersApple integrates on-device AI across entire product lineupSam Altman testifies before Congress on AI regulation frameworkMistral AI reaches $10B valuation after Series C funding roundStability AI launches video generation model rivaling Sora
Tools

Anthropic Explains Why Claude Code Is Burning Through Your Limits So Fast

Anthropic has addressed widespread complaints about Claude Code's rapid usage drain — citing peak-hour rate caps and sessions with contexts exceeding one million tokens as the primary culprits, alongside a recommendation to switch from Opus to Sonnet 4.6.

D.O.T.S AI Newsroom

D.O.T.S AI Newsroom

AI News Desk

2 min read
Anthropic Explains Why Claude Code Is Burning Through Your Limits So Fast

Anthropic has publicly addressed one of the most common complaints from Claude Code subscribers: usage limits that appear to drain far faster than expected. The explanation, delivered via Anthropic engineer Lydia Hallie, names two primary causes — and it's worth understanding both if you're a Claude Code power user.

Cause 1: Peak-Hour Rate Caps

Anthropic applies tighter rate limits during peak usage hours to maintain service quality across its user base. This means that the same Claude Code session that runs smoothly at 10 PM may hit limits more quickly during afternoon US working hours, when demand on the infrastructure is at its highest.

This is a standard cloud infrastructure practice, but it creates a frustrating user experience when the limits aren't clearly surfaced in real time. If your Claude Code sessions feel slower or more throttled at certain times of day, this is the mechanism behind that pattern.

Cause 2: Context Windows That Grow to 1M+ Tokens

The second cause is more directly user-controllable: sessions where the context window grows to one million tokens or larger. Claude Code accumulates context across a session — every file read, every tool call, every prior turn gets appended to the running context. In long coding sessions with large codebases, this context can balloon to sizes that consume disproportionate compute resources, even when the visible conversation appears modest.

Hallie's recommendations are concrete: use Sonnet 4.6 instead of Opus where possible (Opus burns through limits roughly twice as fast), disable Extended Thinking when it's not needed for the specific task, start fresh sessions instead of extending old ones, and explicitly limit the context window size.

Bug Fixes and What They Didn't Cover

Anthropic also reports shipping several bug fixes related to the usage tracking system. Importantly, Hallie specified that none of the bugs identified led to incorrect billing — they affected how usage was surfaced to users, not what users were actually charged.

In-product pop-ups have been added to give users earlier warning before hitting limits. Users still experiencing anomalous usage drain after applying the recommended settings are directed to submit feedback through Claude Code's built-in feedback function.

The Underlying Tension

This episode highlights a structural tension in agentic AI products: the most powerful use cases (long multi-file coding sessions, complex reasoning chains) are precisely the ones that consume the most resources. Flat-rate subscription pricing becomes increasingly difficult to sustain as usage patterns shift toward compute-intensive agentic workflows. Anthropic hasn't changed its pricing model, but the pressure is visible.

Back to Home

Related Stories

Astropad's Workbench Turns a Mac Mini Into an AI Agent Server You Control From Your Phone
Tools

Astropad's Workbench Turns a Mac Mini Into an AI Agent Server You Control From Your Phone

Astropad, the company behind the Luna Display hardware that lets iPads function as Mac monitors, has built a new product for a new era: Workbench lets users remotely monitor and control AI agents running on Mac Minis from an iPhone or iPad. It is remote desktop software reimagined not for IT support but for the AI agent operator — the person who needs to check on autonomous workflows without being at their desk.

D.O.T.S AI Newsroom
Microsoft's Bing Team Open-Sources Harrier, a Multilingual Embedding Model That Tops the MTEB v2 Benchmark
Tools

Microsoft's Bing Team Open-Sources Harrier, a Multilingual Embedding Model That Tops the MTEB v2 Benchmark

Microsoft's Bing search team has released Harrier as an open-source embedding model, and it tops the multilingual MTEB v2 benchmark while supporting over 100 languages. The release is significant not just for the benchmark numbers but for the source: a search team that has spent decades optimizing retrieval systems has built an embedding model for the exact use case — semantic search and retrieval — that underpins most production RAG applications.

D.O.T.S AI Newsroom
Stability AI Pivots to Enterprise With Brand Studio — a Platform for Brand-Consistent AI Image Generation
Tools

Stability AI Pivots to Enterprise With Brand Studio — a Platform for Brand-Consistent AI Image Generation

Stability AI, the company that made open-source image generation mainstream with Stable Diffusion, is repositioning for enterprise with Brand Studio. The platform lets creative teams train brand-specific image models, automate visual production workflows, and route tasks to the best-suited AI model — a commercial play from a company that built its name on open access.

D.O.T.S AI Newsroom