Claude Rate Limit – Why It Happens and How to Fix It

Quick Answer

Claude Pro enforces a 5-hour rolling usage window — not a daily reset. When you exhaust that window, you must wait until the oldest messages age out before the quota refreshes. Free users face stricter caps with no fixed window. As of May 6, 2026, Anthropic removed peak-hour throttling for Pro and Max subscribers, so you no longer get slower responses during busy periods (5am–11am PT). To continue working sooner: upgrade to Max ($100–$200/month for 5x–20x more headroom), batch your messages, or switch to shorter conversations.

What Causes Claude Rate Limits

Claude enforces rate limits at two levels:

  1. Per-user usage window — a cap on how much compute you can consume within a rolling 5-hour window (Pro) or similar rolling period (Free)
  2. Server-side capacity limits — triggered when Claude's infrastructure is under high global load, affecting all users temporarily

The error messages you see differ between these two cases:

| Error | What It Means | |-------|---------------| | "You've reached your usage limit" | You personally exhausted your plan's rolling window | | "Claude is at capacity right now" | Global server load — not your personal limit | | "Too many requests" (API) | API rate limit exceeded — slow down requests | | "Rate limit — try again later" | Rolling window exhausted — wait for oldest requests to cycle out |


How the 5-Hour Rolling Window Works

Claude Pro does not reset at midnight or at a fixed time each day. Instead, it uses a 5-hour rolling window:

  • Every message you send enters the window
  • 5 hours after each message, it exits the window and the quota it used is freed
  • When the window fills up, you're rate limited until old messages expire

Practical example:

  • 9:00am — you send 30 long messages; window fills up
  • You see "usage limit reached"
  • 2:00pm — the 9:00am messages start expiring; quota frees up gradually
  • By 2:30pm — enough quota has freed that you can send new messages

This is why the rate limit notification shows a specific time estimate rather than "try again tomorrow."

As of May 6, 2026: Anthropic removed peak-hour throttling for Pro and Max subscribers. You no longer experience slower responses during the 5am–11am PT busy period. The rolling window limit still applies, but you won't be throttled below your normal speed during peak hours.


Claude Rate Limits by Plan

| Plan | Usage Window | Monthly Cost | |------|-------------|--------------| | Free | Small rolling window, no 5-hour structure | $0 | | Claude Pro | 5-hour rolling window (~5x free) | $20 | | Claude Max | 5-hour rolling window (5x Pro) | $100 | | Claude Max | 5-hour rolling window (20x Pro) | $200 | | API (Tier 1) | Tokens per minute + requests per day | Pay per use | | API (Higher Tiers) | Higher TPM/RPD unlocked by spend history | Pay per use |

Free plan limits are not published officially but are significantly lower than Pro. Claude Pro gives priority access; as of May 2026, this also means no peak-hour throttling.


Step-by-Step Fix

1. Check the Reset Time in the Error Message

When Claude shows a rate limit notification, it typically includes an estimated wait time. Note that time and stop sending requests. Every additional attempt wastes quota without getting you closer to a response.

2. Understand What You Can Do Right Now

While waiting, you have a few options:

  • Work on tasks that do not require Claude (local tools, other work)
  • Use a different AI tool temporarily (ChatGPT, Gemini, Perplexity)
  • Plan your next Claude session so you use it more efficiently when quota resets

3. Slow Down Your Request Pace

If you hit limits regularly, batch multiple questions into a single message instead of rapid-fire follow-ups.

Instead of:

Message 1: "Summarize this article" Message 2: "Now translate it to Spanish" Message 3: "Make the Spanish version shorter"

Do this:

"Summarize this article, then translate the summary to Spanish, then shorten the Spanish version to 100 words."

One message = one slot in the rolling window. Combining tasks dramatically extends how long your quota lasts.

4. Keep Conversations Shorter

Long conversation threads consume more quota per message because Claude re-processes the entire history each time. Start fresh conversations for new topics rather than extending one long thread.

Rule of thumb: When you switch topics, start a new conversation. Each new chat starts with zero accumulated context, making each message cheaper.

5. Request Shorter Outputs

When you do not need a full essay, say so:

  • "In 2-3 sentences..."
  • "Just the code, no explanation"
  • "Give me the bullet points only"

Shorter responses consume less of your rolling window.

6. Upgrade Your Plan

If you regularly hit the Pro rate limit, consider Claude Max:

  • Claude Max $100/month — 5x the Pro window (effectively 25x free)
  • Claude Max $200/month — 20x the Pro window (effectively 100x free)

Upgrade at claude.ai/upgrade.

7. Switch to the Claude API for Heavy Workloads

For automated or high-volume use, the Claude API is more appropriate than the web app:

  • Limits are token-based (tokens per minute), not conversation-based
  • Higher throughput available at higher API tiers
  • Supports exponential backoff and programmatic retry logic

Set up access at console.anthropic.com.


API Rate Limit Errors (429 Too Many Requests)

If you're using the Claude API and seeing HTTP 429 errors:

  1. Check your tier limits at console.anthropic.com under Usage and Limits
  2. Implement exponential backoff — start with a 1s delay, double on each retry, cap at 60s
  3. Reduce tokens per minute — large prompts consume TPM faster than short ones
  4. Batch requests — combine multiple short requests into one where possible
  5. Request a tier upgrade — API tiers unlock automatically based on cumulative account spend

Why This Happens: The Technical Reason

Claude is computationally expensive to run. Each response requires significant GPU processing, and the cost scales with the length and complexity of both your input and Claude's output. The 5-hour rolling window exists to prevent a small number of power users from consuming infrastructure that degrades response times for everyone else. The rolling structure (rather than a midnight reset) distributes load more evenly across the day.


Common Mistakes to Avoid

  • Retrying immediately after a limit — this wastes remaining quota and can worsen throttling; wait for the reset time shown
  • Running many parallel conversations — each browser tab or API call counts against the same rolling window
  • Sending very long prompts repeatedly — long inputs consume the rolling window faster; be concise with context
  • Confusing server capacity errors with personal limits — "Claude is at capacity" is global and clears on its own; your personal limit requires waiting out the 5-hour window
  • Expecting a midnight reset — Claude Pro uses a rolling window, not a daily cycle; you may hit a limit at 3pm and not get a reset until 8pm
  • Uploading large files repeatedly — file uploads count toward the rolling window; upload once and reference in subsequent messages

When to Contact Support

If you believe you're being rate limited incorrectly — for example, you've barely used Claude that day and still see a limit message — contact Anthropic support at support.anthropic.com. Include the exact error message text, the timestamp, and your plan type.


View all Claude guides

Claude · Usage Limits & Restrictions

More Claude usage limits & restrictions guides

Browse all guides in this category to troubleshoot related issues faster.

Browse all guides →

Frequently Asked Questions

Claude Pro uses a 5-hour rolling window, not a fixed reset time. When you hit the limit, you must wait until messages sent 5 hours ago cycle out of the window. In practice, if you sent a burst of long requests at 9am, quota starts freeing up around 2pm. Claude displays an estimated wait time in the rate limit notification — follow that timer. Free users face a shorter rolling window with lower caps and no published reset schedule.

Related Guides

Continue with nearby guides in the same topic to rule out adjacent causes faster.

Claude Usage Limit Reached – How to Continue Using Claude

Claude's usage limits reset on a rolling 8-hour window, not at a fixed midnight. Free users typically get 10–20 messages before hitting the cap; Claude Pro users get approximately 5x that amount with priority access during peak hours. To continue immediately: upgrade to Claude Pro ($18/month billed annually), switch to Claude Haiku (separate, lighter cap), or start a fresh conversation to avoid heavy context overhead.

How to handle Claude context window limits without losing accuracy?

Claude's context window holds up to 200,000 tokens on paid plans — roughly 150,000 words. As conversations grow long, Claude's accuracy on earlier content degrades before the hard limit is hit. The most effective strategy is to start fresh conversations with a structured summary of essential context rather than continuing one extremely long thread. Keep project files concise and use Claude Projects to persist only what Claude genuinely needs.

How to avoid Claude temporary restrictions (suspicious activity flags)?

Claude temporary restrictions occur when usage patterns trigger automated safety checks — sending many rapid messages, unusual request patterns, or content that approaches policy limits. Most restrictions are temporary and lift within a few hours. To avoid them: use Claude at a natural pace, start new conversations instead of sending dozens of messages in a single thread, and avoid testing content policy limits with repeated edge-case requests.

Claude Throttling and Slow Responses During Peak Hours: What's Happening and How to Work Around It

Claude throttles Pro and Max users during peak hours (5 AM to 11 AM PT / 8 AM to 2 PM ET / 13:00 to 19:00 GMT), causing the 5-hour usage window to deplete 2–3x faster than normal. Between March and May 2026, some Claude Max users reported their full session quota exhausting in under 19 minutes during peak times. On May 6, 2026, Anthropic partially removed peak-hour throttling for Pro and Max users, but heavy usage during high-demand periods can still trigger slowdowns.

Claude Usage Limit Reset Time: When Does Claude Pro Reset and How the 5-Hour Window Works

Claude Pro does not reset at a fixed daily time. Instead, it uses a rolling 5-hour window: your usage allowance refills 5 hours after your first message in a session, not at midnight. Claude Max ($100–$200/month) uses the same rolling system but with a significantly higher cap. The most efficient times to use Claude Pro are between 11 PM and 5 AM PT, when server load is lowest and your quota stretches furthest.