What is Perplexity's rate limit for Pro users?

Perplexity Pro users are limited to 200 Pro searches per week as of May 2026. This quota covers searches that use advanced models like GPT-4o, Claude 3.5, or Sonar Large. The 200-search weekly limit resets every Monday at 00:00 UTC, regardless of when you started your subscription. Standard searches (using the default Sonar model) remain unlimited and do not count toward this cap.

What is the Perplexity API rate limit?

The Perplexity Sonar API (used by developers) has a default rate limit of 50 requests per minute (RPM) for new accounts. This is separate from the web interface quota entirely. API tier upgrades are available for higher throughput needs. If your application exceeds 50 RPM, you will receive a 429 Too Many Requests HTTP error. You can request a tier increase through the Perplexity API dashboard once your usage patterns are established.

How do I know if I've hit the rate limit vs. a server outage?

A rate limit shows a specific message like 'Rate limit exceeded' or 'You have reached your Pro search limit' inside the Perplexity interface, or returns HTTP 429 in the API. A server outage typically shows a generic error, a blank page, or affects all search types including Standard. Check perplexity.ai/status to confirm whether there is a platform-wide incident. If Standard search still works normally, you have hit a rate limit, not an outage.

Does switching to Standard search bypass the Pro rate limit?

Yes. Standard search on Perplexity uses the base Sonar model and is unlimited for all users, including free tier. When you exhaust your 200 weekly Pro searches, you can immediately switch the search mode selector to 'Default' or 'Standard' and continue searching without interruption. You will lose access to advanced models like GPT-4o and Claude 3.5 Sonnet until your Pro quota resets on Monday at 00:00 UTC, but core search functionality remains fully available.

Can I increase my Perplexity API rate limit above 50 RPM?

Yes, but it requires contacting Perplexity or requesting a tier upgrade through your API dashboard. The 50 RPM default applies to new Sonar API accounts. Higher tiers exist for production applications with established usage histories. As a short-term workaround, implement exponential backoff in your code: retry failed requests after 1 second, then 2 seconds, then 4 seconds, rather than hammering the endpoint repeatedly, which can result in longer throttle windows.

Does Perplexity's rate limit reset at midnight my local time?

No. Perplexity's weekly Pro search quota resets every Monday at 00:00 UTC, not at your local midnight. If you are in New York (UTC-4 during EDT), that reset happens Sunday evening at 8:00 PM your time. If you are in London (UTC+1 during BST), it resets Monday at 1:00 AM your time. The monthly Deep Research limit resets on the 1st of each calendar month at 00:00 UTC. Always convert to UTC when planning around your quota.

How do I check how many Pro searches I have left?

Go to perplexity.ai/settings/account and look at the Usage section. It shows your remaining Pro searches for the current week, your remaining Deep Research sessions for the current month, and the next reset date for each counter. There is no real-time indicator inside the search interface itself, so you need to visit settings proactively before large research sessions. If you are close to your limit, consider switching to Standard mode for lower-priority queries to preserve your remaining Pro searches.

Perplexity AI Rate Limit: What It Means and How to Fix It

Step-by-Step Fix

1. Identify Which Rate Limit You Have Hit

Perplexity has two separate rate-limiting systems that behave very differently:

Web interface (Pro plan users):

200 Pro searches per week — resets every Monday at 00:00 UTC
20 Deep Research sessions per month — resets on the 1st at 00:00 UTC
Free plan: approximately 5 Pro searches per day; Standard searches are unlimited

API users (Sonar):

Default cap of 50 requests per minute (RPM) for new accounts
Exceeding this returns HTTP status 429 with a rate_limit_exceeded error code
API limits are entirely separate from web interface quotas

Knowing which limit you hit determines your next step.

2. Check Your Current Usage and Reset Time

For web users:

Go to perplexity.ai/settings/account
Look for the Usage section showing your remaining Pro searches and Deep Research sessions
Note the reset date shown next to each counter
If the reset is close (within a few hours), waiting is the fastest solution

For API users:

Check the response headers on your 429 error — look for Retry-After or X-RateLimit-Reset
The value will tell you exactly how many seconds until your rate limit window clears
Log into your API dashboard to see your current tier and remaining quota

3. Switch to Standard Search (Web Users — Immediate Fix)

If you need to keep searching right now:

Open any Perplexity search page
Click the model/mode selector (usually labeled with the current model name like "Pro" or "GPT-4o")
Select Default or Standard mode
Continue searching — Standard searches are unlimited and do not count against your weekly Pro quota

Standard mode uses the Sonar model, which is fast and accurate for most everyday queries. You only lose access to advanced reasoning and deep analysis that Pro models provide.

4. Wait for the Reset Window (Web Users)

If you need Pro-quality results and cannot wait long:

Pro search quota: resets every Monday at 00:00 UTC
Deep Research quota: resets on the 1st of each month at 00:00 UTC
Free plan daily limit: resets at 00:00 UTC each day

Calculate how long until your specific reset:

Current UTC time: check time.is/UTC
Days until Monday: plan accordingly

For US East Coast users (UTC-4 in summer), Monday 00:00 UTC = Sunday 8:00 PM EDT. Your weekly Pro searches restore on Sunday evening, not Monday morning.

5. Implement Exponential Backoff (API Users)

If you are hitting the 50 RPM API limit, do not retry immediately. Use exponential backoff:

import time
import requests

def query_perplexity(payload, max_retries=5):
    for attempt in range(max_retries):
        response = requests.post(
            "https://api.perplexity.ai/chat/completions",
            headers={"Authorization": f"Bearer {API_KEY}"},
            json=payload
        )
        if response.status_code == 200:
            return response.json()
        elif response.status_code == 429:
            wait_time = (2 ** attempt)  # 1s, 2s, 4s, 8s, 16s
            print(f"Rate limited. Waiting {wait_time}s before retry {attempt + 1}")
            time.sleep(wait_time)
        else:
            response.raise_for_status()
    raise Exception("Max retries exceeded")

This approach respects the rate limit window without triggering extended throttling from aggressive retry behavior.

6. Throttle Your API Request Rate

If you are running parallel API calls:

Limit concurrency to fewer than 10 simultaneous requests
Add a minimum 1.2-second delay between sequential requests (60 seconds ÷ 50 RPM = 1.2 seconds per request)
Use a request queue rather than firing all requests simultaneously
Monitor the X-RateLimit-Remaining response header to track how close you are to the limit before hitting it

7. Request a Rate Limit Increase (API Users)

If your production application consistently needs more than 50 RPM:

Log into your Perplexity API dashboard
Navigate to the usage/tier section
Submit a tier upgrade request with your expected usage volume
Alternatively, email Perplexity support with your use case details

Tier increases are granted based on demonstrated usage patterns and account standing. Include details like average daily request volume and your application's purpose to speed up the review.

8. Verify the Issue Is Not a Platform Outage

Before spending time troubleshooting rate limits, rule out a service disruption:

Visit perplexity.ai/status
Check for any active incidents or degraded performance notices
Try a Standard search — if Standard also fails, the issue is platform-wide, not your quota

Rate limit problems and outages produce different symptoms. A rate limit blocks Pro searches while Standard continues to work. An outage blocks everything.

Why This Happens

Perplexity applies rate limits because Pro searches use computationally expensive large language models — GPT-4o, Claude 3.5 Sonnet, and Sonar Large — which cost significantly more per query than Standard search. The weekly 200-search cap (as of May 2026, reduced from 600) reflects the cost balance between the $20/month Pro subscription price and underlying model API costs.

The API's 50 RPM default exists to protect infrastructure stability for all users. A single client sending hundreds of requests per minute could degrade response quality for everyone else on the platform. Rate limiting is a standard practice across all major AI API providers for exactly this reason.

Common Mistakes to Avoid

Retrying immediately after a 429 error. Rapid retries can extend the throttle window on the API side. Always wait at least 1–2 seconds before the first retry, then increase from there using exponential backoff.
Assuming the limit resets at midnight local time. Perplexity resets quotas at 00:00 UTC on fixed calendar intervals. Midnight in your timezone is almost certainly a different moment — sometimes an entire day off for users in UTC+12 or UTC-12.
Using Pro mode for routine queries. Save your 200 weekly Pro searches for tasks that genuinely need advanced reasoning. Simple factual lookups, quick definitions, and basic summaries work perfectly well in Standard mode.
Ignoring the Retry-After header. The API returns this header on 429 responses. Reading it gives you the exact wait time instead of guessing — always parse it in your application code.
Confusing a rate limit with a platform outage. Check perplexity.ai/status before troubleshooting. If Standard search still works, it is a quota issue, not a service disruption.
Not checking usage before a major research session. If you have only 5 Pro searches left and need to run 20 queries for a project, you will hit the wall mid-session. Check perplexity.ai/settings/account first.

View all Perplexity guides

Perplexity AI Rate Limit: What It Means and How to Fix It

Step-by-Step Fix

1. Identify Which Rate Limit You Have Hit

2. Check Your Current Usage and Reset Time

3. Switch to Standard Search (Web Users — Immediate Fix)

4. Wait for the Reset Window (Web Users)

5. Implement Exponential Backoff (API Users)

6. Throttle Your API Request Rate

7. Request a Rate Limit Increase (API Users)

8. Verify the Issue Is Not a Platform Outage

Why This Happens

Common Mistakes to Avoid

More Perplexity usage limits & restrictions guides

Frequently Asked Questions

Related Guides

Perplexity Pro Usage Limits Explained: What's Included and What to Do When You Hit a Cap

Perplexity file upload limits — supported formats, size limits, and weekly caps

Perplexity Labs Rate Limit: What It Is and How to Work Around It

Perplexity Limit Exceeded: 3 Causes and How to Fix Each

How to avoid Perplexity temporary restrictions and suspicious activity flags

Perplexity Message Limit Reached: What Changed and How to Fix It