What is the default rate limit for Plaid's Sandbox environment?

Plaid does not publish exact numerical limits as they fluctuate, but the Sandbox environment has aggressively lower rate limits compared to Development or Production. It is designed purely for functional testing, not load testing. Hitting limits in Sandbox is normal if making more than a dozen requests per minute.

How long does a Plaid rate limit last?

Plaid rate limits typically operate on a rolling window. Depending on the specific endpoint and the severity of the limit breached, the restriction might lift in as little as 60 seconds, or it could last for an hour. It is best practice to back off for at least 1-2 minutes before retrying.

Does the Plaid API return a Retry-After HTTP header?

No, unlike some other modern APIs, Plaid does not consistently utilize the standard `Retry-After` HTTP header to dictate backoff times. Developers are entirely responsible for implementing their own exponential backoff logic based on the `RATE_LIMIT_EXCEEDED` error payload.

Why am I getting rate limited on `/link/token/create`?

This is almost always a frontend architectural flaw. If a React component re-renders endlessly, or if you are pre-fetching link tokens for all users on login rather than waiting for them to explicitly click a 'Connect Bank' button, you will exhaust your endpoint limits rapidly.

Can I request a rate limit increase from Plaid?

Yes, but only for Production environments. If you have already implemented webhooks, caching, and backoff logic, and your application's legitimate scale is still hitting limits, you can submit a rate limit increase request via a support ticket in the Plaid Dashboard. Be prepared to provide architecture diagrams.

How to Fix Plaid Rate Limit Errors: Troubleshooting RATE_LIMIT_EXCEEDED

Resolve Plaid API RATE_LIMIT_EXCEEDED errors by implementing exponential backoff, utilizing webhooks over polling, and optimizing request caching strategies.

Last updated: February 23, 2026

Last verified: February 23, 2026

1,566 words

Key Takeaways

Plaid enforces strict, rolling-window rate limits per endpoint, per Item, and globally across your developer account to ensure system stability.
The most common root cause is aggressive polling of endpoints like `/transactions/get` or `/accounts/balance/get` instead of relying on Plaid's Webhook infrastructure.
Another frequent trigger is infinite loops or frontend bugs causing repeated calls to `/link/token/create`.
Quick Fix: Implement a jittered exponential backoff strategy for API requests returning the `RATE_LIMIT_ERROR` error type.
Long-term Fix: Migrate from a polling architecture to an event-driven webhook architecture for transaction and balance updates.

Fix Approaches Compared
Method	When to Use	Implementation Time	Risk Level
Exponential Backoff	When experiencing intermittent or burst-related `RATE_LIMIT_EXCEEDED` errors on valid requests.	1-2 Hours	Low
Webhook Migration	Replacing aggressive polling for transactions, auth, or balance updates.	1-3 Days	Medium
Data Caching (Redis/Memcached)	Frequent read-only calls for static or semi-static account/institution data.	1-2 Days	Low
Limit Increase Request	Consistent hitting of production limits despite full optimization of caching and webhooks.	1-2 Weeks	Low

Understanding the Plaid Rate Limit Error

When building financial applications integrated with the Plaid API, encountering a RATE_LIMIT_EXCEEDED error is a rite of passage for many developers. Plaid processes massive amounts of highly sensitive financial data, and to protect both their infrastructure and the downstream financial institutions, they impose strict rate limits.

Unlike traditional APIs that might simply throttle you globally based on IP, Plaid's rate limiting is multi-dimensional. Limits are applied at the global environment level (Sandbox, Development, Production), at the specific endpoint level (e.g., /transactions/get), and critically, at the Item level (a single set of credentials at a single institution).

When you hit a rate limit, the Plaid API will respond with an HTTP 429 Too Many Requests status code. The JSON response payload will look exactly like this:

{
  "display_message": "The rate limit was exceeded. Please try again later.",
  "error_code": "RATE_LIMIT_EXCEEDED",
  "error_message": "Too many requests to /transactions/get",
  "error_type": "RATE_LIMIT_ERROR",
  "request_id": "m8Ykw7XbcT3A1aC"
}

Root Causes of RATE_LIMIT_EXCEEDED

Before diving into the fixes, it is crucial to identify why your application is triggering these limits. The most common culprits include:

Aggressive Polling: Continuously calling /transactions/get or /accounts/balance/get every few minutes to check for new data. Financial institutions do not update data this frequently, making polling both inefficient and guaranteed to trigger rate limits.
Runaway Link Token Generation: A bug in your frontend React/Vue code that repeatedly calls your backend to generate a new Link token via /link/token/create without user interaction.
Sandbox Constraints: Plaid's Sandbox environment has significantly lower rate limits than Production. Load testing in Sandbox will almost always result in RATE_LIMIT_ERROR.
Historical Data Extraction: Attempting to pull years of transaction history in parallel across hundreds of Items simultaneously without staggering the requests.

Step 1: Diagnose the Request Pattern

Your first step is to isolate the offending endpoint and trace the request volume. If you are using Datadog, New Relic, or even basic structured logging, query your logs for error_type: RATE_LIMIT_ERROR.

Pay close attention to the request_id and the error_message. The error_message will explicitly state which endpoint was throttled (e.g., Too many requests to /link/token/create).

Once you have the endpoint, check your application's request volume to that specific path over the last 15 minutes. If you see sharp spikes, you have a burst issue. If you see a high, sustained baseline, you have an architectural polling issue.

Step 2: Implement Exponential Backoff (The Immediate Fix)

Because network volatility and minor traffic spikes are inevitable, your Plaid client must gracefully handle 429 responses. Plaid does not guarantee a standard Retry-After HTTP header, so you must implement a client-side exponential backoff strategy.

Exponential backoff works by progressively increasing the wait time between retry attempts. If the first retry happens after 1 second, the next might happen after 2 seconds, then 4 seconds, up to a maximum threshold. Adding "jitter" (a random degree of variance to the backoff time) prevents the "thundering herd" problem where multiple throttled workers all retry at the exact same millisecond.

For example, if you are calling /accounts/balance/get and receive a RATE_LIMIT_EXCEEDED, your code should catch this specific error_code, sleep for 1s + jitter, and retry. If it fails again, sleep for 2s + jitter, and so on.

Step 3: Shift from Polling to Webhooks (The Architectural Fix)

The most robust solution to Plaid rate limits is completely abandoning the polling model. Plaid offers a comprehensive Webhook system that pushes notifications to your servers exactly when new data is available.

If you are currently polling /transactions/get to see if a user has new purchases, stop. Instead:

Register a webhook URL in your Plaid Dashboard or during the Link token creation phase.
Listen for the TRANSACTIONS webhook with the DEFAULT_UPDATE or INITIAL_UPDATE code.
When your server receives this webhook, it indicates that Plaid has already fetched new data from the bank.
Now you make a single API call to /transactions/sync or /transactions/get to retrieve the data safely, knowing it is available and you will not hit a rate limit.

This event-driven architecture reduces your API request volume by up to 99% for active users, completely eliminating polling-induced rate limits.

Step 4: Implement Intelligent Caching

For endpoints that return semi-static data, such as /institutions/get or /institutions/get_by_id, repeated calls are wasteful. Institution data (like bank names, logos, and routing numbers) changes rarely.

Implement a Redis or Memcached layer in your infrastructure. When a user needs an institution's details, check the cache first. If it's a cache miss, call the Plaid API, store the result in Redis with a Time-To-Live (TTL) of 24 to 48 hours, and return the data. This heavily shields your Plaid API quota from unnecessary hits.

Step 5: Handling Item-Specific Limits

Sometimes, a rate limit isn't about your overall application volume, but a specific user's bank account (an "Item"). Banks often throttle Plaid, and Plaid passes that throttle down to you. If a specific access_token is repeatedly getting RATE_LIMIT_EXCEEDED on /accounts/balance/get, you must stop querying that specific Item for at least an hour. Continuing to hammer a throttled Item can result in the financial institution temporarily locking the user's account for security reasons.

Implement a database flag (e.g., is_throttled_until) for individual Items in your system. If an Item hits a limit, set this timestamp for 1 hour in the future, and have your application bypass Plaid API calls for that specific user until the timestamp expires.

Frequently Asked Questions

python

import time
import random
import plaid
from plaid.api import plaid_api
from plaid.model.transactions_get_request import TransactionsGetRequest

# Advanced Exponential Backoff Wrapper for Plaid API Calls
def call_plaid_with_backoff(api_func, max_retries=5, base_delay=1):
    retries = 0
    while retries < max_retries:
        try:
            # Attempt the Plaid API call
            return api_func()
            
        except plaid.ApiException as e:
            # Parse the JSON error response from Plaid
            error_response = e.body
            
            if 'RATE_LIMIT_EXCEEDED' in str(error_response):
                retries += 1
                if retries == max_retries:
                    print(f"Max retries reached. Failing after {max_retries} attempts.")
                    raise e
                    
                # Calculate exponential backoff with jitter
                # e.g., 1s, 2s, 4s, 8s plus random milliseconds
                sleep_time = (base_delay * (2 ** (retries - 1))) + (random.randint(0, 1000) / 1000.0)
                print(f"Rate limit exceeded. Backing off for {sleep_time:.2f} seconds (Attempt {retries}/{max_retries}).")
                time.sleep(sleep_time)
            else:
                # If it's a different ApiException (e.g., ITEM_LOGIN_REQUIRED), raise immediately
                raise e

# Example Usage
# Assuming 'client' is an authenticated plaid_api.PlaidApi instance
# and 'access_token' is a valid token for an Item.

request = TransactionsGetRequest(
    access_token="access-production-12345",
    start_date=datetime.strptime("2023-01-01", "%Y-%m-%d").date(),
    end_date=datetime.strptime("2023-01-31", "%Y-%m-%d").date()
)

# Wrap the specific API call in our backoff function
try:
    response = call_plaid_with_backoff(
        lambda: client.transactions_get(request)
    )
    print("Successfully retrieved transactions!")
except Exception as final_err:
    print("Failed to retrieve data after backoff.")

Error Medic Editorial

Error Medic Editorial is composed of senior DevOps, SRE, and platform engineers dedicated to documenting the hardest, most obscure infrastructure and API errors so you don't have to debug them from scratch.

Sources

Fix Plaid rate limit errors fast. Learn why RATE_LIMIT_EXCEEDED happens, how to implement exponential backoff, and optimize API calls to stay within Plaid limit

Plaid Rate Limit Error: How to Fix RATE_LIMIT_EXCEEDED and 429 Responses

Fix Plaid rate limit errors (HTTP 429, RATE_LIMIT_EXCEEDED) with exponential backoff, request batching, and token caching strategies. Step-by-step guide.

Plaid Rate Limit Error: RATE_LIMIT_EXCEEDED Troubleshooting Guide

Fix Plaid RATE_LIMIT_EXCEEDED (HTTP 429) errors with exponential backoff, request queuing, and caching strategies. Step-by-step guide for devs.

Plaid Rate Limit Exceeded: How to Fix RATE_LIMIT_EXCEEDED Errors (HTTP 429)

Hitting Plaid's RATE_LIMIT_EXCEEDED error? Learn how to diagnose and fix rate limiting with exponential backoff, request queuing, and caching in under 30 minute