Why am I getting Twilio Error 20429 if I only send 5 messages per second?

If you are using standard US Long Codes, your limit might be heavily restricted by A2P 10DLC regulations. Unregistered numbers, or campaigns with low Trust Scores, can be throttled to as low as 1 Message Per Second (MPS). Exceeding this triggers the 20429 error.

What is the difference between a 429 and a 503 error in Twilio?

A 429 (Error 20429) is a rate limit error meaning you are sending requests to Twilio too quickly. A 503 (Service Unavailable) usually occurs on inbound webhooks when your own server takes too long to respond to Twilio, causing the connection to drop.

How can I increase my Twilio rate limit?

To increase limits, you must address the specific bottleneck. You can upgrade from Long Codes to Toll-Free numbers or Short Codes for higher baseline MPS. For A2P 10DLC, ensure your brand is fully registered to improve your Trust Score. For account-wide API concurrency limits (default 100), you must contact Twilio support with a business justification.

Does Twilio have built-in queueing to prevent rate limits?

Yes, Twilio offers internal queueing via Messaging Services. It can queue messages for up to 4 hours if you exceed your MPS limit. However, if your backlog exceeds the 4-hour window, subsequent API requests will be rejected with a 429 error.

How do I fix Twilio 503 errors on my webhooks?

Decouple your processing logic. Your webhook endpoint should immediately save the incoming payload to a local queue (like Redis or a database) and return an HTTP 200 OK within milliseconds. Process the actual business logic asynchronously using background workers.

Resolving Twilio Error 20429 & 503: The Complete Rate Limit Troubleshooting Guide

Fix Twilio rate limit (Error 20429) and 503 errors. Learn to implement exponential backoff, Redis message queues, and configure A2P 10DLC to scale APIs.

Last updated: February 24, 2026

Last verified: February 24, 2026

1,797 words

Key Takeaways

Twilio Error 20429 (Too Many Requests) occurs when you exceed your Account's concurrent connection limit or carrier-specific Messages Per Second (MPS) limits.
HTTP 503 (Service Unavailable) errors typically indicate downstream carrier congestion or that your own webhook endpoints are timing out under heavy Twilio payload load.
Quick Fix: Wrap all Twilio API calls in an exponential backoff retry mechanism to gracefully handle intermittent 429 spikes.
Long-term Fix: Implement an asynchronous message queue (e.g., Redis + Celery) to enforce strict client-side rate limiting matching your A2P 10DLC Trust Score limits.
Use Twilio Messaging Services to automatically handle number pooling and internal queueing for up to 4 hours.

Rate Limit Mitigation Strategies Compared
Method	When to Use	Implementation Time	Risk / Drawbacks
Exponential Backoff	Intermittent spikes, minor bursting over limits.	Minutes (Library integration)	High latency during retries; memory exhaustion if bursts are sustained.
Asynchronous Queue (Redis/SQS)	Sustained high-volume traffic exceeding MPS limits.	Days (Infrastructure change)	Requires maintaining additional infrastructure and worker processes.
Twilio Messaging Services	Managing multiple sender numbers, US A2P routing.	Hours (Console config & code update)	Messages queued internally by Twilio may arrive late; limited to 4-hour queue window.
Upgrade Short Code / Toll-Free	Consistent high throughput needed (100+ MPS).	Weeks (Carrier approval process)	High financial cost; rigorous carrier vetting required.

Understanding Twilio Rate Limits and Errors

When scaling communication applications, developers inevitably encounter performance boundaries. In the Twilio ecosystem, pushing too much traffic too quickly results in being 'Twilio rate limited'. This scenario primarily manifests as Error 20429 (Too Many Requests) or HTTP 429 status codes. In more complex or overloaded architectures, you might also encounter HTTP 503 (Service Unavailable) errors.

Being rate limited is a protective mechanism. It prevents a single user from degrading the multi-tenant Twilio platform and ensures compliance with strict telecom carrier regulations, such as A2P 10DLC (Application-to-Person 10-Digit Long Code) in the United States.

The Anatomy of Twilio Throttling

Twilio enforces rate limits across several different vectors:

API Concurrency Limits: Twilio allows a maximum of 100 concurrent requests per account across all REST API endpoints. Exceeding this triggers an immediate 429 Too Many Requests.
Messages Per Second (MPS) Limits: This is the physical speed at which Twilio can hand off messages to carrier networks.
- Standard Long Codes: Historically 1 MPS. Under A2P 10DLC, this is dictated by your campaign's Trust Score (ranging from 0.75 MPS to 75+ MPS).
- Toll-Free Numbers: Default to 3 MPS, upgradeable upon request.
- Short Codes: Default to 100 MPS, highly vetted and optimized for bulk.
Queueing Limits: If you send messages faster than your MPS limit, Twilio will queue them internally. However, if the queue exceeds 4 hours' worth of traffic, Twilio will reject new requests with an error.

Decoding the Error Codes

Error 20429 (429 Too Many Requests): You have exceeded either the API concurrency limit or an absolute rate limit for a specific API resource.
Error 30022: A specialized rate limit error for US A2P 10DLC. It indicates you've hit the message throughput limit assigned specifically to your registered campaign.
Error 503 (Service Unavailable): While sometimes indicating a Twilio platform degradation, a twilio 503 often occurs on inbound webhooks. If your server takes too long to respond to Twilio's webhook (exceeding the 15-second timeout), or if your server queue is full, Twilio drops the request and logs a 503. It can also occur outbound if Twilio's downstream SMSC (Short Message Service Center) links are saturated.

Step 1: Diagnose the Bottleneck

Before writing code, you must identify exactly where the rate limit is occurring.

1. Analyze the Twilio Error Logs: Navigate to the Twilio Console > Monitor > Errors. Look for spikes in Error 20429 or 30022. The error details will specify whether the rejection was due to API concurrency or carrier MPS limits.

2. Monitor Webhook Latency (For 503s): If you are experiencing 503s, check your own server's APM (Application Performance Monitoring) tools like Datadog, New Relic, or AWS CloudWatch. Look at the p95 and p99 response times for the endpoints Twilio is hitting. If they are creeping above 10 seconds, your server is the bottleneck, causing Twilio to record a 503.

3. Check A2P 10DLC Status: If sending to US numbers, verify your Trust Score in the Twilio Trust Hub. A low score drastically throttles your MPS, causing immediate queuing and subsequent 20429 errors during moderate traffic bursts.

Step 2: Immediate Mitigation with Exponential Backoff

The most immediate, low-effort fix for intermittent 429 errors is wrapping your Twilio API calls in an exponential backoff algorithm.

When a 429 is received, the client should pause for a short duration (e.g., 500ms) before retrying. If the retry fails, the pause duration doubles (1s, 2s, 4s), introducing 'jitter' (randomness) to prevent the thundering herd problem where hundreds of retries hit the API at the exact same millisecond.

While effective for intermittent spikes, backoff is not a solution for sustained traffic that consistently exceeds your MPS. Sustained looping will quickly consume your application's worker threads or memory.

Step 3: Implement a Message Queue (Architecture for Scale)

For sustained high-volume messaging, you must decouple message generation from message dispatch. This is the definitive fix for 'twilio rate limited' scenarios.

Instead of calling the Twilio API synchronously when an event occurs in your application, you push a payload to a message broker (like Redis, RabbitMQ, or AWS SQS). A dedicated worker pool then consumes these messages at a strictly controlled rate that exactly matches your Twilio MPS limit.

The Token Bucket Algorithm: To enforce client-side rate limiting, implement a Token Bucket algorithm in your workers.

Imagine a bucket that holds a maximum of N tokens (your concurrent limit).
Tokens are added to the bucket at a rate of R per second (your MPS limit).
Before a worker makes a Twilio API call, it must remove a token from the bucket.
If the bucket is empty, the worker sleeps until a new token is available.

This completely shields Twilio from your internal traffic bursts, virtually eliminating 20429 errors.

Step 4: Leverage Twilio Messaging Services

If you prefer not to build complex queuing infrastructure, you can offload this to Twilio using Messaging Services.

A Messaging Service acts as a smart, internal queue and routing engine. When you send a message via a Messaging Service SID (instead of a specific 'From' phone number):

Number Pooling: Twilio automatically distributes the outbound traffic across multiple phone numbers in your pool, effectively multiplying your standard long-code MPS limits.
Smart Queueing: If you burst over your aggregate limit, Twilio queues the messages internally. They will hold the messages for up to 4 hours, dripping them to the carrier networks at the maximum allowed compliance rate.
Geomatch: Twilio automatically selects a local number to improve delivery rates.

Note: Even with Messaging Services, if you enqueue more than 4 hours' worth of messages, the API will start returning 429s. Monitor your queue length via the Twilio Insights dashboard.

Step 5: Fixing Webhook 503 Service Unavailable Errors

If your search for "twilio 503" led you here, the problem is almost certainly your server's inability to process incoming webhooks (like incoming SMS, call status updates, or recording ready callbacks) quickly enough.

Twilio expects your server to respond with an HTTP 2xx status code within 15 seconds.

The Solution: Asynchronous Processing Never perform long-running synchronous tasks (like database writes, external API calls, or heavy computation) directly within the webhook HTTP request cycle.

Receive the Request: Your endpoint receives the webhook payload from Twilio.
Acknowledge Immediately: Enqueue the payload into your local database or Redis queue, and immediately return an HTTP 200 OK or HTTP 204 No Content to Twilio. This should take less than 100 milliseconds.
Process Asynchronously: A background worker picks up the payload from your queue and performs the heavy lifting (e.g., generating an AI response, saving audio files, updating CRM records).

By decoupling the HTTP response from the business logic, you will permanently eliminate Twilio webhook 503 errors.

Frequently Asked Questions

python

import time
import logging
from twilio.rest import Client
from twilio.base.exceptions import TwilioRestException
from tenacity import retry, wait_exponential, stop_after_attempt, retry_if_exception_type

logging.basicConfig(level=logging.INFO)
logger = logging.getLogger(__name__)

# Twilio Client Setup
client = Client('TWILIO_ACCOUNT_SID', 'TWILIO_AUTH_TOKEN')

def is_rate_limit_error(exception):
    """Check if the exception is a Twilio 429 / 20429 error."""
    return isinstance(exception, TwilioRestException) and exception.status == 429

# Apply Exponential Backoff using Tenacity
# Waits 2^x * 1 second between each retry, up to 10 seconds, max 5 attempts
@retry(
    wait=wait_exponential(multiplier=1, min=2, max=10),
    stop=stop_after_attempt(5),
    retry=retry_if_exception_type(TwilioRestException),
    before_sleep=lambda retry_state: logger.warning(f"Rate limited. Retrying in {retry_state.next_action.sleep} seconds...")
)
def send_sms_with_backoff(to_number, from_number, body_text):
    try:
        message = client.messages.create(
            body=body_text,
            from_=from_number,
            to=to_number
        )
        logger.info(f"Message sent successfully: {message.sid}")
        return message
    except TwilioRestException as e:
        if e.status == 429:
            # Raise to trigger the tenacity retry mechanism
            raise e
        else:
            # Log and handle other Twilio errors (e.g., invalid number) without retrying
            logger.error(f"Failed to send message: {e.msg}")
            raise

# Example usage
# send_sms_with_backoff("+1234567890", "+0987654321", "Important alert payload")

Error Medic Editorial

Written by Senior Site Reliability Engineers specializing in high-throughput API integrations, distributed systems, and telecom infrastructure.

Sources

Troubleshoot Twilio 429 Too Many Requests and 503 Service Unavailable errors. Learn to implement exponential backoff, rate limiting, and message queueing.

How to Fix Twilio Rate Limit (Error 20429 & 503 Service Unavailable)

Resolving Twilio rate limit errors (20429) and 503 responses. Learn how to implement exponential backoff, manage message queueing, and increase API concurrency.

How to Fix Twilio Rate Limit (Error 20429 & 503 Service Unavailable) - A Comprehensive Guide

Resolve Twilio rate limit errors (20429) and 503 Service Unavailable. Learn how to implement exponential backoff, queues, and optimize your API requests.

How to Fix Twilio Rate Limit Exceeded (Error 20429 & Twilio 503)

Fix Twilio rate limited errors (Error 20429) and 503 Service Unavailable by implementing exponential backoff, message queuing, and concurrent request tuning.