Route every AI model through a single API
Intelligent routing, streaming, automatic failover, and real‑time cost tracking — across OpenAI, Claude, Gemini, and 30+ models.
Integration
One line of code. Any model.
Official SDKs for Python and Node.js — or use the OpenAI SDK you already know. Access 30+ models across 3 providers with built‑in routing, cost tracking, and failover.
- Python & Node.js SDKs with zero dependencies
- Also OpenAI SDK compatible — change one line
- Per-request cost breakdown in every response
from routeplex import RoutePlex
client = RoutePlex(api_key="rp_live_YOUR_KEY")
# Auto-routing — picks the best model
response = client.chat("Explain quantum computing")
print(response.output)
# Strategy routing — override with a priority
fast = client.chat("Summarize this", strategy="speed")Routing Modes
Three ways to route your requests
From fully automatic AI routing to precise model targeting and automatic failover.
Auto-Routing
RecommendedAnalyzes your prompt and picks the best model. Simple questions get fast, cheap models; complex tasks get powerful ones. Override with a strategy (cost, speed, quality, balanced) when you need to.
mode="routeplex-ai"Manual Mode
PrecisePick a specific model when you know exactly what you need. Full control over provider and model selection, with automatic fallbacks if it fails.
model="gpt-4o"Automatic Fallback
ResilientIf a provider fails, RoutePlex silently retries with the next-best model — giving you near-zero downtime across providers.
Provider A → B → C ✓How it Works
From zero to production in four steps
Create your API key
Sign up, create an account, and generate your RoutePlex API key from the dashboard in seconds.
Send your first request
Install the Python or Node.js SDK and send a request. Or use the OpenAI SDK with a one-line base URL change.
Choose your routing mode
Let RoutePlex AI pick the best model, target a specific one, or set up a fallback chain for resilience.
Monitor & optimize
Track every request, token, and dollar in real time. Set cost caps, rate limits, and budget alerts.
Capabilities
AI gateway features built for production
Intelligent routing, streaming, real-time web context, built-in safety, and cost governance. All through one API.
Smart Routing & Failover
Auto-routing analyzes your prompt and picks the best model. Four strategies (cost, speed, quality, balanced) for when you want to override. Automatic multi-provider retry if one goes down.
Web-Augmented AI
Prompts are auto-analyzed for search intent and URLs. Real-time web results and page content are fetched and injected into context. Zero config.
Built-in Safety
Three-layer moderation pipeline: pattern detection, AI classification, and URL blocklist. Every request is screened before it reaches any model.
Cost Governance
Real-time cost tracking with micro-cent precision, daily spending caps, budget alerts, and a free estimation endpoint.
Official SDKs & OpenAI Compatible
Python and Node.js SDKs with typed responses, error classes, and all features built in. Also works as a drop-in OpenAI SDK replacement — just change the base URL.
Real-Time Analytics
Live dashboards for requests, tokens, latency, and error rates. Per-key usage breakdowns and exportable analytics.
Try the AI playground in your browser
Send real requests, compare models side-by-side, and preview costs. No sign-up required.
Trust & Reliability
Reliable AI infrastructure you can depend on
Every layer is designed for uptime, security, and visibility.
99.9%+ Effective Uptime
Multi-provider routing means if one provider fails, your request is silently retried. Your users never see an error.
Automatic Failover
Built-in retry logic with configurable fallback chains. Requests reroute across providers in milliseconds.
Stateless by Design
Prompts and responses are never stored, logged, or written to disk. Data flows through in-memory and is immediately discarded.
OpenAI SDK Compatible
Use the OpenAI SDK with any model. Just change the base URL. Same code, same auth pattern, 30+ models.
Encrypted End-to-End
All traffic TLS 1.3 encrypted. API keys hashed with bcrypt. Fully GDPR-aligned with clear data handling policies.
Granular Rate Limits
Account-level rate limiting, daily cost caps, and configurable token quotas protect your budget.
Pricing
Transparent, usage-based pricing
Start free with $5 in credits. Pay only for the tokens you use. No surprises.
Free Trial
Explore the platform
- $5 free credits
- All standard models
- RoutePlex AI routing
- 1,000 RPM
- Basic analytics
- Premium models
- Priority support
Pay As You Go
Scale with zero commitment
- Pay only for what you use
- All standard models
- Premium models available
- 10,000 RPM
- Advanced analytics
- Cost controls & alerts
- Email support
Enterprise
Large-scale deployments
- Volume discounts
- All models included
- Unlimited RPM
- Custom rate limits
- SLA guarantee
- Dedicated support
- Custom integrations
Estimate endpoint is always free. View full pricing & model costs →
Frequently asked questions
Quick answers to what developers ask most.
Ready to simplify your AI stack?
$5 in free credits. No card required. Go from zero to production in under five minutes.