Question 1

Do you have official SDKs?

Accepted Answer

Yes! Official SDKs for Python (pip install routeplex) and Node.js (npm install @routeplex/node). Both include typed responses, error classes, and full API coverage. Also fully OpenAI SDK compatible — just change the base URL and API key.

Question 2

How does auto-routing work?

Accepted Answer

When you use the routeplex-ai model, RoutePlex analyzes your prompt — detecting intent, complexity, and domain — and picks the best model automatically. Simple questions get fast, cost-effective models. Complex reasoning tasks get powerful ones. Override with a strategy (cost, speed, quality, balanced) or pick a specific model for full control.

Question 3

Do you support streaming?

Accepted Answer

Yes. Pass stream=true and responses are delivered as Server-Sent Events in real time. Choose buffered mode for smooth, sentence-aware chunks, or realtime mode for minimal-latency character-level delivery. Streaming works with both the native API and the OpenAI SDK.

Question 4

What happens if a provider goes down?

Accepted Answer

RoutePlex automatically retries with the next-best model from a different provider — including mid-request for streaming. In auto-routing, unhealthy providers are detected and avoided. Near-zero downtime across OpenAI, Anthropic, and Google.

Question 5

How is pricing calculated?

Accepted Answer

You pay per token at each model's standard rate. Every response includes a cost breakdown so there are no surprises. Free endpoints like cost estimation and model listing have no charge. Set daily spending caps and budget alerts in the dashboard.

Question 6

Do you store my prompts or responses?

Accepted Answer

No. RoutePlex is a stateless gateway — prompts and model responses are processed in memory and immediately discarded. Only operational metadata (timestamps, token counts, costs) is retained for billing and analytics.

Route every AI model through a single API

One line of code. Any model.

Three ways to route your requests

Auto-Routing

Manual Mode

Automatic Fallback

From zero to production in four steps

Create your API key

Send your first request

Choose your routing mode

Monitor & optimize

AI gateway features built for production

Smart Routing & Failover

Web-Augmented AI

Built-in Safety

Cost Governance

Official SDKs & OpenAI Compatible

Real-Time Analytics

Try the AI playground in your browser

Reliable AI infrastructure you can depend on

99.9%+ Effective Uptime

Automatic Failover

Stateless by Design

OpenAI SDK Compatible

Encrypted End-to-End

Granular Rate Limits

Transparent, usage-based pricing

Free Trial

Pay As You Go

Enterprise

Frequently asked questions

Ready to simplify your AI stack?