Models & Routing
How Tangle Router routes your requests across operators.
Model Selection
When you specify a model in your API request (e.g., gpt-4o-mini), the gateway finds all operators serving that model and routes your request to the best available one.
Routing Strategy
By default, requests are routed based on a composite score of operator latency, uptime, and price. You can override this with the X-Tangle-Operator header to pin a specific operator.
Fallbacks
If the selected operator fails or times out, the gateway automatically retries with the next-best operator. This happens transparently — your application sees a single successful response.
Operator Selection Headers
# Pin to a specific operator X-Tangle-Operator: tangle-core # Prefer operators in a region X-Tangle-Region: eu-west # Set a max price (per million tokens) X-Tangle-Max-Price: 5.00