Models/NVIDIA/nvidia/nemotron-3-ultra-550b-a55b
nvidia

nvidia/nemotron-3-ultra-550b-a55b

1M context$0.50/M input$2.20/M output
nvidia/nemotron-3-ultra-550b-a55b

Description

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...

Quick Start

curl https://router.tangle.tools/v1/chat/completions \
  -H "Authorization: Bearer $TANGLE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "nvidia/nemotron-3-ultra-550b-a55b",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Modalities

Input: TextOutput: Text

Pricing

Input$0.50/M
Output$2.20/M
Context1M tokens

Model Info

ProviderNVIDIA
IDnvidia/nemotron-3-ultra-550b-a55b