Models/NVIDIA/NVIDIA: Nemotron 3 Ultra (free)
nvidia

NVIDIA: Nemotron 3 Ultra (free)

1M context— input— output
nvidia/nemotron-3-ultra-550b-a55b:free

Description

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...

Availability

Pricing for this model is operator-dependent. Open the chat to route a request — the response includes the resolved per-token cost.

Modalities

Input: TextOutput: Text

Pricing

Input
Output
Context1M tokens

Model Info

ProviderNVIDIA
IDnvidia/nemotron-3-ultra-550b-a55b:free