Models/NVIDIA/nvidia/llama-nemotron-embed-1b-v2
nvidia

nvidia/llama-nemotron-embed-1b-v2

33K context— input— output
nvidia/llama-nemotron-embed-1b-v2

Availability

Pricing for this model is operator-dependent. Open the chat to route a request — the response includes the resolved per-token cost.

Modalities

Input: TextOutput: Embeddings

Pricing

Input
Output
Context33K tokens

Model Info

ProviderNVIDIA
IDnvidia/llama-nemotron-embed-1b-v2