nvidia/llama-3.2-nemoretriever-1b-vlm-embed-v1
33K context— input— output
nvidia/llama-3.2-nemoretriever-1b-vlm-embed-v1Availability
Pricing for this model is operator-dependent. Open the chat to route a request — the response includes the resolved per-token cost.
Modalities
Input: TextOutput: Embeddings
Pricing
Input—
Output—
Context33K tokens
Model Info
ProviderNVIDIA
IDnvidia/llama-3.2-nemoretriever-1b-vlm-embed-v1