Models/NVIDIA/nvidia/llama-3.2-nemoretriever-1b-vlm-embed-v1

nvidia/llama-3.2-nemoretriever-1b-vlm-embed-v1

33K context— input— output

nvidia/llama-3.2-nemoretriever-1b-vlm-embed-v1

Availability

Pricing for this model is operator-dependent. Open the chat to route a request — the response includes the resolved per-token cost.

Input: TextOutput: Embeddings

Input—

Output—

Context33K tokens

ProviderNVIDIA

IDnvidia/llama-3.2-nemoretriever-1b-vlm-embed-v1