nvidia/llama-3.2-nv-embedqa-1b-v1
33K context— input— output
nvidia/llama-3.2-nv-embedqa-1b-v1Availability
Pricing for this model is operator-dependent. Open the chat to route a request — the response includes the resolved per-token cost.
Modalities
Input: TextOutput: Embeddings
Pricing
Input—
Output—
Context33K tokens
Model Info
ProviderNVIDIA
IDnvidia/llama-3.2-nv-embedqa-1b-v1