Models/Meta/Meta: Llama 3.2 11B Vision Instruct

Meta: Llama 3.2 11B Vision Instruct

131K context$0.34/M input$0.34/M output

meta-llama/llama-3.2-11b-vision-instruct

Description

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...

Quick Start

curl https://router.tangle.tools/v1/chat/completions \
  -H "Authorization: Bearer $TANGLE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "meta-llama/llama-3.2-11b-vision-instruct",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Modalities

Input: TextInput: ImageOutput: Text

Pricing

Input$0.34/M

Output$0.34/M

Context131K tokens

Model Info

ProviderMeta

IDmeta-llama/llama-3.2-11b-vision-instruct

Explore

Browse all models Model rankings