XI
Xiaomi: MiMo-V2-Omni
262K context$0.40/M input$2.00/M output
xiaomi/mimo-v2-omniDescription
MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step...
Quick Start
curl https://router.tangle.tools/v1/chat/completions \
-H "Authorization: Bearer $TANGLE_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "xiaomi/mimo-v2-omni",
"messages": [{"role": "user", "content": "Hello!"}]
}'Modalities
Input: TextInput: AudioInput: ImageInput: VideoOutput: Text
Pricing
Input$0.40/M
Output$2.00/M
Context262K tokens
Model Info
Providerxiaomi
IDxiaomi/mimo-v2-omni