MiMo V2 Omni

Xiaomi ๐Ÿง  Language Model

Xiaomi's multimodal model with vision and reasoning capabilities

Specifications

Context Window262,144 tokens
Max Output65,536 tokens
Speedโ—โ—โ—โ—โ— Fast
ModalitiesInput: text, image  ยท  Output: text
FeaturesVision: Yes Tools: Yes Streaming: Yes

Pricing

Included with plan

Capabilities

chattoolsreasoningvision
Use MiMo V2 Omni on Zubnet →