Qwen Omni Turbo

Alibaba ๐Ÿง  Language Model

Multimodal model supporting text, image, audio input and text/audio output. Supports 49 voices for speech synthesis.

Specifications

Context Window32,768 tokens
Max Output8,192 tokens
Speedโ—โ—โ—โ—โ— Fast
ModalitiesInput: text, image, audio  ยท  Output: text, audio
FeaturesVision: Yes Tools: No Streaming: Yes

Pricing

Included with plan

Use Qwen Omni Turbo on Zubnet →