Llama 4 Scout
Meta
๐ง Language Model
Fast multimodal model. 109B params (17B active, 16-expert MoE). Vision, tool use, 327K context. Fits on a single GPU.
Specifications
Context Window327,680 tokens
Max Output8,192 tokens
Speedโโโโโ Very Fast
ModalitiesInput: text, image ยท Output: text
FeaturesVision: Yes Tools: Yes Streaming: Yes
Pricing
Included with plan