Llama 4 Scout

Meta ๐Ÿง  Language Model

Fast multimodal model. 109B params (17B active, 16-expert MoE). Vision, tool use, 327K context. Fits on a single GPU.

Specifications

Context Window327,680 tokens
Max Output8,192 tokens
Speedโ—โ—โ—โ—โ— Very Fast
ModalitiesInput: text, image  ยท  Output: text
FeaturesVision: Yes Tools: Yes Streaming: Yes

Pricing

Included with plan

Capabilities

chattools
Use Llama 4 Scout on Zubnet →