Nemotron Nano 9B V2

NVIDIA ๐Ÿง  Language Model

Hybrid Mamba-2 + Transformer architecture. 128K context, multi-language, very fast inference.

Specifications

Context Window128,000 tokens
Max Output8,192 tokens
Speedโ—โ—โ—โ—โ— Very Fast
ModalitiesInput: text  ยท  Output: text
FeaturesTools: Yes Streaming: Yes

Pricing

Included with plan

Capabilities

tools
Use Nemotron Nano 9B V2 on Zubnet →