Nemotron Nano 9B V2
NVIDIA
๐ง Language Model
Hybrid Mamba-2 + Transformer architecture. 128K context, multi-language, very fast inference.
Specifications
Context Window128,000 tokens
Max Output8,192 tokens
Speedโโโโโ Very Fast
ModalitiesInput: text ยท Output: text
FeaturesTools: Yes Streaming: Yes
Pricing
Included with plan