Jina CLIP v2

Jina AI 📐 Embedding

Jina AI's CLIP model — cross-modal text-image embeddings supporting 89 languages with Matryoshka representations. 865M params.

Specifications

Context Window8,192 tokens
Speed Fast
ModalitiesInput: text, image  ·  Output: embedding
FeaturesVision: Yes Tools: No Streaming: No

Pricing

Included with plan

Use Jina CLIP v2 on Zubnet →