M
Marengo 3.0
Twelve Labs
๐ Embedding
Twelve Labs multimodal video embedding model. Converts video, audio, and text into a shared vector space for semantic search across 36 languages.
Specifications
ModalitiesInput: video, audio, text, image ยท Output: embedding
FeaturesTools: No Streaming: No
Pricing
Included with plan