Breaking2 min read
Google Gemini Embedding 2 Unifies Text, Image, Video and Audio in a Single Vector Space
Google has released its first native multimodal embedding model, capable of consolidating text, images, video, audio, and documents into one unified vector space — eliminating the need for separate embedding models across modalities and opening new possibilities for cross-modal AI search and retrieval.