Webvar
voyage-multimodal-3 Embedding Model - logo

voyage-multimodal-3 Embedding Model

Rich multimodal embedding model that can vectorize interleaved text and content-rich images. 32K context length.
awsPurchase this listing from Webvar in AWS Marketplace using your AWS account. In AWS Marketplace, you can quickly launch pre-configured software with just a few clicks. AWS handles billing and payments, and charges on your AWS bill.

About

Multimodal embedding models are neural networks that transform multiple modalities, such as text and images, into numerical vectors. They are a crucial building block for semantic search/retrieval systems and retrieval-augmented generation (RAG) and are responsible for the retrieval quality. voyage-multimodal-3 is a state-of-the-art multimodal embedding model that uniquely vectorizes interleaved texts + images while capturing visual features from PDFs, slides, tables, figures, and more, eliminating complex document parsing. It improves retrieval accuracy by an average of 19.63% over the next best-performing multimodal embedding model when evaluated across 3 multimodal retrieval tasks (20 total datasets). Latency is 75 ms for a single query with at most 200 tokens, and throughput is 57M tokens per hour at $0.06 per 1M tokens on an ml.g6.xlarge. Learn more about voyage-multimodal-3 here: https://blog.voyageai.com/2024/11/12/voyage-multimodal-3/

Related Products

How it works?

Search

Search 25000+ products and services vetted by AWS.

Request private offer

Our team will send you an offer link to view.

Purchase

Accept the offer in your AWS account, and start using the software.

Manage

All your transactions will be consolidated into one bill in AWS.

Create Your Marketplace with Webvar!

Launch your marketplace effortlessly with our solutions. Optimize sales processes and expand your reach with our platform.