2038 and Artificial Intelligence - Artificial Intelligence Zone

Ovis-1.6: An Open-Source Multimodal Large Language Model (MLLM) Architecture Designed to Structurally Align Visual and Textual Embeddings

Marktechpost

SEPTEMBER 29, 2024

Artificial intelligence (AI) is transforming rapidly, particularly in multimodal learning. Similarly, in the RealWorldQA benchmark, Ovis outperformed leading proprietary models such as GPT4V and Qwen-VL-Plus, scoring 2230, compared to GPT4V’s 2038. to 14.1%, depending on the specific benchmark.

Large Language Models

Large Language Models Artificial Intelligence Artificial Intelligence Data Integration

Artificial Intelligence Zone

Ovis-1.6: An Open-Source Multimodal Large Language Model (MLLM) Architecture Designed to Structurally Align Visual and Textual Embeddings

Webinars

Stay Connected