Yi-VL Multimodal Language Model Released with Two Versions

站长之家

Published inAI News · 2 min read · Jan 23, 2024

The Zero-One-Everything Yi-VL Multimodal Language Model is the latest addition to the Zero-One-Everything Yi family of models, excelling in both visual comprehension and conversational generation. The Yi-VL model has achieved top results on both the English dataset MMMU and the Chinese dataset CMMMU, demonstrating its prowess in complex interdisciplinary tasks. The Yi-VL-34B model surpassed other large multimodal models with a 41.6% accuracy rate in the new multimodal benchmark test MMMU, showcasing its robust interdisciplinary knowledge comprehension and application capabilities. Built on the open-source LLaVA architecture, the Yi-VL model includes the Vision Transformer (ViT), Projection modules, and large-scale language models Yi-34B-Chat and Yi-6B-Chat. The ViT is used for image encoding, the Projection modules enable the alignment of image features with text feature spaces, and the large-scale language models provide powerful language comprehension and generation capabilities.

Multimodal Language Model Yi-VL Zero One Everything

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

AI Daily: Zero One Everything Denies Acquisition by Alibaba; ChatGPT Pro Subscription Still Losing Money; NVIDIA's First Global Foundation Model Cosmos

Welcome to the 【AI Daily】 column! Here is your daily guide to exploring the world of artificial intelligence. Every day, we present you with the hot topics in the AI field, focusing on developers to help you gain insights into technological trends and understand innovative AI product applications. For fresh AI products, click to learn more: https://top.aibase.com/1. Zero One Everything Denies Acquisition by Alibaba: Rumors are False. On January 7, 2025, Beijing Zero One Everything Technology Co., Ltd. refuted recent online rumors regarding 'Alibaba's acquisition of Zero One Everything'.

Jan 7, 2025

420

Zero One Everything Denies Alibaba Acquisition: Related Rumors Are False

On January 7, 2025, Beijing Zero One Everything Technology Co., Ltd. refuted recent online rumors regarding 'Alibaba acquiring Zero One Everything'. The company clearly stated that these rumors are untrue and denied the related false statements. Zero One Everything emphasized that its collaboration with Alibaba Cloud is based on deep cooperation in technology, computing power, business, and talent, rather than an acquisition relationship.

Jan 7, 2025

1.4k

Meta's Latest Audio Model SPIRIT LM: Making AI Not Just Talk, But Also Express Emotion!

Recently, Meta AI open-sourced a foundational multimodal language model named SPIRIT LM, which can freely mix text and speech, opening new possibilities for multimodal tasks involving audio and text. SPIRIT LM is based on a pre-trained text language model with 7 billion parameters, which has been continuously trained on text and speech units, expanding into the speech modality. It can understand and generate text like a large text model, while also being capable of understanding and generating speech, and even mixing text and speech to create various forms of expression.

Nov 22, 2024

6.4k

Yi-Coder Series Model by Zero One Everything Open Source: A Small yet Powerful Programming Assistant

Beijing Zero One Everything Technology Co., Ltd. officially announces the open-source release of the Yi-Coder series model, marking another significant advancement for the company in the field of artificial intelligence. The Yi-Coder series focuses on programming tasks, offering models with 1.5B and 9B parameter scales, with Yi-Coder-9B demonstrating exceptional performance in code generation, understanding, debugging, and completion.

Sep 5, 2024

2.8k

AI Unicorn Zero One Everything Founded by Kai-Fu Lee Completes Hundreds of Millions of Dollars in Financing

Aug 7, 2024

2.4k

Yi API under Zero One Everything 01AI Announces the Launch of Function Call Feature

Recently, Yi API under Zero One Everything 01AI announced the launch of the Yi-Large-FC model with Function Call capability. This new feature enables the model to determine when to call external tools based on user input and respond in JSON format, compatible with OpenAI's interface design, achieving a smooth alternative to GPT's capabilities.

Aug 6, 2024

2.7k

Zero One Everything Launches 80% Discount GPT Alternative Plan with Cost Reduction of Up to 91%

On June 25th, OpenAI's announcement of supply cuts caused considerable disruption to developers and businesses. In response to this move by OpenAI, the AI large model company Zero One Everything, founded by Dr. Kai-Fu Lee, quickly reacted by launching the 'Yi API 80% Discount Alternative Plan.' This service is aimed at OpenAI users, designed to help them seamlessly transition to the Yi series large models, providing a cost-effective alternative.

Jun 27, 2024

1.7k

Zero One Everything API Open: Multimodal Chinese Chart Experience Exceeds GPT-4V

The Zero One Everything API is officially open to developers, featuring models such as Yi-34B-Chat-0205, supporting general chat, Q&A, dialogue, writing, and translation. The Yi-VL-Plus multimodal model surpasses GPT-4V, performing exceptionally in the Chinese chart experience, supporting chart recognition, information extraction, Q&A, and reasoning. The Yi-34B-Chat-200K model is open, with an accuracy rate of up to 99.8%, suitable for long text comprehension, novel content summarization, and thesis point extraction.

Mar 22, 2024

610

Zero One Everything Releases New Generation AI Model Yi-9B, Optimized for Chinese-English Bilingual Scenarios

Zero One Everything has released the new generation AI model Yi-9B, featuring 8.8 billion parameters and a default context length of 4K tokens. Yi-9B excels in code writing and solving mathematical problems, making it suitable for technical and academic applications. It also demonstrates high accuracy and flexibility in understanding text, performing common-sense reasoning tasks, and reading comprehension. Optimized for both Chinese and English, Yi-9B effectively handles and comprehends bilingual text. By employing quantization technology, Yi-9B operates with lower hardware requirements.

Mar 7, 2024

480

AI Company Zero One Everything Releases Open Source Yi-9B Model, the Strongest in Its Series

Zero One Everything has announced the open sourcing of the Yi-9B model. Yi-9B is the strongest model in the Yi series in terms of coding and mathematical capabilities. It excels in overall, coding, and mathematical abilities, and can be easily deployed on consumer-grade graphics cards. The company was founded by Kaifu Lee, Chairman and CEO of Innovation Works.

Mar 6, 2024

810

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview