Chinese Researchers Launch CogVLM: A Powerful Open Source Visual Language Foundation Model

站长之家

Published inAI News · 1 min read · Nov 13, 2023

Chinese researchers have introduced a powerful open-source vision-language foundation model named CogVLM, which has made significant progress in cross-modal tasks by deeply integrating language and visual information. CogVLM employs a novel training method, incorporating trainable visual experts to enhance the visual understanding capabilities of language models, demonstrating exceptional performance in tasks such as image captioning and visual question answering. The open-source CogVLM-28B-zh supports mixed Chinese-English commercial applications, bringing significant impact to both field research and practical applications.

Visual Language Model Open Source Cross-Modal Tasks

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

30.5K Stars! Why This AI Tool Design Treasure is Going Viral Among Developers?

A GitHub open-source project named "system-prompts-and-models-of-ai-tools" has gained significant traction, accumulating 30.5K stars and becoming a popular resource for AI developers and researchers. This project, according to AIbase, compiles system prompts and model configurations for 9 mainstream AI tools, encompassing over 6500 lines of content and covering v0, Cursor, Manus, Same.dev, Lovable, Devin, and Rep.

Apr 25, 2025

130

AI Daily: Tencent Releases Version 2.5 of its HunYuan 3D Generation Model; Haier Launches Image-to-Person Reference Feature; Baidu Launches Mobile Super Intelligence App, Xinxiang

Apr 23, 2025

1.3k

AI Daily: Kunlun Wanwei Open-Sources SkyReels-V2; iFlytek's Starfire X1 Receives Major Upgrade; Coze Space Internal Testing

Welcome to the 【AI Daily】column! Your daily guide to exploring the world of artificial intelligence. We present you with the hottest AI topics, focusing on developers and helping you understand technology trends and innovative AI product applications. Explore new AI products here: https://top.aibase.com/ 1. Kunlun Wanwei Open-Sources SkyReels-V2: An Infinite-Length Movie Generation Model Kunlun Wanwei's SkyReels team has launched SkyReels-V2, the world's first diffusion-based...

Apr 21, 2025

340

Persona Engine Open Source Release: AI Virtual Assistant Meets Live2D for Enhanced Interactive Experiences

Recently, the Persona Engine project was officially open-sourced. Its powerful capabilities, integrating cutting-edge technologies such as Large Language Models (LLMs), Live2D, Automatic Speech Recognition (ASR), Text-to-Speech (TTS), and Real-time Voice Cloning (RVC), have garnered significant attention in the AI and virtual content creation fields. According to AIbase, the project enables real-time interaction with virtual characters by granting them natural conversation and dynamic expression capabilities, making it particularly suitable for VTubing and similar applications.

Apr 21, 2025

220

Intel Open Sources AI Playground for Intel Arc GPUs and Various AI Models

Intel has announced the open-sourcing of its generative AI software, AI Playground, generating significant interest within the AI community. Optimized for Intel Arc GPUs and integrated graphics, AI Playground is described as an 'AI hub' that supports local running of chat-based Large Language Models (LLMs), as well as image and video generation capabilities. This open-sourcing signifies Intel's commitment to advancing the accessibility of generative AI technology.

Apr 21, 2025

170

WORLDMEM Open Source Release: Revolutionizing Long-Term Consistent World Simulation Technology

Apr 18, 2025

270

Alibaba Tongyi Wanxiang First and Last Frame Video Generation Model Wan2.1-FLF2V-14B Open Source

Apr 18, 2025

490

DeepSeek Inference Engine Opens New Path for Open Source, Boosting vLLM Ecosystem

Apr 16, 2025

260

Google Open Sources AI Agent Development Kit: ADK Launch Ushers in New Wave of AI Agent Development

Apr 10, 2025

450

GitHub Official Open Source MCP Server with Seamless GitHub API Integration

Apr 8, 2025

650

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

Chinese Researchers Launch CogVLM: A Powerful Open Source Visual Language Foundation Model

站长之家

This article is from AIbase Daily

AI News Recommendations

30.5K Stars! Why This AI Tool Design Treasure is Going Viral Among Developers?

AI Daily: Tencent Releases Version 2.5 of its HunYuan 3D Generation Model; Haier Launches Image-to-Person Reference Feature; Baidu Launches Mobile Super Intelligence App, Xinxiang

AI Daily: Kunlun Wanwei Open-Sources SkyReels-V2; iFlytek's Starfire X1 Receives Major Upgrade; Coze Space Internal Testing

Persona Engine Open Source Release: AI Virtual Assistant Meets Live2D for Enhanced Interactive Experiences

Intel Open Sources AI Playground for Intel Arc GPUs and Various AI Models

WORLDMEM Open Source Release: Revolutionizing Long-Term Consistent World Simulation Technology

Alibaba Tongyi Wanxiang First and Last Frame Video Generation Model Wan2.1-FLF2V-14B Open Source

DeepSeek Inference Engine Opens New Path for Open Source, Boosting vLLM Ecosystem

Google Open Sources AI Agent Development Kit: ADK Launch Ushers in New Wave of AI Agent Development

GitHub Official Open Source MCP Server with Seamless GitHub API Integration