National University of Singapore Releases Open Source Multimodal Language Model NExT-GPT to Advance Multimedia AI Applications

站长之家

Published inAI News · 1 min read · Nov 29, 2023

Data to be translated: The National University of Singapore has released the NExT-GPT multimodal language model, which supports processing of text, images, videos, and audio, thereby facilitating the development of multimedia artificial intelligence applications. The model employs a three-tier architecture and undergoes intermediate layer training through MosIT technology, offering open-source contributions that create opportunities for researchers and developers to integrate multimodal inputs. The unique feature of NExT-GPT lies in its ability to generate modal signaling tags, bringing potential applications in content generation and multimedia analysis.

Multimodal Language Model Open Source Artificial Intelligence

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

AliTongyi Launches Z-Image Model, Downloads Exceed 500,000 on the First Day

Alibaba's Tongyi Z-Image, a 600M-parameter image generation model, topped Hugging Face with 500k downloads. It matches larger models in detail and composition. Z-Image-Turbo produces high-quality images in just 8 steps.....

Nov 28, 2025

Kuaishou Flagship Keye-VL-671B-A37B Launches with Significant Breakthroughs in Multimodal Reasoning Capabilities

Kuaishou releases Keye-VL-671B-A37B, a multimodal model excelling in visual understanding, video analysis, and math reasoning, showcasing enhanced AI capabilities.....

Nov 28, 2025

Lei Jun: Every Industry Is Worth Redoing with AI, Humanoid Robots Will Enter Factories in Large Scale Within Five Years

Founder of Xiaomi, Lei Jun, predicts that AI will deeply transform traditional industries in the next five years, proposing the idea that every industry is worth redoing with AI. He uses the Xiaomi car factory as an example to illustrate how AI and traditional manufacturing integration can significantly improve efficiency: using AI visual models to detect die-cast parts, reducing detection time to 2 seconds, with efficiency 10 times that of manual work, and accuracy improved by more than 5 times.

Nov 28, 2025

The All-People Hand-Crafted Luminous Spark Application Competition is Officially Open - A Single Sentence to Create an AI Application

Spark App launches a competition to simplify AI app creation, enabling users to generate interactive apps in 30 seconds using natural language, fostering mass innovation.....

Nov 28, 2025

130

Meta Releases CoT Verification Model: A White-box Reasoning Error Correction Tool Based on Llama 3.1

Meta AI Lab introduces CoT-Verifier, a model based on Llama3.18B, using TopK transcoder for white-box verification to precisely identify and correct errors in AI chain-of-thought reasoning, overcoming traditional method limitations.....

Nov 28, 2025

DeepSeek-Math-V2 Launches: Open Source Model Conquers International Mathematical Olympiad for the First Time with a Gold Medal

DeepSeek launches the world's first open-source large model for mathematical reasoning, DeepSeek-Math-V2, with 685 billion parameters, achieving the level of a gold medal in the International Mathematical Olympiad. Based on the DeepSeek-V3.2 architecture, this model is open-sourced under the Apache 2.0 license. Its core breakthrough is an innovative 'Generate-Verify' dual-model closed-loop mechanism, significantly enhancing mathematical reasoning capabilities.

Nov 28, 2025

310

Domestic Mathematical Gold Medal Emerges: DeepSeek-Math-V2 Open-Source File Has Been Uploaded, Performance Competes with GPT-4o

DeepSeek-Math-V2, a 236B MoE model with 21B active parameters, achieves 75.7% on MATH, 30/4 on AIME 2024, and 53.7% on Math Odyssey using self-verification. Open-sourced under Apache 2.0.....

Nov 28, 2025

180

SoftBank Stock Plummets 40% as Concerns Over AI Bubble Intensify

Early 2025, Trump's tariff pressures affected global technology companies. Starting in October, markets worried about the AI bubble, and investors became more cautious toward high-valued AI companies. As a major AI investor, SoftBank Group saw its stock plummet by 40% from October 31 to November 26, with a loss of nearly $50 billion in market value. This volatility was caused by multiple factors, not a single event.

Nov 28, 2025

130

AI Daily: Alibaba Open Sources Z-Image Image Model; Quark AI Glasses Launch; Opera Neon Browser Upgraded

Alibaba opensources the Z-Image image model, which supports bilingual text rendering, achieves efficient image generation and editing with only 6B parameters, and has excellent visual quality. The model was developed by the Tongyi Lab, focusing on AI technology trends to help developers understand innovative applications.

Nov 27, 2025

280

4MP Open Source Blitz: Flux.2 Now Free, Google 3000 Dollar Nano Banana Pro Suddenly Less Attractive

Flux.2 Open Source Sparks AI Image Generation Revolution: 4MP Images Generated in 8 Seconds for Only $0.003, Cost is Just One Thousandth of Google's Solution. Open Source Community Quickly Verified Its Performance, Professional Version Sampling Steps Reduced to 8 Steps, Speed Increased Significantly. Netizens Directly Pointed Out Google's High Pricing, This Technological Breakthrough is Reshaping the Industry Competitive Landscape.

Nov 27, 2025

170

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

National University of Singapore Releases Open Source Multimodal Language Model NExT-GPT to Advance Multimedia AI Applications

站长之家

This article is from AIbase Daily

AI News Recommendations

AliTongyi Launches Z-Image Model, Downloads Exceed 500,000 on the First Day

Kuaishou Flagship Keye-VL-671B-A37B Launches with Significant Breakthroughs in Multimodal Reasoning Capabilities

Lei Jun: Every Industry Is Worth Redoing with AI, Humanoid Robots Will Enter Factories in Large Scale Within Five Years

The All-People Hand-Crafted Luminous Spark Application Competition is Officially Open - A Single Sentence to Create an AI Application

Meta Releases CoT Verification Model: A White-box Reasoning Error Correction Tool Based on Llama 3.1

DeepSeek-Math-V2 Launches: Open Source Model Conquers International Mathematical Olympiad for the First Time with a Gold Medal

Domestic Mathematical Gold Medal Emerges: DeepSeek-Math-V2 Open-Source File Has Been Uploaded, Performance Competes with GPT-4o

SoftBank Stock Plummets 40% as Concerns Over AI Bubble Intensify

AI Daily: Alibaba Open Sources Z-Image Image Model; Quark AI Glasses Launch; Opera Neon Browser Upgraded

4MP Open Source Blitz: Flux.2 Now Free, Google 3000 Dollar Nano Banana Pro Suddenly Less Attractive

GEO Services