Shanghai AI Laboratory Opens Source for First Mixed Media Creation Model 'Pu Yu Ling Bi'

上海人工智能实验室

Published inAI News · 2 min read · Oct 10, 2023

151

Recently, the Shanghai Artificial Intelligence Laboratory (Shanghai AI Lab) unveiled its first text-image mixed creation large model, dubbed "Shusheng Puyu Lingbi" (InternLM-XComposer). Puyu Lingbi is capable of engaging in fluent Chinese-English text-image dialogues, accurately comprehending image content, and has unlocked the creative ability to "generate articles with a single click" that combine text and images. Users only need to provide a topic, and Puyu Lingbi can instantly produce an article rich with both text and images. The model employs a "three-step" algorithmic process for text-image article creation, which includes generating text, planning illustrations, and intelligently selecting images. In multiple mainstream multimodal model evaluations, Puyu Lingbi's performance ranks at the forefront, especially excelling in Chinese multimodal understanding. Puyu Lingbi has been open-sourced on platforms such as GitHub, welcoming developers to try it out and innovate with new applications.

Large Model Multimodal Open Source

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Google Vids Integrates Veo3.1 Model, Supporting Text Prompts to Direct AI Virtual Characters Interaction

Google upgrades its enterprise video application Vids, integrating the Veo3.1 model to enable dynamic interaction with AI virtual characters. Users can control the character and scene interaction through text commands while maintaining character consistency. The update enhances multimodal integration, improving video creation efficiency.

Apr 3, 2026

Google Releases Gemma4 Open Source Model: Adopting the Apache License to Fully Unleash Developer Productivity

Google has released the new open source AI model Gemma4, which adopts the Apache 2.0 license, replacing previous restrictive agreements. This allows developers to freely use, modify, and distribute the model, facilitating commercial applications. The model achieves dual upgrades in technical architecture, improving performance and ecosystem compatibility.

Apr 3, 2026

Claude Code Source Code Leak Becomes a Target for Hackers: GitHub Phishing Traps Are Maniacally Harvesting Developers

Claude Code's 513k lines of frontend source code leaked, triggering security risks. Hackers set up fake GitHub repositories for phishing, with user idbzoomh offering 'unlocked' versions. Security agencies are monitoring threats.....

Apr 3, 2026

100

Be on High Alert! The Source Code of Claude Code Leaked, Triggering Secondary Disasters: Hackers Laid a GitHub Phishing Trap

Hackers exploit Claude source code leak to spread Vidar info-stealer via fake GitHub repos, luring users with 'unlocked enterprise features' claims. Security firms warn of such targeted phishing attacks.....

Apr 3, 2026

Google Officially Launches Gemma4 Open-Source Large Model: Available in Four Specifications, 31B Version Ranks Third in Global Open-Source List

Google's open-source model Gemma4 enhances 'parameter efficiency', setting new standards for AI workflows. It includes 2.3B/4.5B efficient and 26B/31B high-performance versions, based on Gemini3, all supporting multimodal input, with some enabling real-time voice understanding.....

Apr 3, 2026

Qwen 3.6 Officially Released: 1 Million Long Context, Competing with Claude Code

Alibaba released the new generation large language model Qwen3.6-Plus, which is hailed as the strongest domestic programming model at present. Compared to the 3.5 version, its performance has been significantly improved, ranking first among domestic models in multiple programming evaluations, and its overall capabilities are close to the international benchmark Claude series. The model demonstrates a high level of autonomy in front-end development, complex repository tasks, and other areas.

Apr 2, 2026

750

AI Daily: Zhipu Releases GLM-5V-Turbo Multimodal Coding Large Model; Seedance 2.0 API Now Fully Opened; Meituan LongCat-AudioDiT Open-Sourced

Welcome to the [AI Daily] section! This is your guide to exploring the world of artificial intelligence every day. Every day, we present the latest content in the AI field, focusing on developers, helping you understand technology trends and innovative AI product applications. Discover new AI products: https://app.aibase.com/zh1. Zhipu releases the GLM-5V-Turbo multimodal coding large model, which achieves visual and programming capabilities

Apr 2, 2026

310

Zhipu Launches GLM-5V-Turbo: Giving AI Agents a Sharp Vision

Zhipu launches the multimodal programming model GLM-5V-Turbo, which has visual understanding capabilities, allowing it to convert visual information such as design drafts and web interfaces into code, enabling AI Agents to extend their perception from text to visual input.

Apr 2, 2026

380

Domestic LLM Toolchain Upgraded Again! Open Source LLMOps Platform Maxkb4j v2.6.0 Officially Released

Maxkb4j v2.6.0 enhances its open-source LLMOps platform with improved skill expansion, security authentication, and system stability. Key updates include new Shell tools, system message integration, and Webhook authentication, empowering developers with advanced LLM workflow and RAG capabilities.....

Apr 2, 2026

140

Google Open-Sources Large Model Gemma 4: Official Announcement Imminent: Parameter Count Increases by 4 Times

Google DeepMind hints at releasing Gemma 4, a new open-source model with 120B parameters, aiming to challenge local deployment limits and regain dominance in the global open-source AI market.....

Apr 2, 2026

530

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services​

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

Shanghai AI Laboratory Opens Source for First Mixed Media Creation Model 'Pu Yu Ling Bi'

上海人工智能实验室

This article is from AIbase Daily

AI News Recommendations

Google Vids Integrates Veo3.1 Model, Supporting Text Prompts to Direct AI Virtual Characters Interaction

Google Releases Gemma4 Open Source Model: Adopting the Apache License to Fully Unleash Developer Productivity

Claude Code Source Code Leak Becomes a Target for Hackers: GitHub Phishing Traps Are Maniacally Harvesting Developers

Be on High Alert! The Source Code of Claude Code Leaked, Triggering Secondary Disasters: Hackers Laid a GitHub Phishing Trap

Google Officially Launches Gemma4 Open-Source Large Model: Available in Four Specifications, 31B Version Ranks Third in Global Open-Source List

Qwen 3.6 Officially Released: 1 Million Long Context, Competing with Claude Code

AI Daily: Zhipu Releases GLM-5V-Turbo Multimodal Coding Large Model; Seedance 2.0 API Now Fully Opened; Meituan LongCat-AudioDiT Open-Sourced

Zhipu Launches GLM-5V-Turbo: Giving AI Agents a Sharp Vision

Domestic LLM Toolchain Upgraded Again! Open Source LLMOps Platform Maxkb4j v2.6.0 Officially Released

Google Open-Sources Large Model Gemma 4: Official Announcement Imminent: Parameter Count Increases by 4 Times

AI News Recommendations

Google Vids Integrates Veo3.1 Model, Supporting Text Prompts to Direct AI Virtual Characters Interaction

Google Releases Gemma4 Open Source Model: Adopting the Apache License to Fully Unleash Developer Productivity

Claude Code Source Code Leak Becomes a Target for Hackers: GitHub Phishing Traps Are Maniacally Harvesting Developers

Be on High Alert! The Source Code of Claude Code Leaked, Triggering Secondary Disasters: Hackers Laid a GitHub Phishing Trap

Google Officially Launches Gemma4 Open-Source Large Model: Available in Four Specifications, 31B Version Ranks Third in Global Open-Source List

Qwen 3.6 Officially Released: 1 Million Long Context, Competing with Claude Code

AI Daily: Zhipu Releases GLM-5V-Turbo Multimodal Coding Large Model; Seedance 2.0 API Now Fully Opened; Meituan LongCat-AudioDiT Open-Sourced

Zhipu Launches GLM-5V-Turbo: Giving AI Agents a Sharp Vision

Domestic LLM Toolchain Upgraded Again! Open Source LLMOps Platform Maxkb4j v2.6.0 Officially Released

Google Open-Sources Large Model Gemma 4: Official Announcement Imminent: Parameter Count Increases by 4 Times

GEO Services