Information

Latest AI News

Explore AI Frontiers, Master Industry Trends

AI Daily Brief

Your Daily AI Brief - Never Miss What's Next

Information

AI Product Finder

Smart Product Discovery - Comprehensive Market Intelligence

AI Product Rankings

AI Product Power Rankings - Performance, Buzz & Trends

AI Product Submit

Submit Your AI Product - Amplify Reach & Drive Growth

Tools

AI Tools Directory

Discover The Best AI Websites & Tools

Information

AI Models Finder

Comprehensive AI Models Collection for All Your Development & Research Needs

LLM Leaderboard

AI LLM Power Rankings - Performance, Buzz & Trends

Model Providers

Discover Trusted AI Model Partners - Guaranteed Reliable Support

Submit Your Model

Submit Your Model Info & Services - Precision Marketing & User Targeting

Tools

Compare LLMs

Multi-Dimensional Large Model Comparison - Find Your Perfect Match

LLM Cost Calculator

Calculate AI Model Costs Accurately - Optimize Your Budget

LLM Arena

Multi-Model Real-Time Evaluation & Quick Output Comparison

Information

MCP Servers

Discover Popular AI-MCP Services - Find Your Perfect Match Instantly

MCP Client

Easy MCP Client Integration - Access Powerful AI Capabilities

MCP Case Tutorials

Master MCP Usage - From Beginner to Expert

MCP Ranking

Top MCP Service Performance Rankings - Find Your Best Choice

MCP Service Submission

Publish & Promote Your MCP Services

Tools

MCP Playground

Test MCP Services Freely - Quick Online Experience

MCP Inspector

Quick MCP Service Testing - Fast Deployment

AI Brand Monitoring Tool

Analyze & Track How AI Models Cite Your Brand

GEO Services

Achieve Dominant Visibility in AI Search for Your Business or Brand with GEO Services

AI Search Visibility Checker

Detect brand's visibility on AI platforms

Tools

AI Model Compatibility Checker

Free PC Hardware Test for DeepSeek & Llama

AI Deployment Calculator

Enter Your Large Model Computing Requirements for Instant GPU, Memory & Server Configuration Recommendations

AI Tutorial

Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

Zhipu AI Shocking Release of GLM-4-Plus: Comparable to GPT-4, First to Introduce C-end Video Call Feature

AIbase基地

Published inAI News · 6 min read · Aug 30, 2024

1.3k

ZhipuAI recently released its latest foundational large model, GLM-4-Plus, showcasing powerful visual capabilities comparable to OpenAI's GPT-4, and announced its availability on August 30th. This breakthrough not only marks a significant leap in domestic AI technology but also brings users an unprecedented intelligent experience.

Key Update Highlights:

Language foundation model GLM-4-Plus: Achieved a qualitative leap in language parsing, instruction execution, and long-text processing capabilities, maintaining a leading position in international competition.

Text-to-image model CogView-3-Plus: Performance on par with top-tier industry models like MJ-V6 and FLUX.

Image/video understanding model GLM-4V-Plus: Excels not only in image understanding but also in video understanding based on temporal sequence analysis. This model will be launched on the open platform bigmodel.cn and become the first universal video understanding model API in China.

Video generation model CogVideoX: Following the release and open-sourcing of the 2B version, the 5B version has also been officially open-sourced, significantly enhancing performance and becoming a top choice among open-source video generation models.

The cumulative downloads of Zhipu's open-source models have exceeded 20 million times, making a significant contribution to the prosperity of the open-source community.

GLM-4-Plus excels in multiple key areas. In terms of language capabilities, the model has reached an international leading level in understanding, instruction following, and long-text processing, comparable to GPT-4 and Llama3.1 with 405B parameters. Notably, GLM-4-Plus significantly enhances long-text inference through a precise mix of short and long text data strategies.

In the field of visual intelligence, GLM-4V-Plus demonstrates excellent image and video understanding capabilities. It not only has temporal awareness but also can process and understand complex video content. Notably, the model will be launched on Zhipu's open platform, becoming the first universal video understanding model API in China, providing powerful tools for developers and researchers.

For example, if you give it a video like this and ask what the player in the green jersey did throughout the video, it can accurately describe the actions of the player and even tell you the highlight moments of the video at what second:

Screenshot from the official

ZhipuAI has also made breakthroughs in the generative field. CogView-3-Plus is close to the best models like MJ-V6 and FLUX in text-to-image performance. Meanwhile, the video generation model CogVideoX has launched a more powerful 5B version, considered the best choice among current open-source video generation models.

Most anticipated is the upcoming "video call" feature of Zhipu's Qingyan APP, the first AI video call feature open to C-end users in China. This feature spans text, audio, and video modalities and has real-time inference capabilities. Users can have smooth conversations with AI, even with frequent interruptions, and the AI can react quickly.

Even more astonishing is that by simply turning on the camera, the AI can see and understand what the user sees and accurately execute voice commands.

This revolutionary video call feature will be launched on August 30th, initially available to some Qingyan users and accepting external applications. This innovation not only showcases ZhipuAI's technological prowess but also opens up new possibilities for the deep integration of artificial intelligence with daily life.

Reference: https://mp.weixin.qq.com/s/Ww8njI4NiyH7arxML0nh8w

Zhipu AI GLM-4-Plus OpenAI AI Technology

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Global Actors Gather at the 2025 Sustainable Social Value Innovation Conference to Explore Solutions for Sustainable Development in the AI Era

At the 2025 Sustainable Social Value Innovation Conference, a young man with a missing right arm used BrainCo's brain-computer interface to control a smart prosthetic hand, playing the piano smoothly through brain signals, showcasing the breakthrough of BCI technology from sci-fi to reality.....

Dec 3, 2025

Amazon Launches Nova 2 Series Models, AI Performance Reaches New Heights!

AWS unveils four self-developed 'Nova2' AI models at re:Invent 2025, covering text, image, video, and speech with built-in web search and code execution, claiming leading price-performance. Nova2 Lite offers cost-effective inference, outperforming Claude Haiku4.5 and GPT-5Mini at about half the cost, while Nova2 Pro targets complex agent tasks.....

Dec 3, 2025

170

Google Launches New Features for Android 16: AI Notification Summary and Personalization Options Arrive

Google announces Android 16 with more frequent updates, AI-powered notification summaries, and an organizer to group and mute low-priority alerts, first rolling out on Pixel devices.....

Dec 3, 2025

120

Amazon Launches New Nova 2 Model Family with Comprehensive Technological Advantages

At the 2025 re:Invent conference, Amazon Web Services introduced the Nova2 model series, including four new models, offering leading cost-effectiveness in reasoning, multimodal, dialogue AI, code generation, and agent tasks. Among them, Nova2Lite is designed for everyday workloads, supporting text, image, and video input and generating text output. It is a fast and economical reasoning model.

Dec 3, 2025

140

New Trend in AI Glasses! Alibaba Qwen Teams Up with Resound Technology to Create a Whisper Assistant, Making Voice Interaction Smarter

Alibaba launches its first self-developed AI glasses S1, featuring five high-performance mics and a bone conduction mic for precise voice command recognition and easy AI assistant activation in noisy environments.....

Dec 3, 2025

110

AI Daily: Kling 2.6 to be released; Qwen APP launches learning of large models; Z-Image-Turbo-Fun-Controlnet-Union is open-sourced

Kling AI version 2.6 introduces native audio generation, supporting bilingual dialogue, singing, and sound effects, enabling a complete text-to-video workflow and marking the start of the AI video era with sound.....

Dec 3, 2025

110

Chuanshen港 New Media Platform - A Comprehensive Media Service Platform Driven by AI

VoicePort, an AI-driven media service platform under Hangzhou Longtou Culture Media, integrates media, bloggers, and influencers to offer one-stop media distribution, marketing, and monitoring services, enhancing brand and product promotion. Core services include media releases, content creation, influencer marketing, public opinion monitoring, and data analysis, efficiently addressing enterprise content operation needs.....

Dec 3, 2025

110

Kling 2.6 Will Be Released: Native Audio + 10-Second 1080P AI Video Enter the Era of Audio

Kling AI 2.6 introduces audio generation, enabling bilingual dialogues, singing, and sound effects, with one-click sync for text, video, and audio. It uses diffusion transformers and 3D spatiotemporal attention, improving complex instruction adherence by 15% and cross-scene character consistency. Output remains 10s 1080p HD, with a 30% cost reduction.....

Dec 3, 2025

130

OpenAI is exploring new features for integrating ChatGPT with Apple Health

OpenAI is integrating ChatGPT with Apple Health, enabling personalized health advice by accessing user data on activities, sleep, and diet.....

Dec 3, 2025

100

Hangzhou Tongxing Technology Launches China's First AI-assisted Blindness Glasses, Achieving Road Condition Announcement Within 300 Milliseconds Under 3000 Yuan

Tongxing Technology launches China's first AI-assisted glasses for the visually impaired, integrating Alibaba's Qwen model to provide real-time navigation. The system includes glasses, a phone, a remote ring, and a cane, using dual cameras for 300ms low-latency updates, recognizing bus signs, road markers, and surroundings. Technical Director Chen Gang notes a 70% reduction in R&D costs, speeding up deployment, with local text recognition also fe....

Dec 3, 2025

100

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

Zhipu AI Shocking Release of GLM-4-Plus: Comparable to GPT-4, First to Introduce C-end Video Call Feature

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Global Actors Gather at the 2025 Sustainable Social Value Innovation Conference to Explore Solutions for Sustainable Development in the AI Era

Amazon Launches Nova 2 Series Models, AI Performance Reaches New Heights!

Google Launches New Features for Android 16: AI Notification Summary and Personalization Options Arrive

Amazon Launches New Nova 2 Model Family with Comprehensive Technological Advantages

New Trend in AI Glasses! Alibaba Qwen Teams Up with Resound Technology to Create a Whisper Assistant, Making Voice Interaction Smarter

AI Daily: Kling 2.6 to be released; Qwen APP launches learning of large models; Z-Image-Turbo-Fun-Controlnet-Union is open-sourced

Chuanshen港 New Media Platform - A Comprehensive Media Service Platform Driven by AI

Kling 2.6 Will Be Released: Native Audio + 10-Second 1080P AI Video Enter the Era of Audio

OpenAI is exploring new features for integrating ChatGPT with Apple Health

Hangzhou Tongxing Technology Launches China's First AI-assisted Blindness Glasses, Achieving Road Condition Announcement Within 300 Milliseconds Under 3000 Yuan

GEO Services