Today, the Zhipu Technology team released its latest video generation model, CogVideoX v1.5, and made it open source. This version is another significant advancement in the CogVideoX series launched by the Zhipu Technology team since August. The update greatly enhances video generation capabilities, including support for 5-second and 10-second video lengths, 768P resolution, and the ability to generate 16 frames. Additionally, the I2V (Image-to-Video) model supports arbitrary aspect ratios, further improving the understanding of complex semantics.
The Zhipu technology team has recently launched a new product based on the research achievements of the GLM technology team—AutoGLM. This agent can simulate human operation of a mobile phone and perform various tasks. The launch of AutoGLM marks a significant advancement in artificial intelligence within the 'Phone Use' domain, making AI applications more relevant to people's daily lives.
Zhipu AI announces that its end-to-end emotional voice technology has officially launched on the Zhipu Qinyan platform and is now available to all users. This technology overcomes the limitations of traditional text-to-speech (TTS) technology, capable of deeply understanding contextual nuances and generating emotionally rich natural dialogues. This marks the evolution of Zhipu AI's speech synthesis technology from simple text reading to an AI that can express genuine emotions.