Beijing Zhipu Huazhang Technology Co., Ltd. recently announced that its Zhipu GLM-PC intelligent agent has been upgraded and is now officially open for public experience. As the world's first multimodal intelligent agent capable of independently operating a computer, the technology behind GLM-PC is based on Zhipu's multimodal large model, CogAgent. Users can experience this revolutionary computer assistant with just a simple press of the enter key.
Since the release of GLM-PC v1.0 on November 29, 2024, it has been in a beta testing phase. This version introduced the "Deep Thinking" mode, added logical reasoning and code generation capabilities, and also supports the Windows operating system. GLM-PC's capabilities encompass code generation, logical execution, graphical user interface (GUI) understanding, and more, showcasing its strong potential in intelligent operations.
In terms of code generation and logical execution, GLM-PC has the ability to comprehensively analyze goals and resources, generating execution roadmaps that break down large tasks into smaller, manageable sub-tasks for efficient task planning. Once the task planning is complete, the intelligent agent can activate the code generation module for iterative execution, ensuring precise task completion. Additionally, GLM-PC possesses long-thinking capabilities, allowing it to adjust and reflect in real-time, interact with users, and optimize solutions.
In the realm of image and GUI cognition, GLM-PC can accurately identify and understand elements within graphical interfaces, such as buttons and icons, and provide intelligent recommendations based on users' historical operation information. Its image semantic analysis feature can delve into complex images to extract key information, such as trends and indicators. Moreover, GLM-PC can integrate image and text information to provide users with comprehensive perceptual results, assisting them in formulating precise operational plans.
With the continuous development of artificial intelligence technology, the launch of Zhipu GLM-PC undoubtedly offers users a more efficient and intelligent computer experience, marking an important advancement in human-computer interaction.