2024-12-27 09:16:39.AIbase.14.3k
Zhipu AI Open Source Agent Task Model CogAgent-9B: Predicting Actions Through Screenshots
The GLM-PC base model CogAgent-9B under Zhipu AI has now been open-sourced to promote the development of the large model Agent ecosystem. CogAgent-9B is a specialized Agent task model trained based on GLM-4V-9B, capable of predicting the next GUI action based solely on a screenshot input, combining user-specified tasks with historical operations. This model's versatility makes it widely applicable to various GUI interaction scenarios such as personal computers, smartphones, and vehicle systems.