"Deep Breathing" Enhances Large Model Performance! Google DeepMind Uses Large Language Models to Generate Prompts, as AI Understands AI Better

At the recent 'Invest Karnataka 2025' conference held in Bangalore, numerous leaders from the technology industry gathered to discuss the transformative potential of artificial intelligence in India and its impacts. Manish Gupta, Senior Director at Google DeepMind, delivered a key speech at the conference, emphasizing the need to establish corresponding regulations alongside technological innovation to ensure responsible development. Note: Image generated by AI, image courtesy service provider Midjourney.
Recently, Microsoft launched OmniParser V2.0, a new parsing tool designed to convert user interface (UI) screenshots into structured formats. OmniParser enhances the performance of UI agents based on large language models (LLM), helping users better understand and interact with the information on their screens. The tool's training dataset includes an interactive icon detection dataset, meticulously curated and automatically annotated from popular websites to highlight clickable and actionable areas.
Recently, Tencent Technology (Shenzhen) Co., Ltd. published a patent regarding a training method and related equipment for large language models on the Tianyancha app. The patent is titled 'Training Method, Device, Computer Equipment, and Storage Medium for Large Language Models' and aims to enhance the learning capacity and accuracy of large language models through innovative training methods. In the training process of large language models, traditional methods often rely on a single text summary, which may lead to model overfitting and negatively impact the accuracy and diversity of generated content. However, Tencent's new...