Before You Even Speak, We Know What You're Going to Do! Tsinghua and Mianbi Intelligence Join Forces to Create a 'More Understanding' AI Entity!

AIbase基地

Published inAI News · 5 min read · Dec 2, 2024

241

In recent years, large language models represented by ChatGPT have sparked a new wave in the field of AI. These powerful language models can not only understand human instructions but also plan, explore environments, and utilize tools to solve complex tasks, showcasing immense potential in areas such as robotics, personal assistants, and process automation.

However, most existing AI systems are passive and require explicit human instructions to perform tasks. For example, to schedule a meeting, you still have to manually input the time, location, and even list all the attendees, which can be more cumbersome than doing it yourself!

Imagine receiving an email from a colleague suggesting a meeting. A passive AI would wait for you to explicitly instruct it to arrange the meeting. In contrast, a proactive AI would notice the email and take the initiative to suggest scheduling the meeting. This proactivity not only significantly reduces the cognitive load on users but also helps identify potential needs that humans may not have explicitly expressed.

To address the issue of overly passive AI assistants, Tsinghua University and Mianbi Intelligence have teamed up to propose a brand-new AI agent that is no longer a "machine that merely obeys commands," but rather one that can "predict needs" and proactively help you organize tasks before you even say a word!

How does this "magical" AI agent achieve this? The secret weapon is the ProactiveBench dataset! This dataset is like an "encyclopedia" documenting various human activities, containing every letter you type at your computer, every link you click, and even the content you copy and paste!

Using this dataset, researchers trained a reward model, which acts like a "supercomputer simulating the human brain," capable of determining whether the AI agent's behavior aligns with human expectations. If the AI agent performs well, it receives rewards; otherwise, it gets penalized. After repeated training, the AI agent can predict your needs based on your behavior and offer assistance proactively when you need it.

For instance, when you receive an email from a colleague suggesting a meeting, this "predictive" AI agent will automatically recognize the email content and proactively ask if you would like to schedule the meeting. If you agree, it will automatically arrange the time and location and even send out meeting invitations! Isn't it much "smarter" than current AI assistants?

Experimental results show that AI agents trained with the ProactiveBench dataset perform exceptionally well; for example, the Qwen2-7B-Instruct model achieved an F1 score of 66.47% in providing proactive assistance, surpassing all open-source and closed-source models!

Although this "predictive" AI agent is still in the research phase, it brings new hope for the future of human-machine collaboration. We believe that in the near future, we will have a truly "understanding" AI assistant that not only "obeys commands" but also proactively helps solve various problems, making your life easier and more convenient!

Paper link: https://arxiv.org/pdf/2410.12361

AI Daily: 12306 MCP Server Launches; Baidu Launches AI Search Assistant Tizzy.ai; ChatGPT Voice Recording Mode Opens to Plus Users

AI Highlights: Baidu launches ad-free search assistant Tizzy.ai; 12306 open-sources train ticket engine; ChatGPT voice mode for Plus users; FireGEO SaaS template; ReadMeX docs tool; Baidu AI adds video calls; Jackywine's AI companion 'Bella'; OpenAI's Agent Mode; MidJourney enterprise API; MiniMax e-commerce feature; Claude Sonnet4 relaunch.....

AI Unicorn MiniMax Secretly Submitted IPO Application for Hong Kong Stock Market, Targeting Valuation Over 4 Billion USD

Chinese AI unicorn MiniMax accelerates IPO plans, secretly files for Hong Kong listing targeting $4B valuation, eyes A-share market. Raised $300M led by Shanghai state fund, backed by Alibaba, Tencent. Founded by ex-SenseTime execs, focuses on general AI platforms, recently launched new inference and video models.....

Windsurf Re-launches Claude Sonnet 4 Model

AI coding tool Windsurf announced the re-launch of Anthropic's Claude Sonnet 4 model, offering Pro users a monthly quota of 250 API calls (2x credit consumption). The model is known for its 72.7% performance on the SWE-bench test, supports a 200K token context window, and enables code generation and complex refactoring features. Previously, due to Anthropic's restrictions on direct access, Windsurf introduced a BYOK solution. This partnership restoration is being

Google DeepMind Launches MoR Architecture: Expected to Significantly Improve the Efficiency of Large Language Models

DeepMind's Mixture-of-Recursions (MoR) enhances model efficiency via dynamic token routing and recursive depth allocation, outperforming Transformers with fewer parameters. Its selective caching reduces memory pressure, proving especially effective above 360M scale, offering optimized AI deployment solutions.....

ChatGPT Adds Audio Transcription Feature! A Powerful Tool to Easily Record Meeting Highlights

OpenAI launches ChatGPT audio transcription for macOS users, supporting 120-minute recordings with auto-generated transcripts and summaries. Available only to GPT-4o subscribers, it records system audio and microphone input, deleting recordings post-transcription unless users opt in for model training. Enterprise/education users are excluded by default. Not available on Windows/Android/web yet.....

New Breakthrough in Medical AI: OpenMed Launches Over 380 Open-Source Models to Revolutionize Global Medical Technology

OpenMed released 380+ free medical NER models on Hugging Face (Apache2.0). These models (109M-568M params) rival paid alternatives, integrated into major AI ecosystems. Addressing global healthcare shortages, the project allows free use/modification, with COVID screening API previously developed in 5 days. Team plans to expand model library for open-source medical AI.....

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

Before You Even Speak, We Know What You're Going to Do! Tsinghua and Mianbi Intelligence Join Forces to Create a 'More Understanding' AI Entity!

AIbase基地

This article is from AIbase Daily

AI News Recommendations

ByteDance AI Core Personnel Changes: Visual Multimodal Leader Yang Jianchao Announces Temporary Leave

Anthropic Valuation Doubles Beyond Trillion, AI Revenue Surges Fourfold