OpenAI has announced the launch of its latest AI agent, "Operator," designed to assist users in performing various tasks online. OpenAI stated in its blog that Operator is currently in a "research preview," initially available to ChatGPT Pro subscribers in the United States, with a monthly fee of $200.
The design concept of Operator is based on a model called "computer usage agent," which combines the visual capabilities of GPT-4o with advanced reasoning through reinforcement learning, allowing it to interact with graphical user interfaces (GUIs). OpenAI explained that Operator can view web pages through its built-in browser and interact with the page by typing, clicking, and scrolling. The advantage of this technology is that Operator can operate autonomously online without the need for custom API integrations.
During use, Operator can not only utilize reasoning capabilities for "self-correction" but also hand control back to the user when encountering difficulties. When a website requests sensitive information, such as login credentials, Operator will ask the user whether to take over the operation. Additionally, Operator will seek user confirmation when handling tasks like sending emails. OpenAI emphasizes that Operator is designed with a strong focus on security, aiming to reject harmful requests and filter out prohibited content.
OpenAI also revealed that Operator is collaborating with several well-known companies, including DoorDash, Instacart, OpenTable, Priceline, StubHub, Thumbtack, and Uber, to ensure it meets real-world needs and adheres to established industry standards. However, OpenAI also cautioned users that the tool may encounter difficulties when dealing with complex interfaces, such as creating slides or managing calendars.
OpenAI plans to expand Operator to Plus, Team, and Enterprise users and integrate these features into ChatGPT. This means that more users will have the opportunity to experience the convenience brought by this cutting-edge technology.
Official podcast: https://openai.com/index/introducing-operator/
Key Points:
🌐 OpenAI launches the "Operator" AI agent to assist users in online tasks, initially targeting ChatGPT Pro users.
🖱️ Operator can interact with web pages through a browser, featuring self-correction and user control functions to ensure safety.
🤝 OpenAI collaborates with several well-known companies to meet real-world demands while planning future expansion to more users.