Recently, the Opera team announced a groundbreaking upgrade to browser functionality with the launch of its AI agent, "Browser Operator."
Opera's Executive Vice President, Krystian Kolondra, stated: "For over 30 years, browsers have allowed users to access the internet, but they've never actually done things *for* users. Now, they can. This is unlike anything we've seen or launched before."
Key features of the Browser Operator include autonomy, perception, decision-making, action execution, and adaptive learning. These intelligent capabilities aim to significantly boost user productivity. Users simply describe their tasks using natural language, and the Browser Operator automatically executes them.
For example, a user could ask the Browser Operator to buy a pair of size 8.5 pink Nike running shoes. The Browser Operator will show the user progress updates throughout the task, ensuring the user remains in control.
Opera is transforming the browser into a more user-centric ecosystem, utilizing native client-side solutions to accomplish tasks while protecting user privacy. The Browser Operator runs natively within the browser, leveraging the DOM tree and browser layout data for contextual information. This approach allows for faster response times as it doesn't need to understand screen content through pixels or navigate using mouse pointers. The Browser Operator can access the entire page at once without scrolling, reducing the time and computational resources required. Furthermore, because all operations occur within the browser, it doesn't rely on virtual machines or cloud servers.
Currently, the Browser Operator is in a feature preview stage and is expected to roll out soon through Opera's feature update program. Users can watch a demonstration of the Browser Operator on Opera's official YouTube channel. We anticipate its official release in an upcoming feature update.
Key Highlights:
🌟 Opera launches the "Browser Operator," an AI agent redefining browser capabilities.
🛒 Users can easily accomplish various tasks using natural language commands.
🔒 All operations are performed within the browser, protecting user privacy and enhancing efficiency.