Post Thumbnail

OpenAI Introduces AI Agent Operator OpenAI introduced

Operator – a GPT-4 o-based agent capable of performing online browser tasks. The agent works through a special interface where users can see the browser window and control the assistant’s actions.

Operator uses Computer-Using Agent, combining GPT-4 o’s visual capabilities with advanced thinking through reinforcement learning. Computer-Using Agent achieved 38.1% success on the OSWorld test and 87% on WebVoyager, surpassing previous models.

The agent operates on a remote server via encrypted connection. Users can take control for CAPTCHA input or payment data. Operator has instruction sets for storing user preferences. You can input any request, even with photos, and the assistant will start browsing – you can delegate food ordering, table reservations, ticket purchases, taxi calls, and more. Operator also shows a mini-screen with everything it does in real-time.

OpenAI heavily emphasizes system security and attack resistance. The entire process is monitored by a separate model that can trigger execution stops if something’s wrong. Additionally, suspicious situations will be sent for manual review.

The service is available to Pro users in the US, will be added to Plus subscription in few weeks, and API for developers. Although Anthropic and Google showed similar demonstrations earlier, OpenAI first launched a consumer product, despite Pro subscription unprofitability. Let’s hope that when Operator learns to make purchases independently, it won’t start ordering gifts for itself on its activation day.

Autor: AIvengo
For 5 years I have been working with machine learning and artificial intelligence. And this field never ceases to amaze, inspire and interest me.
Latest News
Chinese humanoid Bumi costs like iPhone 17 Pro Max

Chinese startup Noetix Robotics introduced the humanoid robot Bumi, which costs as much as an iPhone 17 Pro Max in China. Price - 9998 yuan. That's about 1370 dollars.

Reddit caught Perplexity stealing content

I told earlier that Reddit filed a lawsuit against AI search engine Perplexity. Reddit accuses Perplexity of "industrial" content scraping. But now there are facts and Reddit showed how they caught the defendant in a trap.

OpenAI is developing music generation tool

OpenAI is developing a tool for music generation based on text and audio prompts. This is reported by The Information citing sources. Such a tool could be used to add music to existing videos or to add guitar accompaniment to a vocal track.

Amazon turns couriers into cyborgs with AI smart glasses

Amazon decided to turn its couriers into cyborgs. No, seriously - the company announced smart glasses with AI for delivery workers. The idea, according to the e-commerce giant, is to free up drivers' hands. And spare them from constantly switching gaze between phone, package and surroundings.

OpenAI will add character cameos to Sora

OpenAI published the development roadmap for Sora, and you know what? It seems the company finally realized that video generation isn't just a technological demonstration. But a tool that people need to actually use. Bill Peebles, project head, announced a whole set of updates, and some of them are really interesting.