OpenAI has been laying groundwork. While most users were just starting to really explore ChatGPT Tasks – a new feature that ...
The model underpinning Operator is a Computer-Using Agent (CUA) that combines GPT-4o's vision mode to "see" what's on the user's screen through screenshots with graphical user interfaces (GUIs) that ...
It can also ask follow-up questions to further personalize the tasks it completes, such as login information for other websites. Users can take control of the screen at any time.
OpenAI plans to expand access to Operator across more user tiers and integrate its capabilities into ChatGPT, broadening its ...
OpenAI announced that it is launching a research preview of Operator, an AI agent that can take control of a browser and perform tasks.