A smarter ChatGPT is here — it can now perform tasks using its own system

A smarter ChatGPT is here — it can now perform tasks using its own system

OpenAI has taken a major leap forward with the launch of the ChatGPT Agent , a new AI assistant that combines the best features of its previous tools — Operator and Deep Research — while adding powerful new capabilities. This agent is designed to perform complex tasks independently using its own virtual environment, making it one of the most advanced AI assistants available today.

 

🔹 What Makes the ChatGPT Agent Unique?

Unlike previous versions, the ChatGPT Agent can now carry out tasks using a built-in virtual computer . Based on user instructions, it can:

  • Browse the web
  • Filter and analyze search results
  • Prompt for login when needed
  • Run code and perform data analysis
  • Generate spreadsheets and presentations

All of this is done within a self-contained system, allowing the agent to maintain full context throughout the task.

 

🔹 Tools at Its Disposal

To handle a wide variety of tasks efficiently, the ChatGPT Agent has access to several advanced tools:

  • Visual Web Browser : Navigates websites using a graphical interface
  • Text-Based Browser : Handles simpler, logic-driven queries
  • Built-in Terminal : Executes commands and scripts
  • Direct API Access : Connects with external services
  • ChatGPT Connectors Integration : Enables seamless interaction with third-party tools

This combination allows the agent to perform tasks like downloading files, manipulating data in the terminal, and viewing the results in the browser — all in one smooth workflow.

 

🔹 Impressive Performance on Real-World Tasks

OpenAI reports that the ChatGPT Agent has achieved state-of-the-art results across several benchmark tests. Here are some of its top performances:

  • Humanity’s Last Exam : Scored 41.6 pass@1 (44.4 when running multiple attempts)
  • FrontierMath : Reached 27.4% accuracy
  • Internal Benchmark for Knowledge Work : Matched or exceeded human performance in about half the cases
  • DSBench : Outperformed humans in data science tasks
  • SpreadsheetBench : Scored 45.5% vs. Excel Copilot’s 20.0%
  • BrowseComp : Set a new SOTA with 68.9%
  • WebArena : Achieved a score of 65.4

These results show the agent’s growing ability to handle real-world, complex tasks with high accuracy.

 

🔹 How to Access the ChatGPT Agent

The ChatGPT Agent is now available in the Tools menu of ChatGPT under the new “Agent Mode.” While the agent works, users can follow along with a live narration and even take over the browser manually if needed.

Access will be rolled out gradually:

  • ChatGPT Pro users : Available today
  • Plus and Team subscribers : Access within the next few days
  • Enterprise and Education users : Coming in the coming weeks

Pro users get 400 messages per month with the agent, while other paid plans include 40 messages monthly . Additional usage can be purchased via flexible credit-based options.

 

Similar Posts