How Open AI’s red team made Chat GPT agent into an AI fortress

In a significant update, OpenAI has introduced a robust new feature for ChatGPT called the 'ChatGPT Agent,' which is now available to paying subscribers. This innovative mode allows users to engage the AI in tasks such as logging into email accounts, composing and responding to messages, and managing files autonomously. However, this capability raises important questions about user trust and data security, as it requires a high level of confidence that the AI will not act inappropriately or compromise sensitive information. Keren Gu, a member of OpenAI's Safety Research team, emphasized on X that they have implemented extensive safeguards for the ChatGPT Agent. This model has been classified as 'High capability' under their Preparedness Framework, particularly in the areas of biology and chemistry, underscoring the importance of security in its operations. To address potential vulnerabilities, OpenAI enlisted a specialized 'red team' of 16 PhD-level security researchers, who dedicated 40 hours to rigorously test the ChatGPT Agent. Their efforts uncovered seven critical exploits that could jeopardize the system's integrity during real-world interactions. The findings prompted a series of extensive security enhancements, leading to the submission of 110 different attack simulations, of which 16 surpassed OpenAI's internal risk thresholds. The results of this proactive testing initiative have yielded remarkable security improvements for the ChatGPT Agent. The system now boasts a 95% success rate against visual browser instruction attacks, alongside robust measures to protect against biological and chemical risks. These upgrades were made possible through the insights gained from the red team's findings, which included the identification of fundamental weaknesses in the AI's handling of various tasks. Moreover, OpenAI's collaboration with the UK AISI provided unprecedented access to the internal logic and policy frameworks of the ChatGPT Agent. This partnership revealed that conventional security boundaries are increasingly inadequate when an AI can access shared drives, browse the internet, and execute commands autonomously. In response to the vulnerabilities identified, OpenAI has instituted significant architectural changes, including a dual-layer inspection system to monitor all production traffic in real-time. This approach ensures that critical vulnerabilities are identified and rectified promptly, with the capability to patch security weaknesses within hours rather than weeks. The ongoing evolution of security measures reflects a broader shift in OpenAI's philosophy. The insights from the red team have established a new standard for enterprise AI deployment, emphasizing the importance of monitoring and rapid remediation in maintaining system integrity. As AI continues to advance, the lessons learned from this rigorous testing process will shape the future of AI security, ensuring that safety remains at the core of technological progress.

Sources : VentureBeat

Published On : Jul 21, 2025, 24:40

Startups

Quantum Systems Secures $1.2 Billion in Funding to Revolutionize Defense Technology

Quantum Systems, a pioneering startup focused on autonomous defense technology, has successfully raised $1.2 billion in ...

CNBC | Jul 02, 2026, 14:40

Quantum Systems Secures $1.2 Billion in Funding to Revolutionize Defense Technology

The Hidden Environmental Cost of Big Tech's Pursuit of AI

In recent sustainability reports released by Google and Amazon, alarming figures reveal the environmental toll of the te...

TechCrunch | Jul 02, 2026, 19:20

The Hidden Environmental Cost of Big Tech's Pursuit of AI

Aerospace

Safety Whistleblower Allegations Rock Boeing's Wisk Aero

Wisk Aero, the electric air taxi enterprise under the Boeing umbrella, is facing serious allegations following a lawsuit...

TechCrunch | Jul 02, 2026, 17:40

Safety Whistleblower Allegations Rock Boeing's Wisk Aero

Streaming

Amazon Gears Up for Leo Satellite Internet Launch with New Mission Success

Amazon has announced a significant advancement in its plans to launch the Leo internet-from-space service, revealing tha...

CNBC | Jul 02, 2026, 18:35

Amazon Gears Up for Leo Satellite Internet Launch with New Mission Success

Gadgets

The Rise of Home Robots: Weave Robotics Launches Affordable Isaac 1

In a significant leap toward automating household chores, Weave Robotics, backed by Y Combinator, has introduced its inn...

Business Insider | Jul 02, 2026, 15:25

The Rise of Home Robots: Weave Robotics Launches Affordable Isaac 1

View All News

High-quality, Cost-effective IT Outsourcing

let’s grow together!

portfolio

case study

follow us on

follow us on

How Open AI’s red team made Chat GPT agent into an AI fortress

Quantum Systems Secures $1.2 Billion in Funding to Revolutionize Defense Technology

The Hidden Environmental Cost of Big Tech's Pursuit of AI

Safety Whistleblower Allegations Rock Boeing's Wisk Aero

Amazon Gears Up for Leo Satellite Internet Launch with New Mission Success

The Rise of Home Robots: Weave Robotics Launches Affordable Isaac 1

Collaborate with Benzatine Infotech

High-quality, Cost-effective IT Outsourcing

let’s grow together!

portfolios

case study

follow us on

follow us on

portfolio

case study

follow us on

follow us on

How Open AI’s red team made Chat GPT agent into an AI fortress

Quantum Systems Secures $1.2 Billion in Funding to Revolutionize Defense Technology

The Hidden Environmental Cost of Big Tech's Pursuit of AI

Safety Whistleblower Allegations Rock Boeing's Wisk Aero

Amazon Gears Up for Leo Satellite Internet Launch with New Mission Success

The Rise of Home Robots: Weave Robotics Launches Affordable Isaac 1

Collaborate with Benzatine Infotech