OpenAI says AI browsers may always be vulnerable to prompt injection attacks

OpenAI says AI browsers may always be vulnerable to prompt injection attacks

As OpenAI enhances security measures for its Atlas AI browser, the tech giant acknowledges that the threat of prompt injections—attacks that manipulate AI systems to execute harmful commands embedded within web pages or emails—remains a significant concern. In a recent blog post, OpenAI emphasized that similar to online scams and social engineering, prompt injection issues are unlikely to be fully resolved. Launched in October, the ChatGPT Atlas browser has already faced scrutiny from security researchers who demonstrated that simple phrases in Google Docs could alter its behavior. On the same day, Brave released a statement indicating that indirect prompt injection poses a systematic threat to AI-driven browsers, including Perplexity’s Comet. The U.K. National Cyber Security Centre recently echoed these concerns, warning that prompt injection attacks on generative AI platforms may never be completely mitigated, which could expose websites to potential data breaches. The agency advised cybersecurity professionals to focus on minimizing the risks and impacts rather than assuming these attacks can be entirely eliminated. OpenAI has adopted a proactive approach to address these persistent security challenges, implementing a rapid-response cycle that aims to identify new attack strategies before they can be exploited. This method aligns with the strategies of competitors like Anthropic and Google, who emphasize the need for layered defenses that undergo continuous stress testing. Where OpenAI is innovating is through the development of a reinforcement learning-based automated attacker. This bot simulates a hacker's attempts to embed malicious instructions into an AI agent, allowing it to analyze and refine its approach based on the target AI's internal logic. OpenAI has indicated that this method has revealed novel attack strategies that were not evident during conventional red teaming efforts. In a demonstration, the automated attacker was able to insert a harmful email into a user's inbox. When the AI agent processed this email, it followed the hidden commands, resulting in a resignation message instead of a simple out-of-office reply. Fortunately, after a security update, the Atlas browser's 'agent mode' successfully detected this attempt and alerted the user. Despite the ongoing battle against prompt injection, OpenAI is committed to rigorous testing and rapid updates to strengthen its defenses. However, a spokesperson did not provide specific data on whether recent security enhancements have significantly reduced successful injection attempts. Experts like Rami McCarthy from cybersecurity firm Wiz advocate for a balanced view on the risks associated with AI browsers, highlighting that while reinforcement learning is effective for adapting to threats, it is only part of the solution. McCarthy noted that agentic browsers exist in a complex space of autonomy and access, posing unique risks due to their potential exposure to sensitive data. To mitigate these risks, OpenAI recommends that users limit the access of their agents and clearly instruct them on tasks, rather than granting broad permissions. They stress that vague instructions can lead to unintended influences from malicious content, even with safeguards in place. While OpenAI prioritizes user protection against prompt injections, McCarthy raises questions about the overall value and risk balance of agentic browsers, suggesting that their current risk profile may not justify their use in everyday scenarios.

Sources : TechCrunch

Published On : Dec 22, 2025, 22:15

Automotive
Lucid Motors Unveils Ambitious Plans for Affordable Electric SUVs

Lucid Motors is setting its sights on the bustling midsize SUV market, a move that could prove pivotal for the company's...

Ars Technica | Mar 12, 2026, 17:55
Lucid Motors Unveils Ambitious Plans for Affordable Electric SUVs
Automotive
Rivian Delays Launch of Affordable R2 SUV Until Late 2027

Rivian has unveiled the specifications and pricing details for its highly anticipated R2 SUV, but customers eager to pur...

TechCrunch | Mar 12, 2026, 21:00
Rivian Delays Launch of Affordable R2 SUV Until Late 2027
Streaming
Substack Unveils Innovative Recording Studio for Creators

Substack is making significant strides in the realm of video content with the introduction of its new Substack Recording...

TechCrunch | Mar 12, 2026, 18:45
Substack Unveils Innovative Recording Studio for Creators
AI
AI Boosts U.S. Military Edge, Says Palantir CEO Amid Rising Tensions

During an interview with CNBC, Palantir's CEO Alex Karp emphasized the significant advantage that artificial intelligenc...

CNBC | Mar 12, 2026, 22:05
AI Boosts U.S. Military Edge, Says Palantir CEO Amid Rising Tensions
Computing
HP Faces Pressure Over Firmware Updates Impacting Third-Party Ink Compatibility

The International Imaging Technology Council (Int’l ITC) has raised concerns against HP regarding recent firmware update...

Ars Technica | Mar 12, 2026, 20:35
HP Faces Pressure Over Firmware Updates Impacting Third-Party Ink Compatibility
View All News