Anthropic says its new AI model “maintained focus” for 30 hours on multistep tasks

Anthropic says its new AI model “maintained focus” for 30 hours on multistep tasks

On Monday, Anthropic introduced Claude Sonnet 4.5, which the company touts as its most advanced AI language model to date, showcasing enhanced capabilities in coding and computer utilization. Alongside this release, Anthropic also launched Claude Code 2.0, an AI command-line agent specifically designed for developers, and the Claude Agent SDK, a toolkit that enables developers to create their own AI coding agents. The company claims that Sonnet 4.5 has successfully maintained focus on a single project for over 30 continuous hours while tackling intricate, multi-step tasks. However, specific details regarding these tasks have not been disclosed. Historically, AI models have struggled with maintaining coherence over extended periods, often losing track as errors accumulate and context windows reach their limits. In previous instances, Anthropic noted that earlier versions of Claude, like the 4.0 models, managed to play Pokémon for more than 24 hours and could refactor code over a span of seven hours. To appreciate the significance of Sonnet 4.5, one must understand the structure of Anthropic's AI language models. The Claude family includes three sizes: Haiku (the smallest), Sonnet (mid-range), and Opus (the largest). The last update to Haiku occurred in November 2024 (version 3.5), while Sonnet was last updated in May (to version 4.0) and Opus received an update in August (to version 4.1). The model’s size, determined by the number of parameters within its neural network, correlates with its contextual depth and problem-solving abilities. However, larger models tend to be slower and more costly to operate, leading AI companies to strive for an optimal balance between performance and cost efficiency. For several years, Claude Sonnet has effectively filled this niche for Anthropic. Developers have shown a fondness for the Claude Code feature, and the company expresses confidence in Sonnet's latest iteration, declaring, "Claude Sonnet 4.5 is the best coding model in the world. It excels in building complex agents, demonstrates superior performance in using computers, and showcases significant improvements in reasoning and mathematics."

Sources : Ars Technica

Published On : Sep 29, 2025, 22:15

Cybersecurity
Stryker Faces Cyber Assault Amid Global Tensions: What We Know

In the wake of recent airstrikes by the US and Israel on Iran, cybersecurity experts issued warnings to organizations wo...

Ars Technica | Mar 12, 2026, 22:20
Stryker Faces Cyber Assault Amid Global Tensions: What We Know
Computing
AI and Private Equity: A Recipe for Software Disruption?

The landscape of enterprise software is on the brink of a significant transformation, driven by an unexpected alliance b...

CNBC | Mar 12, 2026, 21:05
AI and Private Equity: A Recipe for Software Disruption?
AI
Strengthening Ties: US Ambassador Advocates for Enhanced AI Collaboration with India

During the India Today Conclave 2026, themed "The Intelligence Exchange," US Ambassador Sergio Gor emphasized the necess...

Business Today | Mar 13, 2026, 06:55
Strengthening Ties: US Ambassador Advocates for Enhanced AI Collaboration with India
Startups
Meta AI Revolutionizes Buyer-Seller Interactions on Facebook Marketplace

Facebook Marketplace is enhancing its platform with innovative Meta AI functionalities aimed at streamlining communicati...

TechCrunch | Mar 12, 2026, 18:45
Meta AI Revolutionizes Buyer-Seller Interactions on Facebook Marketplace
Cybersecurity
Sam Bankman-Fried's Political Pivot Fails to Impress Trump’s Justice Department

Since Donald Trump’s presidency began, the founder of FTX, Sam Bankman-Fried, has been on a mission to rebrand himself a...

Ars Technica | Mar 12, 2026, 19:00
Sam Bankman-Fried's Political Pivot Fails to Impress Trump’s Justice Department
View All News