Anthropic has to keep revising its technical interview test so you can’t cheat on it with Claude

Anthropic has to keep revising its technical interview test so you can’t cheat on it with Claude

Since 2024, the performance optimization team at Anthropic has implemented a take-home assessment for job candidates to evaluate their expertise. However, the rise of advanced AI coding tools has necessitated frequent updates to this test in order to counteract AI-assisted cheating. Team lead Tristan Hume detailed these ongoing challenges in a recent blog post. Hume noted that with each iteration of their Claude models, the test has required redesigning. For instance, Claude Opus 4 surpassed many human applicants when given the same time constraints, allowing for differentiation among candidates. Yet, the subsequent Claude Opus 4.5 proved capable of matching even those top human performers, complicating the assessment process. The reliance on take-home tests without in-person oversight raises significant concerns about the potential for AI-driven cheating, which could enable unqualified candidates to excel. "Under the constraints of the take-home test, we no longer had a way to distinguish between the output of our top candidates and our most capable model," Hume explained. This dilemma is not isolated to Anthropic; educational institutions globally are grappling with similar issues as AI tools infiltrate academic integrity. Fortunately, Anthropic is well-suited to tackle this challenge. Hume ultimately crafted a new assessment format focused less on hardware optimization, making it sufficiently inventive to confound current AI capabilities. In a collaborative spirit, he also invited readers to engage with the original test, encouraging those who could outperform Opus 4.5 to share their solutions.

Sources : TechCrunch

Published On : Jan 22, 2026, 15:05

Computing
Nvidia's GTC Summit: Key Questions and Expectations Ahead

As Nvidia gears up for its annual GTC conference, anticipation is building around several critical issues that could sha...

Business Insider | Mar 13, 2026, 09:15
Nvidia's GTC Summit: Key Questions and Expectations Ahead
AI
ChatGPT Surges to 900 Million Users, Consuming Power Equivalent to Small Nations

Recent studies reveal that ChatGPT's energy consumption is staggering, with each query requiring at least ten times the ...

Business Today | Mar 13, 2026, 10:05
ChatGPT Surges to 900 Million Users, Consuming Power Equivalent to Small Nations
AI
Why a Startup Founder Switched from ChatGPT to Claude: A Deep Dive

In the evolving landscape of AI, many startups are reevaluating their tools. Sidhant Bendre, co-founder of Oleve, an AI-...

Business Insider | Mar 13, 2026, 09:40
Why a Startup Founder Switched from ChatGPT to Claude: A Deep Dive
AI
Elon Musk Revives Talent Search Amid xAI Leadership Exodus

In a bid to strengthen his AI startup xAI, Elon Musk has announced plans to revisit previous job applications as he face...

Business Insider | Mar 13, 2026, 08:40
Elon Musk Revives Talent Search Amid xAI Leadership Exodus
AI
Mastering AI in Coding: Insights from an Amazon Tech Lead

In the rapidly evolving world of technology, understanding the nuances of coding remains crucial, especially when harnes...

Business Insider | Mar 13, 2026, 07:10
Mastering AI in Coding: Insights from an Amazon Tech Lead
View All News