
Since 2024, the performance optimization team at Anthropic has implemented a take-home assessment for job candidates to evaluate their expertise. However, the rise of advanced AI coding tools has necessitated frequent updates to this test in order to counteract AI-assisted cheating. Team lead Tristan Hume detailed these ongoing challenges in a recent blog post. Hume noted that with each iteration of their Claude models, the test has required redesigning. For instance, Claude Opus 4 surpassed many human applicants when given the same time constraints, allowing for differentiation among candidates. Yet, the subsequent Claude Opus 4.5 proved capable of matching even those top human performers, complicating the assessment process. The reliance on take-home tests without in-person oversight raises significant concerns about the potential for AI-driven cheating, which could enable unqualified candidates to excel. "Under the constraints of the take-home test, we no longer had a way to distinguish between the output of our top candidates and our most capable model," Hume explained. This dilemma is not isolated to Anthropic; educational institutions globally are grappling with similar issues as AI tools infiltrate academic integrity. Fortunately, Anthropic is well-suited to tackle this challenge. Hume ultimately crafted a new assessment format focused less on hardware optimization, making it sufficiently inventive to confound current AI capabilities. In a collaborative spirit, he also invited readers to engage with the original test, encouraging those who could outperform Opus 4.5 to share their solutions.
As Nvidia gears up for its annual GTC conference, anticipation is building around several critical issues that could sha...
Business Insider | Mar 13, 2026, 09:15Recent studies reveal that ChatGPT's energy consumption is staggering, with each query requiring at least ten times the ...
Business Today | Mar 13, 2026, 10:05
In the evolving landscape of AI, many startups are reevaluating their tools. Sidhant Bendre, co-founder of Oleve, an AI-...
Business Insider | Mar 13, 2026, 09:40In a bid to strengthen his AI startup xAI, Elon Musk has announced plans to revisit previous job applications as he face...
Business Insider | Mar 13, 2026, 08:40In the rapidly evolving world of technology, understanding the nuances of coding remains crucial, especially when harnes...
Business Insider | Mar 13, 2026, 07:10