'Inference whales' are breaching AI coding startup business models

'Inference whales' are breaching AI coding startup business models

The landscape of AI coding services is facing a significant challenge as the costs of inference continue to rise. Heavy users, referred to as "inference whales," are driving up expenses, forcing startups to rethink their business strategies and pricing models to avoid substantial losses. Inference, which pertains to the execution of AI models, becomes increasingly expensive with newer reasoning models that break user requests into multiple steps. This issue is particularly pronounced in AI coding services where developers utilize automated agents for long-term tasks, leading to soaring operational costs. Many of these services operate on subscription models that promise unlimited use for a fixed monthly price. However, some users exploit this system by submitting massive projects, placing immense financial pressure on startups. These companies must still cover the costs of the underlying AI models, creating a precarious balance between a steady revenue stream and escalating backend expenses. Eric Simons, CEO of StackBlitz, expressed concerns about the fragility of businesses that primarily resell AI inference, stating, "If you're purely reselling AI inference, your business could be very fragile and vulnerable." Anthropic, a notable player in this field, had previously offered its Claude Code service through an enticing $200 monthly unlimited plan. However, some users took advantage of this pricing structure, resulting in staggering usage costs that far exceeded their subscription fees. Reports indicate one developer on the Claude Code Leaderboard utilized nearly 11 billion tokens, racking up costs of approximately $35,000 while paying only $200. To address this unsustainable model, Anthropic announced plans to modify its pricing structure, which will include weekly rate limits starting August 28. Users who exceed these limits will need to purchase additional capacity. An Anthropic representative noted that this change aims to ensure consistent performance for all developers while managing extreme usage by a small number of customers. One developer, Albert Örwall from Sweden, shared his experience using the Claude Code subscription for his projects. He indicated that his regular workflow could generate inference costs of around $500 per day, highlighting the potential for unsustainable expenses under the current pricing model. Örwall plans to adapt his coding practices to align with the new limits, indicating a shift in how developers will need to approach their projects moving forward. Cursor, another popular AI coding service, also changed its pricing model from unlimited requests to a tiered system, which led to confusion among users. This reflects a broader trend in the industry where companies are grappling with the reality of rising inference costs despite expectations for price reductions. As the demand for advanced AI models continues, the assumption that costs will decrease has not materialized. Instead, the integration of new models often comes with increased prices, complicating the financial viability of these services. Ethan Ding, CEO of TextQL, noted that the shift toward longer, automated AI workflows means that even if per-token prices drop, the overall costs can remain prohibitively high. In this evolving landscape, it's clear that offering unlimited usage under any subscription model is becoming increasingly untenable. The math simply does not add up, leaving startups to navigate a complex and challenging environment as they adapt to the demands of their users.

Sources : Business Insider

Published On : Aug 12, 2025, 17:00

Startups
AI Valuation Surge: Anthropic Sets the Stage for a New Investment Era

Anthropic is rapidly approaching a staggering $1 trillion valuation following its latest funding round, highlighting a b...

CNBC | May 29, 2026, 12:30
AI Valuation Surge: Anthropic Sets the Stage for a New Investment Era
AI
Amazon Discourages AI Misuse: A Move Towards Purposeful Innovation

In a recent initiative to curb what it calls "tokenmaxxing," Amazon has decided to dismantle an internal leaderboard tha...

Business Insider | May 29, 2026, 16:50
Amazon Discourages AI Misuse: A Move Towards Purposeful Innovation
Startups
AI Startup Drafted Secures $16 Million to Revolutionize Home Design with Innovative Technology

Drafted, a newly established startup, is on a mission to transform the home design landscape by utilizing artificial int...

Business Insider | May 29, 2026, 14:25
AI Startup Drafted Secures $16 Million to Revolutionize Home Design with Innovative Technology
AI
Cognition's Scott Wu Advocates for AI as a Programmer's Ally, Not a Replacement

Scott Wu, CEO of Cognition, is making waves in the tech world this week as his AI coding startup secures a staggering $1...

TechCrunch | May 29, 2026, 16:30
Cognition's Scott Wu Advocates for AI as a Programmer's Ally, Not a Replacement
Science
Revolutionary Discovery: Sea Cucumber Tissues Defy Decay in Seawater

In a groundbreaking finding, researchers have unveiled that certain tissues from the sea cucumber species Psolus fabrici...

Ars Technica | May 29, 2026, 15:15
Revolutionary Discovery: Sea Cucumber Tissues Defy Decay in Seawater
View All News