Companies are hoarding AI compute because of FOMO — and they're sitting on most of it

Companies are hoarding AI compute because of FOMO — and they're sitting on most of it

Recent analysis reveals that many organizations are investing heavily in AI infrastructure, yet a significant portion of this computing power remains untapped. Cast AI, a platform specializing in automated cloud cost optimization, has released its 2026 State of Kubernetes Optimization Report, highlighting a striking trend among companies like BMW and Cisco. According to the report, organizations are provisioning approximately 20 times the GPU capacity they actually utilize at any moment. Data gathered from 23,000 clusters across a wide range of enterprises indicates that average GPU utilization is a mere 5%, meaning that roughly 95% of the allocated GPU capacity goes unused. Similarly, CPU utilization stands low at 8% of total capacity. The report emphasizes the financial implications of this underutilization. While an idle CPU might waste just a few cents hourly, a GPU, which is essential for tasks like machine learning and video processing, can incur losses of several dollars per hour. Given that GPUs can be up to 50 times more costly than their CPU counterparts, this inefficiency is concerning. These findings emerge as businesses scramble to acquire high-demand AI chips, particularly premium GPUs such as Nvidia's Blackwell models, which are experiencing price hikes due to overwhelming demand. Laurent Gil, CEO of Cast AI, explained the urgency driving this phenomenon. Unlike traditional cloud services that allow for flexible resource allocation, the current scarcity of GPUs leads companies to enter long-term contracts, often resulting in over-purchasing out of fear of missing out. "The act of buying has no correlation with whether you need them or not," Gil pointed out, emphasizing that companies are acquiring GPUs simply because they are available, not necessarily because they require them. Gil urges CTOs to reassess their GPU usage, suggesting they ask their teams, "We already have a few thousand of those GPUs. How are we using them?" He notes that with only 5% utilization, there could be 20 times more latent capacity available that organizations are unaware of before they consider buying additional machines.

Sources : Business Insider

Published On : Apr 21, 2026, 13:31

Streaming
YouTube Directors Dominate Box Office with Horror Hits

This weekend has seen a remarkable surge in cinema, with two films directed by former YouTube stars topping the box offi...

TechCrunch | May 30, 2026, 21:35
YouTube Directors Dominate Box Office with Horror Hits
Gadgets
Transform Your Pizza Nights with the Ninja Artisan Outdoor Oven

For pizza enthusiasts who crave homemade creations without the associated hassle, the Ninja Artisan Outdoor Pizza Oven c...

TechCrunch | May 30, 2026, 13:25
Transform Your Pizza Nights with the Ninja Artisan Outdoor Oven
Science
Unraveling the Roots of Vaccine Opposition: A Historical Perspective

Stanley Plotkin, a pivotal figure in vaccine development, recently expressed his sorrow over the current state of public...

Ars Technica | May 30, 2026, 11:05
Unraveling the Roots of Vaccine Opposition: A Historical Perspective
Computing
Exploring the Next Wave of Web Browsers: Exciting Alternatives to Chrome and Safari in 2026

As the battle for dominance in the web browser market intensifies, Google Chrome and Apple's Safari continue to lead, wi...

TechCrunch | May 30, 2026, 13:25
Exploring the Next Wave of Web Browsers: Exciting Alternatives to Chrome and Safari in 2026
Startups
Snap Alumni Launch Ghost Angels Fund to Revolutionize Social Media

A collective of 20 former Snap employees has united to establish a new investment fund named Ghost Angels, aimed at supp...

TechCrunch | May 30, 2026, 17:20
Snap Alumni Launch Ghost Angels Fund to Revolutionize Social Media
View All News