DeepSeek tests “sparse attention” to slash AI processing costs

DeepSeek tests “sparse attention” to slash AI processing costs

Have you ever noticed that ChatGPT tends to lag during extended conversations? This slowdown can be traced back to a critical mathematical challenge: managing long text sequences necessitates substantial computational power, even with the efficiency strategies that are already in place. While major U.S. tech firms can easily invest in additional hardware, Chinese AI startup DeepSeek faces unique pressures due to export restrictions that limit access to advanced AI chips. Consequently, the company is driven to optimize performance while using fewer resources. On Monday, DeepSeek unveiled an experimental version of its latest simulated reasoning language model, named DeepSeek-V3.2-Exp. This model introduces a novel approach called "DeepSeek Sparse Attention" (DSA). This technique is a variation on a computational method that has already been implemented by some of the leading AI models globally. Notably, OpenAI pioneered sparse transformers in 2019 for its creation of GPT-3, and Google Research explored similar methodologies in its 2020 "Reformer" models. However, the current usage of sparse attention by Western AI companies remains largely under wraps. Although sparse attention has been known for several years, DeepSeek asserts that it has achieved "fine-grained sparse attention for the first time" with its new model, which has resulted in a 50% reduction in API costs as evidence of its efficiency improvements. To appreciate the significance of DeepSeek V3.2, it’s essential to revisit the company’s recent achievements. Earlier this year, DeepSeek's R1 simulated reasoning model reportedly matched OpenAI's performance while requiring only $6 million for training. Additionally, its chat application recently topped the iPhone App Store charts, even surpassing ChatGPT. The spotlight is firmly on DeepSeek as it challenges some of America's top AI research labs.

Sources : Ars Technica

Published On : Sep 30, 2025, 20:20

Automotive
Uber Unveils Ambitious Plan to Power the Future of Autonomous Vehicles

Uber is making significant strides in the realm of autonomous transportation with the launch of its new division, Uber A...

TechCrunch | Feb 23, 2026, 23:35
Uber Unveils Ambitious Plan to Power the Future of Autonomous Vehicles
Startups
Canva Expands Creative Horizons with Dual Startup Acquisitions

Canva, the renowned creative platform, has announced significant strategic moves with the acquisition of two innovative ...

TechCrunch | Feb 24, 2026, 08:00
Canva Expands Creative Horizons with Dual Startup Acquisitions
Startups
Space Force Prioritizes Payload Development Over Rocket Expansion

In a recent address at a space finance conference in Dallas, Maj. Gen. Stephen Purdy, the Space Force officer responsibl...

Ars Technica | Feb 23, 2026, 22:15
Space Force Prioritizes Payload Development Over Rocket Expansion
AI
AI Misfire: A Researcher's Email Disaster Sparks Cautionary Tale

In a surprising turn of events, Summer Yue, a security researcher at Meta AI, recently shared an alarming experience inv...

TechCrunch | Feb 24, 2026, 01:20
AI Misfire: A Researcher's Email Disaster Sparks Cautionary Tale
AI
Anthropic Raises Alarm Over AI Model Misuse by Chinese Competitors

Anthropic, renowned for its Claude chatbot, has raised serious allegations against three Chinese AI companies: DeepSeek,...

Business Today | Feb 24, 2026, 06:50
Anthropic Raises Alarm Over AI Model Misuse by Chinese Competitors
View All News