A single point of failure triggered the Amazon outage affecting millions

A single point of failure triggered the Amazon outage affecting millions

A recent outage affecting Amazon Web Services (AWS) has been attributed to a single point of failure that triggered a domino effect across its vast network. According to a detailed analysis by Amazon's engineers, this incident lasted for over 15 hours and 32 minutes, impacting millions of users globally. Network monitoring firm Ookla reported that its DownDetector service recorded more than 17 million interruptions from approximately 3,500 organizations. The outage primarily affected users in the United States, the United Kingdom, and Germany, with major platforms like Snapchat, AWS itself, and Roblox among those most frequently reported as down. This incident is noted as one of the most significant internet outages in history, according to Ookla. The engineers identified the root cause as a software bug within the DynamoDB DNS management system, which is crucial for overseeing load balancer stability. The system functions by periodically updating DNS configurations for various endpoints within AWS. A race condition, an error linked to the timing of events beyond developers' control, led to unexpected and detrimental failures in this case. This particular race condition occurred in the DNS Enactor, a component of DynamoDB responsible for continuously updating domain lookup tables. It encountered significant delays while trying to refresh DNS updates across several endpoints. Meanwhile, another component, the DNS Planner, continued to create new configurations, leading to a timing conflict with the DNS Enactor. This misalignment ultimately resulted in the failure of the entire DynamoDB system. Amazon engineers have provided insights into this complex failure, highlighting the challenges faced in large-scale network management.

Sources : Ars Technica

Published On : Oct 24, 2025, 21:55

Computing
LexisNexis Defends Its Position Amidst AI Market Fears

As the artificial intelligence landscape continues to evolve, LexisNexis finds itself at the center of investor concerns...

Business Insider | Mar 01, 2026, 10:30
LexisNexis Defends Its Position Amidst AI Market Fears
Gadgets
Honor Unveils the Sleek Magic V6 Foldable with Game-Changing Battery Technology

Honor has officially introduced its latest foldable phone, the Magic V6, featuring an impressive 6,600 mAh battery and a...

TechCrunch | Mar 01, 2026, 15:40
Honor Unveils the Sleek Magic V6 Foldable with Game-Changing Battery Technology
AI
Key Insights from Sam Altman's OpenAI Discussion on Pentagon Partnership

In a recent Saturday night session on social media, Sam Altman, the CEO of OpenAI, provided insights into the company's ...

Business Insider | Mar 01, 2026, 06:10
Key Insights from Sam Altman's OpenAI Discussion on Pentagon Partnership
Gaming
Discover the Top Alternatives to Discord Amid Privacy Concerns

Discord is set to implement mandatory age verification for its users by the end of 2026, raising concerns regarding the ...

TechCrunch | Mar 01, 2026, 19:10
Discover the Top Alternatives to Discord Amid Privacy Concerns
AI
Claude Surges to the Top of the App Store Amid Pentagon Controversy

In a remarkable turn of events, Anthropic's chatbot Claude has ascended to the pinnacle of the Apple App Store's free ap...

TechCrunch | Mar 01, 2026, 15:05
Claude Surges to the Top of the App Store Amid Pentagon Controversy
View All News