System Design

It is your fault if your application is down

Do not blame the infrastructure provider

Uwe Friedrichsen

16 minute read

(Big) zucchinis on a table

Recently, AWS experienced one of its rare partial outages. Its DynamoDB service experienced a disruption in the US-East-1 region that could be tracked down to a latent race condition in the DynamoDB DNS management system which caused the disruption. A comprehensive post-event summary describing the outage, its cause and the resulting effects can be found here.