Distributed Systems

It is your fault if your application is down

Do not blame the infrastructure provider

Uwe Friedrichsen

16 minute read

(Big) zucchinis on a table

Recently, AWS experienced one of its rare partial outages. Its DynamoDB service experienced a disruption in the US-East-1 region that could be tracked down to a latent race condition in the DynamoDB DNS management system which caused the disruption. A comprehensive post-event summary describing the outage, its cause and the resulting effects can be found here.

A note about eventual consistency - Part 2

Revisiting a massively misunderstood topic

Uwe Friedrichsen

17 minute read

Ducks in the surf

In the previous post we discussed what eventual consistency actually means and why we sometimes need to favor eventual consistency over strong consistency. We also saw that most of the time we will not perceive any differences between eventual and strong consistency if set up properly. The differences only become apparent if the system encounters adverse conditions like, e.g., a network partition, loss of a node, or alike.