Downtime

The HealthCare.gov Experience: Why Critical Systems Fail

Richard Cook, Royal Institute of Technology, Stockholm

The amazing thing isn’t that systems like the healthcare.gov government web site can fail in spectacular fashion, says Dr. Richard Cook. It’s that it doesn’t happen more often. Cook, an expert on failures in complex systems, says it’s human nature to push these systems to the “hairy edge of failure.” Read More

Network Issues Cause Amazon Cloud Outage

It was a stormy week in the cloud, as an outage at Amazon Web Services affected some customers and sparked discussion about resiliency strategies. (Photo by BCP via Flickr.

There was another Amazon cloud outage this morning as the oldest AWS cloud computing region stuttered, reminding folks that if you heavily rely on that the US-EAST region, it’s good to have contingency plans. Read More

Additional Downtime Articles