Problems with UPS equipment and configuration are the most frequently cited cause of data center outages, according to a survey of more than 450 data center professionals.
Facebook was offline for more than two hours today after a configuration change created a feedback loop that overwhelmed a database cluster. The only way to fix the problem was to take the web site offline.
Database industry analyst Curt Monash has an interesting unofficial account of the database crash that led to last week's lengthy outage for the Chase.com online banking portal.
The Chase.com online banking portal is back online and processing customer bill payments that were delayed during lengthy outages Tuesday and Wednesday.
Rackspace Hosting is in the news today after the company invoked its terms of service in terminating web services for the Florida church that has scheduled a Koran-burning event for this Saturday.
Does the downtime at social media hub Digg reflect challenges in deploying NoSQL databases like Cassandra? Or is it simply a case of a company launching a new site architecture before it was ready for prime time?
Many critical services in the state of Virginia were crippled Thursday by computer failures in a state data center in Chesterfield, state officials said.
Police in Salt Lake City say an employee of a mortgage company opened fire on a $100,000 server with a .45 caliber automatic, and then concocted a cover story that his gun had been stolen and used to shoot up the IT equipment.
How important are the number of "nines" in your site's uptime? In a world where getting things right 99 percent of the time is normally a win, 99 percent uptime is widely perceived as a failure. Case in point: Twitter's uptime for June.