Performance Problems for Rackspace Cloud

10 comments

Rackspace reports that its cloud computing service is “degraded,” with many customers reporting their sites are unreachable. The company attributed the problem to an unusual load spike in the storage system supporting its cloud platform. The outage came several hours after the Rackspace Cloud disabled CRON, a command commonly used to automate tasks on Unix and Linux systems. By early evening, the company said performance had improved.

“Starting yesterday we began experiencing very high loads on our storage devices for cluster WC1 in DFW,” Rackspace said on its status page. “In order to reduce load we have shut down processes like CRON to ensure core site content continue to load.  While load spikes are common in our cloud infrastructure, we have not been able to fully identify the root cause of these unusual issues.

“We are working with engineers from inside and outside the company with the best expertise on these issues to resolve them and develop a plan of action to ensure we do not repeat this state. We have a series of changes that are being implemented in real time.  We are being careful to minimize issues as we proceed.”

UPDATE: At about 6 pm Central time Rackspace provided an update: “We have been seeing improved performance on our Cloud Sites WC1 storage cluster for the last few hours. Assuming stability continues we will resume CRON operations this evening. At this time, we cannot declare victory on this issue, but we have many plans in place to continue to increase headroom and ensure stability under all conditions.”

About the Author

Rich Miller is the founder and editor at large of Data Center Knowledge, and has been reporting on the data center sector since 2000. He has tracked the growing impact of high-density computing on the power and cooling of data centers, and the resulting push for improved energy efficiency in these facilities.

Add Your Comments

  • (will not be published)

10 Comments

  1. Don

    These issues at RackspaceCloud are killing me. Today, was the delivery date of an important nat'l client. Well, I will see if they trust my firm to handle their web affairs after multiple days of nothing. I can't tweak or show anything and their staff meeting is today. Crap!!!

  2. We're a very small, niche business that chooses to spend more for a service like RackSpace so that we can deliver a fast website to our clients with nearly zero downtime. These outages are killing us too. It isn't just the downtime, it is what it does to our customer base, like Don said, it is hard enough to establish trust. We then have to expend scarce time explaining what happened to our users. To add insult to injury, did anyone see the email from RackSpace today announcing a $50 monthly increase, and the reasons why? It said in part. "We felt it was necessary to update prices for new customers to reflect the value provided from the service, while still remaining very competitive versus other cloud platform as a service offerings."

  3. jwb

    Wow, I really, really, really don't want my cloud provider reaching into my servers and shutting off my processes. Their intervention should end at gracefully shutting off my instances, not monkeying around with their operating systems.

  4. Rackspace has likely oversubscribed the platform -- and if they are running CRON jobs on the platform, then it is likely based on the Xen hypervisor, which has fewer high-availability features compared to VMware ESX. I am familiar with these technologies because my firm builds private VM solutions for clients who require a higher level of security and performance. Although ESX is more expensive, we chose it over Xen because of the the enhanced high-availability capabilities.

  5. Anonymous

    RS Cloud Sites does not use virtual machines. When you have thousands of poorly written cron jobs the storage will be affected and it's the most non impacting way of lowering the load on the storage. RS Cloud Servers are not being affected by Cloud Site issues. Noone logs into your Cloud Server except for you.

  6. Jane

    If you actually want a fast website for your clients with nearly zero downtime use a company that has enterprise solutions like Carpthia Hosting. Rackspace has lost all my trust.

  7. jim

    Funny (but not really), this is an old article but the same thing just happened this week. i've been really unimpressed with the performance of my (admittedly pretty small -- 1024mb) cloud server instances. 10+ second page loads on a Drupal site. Awesome. Glad we're giving demos to investors this week. I'm over it, Rackspace, i'm moving on.

  8. Dawson

    Rackspace Cloudsites has been having outages and performance problems daily for several weeks now. Tech support gives vagues responses and try to up-sell existing clients like us into traditional hosting solutions. Rack space has failed to deliver a multi-tenant cloud hosting solution and appears unable to fix the problem. What is worse is they are not proactively working with customers. Epic Fail!