<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Bank of America Downtime Rattles Nerves</title>
	<atom:link href="http://www.datacenterknowledge.com/archives/2008/09/23/bank-of-america-downtime-rattles-nerves/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.datacenterknowledge.com/archives/2008/09/23/bank-of-america-downtime-rattles-nerves/</link>
	<description>News and analysis about data centers, cloud computing, managed hosting and disaster recovery</description>
	<lastBuildDate>Mon, 13 Feb 2012 17:24:17 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=</generator>
<xhtml:meta xmlns:xhtml="http://www.w3.org/1999/xhtml" name="robots" content="noindex" />
	<item>
		<title>By: Steve Henning</title>
		<link>http://www.datacenterknowledge.com/archives/2008/09/23/bank-of-america-downtime-rattles-nerves/comment-page-1/#comment-944</link>
		<dc:creator>Steve Henning</dc:creator>
		<pubDate>Thu, 02 Oct 2008 01:01:58 +0000</pubDate>
		<guid isPermaLink="false">http://www.datacenterknowledge.com/?p=3255#comment-944</guid>
		<description>Outages (and brownouts) like the one that affected  BofA are going to continue until these enterprises get their Operations act together and start taking a more proactive approach to performance management. Every IT department I speak with is using a variety of siloed monitoring solutions and relying on static thresholds for individual metric measurements to determine if they are having a problem. Of course, this is not very effective because if they set the thresholds too high, by the time they get alerts end users are already calling them to complain. Set them lower and they get constant alert flow that masks the real problem precursors and they still find out about problems from end users. Most of these folks are probably not monitoring critical end user experience data and are not incorporating business performance metrics so they focus efforts on problems that are really impacting the business. Even the most sophisticated IT operations, with their fancy BSM dashboards and Event Management systems and complex processes and procedures cannot prevent problems from affecting end users and the bottom line of the business. Lets not even get into the affect on these company’s reputations…

So what is missing… Well… an automated “brain” that can intergrate with their existing monitoring infrastructure and understand the normal behavior of all the components that make up these complex, customer-facing business services. A solution that can add context and tell IT Operations when to pay attention and what to pay attention to. Lets face it… their current tools aren’t giving them these two critical pieces of information. In fact, these tools are confusing the issue unintentionally.

Performance management analytics solutions exist that take metric data from siloed monitoring sources and analyze it holistically, learning the normal behavior of every metric collected and sending a heads up when significant abnormal behaviors indicate a problem is imminent. These solutions often predict problems hours before occurrence and include the most likely root cause symptoms so that action can be taken to prevent them.

Until IT Operations teams embrace solutions such as these, you&#039;ll have plenty more of these types of outages to report on, Rich...</description>
		<content:encoded><![CDATA[<p>Outages (and brownouts) like the one that affected  BofA are going to continue until these enterprises get their Operations act together and start taking a more proactive approach to performance management. Every IT department I speak with is using a variety of siloed monitoring solutions and relying on static thresholds for individual metric measurements to determine if they are having a problem. Of course, this is not very effective because if they set the thresholds too high, by the time they get alerts end users are already calling them to complain. Set them lower and they get constant alert flow that masks the real problem precursors and they still find out about problems from end users. Most of these folks are probably not monitoring critical end user experience data and are not incorporating business performance metrics so they focus efforts on problems that are really impacting the business. Even the most sophisticated IT operations, with their fancy BSM dashboards and Event Management systems and complex processes and procedures cannot prevent problems from affecting end users and the bottom line of the business. Lets not even get into the affect on these company’s reputations…</p>
<p>So what is missing… Well… an automated “brain” that can intergrate with their existing monitoring infrastructure and understand the normal behavior of all the components that make up these complex, customer-facing business services. A solution that can add context and tell IT Operations when to pay attention and what to pay attention to. Lets face it… their current tools aren’t giving them these two critical pieces of information. In fact, these tools are confusing the issue unintentionally.</p>
<p>Performance management analytics solutions exist that take metric data from siloed monitoring sources and analyze it holistically, learning the normal behavior of every metric collected and sending a heads up when significant abnormal behaviors indicate a problem is imminent. These solutions often predict problems hours before occurrence and include the most likely root cause symptoms so that action can be taken to prevent them.</p>
<p>Until IT Operations teams embrace solutions such as these, you&#8217;ll have plenty more of these types of outages to report on, Rich&#8230;</p>
]]></content:encoded>
	</item>
</channel>
</rss>

