<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:series="http://unfoldingneurons.com/"
	xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd"
xmlns:rawvoice="http://www.rawvoice.com/rawvoiceRssModule/"
>

<channel>
	<title>Gestalt IT &#187; high availability Archives  &#8211; Gestalt IT</title>
	<atom:link href="http://gestaltit.com/tag/high-availability/feed/" rel="self" type="application/rss+xml" />
	<link>http://gestaltit.com</link>
	<description>Independent Experts United</description>
	<lastBuildDate>Wed, 08 Feb 2012 17:00:34 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
<image>
			<title>Gestalt IT</title>
			<url>http://gestaltit.com/wp-content/uploads/2009/02/gestalt-it-feedicon-21.png</url>
			<link>http://gestaltit.com</link>
			<width>144</width>
			<height>37</height>
			<description>Independent Experts United</description>
		</image><!-- podcast_generator="Blubrry PowerPress/2.0.4" -->
	<itunes:summary>Gestalt IT is a community of independent IT infrastructure experts. We gather at GestaltIT.com and our Tech FIeld Day events to discuss the topics of the day. This podcast includes video and audio recordings of these discussions.</itunes:summary>
	<itunes:author>Stephen Foskett</itunes:author>
	<itunes:explicit>no</itunes:explicit>
	<itunes:image href="http://gestaltit.com/wp-content/uploads/powerpress/Gestalt_IT_Tech_Field_Day_Roundtable_Podcast_600.png" />
	<itunes:owner>
		<itunes:name>Stephen Foskett</itunes:name>
		<itunes:email>stephen@fosketts.net</itunes:email>
	</itunes:owner>
	<managingEditor>stephen@fosketts.net (Stephen Foskett)</managingEditor>
	<itunes:subtitle>The best independent IT commentary</itunes:subtitle>
	<itunes:keywords>Storage, Virtualization, Networking, IT</itunes:keywords>
	<image>
		<title>Gestalt IT &#187; high availability Archives  &#8211; Gestalt IT</title>
		<url>http://gestaltit.com/wp-content/uploads/powerpress/Gestalt_IT_Tech_Field_Day_Roundtable_Podcast_144.png</url>
		<link>http://gestaltit.com</link>
	</image>
	<itunes:category text="Technology" />
	<itunes:category text="Business" />
	<itunes:category text="Technology">
		<itunes:category text="Tech News" />
	</itunes:category>
<atom:link rel="hub" href="http://pubsubhubbub.appspot.com" />
	<atom:link rel="hub" href="http://superfeedr.com/hubbub" />
			<item>
		<title>Four Fundamental Best Practices for Enterprise IT</title>
		<link>http://gestaltit.com/all/tech/storage/stephen/fundamental-practices-enterprise/</link>
		<comments>http://gestaltit.com/all/tech/storage/stephen/fundamental-practices-enterprise/#comments</comments>
		<pubDate>Wed, 29 Sep 2010 19:00:46 +0000</pubDate>
		<dc:creator>Stephen Foskett</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[Tech]]></category>
		<category><![CDATA[alignment]]></category>
		<category><![CDATA[best practices]]></category>
		<category><![CDATA[complexity]]></category>
		<category><![CDATA[Enterprise storage]]></category>
		<category><![CDATA[Everything]]></category>
		<category><![CDATA[Gestalt IT]]></category>
		<category><![CDATA[high availability]]></category>
		<category><![CDATA[Personal]]></category>
		<category><![CDATA[Redundancy]]></category>
		<category><![CDATA[SLA]]></category>
		<category><![CDATA[standardization]]></category>

		<guid isPermaLink="false">http://blog.fosketts.net/?p=3702</guid>
		<description><![CDATA[Although many people are cynical about the whole idea of best practices, I'm a believer. I think that such beasts do exist, just that too many companies, analysts and especially consultants spend too much time applying the label to whatever works in their best interest at the time. To counteract this cesspool of non-best practices, I thought it best to put down a few ideas of my own. Following are four fundamental best practices I have distilled from almost 20 years in enterprise IT. I wonder if you agree with them.]]></description>
			<content:encoded><![CDATA[<p><!-- google_ad_section_start --></p>
<div id="attachment_3736" class="wp-caption aligncenter" style="width: 310px; border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align: center; display: block; margin-right: auto; margin-left: auto;"><a href="http://blog.fosketts.net/wp-content/uploads/2010/09/Daves-Bike-Tools.jpg" ><img class="size-medium wp-image-3736" title="Dave's Bike Tools" src="http://blog.fosketts.net/wp-content/uploads/2010/09/Daves-Bike-Tools-300x174.jpg" alt="" width="300" height="174" /></a></p>
<p class="wp-caption-text" style="padding: 0 4px 5px; margin: 0;">Every technician develops a set of tools and practices that serve them well. In this post, I present my best practices for enterprise IT.</p>
</div>
<p>Although many people are cynical about the whole idea of best practices, I’m a believer. I think that such beasts do exist, just that too many companies, analysts and especially consultants spend too much time applying the label to whatever works in their best interest at the time. To counteract this cesspool of non-best practices, I thought it best to put down a few ideas of my own. Following are four fundamental best practices I have distilled from almost 20 years in enterprise IT. I wonder if you agree with them.</p>
<blockquote><p>Before proceeding, I suggest that you take a look at <a href="http://foskettservices.com/2010/09/best-practice-definition-not-opinion/" >the three principle tests that I use to define a best practice</a>.</p></blockquote>
<h3>Best practice 1: Minimize Complexity</h3>
<p>IT has a tough position in business. We are ignored while everything is working well, and in trouble when things start falling apart. We are held accountable, but generally cannot control our own destiny. The only way we have a fighting chance is to minimize the complexity of the solutions we must support.</p>
<blockquote><p>As I said in <a href="http://blog.fosketts.net/2007/07/24/sailing-the-titanic-why-we-need-ilm-and-then-some/" >Sailing the Titanic (Why We Need ILM and Then Some!)</a>:  “To make a timely (and tired) Harry Potter analogy, IT are the <a href="http://en.wikipedia.org/wiki/House_elf" rel="nofollow" >house-elves</a> of the business – powerful but subservient, with little input into what happens above and around them.”</p></blockquote>
<p>This is where the current trends of virtualization, consolidation, and standardization comes in. A single administrator can manage a far-larger environment if it’s standardized, and these systems tend to be far more reliable as well.  Sadly, minimizing complexity is the antithesis of a true hacker’s heart. Who wouldn’t want to play with the latest and greatest toys? Why not build an intricate and interconnected system? But these solutions fail my “best practices” test: They’re risky, unusual and just plain imprudent.</p>
<h3>Best Practice 2: Use the Right Tool For the Job</h3>
<p>Any woodworker or mechanic will tell you that using the right tool makes any job easier. Even if it works, it’s just bad practice to masquerade network-attached storage (NAS) as a block device or a storage area network (SAN) as a collection of shared files. I am always skeptical of so many oddball innovations trumpeted by vendors large and small because they seem to fail the “right tool” practice.</p>
<blockquote><p>I wrote more about this in <a href="http://blog.fosketts.net/2010/04/27/process-solutions-process-problems-technical-solutions-technical/" >Use Process Solutions For Process Problems, Technical Solutions For Technical Ones</a> and <a href="http://blog.fosketts.net/2010/05/07/i-ignore-nas/" >Why Do I Ignore NAS?</a></p></blockquote>
<p>But sometimes they work out just fine. You see, best practices can’t predict the future or what works in a certain situation. It’s better to minimize complexity and stick to what you know than to press for the perfect solution every time. It wouldn’t be right to bring SAN storage into an all-NAS environment just for a single database application, for example. Although we generally shouldn’t try to drive a nail with a screwdriver, it can be the best option when a hammer can’t be found!</p>
<h3>Best Practice 3: Prepare For Failure</h3>
<p>The hacker in me is tempted to stop tinkering as soon as a system is functional, but the realist in me knows that I have to make it twice as good. You can’t prepare for every eventuality, but you can seek out probable points of failure and harden them.</p>
<blockquote><p>For example, see Chris Evans’ discussion of <a href="http://www.thestoragearchitect.com/2010/08/24/choosing-between-monolithic-and-modular-architectures-part-i/" >Choosing Between Monolithic and Modular Architectures</a></p></blockquote>
<p>Many of the fundamental best practices in enterprise IT come down to this. Redundancy from RAID and N+1 components, the widespread use of multi-path technology, and high-availability design are all examples of preparing for the worst. We would not bother with all of this if we didn’t know that errors occur even in well-engineered systems, and that everything we build is sensitive to failure.</p>
<h3><strong>Best Practice 4: Align Expectations with Reality</strong></h3>
<p>We all face conflicting demands, and we are tempted to offer best-case responses to keep everyone happy. But excessive optimism leaves IT folks stressed and end users frustrated when things don’t go right. It’s time to adjust expectations.</p>
<blockquote><p>See <a href="http://blog.fosketts.net/2010/04/29/techie-business-schism/" >The Techie/Business Schism</a>, where I talk about the looming disconnect even within IT</p></blockquote>
<p>Concepts like service level agreements (SLAs) and business impact analyses come from an innate knowledge that the real world doesn’t always conform to the best-case assumption. Although we don’t have real control over user demands, we can try to set correct expectations by admitting what we can and can’t do realistically.  Don’t let those who rely on us labor under misconceptions of our capabilities.</p>
<h3>Stephen’s Stance</h3>
<p>We can’t let others lead us astray with talk of “best practices”, but we can define some of our own. My <a href="http://foskettservices.com/2010/09/best-practice-definition-not-opinion/" >three-part best practices test</a>, and these four general practices, have served me well over the years. What are your best practices?  <em>Image credits: </em><a href="http://www.flickr.com/photos/bre/552152780/" rel="nofollow" ><em>Dave’s Bike Tools</em></a><em> by </em><em><a href="http://www.flickr.com/photos/bre/" rel="nofollow" >bre pettis</a></em></p>
<div id="crp_related"><h3>You might also want to read these other posts...</h3><ul><li><a href="http://gestaltit.com/all/tech/storage/stephen/deallocating-core-issue-thin-provisioning/"  rel="bookmark" class="crp_title">De-Allocating is the Core Issue for Thin Provisioning</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/flexible-path-services-future/"  rel="bookmark" class="crp_title">Flexible IT and the Path to the Services Future</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/fcoe-symbolism-7/"  rel="bookmark" class="crp_title">FCoE Symbolism</a></li><li><a href="http://gestaltit.com/featured/stephen/donate-swag-school-kids/"  rel="bookmark" class="crp_title">Donate Your Swag to School Kids In Need</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/multipath-activepassive-dual-active-activeactive/"  rel="bookmark" class="crp_title">Multipath: Active/Passive, Dual Active, and Active/Active</a></li></ul></div><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://gestaltit.com/all/tech/storage/stephen/fundamental-practices-enterprise/" type="text/javascript" charset="utf-8"></script><hr />
<p><small>© Stephen Foskett for <a href="http://gestaltit.com">Gestalt IT</a>, 2010. |
<a href="http://gestaltit.com/all/tech/storage/stephen/fundamental-practices-enterprise/">Four Fundamental Best Practices for Enterprise IT</a>
<br/>
Read more posts categorized as <a href="http://gestaltit.com/category/featured/" title="View all posts in Featured" rel="category tag">Featured</a>, <a href="http://gestaltit.com/category/all/tech/storage/" title="View all posts in Storage" rel="category tag">Storage</a>, <a href="http://gestaltit.com/category/all/tech/" title="View all posts in Tech" rel="category tag">Tech</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://gestaltit.com/all/tech/storage/stephen/fundamental-practices-enterprise/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How do you define high availability and disaster recovery?</title>
		<link>http://gestaltit.com/all/tech/storage/bas/define-high-availability-disaster-recovery/</link>
		<comments>http://gestaltit.com/all/tech/storage/bas/define-high-availability-disaster-recovery/#comments</comments>
		<pubDate>Thu, 08 Jul 2010 20:00:51 +0000</pubDate>
		<dc:creator>Bas</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[Server Virtualization]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[gestaltit]]></category>
		<category><![CDATA[high availability]]></category>

		<guid isPermaLink="false">http://basraayman.com/?p=313</guid>
		<description><![CDATA[A while back I was on a call with someone who asked me the difference between high availability (HA) and disaster recovery (DR), saying that there are so many different solutions out there and that a lot of people seem to use the terminology but are unable to explain anything more about these two descriptions. So, here’s an attempt to demystify things.]]></description>
			<content:encoded><![CDATA[<p>A while back I was on a call with someone who asked me the difference between high availability (HA) and disaster recovery (DR), saying that there are so many different solutions out there and that a lot of people seem to use the terminology but are unable to explain anything more about these two descriptions. So, here’s an attempt to demystify things.</p>
<p>First of all, let’s take a look at the individual terms:</p>
<h3><strong>High Availability</strong></h3>
<p>According to Wikipedia, you can define availability in the following ways:</p>
<blockquote><p>The degree to which a system, subsystem, or equipment is operable and in a committable state at the start of a mission, when the mission is called for at an unknown, i.e., a random, time. Simply put, availability is the proportion of time a system is in a functioning condition.</p></blockquote>
<blockquote><p>The ratio of (a) the total time a functional unit is capable of being used during a given interval to (b) the length of the interval.</p></blockquote>
<p>And most online dictionaries seem to have a similar definition of availability. When we are talking about HA, we imply that we want the functioning condition of your system to be increased.</p>
<p>Going by the above you will also notice that there is no fixed definition of the availability. Simply put, it would mean that you need to put your own definition in place when talking about HA. You need to define what HA means in your environment. I’ve had customers that needed HA and defined this as the system having a certain amount of uptime, which is one way to measure it.</p>
<p>On the other hand you would be hard pressed if you were able to work with your system, but the data that you were working with was corrupted because one of your power users made an error during a copy job and wrote an older data set in the wrong spot. This would mean that your system is in itself available. You can log on to it, you can work with it, but the output you are going to get will be wrong.</p>
<p>To me, such a scenario would mean that your system isn’t available. After all, it’s not about everything being online. It’s about using a system in the way you would expect it to work. But when you ask most people in IT about availability, the first thing you will likely hear is something related to uptime or downtime. So, my tip to you is once again:</p>
<p>Define what “available” means to you and your organization/customer!</p>
<h3><strong>Disaster Recovery</strong></h3>
<p><a rel="nofollow" href="http://basraayman.files.wordpress.com/2010/07/tornado-natural-disaster-400a061807-e1278406179387.jpg" ><img class="alignright size-full wp-image-382" title="Tornado" src="http://basraayman.files.wordpress.com/2010/07/tornado-natural-disaster-400a061807-e1278406179387.jpg?w=150&amp;h=150" alt="Natural disaster" width="150" height="150" /></a>Let’s do the same thing as before and let’s turn to some general definitions. Wikipedia defines disaster the following way:</p>
<blockquote><p>disaster is a perceived tragedy, being either a natural calamity or man-made catastrophe. It is a hazard which has comes to fruition. A hazard, in turn, is a situation which poses a level of threat to life, health, property, or that may deleteriously affect society or an environment.</p></blockquote>
<p>And recovery is defined the following way (when it comes to health):</p>
<blockquote><p>Healing, or Cure, the process of recovering from an injury or illness.</p></blockquote>
<p>So, in a nutshell this is about bouncing back to your feet once a disaster strikes. Now again, it’s important to define what you would call a disaster, but at least there seems to be some sort of common understanding that anything that would get you back up and running after an entire site goes down, usually falls under the label of a DR solution.</p>
<h3><strong>It all boils down to definitions!</strong></h3>
<p>When you talk to other companies or vendors about HA and/or DR, you will soon notice that most have a different understanding of what HA and DR are. Your main focus should be to have a clear definition for yourself. Try to find out the importance and value of your solution and base your requirements on that. Ask yourself simple questions like for example:</p>
<ul>
<li>What is the maximum downtime I can cope with before I need to start working again? 8 hours per year? 1 hour per year? 4 hours per month? What is my <abbr>RPO</abbr> and <abbr>RTO</abbr></li>
<li>How do I handle planned maintenance? Can I bring everything down or do I need to distribute my maintenance across independent entities?</li>
<li>Can I afford the loss of any data at all? Can I afford the partial loss of data?</li>
<li>What if we see a city-wide power outage? Do I need a failover site, or are all my users in the same spot and won’t be able to work anyway?</li>
</ul>
<p>Questions like these will help you realize that not everything you have running has the same value. Your development system with 6000 people working on it worldwide might need better protection than your productive system that is only being used by 500 people spread through the Baltic region.</p>
<h3><strong>Or in short</strong></h3>
<p>Knowing what kind of protection you need is key. Fact is that both HA and DR solutions never come cheap. If you need the certainty that your solution is available and able to recover from a disaster, you will notice that the price tag will quickly skyrocket. Which is another reason to make sure that you know exactly what kind of protection you need, and creating that definition is the most important starting point. Once you have your own definition, make sure that you communicate those definitions and requirements so that all parties are on the same page. It should make your life a little easier in the end.</p>
<div id="crp_related"><h3>You might also want to read these other posts...</h3><ul><li><a href="http://gestaltit.com/all/tech/storage/martin/disastrous-thinking-2/"  rel="bookmark" class="crp_title">Disastrous Thinking</a></li><li><a href="http://gestaltit.com/all/tech/virtualization/bas/virtualization-challenge-knowing-customer-understands/"  rel="bookmark" class="crp_title">Virtualization and the challenge of knowing what your customer understands</a></li><li><a href="http://gestaltit.com/all/tech/storage/bas/vaai-vmware-admin/"  rel="bookmark" class="crp_title">What is VAAI, and how does it add spice to my life as a VMware admin?</a></li><li><a href="http://gestaltit.com/all/tech/virtualization/simon/veeam-vmware-vexperts-whitepaper/"  rel="bookmark" class="crp_title">Veeam’s VMware vExperts White Paper Series</a></li><li><a href="http://gestaltit.com/all/tech/storage/edsai/business-continuity/"  rel="bookmark" class="crp_title">Why you need to be serious about Business Continuity</a></li></ul></div><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://gestaltit.com/all/tech/storage/bas/define-high-availability-disaster-recovery/" type="text/javascript" charset="utf-8"></script><hr />
<p><small>© Bas for <a href="http://gestaltit.com">Gestalt IT</a>, 2010. |
<a href="http://gestaltit.com/all/tech/storage/bas/define-high-availability-disaster-recovery/">How do you define high availability and disaster recovery?</a>
<br/>
Read more posts categorized as <a href="http://gestaltit.com/category/featured/" title="View all posts in Featured" rel="category tag">Featured</a>, <a href="http://gestaltit.com/category/all/tech/virtualization/" title="View all posts in Server Virtualization" rel="category tag">Server Virtualization</a>, <a href="http://gestaltit.com/category/all/tech/storage/" title="View all posts in Storage" rel="category tag">Storage</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://gestaltit.com/all/tech/storage/bas/define-high-availability-disaster-recovery/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>HDS High Availability Manager: How It Works</title>
		<link>http://gestaltit.com/featured/top/stephen/hds-high-availability-manager-works/</link>
		<comments>http://gestaltit.com/featured/top/stephen/hds-high-availability-manager-works/#comments</comments>
		<pubDate>Fri, 29 May 2009 22:59:14 +0000</pubDate>
		<dc:creator>Stephen Foskett</dc:creator>
				<category><![CDATA[Exclusive]]></category>
		<category><![CDATA[Featured]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[Top Story]]></category>
		<category><![CDATA[clustering]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[HAM]]></category>
		<category><![CDATA[HDS]]></category>
		<category><![CDATA[high availability]]></category>
		<category><![CDATA[MPIO]]></category>
		<category><![CDATA[NMP]]></category>
		<category><![CDATA[PowerPath]]></category>
		<category><![CDATA[replication]]></category>
		<category><![CDATA[TrueCopy]]></category>
		<category><![CDATA[USP]]></category>
		<category><![CDATA[USP-NEXT]]></category>
		<category><![CDATA[usp-v]]></category>
		<category><![CDATA[V-Max]]></category>
		<category><![CDATA[vmware]]></category>

		<guid isPermaLink="false">http://gestaltit.com/?p=860</guid>
		<description><![CDATA[It has been two days since HDS introduced High Availability Manager ("HAM" to us), disappointing some and confusing others. Now that the dust has settled some, it has become clearer just what HAM is and how it works, and we come away more impressed. HDS has taken simple, proven technologies (path management, clustering, synchronous replication) and remixed them into a super-high-availability solution for the largest enterprises. Perhaps this is not what many expected, but it's certainly a worthwhile addition to the company's family of products.
]]></description>
			<content:encoded><![CDATA[<p>It has been two days since HDS surprised the enterprise storage world by not announcing a new storage platform to take on the EMC Symmetrix V-Max. Instead, HDS introduced High Availability Manager (&#8220;HAM&#8221; to us), disappointing some and confusing others. Now that the dust has settled some, it has become clearer just <a href="http://blogs.rupturedmonkey.com/?p=389"  target="_blank">what HAM is</a> and <a href="http://thestoragearchitect.com/2009/05/27/enterprise-computing-usp-v-so-long-and-thanks-for-all-the-fish/"  target="_blank">how it works</a>, and we come away more impressed. <strong>HDS has taken simple, proven technologies (path management, clustering, synchronous replication) and remixed them into a super-high-availability solution for the largest enterprises</strong>. Perhaps this is not what many expected, but it&#8217;s certainly a worthwhile addition to the company&#8217;s family of products.</p>
<div id="attachment_861" class="wp-caption alignright" style="width: 251px"><img class="size-medium wp-image-861" title="HDS HAM" src="http://gestaltit.com/wp-content/uploads/2009/05/picture-11-241x300.png" alt="HAM combines conventional ingredients to create a whole new flavor" width="241" height="300" /><p class="wp-caption-text">HAM combines conventional ingredients to create a whole new flavor</p></div>
<p>High Availability Manager consists of three main components:</p>
<ol>
<li><strong>A conventional multi-pathing agent</strong> like HDLM on each server. This enables the server to continue accessing the storage if one USP fails. It will &#8220;think&#8221; it&#8217;s talking to a single storage target, but will actually be talking to two USPs that can be metro distances (60-100 miles) away, given proper connectivity. Microsoft MPIO will probably be supported as well, and VMware native multi-pathing (NMP) should come shortly after release. Don&#8217;t hold your breath for PowerPath to be officially supported in the short term, but it ought to work fine without changes.</li>
<li><strong>Existing TrueCopy synchronous storage replication technology</strong> will keep the data and quorum disks (see below) in lock-step. This gives the limitation on distance between systems, since latency is the enemy of storage protocol performance. Once the arrays move too far apart, write performance will suffer on the local array while it waits for data to be copied.</li>
<li><strong>Conventional clustering technology with a heartbeat and shared quorum disk</strong> lets both arrays know what&#8217;s going on. The quorum &#8220;lives&#8221; on the remote side, with that secondary array watching to make sure the primary array is still running. If the heartbeat goes away, the secondary array marks the HAMmed LUNs read/write and starts handling I/O from the servers, which will just have failed over.</li>
</ol>
<p>So <strong>HDS&#8217; secret HAM sauce is ketchup</strong>. There&#8217;s no amazing new technology here, and maybe that&#8217;s for the best in the kind of huge, conservative enterprise environments that will use this product. The big change was in programming the USP controllers to monitor the quorum disk and orchestrate the entire failover. In fact, this might even be considered a feature of TrueCopy, not a standalone product.</p>
<p>HDS ought to reconsider one element of the HAM pitch, however. Although it will undoubtedly yield a very highly-available storage architecture, <strong>nothing provides 100% availability</strong>. There are many moving parts involved, and unplanned outages can still happen. Multipath driver bugs are not unheard of: Back in the day, one version of a certain three-letter company&#8217;s product just plain refused to fail over! An instantaneous outage on one of the host channels or a skipped heartbeat on the USP controllers could also cause a failed failover. Plus, <strong>there is no provision for automated fail-back</strong>. Once the failover occurs, the system would certainly be operating in a degraded-availability mode and would require a (planned) outage to re-establish operations.</p>
<p>All that being said, HAM remains an attractive offering for shops with multiple USPs visible to critical servers. They can turn on the HAM software with existing hardware and add peace of mind, knowing that everything is that much more available. One aspect that really impresses is the fact that each USP can be running a different firmware, <strong>reducing the risk of upgrade-induced outages</strong>. Once it ships to customers (in the fourth quarter of this year), we will have to consider the (as-yet unnamed) cost.</p>
<p>And what about the fact that <strong>HAM will also allow seamless upgrades to a new generation of USP hardware</strong>? HDS still isn&#8217;t talking about that possibility!</p>
<div id="crp_related"><h3>You might also want to read these other posts...</h3><ul><li><a href="http://gestaltit.com/featured/top/stephen/hds-ham-announcement/"  rel="bookmark" class="crp_title">HDS&#8217; HAM-Fisted Announcement Can&#8217;t Be All</a></li><li><a href="http://gestaltit.com/featured/top/stephen/evostor-wmware-storage/"  rel="bookmark" class="crp_title">EvoStor: VMware Storage Evolved!</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/stec-zeusram-ssd/"  rel="bookmark" class="crp_title">STEC Spills the Beans on ZeusRAM SSD</a></li><li><a href="http://gestaltit.com/all/tech/storage/chris/emc-vplex-dreary-storage-cluster/"  rel="bookmark" class="crp_title">EMC VPLEX – A Dreary Storage Cluster?</a></li><li><a href="http://gestaltit.com/featured/top/stephen/emc-symmetrix-vmax-neither-nor/"  rel="bookmark" class="crp_title">EMC Symmetrix V-Max Is Neither Monolithic Nor Midrange</a></li></ul></div><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://gestaltit.com/featured/top/stephen/hds-high-availability-manager-works/" type="text/javascript" charset="utf-8"></script><hr />
<p><small>© Stephen Foskett for <a href="http://gestaltit.com">Gestalt IT</a>, 2009. |
<a href="http://gestaltit.com/featured/top/stephen/hds-high-availability-manager-works/">HDS High Availability Manager: How It Works</a>
<br/>
Read more posts categorized as <a href="http://gestaltit.com/category/exclusive/" title="View all posts in Exclusive" rel="category tag">Exclusive</a>, <a href="http://gestaltit.com/category/featured/" title="View all posts in Featured" rel="category tag">Featured</a>, <a href="http://gestaltit.com/category/all/tech/storage/" title="View all posts in Storage" rel="category tag">Storage</a>, <a href="http://gestaltit.com/category/featured/top/" title="View all posts in Top Story" rel="category tag">Top Story</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://gestaltit.com/featured/top/stephen/hds-high-availability-manager-works/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>

