<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:series="http://unfoldingneurons.com/"
	xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd"
xmlns:rawvoice="http://www.rawvoice.com/rawvoiceRssModule/"
>

<channel>
	<title>Gestalt IT &#187; performance Archives  &#8211; Gestalt IT</title>
	<atom:link href="http://gestaltit.com/tag/performance/feed/" rel="self" type="application/rss+xml" />
	<link>http://gestaltit.com</link>
	<description>Independent Experts United</description>
	<lastBuildDate>Fri, 03 Feb 2012 17:00:55 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
<image>
			<title>Gestalt IT</title>
			<url>http://gestaltit.com/wp-content/uploads/2009/02/gestalt-it-feedicon-21.png</url>
			<link>http://gestaltit.com</link>
			<width>144</width>
			<height>37</height>
			<description>Independent Experts United</description>
		</image><!-- podcast_generator="Blubrry PowerPress/2.0.4" -->
	<itunes:summary>Gestalt IT is a community of independent IT infrastructure experts. We gather at GestaltIT.com and our Tech FIeld Day events to discuss the topics of the day. This podcast includes video and audio recordings of these discussions.</itunes:summary>
	<itunes:author>Stephen Foskett</itunes:author>
	<itunes:explicit>no</itunes:explicit>
	<itunes:image href="http://gestaltit.com/wp-content/uploads/powerpress/Gestalt_IT_Tech_Field_Day_Roundtable_Podcast_600.png" />
	<itunes:owner>
		<itunes:name>Stephen Foskett</itunes:name>
		<itunes:email>stephen@fosketts.net</itunes:email>
	</itunes:owner>
	<managingEditor>stephen@fosketts.net (Stephen Foskett)</managingEditor>
	<itunes:subtitle>The best independent IT commentary</itunes:subtitle>
	<itunes:keywords>Storage, Virtualization, Networking, IT</itunes:keywords>
	<image>
		<title>Gestalt IT &#187; performance Archives  &#8211; Gestalt IT</title>
		<url>http://gestaltit.com/wp-content/uploads/powerpress/Gestalt_IT_Tech_Field_Day_Roundtable_Podcast_144.png</url>
		<link>http://gestaltit.com</link>
	</image>
	<itunes:category text="Technology" />
	<itunes:category text="Business" />
	<itunes:category text="Technology">
		<itunes:category text="Tech News" />
	</itunes:category>
<atom:link rel="hub" href="http://pubsubhubbub.appspot.com" />
	<atom:link rel="hub" href="http://superfeedr.com/hubbub" />
			<item>
		<title>Microsoft Adds Data Deduplication to NTFS in Windows 8</title>
		<link>http://gestaltit.com/all/tech/storage/stephen/microsoft-adds-data-deduplication-ntfs-windows-8/</link>
		<comments>http://gestaltit.com/all/tech/storage/stephen/microsoft-adds-data-deduplication-ntfs-windows-8/#comments</comments>
		<pubDate>Wed, 11 Jan 2012 17:00:06 +0000</pubDate>
		<dc:creator>Stephen Foskett</dc:creator>
				<category><![CDATA[Server Virtualization]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[CSV]]></category>
		<category><![CDATA[data deduplication]]></category>
		<category><![CDATA[deduplication]]></category>
		<category><![CDATA[Enterprise storage]]></category>
		<category><![CDATA[Everything]]></category>
		<category><![CDATA[File System]]></category>
		<category><![CDATA[Gestalt IT]]></category>
		<category><![CDATA[Hyper-V]]></category>
		<category><![CDATA[I/O]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[NTFS]]></category>
		<category><![CDATA[performance]]></category>
		<category><![CDATA[Personal]]></category>
		<category><![CDATA[Rick Vanover]]></category>
		<category><![CDATA[server]]></category>
		<category><![CDATA[Terabyte home]]></category>
		<category><![CDATA[Virtual Storage]]></category>
		<category><![CDATA[Windows 8]]></category>

		<guid isPermaLink="false">http://blog.fosketts.net/?p=6475</guid>
		<description><![CDATA[The next version of Microsoft Windows Server includes integrated data deduplication technology. Microsoft is positioning this as a boon for server virtualization and claims it has very little performance impact. But how exactly does Microsoft's de-duplication technology work?]]></description>
			<content:encoded><![CDATA[<div id="attachment_6628" class="wp-caption aligncenter" style="width: 310px; border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align: center; display: block; margin-right: auto; margin-left: auto;"><a href="http://static.fosketts.net/wp-content/uploads/2012/01/Microsoft-Windows-8-Dedupe-Stack.jpg" ><img class="size-medium wp-image-6628 " title="Microsoft Windows 8 Dedupe Stack" src="http://static.fosketts.net/wp-content/uploads/2012/01/Microsoft-Windows-8-Dedupe-Stack-300x225.jpg" alt="" width="300" height="225" /></a></p>
<p class="wp-caption-text" style="padding: 0 4px 5px; margin: 0;">Windows 8 server editions will include a filter driver for NTFS for data deduplication</p>
</div>
<p>The next version of Microsoft Windows Server includes <strong>integrated data deduplication technology</strong>. Microsoft is positioning this as a boon for server virtualization and claims it has very little performance impact. But how exactly does Microsoft’s de-duplication technology work?</p>
<h3>Introducing Windows 8 Deduplication</h3>
<p>Let’s make one thing clear right from the start: Microsoft started from a clean sheet and invented their own deduplication technology. This is not a licensed, cloned, or copied feature as far as I can tell. There are some clever aspects to it, along with a few head scratchers for folks like me who’ve seen lots of different deduplication approaches.</p>
<p><strong>Microsoft’s deduplication is layered onto NTFS in Windows 8</strong>, and will be a feature add-on for Server users. It is implemented as a filter driver on a per volume basis, with each volume a complete, self describing unit. It is cluster aware, and fully crash consistent on all operations. This is a pretty neat trick: As is typical for Microsoft, deduplication will be a simple, transparent feature.</p>
<p>Now let’s talk for a moment about what Windows 8 deduplication is not.</p>
<ul>
<li>It is a <strong>server-only</strong> feature, like so many of Microsoft’s storage developments. But perhaps we might see it deployed in low-end or home servers in the future.</li>
<li>It is <strong>not supported on boot or system volumes</strong>.</li>
<li>Although it should work just fine on removable drives, <strong>deduplication requires NTFS</strong> so you can forget about FAT or exFAT. And of course the connected system must be running a server edition of Windows 8.</li>
<li>Although <strong>deduplication does not work with clustered shared volumes</strong>, it is supported in Hyper-V configurations that do not use CSV.</li>
<li>Finally, deduplication does not function on encrypted files, files with extended attributes, tiny (less than 64 kB) files, or re-parse points.</li>
</ul>
<h3>Some Technical Details on Deduplication in Windows 8</h3>
<p>Microsoft Research spent 2 years experimenting with algorithms to find the “cheapest” in terms of overhead. <strong>They select a chunk size  for each data set</strong>. This is typically between 32 KB and 128 KB, but smaller chunks can be created as well. Microsoft claims that most real-world use cases are about 80 KB. The system processes all the data looking for “fingerprints” of split points and selects the “best” on the fly for each file.</p>
<p>After data is de-duplicated, Microsoft compresses the chunks and stores them in a special “chunk store” within NTFS. This is actually  part of the System Volume store in the root of the volume, so dedupe is volume-level. The entire setup is self describing, so a deduplication NTFS volume can be read by another server without any external data.</p>
<p>There is some redundancy in the system as well. Any chunk that is referenced more than x times (100 by default) will be kept in a second location. All data in the filesystem is checksummed and will be proactively repaired. The same is done for the metadata. The deduplication service includes a scrubbing job as well as a file system optimization task to keep everything running smoothly.</p>
<p>Windows 8 deduplication cooperates with other elements of the operating system. <strong>The Windows caching layer is dedupe-aware</strong>, and this will greatly accelerate overall performance. Windows 8 also includes a new “express” library that makes compression “20 times faster”. Compressed files are not re-compressed based on filetype, so zip files, Office 2007+ files, etc will be skipped and just deduped.</p>
<p>New writes are not deduped – <strong>this is a post-process technology</strong>. The data deduplication service can be scheduled or can run in “background mode” and wait for idle time. Therefore, I/O impact is between “none and 2x” depending on type. Opening a file is less than 3% greater I/O and can be faster if it’s cached. Copying a large file can make some difference (e.g. 10 GB VHD) since it adds additional disk seeks, but multiple concurrent copies that share data can actually improve performance.</p>
<h3>Stephen’s Stance</h3>
<p>Although I am intrigued by Microsoft’s new deduplication technology in Windows 8 server, I still have many questions about its usefulness and impact on performance. Concentrating duplicate data in the system volume makes sense from a technical perspective, but could lead to an I/O hotspot on the disk. This is especially true for external caching storage systems, since there is no integration between Microsoft deduplication and storage array features. I am particularly concerned about the use of deduplication with VHD files in Hyper-V, since it could eat up valuable system RAM and impact I/O performance.</p>
<p>If you would like to try Microsoft deduplication for yourself, I am happy to report that it is included in <a href="http://msdn.microsoft.com/en-us/windows/br229518"  rel="nofollow">the developer preview of Windows 8 that is available on Dev Center</a>. Here are <a href="http://social.msdn.microsoft.com/Forums/zh/windowsdeveloperpreviewgeneral/thread/3f601771-1400-47c4-9aec-bb9bc45b2d85"  rel="nofollow">a few commands</a> to get you started, and read <a href="http://www.techrepublic.com/blog/networking/configuring-windows-server-8-deduplication/4918" >Rick Vanover’s post</a> too!</p>
<pre>Import-Module ServerManager
Add-WindowsFeature -name FS-Data-Deduplication
Import-Module Deduplication
Enable-DedupVolume E:
get-dedupvolume</pre>
<blockquote><p>Note: I am a Microsoft MVP and Microsoft briefs me on upcoming technologies under NDA. This post is based on a Microsoft briefing from November which was said at the time not to be covered by any NDA. All of this information could be gleaned by experimenting with the Windows 8 developer preview, but it’s much easier to just go to the source.</p></blockquote>
<div id="crp_related"><h3>You might also want to read these other posts...</h3><ul><li><a href="http://gestaltit.com/all/tech/storage/joerg/netapp-deduplication-indepth/"  rel="bookmark" class="crp_title">NetApp Deduplication An In-depth Look</a></li><li><a href="http://gestaltit.com/all/tech/storage/bill/fixed-block-variable-block-deduplication-quick-primer/"  rel="bookmark" class="crp_title">Fixed Block vs Variable Block Deduplication – A Quick Primer</a></li><li><a href="http://gestaltit.com/featured/top/stephen/windows-storage-server-2008/"  rel="bookmark" class="crp_title">Windows Storage Server-Based Systems Step Into 2008</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/microsoft-fcoe-support/"  rel="bookmark" class="crp_title">Where Is Microsoft’s FCoE Support?</a></li><li><a href="http://gestaltit.com/all/tech/storage/edsai/data-dedupe-comes-to-zfs/"  rel="bookmark" class="crp_title">Data Dedupe comes to ZFS</a></li></ul></div><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://gestaltit.com/all/tech/storage/stephen/microsoft-adds-data-deduplication-ntfs-windows-8/" type="text/javascript" charset="utf-8"></script><hr />
<p><small>© Stephen Foskett for <a href="http://gestaltit.com">Gestalt IT</a>, 2012. |
<a href="http://gestaltit.com/all/tech/storage/stephen/microsoft-adds-data-deduplication-ntfs-windows-8/">Microsoft Adds Data Deduplication to NTFS in Windows 8</a>
<br/>
Read more posts categorized as <a href="http://gestaltit.com/category/all/tech/virtualization/" title="View all posts in Server Virtualization" rel="category tag">Server Virtualization</a>, <a href="http://gestaltit.com/category/all/tech/storage/" title="View all posts in Storage" rel="category tag">Storage</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://gestaltit.com/all/tech/storage/stephen/microsoft-adds-data-deduplication-ntfs-windows-8/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Flash Storage and Automated Storage Tiering</title>
		<link>http://gestaltit.com/all/tech/storage/joerg/flash-storage-automated-storage-tiering/</link>
		<comments>http://gestaltit.com/all/tech/storage/joerg/flash-storage-automated-storage-tiering/#comments</comments>
		<pubDate>Fri, 16 Sep 2011 16:30:00 +0000</pubDate>
		<dc:creator>Joerg Hallbauer</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[flash]]></category>
		<category><![CDATA[performance]]></category>
		<category><![CDATA[SSD]]></category>
		<category><![CDATA[tiered storage]]></category>

		<guid isPermaLink="false">http://gestaltit.com/all/tech/storage/joerg/flash-storage-automated-storage-tiering/</guid>
		<description><![CDATA[In recent years, a move toward automated storage tiering has begun in the data center. This move has been inspired by the desire to continue to drive down the cost of storage, as well as the introduction of faster, but more expensive storage in the form of Flash memory in the storage array marketplace.]]></description>
			<content:encoded><![CDATA[<p>In recent years, a move toward automated storage tiering has begun in the data center. This move has been inspired by the desire to continue to drive down the cost of storage, as well as the introduction of faster, but more expensive storage in the form of Flash memory in the storage array marketplace. Flash memory is significantly faster than spinning disks, and thus it’s ability to provide very high performance storage has been of interest. However, its cost is considerable, and therefore a way to utilize it and still bend the cost curve downward was needed. Note that Flash memory has been implemented in different ways. It can be obtained as a card for the storage array controller, or as SSD disk drives, and even, as cache on regular spinning disks. However it is implemented, it’s speed and expense remains the same.</p>
<p>Enter the concept of tiered storage again. The idea was to place only that data which absolutely required the very high performance of Flash on Flash, and to leave the remaining data on spinning disk. The challenge with tiered storage in the way that it has been defined in the past was that it meant that too much data would be placed on very expensive Flash since traditionally an entire application would have all it’s data placed on a single tier. Even if only specific parts of the data at the file, or LUN level were placed on Flash, the quantity needed would still be very high, thus driving the costs of for a particular application up. It was quickly recognized that the only way to make Flash cost effective would be to place only the blocks which are “hot” for an application in Flash storage, thereby minimizing the footprint of Flash storage.</p>
<p>The issue addressed by automated storage tiering is that you no longer need to know ahead of time what the proper tier of storage for a particular application’s data needs to be. Furthermore the classification of the data can occur at a much more fine-grained block level rather than the file or the LUN as with some earlier automated storage tiering implementations.</p>
<p>Flash has changed the landscape of storage for the enterprise. Currently, Fash/SSD storage can cost 16-20X what Fiber channel, SAS, or SATA storage can cost. The dollars per GB model ends up looking something like the following:</p>
<div class="separator" style="clear: both; text-align: center;"><a rel="nofollow" href="http://1.bp.blogspot.com/-epdVz-bwsi8/TdatYM_WjTI/AAAAAAAAAB8/RNBssUAd1QA/s1600/pic1.png" style="margin-left: 1em; margin-right: 1em;" ><img src="http://1.bp.blogspot.com/-epdVz-bwsi8/TdatYM_WjTI/AAAAAAAAAB8/RNBssUAd1QA/s400/pic1.png" alt="" width="392" height="400" border="0" /></a></div>
<p>However the IOPS per $ model looks more like this:</p>
<div class="separator" style="clear: both; text-align: center;"><a rel="nofollow" href="http://4.bp.blogspot.com/-LnMVDmm1_wg/TdatindCiUI/AAAAAAAAACA/7OIqz0hRFvk/s1600/pic2.png" style="margin-left: 1em; margin-right: 1em;" ><img src="http://4.bp.blogspot.com/-LnMVDmm1_wg/TdatindCiUI/AAAAAAAAACA/7OIqz0hRFvk/s400/pic2.png" alt="" width="363" height="400" border="0" /></a></div>
<p>The impact on the tiered storage architectural model of Flash storage has been, in effect, to add a tier-0 level of storage where application data is placed that requires extremely fast random I/O performance. Typical examples of such data are database index tables or key lookup tables, etc. Placing this kind of data, which may only be part of an application’s data, on Flash storage can often have a dramatically positive effect on the performance of an application.  However, due to the cost of Flash storage the question is often raised, how can data centers ensure that only data that requires this level of performance resides on SSD or Flash storage so that they can continue to contain costs? Furthermore, is there a way to put only the “hot” parts of the data in the very expensive tier-0 capacity, and leave less hot, and cold data in slower, less expensive capacity? Block based automated storage tiering is the answer to these questions.</p>
<p>Different storage array vendors have approached this problem in different ways. However, in all cases, the object is to place data at a block level, on tier-0 or Flash storage only while that data is actually being accessed, and then to store the rest of the data on lower tiered storage while the data is at rest. Note that this movement must be done at the block level in order to avoid performance issues, and to truly minimize the capacity of the tier-0 storage.</p>
<p>One approach used by several storage vendors is to move blocks of data between multiple tiers of storage via a policy. For example, the policy might dictate that writes always occur to tier-0, and then if that data is not read immediately it is moved to tier-1. Then if the data isn’t read for 3 months that data is then moved to tier-2. The policy might also dictate that if the data is then read from the tier-2 disk then it is placed back on tier-0 in case additional reads are required and the entire process starts all over again. Logically this mechanism provides what enterprises are looking for, minimizing tier-0 storage and placing blocks of data on the lowest-cost storage possible. The challenge with this approach is that the I/O profile of the application needs to be well understood when the policies are developed in order to avoid accessing data from tier-2 storage too frequently and generally moving data up and down the stack too often since this movement is not “free” from a performance perspective. Additionally, EVT has found that for most customers, data rarely needs to spend time in tier-1 (FC or SAS) storage, that most of the data ends up spending most of it’s live on the SATA storage.</p>
<p>Therefore as the cost of Flash storage continues to come down, the need for the SAS or Fiber Channel storage will continue to decline, and eventually disappear leaving just Flash and SATA storage in most arrays.</p>
<p>Another approach that at least one storage vendor is using is to avoid all the policy based movement and to treat the Flash storage as a large read cache. This places the blocks that are most used on tier-0, and leaves the rest on spinning disk. When the fact that the sequential write performance of Flash, SAS/FC, and SATA is similar is taken into consideration along with a controller that orders its random writes, this approach can provide a much more robust way to implement Flash storage.  In some cases, it allows an application that would not normally be considered a good candidate for SAS or Fiber Channel storage to be able to utilize SATA disks instead. In general, this technique de-couples spindle count from performance thus providing more subtle advantages as well.  For example, applications which has traditionally required very small disk drives so that the spindle could would be might (many, many 146GB FC drives, for example) can now be run on much higher capacity 600GB SAS drives and still provide the same, or better performance.</p>
<p>Overall, automated storage tiering is becoming a de-facto standard in the storage industry. However different storage array vendors have taken very different approaches to the implementation of automated tiering, but in the end the result is uniformly the same. The ability of the enterprise to purchase Flash storage to help improve the performance of their applications while at the same time continuing to bend the cost curve of storage downward.</p>
<div id="crp_related"><h3>You might also want to read these other posts...</h3><ul><li><a href="http://gestaltit.com/featured/top/joerg/emc-fast-netapp-flashcache-comparison/"  rel="bookmark" class="crp_title">EMC FAST and NetApp FlashCache a Comparison</a></li><li><a href="http://gestaltit.com/all/tech/storage/bas/storage-tiering-dying/"  rel="bookmark" class="crp_title">“Storage tiering is dying.” But purple unicorns exist.</a></li><li><a href="http://gestaltit.com/all/tech/storage/devang/enterprise-flash-drives-efd-emc-symmetrix-vmax-systems/"  rel="bookmark" class="crp_title">Enterprise Flash Drives (EFD) on EMC Symmetrix V-Max Systems</a></li><li><a href="http://gestaltit.com/all/tech/storage/devang/after-all-fast-makes-a-debut/"  rel="bookmark" class="crp_title">After all, FAST makes a debut</a></li><li><a href="http://gestaltit.com/all/tech/storage/martin/fast-furious/"  rel="bookmark" class="crp_title">FAST and Furious</a></li></ul></div><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://gestaltit.com/all/tech/storage/joerg/flash-storage-automated-storage-tiering/" type="text/javascript" charset="utf-8"></script><hr />
<p><small>© Joerg Hallbauer for <a href="http://gestaltit.com">Gestalt IT</a>, 2011. |
<a href="http://gestaltit.com/all/tech/storage/joerg/flash-storage-automated-storage-tiering/">Flash Storage and Automated Storage Tiering</a>
<br/>
Read more posts categorized as <a href="http://gestaltit.com/category/all/" title="View all posts in All" rel="category tag">All</a>, <a href="http://gestaltit.com/category/all/tech/storage/" title="View all posts in Storage" rel="category tag">Storage</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://gestaltit.com/all/tech/storage/joerg/flash-storage-automated-storage-tiering/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What is VAAI, and how does it add spice to my life as a VMware admin?</title>
		<link>http://gestaltit.com/all/tech/storage/bas/vaai-vmware-admin/</link>
		<comments>http://gestaltit.com/all/tech/storage/bas/vaai-vmware-admin/#comments</comments>
		<pubDate>Mon, 06 Dec 2010 18:30:34 +0000</pubDate>
		<dc:creator>Bas</dc:creator>
				<category><![CDATA[Server Virtualization]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[gestaltit]]></category>
		<category><![CDATA[performance]]></category>
		<category><![CDATA[VAAI]]></category>
		<category><![CDATA[vmware]]></category>
		<category><![CDATA[vSphere]]></category>

		<guid isPermaLink="false">http://basraayman.com/?p=462</guid>
		<description><![CDATA[I spent some days in Cork, Ireland this week presenting to a customer. Besides the fact that I’m now almost two months in to my new job, and I’m loving every part of it, there is one part that is extremely cool about my job.]]></description>
			<content:encoded><![CDATA[<div id="attachment_464" class="wp-caption alignright" style="width: 160px;"><a rel="nofollow" href="http://basraayman.files.wordpress.com/2010/11/photo.jpg" ><img class="size-thumbnail wp-image-464" title="EBC Cork" src="http://basraayman.files.wordpress.com/2010/11/photo.jpg?w=150&amp;h=112" alt="" width="150" height="112" /></a></p>
<p class="wp-caption-text">EMC EBC Cork</p>
</div>
<p>I spent some days in Cork, Ireland this week presenting to a customer. Besides the fact that I’m now almost two months in to my new job, and I’m loving every part of it, there is one part that is extremely cool about my job.</p>
<p>I get to talk to customers about very cool and new technology that can help them get their job done! And while it’s in the heart of every techno loving geek to get caught up in bits and bytes, I’ve noticed one thing very quickly. The technology is usually not the part that is limiting the customer from doing new things.</p>
<p>Everybody know about that last part. Sometimes you will actually run in to a problem, where some new piece of kit is wreaking havoc and we can’t seem to put our finger on what the problem is. But most of the time, we get caught up in entirely different problems altogether. Things like processes, certifications (think of ISO, SOX, ITIL), compliance, security or just something “simple” as people who don’t want to learn something new or feel threatened about their role that might be changing.</p>
<p>And this is where technology comes in again. I had the ability to talk about several things to this customer, but one of the key points was that technology should help make my life easier. One of the cool new things that will actually help me in that area was a topic that was part of my presentation.</p>
<p>Some of the VMware admins already know about this technology, and I would say that most of the folks that read blogs have already heard about it in some form. But when talking to people at conventions or in customer briefings, I get to introduce folks over and over to a new technology called VAAI (vStorage <abbr title="Application Programming Interface">API</abbr> for Array Integration), and I want to explain again in this blog post what it is, and how it might be able to help you.</p>
<h3><strong>So where does VAAI come from?</strong></h3>
<p>Well, you might think that it is something new. And you would be wrong. VAAI was <a href="http://www.vmware.com/company/news/releases/vmware_ready_storage.html" >introduced as a part of the vStorage API</a> during VMworld 2008, even though the release of the VAAI functionality to the customers was part of the vSphere 4.1 update (4.1 Enterprise and Enterprise Plus). But VAAI isn’t the entire vStorage API, since that consists of a family of APIs:</p>
<ul>
<li>vStorage API for Site Recovery Manager</li>
<li>vStorage API for Data Protection</li>
<li>vStorage API for Multipathing</li>
<li>vStorage API for Array Integration</li>
</ul>
<p>Now, the “only API” that was added with the update from vSphere 4.0 to vSphere 4.1 was the last API, called VAAI. I haven’t seen any of the roadmaps yet that contain more info about future vStorage APIs, but personally I would expect to see even more functionality coming in the future.</p>
<h3><strong>And how does VAAI make my life easier?</strong></h3>
<p>If you read back a couple of lines, you will notice that I said that technology should make my life easier. Well, with VAAI this is actually the case. Basically what VAAI allows you to do is offload operations on data to something that was made to do just that: the array. And it does that at the ESX storage stack.</p>
<p>As an admin, you don’t want your ESX(i) machines to be busy copying blocks or creating clones. You don’t want your network being clogged up with storage vMotion traffic. You want your host to be busy with compute operations and with the management of your memory, and that’s about it. You want as much reserve as you can on your machine, because that allows you to leverage virtualization more effectively!</p>
<p>So, this is where VAAI comes in. Using the API that was created by VMware, you can now use a set of SCSI commands:</p>
<ul>
<li><abbr title="Atomic Test and Set">ATS</abbr>: This command helps you out with hardware assisted locking, meaning that you don’t have to lock an entire LUN anymore but can now just lock the blocks that are allocated to the VMDK. This can be of benefit, for example when you have multiple machines on the same datastore and would like to create a clone.</li>
<li>XSET: This one is also called “full copy” and is used to copy data and/or create clones, avoiding that all data is sent back and forth to your host. After all, why would your host need the data if everything is stored on the array already?</li>
<li>WRITE-SAME: This is one that is also know as “bulk zero” and will come in handy when you create the VM. The array takes care of writing zeroes on your thin and thick VMDKs, and helps out at creation time for eager zeroed thick (EZT) guests.</li>
</ul>
<h3><strong>Sounds great, but how do I notice this in reality?</strong></h3>
<p>Well, I’ve seen several scenarios where for example during a storage vMotion, you would see a reduction in CPU utilization of 20% or even more. In the other scenarios, you normally should also see a reduction in the time it takes to complete an operation, and the resources that are allocated to perform such an operation (usually CPU).</p>
<p>Does that mean that VAAI always reduces my CPU usage? Well, in a sense: yes. You won’t always notice a CPU reduction, but one of the key criteria is that with VAAI enabled, all of the SCSI operations mentioned above should always perform faster then without VAAI enabled. That means that even when you don’t see a reduction in CPU usage (which is normally the case), you will see that since the operations are faster, you get your CPU power back more quickly.</p>
<h3><strong>Ok, so what do I need, how do I enable it, and what are the caveats?</strong></h3>
<p>Let’s start off with the caveats, because some of these are easy to overlook:</p>
<ul>
<li>The source and destination VMFS volumes have different block sizes</li>
<li>The source file type is RDM and the destination file type is non-RDM (regular file)</li>
<li>The source VMDK type is eagerzeroedthick and the destination VMDK type is thin</li>
<li>The source or destination VMDK is any sort of sparse or hosted format</li>
<li>The logical address and/or transfer length in the requested operation are not aligned to the minimum alignment required by the storage device (all datastores created with the vSphere Client are aligned automatically)</li>
<li>The VMFS has multiple LUNs/extents and they are all on different arrays</li>
</ul>
<p>Or short and simple: “Make sure your source and target are the same”.</p>
<p>Key criteria to use VAAI are the use of vSphere 4.1 and an array that supports VAAI. If you have those two prerequisites set up you should be set to go. And if you want to be certain you are leveraging VAAI, check these things:</p>
<ul>
<li>In the vSphere Client inventory panel, select the host</li>
<li>Click the Configuration tab, and click Advanced Settings under Software</li>
<li>Check that these options are set to 1 (enabled):
<ul>
<li>DataMover/HardwareAcceleratedMove</li>
<li>DataMover/HardwareAcceleratedInit</li>
<li>VMFS3/HardwareAcceleratedLocking</li>
</ul>
</li>
</ul>
<p>Note that these are enabled by default. And if you need more info, please make sure that you check out the following VMware knowledge base article: <a href="http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&amp;cmd=displayKC&amp;externalId=1021976" >&gt;1021976</a>.</p>
<p>Also, one last word on this. I really feel that this is a technology that will make your life as a VMware admin easier, so talk to your storage admins (if that person isn’t you in the first case) or your storage vendor and ask if their arrays support VAAI. If not, ask them when they will support it. Not because it’s cool technology, but because it’s cool technology that makes your job easier.</p>
<p>And, if you have any questions or comments, please hit me up in the remarks. I would love to see your opinions on this.</p>
<div id="crp_related"><h3>You might also want to read these other posts...</h3><ul><li><a href="http://gestaltit.com/all/tech/storage/rich/vaai-virtualization/"  rel="bookmark" class="crp_title">What Is VAAI And What Does It Mean For Virtualization?</a></li><li><a href="http://gestaltit.com/all/tech/storage/craig/vsphere-4-1-storage-sioc-drs-vaai/"  rel="bookmark" class="crp_title">Whats new in vSphere 4.1 Storage: SIOC, VAAI, and DRS</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/hypervisor-hugger-storage-stalwart/"  rel="bookmark" class="crp_title">Are You a Hypervisor Hugger or a Storage Stalwart?</a></li><li><a href="http://gestaltit.com/all/tech/storage/bas/define-high-availability-disaster-recovery/"  rel="bookmark" class="crp_title">How do you define high availability and disaster recovery?</a></li><li><a href="http://gestaltit.com/all/tech/virtualization/stephen/vsphere-4-upgrade-vmfs-update/"  rel="bookmark" class="crp_title">Will the vSphere 4 Upgrade Require Another VMFS Update?</a></li></ul></div><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://gestaltit.com/all/tech/storage/bas/vaai-vmware-admin/" type="text/javascript" charset="utf-8"></script><hr />
<p><small>© Bas for <a href="http://gestaltit.com">Gestalt IT</a>, 2010. |
<a href="http://gestaltit.com/all/tech/storage/bas/vaai-vmware-admin/">What is VAAI, and how does it add spice to my life as a VMware admin?</a>
<br/>
Read more posts categorized as <a href="http://gestaltit.com/category/all/tech/virtualization/" title="View all posts in Server Virtualization" rel="category tag">Server Virtualization</a>, <a href="http://gestaltit.com/category/all/tech/storage/" title="View all posts in Storage" rel="category tag">Storage</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://gestaltit.com/all/tech/storage/bas/vaai-vmware-admin/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>SolarWinds: Live From Tech Field Day San Jose</title>
		<link>http://gestaltit.com/all/tech/storage/stephen/solarwinds-live-tech-field-day-san-jose/</link>
		<comments>http://gestaltit.com/all/tech/storage/stephen/solarwinds-live-tech-field-day-san-jose/#comments</comments>
		<pubDate>Mon, 15 Nov 2010 18:35:14 +0000</pubDate>
		<dc:creator>Stephen Foskett</dc:creator>
				<category><![CDATA[Events]]></category>
		<category><![CDATA[Exclusive]]></category>
		<category><![CDATA[Networking]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[monitoring]]></category>
		<category><![CDATA[network management]]></category>
		<category><![CDATA[performance]]></category>
		<category><![CDATA[solarwinds]]></category>
		<category><![CDATA[Tech Field Day]]></category>
		<category><![CDATA[Thwack]]></category>

		<guid isPermaLink="false">http://gestaltit.com/?p=14161</guid>
		<description><![CDATA[After fighting yet more networking gremlins, Tech Field Day resumed at The Hoffman Agency wit a presentation by SolarWinds. We had come to expect quite a lot from this company after their fine presentation at Net Field Day in September, and the delegates were not disappointed. SolarWinds discussed their various management products, as well as their approach to high availability and database design.]]></description>
			<content:encoded><![CDATA[<blockquote><p>The <a href="http://techfieldday.com"  target="_blank">Tech Field Day</a> staff have been working behind the scenes to improve the quality and timeliness of video output from our event. We are pleased to present the first result of this work, in the form of <a href="http://gestaltit.com/field-day/2010-san-jose/tech-field-day-4-live-stream/"  target="_blank">live streaming</a> from <a href="http://gestaltit.com/field-day/2010-san-jose/"  target="_blank">Tech Field Day 4</a> in San Jose this week.</p></blockquote>
<p><iframe frameborder="0" height="225" src="http://player.vimeo.com/video/16783498?title=0&amp;byline=0&amp;portrait=0" width="400"></iframe><br />
After fighting <a href="http://twitter.com/jonisick/status/3203995992588288"  target="_blank">yet more networking gremlins</a>, Tech Field Day resumed at <a href="http://www.hoffman.com/"  target="_blank">The Hoffman Agency</a> wit a presentation by <a href="http://SolarWinds.com"  target="_blank">SolarWinds</a>. We had come to expect quite a lot from this company after their fine presentation at <a href="http://gestaltit.com/field-day/2010-net/"  target="_blank">Net Field Day</a> in September, and the delegates were not disappointed. SolarWinds discussed their various management products, as well as their approach to high availability and database design.<br />
<object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="360" height="228" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="flashvars" value="autoplay=false" /><param name="allowfullscreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="src" value="http://www.ustream.tv/flash/viewer.swf?vid=10812092" /><embed type="application/x-shockwave-flash" width="360" height="228" src="http://www.ustream.tv/flash/viewer.swf?vid=10812092" allowscriptaccess="always" allowfullscreen="true" flashvars="autoplay=false"></embed></object><br />
<object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="360" height="228" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="flashvars" value="autoplay=false" /><param name="allowfullscreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="src" value="http://www.ustream.tv/flash/viewer.swf?vid=10812434" /><embed type="application/x-shockwave-flash" width="360" height="228" src="http://www.ustream.tv/flash/viewer.swf?vid=10812434" allowscriptaccess="always" allowfullscreen="true" flashvars="autoplay=false"></embed></object><br />
The Delegate roundtable revealed that almost everyone knew of SolarWinds, and two used their products in production! Many positive comments were made about the product as well as their &#8220;Thwack&#8221; online community.<br />
<object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="360" height="228" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="flashvars" value="autoplay=false" /><param name="allowfullscreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="src" value="http://www.ustream.tv/flash/viewer.swf?vid=10813558" /><embed type="application/x-shockwave-flash" width="360" height="228" src="http://www.ustream.tv/flash/viewer.swf?vid=10813558" allowscriptaccess="always" allowfullscreen="true" flashvars="autoplay=false"></embed></object></p>
<p>
<a href='http://gestaltit.com/all/tech/storage/stephen/solarwinds-live-tech-field-day-san-jose/attachment/sony-dsc-25/' title='SONY DSC'><img width="150" height="99" src="http://static.gestaltit.com/wp-content/uploads/2010/11/DSC02421-150x99.jpg" class="attachment-thumbnail" alt="SONY DSC" title="SONY DSC" /></a>
<a href='http://gestaltit.com/all/tech/storage/stephen/solarwinds-live-tech-field-day-san-jose/attachment/sony-dsc-26/' title='SONY DSC'><img width="150" height="99" src="http://static.gestaltit.com/wp-content/uploads/2010/11/DSC02423-150x99.jpg" class="attachment-thumbnail" alt="SONY DSC" title="SONY DSC" /></a>
<a href='http://gestaltit.com/all/tech/storage/stephen/solarwinds-live-tech-field-day-san-jose/attachment/sony-dsc-27/' title='SONY DSC'><img width="150" height="99" src="http://static.gestaltit.com/wp-content/uploads/2010/11/DSC02424-150x99.jpg" class="attachment-thumbnail" alt="SONY DSC" title="SONY DSC" /></a>
<a href='http://gestaltit.com/all/tech/storage/stephen/solarwinds-live-tech-field-day-san-jose/attachment/sony-dsc-28/' title='SONY DSC'><img width="150" height="99" src="http://static.gestaltit.com/wp-content/uploads/2010/11/DSC02425-150x99.jpg" class="attachment-thumbnail" alt="SONY DSC" title="SONY DSC" /></a>
<br />
<div id="tfd-disclaimer">
<em>Note: Tech Field Day is a sponsored event. Although the delegates receive no direct compensation, all event expenses are paid by the sponsors through Gestalt IT Media LLC. No editorial control is exerted over the delegates and they are expected, and explicitly instructed, to maintain the highest standards of integrity and honesty.</em>
</div></p>
<div id="crp_related"><h3>You might also want to read these other posts...</h3><ul><li><a href="http://gestaltit.com/all/tech/networking/stephen/aprius-live-tech-field-day-san-jose/"  rel="bookmark" class="crp_title">Aprius: Live From Tech Field Day San Jose</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/avere-live-tech-field-day-san-jose/"  rel="bookmark" class="crp_title">Avere: Live From Tech Field Day San Jose</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/netapp-live-tech-field-day-san-jose/"  rel="bookmark" class="crp_title">NetApp: Live From Tech Field Day San Jose</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/actifio-live-tech-field-day-san-jose/"  rel="bookmark" class="crp_title">Actifio: Live From Tech Field Day San Jose</a></li><li><a href="http://gestaltit.com/featured/top/stephen/druva-launches-tech-field-day-5/"  rel="bookmark" class="crp_title">Druva Launches at Tech Field Day 5</a></li></ul></div><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://gestaltit.com/all/tech/storage/stephen/solarwinds-live-tech-field-day-san-jose/" type="text/javascript" charset="utf-8"></script><hr />
<p><small>© Stephen Foskett for <a href="http://gestaltit.com">Gestalt IT</a>, 2010. |
<a href="http://gestaltit.com/all/tech/storage/stephen/solarwinds-live-tech-field-day-san-jose/">SolarWinds: Live From Tech Field Day San Jose</a>
<br/>
Read more posts categorized as <a href="http://gestaltit.com/category/all/events/" title="View all posts in Events" rel="category tag">Events</a>, <a href="http://gestaltit.com/category/exclusive/" title="View all posts in Exclusive" rel="category tag">Exclusive</a>, <a href="http://gestaltit.com/category/all/tech/networking/" title="View all posts in Networking" rel="category tag">Networking</a>, <a href="http://gestaltit.com/category/all/tech/storage/" title="View all posts in Storage" rel="category tag">Storage</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://gestaltit.com/all/tech/storage/stephen/solarwinds-live-tech-field-day-san-jose/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>STEC Spills the Beans on ZeusRAM SSD</title>
		<link>http://gestaltit.com/all/tech/storage/stephen/stec-zeusram-ssd/</link>
		<comments>http://gestaltit.com/all/tech/storage/stephen/stec-zeusram-ssd/#comments</comments>
		<pubDate>Wed, 12 May 2010 20:30:47 +0000</pubDate>
		<dc:creator>Stephen Foskett</dc:creator>
				<category><![CDATA[Storage]]></category>
		<category><![CDATA[EMC World]]></category>
		<category><![CDATA[flash]]></category>
		<category><![CDATA[IOPS]]></category>
		<category><![CDATA[performance]]></category>
		<category><![CDATA[SAS]]></category>
		<category><![CDATA[solid state disk]]></category>
		<category><![CDATA[SSD]]></category>
		<category><![CDATA[STEC]]></category>
		<category><![CDATA[ZeusIOPS]]></category>
		<category><![CDATA[ZeusRAM]]></category>

		<guid isPermaLink="false">http://blog.fosketts.net/?p=3092</guid>
		<description><![CDATA[STEC may not have been quite ready to reveal their next-generation ZeusRAM solid-state disk (SSD), but they are demonstrating it anyway at EMC World in Boston this week. The ZeusRAM is a fundamentally different animal from the existing ZeusIOPS drive in one critical way: Rather than using flash memory for primary data storage, the ZeusRAM uses DRAM. This improves reliability and longevity and ought to raise the bar on performance as well.]]></description>
			<content:encoded><![CDATA[<p><!-- google_ad_section_start --></p>
<p><a href="http://www.stec-inc.com/" >STEC</a> may not have been quite ready to reveal their next-generation ZeusRAM solid-state disk (SSD), but they are demonstrating it anyway at EMC World in Boston this week. The ZeusRAM is a fundamentally different animal from the existing ZeusIOPS drive in one critical way: <strong>Rather than using flash memory for primary data storage, the ZeusRAM uses DRAM</strong>. This improves reliability and longevity and ought to raise the bar on performance as well.</p>
<div id="attachment_3094" class="wp-caption aligncenter" style="width: 235px; border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align: center; display: block; margin-right: auto; margin-left: auto;"><a href="http://blog.fosketts.net/wp-content/uploads/2010/05/STEC-ZeusRAM-Banner.jpg" ><img class="size-medium wp-image-3094" title="STEC ZeusRAM Banner" src="http://blog.fosketts.net/wp-content/uploads/2010/05/STEC-ZeusRAM-Banner-225x300.jpg" alt="" width="225" height="300" /></a></p>
<p class="wp-caption-text" style="padding: 0 4px 5px; margin: 0;">STEC has introduced ZeusRAM, a SDRAM-based drive</p>
</div>
<p>The ZeusRAM is small (just 8 GB) but STEC claims <strong>massive performance</strong>: 80,000 IOPS for both read and write (versus 80,000 read and 45,000 write for the 6 Gb SAS ZeusIOPS) and 500 MB/s sustained read and write (versus 350 and 300 for the ZeusIOPS). These numbers are well beyond the capabilities of whole shelves of traditional hard disk drives.</p>
<p>Should it lose power, the ZeusRAM will de-stage its content to flash RAM using power stored on internal super-capacitors. This is a major differentiator from in-system RAM disks which require whole-system battery backup and software to de-stage.</p>
<p>STEC has not announced pricing or availability information for the ZeusRAM drive at this point. We noted much interest in the drive from EMC staff at the EMC World show, and would not be surprised to see the ZeusRAM show up in a future Symmetrix or CLARiiON.</p>
<p><small> </small></p>
<div id="crp_related"><h3>You might also want to read these other posts...</h3><ul><li><a href="http://gestaltit.com/all/tech/storage/stephen/emc-avamar-iomega-ix12-300r/"  rel="bookmark" class="crp_title">EMC Mixes Avamar Into Iomega ix12-300r</a></li><li><a href="http://gestaltit.com/all/tech/storage/chris/enterprise-computing-is-the-solid-state-drive-hype-over/"  rel="bookmark" class="crp_title">Enterprise Computing: Is the Solid State Drive Hype Over?</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/fcoe-symbolism-7/"  rel="bookmark" class="crp_title">FCoE Symbolism</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/multipath-activepassive-dual-active-activeactive/"  rel="bookmark" class="crp_title">Multipath: Active/Passive, Dual Active, and Active/Active</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/micron-bursts-pcie-ssd-market/"  rel="bookmark" class="crp_title">Micron Bursts Into the PCIe SSD Market</a></li></ul></div><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://gestaltit.com/all/tech/storage/stephen/stec-zeusram-ssd/" type="text/javascript" charset="utf-8"></script><hr />
<p><small>© Stephen Foskett for <a href="http://gestaltit.com">Gestalt IT</a>, 2010. |
<a href="http://gestaltit.com/all/tech/storage/stephen/stec-zeusram-ssd/">STEC Spills the Beans on ZeusRAM SSD</a>
<br/>
Read more posts categorized as <a href="http://gestaltit.com/category/all/tech/storage/" title="View all posts in Storage" rel="category tag">Storage</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://gestaltit.com/all/tech/storage/stephen/stec-zeusram-ssd/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Why Does Cloning A VM From Template Take A Long Time?</title>
		<link>http://gestaltit.com/all/tech/storage/rich/cloning-vm-template-speed/</link>
		<comments>http://gestaltit.com/all/tech/storage/rich/cloning-vm-template-speed/#comments</comments>
		<pubDate>Thu, 15 Apr 2010 16:00:00 +0000</pubDate>
		<dc:creator>Rich Brambley</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[Server Virtualization]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[cloning]]></category>
		<category><![CDATA[ESX]]></category>
		<category><![CDATA[IOPS]]></category>
		<category><![CDATA[performance]]></category>
		<category><![CDATA[sqlpass]]></category>

		<guid isPermaLink="false">http://vmetc.com/?p=5708</guid>
		<description><![CDATA[Over the past few years I’ve been asked to troubleshoot and explain why cloning a virtual machine (VM) from a master template would take a longer time than expected more than once.]]></description>
			<content:encoded><![CDATA[<p>Over the past few years I’ve been asked to troubleshoot and explain why cloning a virtual machine (VM) from a master template would take a longer time than expected more than once. Usually when I’m asked the virtualization admin is frustrated at the hypervisor. “This shouldn’t take this long. It needs to be fixed!” they say. “I definitely agree,” I say, “but let’s take a deeper look at what is happening here first before we flame the vendor’s help desk technician on the phone.”</p>
<p>So, this post is about taking a deeper look at where the master template VM resides versus where the cloned template is destined. My math my be a little off or may not account for every factor involved, but my point is to be close enough to demonstrate that the disk/array/LUN design <span style="text-decoration: line-through;">is</span> can be the culprit more times than not.</p>
<p>When I started this post I emailed for some help. I asked for a sanity check from some storage experts. I’ve been reasonably happy with my own answer until now, but I figured I do some research before adding the content to VM /ETC. I got back a single reply that I am paraphrasing: “Sounds about right. Let me think about it some more and if I can stump you with anything else I’ll let you know.” He never did so I’ll take that as a positive confirmation meaning “yes VM moron, it is that simple.” Good enough for me! If anyone can point out any other factors I am not properly accounting for please leave a comment.</p>
<p>The following is part of my email for help. It not only explains my test scenario but it illustrates the problem and resolution as well. At the end of this post I make some suggestions for bettering the time it takes to clone a VM.</p>
<h3>The email for help</h3>
<p>Oh wise and all powerful masters of the disk,</p>
<p>I humbly submit the following concept for your review. Guide me to a greater disk performance understanding when cloning a VM in VMware ESX environment.</p>
<p>Here’s the scenario:</p>
<ul>
<li>Cloning a VM takes a long time – 10 GB VM using only 3.5 GB of space takes roughly 45 min to an hour to clone.</li>
<li>The master template and the clone reside on the same disk and NFS mount.</li>
<li>Yeah, it’s a single SATA disk in a lab. I know, it should suck.</li>
</ul>
<p>I’m trying to explain the expected speed of read and writes using the IOPs calculator here: <a href="http://www.wmarow.com/strcalc/" >http://www.wmarow.com/strcalc/</a></p>
<p>See the attached screen shot for the values I put in the calculator, but the results I’m interested in are:</p>
<ul>
<li>with 50% reads and 50% writes (master and clone on same disk) average throughput (MB/s) is 1.2</li>
<li>I used 50% reads and 50% writes for the cache.</li>
</ul>
<p>which means to me that</p>
<ul>
<li>3548 MB / 1.2 (MB/s) = 2957 secs or 0.82 hrs.</li>
</ul>
<p>Here’s the screen shot of the IOPS Calculator I linked in the email for help:</p>
<p><img style="margin: 5px; display: inline; border: 0px;" title="disk array calculator Capture" src="http://vmetc.com/wp-content/uploads/2010/04/diskarraycalculatorCapture1.png" border="0" alt="disk array calculator Capture" width="518" height="575" /></p>
<h3>Suggestions for improvement</h3>
<p>Obviously, the type/performance of the disks, the number of disks, and the type of array makes a huge difference. I should also point out that I am using 8 ms as the value for the seek latency. I’m not as focused on technical accuracy because my point is served without it, but changing this value makes a significant difference as well. If you want technical accuracy and more explanation about some of the numbers to use in the calculator check out these posts on the topic of IOPS and the impact on a virtual environment:</p>
<ul>
<li><a href="http://www.yellow-bricks.com/2009/12/23/iops/" >http://www.yellow-bricks.com/2009/12/23/iops/</a></li>
<li><a href="http://vmtoday.com/2009/12/storage-basics-part-ii-iops/" >http://vmtoday.com/2009/12/storage-basics-part-ii-iops/</a></li>
<li><a rel="nofollow" href="http://virtualgeek.typepad.com/virtual_geek/2010/02/solving-a-weird-slow-performance-cloning-issue.html" >http://virtualgeek.typepad.com/virtual_geek/2010/02/solving-a-weird-slow-performance-cloning-issue.html</a></li>
<li><a href="http://vpivot.com/2009/09/18/storage-is-the-problem/" >http://vpivot.com/2009/09/18/storage-is-the-problem/</a></li>
</ul>
<p>In my case, moving the VM template to another disk/array or increasing the number of disks used on my NFS server would help because the reads and writes would be separated when the cloned VM resides on a different disk/array and the number of IOPs possible would be increased with more disks. Yes, this post uses a single SATA disk as a simple example, but the point is hopefully clear. Use the same logic and math for shared storage scenarios, all storage protocols, any vendor’s storage device, and all RAID types. Plug those values in the IOPs calculator to calculate your own results.</p>
<p>My ultimate point is to make everyone think about how the disk/array/LUN design decisions impact the behaviors of the virtual infrastructure.</p>
<p>As an example, if my lab NFS server was using 6 SATA disks configured as a RAID 5 array the calculation for expected time to clone changes as follows:</p>
<ul>
<li>3548 MB / 2.99 (MB/s) = 1187 secs or 0.33 hrs.</li>
</ul>
<p>Better, right? Hey, it’s a basement lab. It’s supposed to suck!</p>
<div id="crp_related"><h3>You might also want to read these other posts...</h3><ul><li><a href="http://gestaltit.com/all/tech/storage/joerg/wide-striping-is-a-two-edged-sword/"  rel="bookmark" class="crp_title">Wide striping is a two edged sword</a></li><li><a href="http://gestaltit.com/all/tech/desktop/rich/vmware-view-30-perspective-from-the-implementation-angle/"  rel="bookmark" class="crp_title">VMware View 3.0 perspective from the implementation angle</a></li><li><a href="http://gestaltit.com/all/tech/storage/edsai/storage-layout-%e2%80%93-why-care/"  rel="bookmark" class="crp_title">Storage Layout – Why care?</a></li><li><a href="http://gestaltit.com/all/tech/storage/chris/review-sun-storage-7000-unified-storage-system-part-ii/"  rel="bookmark" class="crp_title">Review: Sun Storage 7000 Unified Storage System – Part II</a></li><li><a href="http://gestaltit.com/all/tech/storage/chris/wide-striping-benefits/"  rel="bookmark" class="crp_title">The Benefits of Wide Striping – Avoiding A Long Tail</a></li></ul></div><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://gestaltit.com/all/tech/storage/rich/cloning-vm-template-speed/" type="text/javascript" charset="utf-8"></script><hr />
<p><small>© Rich for <a href="http://gestaltit.com">Gestalt IT</a>, 2010. |
<a href="http://gestaltit.com/all/tech/storage/rich/cloning-vm-template-speed/">Why Does Cloning A VM From Template Take A Long Time?</a>
<br/>
Read more posts categorized as <a href="http://gestaltit.com/category/featured/" title="View all posts in Featured" rel="category tag">Featured</a>, <a href="http://gestaltit.com/category/all/tech/virtualization/" title="View all posts in Server Virtualization" rel="category tag">Server Virtualization</a>, <a href="http://gestaltit.com/category/all/tech/storage/" title="View all posts in Storage" rel="category tag">Storage</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://gestaltit.com/all/tech/storage/rich/cloning-vm-template-speed/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Multipath: Active/Passive, Dual Active, and Active/Active</title>
		<link>http://gestaltit.com/all/tech/storage/stephen/multipath-activepassive-dual-active-activeactive/</link>
		<comments>http://gestaltit.com/all/tech/storage/stephen/multipath-activepassive-dual-active-activeactive/#comments</comments>
		<pubDate>Wed, 31 Mar 2010 18:30:14 +0000</pubDate>
		<dc:creator>Stephen Foskett</dc:creator>
				<category><![CDATA[Storage]]></category>
		<category><![CDATA[clustering]]></category>
		<category><![CDATA[DMP]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[HP]]></category>
		<category><![CDATA[multipathing]]></category>
		<category><![CDATA[performance]]></category>
		<category><![CDATA[PVlinks]]></category>
		<category><![CDATA[SCSI]]></category>
		<category><![CDATA[Symmetrix]]></category>
		<category><![CDATA[Veritas]]></category>

		<guid isPermaLink="false">http://blog.fosketts.net/?p=2737</guid>
		<description><![CDATA[Although it's rare in the PC world, multipath I/O is not new in enterprise IT. I've been juggling paths to storage and networks as long as I've been a systems administrator, and that's a bit longer than I care to admit. But the proliferation of technologies has made it difficult to understand path management. What's the difference between "dual active" and "active/active"? Is "active/passive" really that bad?]]></description>
			<content:encoded><![CDATA[<p><!-- google_ad_section_start --></p>
<p>Although it’s rare in the PC world, multipath I/O is not new in enterprise IT. I’ve been juggling paths to storage and networks as long as I’ve been a systems administrator, and that’s a bit longer than I care to admit. But the proliferation of technologies has made it difficult to understand path management. What’s the difference between “dual active” and “active/active”? Is “active/passive” really that bad?</p>
<h3>What is Multipath? And Why?</h3>
<div id="attachment_2844" class="wp-caption aligncenter" style="width: 410px; border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align: center; display: block; margin-right: auto; margin-left: auto;"><a href="http://blog.fosketts.net/wp-content/uploads/2010/03/Single-path.jpg" ><img class="size-full wp-image-2844" title="Single path" src="http://blog.fosketts.net/wp-content/uploads/2010/03/Single-path.jpg" alt="" width="400" height="205" /></a></p>
<p class="wp-caption-text" style="padding: 0 4px 5px; margin: 0;">The good old days: One device, one path</p>
</div>
<p>In the beginning, computers connected to peripherals and other computers through a single bus or channel and life was easy. Although one might mistake the names of the dominant printer connection (parallel) for some kind of multipath system when compared to the modem connection (serial), this was not the case. Only the bits traveled in parallel – the logical connection was a simple single path.</p>
<div id="attachment_2843" class="wp-caption aligncenter" style="width: 250px; border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align: center; display: block; margin-right: auto; margin-left: auto;"><a href="http://blog.fosketts.net/wp-content/uploads/2010/03/Daisy-chain-1.jpg" ><img class="size-full wp-image-2843 " title="Daisy-chain 1" src="http://blog.fosketts.net/wp-content/uploads/2010/03/Daisy-chain-1.jpg" alt="" width="240" height="175" /></a></p>
<p class="wp-caption-text" style="padding: 0 4px 5px; margin: 0;">Early-90&#8242;s servers might confuse admins with two SCSI connections to a single device</p>
</div>
<p>Then things got complicated. The SCSI protocol allowed for multiple devices in a chain, and even for two different “initiators” (computers or controllers) to interact with these “targets”. Some folks even dual-attached devices to a single computer with multiple controllers.</p>
<p>Why would one device and one computer need more than one connection? It boils down to two factors:</p>
<ol>
<li><strong>Performance</strong> – I/O channels have typically been slower than the computer could handle, so multiple channels might be used to increase the amount of data that can flow in and out.</li>
<li><strong>Reliability</strong> – If one connection failed, the other might still be usable, reducing the risk of an outage.</li>
</ol>
<div id="attachment_2842" class="wp-caption aligncenter" style="width: 290px; border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align: center; display: block; margin-right: auto; margin-left: auto;"><a href="http://blog.fosketts.net/wp-content/uploads/2010/03/Multiple-paths.jpg" ><img class="size-full wp-image-2842 " title="Multiple paths" src="http://blog.fosketts.net/wp-content/uploads/2010/03/Multiple-paths.jpg" alt="" width="280" height="175" /></a></p>
<p class="wp-caption-text" style="padding: 0 4px 5px; margin: 0;">Late-90&#8242;s enterprise systems might have four or more paths to a single storage array</p>
</div>
<p>Pretty soon, enterprise computer architecture had gotten incredibly complex. I remember connecting a massive HP V-class server to an EMC Symmetrix with eight separate Fibre Channel cables. Each disk “LUN” showed up twice, and we had hundreds of them. We managed all of these virtual storage paths using HP’s PVLinks dynamic multipathing software. We used Veritas DMP and EMC PowerPath to do pretty much the same thing on Solaris and other UNIX systems.</p>
<h3>Active/Passive to Active/Active</h3>
<p>The earliest path management software provided two incredibly important functions: It figured out which of the SCSI targets it saw were actually different names for the same one, and it allowed the operating system to choose one and fail over to the other in case of an interruption. These were <strong>Active/Passive</strong> links – no matter how many paths were presented (and Fibre Channel switches sometimes presented eight or more), only one was active at any one time.</p>
<div id="attachment_2841" class="wp-caption aligncenter" style="width: 410px; border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align: center; display: block; margin-right: auto; margin-left: auto;"><a href="http://blog.fosketts.net/wp-content/uploads/2010/03/Switched-Fabric.jpg" ><img class="size-full wp-image-2841" title="Switched Fabric" src="http://blog.fosketts.net/wp-content/uploads/2010/03/Switched-Fabric.jpg" alt="" width="400" height="282" /></a></p>
<p class="wp-caption-text" style="padding: 0 4px 5px; margin: 0;">Modern systems have abstracted and virtual I/O channels, making path management much more important</p>
</div>
<p>But the EMC Symmetrix and similar high-end storage systems changed all this. Symmetrix storage was fully virtualized – the presentation of LUNs to servers was entirely disconnected from the actual disks and RAID sets in the array. This meant the Symmetrix could handle I/O requests across different paths and controllers for the same LUN. EMC and the rest responded with <strong>Active/Active</strong> path management software, allowing I/O to travel in parallel for the first time.</p>
<h3>How is Dual Active Different?</h3>
<p>Not everything called Active/Active is created equal. In fact, many supposed Active/Active setups really shouldn’t be called that since they don’t use both paths for all data. Instead, I like to call these <strong>Dual Active</strong> – both paths are active but with different data.</p>
<p>Consider the differences between the following two solutions:</p>
<div id="attachment_2845" class="wp-caption aligncenter" style="width: 410px; border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align: center; display: block; margin-right: auto; margin-left: auto;"><a href="http://blog.fosketts.net/wp-content/uploads/2010/03/Switched-Fabric-Active-Active.jpeg" ><img class="size-full wp-image-2845" title="Switched Fabric Active Active" src="http://blog.fosketts.net/wp-content/uploads/2010/03/Switched-Fabric-Active-Active.jpeg" alt="" width="400" height="282" /></a></p>
<p class="wp-caption-text" style="padding: 0 4px 5px; margin: 0;">A true active/active setup uses all paths for all data all the time</p>
</div>
<div id="attachment_2846" class="wp-caption aligncenter" style="width: 410px; border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align: center; display: block; margin-right: auto; margin-left: auto;"><a href="http://blog.fosketts.net/wp-content/uploads/2010/03/Switched-Fabric-Dual-Active.jpeg" ><img class="size-full wp-image-2846" title="Switched Fabric Dual Active" src="http://blog.fosketts.net/wp-content/uploads/2010/03/Switched-Fabric-Dual-Active.jpeg" alt="" width="400" height="282" /></a></p>
<p class="wp-caption-text" style="padding: 0 4px 5px; margin: 0;">A &#8220;dual active&#8221; setup uses both paths, but each target is directed to one or the other</p>
</div>
<p>See the difference? Although the paths are active in both cases, they are not the same. Both approaches have merit, and neither is inherently superior, but they should have different names applied. Even active/passive has its place, since simplicity is often a virtue.</p>
<h3>Dual Active Outside Storage</h3>
<p>These same concepts apply outside the field of storage and I/O. Many server clustering systems use the same terminology, right down to the misapplication of “active/active” when “dual active” is more appropriate. It’s easy to miss the significance of this difference, but it can make more of an impact in clustering since CPU workloads are harder to balance.</p>
<p>Let me know what you think. If there is interest, I might dive into path management strategies like round robin!</p>
<div id="crp_related"><h3>You might also want to read these other posts...</h3><ul><li><a href="http://gestaltit.com/all/tech/storage/stephen/fcoe-symbolism-7/"  rel="bookmark" class="crp_title">FCoE Symbolism</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/stec-zeusram-ssd/"  rel="bookmark" class="crp_title">STEC Spills the Beans on ZeusRAM SSD</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/microsoft-and-intel-push-one-million-iscsi-iops/"  rel="bookmark" class="crp_title">Microsoft and Intel Push One Million iSCSI IOPS</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/flexible-path-services-future/"  rel="bookmark" class="crp_title">Flexible IT and the Path to the Services Future</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/curtis-prestons-backup-central-live/"  rel="bookmark" class="crp_title">See W. Curtis Preston’s Backup Central Live!</a></li></ul></div><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://gestaltit.com/all/tech/storage/stephen/multipath-activepassive-dual-active-activeactive/" type="text/javascript" charset="utf-8"></script><hr />
<p><small>© Stephen Foskett for <a href="http://gestaltit.com">Gestalt IT</a>, 2010. |
<a href="http://gestaltit.com/all/tech/storage/stephen/multipath-activepassive-dual-active-activeactive/">Multipath: Active/Passive, Dual Active, and Active/Active</a>
<br/>
Read more posts categorized as <a href="http://gestaltit.com/category/all/tech/storage/" title="View all posts in Storage" rel="category tag">Storage</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://gestaltit.com/all/tech/storage/stephen/multipath-activepassive-dual-active-activeactive/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>“Storage tiering is dying.” But purple unicorns exist.</title>
		<link>http://gestaltit.com/all/tech/storage/bas/storage-tiering-dying/</link>
		<comments>http://gestaltit.com/all/tech/storage/bas/storage-tiering-dying/#comments</comments>
		<pubDate>Sun, 28 Mar 2010 18:30:04 +0000</pubDate>
		<dc:creator>Bas</dc:creator>
				<category><![CDATA[Storage]]></category>
		<category><![CDATA[Chris Mellor]]></category>
		<category><![CDATA[gestaltit]]></category>
		<category><![CDATA[NetApp]]></category>
		<category><![CDATA[performance]]></category>
		<category><![CDATA[tiered storage]]></category>
		<category><![CDATA[tiering]]></category>
		<category><![CDATA[Tom Georgens]]></category>

		<guid isPermaLink="false">http://basraayman.com/?p=198</guid>
		<description><![CDATA[NetApp CEO Tom Georgens is quoted as dismissive of multi-level tiering, saying: &#8220;The simple fact of the matter is, tiering is a way to manage migration of data between Fibre Channel-based systems and serial ATA based systems.&#8221;]]></description>
			<content:encoded><![CDATA[<p>Chris Mellor over at the Register put <a href="http://www.theregister.co.uk/2010/02/19/netapp_tiering_dying/" >an interview online with NetApp CEO Tom Georgens</a>.</p>
<p>To quote from the Register piece:</p>
<blockquote><p>He is dismissive of multi-level tiering, saying: “The simple fact of the matter is, tiering is a way to manage migration of data between Fibre Channel-based systems and serial ATA based systems.”</p>
<p>He goes further: “Frankly I think the entire concept of tiering is dying.”</p></blockquote>
<p>Now, for those who are not familiar with the concept of tiering, it’s basically moving data between faster and slower media in the background. Clasically tiering is something that every organization is already doing. You consider the value of the information, and based on that you decide if this data should be accessible instantly from your more expensive hardware, and even at home you will see that as the value decreases you will archive that data to a media that has a different type of performance like your USB archiving disk or for example by burning it to a DVD.</p>
<p>For companies the more interesting part in tiering comes with automation. To put it simply, you want your data to be available on a fast drive when you need it, and it can remain on slower drives if you don’t require it at that moment. Several vendors each have their own specific implementation of how they tier their storage, but you find this kind of technology coming from almost any vendor.</p>
<p>Aparrantly, NetApp has a different definition of tiering, since according to their CTO tiering is limited to the “migration of data between Fibre Channel-based systems and serial ATA based systems”. And this is where I heartily disagree with him. I purposely picked the example of home users who are also using different tiers, and it’s no different for all storage vendors.</p>
<p>The major difference? They remove the layer of fibre channel drives in between of the flash and SATA drives. They still tier their data to the medium that is most fitting. They will try to do that automatically (and hopefully succeed in doing so), but just don’t call it tiering anymore.</p>
<p>As with all vendors, NetApp is also trying to remove the fibre channel drive layer, and I am convinced that this will be possible as soon as the prices of flash drives can be compared to those of regular fibre channel drives, and the automated tiering is automated to a point that any actions performed are transparent to the connected system.</p>
<p>But, if NetApp doesn’t want to call it tiering, that’s fine by me but I hope they don’t honestly expect customers to fall for it. The rest of the world will continue to call it tiering, and they will try to sell you a purple unicorn that moves data around disk types as if by magic.</p>
<div id="crp_related"><h3>You might also want to read these other posts...</h3><ul><li><a href="http://gestaltit.com/all/tech/storage/joerg/flash-storage-automated-storage-tiering/"  rel="bookmark" class="crp_title">Flash Storage and Automated Storage Tiering</a></li><li><a href="http://gestaltit.com/all/tech/storage/martin/manage-data-tiered-storage/"  rel="bookmark" class="crp_title">Manage Data Not Storage</a></li><li><a href="http://gestaltit.com/all/tech/storage/devang/after-all-fast-makes-a-debut/"  rel="bookmark" class="crp_title">After all, FAST makes a debut</a></li><li><a href="http://gestaltit.com/all/tech/storage/chris/fast-v1-emc/"  rel="bookmark" class="crp_title">Do We Need FAST v1, EMC?</a></li><li><a href="http://gestaltit.com/all/tech/storage/devang/enterprise-flash-drives-efd-emc-symmetrix-vmax-systems/"  rel="bookmark" class="crp_title">Enterprise Flash Drives (EFD) on EMC Symmetrix V-Max Systems</a></li></ul></div><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://gestaltit.com/all/tech/storage/bas/storage-tiering-dying/" type="text/javascript" charset="utf-8"></script><hr />
<p><small>© Bas for <a href="http://gestaltit.com">Gestalt IT</a>, 2010. |
<a href="http://gestaltit.com/all/tech/storage/bas/storage-tiering-dying/">“Storage tiering is dying.” But purple unicorns exist.</a>
<br/>
Read more posts categorized as <a href="http://gestaltit.com/category/all/tech/storage/" title="View all posts in Storage" rel="category tag">Storage</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://gestaltit.com/all/tech/storage/bas/storage-tiering-dying/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Benefits of Wide Striping – Avoiding A Long Tail</title>
		<link>http://gestaltit.com/all/tech/storage/chris/wide-striping-benefits/</link>
		<comments>http://gestaltit.com/all/tech/storage/chris/wide-striping-benefits/#comments</comments>
		<pubDate>Thu, 25 Mar 2010 20:00:15 +0000</pubDate>
		<dc:creator>Chris Evans</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[Server Virtualization]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[Enterprise Computing]]></category>
		<category><![CDATA[gestaltit]]></category>
		<category><![CDATA[IOPS]]></category>
		<category><![CDATA[Long Tail]]></category>
		<category><![CDATA[performance]]></category>
		<category><![CDATA[wide striping]]></category>
		<category><![CDATA[XIV]]></category>

		<guid isPermaLink="false">http://www.thestoragearchitect.com/?p=1268</guid>
		<description><![CDATA[One of the “key features” of XIV is the wide striping of data across all spindles, a concept we’re seeing more and more.  Have you ever wondered what the point is?]]></description>
			<content:encoded><![CDATA[<div id="attachment_1270" class="wp-caption alignleft" style="width: 310px;">
<p><a href="http://www.thestoragearchitect.com/wp-content/uploads/2010/03/StackedIOPS.jpg" onclick="pageTracker._trackPageview('/outgoing/www.thestoragearchitect.com/wp-content/uploads/2010/03/StackedIOPS.jpg?referer=');" ><img class="size-medium wp-image-1270" title="Stacked IOPS" src="http://www.thestoragearchitect.com/wp-content/uploads/2010/03/StackedIOPS-300x196.jpg" alt="IOPS Per RAID Group, ordered by most to least" width="300" height="196" /></a></p>
<p class="wp-caption-text">IOPS Per RAID Group, ordered by most to least</p>
</div>
<p>I took part in a podcast last night that discussed the XIV platform.  One of the “key features” of XIV is the wide striping of data across all spindles.  It’s a concept we’re seeing more and more in contemporary storage hardware architectures and one that’s being shoe-horned into older storage arrays too.  Have you ever wondered what the point is?  Take a look at the following graphic.  It shows the number of write operations per RAID group, ordered by the busiest RAID group to the least active.  It’s real data from a real system.  What you see is the <a rel="nofollow" href="http://en.wikipedia.org/wiki/Long_Tail" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Long_Tail?referer=');" >Long Tail</a> effect, where a small number of RAID groups are doing most of the I/O.  In this example, 80% of the workload is performed by 50% of the RAID groups; only 3 RAID groups account for 20% of the workload.</p>
<p>The chart shows that in some array designs (typically the older Enterprise arrays), I/O distribution was not evenly balanced and so not all drives were being used to their full capacity.  This was mitigated by using tools to move LUNs or sub-LUNs around; alternatively concatenated devices like metas and LUSEs were employed to spread the load.</p>
<p>The only real solution to the I/O balancing problem is genuine wide striping.  Manual or even automated rebalancing, or the use of metas are just workarounds.  Once wide striping is in place, either more work can be performed or the number of spindles or their “quality” can be reduced, i.e. you can build a complete SATA array like XIV.</p>
<p>There are of course disadvantages to having your data more widely spread.  The most obvious is the increased risk of data loss when the RAID system fails – i.e. a double disk failure.  The wider the striping, the wider the impact.  The tradeoff is the benefit of increased performance.  You have to choose what level of risk/impact you consider acceptable versus the potential gains.</p>
<p>If you’re not doing wide striping today then you should seriously be considering it.  After all, you’re only harnessing performance capacity within the array that you’ve already paid for.</p>
<div id="crp_related"><h3>You might also want to read these other posts...</h3><ul><li><a href="http://gestaltit.com/all/tech/storage/chris/enterprise-computing-the-wide-striping-debate/"  rel="bookmark" class="crp_title">The Wide Striping Debate</a></li><li><a href="http://gestaltit.com/all/tech/storage/chris/hp-p2000-p4000-storage-array/"  rel="bookmark" class="crp_title">New HP P2000 and P4000 Storage Arrays</a></li><li><a href="http://gestaltit.com/all/tech/storage/chris/review-sun-storage-7000-unified-storage-system-part-ii/"  rel="bookmark" class="crp_title">Review: Sun Storage 7000 Unified Storage System – Part II</a></li><li><a href="http://gestaltit.com/all/tech/storage/edsai/storage-layout-%e2%80%93-why-care/"  rel="bookmark" class="crp_title">Storage Layout – Why care?</a></li><li><a href="http://gestaltit.com/all/tech/storage/chris/violin-memory-release-ssd-array/"  rel="bookmark" class="crp_title">Violin Memory Inc Release New All-SSD Array</a></li></ul></div><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://gestaltit.com/all/tech/storage/chris/wide-striping-benefits/" type="text/javascript" charset="utf-8"></script><hr />
<p><small>© Chris for <a href="http://gestaltit.com">Gestalt IT</a>, 2010. |
<a href="http://gestaltit.com/all/tech/storage/chris/wide-striping-benefits/">The Benefits of Wide Striping – Avoiding A Long Tail</a>
<br/>
Read more posts categorized as <a href="http://gestaltit.com/category/featured/" title="View all posts in Featured" rel="category tag">Featured</a>, <a href="http://gestaltit.com/category/all/tech/virtualization/" title="View all posts in Server Virtualization" rel="category tag">Server Virtualization</a>, <a href="http://gestaltit.com/category/all/tech/storage/" title="View all posts in Storage" rel="category tag">Storage</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://gestaltit.com/all/tech/storage/chris/wide-striping-benefits/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How Did Microsoft and Intel Get 1 Million iSCSI IOPS?</title>
		<link>http://gestaltit.com/all/tech/storage/stephen/microsoft-intel-1-million-iscsi-iops/</link>
		<comments>http://gestaltit.com/all/tech/storage/stephen/microsoft-intel-1-million-iscsi-iops/#comments</comments>
		<pubDate>Tue, 23 Mar 2010 20:00:19 +0000</pubDate>
		<dc:creator>Stephen Foskett</dc:creator>
				<category><![CDATA[Server Virtualization]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[10 gigabit Ethernet]]></category>
		<category><![CDATA[benchmarks]]></category>
		<category><![CDATA[CRC]]></category>
		<category><![CDATA[CRC32C]]></category>
		<category><![CDATA[datacore]]></category>
		<category><![CDATA[Enterprise storage]]></category>
		<category><![CDATA[Ethernet]]></category>
		<category><![CDATA[Gestalt IT]]></category>
		<category><![CDATA[Hyper-V]]></category>
		<category><![CDATA[Intel]]></category>
		<category><![CDATA[IOPS]]></category>
		<category><![CDATA[iSCSI]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[MSI-X]]></category>
		<category><![CDATA[Nagle]]></category>
		<category><![CDATA[Open-E]]></category>
		<category><![CDATA[performance]]></category>
		<category><![CDATA[QPI]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[SR-IOV]]></category>
		<category><![CDATA[StarWind]]></category>
		<category><![CDATA[TCP offload]]></category>
		<category><![CDATA[throughput]]></category>
		<category><![CDATA[Virtual Storage]]></category>
		<category><![CDATA[VMDc]]></category>
		<category><![CDATA[VMDq]]></category>
		<category><![CDATA[VMQ]]></category>
		<category><![CDATA[Windows Server 2008 R2]]></category>
		<category><![CDATA[Xeon]]></category>

		<guid isPermaLink="false">http://blog.fosketts.net/?p=2812</guid>
		<description><![CDATA[Ever since Microsoft and Intel declared that the combination of Windows and Nehalem could deliver over a million iSCSI IOPS, I've been curious about just how they did it. What black magic could push that many I/Os over a single Ethernet connection? And what was on the other end? Now Intel has revealed all in a whitepaper, and the results are surprising!]]></description>
			<content:encoded><![CDATA[<p><!-- google_ad_section_start --></p>
<p><a href="http://blog.fosketts.net/wp-content/uploads/2010/01/Speed_Limit_Infinity_sm.png" ><img class="aligncenter size-full wp-image-2591" style="display: block; margin-right: auto; margin-left: auto;" title="Speed_Limit_Infinity_sm" src="http://blog.fosketts.net/wp-content/uploads/2010/01/Speed_Limit_Infinity_sm.png" alt="How fast can iSCSI get?" width="200" height="250" /></a>Ever since Microsoft and Intel declared that the combination of Windows and Nehalem could deliver  over a million iSCSI IOPS, I’ve been curious about just how they did it. What black magic could push that many I/Os over a single Ethernet connection? And what was on the other end? Now <strong>Intel has revealed all in a <a href="http://download.intel.com/support/network/sb/inteliscsiwp.pdf" >whitepaper</a></strong><strong>, and the results are surprising</strong>!</p>
<h3>What iSCSI Did</h3>
<p>Let’s review the test for a moment. In March, Microsoft and Intel demonstrated that the combination of Windows Server 2008 R2 and the Xeon 5500 could saturate a 10 Gb Ethernet link, pushing <a href="http://gestaltit.com/featured/top/stephen/wirespeed-10-gb-iscsi/" ><strong>iSCSI throughput to wire speed</strong></a>. That’s 1,174 MB/s, right around the theoretical maximum of a ten-gigabit link, given a tiny bit of overhead. The pair reunited in January to show that this same combination could deliver an astonishing <a href="http://gestaltit.com/all/tech/storage/stephen/microsoft-and-intel-push-one-million-iscsi-iops/" ><strong>million I/O operations per second</strong></a>, too.</p>
<p>Both of these results are astonishing. Sure, many high-end Fibre Channel SANs and storage systems blast out gigabytes of data and millions of I/O operations every second, but these tests are much more focused. <a href="http://blog.fosketts.net/2010/02/26/fair-technology-benchmarks/" >Benchmarks are perilous</a>, but the folks at Microsoft and Intel devised a fairly clever and focused set. Rather than a “mine’s bigger” contest, the pair only needs to prove that <strong>iSCSI can play with the pros</strong>.</p>
<p>The side effect is a demonstration of the capabilities of Microsoft and Intel components. Microsoft showed off the capabilities of Windows Server 2008 R2, Hyper-V, and their software iSCSI initiator, while Intel can brag about the Xeon 5500 server platform and X520-2 10 Gb Ethernet Server Adapter with their 82599EB controller. Your mileage may vary, but it is possible to construct a true storage monster on an average server budget.</p>
<h3>Intel Inside</h3>
<p>Let’s start by looking at the configuration of the local end of the tested configuration. I’m a storage guy so I think of it as the initiator, but you might say it’s the server, the client, or the host. Regardless, the system under test (SUT) is what was put under the microscope. The configuration was a common one: A high-end computer packing an Intel Xeon CPU and 82599-based 10 Gb Ethernet adapter. Most data centers have a machine or two just like this one.</p>
<p>Looking closely, we see that the test in question relied on the following key components:</p>
<ul>
<li>Intel’s “Shadycove” <a href="http://www.intel.com/products/workstation/motherboards/s5520sc/s5520sc-overview.htm" >S5520SC</a> workstation-class motherboard</li>
<li>The Intel Xeon <a href="http://ark.intel.com/Product.aspx?id=37113" >W5580</a> CPU (4 cores, 8 MB cache, 3.20 GHz)</li>
<li>24 GB of DDR3 RAM</li>
<li>Intel “Niantic” <a href="http://ark.intel.com/Product.aspx?id=32207" >82599EB</a> 10 Gb Ethernet controller</li>
<li>Microsoft Windows Server 2008 R2 x64</li>
</ul>
<p>This combination would set you back about $7,500 – $450 for the motherboard, $1,500 for the CPU, 6 2 GB DDR3 SDRAM modules at $80 each, $1,200 for the Intel X520 NIC, and $4,000 for an Enterprise copy of Windows Server 2008 R2. <strong>Not cheap, but not an exotic server either</strong>.</p>
<h3>Initiate and Optimize</h3>
<p>The secret to push the tested system to perform like it did is in the optimizations in the server platform, the NIC, and Windows Server itself.</p>
<ul>
<li>The Xeon 5500 processor series includes many enhancements:
<ul>
<li>An integrated memory controller allows for faster RAM access</li>
<li><a href="http://en.wikipedia.org/wiki/Intel_QuickPath_Interconnect" rel="nofollow" >QuickPath interconnect (QPI)</a> replaces the old front-side bus and enhances I/O off the core</li>
<li>A new I/O subsystem with PCIe integrated into the CPU</li>
<li><a href="http://en.wikipedia.org/wiki/Message_Signaled_Interrupts" rel="nofollow" >MSI-X</a> expands the number of interrupts a PCI device can use</li>
<li>New instructions for on-board CRC-32C decoding, speeding up iSCSI digest processing</li>
</ul>
</li>
<li>The 82599 Ethernet controller also includes enhanced capabilities:
<ul>
<li><a href="http://www.intel.com/network/connectivity/vtc_vmdq.htm" >VMDq</a> maps I/O queues to multiple cores and virtual machines, reducing I/O bottlenecks</li>
<li>Offload of TCP segmentation and receive-side coalescing</li>
<li>Interestingly, it does not appear that <a href="http://www.intel.com/network/connectivity/solutions/vmdc.htm" >VMDc/SR-IOV</a> was employed in the test</li>
</ul>
</li>
<li>Microsoft Windows Server 2008 R2 and Hyper-V are ready to use all of these features and more:
<ul>
<li>R2 uses multi-core CPUs more effectively in general</li>
<li><a href="http://technet.microsoft.com/en-us/network/dd277646.aspx" rel="nofollow" >Receive-side scaling (RSS)</a> spreads the I/O workload across all four Xeon cores</li>
<li>The iSCSI initiator now allows CRC digest offload (using the new Xeon command set)</li>
<li>Numerous “<a href="http://download.microsoft.com/download/a/f/d/afdfd50d-6eb9-425e-84e1-b4085a80e34e/SVR-T332_WH07.pptx" rel="nofollow" >NUMA I/O</a>” optimizations in the initiator</li>
<li><a href="http://en.wikipedia.org/wiki/Nagle" rel="nofollow" >TCP/IP Nagle</a> can be disabled in the registry</li>
<li>Hyper-V VMQ allows the network packets to be copied directly into the guest virtual machine’s memory</li>
</ul>
</li>
</ul>
<p>Whew! Put all of these optimizations in a blender and Hyper-V virtual machine iSCSI access will be <strong>twice as fast as before</strong>. No kidding!</p>
<h3>Stay On Target</h3>
<p>But we knew all of this back in January. We also saw that a Cisco Nexus 5020 switch was used to fan out to 10 software iSCSI targets. But until now there was no mention of what targets were used exactly.</p>
<p>The final footnotes in Intel’s whitepaper reveals that the storage backing the million IOPS test was none other than <a href="http://www.starwindsoftware.com/" ><strong>StarWind Software</strong></a><strong>’s iSCSI SAN</strong>! It is unclear what led Microsoft and Intel to use this particular iSCSI target (the earlier throughput tests ran on NetApp filers), but it does speak to the quality of this product.</p>
<p>It is not clear how many disk drives were used, but I would guess that SSDs or ramdisks might have been employed to pull a million IOPS. Network optimizations are also not mentioned, though jumbo frames would not be a benefit in an IOPS test.</p>
<p>StarWind’s software runs on Microsoft Windows and creates a full-featured iSCSI target, complete with data mirroring, automatic failover and failback, replication, snapshots, and thin provisioning. The company prices their iSCSI SAN at $6,000 for two nodes and competes with the likes of <a href="http://www.datacore.com/" >DataCore</a> and <a href="http://www.open-e.com/" >Open-E</a>. But the StarWind solution seems at a glance to be more full-featured than these other offerings.</p>
<h3>Try It Yourself!</h3>
<p>I imagine many folks like me might be tempted to try to reproduce these results. More valuable would be a set of best practice guidelines for the deployment of software iSCSI in Windows Server 2008 R2 and Hyper-V environments. Given the relatively modest hardware involved, there should be nothing stopping us!</p>
<p>These test results also prompted me to get in touch with StarWind to try their iSCSI target software. I was pleasantly surprised to learn that they are currently offering <strong>free non-production licenses</strong> to <a href="http://www.starwindsoftware.com/news/30" >VMware vExperts, VCPs, and VCIs</a> as well as <a href="http://www.starwindsoftware.com/news/31" >Microsoft MVPs, MCPs, and MCT Professionals</a>. Many of my readers fall into one (or more) of those buckets, and I applaud the company for this offer. If only more companies realized the value in giving away test licenses to influencers and thought leaders!</p>
<div id="crp_related"><h3>You might also want to read these other posts...</h3><ul><li><a href="http://gestaltit.com/all/tech/storage/stephen/microsoft-and-intel-push-one-million-iscsi-iops/"  rel="bookmark" class="crp_title">Microsoft and Intel Push One Million iSCSI IOPS</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/microsoft-and-intel-pushing-iscsi-performance-limits/"  rel="bookmark" class="crp_title">Microsoft and Intel Pushing iSCSI Performance Limits</a></li><li><a href="http://gestaltit.com/featured/top/stephen/wirespeed-10-gb-iscsi/"  rel="bookmark" class="crp_title">Wire-Speed 10 Gb iSCSI, Anyone?</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/essential-reading-for-vmware-esx-iscsi-users/"  rel="bookmark" class="crp_title">Essential Reading for VMware ESX iSCSI Users!</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/microsoft-fcoe-support/"  rel="bookmark" class="crp_title">Where Is Microsoft’s FCoE Support?</a></li></ul></div><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://gestaltit.com/all/tech/storage/stephen/microsoft-intel-1-million-iscsi-iops/" type="text/javascript" charset="utf-8"></script><hr />
<p><small>© Stephen Foskett for <a href="http://gestaltit.com">Gestalt IT</a>, 2010. |
<a href="http://gestaltit.com/all/tech/storage/stephen/microsoft-intel-1-million-iscsi-iops/">How Did Microsoft and Intel Get 1 Million iSCSI IOPS?</a>
<br/>
Read more posts categorized as <a href="http://gestaltit.com/category/all/tech/virtualization/" title="View all posts in Server Virtualization" rel="category tag">Server Virtualization</a>, <a href="http://gestaltit.com/category/all/tech/storage/" title="View all posts in Storage" rel="category tag">Storage</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://gestaltit.com/all/tech/storage/stephen/microsoft-intel-1-million-iscsi-iops/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

