<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:series="http://unfoldingneurons.com/"
	xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd"
xmlns:rawvoice="http://www.rawvoice.com/rawvoiceRssModule/"
>

<channel>
	<title>Gestalt IT &#187; 10 gigabit Ethernet Archives  &#8211; Gestalt IT</title>
	<atom:link href="http://gestaltit.com/tag/10-gigabit-ethernet/feed/" rel="self" type="application/rss+xml" />
	<link>http://gestaltit.com</link>
	<description>Independent Experts United</description>
	<lastBuildDate>Fri, 03 Feb 2012 17:00:55 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
<image>
			<title>Gestalt IT</title>
			<url>http://gestaltit.com/wp-content/uploads/2009/02/gestalt-it-feedicon-21.png</url>
			<link>http://gestaltit.com</link>
			<width>144</width>
			<height>37</height>
			<description>Independent Experts United</description>
		</image><!-- podcast_generator="Blubrry PowerPress/2.0.4" -->
	<itunes:summary>Gestalt IT is a community of independent IT infrastructure experts. We gather at GestaltIT.com and our Tech FIeld Day events to discuss the topics of the day. This podcast includes video and audio recordings of these discussions.</itunes:summary>
	<itunes:author>Stephen Foskett</itunes:author>
	<itunes:explicit>no</itunes:explicit>
	<itunes:image href="http://gestaltit.com/wp-content/uploads/powerpress/Gestalt_IT_Tech_Field_Day_Roundtable_Podcast_600.png" />
	<itunes:owner>
		<itunes:name>Stephen Foskett</itunes:name>
		<itunes:email>stephen@fosketts.net</itunes:email>
	</itunes:owner>
	<managingEditor>stephen@fosketts.net (Stephen Foskett)</managingEditor>
	<itunes:subtitle>The best independent IT commentary</itunes:subtitle>
	<itunes:keywords>Storage, Virtualization, Networking, IT</itunes:keywords>
	<image>
		<title>Gestalt IT &#187; 10 gigabit Ethernet Archives  &#8211; Gestalt IT</title>
		<url>http://gestaltit.com/wp-content/uploads/powerpress/Gestalt_IT_Tech_Field_Day_Roundtable_Podcast_144.png</url>
		<link>http://gestaltit.com</link>
	</image>
	<itunes:category text="Technology" />
	<itunes:category text="Business" />
	<itunes:category text="Technology">
		<itunes:category text="Tech News" />
	</itunes:category>
<atom:link rel="hub" href="http://pubsubhubbub.appspot.com" />
	<atom:link rel="hub" href="http://superfeedr.com/hubbub" />
			<item>
		<title>The Scaling Limitations of Etherchannel -Or- Why 1+1 Does Not Equal 2</title>
		<link>http://gestaltit.com/all/tech/networking/ethan/scaling-limitations-etherchannel/</link>
		<comments>http://gestaltit.com/all/tech/networking/ethan/scaling-limitations-etherchannel/#comments</comments>
		<pubDate>Tue, 07 Dec 2010 18:30:26 +0000</pubDate>
		<dc:creator>Ethan Banks</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Networking]]></category>
		<category><![CDATA[10 gigabit Ethernet]]></category>
		<category><![CDATA[Cisco]]></category>
		<category><![CDATA[etherchannel]]></category>
		<category><![CDATA[Ethernet]]></category>
		<category><![CDATA[Gestalt IT]]></category>
		<category><![CDATA[Nerdcore]]></category>
		<category><![CDATA[Network]]></category>
		<category><![CDATA[Network switch]]></category>
		<category><![CDATA[Quality of service]]></category>
		<category><![CDATA[Virtual LAN]]></category>

		<guid isPermaLink="false">http://packetattack.wordpress.com/?p=709</guid>
		<description><![CDATA[Some of you know I took on a new job earlier this year, where the challenge was (and is) to transform a globally distributed network for a growing company into an enterprise class operation. A major focus area has been eliminating single points of failure (SPOFs): single links, single routers, single firewalls, etc. If it can break and consequently interrupt traffic flow, part of my job is to design around the SPOF within the constraints of a finite budget.]]></description>
			<content:encoded><![CDATA[<h3>Slaying SPOFs</h3>
<p>Some of you know I took on a new job earlier this year, where the challenge was (and is) to transform a globally distributed network for a growing company into an enterprise class operation. A major focus area has been eliminating single points of failure (<a rel="nofollow" href="http://en.wikipedia.org/wiki/Single_point_of_failure" class="zem_slink" title="Single point of failure" rel="wikipedia" >SPOF</a>s): single links, single routers, single firewalls, etc. If it can break and consequently interrupt traffic flow, part of my job is to design around the SPOF within the constraints of a finite budget.</p>
<p>The network documentation I inherited ranged from “mostly right but vague and outdated” to “a complete and utter fantasy requiring mind-altering substances to make sense of”. Ergo, untrustworthy to the point of being useless beyond perhaps slideware to show a particularly dim collection of simians. I have therefore been doing complete network explorations, building new documentation as I go.</p>
<p>To my horror, I one day discovered an egregious SPOF, where a single, fragile piece of CAT5 provided the sole physical path between two major concentrations of network activity. If that link ran into any trouble, an entire room containing hundreds of physical and virtual servers (and their storage) would have been cut off from the rest of the company.</p>
<p>To eliminate the physical path SPOF, the easy choice was to transform the single link into an <a rel="nofollow" href="http://en.wikipedia.org/wiki/EtherChannel" class="zem_slink" title="EtherChannel" rel="wikipedia" >etherchannel</a>. This I did; the single 1Gbps became a 4x1Gbps etherchannel plumbed back to one core switch. For good measure, I added a second 4x1Gbps, plumbed to a second core switch. Spanning-tree roots had already been established such that even-numbered VLANs would traverse one of the 4x1Gbps etherchannels, and odds the other…which you can read more about <strong><a rel="nofollow" href="http://packetattack.wordpress.com/2010/09/05/assembly-required-a-basic-spanning-tree-design-for-a-two-tier-data-center/" title="Assembly Required: A Basic Spanning-Tree Design for a Two-Tier Data Center" >here</a></strong> if interested.</p>
<p>All should now be sweetness and light, right? A 1Gbps SPOF (and probable bottleneck) was transformed into a load-distributed pair of 4x1Gbps etherchannels, and hey, if they weren’t complaining about the 1Gbps link before, they ought to be blissfully happy now!</p>
<h3>Mad Maths</h3>
<p>Enter the scaling problem: <strong>when it comes to etherchannel, 1+1 does not equal 2.</strong></p>
<p>The reason adding more physical links does not proportionally grow your available bandwidth is that your friendly neighborhood Cisco switch does not load-balance across etherchannel members frame by frame. You might assume that the frame #1 gets sent down etherchannel member #1, frame #2 down etherchannel member #2, etc. in a round-robin fashion. Reality is rather different. What the switch actually does is math. The sort of math will vary depending on the capabilities of the switch, and on what you have configured.</p>
<p>Commonly available <a href="http://www.cisco.com/en/US/docs/switches/lan/catalyst4500/12.2/54sg/configuration/guide/channel.html#wp1020804"  target="_blank">etherchannel load-balancing methods</a> include source and destination MACs, source and destination IPs, and (my personal favorite) source and destination layer 4 port. To determine which etherchannel member will be used to forward a frame, the switch performs mad maths based on the load-balancing method you’ve selected. The practical upshot is that the same conversation is always going to be forwarded across the same etherchannel member, because the math always works out the same.</p>
<p>This behavior can impact the network. Imagine backup server BEAST with enough horsepower to fill a 1Gbps link who is runing a restore operation to server NEEDY. BEAST and NEEDY are uplinked to different switches interconnected by an etherchannel. As the restore runs, each frame is hashed by the switch to determine which etherchannel member to forward across; the math will work out the same for every frame, meaning the entire conversation between BEAST and NEEDY is going to be forwarded across the same etherchannel member. The result is kind of like the picture above – one member that’s crushed, while the other members lie comparatively idle.</p>
<h3>Congestion Indignities</h3>
<p>The switch is not sensitive to an etherchannel member getting crushed; the switch just keeps on doing mad maths. Therefore, some other conversations heading across the link will just happen to get hashed to the same link that the BEAST-NEEDY restore operation is using. Those other unfortunate conversations will therefore suffer the indignities that happen during link congestion: dropped frames and increased latency. The real-world experience is that certain applications act slow or throw errors. Storage could dismount. Monitoring applications get upset as thresholds are exceeded.</p>
<p>Yuck.</p>
<p>Of course, it’s now up to the network engineer (you) to discover why the alarms are going off, track down the offending traffic flow (you are modeling your interswitch links, right?), and figure out what is to be done about it. In my experience, you won’t have a lot of luck explaining what’s happening to non-network people. I’ve had a hard time explaining that 1+1 doesn’t equal 2, (or 1+1+1+1 doesn’t equal 4). You don’t really have a 2Gbps or 4Gbps link just because you’ve built a fancy etherchannel. You’ve really got multiple parallel 1Gbps links, any one of which can still get congested in BEAST-NEEDY scenarios.</p>
<h3>So Fix It, Network Guy</h3>
<p>There’s a few ways to tackle the challenge of 1+1 not equaling 2.</p>
<ol>
<li><strong>Learn your traffic patterns.</strong> See if you can group heavy hitters into the same switch. That’s a pretty old-school way to go after the problem, and it won’t scale to large data center deployments. But you can find wins in this approach from time to time.</li>
<li><strong>Build a dedicated link.</strong> By this, I mean that you could build a link dedicated to just the traffic that’s causing the interswitch etherchannel all the heartburn. If your etherchannel is a trunk carrying a whole bunch of VLANs, you could build a parallel link that carries traffic for just a problem VLAN, while pruning that problem off of the etherchannel trunk. Might help, might not, depending on your situation…and of course, it’s a “one-off” fix, not a scalable solution necessarily. Some shops build networks dedicated to storage or to backup, and plumb specific interfaces on hosts to these specific networks for exactly this reason. There are increased in hardware, cabling, and complexity to make it happen, though.</li>
<li><strong>Add even more 1Gbps links to the etherchannel</strong>. This is not terribly practical. At the end of the day, you still have a potential bottleneck, but at least you’ve decreased the number of conversations that are likely to get hashed to a congested link.</li>
<li><strong>Replace the 1Gbps links with 10Gbps links.</strong> Increasing bandwidth is always an option. The jump to 10Gbps is a tough one, though: new switch hardware, higher power requirements, and likely new cabling will be required. And don’t forget to break out your checkbook.</li>
<li><strong>Apply QoS</strong>. If you have known offenders or predictable traffic patterns, you can write a QoS scheme to help manage the congestion. I tend to pump traffic like this through a traffic shaper, but there are other approaches, such as guaranteeing minimum bandwidth to important traffic, while dumping the link beast into the scavenger class. I have found that latency still tends to suffer when using a guaranteeing minimum bandwidth (CBWFQ) scheme. I have had the best luck with shaping.</li>
<li><strong>Tweak the beastly application.</strong> It’s not uncommon for certain applications to have a built-in throttle, so that you can cap network utilization right at the app. Talk to your system engineer and see…I’ve heard they’re people, too.</li>
</ol>
<div id="crp_related"><h3>You might also want to read these other posts...</h3><ul><li><a href="http://gestaltit.com/all/tech/networking/ethan/assembly-required-interconnecting-ethernet-chassis-switch/"  rel="bookmark" class="crp_title">Assembly Required – Interconnecting 2 Ethernet Chassis Switches</a></li><li><a href="http://gestaltit.com/all/tech/networking/scott/more-on-vswitch-load-balancing/"  rel="bookmark" class="crp_title">More on vSwitch Load Balancing</a></li><li><a href="http://gestaltit.com/all/tech/networking/ethan/assembly-required-basic-spanningtree-design-twotier-data-center/"  rel="bookmark" class="crp_title">Assembly Required: A Basic Spanning-Tree Design for a Two-Tier Data Center</a></li><li><a href="http://gestaltit.com/all/tech/storage/ethan/dont-drop-baby-data-center-bridging-storage-trust-ethernet/"  rel="bookmark" class="crp_title">Don’t Drop The Baby: Data Center Bridging Wants Storage To Trust Ethernet</a></li><li><a href="http://gestaltit.com/all/tech/networking/greg/bisectional-bandwidth-l2mp-trill-bridges-design-value/"  rel="bookmark" class="crp_title">Bisectional Bandwidth. And why L2MP and Trill/RBridges is vital to the Virtualised Data Centres.</a></li></ul></div><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://gestaltit.com/all/tech/networking/ethan/scaling-limitations-etherchannel/" type="text/javascript" charset="utf-8"></script><hr />
<p><small>© ethan for <a href="http://gestaltit.com">Gestalt IT</a>, 2010. |
<a href="http://gestaltit.com/all/tech/networking/ethan/scaling-limitations-etherchannel/">The Scaling Limitations of Etherchannel -Or- Why 1+1 Does Not Equal 2</a>
<br/>
Read more posts categorized as <a href="http://gestaltit.com/category/all/" title="View all posts in All" rel="category tag">All</a>, <a href="http://gestaltit.com/category/all/tech/networking/" title="View all posts in Networking" rel="category tag">Networking</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://gestaltit.com/all/tech/networking/ethan/scaling-limitations-etherchannel/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>1000Base-What?</title>
		<link>http://gestaltit.com/all/tech/storage/stephen/1000basewhat/</link>
		<comments>http://gestaltit.com/all/tech/storage/stephen/1000basewhat/#comments</comments>
		<pubDate>Wed, 21 Apr 2010 16:00:09 +0000</pubDate>
		<dc:creator>Stephen Foskett</dc:creator>
				<category><![CDATA[Networking]]></category>
		<category><![CDATA[Server Virtualization]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[10 gigabit Ethernet]]></category>
		<category><![CDATA[Computer history]]></category>
		<category><![CDATA[Enterprise storage]]></category>
		<category><![CDATA[Ethernet]]></category>
		<category><![CDATA[Fast Ethernet]]></category>
		<category><![CDATA[Gestalt IT]]></category>
		<category><![CDATA[gigabit Ethernet]]></category>
		<category><![CDATA[Higher Speed Ethernet]]></category>
		<category><![CDATA[Personal]]></category>

		<guid isPermaLink="false">http://blog.fosketts.net/?p=2972</guid>
		<description><![CDATA[Networking may be straightforward, but the world of networking terminology is not. I've been steeped in the strange lingo of Ethernet for many years, but I still get confused by some of the terms. What's the difference between 1000BASE-CX, 1000BASE-SX, and 1000BASE-T? In this post, I'm going to tackle this Ethernet network naming convention.]]></description>
			<content:encoded><![CDATA[<p><!-- google_ad_section_start --></p>
<p>Networking may be straightforward, but the world of networking terminology is not. I&#8217;ve been steeped in the strange lingo of Ethernet for many years, but I still get confused by some of the terms. What&#8217;s the difference between 1000BASE-CX, 1000BASE-SX, and 1000BASE-T? In this post, I&#8217;m going to tackle this Ethernet network naming convention.</p>
<h3>Networking Basics</h3>
<p>Let&#8217;s get the first two components of the network name out of the way:</p>
<p>The first part is the signaling rate in megabits per second. In layman&#8217;s terms, this is the speed of the network at hand. You are likely to come across one of the following:</p>
<ul>
<li>10 megabits</li>
<li>100 megabits &#8211; Fast Ethernet</li>
<li>1000 megabits &#8211; Gigabit Ethernet, GbE, or 1000BASE-X</li>
<li>10,000 megabits &#8211; 10 Gigabit Ethernet, 10GbE, or 10GBASE-X</li>
<li>40,000 megabits &#8211; 40 Gigabit Ethernet, 40GbE, or 40GBASE-X</li>
<li>100,000 megabits &#8211; 100 Gigabit Ethernet, 100GbE, or 100GBASE-X</li>
</ul>
<p>It may strike you as odd that the next part is always the word, &#8220;BASE&#8221;. But there is a reason for this, too.BASE refers &#8220;baseband&#8221;, meaning that this is an unfiltered line not requiring a digital modulation scheme. Back in the day, there was a 10PASS-TS version of Ethernet that used a signaling scheme similar to a modem, but baseband is dominant today.</p>
<p>So 100BASE refers to a Fast Ethernet connection that uses the unfiltered cable for transmission.</p>
<h3>BASE-What?</h3>
<p>The third part of an Ethernet network type refers to the cabling used to carry the signals. The earliest forms of Ethernet used coaxial cable, but thin twisted-pair cabling became popular in the mid-1990s. Faster versions of Ethernet also often use fiber optics rather than electrical signals.</p>
<p>There are a bewildering assortment of physical interconnects for Ethernet. But the naming system isn&#8217;t as complex as it might appear:</p>
<ul>
<li>The first letter tells us which kind of wire we are talking about:
<ul>
<li>&#8220;T&#8221; means twisted-pair cable (e.g. the common Cat5 in use today)</li>
<li>&#8220;K&#8221; means a copper backplane</li>
<li>&#8220;C&#8221; means balanced copper cable</li>
<li>&#8220;F&#8221; means optical cable</li>
<li>&#8220;B&#8221; uses two wavelengths over a single optical cable</li>
<li>&#8220;S&#8221; means short-range multi-mode optical cable (less than 100 m)</li>
<li>&#8220;L&#8221; means long-range single- or multi-mode optical cable (100 m to 10 km)</li>
<li>&#8220;E&#8221; means extended-range optical cable (10 km to 40 km)</li>
<li>&#8220;Z&#8221; means long-range single-mode cable at a higher wavelength</li>
</ul>
</li>
<li>Next is the coding scheme for data on the wire
<ul>
<li>&#8220;X&#8221; means 4B/5B block coding for Fast Ethernet or 8B/10B block coding for Gigabit Ethernet</li>
<li>&#8220;R&#8221; means 64B/66B block coding</li>
</ul>
</li>
<li>Finally, we have a number representing the number of parallel &#8220;lanes&#8221; for data
<ul>
<li>&#8220;1&#8243; would mean serial (non-parallel) but is omitted instead</li>
<li>&#8220;4&#8243; or &#8220;10&#8243; are available for copper wire</li>
<li>Just about any other number could be used for optical lanes or wavelengths</li>
</ul>
</li>
</ul>
<h3>Examples</h3>
<p>Now let&#8217;s look at some examples:</p>
<ul>
<li>Back in the day, 10BASE-T became more common than coaxial 10BASE2. It was a simple 10 megabit baseband signal over common twisted-pair.</li>
<li>When Fast Ethernet first rolled out, there was some concern that traditional (usually Cat3) cabling couldn&#8217;t handle 100 megabits. Some early implementations used four copper pairs (100BASE-T4) or fiber optics (100BASE-FX), but nearly every 100 megabit Ethernet connection today is 100BASE-TX, using plain two pairs on plain Cat5 cable.</li>
<li>Gigabit Ethernet had a similar history. Many were concerned that two pairs on unshielded Cat5 wiring could not handle 1000 megabits per second, so optical (1000BASE-SX) and balanced shielded wiring (1000BASE-CX) were specified. Although an unshielded 2-pair standard was developed (1000BASE-TX), it never really caught on. Therefore, today&#8217;s predominant gigabit LAN connection, 1000BASE-T, uses all four pairs of unshielded twister-pair wiring on a Cat5 cable (see note 1).</li>
<li>The 10 Gigabit Ethernet world has mostly shifted to the block coding scheme from Fibre Channel, 64B/66B, which is denoted by the letter &#8220;R&#8221;. This gives us a family of fiber optic cables (10GBASE-SR, LR, ER, etc), and a copper backplane interconnect (10GBASE-KR). The earlier copper wiring standard (10GBASE-CX4) used InfiniBand-like 4-lane cables and 8B/10B signaling, as did 10GBASE-KX4 on the backplane. A backwards-compatible twisted-pair 10GBASE-T has also been developed, but work continues to make it power-efficient enough to be practical (see note 2).</li>
<li>Looking ahead, we see <a href="http://www.ethernetalliance.org/files/static_page_files/F5BB8944-1D09-3519-ADA94D227208E301/HSE%20FAQ%20_02_0601%20_2_.pdf"  target="_blank">Higher-Speed Ethernet</a> emerging: 40GBASE-KR4 for backplane use, multi-mode optical 40GBASE-SR4 and 100GBASE-SR10, and long-range single-mode optical 40GBASE-LR4 and 100GBASE-LR10.</li>
</ul>
<p>As you can see, all this alphabet soup does have some consistency. Common unshielded twisted pair wiring is all &#8220;BASE-T&#8221;, optics are denoted according to their range (&#8220;S&#8221;, &#8220;L&#8221;, &#8220;E&#8221;), and backplanes use &#8220;K&#8221; copper. Clear as mud?</p>
<p>Note 1: Lots of people (and even equipment makers) incorrectly refer to common Gigabit Ethernet as &#8220;1000BASE-TX&#8221;, but this really should be called &#8220;1000BASE-T&#8221;.</p>
<p>Note 2: We will probably never see a 10GBASE-TX, which would use just 2 pairs of unshielded twisted pair copper wiring.</p>
<div id="crp_related"><h3>You might also want to read these other posts...</h3><ul><li><a href="http://gestaltit.com/all/tech/networking/greg/10-gigabit-ethernet-lan-year/"  rel="bookmark" class="crp_title">Is this the year of 10 Gigabit Ethernet in the LAN?</a></li><li><a href="http://gestaltit.com/all/tech/networking/ethan/assembly-required-interconnecting-ethernet-chassis-switch/"  rel="bookmark" class="crp_title">Assembly Required – Interconnecting 2 Ethernet Chassis Switches</a></li><li><a href="http://gestaltit.com/all/tech/storage/edsai/sync-async-replication/"  rel="bookmark" class="crp_title">Sync or Async Replication?</a></li><li><a href="http://gestaltit.com/all/tech/storage/devang/emc-symmetrix-dmx4-symmetrix-vmax-basic-differences/"  rel="bookmark" class="crp_title">EMC Symmetrix DMX-4 and Symmetrix V-Max: Basic Differences</a></li><li><a href="http://gestaltit.com/all/tech/virtualization/rich/cisco-ucs-for-dummies-%e2%80%93-lan-and-san-connectivity/"  rel="bookmark" class="crp_title">Cisco UCS for Dummies – LAN and SAN Connectivity</a></li></ul></div><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://gestaltit.com/all/tech/storage/stephen/1000basewhat/" type="text/javascript" charset="utf-8"></script><hr />
<p><small>© Stephen Foskett for <a href="http://gestaltit.com">Gestalt IT</a>, 2010. |
<a href="http://gestaltit.com/all/tech/storage/stephen/1000basewhat/">1000Base-What?</a>
<br/>
Read more posts categorized as <a href="http://gestaltit.com/category/all/tech/networking/" title="View all posts in Networking" rel="category tag">Networking</a>, <a href="http://gestaltit.com/category/all/tech/virtualization/" title="View all posts in Server Virtualization" rel="category tag">Server Virtualization</a>, <a href="http://gestaltit.com/category/all/tech/storage/" title="View all posts in Storage" rel="category tag">Storage</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://gestaltit.com/all/tech/storage/stephen/1000basewhat/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How Did Microsoft and Intel Get 1 Million iSCSI IOPS?</title>
		<link>http://gestaltit.com/all/tech/storage/stephen/microsoft-intel-1-million-iscsi-iops/</link>
		<comments>http://gestaltit.com/all/tech/storage/stephen/microsoft-intel-1-million-iscsi-iops/#comments</comments>
		<pubDate>Tue, 23 Mar 2010 20:00:19 +0000</pubDate>
		<dc:creator>Stephen Foskett</dc:creator>
				<category><![CDATA[Server Virtualization]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[10 gigabit Ethernet]]></category>
		<category><![CDATA[benchmarks]]></category>
		<category><![CDATA[CRC]]></category>
		<category><![CDATA[CRC32C]]></category>
		<category><![CDATA[datacore]]></category>
		<category><![CDATA[Enterprise storage]]></category>
		<category><![CDATA[Ethernet]]></category>
		<category><![CDATA[Gestalt IT]]></category>
		<category><![CDATA[Hyper-V]]></category>
		<category><![CDATA[Intel]]></category>
		<category><![CDATA[IOPS]]></category>
		<category><![CDATA[iSCSI]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[MSI-X]]></category>
		<category><![CDATA[Nagle]]></category>
		<category><![CDATA[Open-E]]></category>
		<category><![CDATA[performance]]></category>
		<category><![CDATA[QPI]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[SR-IOV]]></category>
		<category><![CDATA[StarWind]]></category>
		<category><![CDATA[TCP offload]]></category>
		<category><![CDATA[throughput]]></category>
		<category><![CDATA[Virtual Storage]]></category>
		<category><![CDATA[VMDc]]></category>
		<category><![CDATA[VMDq]]></category>
		<category><![CDATA[VMQ]]></category>
		<category><![CDATA[Windows Server 2008 R2]]></category>
		<category><![CDATA[Xeon]]></category>

		<guid isPermaLink="false">http://blog.fosketts.net/?p=2812</guid>
		<description><![CDATA[Ever since Microsoft and Intel declared that the combination of Windows and Nehalem could deliver over a million iSCSI IOPS, I've been curious about just how they did it. What black magic could push that many I/Os over a single Ethernet connection? And what was on the other end? Now Intel has revealed all in a whitepaper, and the results are surprising!]]></description>
			<content:encoded><![CDATA[<p><!-- google_ad_section_start --></p>
<p><a href="http://blog.fosketts.net/wp-content/uploads/2010/01/Speed_Limit_Infinity_sm.png" ><img class="aligncenter size-full wp-image-2591" style="display: block; margin-right: auto; margin-left: auto;" title="Speed_Limit_Infinity_sm" src="http://blog.fosketts.net/wp-content/uploads/2010/01/Speed_Limit_Infinity_sm.png" alt="How fast can iSCSI get?" width="200" height="250" /></a>Ever since Microsoft and Intel declared that the combination of Windows and Nehalem could deliver  over a million iSCSI IOPS, I’ve been curious about just how they did it. What black magic could push that many I/Os over a single Ethernet connection? And what was on the other end? Now <strong>Intel has revealed all in a <a href="http://download.intel.com/support/network/sb/inteliscsiwp.pdf" >whitepaper</a></strong><strong>, and the results are surprising</strong>!</p>
<h3>What iSCSI Did</h3>
<p>Let’s review the test for a moment. In March, Microsoft and Intel demonstrated that the combination of Windows Server 2008 R2 and the Xeon 5500 could saturate a 10 Gb Ethernet link, pushing <a href="http://gestaltit.com/featured/top/stephen/wirespeed-10-gb-iscsi/" ><strong>iSCSI throughput to wire speed</strong></a>. That’s 1,174 MB/s, right around the theoretical maximum of a ten-gigabit link, given a tiny bit of overhead. The pair reunited in January to show that this same combination could deliver an astonishing <a href="http://gestaltit.com/all/tech/storage/stephen/microsoft-and-intel-push-one-million-iscsi-iops/" ><strong>million I/O operations per second</strong></a>, too.</p>
<p>Both of these results are astonishing. Sure, many high-end Fibre Channel SANs and storage systems blast out gigabytes of data and millions of I/O operations every second, but these tests are much more focused. <a href="http://blog.fosketts.net/2010/02/26/fair-technology-benchmarks/" >Benchmarks are perilous</a>, but the folks at Microsoft and Intel devised a fairly clever and focused set. Rather than a “mine’s bigger” contest, the pair only needs to prove that <strong>iSCSI can play with the pros</strong>.</p>
<p>The side effect is a demonstration of the capabilities of Microsoft and Intel components. Microsoft showed off the capabilities of Windows Server 2008 R2, Hyper-V, and their software iSCSI initiator, while Intel can brag about the Xeon 5500 server platform and X520-2 10 Gb Ethernet Server Adapter with their 82599EB controller. Your mileage may vary, but it is possible to construct a true storage monster on an average server budget.</p>
<h3>Intel Inside</h3>
<p>Let’s start by looking at the configuration of the local end of the tested configuration. I’m a storage guy so I think of it as the initiator, but you might say it’s the server, the client, or the host. Regardless, the system under test (SUT) is what was put under the microscope. The configuration was a common one: A high-end computer packing an Intel Xeon CPU and 82599-based 10 Gb Ethernet adapter. Most data centers have a machine or two just like this one.</p>
<p>Looking closely, we see that the test in question relied on the following key components:</p>
<ul>
<li>Intel’s “Shadycove” <a href="http://www.intel.com/products/workstation/motherboards/s5520sc/s5520sc-overview.htm" >S5520SC</a> workstation-class motherboard</li>
<li>The Intel Xeon <a href="http://ark.intel.com/Product.aspx?id=37113" >W5580</a> CPU (4 cores, 8 MB cache, 3.20 GHz)</li>
<li>24 GB of DDR3 RAM</li>
<li>Intel “Niantic” <a href="http://ark.intel.com/Product.aspx?id=32207" >82599EB</a> 10 Gb Ethernet controller</li>
<li>Microsoft Windows Server 2008 R2 x64</li>
</ul>
<p>This combination would set you back about $7,500 – $450 for the motherboard, $1,500 for the CPU, 6 2 GB DDR3 SDRAM modules at $80 each, $1,200 for the Intel X520 NIC, and $4,000 for an Enterprise copy of Windows Server 2008 R2. <strong>Not cheap, but not an exotic server either</strong>.</p>
<h3>Initiate and Optimize</h3>
<p>The secret to push the tested system to perform like it did is in the optimizations in the server platform, the NIC, and Windows Server itself.</p>
<ul>
<li>The Xeon 5500 processor series includes many enhancements:
<ul>
<li>An integrated memory controller allows for faster RAM access</li>
<li><a href="http://en.wikipedia.org/wiki/Intel_QuickPath_Interconnect" rel="nofollow" >QuickPath interconnect (QPI)</a> replaces the old front-side bus and enhances I/O off the core</li>
<li>A new I/O subsystem with PCIe integrated into the CPU</li>
<li><a href="http://en.wikipedia.org/wiki/Message_Signaled_Interrupts" rel="nofollow" >MSI-X</a> expands the number of interrupts a PCI device can use</li>
<li>New instructions for on-board CRC-32C decoding, speeding up iSCSI digest processing</li>
</ul>
</li>
<li>The 82599 Ethernet controller also includes enhanced capabilities:
<ul>
<li><a href="http://www.intel.com/network/connectivity/vtc_vmdq.htm" >VMDq</a> maps I/O queues to multiple cores and virtual machines, reducing I/O bottlenecks</li>
<li>Offload of TCP segmentation and receive-side coalescing</li>
<li>Interestingly, it does not appear that <a href="http://www.intel.com/network/connectivity/solutions/vmdc.htm" >VMDc/SR-IOV</a> was employed in the test</li>
</ul>
</li>
<li>Microsoft Windows Server 2008 R2 and Hyper-V are ready to use all of these features and more:
<ul>
<li>R2 uses multi-core CPUs more effectively in general</li>
<li><a href="http://technet.microsoft.com/en-us/network/dd277646.aspx" rel="nofollow" >Receive-side scaling (RSS)</a> spreads the I/O workload across all four Xeon cores</li>
<li>The iSCSI initiator now allows CRC digest offload (using the new Xeon command set)</li>
<li>Numerous “<a href="http://download.microsoft.com/download/a/f/d/afdfd50d-6eb9-425e-84e1-b4085a80e34e/SVR-T332_WH07.pptx" rel="nofollow" >NUMA I/O</a>” optimizations in the initiator</li>
<li><a href="http://en.wikipedia.org/wiki/Nagle" rel="nofollow" >TCP/IP Nagle</a> can be disabled in the registry</li>
<li>Hyper-V VMQ allows the network packets to be copied directly into the guest virtual machine’s memory</li>
</ul>
</li>
</ul>
<p>Whew! Put all of these optimizations in a blender and Hyper-V virtual machine iSCSI access will be <strong>twice as fast as before</strong>. No kidding!</p>
<h3>Stay On Target</h3>
<p>But we knew all of this back in January. We also saw that a Cisco Nexus 5020 switch was used to fan out to 10 software iSCSI targets. But until now there was no mention of what targets were used exactly.</p>
<p>The final footnotes in Intel’s whitepaper reveals that the storage backing the million IOPS test was none other than <a href="http://www.starwindsoftware.com/" ><strong>StarWind Software</strong></a><strong>’s iSCSI SAN</strong>! It is unclear what led Microsoft and Intel to use this particular iSCSI target (the earlier throughput tests ran on NetApp filers), but it does speak to the quality of this product.</p>
<p>It is not clear how many disk drives were used, but I would guess that SSDs or ramdisks might have been employed to pull a million IOPS. Network optimizations are also not mentioned, though jumbo frames would not be a benefit in an IOPS test.</p>
<p>StarWind’s software runs on Microsoft Windows and creates a full-featured iSCSI target, complete with data mirroring, automatic failover and failback, replication, snapshots, and thin provisioning. The company prices their iSCSI SAN at $6,000 for two nodes and competes with the likes of <a href="http://www.datacore.com/" >DataCore</a> and <a href="http://www.open-e.com/" >Open-E</a>. But the StarWind solution seems at a glance to be more full-featured than these other offerings.</p>
<h3>Try It Yourself!</h3>
<p>I imagine many folks like me might be tempted to try to reproduce these results. More valuable would be a set of best practice guidelines for the deployment of software iSCSI in Windows Server 2008 R2 and Hyper-V environments. Given the relatively modest hardware involved, there should be nothing stopping us!</p>
<p>These test results also prompted me to get in touch with StarWind to try their iSCSI target software. I was pleasantly surprised to learn that they are currently offering <strong>free non-production licenses</strong> to <a href="http://www.starwindsoftware.com/news/30" >VMware vExperts, VCPs, and VCIs</a> as well as <a href="http://www.starwindsoftware.com/news/31" >Microsoft MVPs, MCPs, and MCT Professionals</a>. Many of my readers fall into one (or more) of those buckets, and I applaud the company for this offer. If only more companies realized the value in giving away test licenses to influencers and thought leaders!</p>
<div id="crp_related"><h3>You might also want to read these other posts...</h3><ul><li><a href="http://gestaltit.com/all/tech/storage/stephen/microsoft-and-intel-push-one-million-iscsi-iops/"  rel="bookmark" class="crp_title">Microsoft and Intel Push One Million iSCSI IOPS</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/microsoft-and-intel-pushing-iscsi-performance-limits/"  rel="bookmark" class="crp_title">Microsoft and Intel Pushing iSCSI Performance Limits</a></li><li><a href="http://gestaltit.com/featured/top/stephen/wirespeed-10-gb-iscsi/"  rel="bookmark" class="crp_title">Wire-Speed 10 Gb iSCSI, Anyone?</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/essential-reading-for-vmware-esx-iscsi-users/"  rel="bookmark" class="crp_title">Essential Reading for VMware ESX iSCSI Users!</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/microsoft-fcoe-support/"  rel="bookmark" class="crp_title">Where Is Microsoft’s FCoE Support?</a></li></ul></div><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://gestaltit.com/all/tech/storage/stephen/microsoft-intel-1-million-iscsi-iops/" type="text/javascript" charset="utf-8"></script><hr />
<p><small>© Stephen Foskett for <a href="http://gestaltit.com">Gestalt IT</a>, 2010. |
<a href="http://gestaltit.com/all/tech/storage/stephen/microsoft-intel-1-million-iscsi-iops/">How Did Microsoft and Intel Get 1 Million iSCSI IOPS?</a>
<br/>
Read more posts categorized as <a href="http://gestaltit.com/category/all/tech/virtualization/" title="View all posts in Server Virtualization" rel="category tag">Server Virtualization</a>, <a href="http://gestaltit.com/category/all/tech/storage/" title="View all posts in Storage" rel="category tag">Storage</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://gestaltit.com/all/tech/storage/stephen/microsoft-intel-1-million-iscsi-iops/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Microsoft and Intel Pushing iSCSI Performance Limits</title>
		<link>http://gestaltit.com/all/tech/storage/stephen/microsoft-and-intel-pushing-iscsi-performance-limits/</link>
		<comments>http://gestaltit.com/all/tech/storage/stephen/microsoft-and-intel-pushing-iscsi-performance-limits/#comments</comments>
		<pubDate>Thu, 07 Jan 2010 15:17:03 +0000</pubDate>
		<dc:creator>Stephen Foskett</dc:creator>
				<category><![CDATA[Server Virtualization]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[10 GbE]]></category>
		<category><![CDATA[10 gigabit Ethernet]]></category>
		<category><![CDATA[Enterprise storage]]></category>
		<category><![CDATA[Fibre Channel]]></category>
		<category><![CDATA[Gestalt IT]]></category>
		<category><![CDATA[Hyper-V]]></category>
		<category><![CDATA[Intel]]></category>
		<category><![CDATA[iSCSI]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[performance]]></category>
		<category><![CDATA[Virtual Storage]]></category>
		<category><![CDATA[webcast]]></category>
		<category><![CDATA[Windows Server 2008 R2]]></category>

		<guid isPermaLink="false">http://blog.fosketts.net/?p=2590</guid>
		<description><![CDATA["Maximizing Hyper-V iSCSI Performance with Microsoft and Intel" might sound like another "blah blah" marketing piece, but a little birdy tells me that this webcast will drop a bombshell about iSCSI performance.]]></description>
			<content:encoded><![CDATA[<p><!-- google_ad_section_start -->
<div id="attachment_2591" class="wp-caption aligncenter" style="width: 210px;  border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align:center; display: block; margin-right: auto; margin-left: auto;"><a href="http://blog.fosketts.net/wp-content/uploads/2010/01/Speed_Limit_Infinity_sm.png"  ><img class="size-full wp-image-2591" title="Speed_Limit_Infinity_sm" src="http://blog.fosketts.net/wp-content/uploads/2010/01/Speed_Limit_Infinity_sm.png" alt="" width="200" height="250" /></a>
<p style=' padding: 0 4px 5px; margin: 0;'  class="wp-caption-text">How fast can iSCSI get?</p>
</div>
<p>&#8220;<a href="http://msevents.microsoft.com/CUI/WebCastEventDetails.aspx?EventID=1032432956&amp;EventCategory=4&amp;culture=en-US&amp;CountryCode=US" rel="nofollow"   >Maximizing Hyper-V iSCSI Performance with Microsoft and Intel</a>&#8221; might sound like another &#8220;blah blah&#8221; marketing piece, but a little birdy tells me that <strong>this webcast will drop a bombshell about iSCSI performance</strong>.</p>
<p>Lots of storage and networking folks don&#8217;t give iSCSI and Microsoft the credit they deserve. &#8220;iSCSI is cheap and easy,&#8221; they say, &#8220;but real performance requires Fibre Channel.&#8221; Those of us who have an open mind about such things know that this is simply not the case. The fastest SAN I ever saw was based on iSCSI, and <strong>Microsoft demonstrated </strong><a href="http://gestaltit.com/featured/top/stephen/wirespeed-10-gb-iscsi/"   ><strong>wire-speed iSCSI over 10 Gb Ethernet in March</strong></a>. I never saw a Fibre Channel SAN (even an 8 Gb/s one) push over a gigabyte per second over a single link!</p>
<p>Still, ask the average sysadmin and they will tell you that iSCSI isn&#8217;t for high performance applications. That&#8217;s why <strong>folks should tune in to this webcast, as Microsoft and Intel knock down another iSCSI performance myth</strong>. Note that even though Hyper-V is called out in the title and description, this discussion is really about Windows Server 2008 R2 and applies equally regardless of whether or not you use Microsoft&#8217;s hypervisor.</p>
<p>Watch this space for a summary of the news immediately following the announcement.</p>
<ul>
<li><strong>What:</strong> <a href="http://msevents.microsoft.com/CUI/WebCastEventDetails.aspx?EventID=1032432956&amp;EventCategory=4&amp;culture=en-US&amp;CountryCode=US" rel="nofollow"   >Maximizing Hyper-V iSCSI Performance with Microsoft and Intel</a> webcast</li>
<li><strong>When:</strong> Thursday, January 14, 2010 8:00 AM Pacific Time</li>
<li><strong>Where:</strong> <a href="http://msevents.microsoft.com/CUI/WebCastEventDetails.aspx?EventID=1032432956&amp;EventCategory=4&amp;culture=en-US&amp;CountryCode=US" rel="nofollow"   >MSEvents.Microsoft.com</a></li>
<li><strong>Who:</strong> Anyone interested in high-performance storage and server I/O</li>
</ul>
<div id="crp_related">
<h3>You might also want to read these other posts&#8230;</h3>
<ul>
<li><a href="http://blog.fosketts.net/2010/01/14/microsoft-intel-push-million-iscsi-iops/"   rel="bookmark" class="crp_title">Microsoft and Intel Push One Million iSCSI IOPS</a></li>
<li><a href="http://blog.fosketts.net/2009/03/11/10-gbe-iscsi-fcoe-microsoft/"   rel="bookmark" class="crp_title">10 GbE, iSCSI, FCoE, Microsoft, and the Future</a></li>
<li><a href="http://blog.fosketts.net/2009/05/05/windows-storage-server-2008/"   rel="bookmark" class="crp_title">I Can Finally Talk About Windows Storage Server 2008!</a></li>
<li><a href="http://blog.fosketts.net/2009/01/26/essential-vmware-esx-iscsi/"   rel="bookmark" class="crp_title">Essential Reading for VMware ESX iSCSI Users!</a></li>
<li><a href="http://blog.fosketts.net/2008/07/18/which-storage-protocol-vmware-2/"   rel="bookmark" class="crp_title">Which Storage Protocol For VMware?</a></li>
</ul>
</div>
<p><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://blog.fosketts.net/2010/01/07/microsoft-intel-iscsi-performance/" type="text/javascript" charset="utf-8"></script><!-- google_ad_section_end --><br />
<hr />
<p><small>© sfoskett for <a href="http://blog.fosketts.net" >Stephen Foskett, Pack Rat</a>, 2010. |<br />
<a href="http://blog.fosketts.net/2010/01/07/microsoft-intel-iscsi-performance/" >Microsoft and Intel Pushing iSCSI Performance Limits</a><br />
<br/><br />
This post was categorized as <a href="http://blog.fosketts.net/category/everything/enterprisestorage/"  title="View all posts in Enterprise storage" rel="category tag">Enterprise storage</a>,  <a href="http://blog.fosketts.net/category/gestaltit/"  title="View all posts in Gestalt IT" rel="category tag">Gestalt IT</a>,  <a href="http://blog.fosketts.net/category/everything/virtualstorage/"  title="View all posts in Virtual Storage" rel="category tag">Virtual Storage</a>. Each of my categories has its own feed if you&#8217;d like to filter out or focus on posts like this.<br/><br />
</small></p>
<div id="crp_related"><h3>You might also want to read these other posts...</h3><ul><li><a href="http://gestaltit.com/all/tech/storage/stephen/microsoft-and-intel-push-one-million-iscsi-iops/"  rel="bookmark" class="crp_title">Microsoft and Intel Push One Million iSCSI IOPS</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/essential-reading-for-vmware-esx-iscsi-users/"  rel="bookmark" class="crp_title">Essential Reading for VMware ESX iSCSI Users!</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/microsoft-intel-1-million-iscsi-iops/"  rel="bookmark" class="crp_title">How Did Microsoft and Intel Get 1 Million iSCSI IOPS?</a></li><li><a href="http://gestaltit.com/featured/top/stephen/wirespeed-10-gb-iscsi/"  rel="bookmark" class="crp_title">Wire-Speed 10 Gb iSCSI, Anyone?</a></li><li><a href="http://gestaltit.com/all/tech/storage/stephen/microsoft-fcoe-support/"  rel="bookmark" class="crp_title">Where Is Microsoft’s FCoE Support?</a></li></ul></div><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://gestaltit.com/all/tech/storage/stephen/microsoft-and-intel-pushing-iscsi-performance-limits/" type="text/javascript" charset="utf-8"></script><hr />
<p><small>© Stephen Foskett for <a href="http://gestaltit.com">Gestalt IT</a>, 2010. |
<a href="http://gestaltit.com/all/tech/storage/stephen/microsoft-and-intel-pushing-iscsi-performance-limits/">Microsoft and Intel Pushing iSCSI Performance Limits</a>
<br/>
Read more posts categorized as <a href="http://gestaltit.com/category/all/tech/virtualization/" title="View all posts in Server Virtualization" rel="category tag">Server Virtualization</a>, <a href="http://gestaltit.com/category/all/tech/storage/" title="View all posts in Storage" rel="category tag">Storage</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://gestaltit.com/all/tech/storage/stephen/microsoft-and-intel-pushing-iscsi-performance-limits/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

