Storage Efficiency and Optimization – The Other Green

For those of you in the New York City area, I will be presenting live in person at Storage Decisions September 23, 2009 conference The Other Green, Storage Efficiency and Optimization.

Throw out the "green“: buzzword, and you’re still left with the task of saving or maximizing use of space, power, and cooling while stretching available IT dollars to support growth and business sustainability. For some environments the solution may be consolation while others need to maintain quality of service response time, performance and availability necessitating faster, energy efficient technologies to achieve optimization objectives.

To accomplish these and other related issues, you can turn to the cloud, virtualization, intelligent power management, data footprint reduction and data management not to mention various types of tiered storage and performance optimization techniques. The session will look at various techniques and strategies to optimize either on-line active or primary as well as near-line or secondary storage environment during tough economic times, as well as to position for future growth, after all, there is no such thing as a data recession!

Topics, technologies and techniques that will be discussed include among others:

  • Energy efficiency (strategic) vs. energy avoidance (tactical), whats different between them
  • Optimization and the need for speed vs. the need for capacity, finding the right balance
  • Metrics & measurements for management insight, what the industry is doing (or not doing)
  • Tiered storage and tiered access including SSD, FC, SAS, tape, clouds and more
  • Data footprint reduction (archive, compress, dedupe) and thin provision among others
  • Best practices, financial incentives and what you can do today

This is a free event for IT professionals, however space I hear is limited, learn more and register here.

For those interested in broader IT data center and infrastructure optimization, check out the on-going seminar series The Infrastructure Optimization and Planning Best Practices (V2.009) – Doing more with less without sacrificing storage, system or network capabilities Seminar series continues September 22, 2009 with a stop in Chicago. This is also a free Seminar, register and learn more here or here.

Ok, nuff said.

Cheers gs

Greg Schulz – Author Cloud and Virtual Data Storage Networking (CRC Press), The Green and Virtual Data Center (CRC Press) and Resilient Storage Networks (Elsevier)
twitter @storageio

All Comments, (C) and (TM) belong to their owners/posters, Other content (C) Copyright 2006-2024 Server StorageIO and UnlimitedIO LLC All Rights Reserved

Back to School and Dedupe School

Summers is over hear in the northern hemisphere and its back to school time.

This coming week I will be the substitute teacher filling in for my friend Mr. Backup in Minneapolis and Toronto for TechTargets Dedupe School. If you are in either city and have not yet signed up, check out the link here to learn more.

Hope to see you this week, or, next week at Infrastructure Optimization in Chicago or Storage Decisions in NYC where I will also be presenting or teaching if you prefer, as well as listening and learning from the attendees whats on their minds.

Stay current on other upcoming activities on our events page, as well as see whats new or in the news here.

Ok, nuff said.

Cheers gs

Greg Schulz – Author Cloud and Virtual Data Storage Networking (CRC Press), The Green and Virtual Data Center (CRC Press) and Resilient Storage Networks (Elsevier)
twitter @storageio

All Comments, (C) and (TM) belong to their owners/posters, Other content (C) Copyright 2006-2024 Server StorageIO and UnlimitedIO LLC All Rights Reserved

Performance = Availability StorageIOblog featured ITKE guest blog

ITKE - IT Knowledge Exchange

Recently IT Knowledge Exchange named me and StorageIOblog as their weekly featured IT blog too which Im flattered and honored. Consequently, I did a guest blog for them titled Performance = Availability, Availability = Performance that you can read about here.

For those not familiar with ITKE, take a few minutes and go over and check it out, there is a wealth of information there on a diversity of topics that you can read about, or, you can also get involved and participate in the questions and answers discussions.

Speaking of ITKE, interested in “The Green and Virtual Data Center” (CRC), check out this link where you can download a free chapter of my book, along with information on how to order your own copy along with a special discount code from CRC press.

Thank you very much to Sean Brooks of ITKE and his social media team of Michael Morisy and Jenny Mackintosh for being named featured IT blogger, as well as for being able to do a guest post for them. It has been fantastic working them and particularly Jenny who helped with all of the logistics in putting together the various pieces including getting the post up on the web as well as in their news letter.

Ok, nuff said.

Cheers gs

Greg Schulz – Author Cloud and Virtual Data Storage Networking (CRC Press), The Green and Virtual Data Center (CRC Press) and Resilient Storage Networks (Elsevier)
twitter @storageio

All Comments, (C) and (TM) belong to their owners/posters, Other content (C) Copyright 2006-2024 Server StorageIO and UnlimitedIO LLC All Rights Reserved

Data Center I/O Bottlenecks Performance Issues and Impacts

This is an excerpt blog version of the popular Server and StorageIO Group white paper "IT Data Center and Data Storage Bottlenecks" originally published August of 2006 that is as much if not more relevant today than it was in the past.

Most Information Technology (IT) data centers have bottleneck areas that impact application performance and service delivery to IT customers and users. Possible bottleneck locations shown in Figure-1 include servers (application, web, file, email and database), networks, application software, and storage systems. For example users of IT services can encounter delays and lost productivity due to seasonal workload surges or Internet and other network bottlenecks. Network congestion or dropped packets resulting in wasteful and delayed retransmission of data can be the results of network component failure, poor configuration or lack of available low latency bandwidth.

Server bottlenecks due to lack of CPU processing power, memory or under sized I/O interfaces can result in poor performance or in worse case scenarios application instability. Application including database systems bottlenecks due to excessive locking, poor query design, data contention and deadlock conditions result in poor user response time. Storage and I/O performance bottlenecks can occur at the host server due to lack of I/O interconnect bandwidth such as an overloaded PCI interconnect, storage device contention, and lack of available storage system I/O capacity.

These performance bottlenecks, impact most applications and are not unique to the large enterprise or scientific high compute (HPC) environments. The direct impact of data center I/O performance issues include general slowing of the systems and applications, causing lost productivity time for users of IT services. Indirect impacts of data center I/O performance bottlenecks include additional management by IT staff to trouble shoot, analyze, re-configure and react to application delays and service disruptions.


Figure-1: Data center performance bottleneck locations

Data center performance bottleneck impacts (see Figure-1) include:

  • Under utilization of disk storage capacity to compensate for lack of I/O performance capability
  • Poor Quality of Service (QoS) causing Service Level Agreements (SLA) objectives to be missed
  • Premature infrastructure upgrades combined with increased management and operating costs
  • Inability to meet peak and seasonal workload demands resulting in lost business opportunity

I/O bottleneck impacts
It should come as no surprise that businesses continue to consume and rely upon larger amounts of disk storage. Disk storage and I/O performance fuel the hungry needs of applications in order to meet SLAs and QoS objectives. The Server and StorageIO Group sees that, even with efforts to reduce storage capacity or improve capacity utilization with information lifecycle management (ILM) and Infrastructure Resource Management (IRM) enabled infrastructures, applications leveraging rich content will continue to consume more storage capacity and require additional I/O performance. Similarly, at least for the next few of years, the current trend of making and keeping additional copies of data for regulatory compliance and business continue is expected to continue. These demands all add up to a need for more I/O performance capabilities to keep up with server processor performance improvements.


Figure-2: Processing and I/O performance gap

Server and I/O performance gap
The continued need for accessing more storage capacity results in an alarming trend: the expanding gap between server processing power and available I/O performance of disk storage (Figure-2). This server to I/O performance gap has existed for several decades and continues to widen instead of improving. The net impact is that bottlenecks associated with the server to I/O performance lapse result in lost productivity for IT personal and customers who must wait for transactions, queries, and data access requests to be resolved.

Application symptoms of I/O bottlenecks
There are many applications across different industries that are sensitive to timely data access and impacted by common I/O performance bottlenecks. For example, as more users access a popular file, database table, or other stored data item, resource contention will increase. One way resource contention manifests itself is in the form of database “deadlock” which translates into slower response time and lost productivity. 

Given the rise and popularity of internet search engines, search engine optimization (SEO) and on-line price shopping, some businesses have been forced to create expensive read-only copies of databases. These read-only copies are used to support more queries to address bottlenecks from impacting time sensitive transaction databases.

In addition to increased application workload, IT operational procedures to manage and protect data help to contribute to performance bottlenecks. Data center operational procedures result in additional file I/O scans for virus checking, database purge and maintenance, data backup, classification, replication, data migration for maintenance and upgrades as well as data archiving. The net result is that essential data center management procedures contribute to performance challenges and impacting business productivity.

Poor response time and increased latency
Generally speaking, as additional activity or application workload including transactions or file accesses are performed, I/O bottlenecks result in increased response time or latency (shown in Figure-3). With most performance metrics more is better; however, in the case of response time or latency, less is better.  Figure-3 shows the impact as more work is performed (dotted curve) and resulting I/O bottlenecks have a negative impact by increasing response time (solid curve) above acceptable levels. The specific acceptable response time threshold will vary by applications and SLA requirements. The acceptable threshold level based on performance plans, testing, SLAs and other factors including experience serves as a guide line between acceptable and poor application performance.

As more workload is added to a system with existing I/O issues, response time will correspondingly decrease as was seen in Figure-3. The more severe the bottleneck, the faster response time will deteriorate (e.g. increase) from acceptable levels. The elimination of bottlenecks enables more work to be performed while maintaining response time below acceptable service level threshold limits.


Figure-3: I/O response time performance impact

Seasonal and peak workload I/O bottlenecks
Another common challenge and cause of I/O bottlenecks is seasonal and/or unplanned workload increases that result in application delays and frustrated customers. In Figure-4 a workload representing an eCommerce transaction based system is shown with seasonal spikes in activity (dotted curve). The resulting impact to response time (solid curve) is shown in relation to a threshold line of acceptable response time performance. For example, peaks due holiday shopping exchanges appear in January then dropping off increasing near mother’s day in May, then back to school shopping in August results in increased activity as does holiday shopping starting in late November.


Figure-4: I/O bottleneck impact from surge workload activity

Compensating for lack of performance
Besides impacting user productivity due to poor performance, I/O bottlenecks can result in system instability or unplanned application downtime. One only needs to recall recent electric power grid outages that were due to instability, insufficient capacity bottlenecks as a result of increased peak user demand.

I/O performance improvement approaches to address I/O bottlenecks have been to do nothing (incur and deal with the service disruptions) or over configure by throwing more hardware and software at the problem. To compensate for lack of I/O performance and counter the resulting negative impact to IT users, a common approach is to add more hardware to mask or move the problem.

However, this often leads to extra storage capacity being added to make up for a short fall in I/O performance. By over configuring to support peak workloads and prevent loss of business revenue, excess storage capacity must be managed throughout the non-peak periods, adding to data center and management costs. The resulting ripple affect is that now more storage needs to be managed, including allocating storage network ports, configuring, tuning, and backing up of data. This can and does result in environments that have storage utilization well below 50% of their useful storage capacity. The solution is to address the problem rather than moving and hiding the bottleneck elsewhere (rather like sweeping dust under the rug).

Business value of improved performance
Putting a value on the performance of applications and their importance to your business is a necessary step in the process of deciding where and what to focus on for improvement. For example, what is the value of reducing application response time and the associated business benefit of allowing more transactions, reservations or sales to be made? Likewise, what is the value of improving the productivity of a designer or animator to meet tight deadlines and market schedules? What is business benefit of enabling a customer to search faster for and item, place an order, access media rich content, or in general improve their productivity?

Server and I/O performance gap as a data center bottleneck
I/O performance bottlenecks are a wide spread issue across most data centers, affecting many applications and industries. Applications impacted by data center I/O bottlenecks to be looked at in more depth are electronic design automation (EDA), entertainment and media, database online transaction processing (OLTP) and business intelligence. These application categories represent transactional processing, shared file access for collaborative work, and processing of shared, time sensitive data.

Electronic design
Computer aided design (CAD), computer assisted engineering (CAE), electronic design automaton (EDA) and other design tools are used for a wide variety of engineering and design functions. These design tools require fast access to shared, secured and protected data. The objective of using EDA and other tools is to enable faster product development with better quality and improved worker productivity. Electronic components manufactured for the commercial, consumer and specialized markets rely on design tools to speed the time-to-market of new products as well as to improve engineer productivity.

EDA tools, including those from Cadence, Synopsis, Mentor Graphics and others, are used to develop expensive and time sensitive electronic chips, along with circuit boards and other components to meet market windows and suppler deadlines. An example of this is a chip vendor being able to simulate, develop, test, produce and deliver a new chip in time for manufacturers to release their new products based on those chips. Another example is aerospace and automotive engineering firms leveraging design tools, including CATIA and UGS, on a global basis relying on their suppler networks to do the same in a real-time, collaborative manner to improve productivity and time-to-market. These results in contention of shared file and data access and, as a work-around, more copies of data kept as local buffers.

I/O performance impacts and challenges for EDA, CAE and CAD systems include:

  • Delays in drawing and file access resulting in lost productivity and project delays
  • Complex configurations to support computer farms (server grids) for I/O and storage performance
  • Proliferation of dedicated storage on individual servers and workstations to improve performance

Entertainment and media
While some applications are characterized by high bandwidth or throughput, such as streaming video and digital intermediate (DI) processing of 2K (2048 pixels per line) and 4K (4096 pixels per line) video and film, there are many other applications that are also impacted by I/O performance time delays. Even bandwidth intensive applications for video production and other applications are time sensitive and vulnerable to I/O bottleneck delays. For example, cell phone ring tone, instant messaging, small MP3 audio, and voice- and e-mail are impacted by congestion and resource contention.

Prepress production and publishing requiring assimilation of many small documents, files and images while undergoing revisions can also suffer. News and information websites need to look up breaking stories, entertainment sites need to view and download popular music, along with still images and other rich content; all of this can be negatively impacted by even small bottlenecks.  Even with streaming video and audio, access to those objects requires accessing some form of a high speed index to locate where the data files are stored for retrieval. These indexes or databases can become bottlenecks preventing high performance storage and I/O systems from being fully leveraged.

Index files and databases must be searched to determine the location where images and objects, including streaming media, are stored. Consequently, these indices can become points of contention resulting in bottlenecks that delay processing of streaming media objects. When cell phone picture is taken phone and sent to someone, chances are that the resulting image will be stored on network attached storage (NAS) as a file with a corresponding index entry in a database at some service provider location. Think about what happens to those servers and storage systems when several people all send photos at the same time.

I/O performance impacts and challenges for entertainment and media systems include:

  • Delays in image and file access resulting in lost productivity
  • Redundant files and storage local servers to improve performance
  • Contention for resources causing further bottlenecks during peak workload surges

OLTP and business intelligence
Surges in peak workloads result in performance bottlenecks on database and file servers, impacting time sensitive OLTP systems unless they are over configured for peak demand. For example, workload spikes due to holiday and back-to-school shopping, spring break and summer vacation travel reservations, Valentines or Mothers Day gift shopping, and clearance and settlement on peak stock market trading days strain fragile systems. For database systems maintaining performance for key objects, including transaction logs and journals, it is important to eliminate performance issues as well as maintain transaction and data integrity.

An example tied to eCommerce is business intelligence systems (not to be confused with back office marketing and analytics systems for research). Online business intelligence systems are popular with online shopping and services vendors who track customer interests and previous purchases to tailor search results, views and make suggestions to influence shopping habits.

Business intelligence systems need to be fast and support rapid lookup of history and other information to provide purchase histories and offer timely suggestions. The relative performance improvements of processors shift the application bottlenecks from the server to the storage access network. These applications have, in some cases, resulted in an exponential increase in query or read operations beyond the capabilities of single database and storage instances, resulting in database deadlock and performance problems or the proliferation of multiple data copies and dedicated storage on application servers.

A more recent contribution to performance challenges, caused by the increased availability of on-line shopping and price shopping search tools, is low cost craze (LCC) or price shopping. LCC has created a dramatic increase in the number of read or search queries taking place, further impacting database and file systems performance. For example, an airline reservation system that supports price shopping while preventing impact to time sensitive transactional reservation systems would create multiple read-only copies of reservations databases for searches. The result is that more copies of data must be maintained across more servers and storage systems thus increasing costs and complexity. While expensive, the alternative of doing nothing results in lost business and market share.

I/O performance impacts and challenges for OLTP and business intelligence systems include:

  • Application and database contention, including deadlock conditions, due to slow transactions
  • Disruption to application servers to install special monitoring, load balance or I/O driver software
  • Increased management time required to support additional storage needed as a I/O workaround

Summary/Conclusion
It is vital to understand the value of performance, including response time or latency, and numbers of I/O operations for each environment and particular application. While the cost per raw TByte may seem relatively in-expensive, the cost for I/O response time performance also needs to be effectively addressed and put into the proper context as part of the data center QoS cost structure.

There are many approaches to address data center I/O performance bottlenecks with most centered on adding more hardware or addressing bandwidth or throughput issues. Time sensitive applications depend on low response time as workload including throughput increase and thus latency can not be ignored. The key to removing data center I/O bottlenecks is to find and address the problem instead of simply moving or hiding it with more hardware and/or software. Simply adding fast devices such as SSD may provide relief, however if the SSDs are attached to high latency storage controllers, the full benefit may not be realized. Thus, identify and gain insight into data center and I/O bottleneck paths eliminating issues and problems to boost productivity and efficiency.

Where to Learn More
Additional information about IT data center, server, storage as well as I/O networking bottlenecks along with solutions can be found at the Server and StorageIO website in the tips, tools and white papers, as well as news, books, and activity on the events pages. If you are in the New York area on September 23, 2009, check out my presentation on The Other Green – Storage Optimization and Efficiency that will touch on the above and other related topics. Download your copy of "IT Data Center and Storage Bottlenecks" by clicking here.

Ok, nuff said.

Cheers gs

Greg Schulz – Author Cloud and Virtual Data Storage Networking (CRC Press), The Green and Virtual Data Center (CRC Press) and Resilient Storage Networks (Elsevier)
twitter @storageio

All Comments, (C) and (TM) belong to their owners/posters, Other content (C) Copyright 2006-2024 Server StorageIO and UnlimitedIO LLC All Rights Reserved

Upcoming Out and About Events

Following up on previous Out and About updates ( here and here ) of where I have been, heres where I’m going to be over the next couple of weeks.

On September 15th and 16th 2009, I will be the keynote speaker along with doing a deep dive discussion around data deduplication in Minneapolis, MN and Toronto ON. Free Seminar, register and learn more here.

The Infrastructure Optimization and Planning Best Practices (V2.009) – Doing more with less without sacrificing storage, system or network capabilities Seminar series continues September 22, 2009 with a stop in Chicago. Free Seminar, register and learn more here.

On September 23, 2009 I will be in New York City at Storage Decisions conference participating in the Ask the Experts during the expo session as well as presenting The Other Green — Storage Efficiency and Optimization.

Throw out the "green“: buzzword, and you’re still left with the task of saving or maximizing use of space, power, and cooling while stretching available IT dollars to support growth and business sustainability. For some environments the solution may be consolation while others need to maintain quality of service response time, performance and availability necessitating faster, energy efficient technologies to achieve optimization objectives. To accomplish these and other related issues, you can turn to the cloud, virtualization, intelligent power management, data footprint reduction and data management not to mention various types of tiered storage and performance optimization techniques. The session will look at various techniques and strategies to optimize either on-line active or primary as well as near-line or secondary storage environment during tough economic times, as well as to position for future growth, after all, there is no such thing as a data recession!

Topics, technologies and techniques that will be discussed include among others:

  • Energy efficiency (strategic) vs. energy avoidance (tactical)
  • Optimization and the need for speed vs. the need for capacity
  • Metrics and measurements for management insight
  • Tiered storage and tiered access including SSD, FC, SAS and clouds
  • Data footprint reduction (archive, compress, dedupe) and thin provision
  • Best practices, financial incentives and what you can do today

Free event, learn more and register here.

Check out the events page for other upcoming events and hope to see you this fall while Im out and about.

Cheers – gs

Greg Schulz – StorageIOblog, twitter @storageio Author “The Green and Virtual Data Center” (CRC)

I/O, I/O, Its off to Virtual Work and VMworld I Go (or went)

Ok, so I should have used that intro last week before heading off to VMworld in San Francisco instead of after the fact.

Think of it as a high latency title or intro, kind of like attaching a fast SSD to a slow, high latency storage controller, or a fast server attached to a slow network, or fast network with slow storage and servers, it is what it is.

I/O virtualization (IOV), Virtual I/O (VIO) along with I/O and networking convergence have been getting more and more attention lately, particularly on the convergence front. In fact one might conclude that it is trendy to all of a sudden to be on the IOV, VIO and convergence bandwagon given how clouds, soa and SaaS hype are being challenged, perhaps even turning to storm clouds?

Lets get back on track, or in the case of the past week, get back in the car, get back in the plane, get back into the virtual office and what it all has to do with Virtual I/O and VMworld.

The convergence game has at its center Brocade emanating from the data center and storage centric I/O corner challenging Cisco hailing from the MAN, WAN, LAN general networking corner.

Granted both vendors have dabbled with success in each others corners or areas of focus in the past. For example, Brocade as via acquisitions (McData+Nishan+CNT+INRANGE among others) a diverse and capable stable of local and long distance SAN connectivity and channel extension for mainframe and open systems supporting data replication, remote tape and wide area clustering. Not to mention deep bench experience with the technologies, protocols and partners solutions for LAN, MAN (xWDM), WAN (iFCP, FCIP, etc) and even FAN (file area networking aka NAS) along with iSCSI in addition to Fibre Channel and FICON solutions.

Disclosure: Here’s another plug ;) Learn more about SANs, LANs, MANs, WANs, POTs, PANs and related technologies and techniques in my book “Resilient Storage NetworksDesigning Flexible Scalable Data Infrastructures" (Elsevier).

Cisco not to be outdone has a background in the LAN, MAN, WAN space directly, or similar to Brocade via partnerships with product and experience and depth. In fact while many of my former INRANGE and CNT associates ended up at Brocade via McData or in-directly, some ended up at Cisco. While Cisco is known for general networking, the past several years they have gone from zero to being successful in the Fibre Channel and yes, even the FICON mainframe space while like Brocade (HBAs) dabbling in other areas like servers and storage not to mention consumer products.

What does this have to do with IOV and VIO, let alone VMworld and my virtual office, hang on, hold that thought for a moment, lets get the convergence aspect out of the way first.

On the I/O and networking convergence (e.g. Fibre Channel over Ethernet – FCoE) scene both Brocade (Converged Enhanced Ethernet-CEE) and Cisco (Data Center Ethernet – DCE) along with their partners are rallying around each others camps. This is similar to how a pair of prize fighters maneuvers in advance of a match including plenty of trash talk, hype and all that goes with it. Brocade and Cisco throwing mud balls (or spam) at each other, or having someone else do it is nothing new, however in the past each has had their core areas of focus coming from different tenets in some cases selling to different people in an IT environment or those in VAR and partner organizations. Brocade and Cisco are not alone nor is the I/O networking convergence game the only one in play as it is being complimented by the IOV and VIO technologies addressing different value propositions in IT data centers.

Now on to the IOV and VIO aspect along with VMworld.

For those of you that attended VMworld and managed to get outside of session rooms, or media/analyst briefing or reeducation rooms, or out of partner and advisory board meetings walking the expo hall show floor, there was the usual sea of vendors and technology. There were the servers (physical and virtual), storage (physical and virtual), terminals, displays and other hardware, I/O and networking, data protection, security, cloud and managed services, development and visualization tools, infrastructure resource management (IRM) software tools, manufactures and VARs, consulting firms and even some analysts with booths selling their wares among others.

Likewise, in the onsite physical data center to support the virtual environment, there were servers, storage, networking, cabling and associated hardware along with applicable software and tucked away in all of that, there were also some converged I/O and networking, and, IOV technologies.

Yes, IOV, VIO and I/O networking convergence were at VMworld in force, just ask Jon Torr of Xsigo who was beaming like a proud papa wanting to tell anyone who would listen that his wares were part of the VMworld data center (Disclosure: Thanks for the T-Shirt).

Virtensys had their wares on display with Bob Nappa more than happy to show the technology beyond an UhiGui demo including how their solution includes disk drives and an LSI MegaRAID adapter to support VM boot while leveraging off-the shelf or existing PCIe adapters (SAS, FC, FCoE, Ethernet, SATA, etc.) while allowing adapter sharing across servers, not to mention, they won best new technology at VMworld award.

NextIO who is involved in the IOV / VIO game was there along with convergence vendors Brocade, Cisco, Qlogic and Emulex among others. Rest assured, there are many other vendors and VARs in the VIO and IOV game either still in stealth, semi-stealth or having recently launched.

IOV and VIO are complimentary to I/O and networking convergence in that solutions like those from Aprius, Virtensys, Xsigo, NextIO and others. While they sound similar, and in fact there is confusion as to if Fibre Channel N_Port Virtual ID (FC_NPVID) and VMware virtual adapters are IOV and VIO vs. solutions that are focused on PCIe device/resource extension and sharing.

Another point of confusion around I/O virtualization and virtual I/O are blade system or blade center connectivity solutions such as HP Virtual Connect or IBM Fabric Manger not to mention those form Engenera add confusion to the equation. Some of the buzzwords that you will be hearing and reading more about include PCIe Single Root IOV (SR-IOV) and Multi-Root IOV (MR-IOV). Think of it this way, within VMware you have virtual adapters, and Fibre Channel Virtualization N_Port IDs for LUN mapping/masking, zone management and other tasks.

IOV enables localized sharing of physical adapters across different physical servers (blades or chassis) with distances measured in a few meters; after all, it’s the PCIe bus that is being extended. Thus, it is not a replacement for longer distance in the data center solutions such as FCoE or even SAS for that matter, thus they are complimentary, or at least should be considered complimentary.

The following are some links to previous articles and related material including an excerpt (yes, another plug ;)) from chapter 9 “Networking with you servers and storage” of new book “The Green and Virtual Data Center” (CRC). Speaking of virtual and physical, “The Green and Virtual Data Center” (CRC) was on sale at the physical VMworld book store this week, as well as at the virtual book stores including Amazon.com

The Green and Virtual Data Center

The Green and Virtual Data Center (CRC) on book shelves at VMworld Book Store

Links to some IOV, VIO and I/O networking convergence pieces among others, as well as news coverage, comments and interviews can be found here and here with StorageIOblog posts that may be of interest found here and here.

SearchSystemChannel: Comparing I/O virtualization and virtual I/O benefits – August 2009

Enterprise Storage Forum: I/O, I/O, It’s Off to Virtual Work We Go – December 2007

Byte and Switch: I/O, I/O, It’s Off to Virtual Work We Go (Book Chapter Excerpt) – April 2009

Thus I went to VMworld in San Francisco this past week as much of the work I do is involved with convergence similar to my background, that is, servers, storage, I/O networking, hardware, software, virtualization, data protection, performance and capacity planning.

As to the virtual work, well, I spent some time on airplanes this week which as is often the case, my virtual office, granted it was real work that had to be done, however I also had a chance to meet up with some fellow tweeters at a tweet up Tuesday evening before getting back in a plane in my virtual office.

Now, I/O, I/O, its back to real work I go at Server and StorageIO , kind of rhymes doesnt it!

I/O, I/O, Its off to Virtual Work and VMworld I Go (or went)

Summer Book Update and Back to School Reading

August and thus Summer 2009 in the northern hemisphere are swiftly passing by and start of a new school year is just around the corner which means it is also time for final vacations, time at the beach, pool, golf course, amusement park or favorite fishing hole among other past times. In order to help get you ready for fall (or late summer) book shopping for those with IT interests, here are some Amazon lists (here, here and here) for ideas, after all, the 2009 holiday season is not that far away!

Here’s a link to my Amazon.com Authors page that includes coverage of both my books, "The Green and Virtual Data Center" (CRC) and "Resilient Storage Networks – Designing Scalable Flexible Data Infrastructures" (Elsevier).

The Green and Virtual Data Center (CRC)Resilient Storage Networks - Designing Flexible Scalable Data Infrastructures (Elsevier)

Click here to look inside "The Green and Virtual Data Center" (CRC) and or inside "Resilient Storage Networks" (Elsevier).

Its been six months since the launch announcement of my new book "The Green and Virtual Data Center" (CRC) and general availability at Amazon.com and other global venues here and here. In celebration of the six month anniversary of the book launch (thank you very much to all who have bought a copy!), here is some coverage including what is being said, related articles, interviews, book reviews and more.

Article: New Green Data Center: shifting from avoidance to becoming more efficient IT-World August 2009

wsradio.com interview discussing themes and topics covered in the book including closing the green gap and shifting towards an IT efficiency and productivity for business sustainability.

Closing the green gap: Discussion about expanding data centers with environmental benefits at SearchDataCenter.com

From Greg Brunton – EDS/An HP Company: “Greg Schulz has presented a concise and visionary perspective on the Green issues, He has cut through the hype and highlighted where to start and what the options are. A great place to start your green journey and a useful handbook to have as the journey continues.”

From Rick Bauer – Storage Networking Industry Association (SNIA) – Education and Technology Director”
“Greg is one of the smartest “good guys” in the storage industry.
He has been a voice of calm amid all the “green IT hype” over the past few years. So when he speaks of the possible improvements that Green Tech can bring, it’s a much more realistic approach…”

From CMG (Computer Measurement Group) MeasureIT
I must admit that I have been slightly skeptical at times, when it comes to what the true value is behind all of the discussions on “green” technologies in the data center. As someone who has seen both the end user and vendor side of things, I think my skepticism gets heightened more than it normally would be. This book really helped dispel my skepticism.

The book is extremely well organized and easy to follow. Each chapter has a very good introduction and comprehensive summary. This book could easily serve as a blueprint for organizations to follow when they look for ideas on how to design new data centers. It’s a great addition to an IT Bookshelf. – Reviewed by Stephen R. Guendert, PhD (Brocade and CMG MeasureIT). Click here to read the full review in CMG MeasureIT.

From Tom Becchetti – IT Architect: “This book is packed full of information. From ecological and energy efficiencies, to virtualization strategies and what the future may hold for many of the key enabling technologies. Greg’s writing style benefits both technologists and management levels.”

From MSP Business Journal: Greg Schulz named an Eco-Tech Warrior – April 2009

From David Marshall at VMblog.com: If you follow me on Linked in, you might have seen that I had been reading a new book that came out at the beginning of the year titled, “The Green and Virtual Data Center” by Greg Schulz. Rather than writing about a specific virtualization platform and how to get it up and running, Schulz takes an interesting approach at stepping back and looking at the big picture. After reading the book, I reached out to the author to ask him a few more questions and to share his thoughts with readers of VMBlog.com. I know I’m not Oprah’s Book Club, but I think everyone here will enjoy this book. Click here to read more what David Marshal has to say.

From Zen Kishimoto of Altaterra Research: Book Review May 2009

From Kurt Marko of Processor.com Green and Virtual Book Review – April 2009

From Serial Storage Wire (STA): Green and SASy = Energy and Economic, Effective Storage – March 2009

From Computer Technology Review: Recent Comments on The Green and Virtual Data Center – March 2009

From Alan Radding in Big Fat Finance Blog: Green IT for Finance Operations – April 2009

From VMblog: Comments on The Green and Virtual Data CenterMarch 2009

From StorageIO Blog: Recent Comments and Tips – March 2009

From VMblog: Comments on The Green and Virtual Data CenterMarch 2009

From Data Center Links John Rath comments on “The Green and Virtual Data Center

From InfoStor Dave Simpson comments on “The Green and Virtual Data Center

From Sys-Con Georgiana Comsa comments on “The Green and Virtual Data Center

From Ziff Davis Heather Clancy comments on “The Green and Virtual Data Center”

From Byte & Switch Green IT and the Green Gap February 2009

From GreenerComputing: Enabling a Green and Virtual Data Center February 2009

From Sys-con: Comments on The Green and Virtual Data Center – March 2009

From ServerWatch: Green IT: Myths vs. Realities – February 2009

From Byte & Switch: Going Green and the Economic Downturn – February 2009

From Business Wire: Comments on The Green and Virtual Data Center Book – January 2009

Additional content and news can be found here and here with upcoming events listed here.

Interested in Kindle? Here’s a link to get a Kindle copy of "Resilient Storage Networks" (Elsevier) or to send a message via Amazon to publisher CRC that you would like to see a Kindle version of "The Green and Virtual Data Center". While you are at it, I also invite you to become a fan of my books at Facebook.

Thanks again to everyone who has obtained their copy of either of my books, also thanks to all of those who have done reviews, interviews and helped in many other ways!

Enjoy the rest of your summer!

Cheers – gs

Greg Schulz – twitter @storageio

StorageIO in the news

StorageIO is regularly quoted and interviewed in various industry and vertical market venues and publications both on-line and in print on a global basis. The following is coverage, perspectives and commentary by StorageIO on IT industry trends including servers, storage, I/O networking, hardware, software, services, virtualization, cloud, cluster, grid, SSD, data protection, Green IT and more.

Realizing that some prefer blogs to webs to twitters to other venues, here are some recent links among others to media coverage and comments by me on a different topics that are among others found at www.storageio.com/news.html:

  • Virtualization Review: Comments on Clouds, Virtualizaiton and Cisco move into servers – July 2009
  • SearchStorage: Comments on Storage Resource Managemet (SRM) and related tools – July 2009
  • SearchStorage: Comments on flash SSD – July 2009
  • SearchDataBackup: Comments on Data backup reporting tools’ trends – July 2009
  • SearchServerVirtualization: Comments on Hyper-V R2 matches VMware with 64-processor support – July 2009
  • SearchStorage: Comments on HP buying IBRIX for clustered and Cloud NAS – July 2009
  • Enterprise Storage Forum: Comments on HP buying IBRIX for clustered and Cloud NAS – July 2009
  • eWeek: Comments on NetApps next moves after DDUP and EMC – July 2009
  • Enterprise Storage Forum: Comments on NetApps next moves after DDUP and EMC – July 2009
  • SearchStorage: Comments on EMC buying DataDomain, NetApps next moves – July 2009
  • SearchVirtualization: Comments on Microsft HyperV features and VMware – July 2009
  • SearchITchannel: Comments on social media for business – June 2009
  • SearchSMBstorage: Comments on Storage Resource Management (SRM) for SMBs – June 2009
  • Enterprise Storage Forum: Comments on IT Merger & Acquisition activity – June 2009
  • Evolving Solutions: Comments on Storage Consolidation, Networking & Green IT – June 2009
  • Enterprise Storage Forum: Comments on EMC letter to DDUP – June 2009
  • SearchStorage: Comments on best practices for effective thin provisioning – June 2009
  • Processor: Comments on Cloud computing, SaaS and SOAs – June 2009
  • Serverwatch: Comments in How EMC’s World Pulls the Data Center Together – June 2009
  • Processor: Comments on Virtual Security Is No Walk In The Park – May 2009
  • SearchStorage: Comments on EPA launching Green Storage specification – May 2009
  • SearchStorage: Comments on Storage Provisioning Tools – May 2009
  • Enterprise Systems Journal: Comments on Tape: The Zombie Technology – May 2009
  • Enterprise Storage Forum: Comments on Oracle Keeping Sun Storage Business – May 2009
  • IT Health Blogging: Discussion about iSCSI vs. Fibre Channel for Virtual Environments – May 2009
  • IT Business Edge: Discussion about IT Data Center Futures – May 2009
  • IT Business Edge: Comments on Tape being a Green Technology – April 2009
  • Big Fat Finance Blog: Quoted in story about Green IT for Finance Operaitons – April 2009
  • SearchStorage: Comments on FLASH and SSD Storage – April 2009
  • SearchStorage AU: Comments on Data Classificaiton – April 2009
  • IT Knowledge Exchange: Comments on FCoE and Converged Networking Coming Together – April 2009
  • SearchSMBStorage: Comments on Data Deduplicaiton for SMBs – April 2009
  • SearchSMBStorage: Comments on Blade Storage for SMBs – April 2009
  • SearchStorage: Comments on MAID technology remaining underutilized – April 2009
  • SearchDataCenter: Closing the green gap: Expanding data centers with environmental benefits – April 2009
  • ServerWatch: Comments on What’s Selling In the Data Storage Market? – April 2009
  • ServerWatch: Comments on Oracle Buys Sun: The Consequences – April 2009
  • SearchStorage: Comments on Tiered Storage – April 2009
  • SearchStorage: Comments on Data Classification for Storage Managers – April 2009
  • wsradio.com Interview closing the Green Gap


  • IT Knowledge Exchange: Comments on FCoE eco-system maturing – April 2009
  • Internet Revolution: Comments on the Pre-mature death of the disk drive – April 2009
  • Enterprise Storage Forum: Comments on EMC V-MAX announcement – April 2009
  • MSP Business Journal: Greg Schulz named an Eco-Tech Warrior – April 2009
  • Storage Magazine: Comments on Power-smart disk systems – April 2009
  • Storage Magazine: Comments on Replication Alternatives – April 2009
  • StorageIO Blog: Comments and Tape as a Green Storage Medium – April 2009
  • Inside HPC: Recent Comments on Tape and Green IT – April 2009
  • Processor.com: Recent Comments on Green and Virtual – April 2009
  • SearchDataCenter: Interview: Closing the green gap: Expanding data centers with environmental benefits – April 2009
  • Enterprise Systems Journal: Recent Comments and Tips – March 2009
  • Computer Technology Review: Recent Comments on The Green and Virtual Data Center – March 2009
  • VMblog: Comments on The Green and Virtual Data Center – March 2009
  • Sys-con: Comments on The Green and Virtual Data Center – March 2009
  • Server Watch: Comments on IBM possibly buying Sun – March 2009
  • Bnet: Comments on IBM possibly buying Sun – March 2009
  • SearchStorage: Comments on Tiered Storage 101 – March 2009
  • SearchStorage: Comments – Cisco pushes into Servers March 2009
  • Enterprise Storage Forum: Comments – Cisco Entering Server Market March 2009
  • Enterprise Storage Forum: Comments – State of Storage Job Market – March 2009
  • SearchSMBStorage: Comments on SMB Storage Options – March 2009
  • Enterprise Storage Forum: Comments on Sun Proposes New Solid State Storage Spec – March 2009
  • Enterprise Storage Forum: Comments on Despite Economy, Storage Bargains Hard to Find – March 2009
  • TechWorld: Comments on Where to Stash Your Data – February 2009
  • ServerWatch: Green IT: Myths vs. Realities – February 2009
  • Byte & Switch: Going Green and the Economic Downturn – February 2009
  • CTR: Comments on Tape Hardly Being On Way Out – February 2009
  • Processor: Comments on SSD (FLASH and RAM) – February 2009
  • Internet News: Comments on Steve Wozniak joining SSD startup – February 2009
  • SearchServerVirtualization: Comments on I/O and Virtualization – February 2009
  • Technology Inc.: Comments on Data De-dupe for DR – February 2009
  • SearchStorage: Comments on NetApp SMB NAS – February 2009
  • Check out the Tips, Tools and White Papers, and News pages for additional commentary, coverage and related content or events.

    Ok, nuff said.

    Cheers gs

    Greg Schulz – Author Cloud and Virtual Data Storage Networking (CRC Press, 2011), The Green and Virtual Data Center (CRC Press, 2009), and Resilient Storage Networks (Elsevier, 2004)

    twitter @storageio

    All Comments, (C) and (TM) belong to their owners/posters, Other content (C) Copyright 2006-2012 StorageIO and UnlimitedIO All Rights Reserved

    Recent tips, videos, articles and more

    Its been a busy year so far and there is still plenty more to do. Taking advantage of a short summer break, I’m getting caught up on some items including putting up a link to some of the recent articles, tips, reports, webcasts, videos and more that I have eluded to in recent posts. Realizing that some prefer blogs to webs to tweets to other venues, here are some links to recent articles, tips, videos, podcasts, webcasts, white papers and more that can be found on the StorageIO Tips, tools and White Papers pages.

    Recent articles, columns, tips, white papers and reports:

  • ITworld: The new green data center: From energy avoidance to energy efficiency August 2009
  • SearchSystemsChannel: Comparing I/O virtualization and virtual I/O benefits July 2009
  • SearchDisasterRecovery: Top server virtualization myths in DR and BC July 2009
  • Enterprise Storage Forum: Saving Money with Green Data Storage Technology July 2009
  • SearchSMB ATE Tips: SMB Tips and ATE by Greg Schulz
  • SearchSMB ATE Tip: Tape library storage July 2009
  • SearchSMB ATE Tip: Server-based operating systems vs. PC-based operating systems June 2009
  • SearchSMB ATE Tip: Pros/cons of block/variable block dedupe June 2009
  • FedTechAt the Ready: High-availability storage hinges on being ready for a system failure May 2009
  • Byte & Switch Part XI – Key Elements For A Green and Virtual Data Center May 2009
  • Byte & Switch Part X – Basic Steps For Building a Green and Virtual Data Center May 2009
  • InfoStor Technology Options for Green Storage: April 2009
  • Byte & Switch Part IX – I/O, I/O, Its off to Virtual Work We Go: Networks role in Virtual Data Centers April 2009
  • Byte & Switch Part VIII – Data Storage Can Become Green: There are many steps you can take April 2009
  • Byte & Switch Part VII – Server Virtualization Can Save Costs April 2009
  • Byte & Switch Part VI – Building a Habitat for Technology April 2009
  • Byte & Switch Part V – Data Center Measurement, Metrics & Capacity Planning April 2009
  • zJournal Storage & Data Management: Tips for Enabling Green and Virtual Efficient Data Management March 2009
  • Serial Storage Wire (STA): Green and SASy = Energy and Economic, Effective Storage March 2009
  • SearchSystemsChannel: FAQs: Green IT strategies for solutions providers March 2009
  • Computer Technology Review: Recent Comments on The Green and Virtual Data Center March 2009
  • Byte & Switch Part IV – Virtual Data Centers Can Promote Business Growth March 2009
  • Byte & Switch Part III – The Challenge of IT Infrastructure Resource Management March 2009
  • Byte & Switch Part II – Building an Efficient & Ecologically Friendly Data Center March 2009
  • Byte & Switch Part I – The Green Gap – Addressing Environmental & Economic Sustainability March 2009
  • Byte & Switch Green IT and the Green Gap February 2009
  • GreenerComputing: Enabling a Green and Virtual Data Center February 2009
  • Some recent videos and podcasts include:

  • bmighty.com The dark side of SMB virtualization July 2009
  • bmighty.com SMBs Are Now Virtualization’s “Sweet Spot” July 2009
  • eWeek.com Green IT is not dead, its new focus is about efficiency July 2009
  • SearchSystemsChannel FAQ: Using cloud computing services opportunities to get more business July 2009
  • SearchStorage FAQ guide – How Fibre Channel over Ethernet can combine networks July 2009
  • SearchDataCenter Business Benefits of Boosting Web hosting Efficiency June 2009
  • SearchStorageChannel Disaster recovery services for solution providers June 2009
  • The Serverside The Changing Dynamic of the Data Center April 2009
  • TechTarget Virtualization and Consolidation for Agility: Intels Xeon Processor 5500 series May 2009
  • TechTarget Virtualization and Consolidation for Agility: Intels Xeon Processor 5500 series May 2009
  • Intel Reduce Energy Usage while Increasing Business Productivity in the Data Center May 2009
  • WSRadio Closing the green gap and shifting towards an IT efficiency and productivity April 2009
  • bmighty.com July 2009
  • Check out the Tips, Tools and White Papers, and News pages for more commentary, coverage and related content or events.

    Ok, nuff said.

    Cheers gs

    Greg Schulz – Author Cloud and Virtual Data Storage Networking (CRC Press, 2011), The Green and Virtual Data Center (CRC Press, 2009), and Resilient Storage Networks (Elsevier, 2004)

    twitter @storageio

    All Comments, (C) and (TM) belong to their owners/posters, Other content (C) Copyright 2006-2012 StorageIO and UnlimitedIO All Rights Reserved

    Green IT Confusion Continues, Opportunities Missed!

    I continue to see those looking for fast silver bullets in the quest to be green, efficient, optimized or sustainable while addressing issues ranging from power/energy, cooling, floor-space/footprint, EH&S (environmental health & safety) not to mention recycling. Yet, I’m also continued to be  amazed by the focus and emphasis around reduce as in reduce your capacity and your performance or processing capabilities in the form of consolidation or aggregation along with energy avoidance which for some is applicable.

    However, there is also the other side of the tale which is shifting from avoidance to becoming more efficient, that is doing more with what you have or with less while boosting productivity. For example, having a server or processor that can do more work in the same or smaller physical footprint drawing the same or less energy and requiring less cooling is a form of reducing overall impact yet boosting productivity. The same can be done with data and I/O networks, storage and even software.

    Similar to automobiles after the 1970s oil and energy crisis, the focus was on reduction, conservation and avoidance as the form of being efficient. Over time, this approach gave way to levering more efficient engines and vehicles that boosted the MPG city and highway, change in driving or usage habits, awareness of issues including applicable metrics and energy costs, as well as the continuing quest for alternative fuels.

    This is no different than what is happening with the IT organizations or compute focused entities in that there has been an initial focus of avoidance to meet short term tactical requirements, not to mention all of the green hype of a few years ago. Today there is a shift taking place towards efficiency and awareness that optimization and efficiency is more than consolidation, that it also includes boosting productivity as part of achieving reduced energy and cooling demands.

    How this can be done is to leverage multiple different techniques including new servers with processors that have intelligent power management (IPM) also known as adaptive voltage scaling (AVS) or other marketing terms enabling variable performance and energy consumption. For example, vary clock cycles and turn on cores when needed, then to turn off cores, slow clock speed down when there is less work to be done. Likewise there are improvements with cooling closer to the heat source ranging from leveraging inert liquid cooling inside the cabinet of computers to surface attached cooling to emerging micro cooling located inside silicon. There is a fascination with using virtualization to consolidate and reduce servers that are underutilized, which again is applicable for some environments and applications.

    However not all servers including many that are underutilized lend themselves to being consolidated for various reasons including quality of service (QoS) or performance, security, vendor support or software compatibility, politics or finance among others. This however does not mean that they cannot be virtualized, it more than likely mean that they cannot be consolidated. There is a common myth that virtualization equals consolidation and vice versa, however virtualization can also be used for abstraction, transparency, emulation and enabling agility including support for load-balancing, scale-up and scale-out performance oriented clustering among other uses. Thus there is another side of virtualization and that is to achieve   efficiency, life beyond consolidation.

    Needless to say there are many more technologies and techniques to address various issues now along with those that are emerging. The good news in all of this is the growing awareness that there are many different faces or facets of being green. That green wash and green hype may be on the endangered species list, that green means more than reducing carbon footprints or recycling or energy avoidance. That green is really about shifting and becoming more efficient, more optimized to support more processing, more work in a cost effective manner to sustain growth on a go forward basis. For high performance compute (HPC) or other large scale IT organizations, there is a notion that small improvements on a large broad scale have significant impact.

    Some organizations are in pursuit of technologies of solutions that promise significant saving ratios over small sets or instances, solutions that provide  smaller reduction or savings over a larger basis can prove to be more effective. For example, if power is a concern, powering down servers or storage that promises 85-100% savings might only be applicable to less than 5% of the devices. However, if 85-100% of the devices can be upgraded to newer models that boost productivity by 5-15% (or more) in the same or smaller footprint, using 5-15% (or more) less power, the results add up quickly. Think of it this way, a 1% saving for an environment using 1,000 kilo watt hour (kWh) or 1mWh of energy is a savings of 10kWh. The point being that for large environments, don’t forget to look at small savings that apply to a large installed base that then add up to big benefits.

    The net result is that one can pursue being green or being perceived as being green which can have a high cost, or, can pursue various efficiency that help the overall organization by boosting productivity, helping the top and bottom line, doing more in a smaller footprint and guess what, the result is not only economic, it’s also environmental positive. Thus, the byproduct of shifting towards efficiency (and not just avoidance) is to become green!

    Ok, nuff said.

    Cheers gs

    Greg Schulz – Author Cloud and Virtual Data Storage Networking (CRC Press), The Green and Virtual Data Center (CRC Press) and Resilient Storage Networks (Elsevier)
    twitter @storageio

    All Comments, (C) and (TM) belong to their owners/posters, Other content (C) Copyright 2006-2024 Server StorageIO and UnlimitedIO LLC All Rights Reserved

    Mirror mirror on the wall, who’s the greenest of them all?

    If you subscribe to the notion that Green IT is all about carbon footprints, you may be missing out on some real opportunities to go green. After all, carbon is part of the green movement, there are many other aspects including supply chain, efficiency, sustainability in addition to recycling, not to mention optimizing power, cooling footprints in order to do more work in a productive manner.

    So who is the greenest of them all? Could it be Brocade, CA, Cisco, EMC, Hitachi, IBM, Intel, LSI, Microsoft, NetApp, Oracle, Symantec, VMware or 3PAR? What about the cloud crowd or perhaps one of the industry trade groups such as Green grid, SNIA GSI, Climate Savers Computing or Carbon disclosure project perhaps among others?

    You might be surprised, now granted, this list is for consumer products. However, given their broad adoption, and looking at Green as more than carbon impact, and with the EPA implanting Energy Star for Servers and now Energy Star for storage in the works, not to mention factoring in the green supply chain, have a look here.

    Here’s an interesting read about how the Internet is causing global warming. How ironic, given Al Gore’s carbon crusade, and the folk-lore claim about  (or mistaken have claimed) to have invented the Internet, no wonder he has been able to cash-in and transform Green to Gold.

    For those interested in saving money with efficient and optimized storage (e.g. the new Green) to boost productivity, here’s an article to check out.

    Ok, that’s enough "Green" fun for now.

    Cheers gs

    Clarifying Clustered Storage Confusion

    Clustered storage can be iSCSI, Fibre Channel block based or NAS (NFS or CIFS or proprietary file system) file system based. Clustered storage can also be found in virtual tape library (VTL) including dedupe solutions along with other storage solutions such as those for archiving, cloud, medical or other specialized grids among others.

    Recently in the IT and data storage specific industry, there has been a flurry of merger and acquisition (M&A) (Here and here), new product enhancement or announcement activity around clustered storage. For example, HP buying clustered file system vendor IBRIX complimenting their previous acquisition of another clustered file system vendor (PolyServe) a few years ago, or, of iSCSI block clustered storage software vendor LeftHand earlier this year. Another recent acquisition is that of LSI buying clustered NAS vendor ONstor, not to mention Dell buying iSCSI block clustered storage vendor EqualLogic about a year and half ago, not to mention other vendor acquisitions or announcements involving storage and clustering.

    Where the confusion enters into play is the term cluster which means many things to different people, and even more so when clustered storage is combined with NAS or file based storage. For example, clustered NAS may infer a clustered file system when in reality a solution may only be multiple NAS filers, NAS heads, controllers or storage processors configured for availability or failover.

    What this means is that a NFS or CIFS file system may only be active on one node at a time, however in the event of a failover, the file system shifts from one NAS hardware device (e.g. NAS head or filer) to another. On the other hand, a clustered file system enables a NFS or CIFS or other file system to be active on multiple nodes (e.g. NAS heads, controllers, etc.) concurrently. The concurrent access may be for small random reads and writes for example supporting a popular website or file serving application, or, it may be for parallel reads or writes to a large sequential file.

    Clustered storage is no longer exclusive to the confines of high-performance sequential and parallel scientific computing or ultra large environments. Small files and I/O (read or write), including meta-data information, are also being supported by a new generation of multipurpose, flexible, clustered storage solutions that can be tailored to support different applications workloads.

    There are many different types of clustered and bulk storage systems. Clustered storage solutions may be block (iSCSI or Fibre Channel), NAS or file serving, virtual tape library (VTL), or archiving and object-or content-addressable storage. Clustered storage in general is similar to using clustered servers, providing scale beyond the limits of a single traditional system—scale for performance, scale for availability, and scale for capacity and to enable growth in a modular fashion, adding performance and intelligence capabilities along with capacity.

    For smaller environments, clustered storage enables modular pay-as-you-grow capabilities to address specific performance or capacity needs. For larger environments, clustered storage enables growth beyond the limits of a single storage system to meet performance, capacity, or availability needs.

    Applications that lend themselves to clustered and bulk storage solutions include:

    • Unstructured data files, including spreadsheets, PDFs, slide decks, and other documents
    • Email systems, including Microsoft Exchange Personal (.PST) files stored on file servers
    • Users’ home directories and online file storage for documents and multimedia
    • Web-based managed service providers for online data storage, backup, and restore
    • Rich media data delivery, hosting, and social networking Internet sites
    • Media and entertainment creation, including animation rendering and post processing
    • High-performance databases such as Oracle with NFS direct I/O
    • Financial services and telecommunications, transportation, logistics, and manufacturing
    • Project-oriented development, simulation, and energy exploration
    • Low-cost, high-performance caching for transient and look-up or reference data
    • Real-time performance including fraud detection and electronic surveillance
    • Life sciences, chemical research, and computer-aided design

    Clustered storage solutions go beyond meeting the basic requirements of supporting large sequential parallel or concurrent file access. Clustered storage systems can also support random access of small files for highly concurrent online and other applications. Scalable and flexible clustered file servers that leverage commonly deployed servers, networking, and storage technologies are well suited for new and emerging applications, including bulk storage of online unstructured data, cloud services, and multimedia, where extreme scaling of performance (IOPS or bandwidth), low latency, storage capacity, and flexibility at a low cost are needed.

    The bandwidth-intensive and parallel-access performance characteristics associated with clustered storage are generally known; what is not so commonly known is the breakthrough to support small and random IOPS associated with database, email, general-purpose file serving, home directories, and meta-data look-up (Figure 1). Note that a clustered storage system, and in particular, a clustered NAS may or may not include a clustered file system.

    Clustered Storage Model: Source The Green and Virtual Data Center (CRC)
    Figure 1 – Generic clustered storage model (Courtesy “The Green and Virtual Data Center  (CRC)”

    More nodes, ports, memory, and disks do not guarantee more performance for applications. Performance depends on how those resources are deployed and how the storage management software enables those resources to avoid bottlenecks. For some clustered NAS and storage systems, more nodes are required to compensate for overhead or performance congestion when processing diverse application workloads. Other things to consider include support for industry-standard interfaces, protocols, and technologies.

    Scalable and flexible clustered file server and storage systems provide the potential to leverage the inherent processing capabilities of constantly improving underlying hardware platforms. For example, software-based clustered storage systems that do not rely on proprietary hardware can be deployed on industry-standard high-density servers and blade centers and utilizes third-party internal or external storage.

    Clustered storage is no longer exclusive to niche applications or scientific and high-performance computing environments. Organizations of all sizes can benefit from ultra scalable, flexible, clustered NAS storage that supports application performance needs from small random I/O to meta-data lookup and large-stream sequential I/O that scales with stability to grow with business and application needs.

    Additional considerations for clustered NAS storage solutions include the following.

    • Can memory, processors, and I/O devices be varied to meet application needs?
    • Is there support for large file systems supporting many small files as well as large files?
    • What is the performance for small random IOPS and bandwidth for large sequential I/O?
    • How is performance enabled across different application in the same cluster instance?
    • Are I/O requests, including meta-data look-up, funneled through a single node?
    • How does a solution scale as the number of nodes and storage devices is increased?
    • How disruptive and time-consuming is adding new or replacing existing storage?
    • Is proprietary hardware needed, or can industry-standard servers and storage be used?
    • What data management features, including load balancing and data protection, exists?
    • What storage interface can be used: SAS, SATA, iSCSI, or Fibre Channel?
    • What types of storage devices are supported: SSD, SAS, Fibre Channel, or SATA disks?

    As with most storage systems, it is not the total number of hard disk drives (HDDs), the quantity and speed of tiered-access I/O connectivity, the types and speeds of the processors, or even the amount of cache memory that determines performance. The performance differentiator is how a manufacturer combines the various components to create a solution that delivers a given level of performance with lower power consumption.

    To avoid performance surprises, be leery of performance claims based solely on speed and quantity of HDDs or the speed and number of ports, processors and memory. How the resources are deployed and how the storage management software enables those resources to avoid bottlenecks are more important. For some clustered NAS and storage systems, more nodes are required to compensate for overhead or performance congestion.

    Learn more about clustered storage (block, file, VTL/dedupe, archive), clustered NAS, clustered file system, grids and cloud storage among other topics in the following links:

    "The Many faces of NAS – Which is appropriate for you?"

    Article: Clarifying Storage Cluster Confusion
    Presentation: Clustered Storage: “From SMB, to Scientific, to File Serving, to Commercial, Social Networking and Web 2.0”
    Video Interview: How to Scale Data Storage Systems with Clustering
    Guidelines for controlling clustering
    The benefits of clustered storage

    Along with other material on the StorageIO Tips and Tools or portfolio archive or events pages.

    Ok, nuff said.

    Cheers gs

    Greg Schulz – Author Cloud and Virtual Data Storage Networking (CRC Press), The Green and Virtual Data Center (CRC Press) and Resilient Storage Networks (Elsevier)
    twitter @storageio

    All Comments, (C) and (TM) belong to their owners/posters, Other content (C) Copyright 2006-2024 Server StorageIO and UnlimitedIO LLC All Rights Reserved

    Worried about IT M&A, here come the new startups!

    Storage I/O trends

    Late last year , I did a post (see here) countering the notion that there is a lack of innovation in IT and specifically around data storage. Recently I did a post about a Funeral for Friend, not to mention yesterdays post about Summer marriages.

    For those who are concerned about lack of innovation, or, that consolidation will result in just a few big vendors, here’s some food for thought. Those big vendors in addition to growing via internal organic growth, also grow by buying or merging with other vendors. Those other vendors emerge as startups, some grow, blossom and are bought, some make a decent business on their own, some are looking to be bought, some need to be bought, some will see fire sales, liquidation or simply closing their doors and perhaps re-launching as a new company.

    With all the M&A activity currently that has taken place, and I’m sure (speculation only ;) ) that there will be plenty more, here’s a short and far from comprehensive list of some startups or companies you may not have heard of yet. There are additional ones who are still in deep stealth, some on the list are still in stealth, yet talking and letting information trickle out, thus only non-NDA information is being shown here. In other words, you can find out about these via publicly available information and sources.

    Something that I have noticed and talked with others in the industry about is that this generation of startups, at least for now are taking a far more low-key approach to their launches than in the past. Gone at least for now are the Dot COM era over the top announcements in some cases before there was even a product or shipping for actual customer production deployment scenario. This crop or corps of startups are taking their time leveraging the current economic situation to further incubate their technologies and go to market strategies, not to mention minimizing the amount of over the top VC funding we have seen in the past. Some of these may not appear to be storage related and that would be correct. This list includes those associated with data infrastructure technolgies from servers, to storage to networking, hardware, software and services among othes as a common theme.

    Disclosure Notice: None of these companies mentioned are nor have ever been clients of StorageIO. Why do I mention this, why not!

    Balesio – File compression solutions
    Box.net – Internet/web/cloud storage service with high availability and backup
    Cirrustore – Backup data protection tools
    Dataslide – Hard rectangular disk (HRD)
    Enclarity – Healthcare CRM and analysis tools
    Enstratus – Amazon cloud computing management tools
    Exludas – Multi core optimize
    Firescope – CMDB data solutions
    Greenbytes – ZFS based storage management solutions
    Likewise – Open backup software for macs/linux/windows
    Liquidcomputing – High density servers
    Maxiscale – Web infrastructure (Stealth)
    Metalogix – Archiving solutions
    Neptuny – Capacity Planning
    Netronome – Network and I/O optimization technology
    Newboundary – IT policy management and IRM tools
    Nexenta ZFS – based storage management solutions
    Pergamumsystems – Archive solutions (Stealth)
    Pranah – SMB Storage vendor formerly known as Marner
    Procedo – Archiving and migration solutions
    Rebit – Backup and data protection solutions
    Rightscale – Amazon cloud computing management tools
    Rmsource – Cloud backup solutions
    RNAnetworks – Virtual memory management solutions
    Scale Computing – Clustered storage management software
    ScaleMP – Multi-core virtualization for scale out
    SiberSystems – Goodsync data protection solutions
    Sparebackup – Backup data protection solutions
    StorageFusion – Storage resource analysis
    Storspeed – NAS/NFS optimization solutions (Stealth)
    Sugarsync – Backup and data protection solutions
    Surgient – Cloud computing solutions
    Synology – SMB storage solutions
    TwinStrata – BC/DR analysis and assessment tools
    Vadium – Security and encryption tools
    Vembu – Backup data protection tools
    Versant – Object database management solutions
    Vipre – Security, data loss, data leak prevention
    VirtenSys – Virtual I/O and I/O virtualization (IOV)
    Vizrt – Video management software tools
    WhipTail – Flash SSD solutions
    Xenos – Archive and data footprint reduction solutions

    Links to the above along with many other companies including manufactures and vars can be found on the Interesting Links page at StorageIO.

    Food for thought for your summer technology picnic fun.

    Nuf said for now.

    Ok, nuff said.

    Cheers gs

    Greg Schulz – Author Cloud and Virtual Data Storage Networking (CRC Press, 2011), The Green and Virtual Data Center (CRC Press, 2009), and Resilient Storage Networks (Elsevier, 2004)

    twitter @storageio

    All Comments, (C) and (TM) belong to their owners/posters, Other content (C) Copyright 2006-2012 StorageIO and UnlimitedIO All Rights Reserved

    Summer Weddings: EMC+Datadomain and HP+IBRIX

    Storage I/O trends

    Are you friend or family of the bride or groom?

    Here’s comes the bride! (Audio)

    That’s a question me and Mrs. Schulz were asked recently when we attended a wedding.

    Summer months particularly June and August are known as wedding months (Hmmm, more merger & acquisition activity to come?). Summer is a nice time of the year for marriages at least in the U.S. and how ironic that we have already seen two well publicized IT data storage industry unions in the past couple of weeks, not to mention other smaller less publicized ones.

    In one case, the California based bride (Datadomain-DDUP) had two courtiers (Massachusetts based EMC and California based NetApp, plus rumors of others). Fortunately one of those had a prenuptial that earned them a cool $57 million for their efforts (NetApp-NTAP) when EMC won the bride. Read more including some of my comments and perspectives among others about EMC, NTAP and DDUP here and here.

    Yesterday, on a mid-July Friday, when things are normally quiet, in true wedding industry forum, news was released (here and here) that California based HP announced that it had bought Massachusetts based data and storage management software vendor IBRIX.

    That’s a lot of activity involving California and Massachusetts in the past couple of weeks, not to mention the tornado sightings in the vicinity of EMCs Hopington Massachusetts headquarters coincidently around the same time the marriage to DDUP was formerly announced! What’s’ next, Aerosmith is out on tour, perhaps the Del Fuegos or Boston will perform at one of these wedding parties?

    Within the data storage industry, publicly traded Datadomain (DDUP) is fairly well known to many for their role in helping to popularize the data footprint impact reduction technique refereed to as de-duplication (e.g. normalization, commonality factoring, intelligent compression, etc.). Adding to the awareness of DDUP was the recent highly public courtship with EMC eventually out-bidding NTAP with a dowry of about $2.1B USD. That type of press coverage and monetary amounts might normally be expected for the likes of a Madonna, Brittney Spears, Michael Jackson-RIP, Paris Hilton, Elizabeth Taylor or other celebrity unions covered by paparazzi with a similar number of attorneys involved.

    On the other hand, IBRIX while known to some, is a lessor known entity compared to DDUP having taken a lower profile than even some of their close competitors. However for those who have been following and covering the clustered storage market (see here, here, here, here, here, here, here, here and here ), IBRIX is a well known entity.

    IBRIX also has had ties to EMC having been involved in a pre-mari age affair with an reseller arrangement along with being "rumored" ;) to have been involved with ATMOS cloud or policy based storage solution formerly known as "Hulk". IBRIX has also quietly been involved with others like Dell as well as HP in similar to EMC reseller arrangements. Where IBRIX has been positioned is to address high performance, scale out parallel or concurrent clustered file system needs, both big and small I/O, sequential and random data storage and access. For example, in the media/entertainment and other industries along with enabling large Internet providers a bulk (low cost, high capacity) scale-out NAS (NFS & CIFS) option.

    One of the reasons that IBRIX has been involved with the likes of EMC, Dell and HP among others is that unlike other vendors such as BlueArc, the once high-flying Isilon, NetApp, Onstor or Panasas, not to mention EMC Cellera NAS , is that those solutions are all bundled with proprietary hardware while IBRIX is software based. Where IBRIX Fusion fits is to enable NAS storage solutions using industry standard hardware (servers and storage) that are capable of being configured for both high performance compute (HPC) along with for low-cost general purpose bulk storage to support Web 2.0, social networking, home directories or on-line archives.

    Consequently, and HP or Dell who just happen to sell servers, have had the ability of meeting large scale out and scale up NAS file serving applications by re-selling IBRIX installed on their servers or blade servers with either their own entry to mid-range lower cost, high performance and high capacity storage along with that of 3rd party vendors.

    Ironically one of IBRIX’s competitors in the software NAS solution market was and remains PolyServe, software that HP acquired a couple of years ago to create their own scale out NAS solution (e.g. EFS). Other software based solutions include among others Lustre (Sun), CXFS (SGI), EMC ATMOS (I’m sure some will argue this is not scale out or NAS, will leave it at that for now) ;) not to mention those from IBM, Microsoft, Quantum (also re-sold by HP) or Symantec.

    What does HP get with IBRIX?

    Simple, the ability to own the IP (intellectual proprietary) that one of their competitors had been "rumored" to have been working with at one point, IP that their competitors had been reselling like themselves.

    Thus HP gets more software IP that can and has been sold along with their hardware such as the Proliant servers and blade servers giving their customers choice, similar to what HP and other vendors do with their open servers. For example, HP had the ExDS9000 extreme storage system built on a blade server with high density, low cost, high capacity HP storage (e.g. HP Modular Disk System 600, HP MSA or even EVA).

    This makes for a nice solution for bulk on-line and near-line storage applications where the emphasis is not as much on performance, rather massive scalability for storing on-line documents, archives, videos, images and other unstructured content which is where there is a lot of growth activity. The challenge is that the ExDS9x00 has only been available with the HP PolyServe software which works good for some environments, yet, for others, the clustered file system scale out capabilities of IBRIX were deployed.

    With the addition of IBRIX, HP now should be able to provide their customers and prospects the choice of software to meet specific needs while maintaining an HP footprint, that is both hardware, software and services. HP has several different storage software stacks that they now own (e.g. Lefthand for clustered iSCSI, PolyServe for NFS/CIFS NAS, IBRIX for Clustered File system scale out NAS) not to mention those that it OEMS including among others Bycast (Medical Archive System) that is also OEM’d by IBM as their Medical Grid combined with IBM SOFS, Quantum StorNext and Microsoft Windows Storage Server and Sepaton (VTL and Dedupe) to name a few.

    Do I think this was a good move by HP?

    Yes as it gives them control over IP that they had been reselling as had some of their competitors who left IBRIX to HP to grab up. HP now has the IP which they can package with their hardware similar to how they have been doing, and giving customers choices to align the right hardware and software technology to the task at hand.

    Whether it be Bycast for medical archiving, PolyServe or IBRIX for scale out NAS, Lefthand for clustered iSCSI, Sepaton for VTL and dedupe, Microsoft, Quantum StorNext for shared block storage serving or any of the other software packages HP offers with their industry standard servers, the customer has options.

    For IBRIX customers and prospects, this move will give them a boost in a confidence that their decisions and investments are safe.

    Ironically, vendors like Symantec with their Scaleable File Serving (SFS) clustered NAS solution that is also software based and runs on anyone’s open servers including those from HP gets a potential shot in the arm with HP validating the model and approach for bulk-storage and clustered NAS (Oh Mr. Salem, Mr. Dell is holding on Line 1, Mr. Chambers is on line 2 and Mr. Ellison on line 3 ;) )

    Who’s going to be at the alter next? IMHO, I would keep an eye on (and this all just pure speculation) Bycast, Symantec, EMLX (Broadcom was a wake up call), Quantum, Sepaton, STEC, StorMagic, or ACS, maybe even 3PAR among other possibilities (think outside of the lines). I would not rule out a major game changer such as someone buying NetApp or the likes of an HP buying an EMC or Oracle buying a CSC, maybe even a CSCO buying someone like NTAP, how about Oracle buying NTAP and putting some attorneys out of work, not to mention, who will MSFT hook up with? Anything is possible as we have seen and traditional M&A wisdom is out the window.

    Have fun at the next wedding you attend, go easy on the cake and wedding punch, especially if you will be doing any dancing (please, no You tube videos of the chicken dance) and be careful throwing rice or other items.

    Ok, nuff said.

    Cheers gs

    Greg Schulz – Author Cloud and Virtual Data Storage Networking (CRC Press), The Green and Virtual Data Center (CRC Press) and Resilient Storage Networks (Elsevier)
    twitter @storageio

    All Comments, (C) and (TM) belong to their owners/posters, Other content (C) Copyright 2006-2024 Server StorageIO and UnlimitedIO LLC All Rights Reserved