It only makes sense that a book about Clouds, Virtualization, Data Storage and Networking be available via a cloud service in electronic format. Today Amazon and my publisher (CRC Press Taylor and Francis) released a Kindle version of my new book Cloud and Virtual Data Storage Networking which joins the previously released hardcopy version also available at Amazon.com among other venues.
Cloud and Virtual Data Storage Networking has been declared The New Enterprise Tech Bible by noted industry blogger and host of the Nekkid Tech (@NekkidTech) pod cast Greg Knieriemen (@Knieriemen). Check out Episode #11 (The Enterprise Tech Bible) of the Nekkid Tech pod cast show here.
Comments and reviews about Cloud and Virtual Data Storage Networking can be found at Amazon.com along with those from Stephen Guendert, PhD (@DrSteveGuendert) at CMG MeasureIT (@cmgnews) who says: Gregs latest book is the ibuprofen that will make these cloud computing information overload headaches go away. Cloud and Virtual Data Storage Networking is the single source you can read to get a clear understanding of the fundamentals of the cloud.
Greg Brunton, EDS, an HP Company commented: With all the chatter in the market about cloud storage and how it can solve all your problems, the industry needed a clear breakdown of the facts and how to use Cloud cloud storage effectively. Gregs latest book does exactly that.
Want to know more besides viewing the Google preview above?
HHDD with integrated nand flash SSD photo courtesy Seagate.com
Molly Rector VP of marketing at tape summit resources vendor Spectra Logic mentioned that what she does not like is companies that base their business plan on patent law trolling. I would have expected something different along the lines of countering or correcting people that say tape sucks, tape is dead, or that tape is the cause problem of anything wrong with storage thus clearing the air or putting up a fight that tape summit resources. Go figure…
Another of my comments involved clouds of which there are plenty of conversations taking place. I do like clouds (I even recently wrote a book involving them) however Im a fan of using them where applicable to coexist and enhance other IT resources. Dont be scared of clouds, however be ready, do your homework, listen, learn, do proof of concepts to decide best practices, when, where, what and how to use them.
Mike Karp (aka twitter @storagewonk ) an analyst with Ptak Noel mentions that midrange environments dont get respect from big (or even startup) vendors.
I would take that a step further by saying compared to six or so years ago, SMB are getting night and day better respect along with attention by most vendors, however what is lacking is respect of the SOHO sector (e.g. lower end of SMB down to or just above consumer).
Granted some that have traditional sold into those sectors such as server vendors including Dell and HP get it or at least see the potential along with traditional enterprise vendor EMC via its Iomega . Yet I still see many vendors including startups in general discounting, shrugging off or sneering at the SOHO space similar to those who dissed or did not respect the SMB space several years ago. Similar to the SMB space, SOHO requires different products, packaging, pricing and routes to market via channel or etail mechanisms which means change for some vendors. Those vendors who embraced the SMB and realized what needed to change to adapt to those markets will also stand to do better with the SOHO.
Here is the reason that I think SOHO needs respect.
Simple, SOHOs grow up to become SMBs, SMBs grow up to become SMEs, SMEs grow up to become enterprises and not to mention that the amount of data being generated, moved, processed and stored continues to grow. The net result is that SMBs along with SOHO storage demands will continue to grow and for those vendors who can adjust to support those markets will also stand to gain new customers that in turn can become plans for other solution offerings.
Not surprising Eran Farajun of Asigra which has been doing cloud backups decades before they were known as clouds loves backup (and restores). However I am surprised that Eran did not jump on the its time to modernize and re architect data protection theme. Oh well, will have to have a chat with Eran on that sometime.
What was surprising were comments from Panzura who has a good distributed (e.g. read also cloud) file system that can be used for various things including online reference data. Panzura has a solution that normally I would not even think about in the context of being pulled into a Datadomain or dedupe appliance type discussion (e.g tape sucks or other similar themes). So it is odd that they are playing to the tape sucks camp and theme vs. playing to where the technology can really shine which IMHO is in the global, distributed, scale out and cloud file system space. Oh well, I guess you go with what you know or has worked in the past to get some attention.
Molly Rector of Spectra also mentioned that she likes High Performance Computing, surprised that she did not throw in high productivity computing as well in conjunction with big data, big bandwidth, green, dedupe, power, disk, tape and related buzzword bingo terms.
Also there are some comments from myself about cost cutting.
While I see the need for organizations to cut costs during tough economic times, Im not a fan of simply cutting cost for the sake of cost cutting as opposed to finding and removing complexity that in turn remove costs of doing work. In other words, Im a fan of finding and removing waste, becoming more effective and productive along with removing the cost of doing a particular piece of work. This in the end meets the aim of bean counters to cut costs, however can be done in a way that does not degrade service levels or customer service experience. For example instead of looking to cut backup costs, do you know where the real costs of doing data protection exist (hint swapping out media is treating the symptoms) and if so, what can be done to streamline those from the source of the problem downstream to the target (e.g. media or medium). In other words, redesign, review, modernize how data protection is done, leverage data footprint reduction (DFR) techniques including archive, compression, consolidation, data management, dedupe and other technologies in effective and creative ways, after all, return on innovation is the new ROI.
Checkout Drews article here to read more on the above topics and themes.
Seagate recently announced the next generation Momentus XT Hybrid Hard Disk Drive (HHDD) with a capacity of 750GB in a 2.5 inch form factor and MSRP of $245.00 USD including integrated NAND flash solid state device (SSD). As a refresher, the Momentus XT is a HHDD in that it includes a 4GB nand flash SSD integrated with a 500GB (or larger) 7,200 RPM hard disk drive (HDD) in a single 2.5 inch package.
HHDD with integrated nand flash SSD photo courtesy Seagate.com
Whats is new with the new generation. Besides extra storage space capacity up to 750GB (was 500GB), there is twice as much single level cell (SLC) nand flash memory (8GB vs. 4GB in previous generation) along with an enhanced interface using 6Gb per second SATA that supports native command queuing (NCQ) for better performance. Note that NCQ was available on the previous generation Momentus XT that used a 3Gb SATA interface. Other enhancements include a larger block or sector size of 4096 bytes vs. traditional 512 bytes on previous generation storage devices.
This bigger sector size results in less overhead with managing data blocks on large capacity storage devices. Also new are caching enhancements are FAST Factor Flash Management, FAST Factor Boot and Adaptive Memory Technology. Not to be confused with EMC Fully Automated Storage Tiering the other FAST; Seagate FAST is technology that exists inside the storage drive itself. FAST Factor boot enables systems to boot and be productive with speeds similar to SSD or several times faster than traditional HDDs.
The FAST Factor Flash Management provides the integrated intelligence to maximize use of the nand flash or SSD capabilities along with spinning HDD to boot performance, keep up compatibility with different systems and their operating systems. In addition to performance and interoperability, data integrity and SSD flash endurance are also enhanced for investment protection. The Adaptive Memory technology is a self learning algorithm to give SSD like performance for often used applications and data to close the storage capacity too performance gap that has increased along with data center bottlenecks.
Some questions and discussion comments:
When to use SSD vs. HDD vs. HHDD? If you need the full speed of SSD to boost performance across all data access and cost is not an issue for available capacity that is where you should be focused. However if you are looking for lowest total cost of storage capacity with no need for performance, than lower cost high capacity HDDs should be on your shopping list. On the other hand, if you want a mix of performance and capacity at an effective price, than HHDDs should be considered.
Why the price jump compared to first generation HHDD? IMHO, it has a lot to do with current market conditions, supply and demand.
With recent floods in Thailand and forecasted HDD and other technology shortages, the lay of supply and demand applies. This means that the supply may be constrained for some products causing demand to rise for others. Your particular vendor or supplier may have inventory however will be less likely to heavily discount while there are shortages or market opportunities to keep prices high. There are already examples of this if you check around on various sites to compare prices now vs. a few months ago. Granted it is the holiday shopping season for both people as well as organizations spending the last of their available budgets so more demand for available supplies.
What kind of performance or productivity have I seen with HHDDs? While I have not yet tested and compared the second generation or new devices, I can attest to the performance improvements resulting in better productivity over the past year using Seagate Momentus XT HHDDs compared to traditional HDDs. Here is a post that you can follow to see some boot performance comparisons as part of some virtual desktop infrastructure (VDI) sizing testing I did earlier this year that included both HHDD and HDD.
When will I jump on the SSD bandwagon? Great question, I have actually been on the SSD train for several decades using them, selling them, covering, analyzing and consulting around them along with other storage mediums including HDD, HHDD, cloud and tape. I have some SSDs and will eventually put them into my laptops, workstations and servers as primary storage when the opportunity makes sense.
Will HHDDs help backup and other data protection tasks? Yes, in fact I initially used my Momentus XTs as backup or data protection targets along with for moving large amounts of data between systems faster than what my network could support.
Why not use a SSD? If you need the performance and can afford the price, go SSD!
On the other hand, if you are looking to add a small 64GB, 128GB or even 256GB SSD while retaining a larger capacity, slower and lower cost HDD, an HHDD should be considered as an option. By using an HHDD instead of both a SSD and HDD, you will cut the need of figuring out how to install both in space constrained laptops, desktop or workstations. In addition, you will cut the need to either manually move data between the different devices or avoid having to acquire software or drivers to do that for you.
How much does the new Seagate Momentus XT HHDD cost? Manufactures Suggested Retail Price (MSRP) is listed at $245 for a 750GB version.
Does the Momentus XT HHDD need any special drivers, adapters or software? No, they are plug and play. There is no need for caching or performance acceleration drivers, utilities or other software. Likewise no needs for tiering or data movement tools.
How do you install an HHDD into an existing system? Similar to installing a new HDD to replace an existing one if you are familiar with that process. If not, it goes like this (or uses your own preferred approach).
Attach a new HHDD to an existing system using a cable
Utilize a disk clone or image tool to make a copy of the existing HDD to HHDD
Note that the system may not be able to be used during the copy, so plan ahead.
After the clone or image copy is made, shutdown system, remove existing HDD and replace it with the HHDD that was connected to the system during the copy (remember to remove the copy cable).
Reboot the system to verify all is well, note that it will take a few reboots before the HHDD will start to learn your data and files along with how they are used.
Regarding your old HDD, save it, put it in a safe place and use it as a disaster recovery (DR) backup. For example if you have a safe deposit box or somewhere else safe, put it there for when you will need it in the future.
Seagate Momentus XT and USB to SATA cable
Can an HHDD fit into an existing slot in a laptop, workstation or server? Yes. In fact, unlike a HDD and SSD combination, that requires multiple slots or forcing one device to be external, HHDDs like the Momentus XT simply use the space where your current HDD is installed.
How do you move data to it? Beyond the first installation described above, the HHDD appears as just another local device meaning you can move data to or from it like any other HDD, SSD or CD.
Do you need automated tiering software? No, not unless you need it for some other reason or if you want to use an HHDD as the lower cost, larger capacity option as a companion to a smaller SSD.
Do I have any of the new or second generation HHDDs? Not yet, maybe soon and I will do another momentus moment point when that time arrives. For the time being, I will continue to use the first generation Momentus XT HHDDs
Bottom line (for now), If you are considering a large capacity, HDDs check out the HHDDs for an added performance boost including faster boot times as well as accessing other data quicker.
On the other hand if you want an SSD however your budget restricts you to a smaller capacity version, look into how an HHDD can be a viable option for some of your needs.
Solid state devices (SSD) are a popular topic gaining both industry adoption and customer deployment to speed up storage performance. Here is a link to a recent conversation that I had with John Hillard to discuss industry trends and perspectives pertaining to using SSD to boost performance and productivity for SMB and other environments.
SSDs can be a great way for organizations to do IO consolidation to reduce costs in place of using many hard disk drives (HDDs) grouped together to achieve a certain level of performance. By consolidating the IOs off of many HDDs that often end up being under utilized from a space capacity basis, organizations can boost performance for applications while reducing, or reusing HDD based storage capacity for other purposes including growth.
Have you ever read an article, blog post or a book and said to yourself that you could do that, perhaps even better?
Well, unless you have already done so, what are you waiting for to write a book, blog, article or create some other form of content using different mediums or venues?
The other evening I attended a local Stillwater (Artreach St Croix) event (Publishers Forum) with my wife (karenofarcola.com). Karen is working on getting her first book (fiction for children and young adults) published so she was interested in meeting the different publishers. For me I wanted to learn about the local publishers, hear what they had to say in addition to meeting the purveyor of a local book store (Valley Book Seller) who helped promote the event.
It was interesting listening to the panel made up of a nonprofit publisher (Milkweed Editions), a full service self publishing venue (Beaver Pond Press) and regional publishing house (Tristin Publishing).
Having formally published books (e.g. with traditional publishers (Elseiver and CRC/Taylor Francis), ISBNs, Library of Congress (LOC) registration) along with contributing on other projects, not to mention over a thousand articles, tips, reports, white papers, solution briefs, videos and other content, I often get asked what does it take to write a book, blog or other material.
I also get told by people that they could do a better job to which I ask them then why dont they do something about it vs. simply saying that they could do something better.
Back to the Art Reach St Croix publishers forum event, the attendees were mainly aspiring authors looking to get their first works published. Having already been down the path that many in the room were looking to go (get published) it was interesting to hear the various questions and discussion topics. Some of those questions were about the process of self publishing vs. working with the publisher (large or small) in addition to how much costs or how to get discovered. It was also great to hear the panelist discuss some of the hurdles authors face in getting their books published along with promoting their works.
I learned several years ago before I did my first solo book was a tip that another author told me of the importance of promotion. That is your publisher will help enable, however it is up to you the author to promote your works by creating a platform or means of interacting with different audiences. Consequently it was fun to hear the panelist talk with the authors on the importance of creating a platform including a blog, twitter, Google Plus, facebook, doing articles and appearances to help create awareness. What was fun to watch were the authors who seemed to be more comfortable with creating their works and then waiting for the results to occur as opposed to helping make their work a success.
Anyways, for those who are aspiring to write a book, blog or article, or even for those who are content being arm chair authors or Monday morning quarterbacks, here is a link to a series about how to write a book or blog. The series (how to write a book or blog) can be read over at the VMware communities site that Im contributing for as a vExpert.
Oh, and for you aspiring authors or bloggers wondering about creating and developing a platform, what you are reading here is an example of doing just that. In other words, my platform includes what you are reading here in addition to on my regular blog or other venues including Google Plus (G+), Facebook, LinkedIn and twitter among other venues.
So what are you waiting for, go get your book or blog or article written, published and start promoting it.
When you hear HDS in the context of information technology do you think of Hitachi Data Systems?
Along with a bunch of other IT industry advisors, analysts, bloggers, consultants, financiers and pundits or influencers, Im attending a event being sponsored by HDS this week in San Jose California (SJC).
For those not familiar, as a division of the much larger Japan based conglomerate named Hitachi, HDS sells various types of data storage systems and associated management tools along with services.
While on the airplane from Seattle (SEA) to SJC the other night (Disclosure: HDS picked up the one way coach ticket) it occurred to me different things that HDS could refer to besides Hitachi Data Systems.
In addition to being the International Airtranspot Transport Association (IATA) code for Hoedspruit Airport in South Africa where HDS is in the process of buying Shoeden Data Systems (SDS), here are some other possibilities of what HDS could mean.
Now that Halloween and talk of Zombies has past (at least for now), that means next up on the social or holiday calendar topics in the U.S. is thanksgiving which means turkey themes.
With turkey themes in mind, how about some past, current and maybe future technology flops or where are they now.
A technology turkey can be a product, trend, technique or theme that was touted (or hyped) and flopped for various reasons not flying up to, or meeting its expectations. That means that a technology turkey may have had industry adoption however lacked customer deployment.
Lets try a few, how about holographic storage, or is that still a future technology?
Were NEXT computer and the Apple Newton turkeys?
Disclosure: I have a Newton that has not been used since the mid 90s.
Is ATA over Ethernet (AoE) a future turkey candidate along with FCoE aka Fibre Channel over Ethernet (or here or here), or is that just some peoples wishful thinking regarding FCoE being a turkey?
Speaking of AoE, what ever happened to Zetera (aka Hammer storage) the iSCSI alternative of a few years ago?
To be fair how about IPFC not to be confused with FCIP (Fibre Channel frames mapped to IP for distance) or iFCP not to be confused with FCoE or iSCSI. IPFC mapped IP as upper level protocol (ULP) onto Fibre Channel coexisting with FCP and FICON. There were only a few adopters of IPFC that used it as a low latency channel to channel (CTC) mechanism for open systems before InfiniBand and other technologies matured.
Im guessing that someone will step up to defend the honor of Microsoft Windows Vista, however until then, IMHO it is or was a Turkey. While on the topic of operating systems, anyone have an opinion on IBMs OS2? Speaking of PCs, how about the DEC Rainbow and its sibling the Robin? Remember when IBM was in the PC business before selling it off to Lenovo, how about the IBM PCjr, turkey candidate or not?
HP should be on the turkey list with their now ex CEO Leo Apotheker whom they put out to pasture, on the technology front, anybody remember AutoRAID?
How about the Britton Lee Database machine which today would be referred to as a storage appliance or application optimized storage system such as the Oracle Exadata II (or Oracle Exadata I based on HP hardware) among others. Note that Im not saying Exadata I or Exadata II are turkeys as that will be left to your own determination. Both are cool from a technology standpoint, however there is more to having neat or interesting technology to move from announcement to industry adoption to customer deployment, things that Oracle has been having some success with.
Speaking of Oracle, remember when Sun bought the Encore storage system and renamed it the A7000 (not to be confused with the A5000 aka Photon) in an attempt to compete against the EMC Symmetrix. The Encore folks after Sun went on to their next project and still today call it DataCore. Meanwhile Sun discontinued the A7000 after a period of time similar to what they did with other acquisitions such as Pirus which became the 6920 which was end of lifed as part of a deal where Sun increased their resell activity of HDS which too has since been archived. Hmmm, that begs the question of what happens with Oracle acquiring Pillar with an earn out scheme where if there is revenue there is a payout, if there is no revenue then there is a tax write off.
What about big data, will that become a turkey following in the footsteps of other former high flyers such as cloud, virtualization, data classification, CDP, Green IT and SOA among many others. IMHO that depends upon what your view or definition along with expectations of big data is as a buzzword bingo topic. Depending on your view, that will determine if the above will join others that fade away from the limelight shifting into productive modes for customers and profitable activity for vendors.
Want to read what others have to say about technology turkeys or flops?
Over the past several years I have done an annual post about IBM and their XIV storage system and this is the fourth in what has become a series. You can read the first one here, the second one here, and last years here and here after the announcement of the IBM V7000.
IBM XIV Gen3 IBM recently announced the generation 3 or Gen3 version of XIV along with releasing for the first time public performance comparison benchmarks using storage performance council (SPC) throughout SPC2 workload.
The XIV Gen3 is positioned by IBM as having up to four (4) times the performance of earlier generations of the storage system. In terms of speeds and feeds, the Gen3 XIV supports up to 180 2TB SAShard disk drives (HDD) that provides up to 161TB of usable storage space capacity. For connectivity, the Gen3 XIV supports up to 24 8Gb Fibre Channel (8GFC) or for iSCSI 22 1Gb Ethernet (1 GbE) ports with a total of up to 360GBytes of system cache. In addition to the large cache to boost performance, other enhancements include leveraging multi core processors along with an internal InfiniBand network to connect nodes replacing the former 1 GbE interconnect. Note, InfiniBand is only used to interconnect the various nodes in the XIV cluster and is not used for attachment to applications servers which is handled via iSCSI and Fibre Channel.
IBM and SPC storage performance history IBM has a strong history if not leading the industry with benchmarking and workload simulation of their storage systems including Storage Performance Council (SPC) among others. The exception for IBM over the past couple of years has been the lack of SPC benchmarks for XIV. Last year when IBM released their new V7000 storage system benchmarks include SPC were available close to if not at the product launch. I have in the past commented about IBMs lack of SPC benchmarks for XIV to confirm their marketing claims given their history of publishing results for all of their other storage systems. Now that IBM has recently released SPC2 results for the XIV it is only fitting then that I compliment them for doing so.
Benchmark brouhaha Performance workload simulation results can often lead to applies and oranges comparisons or benchmark brouhaha battles or storage performance games. For example a few years back NetApp submitted a SPC performance result on behalf of their competitor EMC. Now to be clear on something, Im not saying that SPC is the best or definitive benchmark or comparison tool for storage or other purpose as it is not. However it is representative and most storage vendors have released some SPC results for their storage systems in addition to TPC and Microsoft ESRP among others. SPC2 is focused on streaming such as video, backup or other throughput centric applications where SPC1 is centered around IOPS or transactional activity. The metrics for SPC2 are Megabytes per second (MBps) for large file processing (LFP), large database query (LDQ) and video on demand delivery (VOD) for a given price and protection level.
What is the best benchmark? Simple, your own application in as close to as actual workload activity as possible. If that is not possible, then some simulation or workload simulation that closets resembles your needs.
Does this mean that XIV G3 should be used for every environment? Generally speaking no. However its performance enhancements should allow it to be considered for more applications than in the past. Plus with the public comparisons now available, that should help to silence questions (including those from me) about what the systems can really do vs. marketing claims.
How does XIV compare to some other IBM storage systems using SPC2 comparisons?
System
SPC2 MBps
Cost per SPC2
Storage GBytes
Price tested
Discount
Protection
DS5300
5,634.17
$74.13
16,383
417,648
0%
R5
V7000
3,132.87
$71.32
29,914
$223,422
38-39%
R5
XIV G3
7,467.99
$152.34
154,619
1,137,641
63-64%
Mirror
DS8800
9,705.74
$270.38
71,537
2,624,257
40-50%
R5
In the above comparisons, the DS5300 (NetApp/Engenio based) is a dual controller (4GB of cache per controller) with 128 x 146.8GB 15K HDDs configured as RAID 5 with no discount applied to the price submitted. The V7000 system which is based on the IBM SVC along with other enhancements consists of dual controllers each with 8GB of cache and 120 x 10K 300GB HDDs configured as RAID 5 with just under a 40% discount off list price for system tested. For the XIV Gen3 system tested, discount off list price for the submission is about 63% with 15 nodes and a total of 360GB of cache and 180 2TB 7.2K SAS HDDs configured as mirrors. The DS8800 system with dual controllers has a 256GB of cache, 768 x 146GB 15K HDDs configured in RAID5 with a discount between 40 to 50% off of list.
What the various metrics do not show is the benefit of various features and functionality which should be considered to your particular needs. Likewise, if your applications are not centered around bandwidth or throughput, then the above performance comparisons would not be relevant. Also note that the systems above have various discount prices as submitted which can be a hint to a smart shopper where to begin negotiations at. You can also do some analysis of the various systems based on their performance, configuration, physical footprint, functionality and cost plus the links below take you to the complete reports with more information.
Bottom line, benchmarks and performance comparisons are just that, a comparison that may or may not be relevant to your particular needs. Consequently they should be used as a tool combined with other information to see how a particular solution might be a fit for your specific needs. The best benchmark however is your own application running as close to possible realistic workload to get a representative perspective of a systems capabilities.
Besides activity in support of the launch of my new book Cloud and Virtual Data Storage Networking (CRC Press), I have been busy with various client research, consulting and advisory projects. In addition to Las Vegas for VMworld, out and about travel activities for attending conferences and presenting seminars have included visits in Minneapolis (local), Nijkerk Holland and Denver (in the same week) and Orlando (SNW). Upcoming out and about events are scheduled for Los Angles, Atlanta, Chicago, Seattle and a couple of trips to San Jose area before the brief thanksgiving holiday break.
Beer and Bitter ballens on the left, coffee machine in Nijkerk on the right
Day one of two day seminar in Nijkerk
Bicycles lined up at the Nijkerk train station, waiting for the 6:30 train to Schiphol
Changing trains on way to Schiphol to board flight to MSP and then to DEN
After Denver back to MSP for a few days before SNW in Orlando
While being out and about I have had the chance to meet and visit with many different people. Here are some questions and comments that I have heard while out and about:
What comes after cloud?
Are there standards for clouds and virtualization?
Should cost savings be the justification for going to cloud, virtual or dynamic environments?
How is big data different than traditional stream and flat file analytics and processing using tools such as SAS (Statistical Analysis Software)?
Is big data only about map reduce and hadoop?
Are clouds any less secure or safe for storage and applications?
Do clouds and virtualization removing complexity and simplify infrastructures?
Are cloud storage services cheaper than buying and managing your own?
Is object based storage a requirement for public or private cloud?
Do solution bundles such as EMC vBlock and NetApp FlexPods reduce complexity?
Why is FCoE taking so long to be adopted and is it dead?
Should cost savings be the basis for deciding to do a VDI or virtualization project?
What is the best benchmark or comparison for making storage decisions?
In addition, there continues to be plenty of cloud confusion, FUD and hype around public, private, hybrid along with AaaS, SaaS, PaaS and IaaS among other XaaS. The myth that virtualization of servers, storage and workstations is only for consolidation continues. However there are more people beginning to see the next wave of life beyond consolidation where the focus expands to flexibility, agility and speed of deployment for non aggregated workloads and applications. Another popular myth that is changing is that data footprint reduction (DFR) is only about dedupe and backup. What is changing is an awareness that DFR spans all types of storage and data from primary to secondary leveraging different techniques including archive, backup modernization, compression, consolidation, data management and dedupe along with thin provisioning among other techniques.
Archiving for email, database and file systems needs to be rescued from being perceived as only for compliance purposes. If you want or need to reduce your data footprint impact (DFR), optimize your storage for performance or capacity, enable backup, BC and DR to be performed faster, achieve Green IT and efficiency objectives, expand your awareness around archiving. While discussing archiving, focus is often on the target or data storage medium such as disk, tape, optical or cloud along with DFR techniques such as compression and dedupe or functionally including ediscovery and WORM. The other aspects of archive that need to be looked at include policies, retention, application and software plugins for Exchange, SQL, Sharepoint, Sybase, Oracle, SAP, VMware and others.
Boot storms continue to be a common theme for apply solid state devices (SSD) in support of virtual desktop inititiaves (VDI). There is however a growing awareness and discussions around shutdown storms, day to day maintenance including virus scans in addition to applications that increase the number of writes. Consequently the discussions around VDI are expanding to include both reads and writes as well as reduced latency for storage and networks.
Some other general observations, thoughts and comments:
Getting into Holland as a visitor is easier than returning to the U.S. as a citizen
Airport security screening is more thorough and professional in Europe than in the U.S.
Hops add latency to beer (when you drink it) and to networks (time delay)
Fast tape drives need disk storage to enable streaming for reads and writes
SSD is keeping HDDs alive, HDDs are keeping tape alive and all there roles are evolving while the technologies continue to evolve.
Hybrid Hard Disk Drives (HHDDs) are gaining in awareness and deployments in workstations as well as laptops.
Confusion exists around what are flat layer 2 networks for LANs and Sans
Click here to view additional comments and perspectives
A couple of weeks ago I traveled down to Orlando Florida for a few days to attend the fall 2011 SNW (Storage Networking World) produced in conjunction by IDG Computerworld and the Storage Networking Industry Association (SNIA).
While at the Orlando event, SNIA executive director Leo Legar asked me how many SNWs I had attended and my responses was on which continent?
My answer was part in fun however also serious as I have been attending SNWs (in addition to other SNIA events) for over ten years in both North and South America as well as in Europe including presentingSNIA tutorials and SNW sessions.
SNW is always good for meeting up with old friends and acquaintances along with meeting new ones including twitter tweeps (hashtag #snwusa #snw2011 @sniacloud @snwusa) and the recent event was no exception. Granted SNW is smaller than it was during its peak in the mid 2000s however it was great to go for a couple of days of meetings, checking out the expo hall and some sessions as well as getting out and about meeting people involved with servers, storage, networking, virtualization, cloud, hardware, software and services.
SNW remains as its name implies (Storage Networking World) an event around networking as in conversations, learning, knowledge exchange, information gathering and meetings not to mention the hands on lab. I found the two days I was there adequate to get the meetings and other activities I had planned, along with time for impromptu meetings. ANother observation was that during the peak of the large mega SNW events, while there were more meetings, they were also much shorter along the lines of speed dating vs. those a couple of weeks ago where there was time to have quality conversations.
Also during the recent SNW event, I did a book signing event sponsored by SNIA. If you have not done so, check out the SNIA Cloud Storage Initiative (CSI) who arranged for several of my new book Cloud and Virtual Data Storage Networking to be given away. Book signings are fun in that I get to meet lots of people and hear what they are doing, encountering, looking for, have done, concerned or excited about. It was handy having SNIA CSI material available at the table as I was signing books and visiting with people to be able to give them information about things such as CDMI not to mention hearing what they were doing or looking for. Note to SNIA, if we do this again, lets make sure to have someone from the CSI at the table to join in the fun and conversations as there were some good ones. Learn more about the activities of the SNIA CSI including their Cloud Data Management Initiative (CDMI) here.
Thanks again to Wayne Adams (@wma01606), Leo Legar and Michael Meleedy among others who arranged for a fantastic fall 2011 SNW event along with everyone who participated in the book signing event and other conversations while in Orlando and to those who were involved virtually via twitter.
Rather than doing a bunch of separate posts, here is a collection of different perspectives and commentary on various IT and data storage industry activity.
In this link are comments and perspectives regarding thin provisioning including how it works as well as when to use it for optimizing storage space capacity. Speaking of server and storage capacity, here in this link are comments on what server and storage would be needed to support an SMB office of 50 people (or more, or less) along with how to back it up.
Storage networking interface or protocol debates (battles) can be interesting, in this link, see the role of iSCSI SANs for data storage environments. Lets not forget about Fibre Channel over Ethernet (FCoE) which is discussed in this link and here in this link. Here in this link are comments about how integrated rackem, stackem and package bundles stack up. To support increased continued demand for managed service providers (MSP), cloud and hosted services providers are continuing to invest in their infrastructures, so read some comments here. While technology plays a role particular as it matures, there is another barrier to leveraging converged solutions and that is organizational, read some perspectives and thoughts here.
Storage optimization including data footprint reduction (DFR) can be used to cut costs as well as support growth. In this link see tips on reducing storage costs and additional perspectives in this link to do more with what you have. Here in this link are some wit and wisdom comments on the world of disaster recovery solutions. Meanwhile in this link are perspectives for choosing the right business continuity (BC) and disaster recovery (DR) consultant. In this link are comments on BC and DR including planning for virtualization and life beyond consolidation. Are disk based dedupe and virtual tape summit resources libraries a hold over for old backup, or a gateway to the future, see some perspectives on those topics and technologies in this link.
Here are some more comments on DR and BC leveraging the cloud while perspectives on various size organizations looking at clouds for backup in this piece here. What is the right local, cloud or hybrid backup for SMBs, check out some commentary here while viewing some perspectives on cloud disaster recovery here. Not to be forgotten, laptop data protection can also be a major headache however there are also many cures discussed in this piece here.
The Storage Networking Industry Association (SNIA) Green Storage Initiative (GSI) debut their Emerald power efficiency measurement specification recently, read some perspectives and comments in this link here. While we are on the topic of data center efficiency and effectiveness, here in this link are perspectives on micro servers or mini blade systems. Solution bundles also known as data center in a box or SAN in a CAN have been popular with solutions from EMC (vBlocks) and NetApp (FlexPods) among others, read perspectives on them in this link.
What would a conversation involving data storage and IT (particularly buzzword bingo) be without comments about Big Data and Big Bandwidth which you can read here.
Want to watch some videos, from Spring 2011 SNW, check out starting around the 15:00 to 55:00 time scale in this video from the Cube where various topics are discussed. Interested in how to scale data storage with clustered or scale up and out solutions, check out this video here or if you want to see some perspectives on data de duplication watch this clip.
Whats your take on choosing the best SMB NAS? Here are some of my perspectives on choosing a SMB NAS storage system. Meanwhile here are some perspectives on enterprise class storage features finding their way into SMB NAS storage systems.
Meanwhile industry leaders EMC and NetApp have been busy enhancing their NAS storage solutions that you can read comments here.
Are you familiar with the Open Virtualization Alliance (OVA)? Here are some comments about OVA and other server virtualization topics.
Whats your take on Thunderbolt the new interconnect Apple is using in place of USB, here are my thoughts. Meanwhile various other tips and Ask the Expert (AtE) and discussion can be found here.
Check out the above links, as well view more perspectives, comments and news here,here, here, here and here.
I recently came across a piece by Carl Brooks over at IT Tech News Daily that caught my eye, title was Cloud Storage Often Results in Data Loss. The piece has an effective title (good for search engine: SEO optimization) as it stood out from many others I saw on that particular day.
What caught my eye on Carls piece is that it reads as if the facts based on a quick survey point to clouds resulting in data loss, as opposed to being an opinion that some cloud usage can result in data loss.
My opinion is that if not used properly including ignoring best practices, any form of data storage medium or media could result or be blamed for data loss. For some people they have lost data as a result of using cloud storage services just as other people have lost data or access to information on other storage mediums and solutions. For example, data has been lost on tape, Hard Disk Drives (HDDs), Solid State Devices (SSD), Hybrid HDDs (HHDD), RAID and non RAID, local and remote and even optical based storage systems large and small. In some cases, there have been errors or problems with the medium or media, in other cases storage systems have lost access to, or lost data due to hardware, firmware, software, or configuration including due to human error among other issues.
Technology failure: Not if, rather when and how to decrease impact Any technology regardless of what it is or who it is from along with its architecture design and implementation can fail. It is not if, rather when and how gracefully along with what safeguards to decrease the impact, in addition to containing or isolating faults differentiates various products or solutions. How they automatically repair and self heal to keep running or support accessibility and maintain data integrity are important as is how those options are used. Granted a failure may not be technology related per say, rather something associated with human intervention, configuration, change management (or lack thereof) along with accidental or intentional activities.
I follow my advice and best practices when selecting cloud providers looking for good value, service level agreements (SLAs) and service level objectives (SLOs) over low cost or for free services.
In the several years of using cloud based storage and services there has been some loss of access, however no loss of data. Those service disruptions or loss of access to data and services ranged from a few minutes to a little over an hour. In those scenarios, if I could not have waited for cloud storage to become accessible, I could have accessed a local copy if it were available.
Had a major disruption occurred where it would have been several days before I could gain access to that information, or if it were actually lost, I have a data insurance policy. That data insurance policy is part of my business continuance (BC) and disaster recovery (DR) strategy. My BC and DR strategy is a multi layered approach combining local, offline and offsite as along with online cloud data protection and archiving.
Assuming my cloud storage service could get data back to a given point (RPO) in a given amount of time (RTO), I have some options. One option is to wait for the service or information to become available again assuming a local copy is no longer valid or available. Another option is to start restoration from a master gold copy and then roll forward changes from the cloud services as that information becomes available. In other words, I am using cloud storage as another resource that is for both protecting what is local, as well as complimenting how I locally protect things.
Minimize or cut data loss or loss of access Anything important should be protected locally and remotely meaning leveraging cloud and a master or gold backup copy.
To cut the cost of protecting information, I also leverage archives, which mean not all data gets protected the same. Important data is protected more often reducing RPO exposure and speed up RTO during restoration. Other data that is not as important is protected, however on a different frequency with other retention cycles, in other words, tiered data protection. By implementing tiered data protection, best practices, and various technologies including data footprint reduction (DFR) such as archive, compression, dedupe in addition to local disk to disk (D2D), disk to disk to cloud (D2D2C), along with routine copies to offline media (removable HDDs or RHDDs) that go offsite, Im able to stretch my data protection budget further. Not only is my data protection budget stretched further, I have more options to speed up RTO and better detail for recovery and enhanced RPOs.
If you are looking to avoid losing data, or loss of access, it is a simple equation in no particular order:
Strategy and design
Best practices and processes
Various technologies
Quality products
Robust service delivery
Configuration and implementation
SLO and SLA management metrics
People skill set and knowledge
Usage guidelines or terms of service (ToS)
Unfortunately, clouds like other technologies or solutions get a bad reputation or blamed when something goes wrong. Sometimes it is the technology or service that fails, other times it is a combination of errors that resulted in loss of access or lost data. With clouds as has been the case with other storage mediums and systems in the past, when something goes wrong and if it has been hyped, chances are it will become a target for blame or finger pointing vs. determining what went wrong so that it does not occur again. For example cloud storage has been hyped as easy to use, don’t worry, just put your data there, you can get out of the business of managing storage as the cloud will do that magically for you behind the scenes.
The reality is that while cloud storage solutions can offload functions, someone is still responsible for making decisions on its usage and configuration that impact availability. What separates various providers is their ability to design in best practices, isolate and contain faults quickly, have resiliency integrated as part of a solution along with various SLAs aligned to what the service level you are expecting in an easy to use manner.
Does that mean the more you pay the more reliable and resilient a solution should be? No, not necessarily, as there can still be risks including how the solution is used.
Does that mean low cost or for free solutions have the most risk? No, not necessarily as it comes down to how you use or design around those options. In other words, while cloud storage services remove or mask complexity, it still comes down to how you are going to use a given service.
Shared responsibility for cloud (and non cloud) storage data protection Anything important enough that you cannot afford to lose, or have quick access to should be protected in different locations and on various mediums. In other words, balance your risk. Cloud storage service provider toned to take responsibility to meet service expectations for a given SLA and SLOs that you agree to pay for (unless free).
As the customer you have the responsibility of following best practices supplied by the service provider including reading the ToS. Part of the responsibility as a customer or consumer is to understand what are the ToS, SLA and SLOs for a given level of service that you are using. As a customer or consumer, this means doing your homework to be ready as a smart educated buyer or consumer of cloud storage services.
If you are a vendor or value added reseller (VAR), your opportunity is to help customers with the acquisition process to make informed decision. For VARs and solution providers, this can mean up selling customers to a higher level of service by making them aware of the risk and reward benefits as opposed to focus on cost. After all, if a order taker at McDonalds can ask Would you like to super size your order, why cant you as a vendor or solution provider also have a value oriented up sell message.
Additional related links to read more and sources of information:
Poll: Who is responsible for cloud storage data loss?
Taking action, what you should (or not) do Dont be scared of clouds, however do your homework, be ready, look before you leap and follow best practices. Look into the service level agreements (SLAs) associated with a given cloud storage product or service. Follow best practices about how you or someone else will protect what data is put into the cloud.
For critical data or information, consider having a copy of that data in the cloud as well as at or in another place, which could be in a different cloud or local or offsite and offline. Keep in mind the theme for critical information and data is not if, rather when so what can be done to decrease the risk or impact of something happening, in other words, be ready.
Data put into the cloud can be lost, or, loss of access to it can occur for some amount of time just as happens with using non cloud storage such as tape, disk or ssd. What impacts or minimizes your risk of using traditional local or remote as well as cloud storage are the best practices, how configured, protected, secured and managed. Another consideration is the type and quality of the storage product or cloud service can have a big impact. Sure, a quality product or service can fail; however, you can also design and configure to decrease those impacts.
Wrap up Bottom line, do not be scared of cloud storage, however be ready, do your homework, review best practices, understand benefits and caveats, risk and reward. For those who want to learn more about cloud storage (public, private and hybrid) along with data protection, data management, data footprint reduction among other related topics and best practices, I happen to know of some good resources. Those resources in addition to the links provided above are titled Cloud and Virtual Data Storage Networking (CRC Press) that you can learn more about here as well as find at Amazon among other venues. Also, check out Enterprise Systems Backup and Recovery: A Corporate Insurance Policy by Preston De Guise (aka twitter @backupbear ) which is a great resource for protecting data.