The Human Face of Big Data, a Book Review

StorageIO industry trends cloud, virtualization and big data

My copy of the new book The Human Face of Big Data created by Rick Smolan and Jennifer Erwitt arrived yesterday compliments of EMC (the lead sponsor). In addition to EMC, the other sponsors of the book are Cisco, VMware, FedEx, Originate and Tableau software.

To say this is a big book would be an understatement, then again, big data is a big topic with a lot of diversity if you open your eyes and think in a pragmatic way, which once you open and see the pages you will see. This is physically a big book (11x 14 inches) with lots of pictures, texts, stories, factoids and thought stimulating information of the many facets and dimensions of big data across 224 pages.

While Big Data as a buzzword and industry topic theme might be new, along with some of the related technologies, techniques and focus areas, other as aspects have been around for some time. Big data means many things to various people depending on their focus or areas of interest ranging from analytics to images, videos and other big files. A common theme is the fact that there is no such thing as an information or data recession, and that people and data are living longer, getting larger, and we are all addicted to information for various reasons.

Big data needs to be protected and preserved as it has value, or its value can increase over time as new ways to leverage it are discovered which also leads to changing data access and life cycle patterns. With many faces, facets and areas of interests applying to various spheres of influence, big data is not limited to programmatic, scientific, analytical or research, yet there are many current and use cases in those areas.

Big data is not limited to videos for security surveillance, entertainment, telemetry, audio, social media, energy exploration, geosciences, seismic, forecasting or simulation, yet those have been areas of focus for years. Some big data files or objects are millions of bytes (MBytes), billion of bytes (GBytes) or trillion of bytes (TBytes) in size that when put into file systems or object repositories, add up to Exabytes (EB – 1000 TBytes) or Zettabytes (ZB – 1000 EBs). Now if you think those numbers are far-fetched, simply look back to when you thought a TByte, GByte let alone a MByte was big or far-fetched future. Remember, there is no such thing as a data or information recession, people and data are living longer and getting larger.

Big data is more than hadoop, map reduce, SAS or other programmatic and analytical focused tool, solution or platform, yet those all have been and will be significant focus areas in the future. This also means big data is more than data warehouse, data mart, data mining, social media and event or activity log processing which also are main parts have continued roles going forward. Just as there are large MByte, GByte or TByte sized files or objects, there are also millions and billions of smaller files, objects or pieces of information that are part of the big data universe.

You can take a narrow, product, platform, tool, process, approach, application, sphere of influence or domain of interest view towards big data, or a pragmatic view of the various faces and facets. Of course you can also spin everything that is not little-data to be big data and that is where some of the BS about big data comes from. Big data is not exclusive to the data scientist, researchers, academia, governments or analysts, yet there are areas of focus where those are important. What this means is that there are other areas of big data that do not need a data science, computer science, mathematical, statistician, Doctoral Phd or other advanced degree or training, in other words big data is for everybody.

Cover image of Human Face of Big Data Book

Back to how big this book is in both physical size, as well as rich content. Note the size of The Human Face of Big Data book in the adjacent image that for comparison purposes has a copy of my last book Cloud and Virtual Data Storage Networking (CRC), along with a 2.5 inch hard disk drive (HDD) and a growler. The Growler is from Lift Bridge Brewery (Stillwater, MN), after all, reading a big book about big data can create the need for a big beer to address a big thirst for information ;).

The Human Face of Big Data is more than a coffee table or picture book as it is full of with information, factoids and perspectives how information and data surround us every day. Check out the image below and note the 2.5 inch HDD sitting on the top right hand corner of the page above the text. Open up a copy of The Human Face of Big Data and you will see examples of how data and information are all around us, and our dependence upon it.

A look inside the book The Humand Face of Big Data image

Book Details:
Copyright 2012
Against All Odds Productions
ISBN 978-1-4549-0827-2
Hardcover 224 pages, 11 x 0.9 x 14 inches
4.8 pounds, English

There is also an applet to view related videos and images found in the book at HumanFaceofBigData.com/viewer in addition to other material on the companion site www.HumanFacesofBigData.com.

Get your copy of
The Human Face of Big Data at Amazon.com by clicking here or at other venues including by clicking on the following image (Amazon.com).

Some added and related material:
Little data, big data and very big data (VBD) or big BS?
How many degrees separate you and your information?
Hardware, Software, what about Valueware?
Changing Lifecycles and Data Footprint Reduction (Data doesnt have to lose value over time)
Garbage data in, garbage information out, big data or big garbage?
Industry adoption vs. industry deployment, is there a difference?
Is There a Data and I/O Activity Recession?
Industry trend: People plus data are aging and living longer
Supporting IT growth demand during economic uncertain times
No Such Thing as an Information Recession

For those who can see big data in a broad and pragmatic way, perhaps using the visualization aspect this book brings forth the idea that there are and will be many opportunities. Then again for those who have a narrow or specific view of what is or is not big data, there is so much of it around and various types along with focus areas you too will see some benefits.

Do you want to play in or be part of a big data puddle, pond, or lake, or sail and explore the oceans of big data and all the different aspects found in, under and around those bigger broader bodies of water.

Bottom line, this is a great book and read regardless of if you are involved with data and information related topics or themes, the format and design lend itself to any audience. Broaden your horizons, open your eyes, ears and thinking to the many facets and faces of big data that are all around us by getting your copy of The Human Face of Big Data (Click here to go to Amazon for your copy) book.

Ok, nuff said.

Cheers gs

Greg Schulz – Author Cloud and Virtual Data Storage Networking (CRC Press, 2011), The Green and Virtual Data Center (CRC Press, 2009), and Resilient Storage Networks (Elsevier, 2004)

twitter @storageio

All Comments, (C) and (TM) belong to their owners/posters, Other content (C) Copyright 2006-2024 Server StorageIO and UnlimitedIO LLC All Rights Reserved

Is There a Data and I/O Activity Recession?

Storage I/O trends

With all the focus on both domestic and international economic woes and discussion of recessions and depressions and possible future rapid inflation, recent conversations with IT professionals from organizations of all size across different industry sectors and geographies prompted the question, is there also a data and I/O activity recession?

Here’s the premise, if you listen to current economic and financial reports as well as employment information, the immediate conclusion is that yes, there should also be an I recession in the form of contraction in the amount of data being processed, moved and stored which would also impact I/O (e.g. DAS,, LAN, SAN, FAN or NAS, MAN, WAN) networking activity as well. After all, the server, storage, I/O and networking vendors earnings are all being impacted right?

As is often the case, there is more to the story, certainly vendor earnings are down and some vendors are shipping less product than during corresponding periods from a year or more ago. Likewise, I continue to hear from both IT organizations, vars and vendors of lengthened sales cycles due to increased due diligence and more security of IT acquisitions meaning that sales and revenue forecasts continue to be very volatile with some vendors pulling back on their future financial guidance.

However, does that mean fewer servers, storage, I/O and networking components not to mention less software is being shipped? In some cases there is or has been a slow down. However in other cases, due to pricing pressures, increased performance and capacity density where more work can be done by fewer devices, consolidation, data footprint reduction, optimization, virtualization including VMware and other techniques, not to mention a decrease in some activity, there is less demand. On the other hand, while some retail vendors are seeing their business volume decrease, others such as Amazon are seeing continued heavy demand and activity.

Been on a trip lately through an airport? Granted the airlines have instituted capacity management (e.g. capacity planning) and fleet optimization to align the number of flights or frequency as well as aircraft type (tiering) to the demand. In some cases smaller planes, in other cases larger planes, for some more stops at a lower price (trade time for money) or in other cases shorter direct routes for a higher fee. The point being is that while there is an economic recession underway, and granted there are fewer flights, many if not most of those flights are full which means transactions and information to process by the airlines reservations and operational as well as customer relations and loyalty systems.

Mergers and acquisitions usually mean a reduction or consolidation of activity resulting in excess and surplus technologies, yet talking with some financial services organizations, over time some of their systems will be consolidated to achieve operating efficiency and synergies, near term, in some cases, there is the need for more IT resources to support the increased activity of supporting multiple applications, increased customer inquiry and conversion activity.

On a go forward basis, there is the need to support more applications and services that will generate more I/O activity to enable data to be moved, processed and stored. Not to mention, data being retained in multiple locations for longer periods of time to meet both compliance and non regulatory compliance requirements as well as for BC/DR and business intelligence (BI) or data mining for marketing and other purposes.

Speaking of the financial sector, while the economic value of most securities is depressed, and with the wild valuation swings in the stock markets, the result is more data to process, move and store on a daily basis, all of which continues to place more demand on IT infrastructure resources including servers, storage, I/O networking, software, facilities and the people to support them.

Dow Jones Trading Activity Volume
Dow Jones Trading Activity Volume (Courtesy of data360.org)

For example, the amount of Dow Jones trading activity is on a logarithmic upward trend curve in the example chart from data360.org which means more transactions selling and buying. The result of more transactions is that there are also an increase in the number of back-office functions for settlement, tracking, surveillance, customer inquiry and reporting among others activities. This means that more I/Os are generated with data to be moved, processed, replicated, backed-up with additional downstream activity and processing.

Shifting gears, same things with telephone and in particular cell phone traffic which indirectly relates on IT systems particular for support email and other messaging activity. Speaking of email, more and more emails are sent every day, granted many are spam, yet these all result in more activity as well as data.

What’s the point in all of this?

There is a common awareness among most IT professionals that there is more data generated and stored every year and that there is also an awareness of the increased threats and reliance upon data and information. However what’s either not as widely discussed is the increase in I/O and networking activity. That is, the space capacity often gets talked about, however, the I/O performance, response time, activity and data movement can be forgotten about or its importance to productivity diminished. So the point is, keep performance, response time, and latency in focus as well as IOPS and bandwidth when looking at, and planning IT infrastructure to avoid data center bottlenecks.

Finally for now, what’s your take, is there a data and/or I/O networking recession, or is it business and activity as usual?

Ok, nuff said.

Cheers gs

Greg Schulz – Author Cloud and Virtual Data Storage Networking (CRC Press), The Green and Virtual Data Center (CRC Press) and Resilient Storage Networks (Elsevier)
twitter @storageio

All Comments, (C) and (TM) belong to their owners/posters, Other content (C) Copyright 2006-2024 Server StorageIO and UnlimitedIO LLC All Rights Reserved