AWS Archives

April 30, 2015April 27, 2025

April 2015 Server StorageIO Update Newsletter

Volume 15, Issue IV

Hello and welcome to this April 2015 Server and StorageIO update newsletter.

This months newsletter has a focus on cloud and object storage for bulk data, unstructured data, big data, archiving among other scenarios.

Enjoy this edition of the Server and StorageIO update newsletter and watch for new tips, articles, StorageIO lab report reviews, blog posts, videos and Podcasts along with in the news commentary appearing soon.

Storage I/O trends

StorageIOblog posts

April StorageIOblog posts include:

Cloud conversations: AWS EFS Elastic File System (Cloud NAS) First Preview Look
Blog and Podcast S3motion Buckets Containers Objects AWS S3 Cloud and EMCcode
Data Protection Gumbo (Blog & Podcast) Protect Preserve and Serve Information
In case you missed it, March 2015 Server StorageIO Newsletter

View other recent as well as past blog posts here

April Newsletter Feature Theme
Cloud and Object Storage Fundamentals

There are many facets to object storage including technology implementation, products, services, access and architectures for various applications and use scenarios. The following is a short synopsis of some basic terms and concepts associated with cloud and object storage.

Common cloud and object storage terms

Account or project – Top of the hierarchy that represent owner or billing information for a service that where buckets are also attached.
Availability Zone (AZ) can be rack of servers and storage or data center where data is spread across for storage and durability.

Example of some AWS Regions and AZ’s

Bucket or Container – Where objects or sub-folders containing objects are attached and accessed. Note in some environments such as AWS S3 you can have sub-folders in a bucket.
Connector or how your applications access the cloud or object storage such as via an API, S3, Swift, Rest, CDMI, Torrent, JSON, NAS file, block of other access gateway or software.
Durability – Data dispersed with copies in multiple locations to survive failure of storage or server hardware, software, zone or even region. Availability = Access + Durability.
End-point – Where or what your software, application or tool and utilities or gateways attach to for accessing buckets and objects.
Ephemeral – Temporary or non-persistent
Eventual consistency – Data is eventually made consistency, think in terms of asynchronous or deferred writes where there is a time lag vs. synchronous or real-time updates.
Immutable – Persistent, non-altered or write once read many copy of data. Objects generally are not updated, rather new objects created.

Object storage and cloud
Via Cloud Virtual Data Storage (CRC)

Object – Byte (or bit) stream that can be as small as one byte to as large as several TBytes (some solutions and services support up to 5TByte sized objects). The object contains what ever data in any organization along with meta data. Different solutions and services support from a couple hundred KBytes of meta-data to MBytes worth of meta-data. In terms of what can be stored in an object, anything from files, videos, images, virtual disks (VMDK’s, VHDX), ZIP or tar files, backup and archive save sets, executable images or ISO’s, anything you want.
OPS – Objects per second or how many objects accessed similar to a IOP. Access includes gets, puts, list, head, deletes for a CRUD interface e.g. Created, Read, Update, Delete.
Region – Location where data is stored that can include one or more data centers also known as Availability Zones.
Sub-folder – While object storage can be accessed in a flat name space for commonality and organization some solutions and service support the notion of sub-folder that resemble traditional directory hierarchy.

Learn more in Cloud Virtual Storage Networking (CRC) and www.objectstoragecenter.com

Storage I/O trends

OpenStack Manila (e.g. Folders and Files)

AWS recently announced their new cloud based Elastic File Storage (EFS) to compliment their existing Elastic Block Storage (EBS) offerings. However are you aware of what is going on with cloud files within OpenStack?

For those who are familiar with OpenStack or simply talk about it and Swift object storage, or perhaps Cinder block storage, are you aware that there is also a file (NAS or Network Attached Storage) component called Manila?

In concept Manila should provide a similar capability to what AWS has recently announce with their Elastic File Service (EFS), or depending on your perspective, perhaps the other way around. If you are familiar and have done anything with Manila what are your initial thoughts and perspectives.

What this all means

People routinely tell me this is the most exciting and interesting times ever in servers, storage, I/O networking, hardware, software, backup or data protection, performance, cloud and virtual or take your pick too which I would not disagree.

However, for the past several years (no, make that decade), there is new and more interesting things including in adjacent areas.

I predict that at least for the next few years (no, make that decades), we will continue to see plenty of new and interesting things, questions include.

However, what’s applicable to you and your environment vs. simply fun and interesting to watch?

Ok, nuff said, for now

Cheers gs

In This Issue

Industry Trends Perspectives News

Commentary in the news

Tips and Articles

StorageIOblog posts

Events and Webinars

StorageIOblog posts

Server StorageIO Lab reports

Resources and Links

Industry News and Activity

Recent Industry news and activity

GovTech: Storage Costs Cloud Police Cam
Via BostonHerald: Booting Up: Storage costs cloud police cam issue
Via ComputerWorld: Amazon offers network file storage in the cloud
Via ComputerWeekly: HGST marries helium HDD’s and Himalaya in object storage
Via GoogleCloudPlatform Blog: GCS Nearline Online storage at Offline price
Via MarketWatch: Global Data Center Provider CyrusOne Announces Direct Connectivity to Google Cloud Platform
Via PRNewsWire: Quantum Announces New Archive Solutions Designed To Reduce Unstructured Data Storage Costs
Via StorageIOblog: AWS S3 Cross Region Replication storage enhancements
Via StreetInsider: Western Digital (WDC) to Acquire Object Storage Software Amplidata
Via Enterprise Storage Forum: Dell Invests in Object Storage Startup Exablox
Via Enterprise Storage Forum: Introducing s3motion (S3 and object access docker based appliance)
Via Computerworld: Quantum enhances their cloud and object storage management with new StorNext software version
ScaleOut Software Releases Version 5.2 of Its In-Memory Computing Platform
HP Inks Global Reseller Agreement With Object Storage Startup Scality
NetApp Introduces Software-Defined Object Storage for the Hybrid Cloud
Via InsideHPC: Deploying Hadoop on Lustre Storage: Lessons Learned and Best Practices
Via Yahoo Engineering Blog: Yahoo Cloud Object Store – Object Storage at Exabyte Scale
Via the Platform: Inside The Ceph Exascale Storage At Yahoo
Va Swift Summit: Taking the Mystery out of Erasure Codes: A Swift Implementation
Enterprise Storage Forum: Lustre buying guide

View other recent industry activity here

StorageIO Commentary in the news

Recent Server StorageIO commentary and industry trends perspectives about news, activities and announcements.

CyberTrend: Comments on Software Defined Data Center and Virtualization

View more trends comments here

StorageIO Tips and Articles

Check out these resources and links on server storage I/O performance and benchmarking tools. View more tips and articles here

Various Industry Events

EMCworld – May 4-6 2015 (Las Vegas)

Interop – April 29 2015 (Las Vegas)
Presenting
Smart Shopping for Your Enterprise Storage Strategy

View other recent and upcoming events here

Webinars

BrightTalk Webinar – June 23 2015
Server Storage I/O Innovation Update

View other webinars here

Videos and Podcasts

Data Protection Gumbo Podcast
Protect Preserve and Serve Data

In this episode, Greg Schulz is a guest on Data Protection Gumbo hosted by Demetrius Malbrough(@dmalbrough). The conversation covers various aspects of data protection which has a focus of protect preserve and serve information, applications and data across different environments and customer segments.

While we discuss enterprise and SMB data protection, we also talk about trends from Mobile to the cloud among many others tools, technologies and techniques. Check out the podcast here.

Springtime in Kentucky
With Kendrick Coleman of EMCcode
Cloud Object Storage S3motion and more

In this episode, @EMCcode (Part of EMC) developer advocate Kendrick Coleman (@KendrickColeman) joins me (e.g. Greg Schulz) for a conversation.

Conversation covers what is EMCcode, EMC Federation, Cloud Foundry, clouds, object storage, buckets, containers, objects, node.js, Docker, OpenStack, AWS S3, micro services, and the S3motion tool Kendrick developed.

S3motion is a good tool to have in your server storage I/O tool box for working with cloud and object storage along with others such as Cloudberry, S3fs, Cyberduck, S3 browser among many others. You can get S3motion for free from git hub here Check out the companion blog post for this podcast here.

StorageIO podcast’s are also available via & at StorageIO.tv

From StorageIO Labs

Research, Reviews and Reports

AWS S3 Cross-Region Replication

Moving and Replicating Buckets/Containers, Sub folders and Objects (Click on Image to read about AWS Cross-Region Replication)

View other StorageIO lab review reports here

Resources and Links

Cloud conversations: If focused on cost you might miss other cloud benefits
AWS overview and primer
Avoid Cloud Storage Pricing Surprises
Are more than 5 nines of availability possible?
Primary storage clouds vs cloud for backup
storageio.com/links
objectstoragecenter.com
storageioblog.com/data-protection-diaries-main/
storageperformance.us
thessdplace.com
storageio.com/raid

Greg Schulz – Author Cloud and Virtual Data Storage Networking (CRC Press), The Green and Virtual Data Center (CRC Press) and Resilient Storage Networks (Elsevier)
twitter @storageio

April 12, 2015March 7, 2022

S3motion Buckets Containers Objects AWS S3 Cloud and EMCcode

Storage I/O trends

S3motion Buckets Containers Objects AWS S3 Cloud and EMCcode

It’s springtime in Kentucky and recently I had the opportunity to have a conversation with Kendrick Coleman to talk about S3motion, Buckets, Containers, Objects, AWS S3, Cloud and Object Storage, node.js, EMCcode and open source among other related topics which are available in a podcast here, or video here and available at StorageIO.tv.

In this Server StorageIO industry trends perspective podcast episode, @EMCcode (Part of EMC) developer advocate Kendrick Coleman (@KendrickColeman) joins me for a conversation. Our conversation spans spring-time in Kentucky (where Kendrick lives) which means Bourbon and horse racing as well as his blog (www.kendrickcoleman.com).

Btw, in the podcast I refer to Captain Obvious and Kendrick’s beard, for those not familiar with who or what @Captainobvious is that is made reference to, click here to learn more.

@Kendrickcoleman & @Captainobvious

What about Clouds Object Storage Programming and other technical stuff?

Of course we also talk some tech including what is EMCcode, EMC Federation, Cloud Foundry, clouds, object storage, buckets, containers, objects, node.js, Docker, Openstack, AWS S3, micro services, and the S3motion tool that Kendrick developed.

Cloud and Object Storage Access
Click to view video

Kendrick explains the motivation behind S3motion along with trends in and around objects (including GET, PUT vs. traditional Read, Write) as well as programming among related topic themes and how context matters.

S3motion for AWS S3 Google and object storage
Click to listen to podcast

I have used S3motion for moving buckets, containers and objects around including between AWS S3, Google Cloud Storage (GCS) and Microsoft Azure as well as to/from local. S3motion is a good tool to have in your server storage I/O tool box for working with cloud and object storage along with others such as Cloudberry, S3fs, Cyberduck, S3 browser among many others.

You can get S3motion free from git hub here.

Where to learn more

Here are some links to learn more about AWS S3, Cloud and Object Storage along with related topics

AWS EFS Elastic File System (Cloud NAS) First Preview Look
Cross-Region Replication for Amazon S3
Cloud conversations: If focused on cost you might miss other cloud storage benefits
Data Protection Diaries
Cloud Conversations: AWS overview and primer
Eight Ways to Avoid Cloud Storage Pricing Surprises
Cloud and Object Storage Center
Are more than five nines of availability really possible?
How do primary storage clouds and cloud for backup differ?
What’s most important to know about my cloud privacy policy?

Also available on

What this all means and wrap-up

Context matters when it comes to many things particular about objects as they can mean different things. Tools such as S3motion make it easy for moving your buckets or containers along with objects from one cloud storage system, solution or service to another. Also check out EMCcode to see what they are doing on different fronts from supporting new and greenfield development with Cloud Foundry and PaaS to Openstack to bridging current environments to the next generation of platforms. Also check out Kendricks blog site as he has a lot of good technical content as well as some other fun stuff to learn about. Look forward to having Kendrick on as a guest again soon to continue our conversations. In the meantime, check out S3motion to see how it can fit into your server storage I/O tool box.

Ok, nuff said, for now..

Cheers gs

Greg Schulz – Author Cloud and Virtual Data Storage Networking (CRC Press), The Green and Virtual Data Center (CRC Press) and Resilient Storage Networks (Elsevier)
twitter @storageio

April 9, 2015March 7, 2022

Cloud Conversations: AWS EFS Elastic File System (Cloud NAS) First Preview Look

Storage I/O trends

Cloud Conversations: AWS EFS Elastic File System (Cloud NAS) First Preview Look

Amazon Web Services (AWS) recently announced (preview) new Elastic File System (EFS) providing Network File System (NFS) NAS (Network Attached Storage) capabilities for AWS Elastic Cloud Compute (EC2) instances. EFS AWS compliments other AWS storage offerings including Simple Storage Service (S3) along with Elastic Block Storage (EBS), Glacier and Relational Data Services (RDS) among others.

Ok, that’s a lot of buzzwords and acronyms so lets break this down a bit.

AWS EFS and Cloud Storage, Beyond Buzzword Bingo

EC2 – Instances exist in various Availability Zones (AZ’s) in different AWS Regions. Compute instance with various operating systems including Windows and Ubuntu among others that also can be pre-configured with applications such as SQL Server or web services among others. EC2 instances vary from low-cost to high-performance compute, memory, GPU, storage or general purposed optimized. For example, some EC2 instances rely solely on EBS, S3, RDS or other AWS storage offerings while others include on-board Solid State Disk (SSD) like DAS SSD found on traditional servers. EC2 instances on EBS volumes can be snapshot to S3 storage which in turn can be replicated to another region.
EBS – Scalable block accessible storage for EC2 instances that can be configured for performance or bulk storage, as well as for persistent images for EC2 instances (if you choose to configure your instance to be persistent)
EFS – New file (aka NAS) accessible storage service accessible from EC2 instances in various AZ’s in a given AWS region
Glacier – Cloud based near-line (or by some comparisons off-line) cold-storage archives.
RDS – Relational Database Services for SQL and other data repositories
S3 – Provides durable, scalable low-cost bulk (aka object) storage accessible from inside AWS as well as via externally. S3 can be used by EC2 instances for bulk durable storage as well as being used as a target for EBS snapshots.
Learn more about EC2, EBS, S3, Glacier, Regions, AZ’s and other AWS topics in this primer here

What is EFS

Implements NFS V4 (SNIA NFS V4 primer) providing network attached storage (NAS) meaning data sharing. AWS is indicating initial pricing for EFS at $0.30 per GByte per month. EFS is designed for storage and data sharing from multiple EC2 instances in different AZ’s in the same AWS region with scalability into the PBs.

What EFS is not

Currently it seems that EFS has an end-point inside AWS accessible via an EC2 instance like EBS. This appears to be like EBS where the storage service is accessible only to AWS EC2 instances unlike S3 which can be accessible from the out-side world as well as via EC2 instances.

Note however, that depending on how you configure your EC2 instance with different software, as well as configure a Virtual Private Cloud (VPC) and other settings, it is possible to have an application, software tool or operating system running on EC2 accessible from the outside world. For example, NAS software such as those from SoftNAS and NetApp among many others can be installed on an EC2 instance and with proper configuration, as well as being accessible to other EC2 instances, they can also be accessed from outside of AWS (with proper settings and security).

AWS EFS at this time is NFS version 4 based however does not support Windows SMB/CIFS, HDFS or other NAS access protocols. In addition AWS EFS is accessible from multiple AZ’s within a region. To share NAS data across regions some other software would be required.

EFS is not yet as of this writing released and AWS is currently accepting requests to join the EFS preview here.

Where to learn more

Here are some links to learn more about AWS S3 and related topics

Cross-Region Replication for Amazon S3
Cloud conversations: If focused on cost you might miss other cloud storage benefits
Data Protection Diaries
Cloud Conversations: AWS overview and primer
Eight Ways to Avoid Cloud Storage Pricing Surprises
Cloud and Object Storage Center
Are more than five nines of availability really possible?
How do primary storage clouds and cloud for backup differ?
What’s most important to know about my cloud privacy policy?

What this all means and wrap-up

AWS continues to extend its cloud platform include both compute and storage offerings. EFS compliments EBS along with S3, Glacier and RDS. For many environments NFS support will be welcome while for others CIFS/SMB would be appreciated and others are starting to find that value in HDFS accessible NAS.

Overall I like this announcement and look forward to moving beyond the preview.

Ok, nuff said, for now..

Cheers gs

Greg Schulz – Author Cloud and Virtual Data Storage Networking (CRC Press), The Green and Virtual Data Center (CRC Press) and Resilient Storage Networks (Elsevier)
twitter @storageio

March 29, 2015April 27, 2025

March 2015 Server StorageIO Update Newsletter

Volume 15, Issue III

Hello and welcome to this March 2015 Server and StorageIO update newsletter. Here in the northern hemisphere at least by the calendar spring is here, weather wise winter continues to linger in some areas. March also means in the US college university sports tournaments with many focused on their NCAA men’s basketball championship brackets.

Besides various college championships, March also has a connection to back up and data protection. Thus this months newsletter has a focus on data protection, after all March 31 is World Backup Day which means it should also be World Restore test day!

Focus on Data Protection

Data protection including backup/restore, business continuance (BC), disaster recovery (DR), business resiliency (BR) and archiving across physical, virtual and cloud environments.

Data Protection Fundamentals

A reminder on the importance of data protection including backup, BC, DR and related technologies is to make sure they are occuring as planned. Also test your copies and remember the 4 3 2 1 rule or guide.

4 – Versions (different time intervals)
3 – Copies of critical data (including versions)
2 – Different media, devices or systems
1 – Off-site (cloud or elsewhere)

The above means having at least four (4) different versions from various points in time of your data. Having three (3) copies including various versions protects against one or more copies being corrupt or damaged. Placing those versions and copies on at least two (2) different storage systems, devices or media if something happens.

While it might be common sense, a bad April Fools recovery joke would be finding out all of your copies were on the same device which is damaged. That might seem obvious however sometimes the obvious needs to be stated. Also make sure that at least one (1) of your copies is off-site either on off-line media (tape, disk, ssd, optical) or cloud.

Take a few moments and to verify that your data protection strategy is being implemented and practiced as intended. Also test what is being copied including not only restore the data from cloud, disk, ssd or tape, also make sure you can actually read or use the data being protected. This means make sure that your security credentials including access certificates and decryption occur as expected.

Watch for more news, updates industry trends perspectives commentary, tips, articles and other information at Storageio.com, StorageIOblog.com, various partner venues as well as in future newsletters.