A BEGINNER S GUIDE TO NEXT GENERATION OBJECT STORAGE

Size: px
Start display at page:

Download "A BEGINNER S GUIDE TO NEXT GENERATION OBJECT STORAGE"

Transcription

1 A BEGINNER S GUIDE TO NEXT GENERATION OBJECT STORAGE WHITEPAPER INTRODUCTION Object Storage is the new storage paradigm. There is a high level of interest from organizations, as this new approach resolves the challenges of efficiently storing massive volumes of unstructured data - Big Unstructured Data. This paper addresses the why, what and how of object storage. WHY SHOULD COMPANIES USE OBJECT STORAGE FOR UNSTRUCTURED DATA AND HOW IS IT DIFFERENT FROM NAS OR SAN? The biggest problem with traditional approaches is scalability. NAS lacks the ability to scale as a single system, especially in Petabyte environments. Today s SANs are already complex, when deployed with a file system layer on top. Scaling-out makes the problem a lot worse. Object Storage is essentially just a different way of storing, organizing and accessing data on disk. An Object Storage platform provides a storage infrastructure to store files with lots of metadata added to them referred to as objects. The backend architecture of an object storage platform is designed to present all the storage nodes as one single pool. With Object Storage, there is no file system hierarchy. The architecture of the platform, and its new data protection schemes (vs. RAID, the de-facto data protection scheme for SAN and NAS), allow this pool to scale virtually to an unlimited size, while keeping the system simple to manage. Users access object storage through applications that typically use a REST API (an internet protocol, optimized for online applications). This makes object storage ideal for all online, cloud, environments. When objects are stored, an identifier is created to locate the object in the pool. Applications can very quickly retrieve the right data for the users through the object identifier or by querying the metadata (information about the objects, like the name, when it was created, by who etc.). This approach enables significantly faster access and much less overhead than locating a file through a traditional file system. DDN WOS is a true object storage platform, designed to scale beyond petabytes as a single system, optimizing TCO without compromising performance or durability. This makes WOS a perfect platform for a variety of storage cloud solutions, including online collaboration, active archives, cloud backup and worldwide data distribution.

2 TABLE OF CONTENTS INTRODUCTION...2 Why should companies use Object Storage for unstructured data and how is it different from NAS or SAN?... 2 HISTORY OF OBJECT STORAGE...3 WHAT IS OBJECT STORAGE?...3 SAN vs NAS... 4 Object Storage, The Third Paradigm... 4 Cloud Storage, Storage Clouds, Object Storage... 6 REST API s... 7 DirectProtect Real-Time Error Detection & Correction... 7 Object Storage Summary... 7 WHY OBJECT STORAGE?...7 Massive Data Growth... 7 Always Online... 7 Power to the Applications... 8 The Big Data Explosion... 8 We All Use Object Storage Everyday... 8 Use Cases... 9 HOW DOES OBJECT STORAGE WORK?...9 Issues with File Storage... 9 Data Protection: Erasure Coding or Not? WOS : WEB OBJECT SCALER...13 True Object Storage Platform Optimized for Small and Large Files Choice of Data Protection Schemes Self-healing Architecture Single Storage Infrastructure Widest Selection of Interfaces; Out of the Box Applications Enterprise-grade Platform WOS Benefits ECOSYSTEM...14 RESOURCES... 16

3 HISTORY OF OBJECT STORAGE Object Storage is not a new concept, it even predates the 2006 launch of Amazon s S3. Much advancement have been made, and today s current generation of Object Storage platforms cannot be compared to the earlier generations - which were merely black boxes designed to store immutable copies of documents, mostly for compliance environments. EMC Centera, based on Content Addressed Storage (CAS) innovator Filepool, was one such early implementation of an object-based construct. Today, Centera users face a big challenge to move to newer, faster storage infrastructures: Centera provides no interfaces to migrate data to other platforms. THE INDUSTRY HAS LEARNED SOME TOUGH LESSONS USING PROPRIETARY SYSTEMS. The current generation of object storage platforms is designed with this openness & flexibility in mind. Most platforms support a subset of Amazon s REST API and some platforms are designed to be independent of the hardware platform. The industry has learned some tough lessons from using proprietary systems. One initiative to prevent Vendor Lock-in, is SNIA s Cloud Data Management Interface (CDMI). This is a set of pre-defined RESTful HTTP operations for assessing the capabilities of the cloud storage system, allocating and accessing containers and objects, managing users and groups, implementing access control, attaching metadata, billing, moving data between cloud systems, exporting data, etc. 1 Figure 1 Object Storage Archit ecture WHAT IS OBJECT STORAGE? Object Storage is essentially just a different way of storing, organizing and accessing data on disk. To really understand how object storage is different from traditional storage platforms, it is important to understand the what and how of traditional storage, and what the challenges are 2. 1 From Wikipedia: 2 For the sake of briefness, we will stick to the very basics. There are hundreds, if not thousands of blog articles and papers about this topic available online. 3

4 SAN VS NAS A SAN is block storage device, not that different from an external USB disk drive, just bigger. Systems connect to a SAN with a block interface; common protocols for block storage include iscsi, Fibre Channel, Fibre Channel over Ethernet (FCoE), etc. A device attaching to SAN will see the storage presented as a disk drive. SANs allow multiple servers to share a pool of storage that cannot be accessed by individual users. This is to prevent the overwriting each other s data. SANs are typically used by large applications, such as enterprise databases, that handle data locking through the application. SAN storage can be presented as a file system (by putting a file system layer on top), which is generally referred to as a clustered file system. As we will explain later in this document, SANs are complex systems to manage, especially when used for file storage. A NAS is a file storage device. NAS exposes its storage as a network file system. Devices that attach to a NAS see a mountable file system. Common protocols for file storage devices include, NFS and SMB / CIFS. A NAS operates at the file level and is accessible to users with proper access rights - so it needs to manage user privileges, file locking and other security measures. A NAS environment is a much better fit than SANs for to store files. THE BIGGEST CHALLENGE WITH NAS AND SAN IS SYSTEM SCALABILITY. Figure 2 - Simplified SAN infrastructure with Clustered file system and enterprise applications OBJECT STORAGE, THE THIRD PARADIGM So, if NAS & SAN can store files, then, why Object Storage? How is it different? As we will explain in greater detail to follow, the biggest problem with both systems is scalability. NAS cannot scale as a single system in petabyte-size environments. To scale-out a NAS environment requires a combination of multiple systems (management!) or forklift upgrades, with labor-intensive data migration projects. As we mentioned before, SANs are pretty complex when deployed with a file system layer on top. Scaling-out makes the problem a lot worse. Again, with lots of management! Also, most of the unstructured data that is stored online (or in active archives) is immutable data meaning the file will not be modified. Much of the functionality built-in to traditional file systems addresses user access rights and permissions for appending and amending files. These complex functions create a lot of overhead in terms of performance, IOs required to access data and the 4

5 OBJECT STORAGE IS A MUCH SIMPLER ARCHITECTURE THAN TRADITIONAL FILE SYSTEMS. Figure 3 - Object Storage Timeline ability to scale. Object Storage does not have this functionality. If a user modifies a file, the new version is simply stored as a new object. This results in a much simpler architecture than traditional file systems have. An Object Storage platform is a storage infrastructure to store objects. For now we will refer to objects as similar to files (collection of data blocks with metadata), later in this document we will explain how this is actually only partly true. The backend architecture of an object storage platform is designed so that all the storage nodes are presented as one single pool. There is no file system Figure 4 - Scale out object storage with simple REST API for applications 5

6 APPLICATIONS CAN RETRIEVE OBJECTS MUCH FASTER THAN FILES. Figure 5 - NAS Storage is presented as a file system to the clients REST API S HAVE BECOME THE PREDOMINANT INTERFACE FOR CLOUD APPLICATIONS. hierarchy. The architecture of the platform, and new data protection schemes (vs. RAID, the defacto data protection scheme for SAN and NAS) allow this pool to scale to virtually unlimited capacities, while keeping the system simple to manage. Users access object storage through applications that will typically use a REST API. They use a set of simple commands: GET (read), PUT (save) and DELETE. REST is an internet protocol, optimized for online applications. This makes object storage ideal for all online, Cloud, environments. When objects are stored, an identifier is created to locate the object in the pool. Applications can very quickly retrieve the right data for the users through the object identifier - or by querying the metadata (information about objects: name, when it was created, by who, etc.). This is much faster than attempting to locate a file through a traditional file system. Applications also handle user access management. Each time a file (object) is changed, it is stored as a new object. This prevents corruption through simultaneous changes. CLOUD STORAGE, STORAGE CLOUDS, OBJECT STORAGE What is the difference between Cloud Storage and Storage Clouds? How does Object Storage fit? The answer is pretty straightforward. Cloud Storage is the storage used for Compute Cloud infrastructures - in other words: to run VM s on. Compute Clouds are very IOPS intensive and usually block storage is used in these applications. Storage Clouds are storage in the cloud, whether public or private. So, Storage Clouds are simply storage capacity that is made available through the Internet. Most of today s storage clouds use object storage technologies. 6

7 LITTLE PROGRESS HAS BEEN MADE ON API STANDARDIZATION. REST API S REST stands for Representational State Transfer. It is a software architecture that is used for distributed application environments, such as the internet. An API, short for Application Programming Interface, is an interface used for an application (client) to talk to its environment (backend servers, storage, databases etc.). With the success of cloud-style computing (running applications in the cloud, rather than on the user s computer), REST API s have become the predominant interface for cloud applications to connect to the cloud. For storage-centric cloud applications, a REST API is the interface between the application and the object storage platform. DIRECTPROTECT REAL-TIME ERROR DETECTION & CORRECTION The three most common commands in REST API s for object storage environments are GET, PUT and DELETE, which are the equivalents of reading a file, saving a file (technically save as because object storage does not allow you to update an object), or deleting a file. Since the early days of Cloud Computing, there s been a lot of discussion about standardizing on a specific REST API to avoid vendor lock-in. The general idea behind this is, if all vendors (of applications, cloud infrastructures, object storage platforms etc.) use a standard API, users will never be locked-in to a specific environment. Without having to reprogram their applications, they would be able to freely move their data from one platform to another - or keep it on more than one platform. Little progress has been made on the standardization front however, and the result is that object storage platforms will either support the Amazon S3 API, the OpenStack API or a native API (i.e. an API of their own, typically a very easy to use, lightweight interface). OBJECT STORAGE SUMMARY Data is stored as objects in one large, scalable pool of storage Objects are stored with metadata information about the object An Object ID is stored, to locate the data REST is the standard interface, simple commands used by applications Objects are immutable; edits are saved as a new object WHY OBJECT STORAGE? IN 10 YEARS, THE AVERAGE STORAGE OPERATOR WILL HAVE TO MANAGE TIMES MORE STORAGE THAN TODAY. MASSIVE DATA GROWTH Depending on which analyst firm you talk to, you will hear storage growth predictions that vary between 30x-40x for the next decade. That means we will all be storing 30 to 40 times as much digital data ten years from now, compared to today. At the same time, companies will only invest an additional 50% in personnel to manage their storage infrastructures. This means that the average storage operator will have to manage times as much storage a decade from now. This will drive the need for storage platforms that require little management effort and scale out to virtually unlimited capacities. ALWAYS ONLINE Much of that data growth is driven by the recent innovations in cloud and mobile computing. We already mentioned Amazon S3, but there are also Google, Facebook, Apple and several smaller public storage cloud offerings that set a new level of expectations where all data needs to be available anywhere at anytime. 7

8 80% OF DATA GROWTH WILL BE UNSTRUCTURED DATA POWER TO THE APPLICATIONS File-based storage platforms not only fail to scale sufficiently, they also become obsolete as more and more applications are designed to use REST API s (the default interface for object storage platforms) to talk directly to the storage, without additional (file system) layers in between. This greatly simplifies architectures and delivers significant performance gains. THE BIG DATA EXPLOSION Essentially, there are three types of Big Data: Big Structured Data, Big Semi-structured Data (Big Data Analytics) and Big Unstructured Data. All three require one or more of the three V s, the commonly accepted definition of Big Data: Big Data refers to any set of data that comes in great Volumes, has a large Variety of information and/or is consumed at high Velocity. Big Structured Data refers to large enterprise databases. Velocity is key here, hence the success of the superfast SSD drives. Big Semistructured Data refers to massive volumes of small log files (often sensor information), that is collected for analytics. Therefore, we also talk about Big Data Analytics. This data is stored in distributed frameworks that support distributed processing. Think of Hadoop and MapReduce. Object Storage is not commonly used for structured or semi-structured Big Data, unless it is for archival purposes. The sweet spot for object storage is Big Unstructured Data, which refers to all data that users best understand as files. Think of image and movie data always growing and always in higher resolution music files, office documents etc. Analysts believe that 80% or more of the expected data growth will be unstructured data. File based storage platforms cannot support this growth. This is the problem Object Storage solves. AMAZON ADVERTISES VERY LOW COSTS TO STORE DATA, BUT THERE ARE HIDDEN COSTS. WE ALL USE OBJECT STORAGE EVERYDAY Possibly the largest object storage infrastructure, and one of the drivers for the adoption of object storage is Amazon s S3. This public storage cloud service was launched in 2006 and has stimulated many application developers to have their applications use S3 as backend storage. The benefits were clear: no hassle with a private infrastructure, relatively low cost, pay as you go, scale as needed and a very simple interface. However, while Amazon advertises very low cost to store data on their infrastructure, there are some hidden costs such as network traffic. At a certain volume of data, there is a point of cost inflection. Many of the startups that launched on Amazon over the past years and who clearly see the benefits of object storage, are now deploying their own infrastructure using object storage platforms. Also, not everyone wants their data in a public environment with debatable SLA s and moderate security at best. More and more enterprises are choosing to deploy their own internal storage clouds to facilitate cloud-based applications. These infrastructures need similar or better availability and durability than the available public services. 8

9 USE CASES Object Storage is more than a smarter paradigm that allows you to store large volumes of unstructured data. Features like massive scalability, REST APIs, geographic distribution, enable a series of compelling use cases. An interesting side effect is that solutions tend to overlap. Dropbox is not just file sharing, it s backup, collaboration, archiving and mobile storage. Here are a few popular use cases: Online Web Services: As we mentioned earlier, one of the drivers for object storage is the trend to use more and more online cloud applications. Previously, without Amazon s S3, none of this would have been possible. The more successful web services companies are now gradually making the move to in-house infrastructures. Also, with corporate security policies, IP and compliance considerations, most enterprises prefer to run cloud applications on private storage infrastructures. File Sharing is by far the most popular object storage use case. Dropbox offered a solution for a need that most of us did not know we had. Today, service providers are now deploying similar services and enterprises are deploying private file sharing services as people utilize a variety of devices at home, at work and on-the-go. They collaborate with people across the office or around the world. Cloud Backup is increasingly popular. There are dozens and dozens of online services for backup. For enterprises, the idea of backing up to low cost, highly scalable disk infrastructures - rather than tape, which can be cumbersome for recovery - is also very compelling Cloud Archives: Data archiving decisions used to be very simple: data that was infrequently accessed was moved off disk to tape. Very few arguments could beat the low TCO of tape. Disk archives were hard to justify and reserved for those exceptional use cases where latency outweighed the huge cost of disk archives. With object storage, it is now possible to deploy disk archives at an acquisition cost and TCO close to that of tape. Many organizations are opting for hybrid environments - with a really, really superfast hot disk tier and a very cheap cold tape tier. Worldwide Collaboration: Globally distributed teams have become standard practice. Think of researchers from different institutions working on the same project. Think of a movie being shot in New Zealand and produced in Los Angeles - or software being developed in California and then tested in India. Geographically distributed storage pools enable teams to work in real-time on the same datasets. HOW DOES OBJECT STORAGE WORK? ISSUES WITH FILE STORAGE As we explained earlier in this document, file based storage is a great concept: users can access the same resources through a corporate network. The file system takes care of permissions, access rights and avoids users overwriting each other s data. File systems can even present data in a hierarchical Directory structure, which until now has been a very useful tool to keep data organized. The underlying software for such file systems contains a lot of ingenuity, which rapidly becomes complexity when scaling-out the infrastructure. 9

10 FILES ARE ABSTRACTIONS OF DATA IN A COMPUTER SYSTEM AND ARE REALLY STORED IN FIXED SIZE BLOCKS WHICH ARE ADDRESSED WITH A NUMBER. The concept of a file on a computer system is so well ingrained that it is often difficult to think of computer storage in different terms. It is clearly a very powerful and natural way to think of data. Object Storage is distinct from file storage, but in some ways is even a more natural and powerful way to organize data. The file concept is an abstraction. In actuality, data in a computer system is stored in fixed size blocks which are addressed with a number which ultimately is a physical location in a storage device. This is the case for data stored in a NAS, a SAN or when using Object Storage. The system presents those blocks to the user or application in a form that is useful. For nontransactional data, that form is usually a file. File storage systems will also store a small amount of information that tells which of those data blocks make up the file, in which order, and finally what name has been applied to the collection of blocks. This additional data about data is called file system metadata. Keeping track of the file system metadata is the responsibility of the file system. To keep even more than a few files organized, a file system imposes a hierarchy on the metadata in the form of directory structure. A key concept of the file system is the notion that the files themselves have relationships to one another as one could think of files being co- located in a directory. When the system is instructed to read a file, the repository of file system metadata is consulted and the required data blocks are retrieved from the storage device. Writing data into a file system has the additional complexity of requiring that the file system metadata must be written or updated - potentially by several users or processes simultaneously. Numerous techniques and designs exist that attempt to minimize the impact of dealing with file system metadata, and the locking problem associated with simultaneous access. Unfortunately, as the number of files in the system grows large, keeping the file system metadata correctly organized (so that the names and the data blocks that make up files can be found) becomes increasingly complex. As this requirement increases, keeping track of billions of files (which may be distributed across a number of network connected computer systems), the abstraction of the file system begins to breakdown. Moreover, the hierarchical structure of the file system is insufficient to adequately categorize the data in the system. File systems require at least three layers of software constructs to execute any file operation. As they allow files to be amended by multiple users, they must maintain complex lock structures with OPEN and CLOSE semantics. These lock structures must be distributed coherently to all of the servers used for access. AS FILE SYSTEMS GROW, THE ABSTRACTION OF THE FILE SYSTEM BEGINS TO BREAKDOWN AND INSUFFICIENTLY CATEGORIZES DATA. 10

11 IN AN OBJECT STORAGE SYSTEM, THE PHYSICAL STORAGE BLOCKS ARE ORGANIZED INTO OBJECTS WHICH ARE COLLECTIONS OF DATA BLOCKS REPRESENTED BY AN IDENTIFIER. Also, as data is placed (based on random block availability), traditional file systems are always fragmented. This is especially true in environments where the data is unstructured and it is not uncommon to write widely varied file sizes. Using a traditional file system designed for amendable data, storing immutable data constitutes an inappropriate and wasteful use of bandwidth and compute resources. This highly inefficient approach requires a great deal of additional hardware and network resources to achieve data distribution goals. These systems now become exponentially more complex as they are scaled-out. Object storage systems dispense with the overburdened concept of file system metadata. This approach allows the system to separate the storage of data from the relationship that the individual data items have to each other. In an object storage system, the physical storage blocks are organized into objects which are collections of data blocks represented by an identifier. There is no hierarchy imposed on the data and no repository of the objects metadata to be consulted when reads or writes are requested. This approach allows an object storage system to scale with both the requirements and size of the system, well beyond the technical & practical boundaries of traditional file systems. While Object Storage systems do not use file system metadata, they do employ object metadata (customizable information about the objects). This information can later be used to query or analyze the information stored. Object metadata for a photo could be the day it was taken, the last time it was modified, the type of camera that was used, whether a flash was used, where it was taken, etc. Object metadata will play an increasingly important role as we store more and more information, but it does not add complexity to the system like file system metadata does. At the highest level, storage servers are, like NAS and SAN, simply boxes with a lot of disks in there. Typically, object storage vendors will use SATA disks in their systems, and may include SSDs for caching. Some platforms opt for separate controllers, but in essence that does not make a difference, as the storage is presented as one pool (namespace). When choosing an object storage platform, it s important to understand the limitations of the namespace and how the system combines different pools or namespaces. Many vendors claim infinite scalability, but there is no such thing. The important thing is to understand how namespaces are combined, presented and managed. How many such namespaces can be combined? Are they managed as one system? The system software manages most of that. The actual software layer is where vendors can differentiate. The list of possible features is endless. A single management interface is always great. Self-healing capabilities are a must for environments that will scale into the hundreds of petabytes. The software layer also provides data protection mechanisms, which we will cover in the next section.an Object ID is stored, to locate the data The standard interface to access data in an object storage platform is a RESTful interface or REST API. This is a set of simple commands that application developers use in their code to let the application access the data. The basic REST commands are LIST, GET, PUT and DELETE, which are used to list (a selection of) objects, read an object, store an object or delete it. There is no standard for REST yet, but the so called Amazon API is by far the most popular amongst developers. Hence, most object storage providers will provide an Amazon compatible API, which is typically a subset of the commands that are supported by Amazon S3. As most legacy applications were designed to interface with a file system, most object storage platforms will also provide one or more file interfaces (a file system layer on top of the object storage pool also called a file system gateway) and often a selection of programming language-specific API s will be provided as well. DDN s WOS has the widest selection of interfaces on the market. 11

12 THE PROBLEM WITH RAID IS THAT IT WAS ORIGINALLY DESIGNED FOR SMALL CAPACITY DISKS. WOS WOS HAS THE WIDEST SELECTION OF INTERFACES ON THE MARKET. WOS WAS DESIGNED AS A TRUE OBJECT STORAGE PLATFORM. DATA PROTECTION: ERASURE CODING OR NOT? Several data protection mechanisms are used for object storage. The one method that is being abandoned, however, is RAID (which has been the de facto data protection scheme for SAN and NAS for the past two decades). The problem with RAID is that it was originally designed for small capacity disks. The larger the drive capacity, the longer it takes to restore a failed drive. During this restore, the data is less protected. If you are on RAID 5, and a second disk fails in your RAID group during the lengthy restore, then data loss will occur. Also, as all processing capacity is used for the restore, users will experience severe performance drops as data is being written to the replacement disk. Large systems with hundreds of TBs or Petabytes, will routinely be in constant rebuild mode, as drives routinely fail. In an effort to shorten these longer rebuild cycles, RAID systems ship with faster processors, which also consume more energy, but this only masks the problem at best. The simplest way to protect data is to make several copies, replication. A popular concept is called three copies in the cloud, promoted mostly by public cloud platforms like Amazon S3 and Rackspace. While three copies in the cloud provides acceptable data protection, it is also very lucrative for the cloud provider as they are in the business of selling more storage capacity. Swift, the object storage component of Rackspace s open source cloud infrastructure also uses pure replication. A more efficient data protection mechanism is erasure coding. Today, there are several flavors of erasure coding, each one with its own benefits. Erasure coding s key advantage is that you can break up your data into n fragments, add m additional fragments, store the fragments across n+m devices, and then recover the original data from any n of the devices. Survive 4 failures? 10? Pick a number! Also when a disk is lost, the system only has to calculate new fragments, to be spread over any selection of disks with available capacity. This is a lot faster and more efficient than restoring an entire RAID-based disk, even if it was only 20% full. Erasure coding can be implemented locally or distributed, which means that fragments are spread over multiple data centers (at least three) and the system can survive failure of a full datacenter. Distributed erasure coding drastically reduces the overhead (the extra storage that is needed to protect the data). Five 9 s or more are guaranteed with overhead numbers as low as 20% - as opposed to 3 copies requiring 200% overhead. The problem with distributed erasure coding is that it creates a huge WAN cost, as rebuilds require data transfer between the data centers. Also, the availability highly depends on the WAN connectivity. Each data read from a distributed erasure coding pool, requires data to be read from three data centers. If any of these connections have high latency, the user will notice the delayed response. An interesting, best of both worlds solution is local erasure coding with replication. Such architecture combines the benefits of erasure coding with those of replication (a full copy is present in each datacenter). While such a setup requires more overhead than the distributed alternative, the TCO is typically a lot lower due to the reduced WAN traffic. 12

13 WEB OBJECT SCALER DDN s legacy is designing high-performance storage systems, but without making things more complex than they need to be. WOS is the perfect example of achieving operational excellence through reverse engineering - stripping the architecture down to the very basics. The architecture of WOS consists of three components: WOS building blocks, WOS Core software and a choice of simple interfaces. The backend of a WOS storage infrastructure are the WOS storage nodes. The storage nodes are essentially 4U servers filled with 60 SATA disks. A WOS infrastructure can contain as few as 3 nodes and scales to virtually unlimited capacity by adding more nodes. Smart storage requires intelligent software. WOS Core has a single, straightforward management console for the entire infrastructure - even when distributed across multiple sites. WOS Core s self-healing capabilities and other features drastically reduce operator-driven maintenance interventions. WOS provides the most complete choice of interfaces, including a set of native API s, file access interfaces and S3 REST. TRUE OBJECT STORAGE PLATFORM Most object storage platforms still have a POSIX file system layer on the disk level. WOS, however, was designed as a true object storage platform, a flat, single layer, address structure where objects are stored in a contiguous group of blocks so that disk operations are minimized (singledisk-operation reads dual-operation writes), performance is maximized and disks are used at full capacity. OPTIMIZED FOR SMALL AND LARGE FILES WOS is the only object storage system that is optimized for high-speed throughput of large data volumes and super-fast I/O operations for small files. For multi-site deployments, the built-in WOS Latency-Aware Access Manager will automatically address data access requests to the location with the lowest latency. WOS IS THE ONLY OBJECT STORAGE PLATFORM THAT SEAMLESSLY INTEGRATES WITH OTHER STORAGE TIERS. CHOICE OF DATA PROTECTION SCHEMES WOS offers a choice of data protection mechanisms to ensure the highest data durability AND availability. Reduce your storage overhead while maximizing durability for single site deployments with local ObjectAssure, DDN s implementation of Erasure Coding. Alternatively, you can choose Replicated ObjectAssure, to improve availability without increasing WAN costs. Finally, ObjectAssure can be implemented in a distributed way to ensure higher durability, at a lower cost. SELF-HEALING ARCHITECTURE Keeping traditional storage infrastructures healthy is management-intensive. Disks need to be replaced and restored. Rebuild windows need to be kept to a minimum to avoid data loss and preserve application performance. This is not the case with WOS. The built-in data protection algorithm, ObjectAssure has unique self-healing capabilities that further reduce the management effort. Also, in case of a broken disk, ObjectAssure only has to reconstruct the actual data that was lost - as opposed to the entire disk. This dramatically reduces the rebuild window. SINGLE STORAGE INFRASTRUCTURE WOS is the only object storage platform that seamlessly integrates with other storage tiers. It has one management interface for the entire infrastructure and supports easy data movement between different tiers, e.g. from GPFS to WOS and back, or from and to Lustre environments. 13

14 UNLIKE COMMODITY HARDWARE, WOS WAS DESIGNED TO REDUCE TCO. WOS SCALES IN CLUSTERS AS LARGE AS 30PB WIDEST SELECTION OF INTERFACES; OUT OF THE BOX APPLICATIONS WOS provides the most complete choice of interfaces, including a set of native API s, file access interfaces and S3 REST. In addition, WOS can be configured with preinstalled applications such as irods for data management or WOS Share for secure global file sharing. ENTERPRISE-GRADE PLATFORM Most vendors recommend commodity hardware for their object storage platforms. In the short term, this could mean initial CAPEX savings, but as such devices typically have shorter replacement cycles, this highly impacts the OPEX further down the road. This is especially so for multi petabyte deployments. While WOS was designed to be hardware agnostic, we designed the WOS 7000 hardware to reduce TCO. Unlike commodity hardware, the WOS 7000 has an ultra dense form factor, so there are fewer systems to house, manage, power, cool and maintain. Leveraging over 15 years of hardware design for the most demanding HPC environments, WOS 7000 was built to run many more years than cheaper commodity hardware. WOS BENEFITS Lowest Global Access Latency WOS was designed with the intent of maximizing performance for storage of massive volumes of immutable data. Scales with All Varieties of Applications WOS scales virtually unlimited in clusters as large as 30PB. Those clusters can consist of any mix of small (kilobytes) or large (terabytes) files. Best Durability & System Availability WOS choice of data protection schemes allows the customer to deploy object storage that combines durability with availability. Lowest Administration Overhead, Lowest TCO Through automated management, lower hardware costs, less power usage, simple architecture, optimized disk usage and reduced WAN bandwidth usage; WOS enables organizations to store more data at a much lower cost. Simple Integration Integrate WOS with your GRIDScaler GPFS storage or your EXAScaler Lustre platform. Use WOS as an archive for your HScaler Big Data Storage, or build an Active Archive of WOS with a tape library for offline cold archiving. Maximum Portability WOS features the most complete set of interfaces to facilitate your application integration, including C++ and Java APIs for direct application integration, REST for web applications (S3 or not) and file gateways to support file-based workflows. Best Data Center Density Designed for massive HPC deployments, WOS 7000 provides the highest data center density possible. ECOSYSTEM Object Storage is clearly the hot space in the storage industry, with offerings from both startups and established storage solution providers. But, there is more than just object storage on the market: object storage has fostered a wave of innovation that enables or leverages the paradigm. The list of Tier 2 object storage players can be endless, especially when including the application providers. Here is a short selection of popular gateways, WAN optimizers, collaboration platforms and other applications. This should help to provide a better understanding of the object storage ecosystem and the opportunities and use cases. 14

15 THERE ARE MANY DIFFERENT FLAVORS OF OBJECT STORAGE ON THE MARKET. Ctera CTERA leverages object storage to offer a range of solutions for SMBs, enterprise branch offices and remote users, including: data backup and recovery, file-based collaboration and mobile access. Mezeo also provides a number of storage solutions that leverage object storage, including: an AWS compatible REST API and a number of file sync and share clients that give users access and collaboration capabilities from their PC/Mac, smartphone, tablet or browser interface. Panzura built a NAS gateway for storage clouds. The gateway enables enterprises to combine multiple storage (cloud) resources and make them accessible to multiple locations, presented as a unified global file system. Aspera both leverages and facilitates object storage. On the one hand, they have a number of applications for collaboration, distribution etc., but the core of their technology is a protocol that optimizes how data is sent from the object storage pool, over the WAN to a user application - or between sites, if an object storage infrastructure is distributed over multiple locations. Bitspeed and Silverpeak are active in the same space: WAN optimization, which enables faster, more reliable and] secure data transfer between storage sites - or between the object storage pool and the application. These technologies are becoming increasingly important in the deployment of object storage based storage clouds. Dropbox is probably the best-know object storage success case. This early AWS S3 customer launched a file-sharing application when no one even knew they needed one. The power of Dropbox lies in their use of deduplication (when multiple users store the same file in their Dropbox, only one copy is kept). This way, Dropbox saves a lot on storage costs. Deduplication is not new, but Dropbox pioneered its use in an online, object storage based application. This also allowed them to quickly gain a large user base through a fermium model, which would have been unaffordable otherwise. Box(.net) also started as an online file sharing application but with some very important differences. Box runs on their own (object storage) infrastructure, which gave them more control over security, data integrity etc. (as compared to using S3). This allowed them to bring their solution to the SMB and Enterprise markets. Today, Box.net grew to what can probably best be described as a storage-centric Platform as a Service, enabling organizations to customize apps, integrate with their own applications etc. Netflix, which launched as a DVD rental by mail is an early adopter of object storage: in 2007 it launched a movie streaming service which would disrupt the market. Well before Apple added movies and tv shows to their store, Netflix leveraged S3 to offer movies in an online format. Apple, Google and Facebook also have massive object storage deployments, but little is known about their architectures. Apple and Google are going after the S3 end users with document sharing and other storage in the cloud services. With this, they compete both with Amazon and the applications that use S3 such as Dropbox and Evernote. 15

16 RESOURCES FOR MORE INFORMATION VISIT: ABOUT DDN DataDirect Networks (DDN) is the world s leading big data storage supplier to data-intensive, global organizations. For more than 15 years, DDN has designed, developed, deployed and optimized systems, software and solutions that enable enterprises, service providers, universities and government agencies to generate more value and to accelerate time to insight from their data and information, on premise and in the cloud. Organizations leverage the power of DDN technology and the deep technical expertise of its team to capture, store, process, analyze, collaborate and distribute data, information and content at largest scale in the most efficient, reliable and cost effective manner. DDN customers include many of the world s leading financial services firms and banks, healthcare and life science organizations, manufacturing and energy companies, government and research facilities, and web and cloud service providers. For more information, visit our website or call SALES@DDN.COM DataDirect Networks, Inc. All Rights Reserved. DataDirect Networks, EXAScaler, GRIDScaler, ReACT, SFA12K, SFA, SFX, Storage Fusion Xceleration, Web Object Storage, WOS are trademarks of DataDirect Networks. Other Names and Brands May Be Claimed as the Property of Others. v2 (5/15)

A Beginner s Guide To Next Generation Object Storage. ddn.com. DDN Whitepaper. Tom Leyden, Director of Product Marketing WOS

A Beginner s Guide To Next Generation Object Storage. ddn.com. DDN Whitepaper. Tom Leyden, Director of Product Marketing WOS DDN Whitepaper A Beginner s Guide To Next Generation Object Storage Tom Leyden, Director of Product Marketing WOS 2013 DataDirect Networks. All Rights Reserved. Executive Summary Object Storage is the

More information

WOS. High Performance Object Storage

WOS. High Performance Object Storage Datasheet WOS High Performance Object Storage The Big Data explosion brings both challenges and opportunities to businesses across all industry verticals. Providers of online services are building infrastructures

More information

WOS 360 FULL SPECTRUM OBJECT STORAGE

WOS 360 FULL SPECTRUM OBJECT STORAGE WOS 360 FULL SPECTRUM OBJECT STORAGE WHITEPAPER Collaborate Distribute Archive EXECUTIVE SUMMARY The industry has come to understand that an alternative storage methodology is required to effi..ciently.and.securely.store.the.exabytes.of.unstructured.information.we.generate.every.day..

More information

ANY SURVEILLANCE, ANYWHERE, ANYTIME

ANY SURVEILLANCE, ANYWHERE, ANYTIME ANY SURVEILLANCE, ANYWHERE, ANYTIME WHITEPAPER DDN Storage Powers Next Generation Video Surveillance Infrastructure INTRODUCTION Over the past decade, the world has seen tremendous growth in the use of

More information

WOS OBJECT STORAGE PRODUCT BROCHURE DDN.COM 1.800.837.2298. 360 Full Spectrum Object Storage

WOS OBJECT STORAGE PRODUCT BROCHURE DDN.COM 1.800.837.2298. 360 Full Spectrum Object Storage PRODUCT BROCHURE WOS OBJECT STORAGE 360 Full Spectrum Object Storage The promise of object storage is simple: to enable organizations to build highly Performance Scalability Reliability Efficiency Security

More information

WOS Cloud. ddn.com. Personal Storage for the Enterprise. DDN Solution Brief

WOS Cloud. ddn.com. Personal Storage for the Enterprise. DDN Solution Brief DDN Solution Brief Personal Storage for the Enterprise WOS Cloud Secure, Shared Drop-in File Access for Enterprise Users, Anytime and Anywhere 2011 DataDirect Networks. All Rights Reserved DDN WOS Cloud

More information

HadoopTM Analytics DDN

HadoopTM Analytics DDN DDN Solution Brief Accelerate> HadoopTM Analytics with the SFA Big Data Platform Organizations that need to extract value from all data can leverage the award winning SFA platform to really accelerate

More information

ECMWF HPC Workshop: Accelerating Data Management

ECMWF HPC Workshop: Accelerating Data Management October 2012 ECMWF HPC Workshop: Accelerating Data Management Massively-Scalable Platforms and Solutions Engineered for the Big Data and Cloud Era Glenn Wright Systems Architect, DDN Data-Driven Paradigm

More information

Object storage in Cloud Computing and Embedded Processing

Object storage in Cloud Computing and Embedded Processing Object storage in Cloud Computing and Embedded Processing Jan Jitze Krol Systems Engineer DDN We Accelerate Information Insight DDN is a Leader in Massively Scalable Platforms and Solutions for Big Data

More information

Scala Storage Scale-Out Clustered Storage White Paper

Scala Storage Scale-Out Clustered Storage White Paper White Paper Scala Storage Scale-Out Clustered Storage White Paper Chapter 1 Introduction... 3 Capacity - Explosive Growth of Unstructured Data... 3 Performance - Cluster Computing... 3 Chapter 2 Current

More information

Geospatial Imaging Cloud Storage Capturing the World at Scale with WOS TM. ddn.com. DDN Whitepaper. 2011 DataDirect Networks. All Rights Reserved.

Geospatial Imaging Cloud Storage Capturing the World at Scale with WOS TM. ddn.com. DDN Whitepaper. 2011 DataDirect Networks. All Rights Reserved. DDN Whitepaper Geospatial Imaging Cloud Storage Capturing the World at Scale with WOS TM Table of Contents Growth and Complexity Challenges for Geospatial Imaging 3 New Solutions to Drive Insight, Simplicity

More information

Introduction to NetApp Infinite Volume

Introduction to NetApp Infinite Volume Technical Report Introduction to NetApp Infinite Volume Sandra Moulton, Reena Gupta, NetApp April 2013 TR-4037 Summary This document provides an overview of NetApp Infinite Volume, a new innovation in

More information

T a c k l i ng Big Data w i th High-Performance

T a c k l i ng Big Data w i th High-Performance Worldwide Headquarters: 211 North Union Street, Suite 105, Alexandria, VA 22314, USA P.571.296.8060 F.508.988.7881 www.idc-gi.com T a c k l i ng Big Data w i th High-Performance Computing W H I T E P A

More information

Caringo Swarm 7: beyond the limits of traditional storage. A new private cloud foundation for storage needs at scale

Caringo Swarm 7: beyond the limits of traditional storage. A new private cloud foundation for storage needs at scale Caringo Swarm 7: beyond the limits of traditional storage. A new private cloud foundation for storage needs at scale Prepared for: Caringo May 2014 TABLE OF CONTENTS TABLE OF CONTENTS 1 EXECUTIVE SUMMARY

More information

NetApp Big Content Solutions: Agile Infrastructure for Big Data

NetApp Big Content Solutions: Agile Infrastructure for Big Data White Paper NetApp Big Content Solutions: Agile Infrastructure for Big Data Ingo Fuchs, NetApp April 2012 WP-7161 Executive Summary Enterprises are entering a new era of scale, in which the amount of data

More information

DDN updates object storage platform as it aims to break out of HPC niche

DDN updates object storage platform as it aims to break out of HPC niche DDN updates object storage platform as it aims to break out of HPC niche Analyst: Simon Robinson 18 Oct, 2013 DataDirect Networks has refreshed its Web Object Scaler (WOS), the company's platform for efficiently

More information

BlueArc unified network storage systems 7th TF-Storage Meeting. Scale Bigger, Store Smarter, Accelerate Everything

BlueArc unified network storage systems 7th TF-Storage Meeting. Scale Bigger, Store Smarter, Accelerate Everything BlueArc unified network storage systems 7th TF-Storage Meeting Scale Bigger, Store Smarter, Accelerate Everything BlueArc s Heritage Private Company, founded in 1998 Headquarters in San Jose, CA Highest

More information

Growth of Unstructured Data & Object Storage. Marcel Laforce Sr. Director, Object Storage

Growth of Unstructured Data & Object Storage. Marcel Laforce Sr. Director, Object Storage Growth of Unstructured Data & Object Storage Marcel Laforce Sr. Director, Object Storage Agenda Unstructured Data Growth Contrasting approaches: Objects, Files & Blocks The Emerging Object Storage Market

More information

CTERA Enterprise File Services Platform Architecture for HP Helion Content Depot

CTERA Enterprise File Services Platform Architecture for HP Helion Content Depot CTERA Enterprise File Services Platform Architecture for HP Helion Content Depot Whitepaper by CTERA Networks Highlights How unstructured data growth drives cloud storage adoption Putting cloud storage

More information

MaxDeploy Hyper- Converged Reference Architecture Solution Brief

MaxDeploy Hyper- Converged Reference Architecture Solution Brief MaxDeploy Hyper- Converged Reference Architecture Solution Brief MaxDeploy Reference Architecture solutions are configured and tested for support with Maxta software- defined storage and with industry

More information

ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE

ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE Hadoop Storage-as-a-Service ABSTRACT This White Paper illustrates how EMC Elastic Cloud Storage (ECS ) can be used to streamline the Hadoop data analytics

More information

MaxDeploy Ready. Hyper- Converged Virtualization Solution. With SanDisk Fusion iomemory products

MaxDeploy Ready. Hyper- Converged Virtualization Solution. With SanDisk Fusion iomemory products MaxDeploy Ready Hyper- Converged Virtualization Solution With SanDisk Fusion iomemory products MaxDeploy Ready products are configured and tested for support with Maxta software- defined storage and with

More information

Object Storage: Out of the Shadows and into the Spotlight

Object Storage: Out of the Shadows and into the Spotlight Technology Insight Paper Object Storage: Out of the Shadows and into the Spotlight By John Webster December 12, 2012 Enabling you to make the best technology decisions Object Storage: Out of the Shadows

More information

With DDN Big Data Storage

With DDN Big Data Storage DDN Solution Brief Accelerate > ISR With DDN Big Data Storage The Way to Capture and Analyze the Growing Amount of Data Created by New Technologies 2012 DataDirect Networks. All Rights Reserved. The Big

More information

High Performance Server SAN using Micron M500DC SSDs and Sanbolic Software

High Performance Server SAN using Micron M500DC SSDs and Sanbolic Software High Performance Server SAN using Micron M500DC SSDs and Sanbolic Software White Paper Overview The Micron M500DC SSD was designed after months of close work with major data center service providers and

More information

OPTIMIZING PRIMARY STORAGE WHITE PAPER FILE ARCHIVING SOLUTIONS FROM QSTAR AND CLOUDIAN

OPTIMIZING PRIMARY STORAGE WHITE PAPER FILE ARCHIVING SOLUTIONS FROM QSTAR AND CLOUDIAN OPTIMIZING PRIMARY STORAGE WHITE PAPER FILE ARCHIVING SOLUTIONS FROM QSTAR AND CLOUDIAN CONTENTS EXECUTIVE SUMMARY The Challenges of Data Growth SOLUTION OVERVIEW 3 SOLUTION COMPONENTS 4 Cloudian HyperStore

More information

HGST Object Storage for a New Generation of IT

HGST Object Storage for a New Generation of IT Enterprise Strategy Group Getting to the bigger truth. SOLUTION SHOWCASE HGST Object Storage for a New Generation of IT Date: October 2015 Author: Scott Sinclair, Storage Analyst Abstract: Under increased

More information

Big data management with IBM General Parallel File System

Big data management with IBM General Parallel File System Big data management with IBM General Parallel File System Optimize storage management and boost your return on investment Highlights Handles the explosive growth of structured and unstructured data Offers

More information

CTERA Cloud Storage Platform Architecture

CTERA Cloud Storage Platform Architecture CTERA Cloud Storage Platform Architecture Whitepaper by CTERA Networks Highlights How unstructured data growth drives cloud storage adoption The killer apps of cloud storage reviewed Putting cloud storage

More information

Designing a Cloud Storage System

Designing a Cloud Storage System Designing a Cloud Storage System End to End Cloud Storage When designing a cloud storage system, there is value in decoupling the system s archival capacity (its ability to persistently store large volumes

More information

ANY THREAT, ANYWHERE, ANYTIME Scalable.Infrastructure.to.Enable.the.Warfi.ghter

ANY THREAT, ANYWHERE, ANYTIME Scalable.Infrastructure.to.Enable.the.Warfi.ghter WHITEPAPER ANY THREAT, ANYWHERE, ANYTIME Scalable.Infrastructure.to.Enable.the.Warfi.ghter THE BIG DATA CHALLENGE AND OPPORTUNITY The.proliferation,.management.and.analysis.of.intelligence.data.is.a.fast.growing.concern.

More information

Object Storage: A Growing Opportunity for Service Providers. White Paper. Prepared for: 2012 Neovise, LLC. All Rights Reserved.

Object Storage: A Growing Opportunity for Service Providers. White Paper. Prepared for: 2012 Neovise, LLC. All Rights Reserved. Object Storage: A Growing Opportunity for Service Providers Prepared for: White Paper 2012 Neovise, LLC. All Rights Reserved. Introduction For service providers, the rise of cloud computing is both a threat

More information

Amazon Cloud Storage Options

Amazon Cloud Storage Options Amazon Cloud Storage Options Table of Contents 1. Overview of AWS Storage Options 02 2. Why you should use the AWS Storage 02 3. How to get Data into the AWS.03 4. Types of AWS Storage Options.03 5. Object

More information

Red Hat Storage Server

Red Hat Storage Server Red Hat Storage Server Marcel Hergaarden Solution Architect, Red Hat marcel.hergaarden@redhat.com May 23, 2013 Unstoppable, OpenSource Software-based Storage Solution The Foundation for the Modern Hybrid

More information

I D C T E C H N O L O G Y S P O T L I G H T. T i m e t o S c ale Out, Not Scale Up

I D C T E C H N O L O G Y S P O T L I G H T. T i m e t o S c ale Out, Not Scale Up I D C T E C H N O L O G Y S P O T L I G H T M a naging the Explosion of Enterprise Data: T i m e t o S c ale Out, Not Scale Up July 2014 Adapted from Scale-Out Meets Virtualization by Ashish Nadkarni,

More information

Breaking the Storage Array Lifecycle with Cloud Storage

Breaking the Storage Array Lifecycle with Cloud Storage Breaking the Storage Array Lifecycle with Cloud Storage 2011 TwinStrata, Inc. The Storage Array Lifecycle Anyone who purchases storage arrays is familiar with the many advantages of modular storage systems

More information

IBM ELASTIC STORAGE SEAN LEE

IBM ELASTIC STORAGE SEAN LEE IBM ELASTIC STORAGE SEAN LEE Solution Architect Platform Computing Division IBM Greater China Group Agenda Challenges in Data Management What is IBM Elastic Storage Key Features Elastic Storage Server

More information

StorReduce Technical White Paper Cloud-based Data Deduplication

StorReduce Technical White Paper Cloud-based Data Deduplication StorReduce Technical White Paper Cloud-based Data Deduplication See also at storreduce.com/docs StorReduce Quick Start Guide StorReduce FAQ StorReduce Solution Brief, and StorReduce Blog at storreduce.com/blog

More information

Maxta Storage Platform Enterprise Storage Re-defined

Maxta Storage Platform Enterprise Storage Re-defined Maxta Storage Platform Enterprise Storage Re-defined WHITE PAPER Software-Defined Data Center The Software-Defined Data Center (SDDC) is a unified data center platform that delivers converged computing,

More information

Managing the Unmanageable: A Better Way to Manage Storage

Managing the Unmanageable: A Better Way to Manage Storage Managing the Unmanageable: A Better Way to Manage Storage Storage growth is unending, but there is a way to meet the challenge, without worries about scalability or availability. October 2010 ISILON SYSTEMS

More information

Technology Insight Series

Technology Insight Series Evaluating Storage Technologies for Virtual Server Environments Russ Fellows June, 2010 Technology Insight Series Evaluator Group Copyright 2010 Evaluator Group, Inc. All rights reserved Executive Summary

More information

SwiftStack Filesystem Gateway Architecture

SwiftStack Filesystem Gateway Architecture WHITEPAPER SwiftStack Filesystem Gateway Architecture March 2015 by Amanda Plimpton Executive Summary SwiftStack s Filesystem Gateway expands the functionality of an organization s SwiftStack deployment

More information

August 2009. Transforming your Information Infrastructure with IBM s Storage Cloud Solution

August 2009. Transforming your Information Infrastructure with IBM s Storage Cloud Solution August 2009 Transforming your Information Infrastructure with IBM s Storage Cloud Solution Page 2 Table of Contents Executive summary... 3 Introduction... 4 A Story or three for inspiration... 6 Oops,

More information

Protecting Big Data Data Protection Solutions for the Business Data Lake

Protecting Big Data Data Protection Solutions for the Business Data Lake White Paper Protecting Big Data Data Protection Solutions for the Business Data Lake Abstract Big Data use cases are maturing and customers are using Big Data to improve top and bottom line revenues. With

More information

Block based, file-based, combination. Component based, solution based

Block based, file-based, combination. Component based, solution based The Wide Spread Role of 10-Gigabit Ethernet in Storage This paper provides an overview of SAN and NAS storage solutions, highlights the ubiquitous role of 10 Gigabit Ethernet in these solutions, and illustrates

More information

VMware Software-Defined Storage Vision

VMware Software-Defined Storage Vision VMware Software-Defined Storage Vision Lee Dilworth (@leedilworth) Principal Systems Engineer 2014 VMware Inc. All rights reserved. The Software-Defined Data Center Expand virtual compute to all applications

More information

Quantum StorNext. Product Brief: Distributed LAN Client

Quantum StorNext. Product Brief: Distributed LAN Client Quantum StorNext Product Brief: Distributed LAN Client NOTICE This product brief may contain proprietary information protected by copyright. Information in this product brief is subject to change without

More information

D e c e n t r a lized Scale - Out Ar c h i t e c t u r e s

D e c e n t r a lized Scale - Out Ar c h i t e c t u r e s I D C T E C H N O L O G Y S P O T L I G H T Object-Based Storage: The Need for D e c e n t r a lized Scale - Out Ar c h i t e c t u r e s February 2015 Adapted from IDC MarketScape: Worldwide Object-Based

More information

WHITE PAPER. Reinventing Large-Scale Digital Libraries With Object Storage Technology

WHITE PAPER. Reinventing Large-Scale Digital Libraries With Object Storage Technology WHITE PAPER Reinventing Large-Scale Digital Libraries With Object Storage Technology CONTENTS Introduction..........................................................................3 Hitting The Limits

More information

EMC CLOUDARRAY PRODUCT DESCRIPTION GUIDE

EMC CLOUDARRAY PRODUCT DESCRIPTION GUIDE EMC CLOUDARRAY PRODUCT DESCRIPTION GUIDE INTRODUCTION IT organizations today grapple with two critical data storage challenges: the exponential growth of data and an increasing need to keep more data for

More information

IBM Global Technology Services September 2007. NAS systems scale out to meet growing storage demand.

IBM Global Technology Services September 2007. NAS systems scale out to meet growing storage demand. IBM Global Technology Services September 2007 NAS systems scale out to meet Page 2 Contents 2 Introduction 2 Understanding the traditional NAS role 3 Gaining NAS benefits 4 NAS shortcomings in enterprise

More information

Understanding Object Storage and How to Use It

Understanding Object Storage and How to Use It SWIFTSTACK WHITEPAPER An IT Expert Guide: Understanding Object Storage and How to Use It November 2014 The explosion of unstructured data is creating a groundswell of interest in object storage, certainly

More information

Object Oriented Storage and the End of File-Level Restores

Object Oriented Storage and the End of File-Level Restores Object Oriented Storage and the End of File-Level Restores Stacy Schwarz-Gardner Spectra Logic Agenda Data Management Challenges Data Protection Data Recovery Data Archive Why Object Based Storage? The

More information

TECHNICAL WHITE PAPER: ELASTIC CLOUD STORAGE SOFTWARE ARCHITECTURE

TECHNICAL WHITE PAPER: ELASTIC CLOUD STORAGE SOFTWARE ARCHITECTURE TECHNICAL WHITE PAPER: ELASTIC CLOUD STORAGE SOFTWARE ARCHITECTURE Deploy a modern hyperscale storage platform on commodity infrastructure ABSTRACT This document provides a detailed overview of the EMC

More information

Storage Switzerland White Paper Storage Infrastructures for Big Data Workflows

Storage Switzerland White Paper Storage Infrastructures for Big Data Workflows Storage Switzerland White Paper Storage Infrastructures for Big Data Workflows Sponsored by: Prepared by: Eric Slack, Sr. Analyst May 2012 Storage Infrastructures for Big Data Workflows Introduction Big

More information

THE EMC ISILON STORY. Big Data In The Enterprise. Copyright 2012 EMC Corporation. All rights reserved.

THE EMC ISILON STORY. Big Data In The Enterprise. Copyright 2012 EMC Corporation. All rights reserved. THE EMC ISILON STORY Big Data In The Enterprise 2012 1 Big Data In The Enterprise Isilon Overview Isilon Technology Summary 2 What is Big Data? 3 The Big Data Challenge File Shares 90 and Archives 80 Bioinformatics

More information

IBM Global Technology Services November 2009. Successfully implementing a private storage cloud to help reduce total cost of ownership

IBM Global Technology Services November 2009. Successfully implementing a private storage cloud to help reduce total cost of ownership IBM Global Technology Services November 2009 Successfully implementing a private storage cloud to help reduce total cost of ownership Page 2 Contents 2 Executive summary 3 What is a storage cloud? 3 A

More information

Building Storage Clouds for Online Applications A Case for Optimized Object Storage

Building Storage Clouds for Online Applications A Case for Optimized Object Storage Building Storage Clouds for Online Applications A Case for Optimized Object Storage Agenda Introduction: storage facts and trends Call for more online storage! AmpliStor: Optimized Object Storage Cost

More information

EMC ISILON OneFS OPERATING SYSTEM Powering scale-out storage for the new world of Big Data in the enterprise

EMC ISILON OneFS OPERATING SYSTEM Powering scale-out storage for the new world of Big Data in the enterprise EMC ISILON OneFS OPERATING SYSTEM Powering scale-out storage for the new world of Big Data in the enterprise ESSENTIALS Easy-to-use, single volume, single file system architecture Highly scalable with

More information

Archive Data Retention & Compliance. Solutions Integrated Storage Appliances. Management Optimized Storage & Migration

Archive Data Retention & Compliance. Solutions Integrated Storage Appliances. Management Optimized Storage & Migration Solutions Integrated Storage Appliances Management Optimized Storage & Migration Archive Data Retention & Compliance Services Global Installation & Support SECURING THE FUTURE OF YOUR DATA w w w.q sta

More information

Collaborative Research Infrastructure Deployments. ddn.com. Accelerate > DDN Case Study

Collaborative Research Infrastructure Deployments. ddn.com. Accelerate > DDN Case Study DDN Case Study Accelerate > Collaborative Research Infrastructure Deployments University College London Transforms Research Collaboration and Data Preservation with Scalable Cloud Object Storage Appliance

More information

EMC IRODS RESOURCE DRIVERS

EMC IRODS RESOURCE DRIVERS EMC IRODS RESOURCE DRIVERS PATRICK COMBES: PRINCIPAL SOLUTION ARCHITECT, LIFE SCIENCES 1 QUICK AGENDA Intro to Isilon (~2 hours) Isilon resource driver Intro to ECS (~1.5 hours) ECS Resource driver Possibilities

More information

Introduction to Red Hat Storage. January, 2012

Introduction to Red Hat Storage. January, 2012 Introduction to Red Hat Storage January, 2012 1 Today s Speakers 2 Heather Wellington Tom Trainer Storage Program Marketing Manager Storage Product Marketing Manager Red Hat Acquisition of Gluster What

More information

Cloud OS Vision. Modern platform for the world s apps

Cloud OS Vision. Modern platform for the world s apps Cloud OS Vision Modern platform for the world s apps transforms the datacenter unlocks insights on any data empowers people-centric IT enables modern business apps Development Management Identity Data

More information

Getting performance & scalability on standard platforms, the Object vs Block storage debate. Copyright 2013 MPSTOR LTD. All rights reserved.

Getting performance & scalability on standard platforms, the Object vs Block storage debate. Copyright 2013 MPSTOR LTD. All rights reserved. Getting performance & scalability on standard platforms, the Object vs Block storage debate 1 December Webinar Session Getting performance & scalability on standard platforms, the Object vs Block storage

More information

Whitepaper. NexentaConnect for VMware Virtual SAN. Full Featured File services for Virtual SAN

Whitepaper. NexentaConnect for VMware Virtual SAN. Full Featured File services for Virtual SAN Whitepaper NexentaConnect for VMware Virtual SAN Full Featured File services for Virtual SAN Table of Contents Introduction... 1 Next Generation Storage and Compute... 1 VMware Virtual SAN... 2 Highlights

More information

Building Storage-as-a-Service Businesses

Building Storage-as-a-Service Businesses White Paper Service Providers Greatest New Growth Opportunity: Building Storage-as-a-Service Businesses According to 451 Research, Storage as a Service represents a large and rapidly growing market with

More information

Data Centric Computing Revisited

Data Centric Computing Revisited Piyush Chaudhary Technical Computing Solutions Data Centric Computing Revisited SPXXL/SCICOMP Summer 2013 Bottom line: It is a time of Powerful Information Data volume is on the rise Dimensions of data

More information

Diagram 1: Islands of storage across a digital broadcast workflow

Diagram 1: Islands of storage across a digital broadcast workflow XOR MEDIA CLOUD AQUA Big Data and Traditional Storage The era of big data imposes new challenges on the storage technology industry. As companies accumulate massive amounts of data from video, sound, database,

More information

How To Make A Backup System More Efficient

How To Make A Backup System More Efficient Identifying the Hidden Risk of Data De-duplication: How the HYDRAstor Solution Proactively Solves the Problem October, 2006 Introduction Data de-duplication has recently gained significant industry attention,

More information

Driving the New Paradigm of Software Defined Storage Solutions White Paper July 2014

Driving the New Paradigm of Software Defined Storage Solutions White Paper July 2014 Sanapptx Driving the New Paradigm of Software Defined Storage Solutions White Paper July 2014 Page 1 The New Software Defined Storage Opportunity Organizations everywhere are experiencing an explosion

More information

SECURE, ENTERPRISE FILE SYNC AND SHARE WITH EMC SYNCPLICITY UTILIZING EMC ISILON, EMC ATMOS, AND EMC VNX

SECURE, ENTERPRISE FILE SYNC AND SHARE WITH EMC SYNCPLICITY UTILIZING EMC ISILON, EMC ATMOS, AND EMC VNX White Paper SECURE, ENTERPRISE FILE SYNC AND SHARE WITH EMC SYNCPLICITY UTILIZING EMC ISILON, EMC ATMOS, AND EMC VNX Abstract This white paper explains the benefits to the extended enterprise of the on-

More information

3 common cloud challenges eradicated with hybrid cloud

3 common cloud challenges eradicated with hybrid cloud 3 common cloud eradicated 3 common cloud eradicated Cloud storage may provide flexibility and capacityon-demand benefits but it also poses some difficult that have limited its widespread adoption. Consequently,

More information

EMC XTREMIO EXECUTIVE OVERVIEW

EMC XTREMIO EXECUTIVE OVERVIEW EMC XTREMIO EXECUTIVE OVERVIEW COMPANY BACKGROUND XtremIO develops enterprise data storage systems based completely on random access media such as flash solid-state drives (SSDs). By leveraging the underlying

More information

<Insert Picture Here> Refreshing Your Data Protection Environment with Next-Generation Architectures

<Insert Picture Here> Refreshing Your Data Protection Environment with Next-Generation Architectures 1 Refreshing Your Data Protection Environment with Next-Generation Architectures Dale Rhine, Principal Sales Consultant Kelly Boeckman, Product Marketing Analyst Program Agenda Storage

More information

HyperQ Storage Tiering White Paper

HyperQ Storage Tiering White Paper HyperQ Storage Tiering White Paper An Easy Way to Deal with Data Growth Parsec Labs, LLC. 7101 Northland Circle North, Suite 105 Brooklyn Park, MN 55428 USA 1-763-219-8811 www.parseclabs.com info@parseclabs.com

More information

Redefining Microsoft SQL Server Data Management. PAS Specification

Redefining Microsoft SQL Server Data Management. PAS Specification Redefining Microsoft SQL Server Data Management APRIL Actifio 11, 2013 PAS Specification Table of Contents Introduction.... 3 Background.... 3 Virtualizing Microsoft SQL Server Data Management.... 4 Virtualizing

More information

WHITE PAPER. QUANTUM LATTUS: Next-Generation Object Storage for Big Data Archives

WHITE PAPER. QUANTUM LATTUS: Next-Generation Object Storage for Big Data Archives WHITE PAPER QUANTUM LATTUS: Next-Generation Object Storage for Big Data Archives CONTENTS Executive Summary....................................................................3 The Limits of Traditional

More information

The Design and Implementation of the Zetta Storage Service. October 27, 2009

The Design and Implementation of the Zetta Storage Service. October 27, 2009 The Design and Implementation of the Zetta Storage Service October 27, 2009 Zetta s Mission Simplify Enterprise Storage Zetta delivers enterprise-grade storage as a service for IT professionals needing

More information

Introduction to Cloud : Cloud and Cloud Storage. Lecture 2. Dr. Dalit Naor IBM Haifa Research Storage Systems. Dalit Naor, IBM Haifa Research

Introduction to Cloud : Cloud and Cloud Storage. Lecture 2. Dr. Dalit Naor IBM Haifa Research Storage Systems. Dalit Naor, IBM Haifa Research Introduction to Cloud : Cloud and Cloud Storage Lecture 2 Dr. Dalit Naor IBM Haifa Research Storage Systems 1 Advanced Topics in Storage Systems for Big Data - Spring 2014, Tel-Aviv University http://www.eng.tau.ac.il/semcom

More information

Traditionally, a typical SAN topology uses fibre channel switch wiring while a typical NAS topology uses TCP/IP protocol over common networking

Traditionally, a typical SAN topology uses fibre channel switch wiring while a typical NAS topology uses TCP/IP protocol over common networking Network Storage for Business Continuity and Disaster Recovery and Home Media White Paper Abstract Network storage is a complex IT discipline that includes a multitude of concepts and technologies, like

More information

Implementing Multi-Tenanted Storage for Service Providers with Cloudian HyperStore. The Challenge SOLUTION GUIDE

Implementing Multi-Tenanted Storage for Service Providers with Cloudian HyperStore. The Challenge SOLUTION GUIDE Implementing Multi-Tenanted Storage for Service Providers with Cloudian HyperStore COST EFFECTIVE SCALABLE STORAGE PLATFORM FOR CLOUD STORAGE SERVICES SOLUTION GUIDE The Challenge Service providers (MSPs/ISPs/ASPs)

More information

How To Protect Data On Network Attached Storage (Nas) From Disaster

How To Protect Data On Network Attached Storage (Nas) From Disaster White Paper EMC FOR NETWORK ATTACHED STORAGE (NAS) BACKUP AND RECOVERY Abstract This white paper provides an overview of EMC s industry leading backup and recovery solutions for NAS systems. It also explains

More information

Cloudian delivers object storage for next generation infrastructures

Cloudian delivers object storage for next generation infrastructures Cloudian delivers object storage for next generation infrastructures Prepared for: Cloudian July 2014 TABLE OF CONTENTS TABLE OF CONTENTS 1 EXECUTIVE SUMMARY 2 INTRODUCTION 3 The challenges 3 Considering

More information

Accelerate > Converged Storage Infrastructure. DDN Case Study. ddn.com. 2013 DataDirect Networks. All Rights Reserved

Accelerate > Converged Storage Infrastructure. DDN Case Study. ddn.com. 2013 DataDirect Networks. All Rights Reserved DDN Case Study Accelerate > Converged Storage Infrastructure 2013 DataDirect Networks. All Rights Reserved The University of Florida s (ICBR) offers access to cutting-edge technologies designed to enable

More information

How To Manage A Single Volume Of Data On A Single Disk (Isilon)

How To Manage A Single Volume Of Data On A Single Disk (Isilon) 1 ISILON SCALE-OUT NAS OVERVIEW AND FUTURE DIRECTIONS PHIL BULLINGER, SVP, EMC ISILON 2 ROADMAP INFORMATION DISCLAIMER EMC makes no representation and undertakes no obligations with regard to product planning

More information

Four Ways High-Speed Data Transfer Can Transform Oil and Gas WHITE PAPER

Four Ways High-Speed Data Transfer Can Transform Oil and Gas WHITE PAPER Transform Oil and Gas WHITE PAPER TABLE OF CONTENTS Overview Four Ways to Accelerate the Acquisition of Remote Sensing Data Maximize HPC Utilization Simplify and Optimize Data Distribution Improve Business

More information

Modernizing Hadoop Architecture for Superior Scalability, Efficiency & Productive Throughput. ddn.com

Modernizing Hadoop Architecture for Superior Scalability, Efficiency & Productive Throughput. ddn.com DDN Technical Brief Modernizing Hadoop Architecture for Superior Scalability, Efficiency & Productive Throughput. A Fundamentally Different Approach To Enterprise Analytics Architecture: A Scalable Unit

More information

How To Use An Npm On A Network Device

How To Use An Npm On A Network Device WHITE PAPER: CA ARCserve Backup Network Data Management Protocol (NDMP) Network Attached Storage (NAS) Option: Integrated Protection for Heterogeneous NAS Environments CA ARCserve Backup: Protecting heterogeneous

More information

Business-centric Storage FUJITSU Hyperscale Storage System ETERNUS CD10000

Business-centric Storage FUJITSU Hyperscale Storage System ETERNUS CD10000 Business-centric Storage FUJITSU Hyperscale Storage System ETERNUS CD10000 Clear the way for new business opportunities. Unlock the power of data. Overcoming storage limitations Unpredictable data growth

More information

A Virtual Filer for VMware s Virtual SAN A Maginatics and VMware Joint Partner Brief

A Virtual Filer for VMware s Virtual SAN A Maginatics and VMware Joint Partner Brief A Virtual Filer for VMware s Virtual SAN A Maginatics and VMware Joint Partner Brief With the massive growth of unstructured data in today s enterprise environments, storage IT administrators are constantly

More information

EMC SOLUTION FOR SPLUNK

EMC SOLUTION FOR SPLUNK EMC SOLUTION FOR SPLUNK Splunk validation using all-flash EMC XtremIO and EMC Isilon scale-out NAS ABSTRACT This white paper provides details on the validation of functionality and performance of Splunk

More information

Scale-out NAS Unifies the Technical Enterprise

Scale-out NAS Unifies the Technical Enterprise Scale-out NAS Unifies the Technical Enterprise Panasas Inc. White Paper July 2010 Executive Summary Tremendous effort has been made by IT organizations, and their providers, to make enterprise storage

More information

WOS for Research. ddn.com. DDN Whitepaper. Utilizing irods to manage collaborative research. 2012 DataDirect Networks. All Rights Reserved.

WOS for Research. ddn.com. DDN Whitepaper. Utilizing irods to manage collaborative research. 2012 DataDirect Networks. All Rights Reserved. DDN Whitepaper WOS for Research Utilizing irods to manage collaborative research. 2012 DataDirect Networks. All Rights Reserved. irods and the DDN Web Object Scalar (WOS) Integration irods, an open source

More information

Cloud-Era File Sharing and Collaboration

Cloud-Era File Sharing and Collaboration Cloud-Era File Sharing and Collaboration Maginatics Cloud Storage Platform Perspective 2012 Neovise, LLC. All Rights Reserved. Report Published July 29, 2014 Introduction Distributed file systems have

More information

STORAGE CENTER. The Industry s Only SAN with Automated Tiered Storage STORAGE CENTER

STORAGE CENTER. The Industry s Only SAN with Automated Tiered Storage STORAGE CENTER STORAGE CENTER DATASHEET STORAGE CENTER Go Beyond the Boundaries of Traditional Storage Systems Today s storage vendors promise to reduce the amount of time and money companies spend on storage but instead

More information

Enterprise Data Lake Platforms: Deep Storage for Big Data and Analytics

Enterprise Data Lake Platforms: Deep Storage for Big Data and Analytics Insight Enterprise Data Lake Platforms: Deep Storage for Big Data and Analytics Ashish Nadkarni Laura DuBois IDC OPINION In the past 18 months or so, the term data lakes has surfaced as yet another phrase

More information

Hitachi NAS Platform and Hitachi Content Platform with ESRI Image

Hitachi NAS Platform and Hitachi Content Platform with ESRI Image W H I T E P A P E R Hitachi NAS Platform and Hitachi Content Platform with ESRI Image Aciduisismodo Extension to ArcGIS Dolore Server Eolore for Dionseq Geographic Uatummy Information Odolorem Systems

More information

EMC Backup and Recovery for Microsoft SQL Server 2008 Enabled by EMC Celerra Unified Storage

EMC Backup and Recovery for Microsoft SQL Server 2008 Enabled by EMC Celerra Unified Storage EMC Backup and Recovery for Microsoft SQL Server 2008 Enabled by EMC Celerra Unified Storage Applied Technology Abstract This white paper describes various backup and recovery solutions available for SQL

More information

Simple. Extensible. Open.

Simple. Extensible. Open. White Paper Simple. Extensible. Open. Unleash the Value of Data with EMC ViPR Global Data Services Abstract The following paper opens with the evolution of enterprise storage infrastructure in the era

More information