The Mojette Erasure Code: Application to fault tolerant Distributed File System (DFS)
|
|
|
- Ethelbert Daniel
- 10 years ago
- Views:
Transcription
1 The Mojette Erasure Code: Application to fault tolerant Distributed File System (DFS) Architecture de codes correcteurs d'erreurs Journée inter GDR ISIS et SoCSiP 4 Novembre 2014, salle B007, Télécom Bretagne Benoît Parrein, Université de Nantes, IRCCyN Lab, UMR 6597 Joint work with FIZIANS SAS
2 Outline MDS erasure codes FEC4Cloud project Mojette erasure code Performances Application to DFS: 2
3 Erasure codes (MDS property) k messages Encoding n code-words k received code-words Decoding k messages 3
4 FEC4Cloud Project ANR 2012 (appel Emergence) Partners: IRCCyN (lead), ISAE, SATT-Ouest Valorisation Budget: 256 K Duration: 24 months (product oriented) Goal: promoting erasure codes within Cloud storage infrastructure 4
5 The forward Mojette transform based on Radon transform [Guédon, 1995] compute 1D projections from a 2D geometrical buffer 5
6 Conditions of reconstruction Myron Katz criteria (1978) N N pi P i or qi Q i With a rectangular geometrical buffer of PxQ pixels And a projections set S = {(pi, qi)}. Mathematical Morphology for non rectangular shape [Normand, 1997] 6
7 Ancillary data for inversion Number of pixels (ixels) contributing to one bin Sum of coordinates (not represented here) 7
8 The reverse Mojette transform Check Katz Criteria (or mathematical morphology if necessary) While 2D geometrical buffer is not completely reconstructed do Find one-to-one correspondence into the projection set Retroprojection at the right location Update the projections (bins and ancillary data) 8
9 Properties of Mojette erasure code (1+є) MDS Systematic and non systematic coding Asynchronous reconstruction No algebraic constraints (as Galois fields) No prime size (as in MDS array or FRT) Linear complexity in coding/decoding [O(IN)] Soft coding and decoding 9
10 The reverse Mojette transform Check Katz Criteria (or mathematical morphology if necessary) While 2D geometrical buffer is not completely reconstructed do Find one-to-one correspondence into the projection set - costly Retroprojection at the right location Update the projections (bins and ancillary data) - costly 10
11 Optimizations (1/2) Deterministic path of reconstruction [Normand, 2006] (if geometrical buffer appears as a stripe) Example on a 4 lines geometrical buffer with 4 projections +drastic reduction in writes [engineers of Fizians, 2013] 11
12 Optimizations (2/2) processeur transform_forward: support pixel[0] 2) read(pixel[0])... pixel[128]... pixel[255] 6) read(pixel[128]) 1) write(bin[0],0) 2)data1 = read(pixel[0]) 3)data2 = read(bin[0]) 4) data3 = data2 xor data1 5) write(bin[0],data3) 6) data1= read(pixel[128] 7)data2 = read(bin[0]) 8) data3 = data2 xor data1 9) write(bin[0]),data3 proj 1 1) write_bin(bin0,0) bin[0] 3) read_bin(bin[0]) 5) write_bin(bin0,data3) 8) read_bin(bin[0]) 9) write_bin(bin0,data3) ;... bin[127] Classical forward mojette transform Cadifra Evaluation 12
13 Optimizations (2/2) processeur support pixel[0]... pixel[128] transform_forward: 1) read(pixel[0] 2) read(pixel[128] 1)data1 = read(pixel[0]) proj 1 bin[0] 2) data2= read(pixel[128] 3) data3 = data2 xor data1 4) write_bin(bin0,data3) 4) write(bin[0],data3)... ;... pixel[255] bin[127] Optimized forward mojette transform [Féron et al., 2014] Cadifra Evaluation 13
14 Related works (software) Reed-Solomon (by Cauchy matrices [Byers, 1995]) Reed-Solomon (by Vandermonde matrices [Rizzo, 1998] now a RFC5510) Cauchy Good [Planck, 2008] in Jerasure 1.2 Intel ISA-L (includes SSE instructions)... 14
15 Performances (coding) Coding (k=4,m=6) CPU Cycles ISA-L d 4+2 coding +memcpy 2500 Mojette RozoFS 4+2 coding blocksize (Bytes) that means GB/s (resp GB/s with 4KB) for Mojette coding (in purple) and GB/s (resp GB/s with 4KB) for RS coding (in blue) x1.8 (resp. x2.22) faster (for a 3x more coding blocks) 15
16 Performances (decoding) Decoding (m=4,k=6) CPU cycles 3000 ISA-L d 4+2 decoding +memcpy Mojette RozoFS 4+2 decoding memcpy Blocksize (Bytes) that means 9.6 GB/s (resp GB/s with 4KB) for Mojette coding (in purple) and GB/s (resp GB/s with 4KB) for RS coding (in blue) x3 (resp. x3.25) faster (for a 2x more coding blocks) 16
17 The Mojette Erasure Code: Application to fault tolerant Distributed File System (DFS) Architecture de codes correcteurs d'erreurs Journée inter GDR ISIS et SoCSiP 4 Novembre 2014, salle B007, Télécom Bretagne Benoît Parrein, Université de Nantes, IRCCyN Lab, UMR 6597 Joint work with FIZIANS SAS
18 High availability means % reachable Copies and copies and copies... (up to 7 times) Hard disks and hard disks and hard disks... High consumption of energy Privacy problems Erasure codes reduce drastically the size of the infrastructure for the same availability rate (2x) and facilitate privacy policy 18
19 Distributed File Systems HDFS (Hadoop) Facebook file system (f4)...not really I/O centric CephFS, GlusterFS,... Scality (based on Chord)... Mix of replicas (hot data) and erasure coding (cold data) : none use erasure codes always 19
20 20
21 RozoFS I/O Centric Distributed File System POSIX Scale-out storage Commodity hardware Fault tolerance (up to 4 failures) Based on erasure coding (Mojette coding) Dedicated to cold and hot data Open source project 21
22 Client Node Metadata Server ol p r t n Co ath Rozofsmount exportd th a pa Dat metadata n Mo g rin ito Storage Node Storaged Storage Node Storage Node Storaged... Pool of storage Nodes Storaged 22
23 Read/Write function (in non-sys coding) In a layout 0 i.e (2, 3) coding i.e two projections are necessary for reconstruction 23
24 Testbed 24
25 Performances Sequential access: layout 0, 4K blocks...6 GB/s in read...3 GB/s in write 25
26 Performances Random access: layout 0, 4K blocks...100k IOPS in read...80k IOPS in write 26
27 RozoFS + Exportd External Network Rozofsmount Storage Rozofsmount Storage Rozofsmount Storage Rozofsmount Storage Niveau clients/applications Standard GigE Infrastructure GigE infrastructure (data storage and metadata)
28 Credits Sylvain David Pierre Evenou Alex Van Kempen Jeanpierre Guédon Quentin Lebourgeois Jean-Pierre Monchanin Didier Féron Louis Legouriellec UMR 6597 Dimitri Pertin Nicolas Normand Bastien Confais Christophe de la Guérrande Olivier Blin 28
29 Vielen Danke! 29
30 Backup slides 30
31 The storage in the world 40 Exabytes (1018 bytes) stored in EB (37%) in the Cloud(s) 7,5 EB (50%) video, images,... 31
32 Server type Fujistu RX300-S8 (R3008S0035FR) CPU model name 2 x Intel Xeon CPU E GHz (8 cores & 16 threads/core) Memory (GB) 64 GB RAID card RAID Controller SAS 6Gbit/s 1GB (D3116C) Virtual DRIVE 0 - Seagate Constellaton.2, SAS 6Gb/s, 1TB, 2.5", 7200 RPM (ST SS) - 11 drives - RAID 5 Virtual DRIVE 1 - Seagate Pulsar.2, SAS 6Gb/s, 100GB, 2.5", MLC (ST100FM0002) - 1 drive - RAID 0 Virtual DRIVE 2 - WD Xe, SAS 6Gb/s, 900GB, 2.5", RPM (WD9001BKHG) - 4 drives - RAID 0 Ethernet controllers - Intel 82599EB 10-Gigabit SFI/SFP+ - 2*10Gb - Intel I350 Gigabit Network - 2*1Gb - Intel I350 Gigabit Network - 4*1Gb 32
33 33
34 Conclusions RozoFS is an I/O centric distributed file system based on a erasure code (always) Performances: 100K IOPS, throughput of 6 Gbps... RozoFS follows up the infrastructure Apps: on line video editing, virtualisation (QEMU), database... participate to the convergence of cold and hot data Next: privacy (to check), grid5000 experiments (to come), deduplication (to attach) 34
RozoFS: a fault tolerant I/O intensive distributed file system based on Mojette erasure code
RozoFS: a fault tolerant I/O intensive distributed file system based on Mojette erasure code Workshop Autonomic Oct. 16-17 2014 Laas, Europe, Toulouse Benoît Parrein, Dimitri Pertin*, Nicolas Normand,
New Storage System Solutions
New Storage System Solutions Craig Prescott Research Computing May 2, 2013 Outline } Existing storage systems } Requirements and Solutions } Lustre } /scratch/lfs } Questions? Existing Storage Systems
Cloud Storage. Parallels. Performance Benchmark Results. White Paper. www.parallels.com
Parallels Cloud Storage White Paper Performance Benchmark Results www.parallels.com Table of Contents Executive Summary... 3 Architecture Overview... 3 Key Features... 4 No Special Hardware Requirements...
technology brief RAID Levels March 1997 Introduction Characteristics of RAID Levels
technology brief RAID Levels March 1997 Introduction RAID is an acronym for Redundant Array of Independent Disks (originally Redundant Array of Inexpensive Disks) coined in a 1987 University of California
Distributed File System. MCSN N. Tonellotto Complements of Distributed Enabling Platforms
Distributed File System 1 How do we get data to the workers? NAS Compute Nodes SAN 2 Distributed File System Don t move data to workers move workers to the data! Store data on the local disks of nodes
How To Encrypt Data With A Power Of N On A K Disk
Towards High Security and Fault Tolerant Dispersed Storage System with Optimized Information Dispersal Algorithm I Hrishikesh Lahkar, II Manjunath C R I,II Jain University, School of Engineering and Technology,
Architecting a High Performance Storage System
WHITE PAPER Intel Enterprise Edition for Lustre* Software High Performance Data Division Architecting a High Performance Storage System January 2014 Contents Introduction... 1 A Systematic Approach to
Accelerate SQL Server 2014 AlwaysOn Availability Groups with Seagate. Nytro Flash Accelerator Cards
Accelerate SQL Server 2014 AlwaysOn Availability Groups with Seagate Nytro Flash Accelerator Cards Technology Paper Authored by: Mark Pokorny, Database Engineer, Seagate Overview SQL Server 2014 provides
THE SUMMARY. ARKSERIES - pg. 3. ULTRASERIES - pg. 5. EXTREMESERIES - pg. 9
PRODUCT CATALOG THE SUMMARY ARKSERIES - pg. 3 ULTRASERIES - pg. 5 EXTREMESERIES - pg. 9 ARK SERIES THE HIGH DENSITY STORAGE FOR ARCHIVE AND BACKUP Unlimited scalability Painless Disaster Recovery The ARK
Isilon IQ Scale-out NAS for High-Performance Applications
Isilon IQ Scale-out NAS for High-Performance Applications Optimizing Performance with Isilon IQ Storage By Shai Harmelin, Sr. Solutions Architect An Isilon Systems Technical Whitepaper July 2009 ISILON
Milestone Solution Partner IT Infrastructure MTP Certification Report Scality RING Software-Defined Storage 11-16-2015
Milestone Solution Partner IT Infrastructure MTP Certification Report Scality RING Software-Defined Storage 11-16-2015 Table of Contents Introduction... 4 Certified Products... 4 Key Findings... 5 Solution
SMB Direct for SQL Server and Private Cloud
SMB Direct for SQL Server and Private Cloud Increased Performance, Higher Scalability and Extreme Resiliency June, 2014 Mellanox Overview Ticker: MLNX Leading provider of high-throughput, low-latency server
Building Storage Clouds for Online Applications A Case for Optimized Object Storage
Building Storage Clouds for Online Applications A Case for Optimized Object Storage Agenda Introduction: storage facts and trends Call for more online storage! AmpliStor: Optimized Object Storage Cost
Certification Document macle GmbH Grafenthal-S1212M 24/02/2015. macle GmbH Grafenthal-S1212M Storage system
macle GmbH Grafenthal-S1212M Storage system Executive summary After performing all tests, the macle GmbH Grafenthal-S1212M has been officially certified according to the Open-E Hardware Certification Program
Using Synology SSD Technology to Enhance System Performance Synology Inc.
Using Synology SSD Technology to Enhance System Performance Synology Inc. Synology_SSD_Cache_WP_ 20140512 Table of Contents Chapter 1: Enterprise Challenges and SSD Cache as Solution Enterprise Challenges...
Converged storage architecture for Oracle RAC based on NVMe SSDs and standard x86 servers
Converged storage architecture for Oracle RAC based on NVMe SSDs and standard x86 servers White Paper rev. 2015-11-27 2015 FlashGrid Inc. 1 www.flashgrid.io Abstract Oracle Real Application Clusters (RAC)
F600Q 8Gb FC Storage Performance Report Date: 2012/10/30
F600Q 8Gb FC Storage Performance Report Date: 2012/10/30 Table of Content IO Feature Highlights Test Configurations Maximum IOPS & Best Throughput Maximum Sequential IOPS Test Configurations Random IO
Intel Solid- State Drive Data Center P3700 Series NVMe Hybrid Storage Performance
Intel Solid- State Drive Data Center P3700 Series NVMe Hybrid Storage Performance Hybrid Storage Performance Gains for IOPS and Bandwidth Utilizing Colfax Servers and Enmotus FuzeDrive Software NVMe Hybrid
THESUMMARY. ARKSERIES - pg. 3. ULTRASERIES - pg. 5. EXTREMESERIES - pg. 9
PRODUCT CATALOG THESUMMARY ARKSERIES - pg. 3 ULTRASERIES - pg. 5 EXTREMESERIES - pg. 9 ARKSERIES THE HIGH DENSITY STORAGE FOR ARCHIVE AND BACKUP Unlimited scalability Painless Disaster Recovery The ARK
Comparing Dynamic Disk Pools (DDP) with RAID-6 using IOR
Comparing Dynamic Disk Pools (DDP) with RAID-6 using IOR December, 2012 Peter McGonigal [email protected] Abstract Dynamic Disk Pools (DDP) offer an exciting new approach to traditional RAID sets by substantially
HP reference configuration for entry-level SAS Grid Manager solutions
HP reference configuration for entry-level SAS Grid Manager solutions Up to 864 simultaneous SAS jobs and more than 3 GB/s I/O throughput Technical white paper Table of contents Executive summary... 2
Performance characterization report for Microsoft Hyper-V R2 on HP StorageWorks P4500 SAN storage
Performance characterization report for Microsoft Hyper-V R2 on HP StorageWorks P4500 SAN storage Technical white paper Table of contents Executive summary... 2 Introduction... 2 Test methodology... 3
RAID 5 rebuild performance in ProLiant
RAID 5 rebuild performance in ProLiant technology brief Abstract... 2 Overview of the RAID 5 rebuild process... 2 Estimating the mean-time-to-failure (MTTF)... 3 Factors affecting RAID 5 array rebuild
IBM System x GPFS Storage Server
IBM System x GPFS Storage Crispin Keable Technical Computing Architect 1 IBM Technical Computing comprehensive portfolio uniquely addresses supercomputing and mainstream client needs Technical Computing
Freezing Exabytes of Data at Facebook s Cold Storage. Kestutis Patiejunas ([email protected])
Freezing Exabytes of Data at Facebook s Cold Storage Kestutis Patiejunas ([email protected]) 1990 vs. 2014 Seagate 94171-327 (300MB) iphone 5 16 GB Specs Value Form 3.5" Platters 5 Heads 9 Capacity 300MB
BookKeeper. Flavio Junqueira Yahoo! Research, Barcelona. Hadoop in China 2011
BookKeeper Flavio Junqueira Yahoo! Research, Barcelona Hadoop in China 2011 What s BookKeeper? Shared storage for writing fast sequences of byte arrays Data is replicated Writes are striped Many processes
Blueprints for Scalable IBM Spectrum Protect (TSM) Disk-based Backup Solutions
Blueprints for Scalable IBM Spectrum Protect (TSM) Disk-based Backup Solutions Jason Basler Software Test Architect IBM Technical University/Symposia materials may not be reproduced in whole or in part
POSIX and Object Distributed Storage Systems
1 POSIX and Object Distributed Storage Systems Performance Comparison Studies With Real-Life Scenarios in an Experimental Data Taking Context Leveraging OpenStack Swift & Ceph by Michael Poat, Dr. Jerome
Performance Report Modular RAID for PRIMERGY
Performance Report Modular RAID for PRIMERGY Version 1.1 March 2008 Pages 15 Abstract This technical documentation is designed for persons, who deal with the selection of RAID technologies and RAID controllers
Big Fast Data Hadoop acceleration with Flash. June 2013
Big Fast Data Hadoop acceleration with Flash June 2013 Agenda The Big Data Problem What is Hadoop Hadoop and Flash The Nytro Solution Test Results The Big Data Problem Big Data Output Facebook Traditional
Input / Ouput devices. I/O Chapter 8. Goals & Constraints. Measures of Performance. Anatomy of a Disk Drive. Introduction - 8.1
Introduction - 8.1 I/O Chapter 8 Disk Storage and Dependability 8.2 Buses and other connectors 8.4 I/O performance measures 8.6 Input / Ouput devices keyboard, mouse, printer, game controllers, hard drive,
Mit Soft- & Hardware zum Erfolg. Giuseppe Paletta
Mit Soft- & Hardware zum Erfolg IT-Transformation VCE Converged and Hyperconverged Infrastructure VCE VxRack EMC VSPEX Blue IT-Transformation IT has changed dramatically in last past years The requirements
VMware Virtual SAN Backup Using VMware vsphere Data Protection Advanced SEPTEMBER 2014
VMware SAN Backup Using VMware vsphere Data Protection Advanced SEPTEMBER 2014 VMware SAN Backup Using VMware vsphere Table of Contents Introduction.... 3 vsphere Architectural Overview... 4 SAN Backup
GeoGrid Project and Experiences with Hadoop
GeoGrid Project and Experiences with Hadoop Gong Zhang and Ling Liu Distributed Data Intensive Systems Lab (DiSL) Center for Experimental Computer Systems Research (CERCS) Georgia Institute of Technology
Comparing SMB Direct 3.0 performance over RoCE, InfiniBand and Ethernet. September 2014
Comparing SMB Direct 3.0 performance over RoCE, InfiniBand and Ethernet Anand Rangaswamy September 2014 Storage Developer Conference Mellanox Overview Ticker: MLNX Leading provider of high-throughput,
Introducing NetApp FAS2500 series. Marek Stopka Senior System Engineer ALEF Distribution CZ s.r.o.
Introducing NetApp FAS2500 series Marek Stopka Senior System Engineer ALEF Distribution CZ s.r.o. Complex data storage portfolio Corporate Data Centers Cloud Data Centers Flash Arrays for extreme performance
GPFS Storage Server. Concepts and Setup in Lemanicus BG/Q system" Christian Clémençon (EPFL-DIT)" " 4 April 2013"
GPFS Storage Server Concepts and Setup in Lemanicus BG/Q system" Christian Clémençon (EPFL-DIT)" " Agenda" GPFS Overview" Classical versus GSS I/O Solution" GPFS Storage Server (GSS)" GPFS Native RAID
Certification Document bluechip STORAGEline R54300s NAS-Server 03/06/2014. bluechip STORAGEline R54300s NAS-Server system
bluechip STORAGEline R54300s NAS-Server system Executive summary After performing all tests, the Certification Document bluechip STORAGEline R54300s NAS-Server system has been officially certified according
Data Corruption In Storage Stack - Review
Theoretical Aspects of Storage Systems Autumn 2009 Chapter 2: Double Disk Failures André Brinkmann Data Corruption in the Storage Stack What are Latent Sector Errors What is Silent Data Corruption Checksum
File System & Device Drive. Overview of Mass Storage Structure. Moving head Disk Mechanism. HDD Pictures 11/13/2014. CS341: Operating System
CS341: Operating System Lect 36: 1 st Nov 2014 Dr. A. Sahu Dept of Comp. Sc. & Engg. Indian Institute of Technology Guwahati File System & Device Drive Mass Storage Disk Structure Disk Arm Scheduling RAID
How SSDs Fit in Different Data Center Applications
How SSDs Fit in Different Data Center Applications Tahmid Rahman Senior Technical Marketing Engineer NVM Solutions Group Flash Memory Summit 2012 Santa Clara, CA 1 Agenda SSD market momentum and drivers
HP Proliant BL460c G7
HP Proliant BL460c G7 The HP Proliant BL460c G7, is a high performance, fully fault tolerant, nonstop server. It s well suited for all mid-level operations, including environments with local storage, SAN
Accelerating Server Storage Performance on Lenovo ThinkServer
Accelerating Server Storage Performance on Lenovo ThinkServer Lenovo Enterprise Product Group April 214 Copyright Lenovo 214 LENOVO PROVIDES THIS PUBLICATION AS IS WITHOUT WARRANTY OF ANY KIND, EITHER
I/O Performance of Cisco UCS M-Series Modular Servers with Cisco UCS M142 Compute Cartridges
White Paper I/O Performance of Cisco UCS M-Series Modular Servers with Cisco UCS M142 Compute Cartridges October 2015 2015 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public.
Using Synology SSD Technology to Enhance System Performance Synology Inc.
Using Synology SSD Technology to Enhance System Performance Synology Inc. Synology_WP_ 20121112 Table of Contents Chapter 1: Enterprise Challenges and SSD Cache as Solution Enterprise Challenges... 3 SSD
HYPER-CONVERGED INFRASTRUCTURE STRATEGIES
1 HYPER-CONVERGED INFRASTRUCTURE STRATEGIES MYTH BUSTING & THE FUTURE OF WEB SCALE IT 2 ROADMAP INFORMATION DISCLAIMER EMC makes no representation and undertakes no obligations with regard to product planning
The Data Placement Challenge
The Data Placement Challenge Entire Dataset Applications Active Data Lowest $/IOP Highest throughput Lowest latency 10-20% Right Place Right Cost Right Time 100% 2 2 What s Driving the AST Discussion?
Disk Storage & Dependability
Disk Storage & Dependability Computer Organization Architectures for Embedded Computing Wednesday 19 November 14 Many slides adapted from: Computer Organization and Design, Patterson & Hennessy 4th Edition,
DELL TM PowerEdge TM T610 500 Mailbox Resiliency Exchange 2010 Storage Solution
DELL TM PowerEdge TM T610 500 Mailbox Resiliency Exchange 2010 Storage Solution Tested with: ESRP Storage Version 3.0 Tested Date: Content DELL TM PowerEdge TM T610... 1 500 Mailbox Resiliency
PowerVault MD1200/MD1220 Storage Solution Guide for Applications
PowerVault MD200/MD220 Storage Solution Guide for Applications A Dell Technical White Paper Dell PowerVault Storage Systems Joe Noyola Systems Performance Analysis Group Chuck Colburn Storage Advanced
Using Synology SSD Technology to Enhance System Performance. Based on DSM 5.2
Using Synology SSD Technology to Enhance System Performance Based on DSM 5.2 Table of Contents Chapter 1: Enterprise Challenges and SSD Cache as Solution Enterprise Challenges... 3 SSD Cache as Solution...
Optimizing LTO Backup Performance
Optimizing LTO Backup Performance July 19, 2011 Written by: Ash McCarty Contributors: Cedrick Burton Bob Dawson Vang Nguyen Richard Snook Table of Contents 1.0 Introduction... 3 2.0 Host System Configuration...
Windows 8 SMB 2.2 File Sharing Performance
Windows 8 SMB 2.2 File Sharing Performance Abstract This paper provides a preliminary analysis of the performance capabilities of the Server Message Block (SMB) 2.2 file sharing protocol with 10 gigabit
CSE-E5430 Scalable Cloud Computing P Lecture 5
CSE-E5430 Scalable Cloud Computing P Lecture 5 Keijo Heljanko Department of Computer Science School of Science Aalto University [email protected] 12.10-2015 1/34 Fault Tolerance Strategies for Storage
DELL RAID PRIMER DELL PERC RAID CONTROLLERS. Joe H. Trickey III. Dell Storage RAID Product Marketing. John Seward. Dell Storage RAID Engineering
DELL RAID PRIMER DELL PERC RAID CONTROLLERS Joe H. Trickey III Dell Storage RAID Product Marketing John Seward Dell Storage RAID Engineering http://www.dell.com/content/topics/topic.aspx/global/products/pvaul/top
Storage Architectures for Big Data in the Cloud
Storage Architectures for Big Data in the Cloud Sam Fineberg HP Storage CT Office/ May 2013 Overview Introduction What is big data? Big Data I/O Hadoop/HDFS SAN Distributed FS Cloud Summary Research Areas
MESOS CB220. Cluster-in-a-Box. Network Storage Appliance. A Simple and Smart Way to Converged Storage with QCT MESOS CB220
MESOS CB220 Cluster-in-a-Box Network Storage Appliance A Simple and Smart Way to Converged Storage with QCT MESOS CB220 MESOS CB220 A Simple and Smart Way to Converged Storage Tailored for SMB storage
Convergence-A new keyword for IT infrastructure transformation
Convergence-A new keyword for IT infrastructure transformation www.huawei.com Derek Liu, Sr. Marketing Director Singapore, Nov. 2013 HUAWEI TECHNOLOGIES CO., LTD. Evolution of IT Infrastructure/Stack IBM/DEC/
N8103-149/150/151/160 RAID Controller. N8103-156 MegaRAID CacheCade. Feature Overview
N8103-149/150/151/160 RAID Controller N8103-156 MegaRAID CacheCade Feature Overview April 2012 Rev.1.0 NEC Corporation Contents 1 Introduction... 3 2 Types of RAID Controllers... 3 3 New Features of RAID
Integrated Grid Solutions. and Greenplum
EMC Perspective Integrated Grid Solutions from SAS, EMC Isilon and Greenplum Introduction Intensifying competitive pressure and vast growth in the capabilities of analytic computing platforms are driving
Can High-Performance Interconnects Benefit Memcached and Hadoop?
Can High-Performance Interconnects Benefit Memcached and Hadoop? D. K. Panda and Sayantan Sur Network-Based Computing Laboratory Department of Computer Science and Engineering The Ohio State University,
Performance Characteristics of VMFS and RDM VMware ESX Server 3.0.1
Performance Study Performance Characteristics of and RDM VMware ESX Server 3.0.1 VMware ESX Server offers three choices for managing disk access in a virtual machine VMware Virtual Machine File System
Hadoop: Embracing future hardware
Hadoop: Embracing future hardware Suresh Srinivas @suresh_m_s Page 1 About Me Architect & Founder at Hortonworks Long time Apache Hadoop committer and PMC member Designed and developed many key Hadoop
Building low cost disk storage with Ceph and OpenStack Swift
Background photo from: http://edelomahony.com/2011/07/25/loving-money-doesnt-bring-you-more/ Building low cost disk storage with Ceph and OpenStack Swift Paweł Woszuk, Maciej Brzeźniak TERENA TF-Storage
The Pros and Cons of Erasure Coding & Replication vs. RAID in Next-Gen Storage Platforms. Abhijith Shenoy Engineer, Hedvig Inc.
The Pros and Cons of Erasure Coding & Replication vs. RAID in Next-Gen Storage Platforms Abhijith Shenoy Engineer, Hedvig Inc. @hedviginc The need for new architectures Business innovation Time-to-market
RAID HARDWARE. On board SATA RAID controller. RAID drive caddy (hot swappable) SATA RAID controller card. Anne Watson 1
RAID HARDWARE On board SATA RAID controller SATA RAID controller card RAID drive caddy (hot swappable) Anne Watson 1 RAID The word redundant means an unnecessary repetition. The word array means a lineup.
SSDs and RAID: What s the right strategy. Paul Goodwin VP Product Development Avant Technology
SSDs and RAID: What s the right strategy Paul Goodwin VP Product Development Avant Technology SSDs and RAID: What s the right strategy Flash Overview SSD Overview RAID overview Thoughts about Raid Strategies
Parallels Cloud Storage
Parallels Cloud Storage White Paper Best Practices for Configuring a Parallels Cloud Storage Cluster www.parallels.com Table of Contents Introduction... 3 How Parallels Cloud Storage Works... 3 Deploying
Building All-Flash Software Defined Storages for Datacenters. Ji Hyuck Yun ([email protected]) Storage Tech. Lab SK Telecom
Building All-Flash Software Defined Storages for Datacenters Ji Hyuck Yun ([email protected]) Storage Tech. Lab SK Telecom Introduction R&D Motivation Synergy between SK Telecom and SK Hynix Service & Solution
Panasas at the RCF. Fall 2005 Robert Petkus RHIC/USATLAS Computing Facility Brookhaven National Laboratory. Robert Petkus Panasas at the RCF
Panasas at the RCF HEPiX at SLAC Fall 2005 Robert Petkus RHIC/USATLAS Computing Facility Brookhaven National Laboratory Centralized File Service Single, facility-wide namespace for files. Uniform, facility-wide
SOLUTION BRIEF. Resolving the VDI Storage Challenge
CLOUDBYTE ELASTISTOR QOS GUARANTEE MEETS USER REQUIREMENTS WHILE REDUCING TCO The use of VDI (Virtual Desktop Infrastructure) enables enterprises to become more agile and flexible, in tune with the needs
EMC SYMMETRIX VMAX 20K STORAGE SYSTEM
EMC SYMMETRIX VMAX 20K STORAGE SYSTEM The EMC Virtual Matrix Architecture is a new way to build storage systems that transcends the physical constraints of all existing architectures by scaling system
Certification Document macle GmbH GRAFENTHAL R2208 S2 01/04/2016. macle GmbH GRAFENTHAL R2208 S2 Storage system
macle GmbH GRAFENTHAL R2208 S2 Storage system Executive summary After performing all tests, the macle GmbH GRAFENTHAL R2208 S2 has been officially certified according to the Open-E Hardware Certification
Fusionstor NAS Enterprise Server and Microsoft Windows Storage Server 2003 competitive performance comparison
Fusionstor NAS Enterprise Server and Microsoft Windows Storage Server 2003 competitive performance comparison This white paper compares two important NAS operating systems and examines their performance.
The IntelliMagic White Paper: Storage Performance Analysis for an IBM Storwize V7000
The IntelliMagic White Paper: Storage Performance Analysis for an IBM Storwize V7000 Summary: This document describes how to analyze performance on an IBM Storwize V7000. IntelliMagic 2012 Page 1 This
HP ProLiant DL380p Gen8 1000 mailbox 2GB mailbox resiliency Exchange 2010 storage solution
Technical white paper HP ProLiant DL380p Gen8 1000 mailbox 2GB mailbox resiliency Exchange 2010 storage solution Table of contents Overview 2 Disclaimer 2 Features of the tested solution 2 Solution description
LEVERAGING FLASH MEMORY in ENTERPRISE STORAGE. Matt Kixmoeller, Pure Storage
LEVERAGING FLASH MEMORY in ENTERPRISE STORAGE Matt Kixmoeller, Pure Storage SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA unless otherwise noted. Member companies
Achieving Higher VDI Scalability and Performance on Microsoft Hyper-V with Seagate 1200 SAS SSD Drives & Proximal Data AutoCache Software
Achieving Higher VDI Scalability and Performance on Microsoft Hyper-V with Seagate 1200 SAS SSD Drives & Proximal Data AutoCache Software Technology Paper After the advent of the Hyper-V version 3 hypervisor
Cisco UCS and Fusion- io take Big Data workloads to extreme performance in a small footprint: A case study with Oracle NoSQL database
Cisco UCS and Fusion- io take Big Data workloads to extreme performance in a small footprint: A case study with Oracle NoSQL database Built up on Cisco s big data common platform architecture (CPA), a
Deploying Flash- Accelerated Hadoop with InfiniFlash from SanDisk
WHITE PAPER Deploying Flash- Accelerated Hadoop with InfiniFlash from SanDisk 951 SanDisk Drive, Milpitas, CA 95035 2015 SanDisk Corporation. All rights reserved. www.sandisk.com Table of Contents Introduction
Flash Memory Arrays Enabling the Virtualized Data Center. July 2010
Flash Memory Arrays Enabling the Virtualized Data Center July 2010 2 Flash Memory Arrays Enabling the Virtualized Data Center This White Paper describes a new product category, the flash Memory Array,
HADOOP ON ORACLE ZFS STORAGE A TECHNICAL OVERVIEW
HADOOP ON ORACLE ZFS STORAGE A TECHNICAL OVERVIEW 757 Maleta Lane, Suite 201 Castle Rock, CO 80108 Brett Weninger, Managing Director [email protected] Dave Smelker, Managing Principal [email protected]
Measuring Interface Latencies for SAS, Fibre Channel and iscsi
Measuring Interface Latencies for SAS, Fibre Channel and iscsi Dennis Martin Demartek President Santa Clara, CA 1 Demartek Company Overview Industry analysis with on-site test lab Lab includes servers,
Advanced Knowledge and Understanding of Industrial Data Storage
Dec. 3 rd 2013 Advanced Knowledge and Understanding of Industrial Data Storage By Jesse Chuang, Senior Software Manager, Advantech With the popularity of computers and networks, most enterprises and organizations
Scientific Computing Data Management Visions
Scientific Computing Data Management Visions ELI-Tango Workshop Szeged, 24-25 February 2015 Péter Szász Group Leader Scientific Computing Group ELI-ALPS Scientific Computing Group Responsibilities Data
Protect Data... in the Cloud
QUASICOM Private Cloud Backups with ExaGrid Deduplication Disk Arrays Martin Lui Senior Solution Consultant Quasicom Systems Limited Protect Data...... in the Cloud 1 Mobile Computing Users work with their
Q & A From Hitachi Data Systems WebTech Presentation:
Q & A From Hitachi Data Systems WebTech Presentation: RAID Concepts 1. Is the chunk size the same for all Hitachi Data Systems storage systems, i.e., Adaptable Modular Systems, Network Storage Controller,
Understanding Flash SSD Performance
Understanding Flash SSD Performance Douglas Dumitru CTO EasyCo LLC August 16, 2007 DRAFT Flash based Solid State Drives are quickly becoming popular in a wide variety of applications. Most people think
Design and Evolution of the Apache Hadoop File System(HDFS)
Design and Evolution of the Apache Hadoop File System(HDFS) Dhruba Borthakur Engineer@Facebook Committer@Apache HDFS SDC, Sept 19 2011 Outline Introduction Yet another file-system, why? Goals of Hadoop
High-Availability and Scalable Cluster-in-a-Box HPC Storage Solution
Intel Solutions Reference Architecture High-Availability and Scalable Cluster-in-a-Box HPC Storage Solution Using RAIDIX Storage Software Integrated with Intel Enterprise Edition for Lustre* Audience and
Accelerating and Simplifying Apache
Accelerating and Simplifying Apache Hadoop with Panasas ActiveStor White paper NOvember 2012 1.888.PANASAS www.panasas.com Executive Overview The technology requirements for big data vary significantly
The Design and Implementation of the Zetta Storage Service. October 27, 2009
The Design and Implementation of the Zetta Storage Service October 27, 2009 Zetta s Mission Simplify Enterprise Storage Zetta delivers enterprise-grade storage as a service for IT professionals needing
Investigation of storage options for scientific computing on Grid and Cloud facilities
Investigation of storage options for scientific computing on Grid and Cloud facilities Overview Context Test Bed Lustre Evaluation Standard benchmarks Application-based benchmark HEPiX Storage Group report
RAID. RAID 0 No redundancy ( AID?) Just stripe data over multiple disks But it does improve performance. Chapter 6 Storage and Other I/O Topics 29
RAID Redundant Array of Inexpensive (Independent) Disks Use multiple smaller disks (c.f. one large disk) Parallelism improves performance Plus extra disk(s) for redundant data storage Provides fault tolerant
Lab Evaluation of NetApp Hybrid Array with Flash Pool Technology
Lab Evaluation of NetApp Hybrid Array with Flash Pool Technology Evaluation report prepared under contract with NetApp Introduction As flash storage options proliferate and become accepted in the enterprise,
Solid State Storage in the Evolution of the Data Center
Solid State Storage in the Evolution of the Data Center Trends and Opportunities Bruce Moxon CTO, Systems and Solutions stec Presented at the Lazard Capital Markets Solid State Storage Day New York, June
Hadoop Architecture. Part 1
Hadoop Architecture Part 1 Node, Rack and Cluster: A node is simply a computer, typically non-enterprise, commodity hardware for nodes that contain data. Consider we have Node 1.Then we can add more nodes,
