Seagate Lustre Update. Peter Bojanic 2015-04-13

Similar documents
Data management challenges in todays Healthcare and Life Sciences ecosystems

Data Storage At the Heart of any Information System. Ken Claffey, VP/GM - June 2015

OpenStack Benelux. Dan Chester. Seagate Cloud Systems & Solutions

Inside Lustre HSM. An introduction to the newly HSM-enabled Lustre 2.5.x parallel file system. Torben Kling Petersen, PhD.

February, 2015 Bill Loewe

Seagate Cloud Systems & Solutions

HadoopTM Analytics DDN

Next Generation HPC Storage Initiative. Torben Kling Petersen, PhD Lead Field Architect - HPC

Seagate HPC /Big Data Business Tech Talk. December 2014

An Alternative Storage Solution for MapReduce. Eric Lomascolo Director, Solutions Marketing

Netapp HPC Solution for Lustre. Rich Fenton UK Solutions Architect

Xyratex Update. Michael K. Connolly. Partner and Alliances Development

Intel HPC Distribution for Apache Hadoop* Software including Intel Enterprise Edition for Lustre* Software. SC13, November, 2013

High Performance Server SAN using Micron M500DC SSDs and Sanbolic Software

Scality Conversations (Episode 3) Ever Evolving Data Center Hardware. Leo Leung, VP of Corporate Marketing

SGI Solutions for RDSI/CAUDIT 2013 SGI

Vendor Update Intel 49 th IDC HPC User Forum. Mike Lafferty HPC Marketing Intel Americas Corp.

Understanding Hadoop Performance on Lustre

Intel RAID SSD Cache Controller RCS25ZB040

Intel Solid- State Drive Data Center P3700 Series NVMe Hybrid Storage Performance

Enabling High performance Big Data platform with RDMA

Protecting Big Data Data Protection Solutions for the Business Data Lake

Big Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum

WHITEPAPER. A Technical Perspective on the Talena Data Availability Management Solution

HPC on AWS. Hiroshi Kobayashi, Dev./Lab. IT System HGST Japan, Ltd. Jun 3, 2015

Quick Reference Selling Guide for Intel Lustre Solutions Overview

Understanding Enterprise NAS

Product Brochure. Hedvig Distributed Storage Platform Modern Storage for Modern Business. Elastic. Accelerate data to value. Simple.

How To Speed Up A Flash Flash Storage System With The Hyperq Memory Router

Performance, Reliability, and Operational Issues for High Performance NAS Storage on Cray Platforms. Cray User Group Meeting June 2007

SUSE Storage. FUT7537 Software Defined Storage Introduction and Roadmap: Getting your tentacles around data growth. Larry Morris

A Cloud WHERE PHYSICAL ARE TOGETHER AT LAST

Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack

RED HAT STORAGE PORTFOLIO OVERVIEW

Introduction to NetApp Infinite Volume

The Benefit of Migrating from 4Gb to 8Gb Fibre Channel

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software

ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE

Symantec Backup Appliances

StarWind Virtual SAN for Microsoft SOFS

Accelerating Data Compression with Intel Multi-Core Processors

Easier - Faster - Better

Mit Soft- & Hardware zum Erfolg. Giuseppe Paletta

Percipient StorAGe for Exascale Data Centric Computing

Lenovo ThinkServer and Cloudera Solution for Apache Hadoop

Please give me your feedback

NEXT GENERATION EMC: LEAD YOUR STORAGE TRANSFORMATION. Copyright 2013 EMC Corporation. All rights reserved.

QuickSpecs. What's New HP 3TB 6G SAS 7.2K 3.5-inch Midline Hard Drive. HP SAS Enterprise and SAS Midline Hard Drives. Overview

The Use of Flash in Large-Scale Storage Systems.

(Scale Out NAS System)

Top 5 reasons to choose HP Information Archiving

Innovation Makes Computing Simple

HyperQ Storage Tiering White Paper

MaxDeploy Ready. Hyper- Converged Virtualization Solution. With SanDisk Fusion iomemory products

ANY SURVEILLANCE, ANYWHERE, ANYTIME

MaxDeploy Hyper- Converged Reference Architecture Solution Brief

RECOVERY SCALABLE STORAGE

StorageBox High Performance NVMe JBOF

Cray s Storage History and Outlook Lustre+ Jason Goodman, Cray LUG Denver

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

Solid State Storage in the Evolution of the Data Center

BlueArc unified network storage systems 7th TF-Storage Meeting. Scale Bigger, Store Smarter, Accelerate Everything

Accelerating and Simplifying Apache

200 flash-related. petabytes of. flash shipped. patents. NetApp flash leadership. Metrics through October, 2015

ECMWF HPC Workshop: Accelerating Data Management

QuickSpecs. HP SAS Enterprise and SAS Midline Hard Drives Overview

Driving IBM BigInsights Performance Over GPFS Using InfiniBand+RDMA

SOLUTION BRIEF Citrix Cloud Solutions Citrix Cloud Solution for Disaster Recovery

Business-centric Storage FUJITSU Hyperscale Storage System ETERNUS CD10000

Cloud Computing. Big Data. High Performance Computing

Adobe Deploys Hadoop as a Service on VMware vsphere

HGST Object Storage for a New Generation of IT

Improve your IT Analytics Capabilities through Mainframe Consolidation and Simplification

Object Storage A New Architectural Partitioning In Storage

QuickSpecs. What's New HP 1.2TB 6G SAS 10K rpm SFF (2.5-inch) SC Enterprise 3yr Warranty Hard Drive

Improving Time to Results for Seismic Processing with Paradigm and DDN. ddn.com. DDN Whitepaper. James Coomer and Laurent Thiers

Cisco Data Center Optimization Services

IBM PureFlex and Atlantis ILIO: Cost-effective, high-performance and scalable persistent VDI

Load DynamiX Storage Performance Validation: Fundamental to your Change Management Process

IT Platforms for Utilization of Big Data

VMware Hybrid Cloud. Accelerate Your Time to Value

Experiences with Lustre* and Hadoop*

High-Availability and Scalable Cluster-in-a-Box HPC Storage Solution

IBM Storwize V7000 Midrange Disk System

Red Hat Storage Server

Transcription:

Seagate Lustre Update Peter Bojanic 2015-04-13

Seagate Cloud Systems and Solutions Delivering next-generation workloads with Intelligent Information Infrastructure tion OEM Cloud Services HPC HPC HPC Information Information Information OEM OEM OEM Cloud Cloud Services Services Cloud Servic cture OEM HPC Infrastructure Scale-Out Infrastructure Systems Cloud Services COMPANY COMPANY COMPANY COMPANY odular ents ions COMPANY Custom systems Cloud backup Integrated and Integrated restore, Integrated Scalable, Scalable, modular modular Scalable, Custom modular Custom systems systems Custom Cloud systems Cloud backup backup and Cloud restore, and backup restore, and high-performance components 2+ million enclosures disaster high-performance Engineered recovery, disaster recovery, to high-performance optimize components Engineered components solutions for Backup disaster a recovery, service disaster recove and long-term computing computing solutions computing solutions solutions and solutions and solutions and solutions and long-term and storage long-term solutions storage capacity storage and long-term storage solutions and solutions object storage 17+Petabytes shipped Disaster recovery as a performance service Drive Variety (HDD, SAS, SATA, SSD, hybrid) Enclosures, controllers Customer-driven partnership Services: Logistics, fulfillment, warranty, design, supply chain 40% fewer racks required >1TB/sec file system performance COMPANY Validated architectures for open source and software-defined storage Private cloud appliances for backup and recovery Modular, scalable components for DIY customers COMPANY COMPANY Archive as a service Endpoint backup Managed services 2

ClusterStor Product Line Overview ClusterStor 1500 1GB/sec increments 1 to 110GB/sec Up to 7PB Standard Rack ClusterStor 6000 6GB/sec increments Up to 1TB/sec Up to 25PB Custom Rack ClusterStor 9000 9GB/sec increments Up to 1TB/sec+ Up to 25PB+ Custom Rack

ClusterStor Engineered End to End Solution Creating and supporting your own Lustre solution with high availability and productivity is challenging Seagate is the ONLY Vendor to Provide a complete End to End Lustre Solution High density disk enclosures Storage Bay Bridge Object Storage Servers Dedicated Lustre software development, integration and support team System Administration Management CLI & GUI Manufacturing integration and test validation Support and Professional services Storage Media (HDD, SSD, PCIe)

Vertical Markets Weather Healthcare Energy Finance Pharmacology Engineering Academic Defense Confidential 5

Top 5 Misconceptions about Seagate ClusterStor Dispelling some myths 1. ClusterStor uses an ancient version of Lustre 2. ClusterStor does not use an open source compatible version of Lustre 3. ClusterStor is a proprietary, closed solution 4. Seagate cannot support Lustre 5. Seagate does not contribute to Lustre

Lustre 2.5 Distributed Name Space (DNE) DNE is a hardware OPTION with ClusterStor v2.0 or greater Same Base Rack as ClusterStor v1.x with active/passive MDS for MDT0 Additional Directory Units (ADUs) are configured with active / active MDT pairs Scale up to 8 ADUs Up to 16 additional MDTs 1.885 billion additional files per ADU Achieve up to 280K file creates per second and 15.984 billion files Integrated management, monitoring, and serviceability

ClusterStor Lustre HSM Partners Cray TAS and SGI DMF ClusterStor v2.0 or greater with Lustre 2.5 is HSM Ready Partners supply additional HSM Gateway hardware and back end Archive engine

Seagate Lustre Seagate Lustre 2.5.1 is focused on production quality 3541 hours of Lustre testing per month! Seagate's Lustre distribution incorporates hundreds of stability and performance improvements Contributions from Intel, Seagate, and the community are curated to improve the quality of Lustre beyond the original OpenSFS release baseline Seagate's Lustre 2.5 distribution is engineered to operate reliability at scale in production environments Neo 1.x Neo 2.x Seagate developers Other community developers Lustre 2.1 Neo Lustre baseline Seagate ClusterStor development branch Port 336 patches (bug fixes) Lustre 2.5 Neo Lustre baseline Port 868 patches (bug fixes) Intel developers Lustre Community development branch Contributions

Seagate Participation in Lustre Community Seagate Roles in the Lustre Community OpenSFS Board Member and EOFS Member Direct funding of and contributions to Lustre tree and roadmap development Leadership and Participation in working groups and PACs Active Contributor to Lustre Developed and provided over 292 patches Liberator of Lustre trademark and IP Integration into Lustre ClusterStor TM Based from community Lustre tree; stabilization efforts contributed Proven at scale on BlueWaters and sites around the world

Seagate Financial Investment in Lustre Seagate has been an OpenSFS Promoter member for 4 consecutive years Seagate has invested $2.3M in Lustre development and tree maintenance through contributions to OpenSFS This is above and beyond Seagate s considerable investment (in millions) to maintain its own internal Lustre development and support capability and capacity $600,000 $500,000 $400,000 $300,000 $200,000 Seagate Financial Contributions to OpenSFS $100,000 $0 2011 2012 2013 2014 2015

Lustre.org Donated to Lustre Community Lustre.org will be jointly managed by the Open Scalable File Systems, Inc. (OpenSFS) and European Open File System (EOFS) as trusted stewards of the community. R->L: Galen Shipman, OpenSFS, Ken Claffey, Seagate, Hugo Falter, EOFS...this move is a great indication that Seagate intends to be an active, contributing member of the Lustre community, Rich Brueckner, Inside HPC We wanted to send out a special thank you to Seagate for showing such a strong commitment to Lustre...They continue to show their excitement and commitment to Lustre with concrete moves like this..., OpenSFS

Seagate Contributes Hadoop on Lustre Connector Improves IO performance and eliminates HDFS pre-staging overhead Improves workflow efficiency for users of both Lustre and Hadoop by eliminating the need to copy data into HDFS prior to running Apache Hadoop jobs Provides an alternative to Hadoop s reliance on the HDFS file system Enables Hadoop eco-system tools such as Mahout, Hive and Pig to take advantage of the Lustre file system TeraSort Benchmark HDFS vs. Hadoop Workflow Accelerator

Seagate Open Sources Xperior Automated Build and Test Platform for Lustre Used by Seagate Engineering and System Test groups for Lustre test automation Test management Execution control Test result reporting Flexibility for extensions Supports the addition of new test models for new Lustre release via wrapper functions Next steps Connect directly to community Jenkins repository Scale testing on on Amazon Web Services (AWS)

Support for Lustre and Beyond The World s Leading Data Services Company HD FLASH SILICON HYBRID SOLUTIONS Over 37 Years of Services - With a satisfaction rating above 95%, we have proven that we are the place to go for data recovery Over 18 Years of Services - Founded in 1997 as the original Cloud Backup and Archive Services company Over 23 Years of Services - With locations in 15 countries, a third of our worldwide staff belong to our technical teams Over 21 Years of Services - Services expertise in both global data storage and networking technologies

Seagate Powers 4 of 5 1+ TB/sec file systems Supported 100% by Seagate & our partners 1TB/sec 20PB Lustre File System 1.7TB/sec 80PB Lustre File System 1TB/sec 18PB Lustre File System 1TB/sec 22PB Lustre File System

Seagate Collaborative Research Efforts Results for collective I/O performance optimization through E10 work in the DEEP-ER project New E10 hints infrastructure built on top of MPI-IO. funded initiative which will target E10 work a "Marie Curie Initial Training Network" program in Europe which funds PhD students.