Lustre * Filesystem for Cloud and Hadoop *

Similar documents
Performance Comparison of SQL based Big Data Analytics with Lustre and HDFS file systems

Performance Comparison of Intel Enterprise Edition for Lustre* software and HDFS for MapReduce Applications

Developing High-Performance, Scalable, cost effective storage solutions with Intel Cloud Edition Lustre* and Amazon Web Services

Hadoop MapReduce over Lustre* High Performance Data Division Omkar Kulkarni April 16, 2013

Amazon EC2 Product Details Page 1 of 5

Hadoop* on Lustre* Liu Ying High Performance Data Division, Intel Corporation

MapReduce and Lustre * : Running Hadoop * in a High Performance Computing Environment

Cloud Bursting with SLURM and Bright Cluster Manager. Martijn de Vries CTO

A Shared File System on SAS Grid Manger in a Cloud Environment

A Shared File System on SAS Grid Manager * in a Cloud Environment

Scalable Architecture on Amazon AWS Cloud

High Performance Computing OpenStack Options. September 22, 2015

Alfresco Enterprise on AWS: Reference Architecture

New Storage System Solutions

An Alternative Storage Solution for MapReduce. Eric Lomascolo Director, Solutions Marketing

Quick Reference Selling Guide for Intel Lustre Solutions Overview

February, 2015 Bill Loewe

Dell Reference Configuration for Hortonworks Data Platform

Scala Storage Scale-Out Clustered Storage White Paper

Leveraging BlobSeer to boost up the deployment and execution of Hadoop applications in Nimbus cloud environments on Grid 5000

Maximizing Hadoop Performance and Storage Capacity with AltraHD TM

Experiences with Lustre* and Hadoop*

CSE 590: Special Topics Course ( Supercomputing ) Lecture 10 ( MapReduce& Hadoop)

Cloud Storage. Parallels. Performance Benchmark Results. White Paper.

Hadoop & Spark Using Amazon EMR

Hadoop IST 734 SS CHUNG

Intel HPC Distribution for Apache Hadoop* Software including Intel Enterprise Edition for Lustre* Software. SC13, November, 2013

Performance Analysis of Mixed Distributed Filesystem Workloads

Deploying Hadoop on Lustre Sto rage:

THE HADOOP DISTRIBUTED FILE SYSTEM

Hadoop Cluster Applications

Outline. High Performance Computing (HPC) Big Data meets HPC. Case Studies: Some facts about Big Data Technologies HPC and Big Data converging

Aspera Direct-to-Cloud Storage WHITE PAPER

Commoditisation of the High-End Research Storage Market with the Dell MD3460 & Intel Enterprise Edition Lustre

CERN Cloud Storage Evaluation Geoffray Adde, Dirk Duellmann, Maitane Zotes CERN IT

Lustre* is designed to achieve the maximum performance and scalability for POSIX applications that need outstanding streamed I/O.

Lustre failover experience

Unstructured Data Accelerator (UDA) Author: Motti Beck, Mellanox Technologies Date: March 27, 2012

EOFS Workshop Paris Sept, Lustre at exascale. Eric Barton. CTO Whamcloud, Inc Whamcloud, Inc.

Ali Ghodsi Head of PM and Engineering Databricks

Implement Hadoop jobs to extract business value from large and varied data sets

Lustre Monitoring with OpenTSDB

Energy Efficient MapReduce

Analysis and Optimization of Massive Data Processing on High Performance Computing Architecture

Evaluating MapReduce and Hadoop for Science

Pilot-Streaming: Design Considerations for a Stream Processing Framework for High- Performance Computing

Lecture 5: GFS & HDFS! Claudia Hauff (Web Information Systems)! ti2736b-ewi@tudelft.nl

Design and Evolution of the Apache Hadoop File System(HDFS)

Hadoop Architecture. Part 1

NetApp High-Performance Computing Solution for Lustre: Solution Guide

HADOOP BIG DATA DEVELOPER TRAINING AGENDA

HPCHadoop: MapReduce on Cray X-series

PARALLELS CLOUD STORAGE

Mambo Running Analytics on Enterprise Storage

Enabling High performance Big Data platform with RDMA

Investigation of storage options for scientific computing on Grid and Cloud facilities

DEPLOYING AND MONITORING HADOOP MAP-REDUCE ANALYTICS ON SINGLE-CHIP CLOUD COMPUTER

Introduction to Hadoop. New York Oracle User Group Vikas Sawhney

HDFS Space Consolidation

Hadoop Distributed File System. T Seminar On Multimedia Eero Kurkela

Mixing Hadoop and HPC Workloads on Parallel Filesystems

Financial Services Grid Computing on Amazon Web Services January 2013 Ian Meyers

Chapter 7. Using Hadoop Cluster and MapReduce

High Availability and Backup Strategies for the Lustre MDS Server

Take An Internal Look at Hadoop. Hairong Kuang Grid Team, Yahoo! Inc

TRAINING PROGRAM ON BIGDATA/HADOOP

Reduction of Data at Namenode in HDFS using harballing Technique

Hadoop on the Gordon Data Intensive Cluster

Architecting for the next generation of Big Data Hortonworks HDP 2.0 on Red Hat Enterprise Linux 6 with OpenJDK 7

Performance measurement of a Hadoop Cluster

LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance

Big Data Analytics - Accelerated. stream-horizon.com

CSE-E5430 Scalable Cloud Computing Lecture 2

Big Fast Data Hadoop acceleration with Flash. June 2013

A Framework for Performance Analysis and Tuning in Hadoop Based Clusters

Lustre & Cluster. - monitoring the whole thing Erich Focht

Accelerating and Simplifying Apache

Testing Big data is one of the biggest

Welcome to the unit of Hadoop Fundamentals on Hadoop architecture. I will begin with a terminology review and then cover the major components

The Hadoop Distributed File System

HiBench Introduction. Carson Wang Software & Services Group

COURSE CONTENT Big Data and Hadoop Training

IBM General Parallel File System (GPFS ) 3.5 File Placement Optimizer (FPO)

Understanding Hadoop Performance on Lustre

GraySort on Apache Spark by Databricks

Storage Architectures for Big Data in the Cloud

A Brief Introduction to Apache Tez

Workshop on Parallel and Distributed Scientific and Engineering Computing, Shanghai, 25 May 2012

Application Performance for High Performance Computing Environments

POSIX and Object Distributed Storage Systems

Can High-Performance Interconnects Benefit Memcached and Hadoop?

Amazon Elastic Beanstalk

Introduction to Hadoop HDFS and Ecosystems. Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data

Hadoop Job Oriented Training Agenda

Big Data Course Highlights

Big Data on Microsoft Platform

The Comprehensive Performance Rating for Hadoop Clusters on Cloud Computing Platform

Architecting a High Performance Storage System

Transcription:

OpenFabrics Software User Group Workshop Lustre * Filesystem for Cloud and Hadoop * Robert Read, Intel

Lustre * for Cloud and Hadoop * Brief Lustre History and Overview Using Lustre with Hadoop Intel Cloud Edition for Lustre March 15 18, 2015 #OFADevWorkshop 2

What is Lustre * High performance parallel filesystem for Linux environments Designed to use RDMA on high performance fabrics Allows large number of users to share a file system High speed and low latencies Across local or wide area networks Designed for reliable storage Ideal for streaming IO and large, shared datasets Evolving to bring parallel filesystem to new workloads March 15 18, 2015 #OFADevWorkshop 3

Quick History Originally built to solve next gen IO problems for HPC An open source (GPL) project from the beginning Core team has survived numerous transitions and is now safely established in Intel s HPDD. Used in production systems since 2002 Intel Enterprise Edition for Lustre * since 2013 Intel Cloud Edition now available on AWS Marketplace March 15 18, 2015 #OFADevWorkshop 4

Lustre * Storage Components Management Target Lustre mount service Initial point of contact for Clients MGS MGT Metadata Targets Namespace of file system File layouts, no data Scalable MDS MDT Object Storage Targets File content stored as objects Striped across multiple targets Scales to 100s OST 5

Hadoop * Introduction Open source framework for data-intensive computing Parallelism hidden by framework Highly scalable: can be applied to large datasets (Big Data) and run on commodity clusters Comes with its own user-space distributed file system (HDFS) based on the local storage of cluster nodes 6

Hadoop * with HDFS HDFS locality has advantages, but HDFS requires import/ export to share data Compute nodes require local storage Hadoop nodes are both compute and IO nodes Hadoop Cluster HPC Cluster HDFS Import / Export Lustre Filesystem Hadoop nodes are single-purpose March 15 18, 2015 #OFADevWorkshop 7

Hadoop * Adapter for Lustre * Shared data repository for all compute resources Use data in place (no import/export) Dedicated Compute and IO nodes Hadoop Cluster HPC Cluster Lustre Filesystem March 15 18, 2015 #OFADevWorkshop 8

HPC Adapter for MapReduce Run Hadoop * on HPC cluster Replace YARN with Slurm Single compute cluster used for variety of workloads Hadoop HPC Cluster Lustre Filesystem March 15 18, 2015 #OFADevWorkshop 9

Optimized Shuffle: HDFS vs Lustre * Input Spill File : Reducer à N : 1 LUSTRE (Global Namespace) Output Spill Map Sort Merge Shuffle Merge Reduce Spill Repetitive Local Disk Local Disk Map O/P Map O/P Map O/P Map O/P Map O/P Input HDFS Output MAP REDUCE

Performance Comparison Tata Consultancy Services performed comparison in 2014 Performance comparison of Lustre* and HDFS for MR implementation of FSI workload using HDDP cluster hosted in the Intel BigData Lab in Swindon (UK) and Intel Enterprise Edition for Lustre* software Intel EE for Lustre = 3 X HDFS for single job Intel EE for Lustre = 5.5 X HDFS for concurrent workload http://insidebigdata.com/2014/09/29/performance-comparison-intel-enterpriseedition-lustre-hdfs-mapreduce/ March 15 18, 2015 #OFADevWorkshop 11

Degree of Concurrency = 1 Intel EE for Lustre* = 3 X HDFS for optimal SC settings* http://www.eofs.eu/fileadmin/lad2014/slides/21_rekha_singhal_lad2014_lustre_vs_hdfs_mr.pdf March 15 18, 2015 #OFADevWorkshop 12 23

Degree of Concurrency = 1 Intel EE for Lustre* optimal SC gives 70% improvement over HDFS http://www.eofs.eu/fileadmin/lad2014/slides/21_rekha_singhal_lad2014_lustre_vs_hdfs_mr.pdf March 15 18, 2015 #OFADevWorkshop 13 24

Intel Cloud Edition for Lustre * Three Intel Lustre AMIs available on AWS Marketplace*: Community Version Global Support Global Support HVM Lustre v2.5.3 Automated Lustre configuration Lustre monitoring with Ganglia and LMT/ltop Automated S3 data import Global Support includes additional capabilities: Intel Lustre Support High Availability(requires VPC) Enhanced Networking (HVM + VPC) Higher performance instance types 14

Automated Lustre * Deployment 1 MGS MDS 2 4 3 1 CloudFormation creates a stack of AWS resources from a template 2 MGS Initializes itself CloudFormation 5 Dynamo DB 3 MGS updates DB with NID 5 5 4 5 MDS formats MDT, registers with MGS, updates DB. s format local targets, updates DB 15

Using Virtual Private Cloud VPC Compute Subnet Compute Compute Compute Compute AWS APIs Public Subnet Lustre* Private Subnet ELB MGS & Console MDS Worker Client Public Network NAT March 15 18, 2015 #OFADevWorkshop 16

Automated S3 Import Option to import an S3 bucket into a new Lustre * filesystem Initially only the file metadata is imported File contents are retrieved from S3 on demand and stored on Lustre March 15 18, 2015 #OFADevWorkshop 17

Lustre * HA in the cloud HA template is available Storage is managed independently of instances Failure scenario AWS AutoScaling detects and terminates a failed instance AutoScaling creates a new instance New instance identifies orphaned storage and network interface Adopts the storage and NIC Target is brought back online and recovers March 15 18, 2015 #OFADevWorkshop 18

ICE-L Monitoring tools 19

Large File Benchmark Using IOR benchmark Comparing 3 Lustre cluster configurations Increase the number of s 4 8 16 Configurations of MGS and MDS are fixed 1-32 clients RAID0 8x 40GB Standard 8x 100GB Standard MGS m1.medium MDS EBS Optimized m3.2xlarge EBS Optimized m3.2xlarge Client m3.2xlarge 94 MB/sec 110 MB/sec 110 MB/sec 110 MB/sec 20

IOR Sequential Read FPP MB/sec 1600 1400 1200 Close to the network 1000 4 800 8 s network bottleneck 16 600 400 200 0 s network bottleneck Client s network bottleneck N. Clients 1 2 4 8 16 32 21

IOR Sequential Write FPP MB/sec 1600 1400 Ops. 1200 1000 800 600 4 8 s network bottleneck 16 400 200 0 s network bottleneck Client s network bottleneck N. Clients 1 2 4 8 16 32 22

Aggregate Performance During Run LMT used to record the OST metrics during the IOR run. With a simple python script we create this graph: aggregate performance vs time to analyze the problem. Long tail 1920 MB/ sec time 23

MDTEST on 16 Cluster Configuration MOPS 30000 25000 20000 Directory crea9on Directory stat Directory removal 15000 File crea9on File stat File removal 10000 Tree crea9on Tree removal 5000 0 32 64 128 256 N. Threads/32 clients 24

#OFSUserGroup Thank You OpenFabrics Software User Group Workshop