Linux Performance Optimizations for Big Data Environments



Similar documents
ZingMe Practice For Building Scalable PHP Website. By Chau Nguyen Nhat Thanh ZingMe Technical Manager Web Technical - VNG

Architecting for the next generation of Big Data Hortonworks HDP 2.0 on Red Hat Enterprise Linux 6 with OpenJDK 7

Datacenter Operating Systems

HiBench Introduction. Carson Wang Software & Services Group

PERFORMANCE TUNING ORACLE RAC ON LINUX

A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM

Performance Analysis of Mixed Distributed Filesystem Workloads

Agenda. Enterprise Application Performance Factors. Current form of Enterprise Applications. Factors to Application Performance.

Performance and scalability of a large OLTP workload

Accelerating and Simplifying Apache

Next Generation Operating Systems

Audit & Tune Deliverables

Performance Tuning and Optimizing SQL Databases 2016

Linux Block I/O Scheduling. Aaron Carroll December 22, 2007

Hadoop Architecture. Part 1

Reference Architecture and Best Practices for Virtualizing Hadoop Workloads Justin Murray VMware

KVM PERFORMANCE IMPROVEMENTS AND OPTIMIZATIONS. Mark Wagner Principal SW Engineer, Red Hat August 14, 2011

Mixing Hadoop and HPC Workloads on Parallel Filesystems

A Framework for Performance Analysis and Tuning in Hadoop Based Clusters

CSE-E5430 Scalable Cloud Computing Lecture 2

Mambo Running Analytics on Enterprise Storage

Storage Architectures for Big Data in the Cloud

Big Data Performance Growth on the Rise

STeP-IN SUMMIT June 2014 at Bangalore, Hyderabad, Pune - INDIA. Performance testing Hadoop based big data analytics solutions

Red Hat Linux Internals

Distributed File System. MCSN N. Tonellotto Complements of Distributed Enabling Platforms

Exar. Optimizing Hadoop Is Bigger Better?? March Exar Corporation Kato Road Fremont, CA

Maximizing Hadoop Performance and Storage Capacity with AltraHD TM

Block I/O Layer Tracing: blktrace

Enabling High performance Big Data platform with RDMA

Hadoop & Spark Using Amazon EMR

System Models for Distributed and Cloud Computing

GraySort on Apache Spark by Databricks

Big Fast Data Hadoop acceleration with Flash. June 2013

NoSQL Performance Test In-Memory Performance Comparison of SequoiaDB, Cassandra, and MongoDB

Hadoop Size does Hadoop Summit 2013

Web Application s Performance Testing

Accelerating Hadoop MapReduce Using an In-Memory Data Grid

Operating System Components and Services

HiBench Installation. Sunil Raiyani, Jayam Modi

FPGA Accelerator Virtualization in an OpenPOWER cloud. Fei Chen, Yonghua Lin IBM China Research Lab

Optimizing the Performance of Your Longview Application

DSS. Diskpool and cloud storage benchmarks used in IT-DSS. Data & Storage Services. Geoffray ADDE

Big Data. Value, use cases and architectures. Petar Torre Lead Architect Service Provider Group. Dubrovnik, Croatia, South East Europe May, 2013

Load Testing Analysis Services Gerhard Brückl

Operating System for the K computer

OpenPOWER Outlook AXEL KOEHLER SR. SOLUTION ARCHITECT HPC

HDFS Under the Hood. Sanjay Radia. Grid Computing, Hadoop Yahoo Inc.

DEPLOYING AND MONITORING HADOOP MAP-REDUCE ANALYTICS ON SINGLE-CHIP CLOUD COMPUTER

OPTIMIZE DMA CONFIGURATION IN ENCRYPTION USE CASE. Guillène Ribière, CEO, System Architect

COURSE CONTENT Big Data and Hadoop Training

Distributed File Systems

Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam

Running a Workflow on a PowerCenter Grid

Supercomputing and Big Data: Where are the Real Boundaries and Opportunities for Synergy?

SQL Server Performance Tuning and Optimization

Hadoop Cluster Applications

Clash of the Titans: MapReduce vs. Spark for Large Scale Data Analytics

Building All-Flash Software Defined Storages for Datacenters. Ji Hyuck Yun Storage Tech. Lab SK Telecom

Dell Reference Configuration for Hortonworks Data Platform

Virtualizing a Virtual Machine

Jun Liu, Senior Software Engineer Bianny Bian, Engineering Manager SSG/STO/PAC

Hadoop MapReduce and Spark. Giorgio Pedrazzi, CINECA-SCAI School of Data Analytics and Visualisation Milan, 10/06/2015

PERFORMANCE ANALYSIS OF KERNEL-BASED VIRTUAL MACHINE

ZooKeeper. Table of contents

Apache Hadoop. Alexandru Costan

Winning the J2EE Performance Game Presented to: JAVA User Group-Minnesota

Stingray Traffic Manager Sizing Guide

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

Removing Performance Bottlenecks in Databases with Red Hat Enterprise Linux and Violin Memory Flash Storage Arrays. Red Hat Performance Engineering

Introduction. Various user groups requiring Hadoop, each with its own diverse needs, include:

VMware vsphere 4.1 with ESXi and vcenter

Violin: A Framework for Extensible Block-level Storage

PERFORMANCE TESTING. New Batches Info. We are ready to serve Latest Testing Trends, Are you ready to learn.?? START DATE : TIMINGS : DURATION :

Workload Dependent Hadoop MapReduce Application Performance Modeling

Automating Big Data Benchmarking for Different Architectures with ALOJA

Performance Comparison of SQL based Big Data Analytics with Lustre and HDFS file systems

Above the clouds: A Berkeley View of Cloud Computing

Big Data: A Storage Systems Perspective Muthukumar Murugan Ph.D. HP Storage Division

Task Scheduling in Hadoop

Cloud Operating Systems for Servers

Chapter 3 Operating-System Structures

Tomcat Tuning. Mark Thomas April 2009

Petascale Software Challenges. Piyush Chaudhary High Performance Computing

Duke University

An Oracle White Paper July Oracle Primavera Contract Management, Business Intelligence Publisher Edition-Sizing Guide

Architectures for Big Data Analytics A database perspective

On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform

Maximizing Hadoop Performance with Hardware Compression

Microsoft SQL Server: MS Performance Tuning and Optimization Digital

MySQL performance in a cloud. Mark Callaghan

COS 318: Operating Systems. Virtual Machine Monitors

Transcription:

Linux Performance Optimizations for Big Data Environments Dominique A. Heger Ph.D. DHTechnologies (Performance, Capacity, Scalability) www.dhtusa.com Data Nubes (Big Data, Hadoop, ML) www.datanubes.com

Performance & Capacity Studies Availability & Reliability Studies Systems Modeling Scalability & Speedup Studies Linux & UNIX Internals Design, Architecture & Feasibility Studies Systems Stress- Testing & Benchmarking Cloud Computing Research, Education & Training Machine Learning Operations Research BI, Data Analytics, Data Mining, Predictive Analytics Hadoop Ecosystem & MapReduce www.dhtusa.com

Agenda Linux & Big Data (Hadoop Ecosystem) Performance Management Methodology Linux 3.x Task & I/O Framework Quantifying Linux & Application Performance Q&A

Linux Engineers Big Demand & Small Talent Pool Big Data, Hadoop Ecosystem & Cloud Computing in general is powered by Linux 91.4% of the top 500 supercomputers are Linux-based (Source TOP500, 2012) Linux Talent needed now A 2013 job report compiled by Dice showed that 93% of the contacted US companies (850 firms) are hiring Linux professionals this year (2013) The same study revealed that 90% of the firms stated that it is very difficult at the moment to even find Linux talent in the US. This number is up from 80% for the 2012 study According to Dice, the average salary increase for a Linux professional in the US is approximately 9% this year. At the same time, the average IT salary increase in the US is approximately 5%

Hadoop Ecosystem (Partial View) Twitter Real-Time Processing Data Handlers Data Serialization System Configuration Management Tools KAFKA Distributed Messaging System Schedulers RDBMS Database & No

Hadoop Linux Interaction Language Abstraction Java API MapReduce Framework(*) Hadoop Distributed Filesystem Hadoop (*) Some Hadoop Projects Bypass MapReduce Linux OS Node HW OS & Local FS HW Components

Hadoop MR2 Environment

Performance Management - Building Blocks Phase 1: Understand Goals & Objectives Phase 2: Phase 3: HW Profiles Workload Profiles Application & OS Traces Data Post-Processing Phase 4: Performance Study CSA Study Phase 5: Capacity Study Scalability Study Speedup Study

Performance Evaluation - Goals & Objectives Identify bottlenecks, predict future capacity shortcomings, and determine the most adequate (cost effective) way to configure, tune, and optimize computing environments to overcome performance problems and cope with increasing application workload demands. Combination of analytical, simulation, and empirical study based approaches that utilizes tracing techniques, HW profiles, actual application workload profiles, application log files, and performance data collected either in a Lab or production environment. If no empirical data is available, performance budgets are being used (PE). 9

Application Centric Systems Analysis System Hierarchy Application Abstraction Operating System Abstraction Hardware Abstraction Performance Hierarchy Application Vector Performance Code Path - Application to OS Interface Performance Code Path - High to Low Level OS Interface Performance Code Path - OS to HW Interface OS Vector Application Primitives High-Level OS Primitives Low-Level OS Primitives Hardware Primitives Process/Thread Monitors Application Trace Tools & Macro Benchmarks OS Performance Tools & Micro Benchmarks

Linux & Hadoop Tools & Techniques Linux Performance Evaluation Tools (Code Path Analysis) strace nmon, blktrace, blkparse, btt, blkiomon, iostat perf valgrind, kcachegrind Workload Generators (Macro & Micro Benchmarks) DHTUX toolset (Unix, Linux 46 systems benchmarks) TeraSort (Hadoop) K-Means Clustering (Hadoop) Bayesian Classification (Hadoop)

Performance by the Numbers (Ballpark Figures) L1 cache reference TLB miss Branch misprediction L2 cache reference Mutex lock/unlock Main memory reference Compress 1Kbytes with Zippy Send 2Kbytes over 1Gbps network Read 1MB sequentially from memory Round trip within same datacenter Disk seek (HD) Read 1MB sequentially from disk (HD) Send packet CA->UK->CA 1ns 4ns 5ns 7ns 25ns 100ns 3,000ns 20,000ns 250,000ns 500,000ns 10,000,000ns 20,000,000ns 150,000,000ns Execute Micro & Macro Benchmarks to Baseline the HW and the OS Hadoop MapReduce: With large-scale projects, the performance focus is on disk and interconnect/network performance rather than on the CPU and the DRAM subsystems

Micro & Macro Benchmarks Benchmarking & Stress-Testing the HW & the OS prior to deploying the Cluster Nodes Establish a Sound Performance Baseline

Application User Space Linux I/O Requests File System Layer Linux bio Layer Linux dequeue Function I/O Task Queue Linux enqueue Function Device Driver I/O Scheduler Disk/RAID/SAN Subsystem

Linux 3.x IO Schedulers (3.x) CFQ (default) synchronous verses asynchronous requests, IO priority, read favored over write requests, time-out value noop unordered FIFO queue, only merging, good for environments where IO is optimized at a lower level Deadline 5 IO queues, reorder requests, deadline value, read favored over write requests

Application Layer - strace

Kernel Layer - blktrace/blkparse

Kernel Layer - blktrace - Summary

Kernel Layer - btt

Kernel Layer - btt (time-line)

perf Linux Performance Tool

valgrin memcheck (Memory Leaks)

valgrin kcachegrin (Call Profiler)

Q & A