Hadoop Scalability at Facebook. Dmytro Molkov (dms@fb.com) YaC, Moscow, September 19, 2011



Similar documents
The Hadoop Distributed File System

Design and Evolution of the Apache Hadoop File System(HDFS)

HADOOP MOCK TEST HADOOP MOCK TEST I

Hadoop Architecture. Part 1

Introduction to Hadoop. New York Oracle User Group Vikas Sawhney

Hadoop implementation of MapReduce computational model. Ján Vaňo

Petabyte Scale Data at Facebook. Dhruba Borthakur, Engineer at Facebook, SIGMOD, New York, June 2013

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

Overview. Big Data in Apache Hadoop. - HDFS - MapReduce in Hadoop - YARN. Big Data Management and Analytics

HDFS Federation. Sanjay Radia Founder and Hortonworks. Page 1

Large scale processing using Hadoop. Ján Vaňo

Welcome to the unit of Hadoop Fundamentals on Hadoop architecture. I will begin with a terminology review and then cover the major components

HDFS Under the Hood. Sanjay Radia. Grid Computing, Hadoop Yahoo Inc.

Prepared By : Manoj Kumar Joshi & Vikas Sawhney

Certified Big Data and Apache Hadoop Developer VS-1221

THE HADOOP DISTRIBUTED FILE SYSTEM

The Evolving Apache Hadoop Eco-System

Journal of science STUDY ON REPLICA MANAGEMENT AND HIGH AVAILABILITY IN HADOOP DISTRIBUTED FILE SYSTEM (HDFS)

BookKeeper. Flavio Junqueira Yahoo! Research, Barcelona. Hadoop in China 2011

Apache Hadoop. Alexandru Costan

Petabyte Scale Data at Facebook. Dhruba Borthakur, Engineer at Facebook, UC Berkeley, Nov 2012

Hadoop Distributed File System. Dhruba Borthakur Apache Hadoop Project Management Committee June 3 rd, 2008

Hadoop Distributed File System. Jordan Prosch, Matt Kipps

<Insert Picture Here> Big Data

Real-time Analytics at Facebook: Data Freeway and Puma. Zheng Shao 12/2/2011

CS2510 Computer Operating Systems

CS2510 Computer Operating Systems

EXPERIMENTATION. HARRISON CARRANZA School of Computer Science and Mathematics

Sujee Maniyam, ElephantScale

Apache HBase. Crazy dances on the elephant back

Operations and Big Data: Hadoop, Hive and Scribe. Zheng 铮 9 12/7/2011 Velocity China 2011

DATA MINING WITH HADOOP AND HIVE Introduction to Architecture

Apache Hadoop FileSystem and its Usage in Facebook

Big Data With Hadoop

Case Study : 3 different hadoop cluster deployments

Application Development. A Paradigm Shift

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

Lecture 2 (08/31, 09/02, 09/09): Hadoop. Decisions, Operations & Information Technologies Robert H. Smith School of Business Fall, 2015

High Availability on MapR

Hadoop: Embracing future hardware

Cloudera Manager Health Checks

Distributed File Systems

Open source software framework designed for storage and processing of large scale data on clusters of commodity hardware

Realtime Apache Hadoop at Facebook. Jonathan Gray & Dhruba Borthakur June 14, 2011 at SIGMOD, Athens

Benchmarking Hadoop & HBase on Violin

Distributed File System. MCSN N. Tonellotto Complements of Distributed Enabling Platforms

Data Warehousing and Analytics Infrastructure at Facebook. Ashish Thusoo & Dhruba Borthakur athusoo,dhruba@facebook.com

Open source Google-style large scale data analysis with Hadoop

Hadoop & its Usage at Facebook

The Hadoop Distributed File System

Data-Intensive Computing with Map-Reduce and Hadoop

Extending Hadoop beyond MapReduce

Take An Internal Look at Hadoop. Hairong Kuang Grid Team, Yahoo! Inc

Distributed File Systems

Hadoop Parallel Data Processing

A very short Intro to Hadoop

HDFS Design Principles

Hadoop Ecosystem B Y R A H I M A.

CURSO: ADMINISTRADOR PARA APACHE HADOOP

Lecture 32 Big Data. 1. Big Data problem 2. Why the excitement about big data 3. What is MapReduce 4. What is Hadoop 5. Get started with Hadoop

Reference Architecture and Best Practices for Virtualizing Hadoop Workloads Justin Murray VMware

CSE 590: Special Topics Course ( Supercomputing ) Lecture 10 ( MapReduce& Hadoop)

Accelerating and Simplifying Apache

Hadoop Distributed File System. Dhruba Borthakur Apache Hadoop Project Management Committee

Lecture 5: GFS & HDFS! Claudia Hauff (Web Information Systems)! ti2736b-ewi@tudelft.nl

Big Data Technology Core Hadoop: HDFS-YARN Internals

Hadoop. Apache Hadoop is an open-source software framework for storage and large scale processing of data-sets on clusters of commodity hardware.

Deploying Hadoop with Manager

HDFS Users Guide. Table of contents

Distributed File Systems

Hadoop & its Usage at Facebook

Near Real Time Indexing Kafka Message to Apache Blur using Spark Streaming. by Dibyendu Bhattacharya

Hadoop IST 734 SS CHUNG

International Journal of Advancements in Research & Technology, Volume 3, Issue 2, February ISSN

Snapshots in Hadoop Distributed File System

Apache Hadoop: Past, Present, and Future

Cloudera Manager Health Checks

Hadoop Distributed File System (HDFS)

HDFS 2015: Past, Present, and Future

Storage Architectures for Big Data in the Cloud

HDFS: Hadoop Distributed File System

!"#$%&' ( )%#*'+,'-#.//"0( !"#$"%&'()*$+()',!-+.'/', 4(5,67,!-+!"89,:*$;'0+$.<.,&0$'09,&)"/=+,!()<>'0, 3, Processing LARGE data sets

HADOOP MOCK TEST HADOOP MOCK TEST II

Hadoop: A Framework for Data- Intensive Distributed Computing. CS561-Spring 2012 WPI, Mohamed Y. Eltabakh

Chapter 7. Using Hadoop Cluster and MapReduce

Apache Hadoop FileSystem Internals

Qsoft Inc

HadoopRDF : A Scalable RDF Data Analysis System

and HDFS for Big Data Applications Serge Blazhievsky Nice Systems

Extended Attributes and Transparent Encryption in Apache Hadoop

Hadoop Architecture and its Usage at Facebook

Big Data Storage Options for Hadoop Sam Fineberg, HP Storage

INTRODUCTION TO APACHE HADOOP MATTHIAS BRÄGER CERN GS-ASE

White Paper. Managing MapR Clusters on Google Compute Engine

marlabs driving digital agility WHITEPAPER Big Data and Hadoop

BBM467 Data Intensive ApplicaAons

Transcription:

Hadoop Scalability at Facebook Dmytro Molkov (dms@fb.com) YaC, Moscow, September 19, 2011

How Facebook uses Hadoop Hadoop Scalability Hadoop High Availability HDFS Raid

How Facebook uses Hadoop

Usages of Hadoop at Facebook Warehouse Thousands of machines in the cluster Tens of petabytes of data Tens of thousands of jobs/queries a day Over a hundred million files Scribe-HDFS Dozens of small clusters Append support High availability High throughput

Usages of Hadoop at Facebook (contd.) Realtime Analytics Medium sized hbase clusters High throughput/low latency FB Messages Storage Medium sized hbase clusters Low latency High data durability High Availability Misc Storage/Backup clusters Small to medium sized Various availability/performance requirements

Hadoop Scalability

Hadoop Scalability Warehouse Cluster - A Single Cluster approach Good data locality Ease of data access Operational Simplicity NameNode is the bottleneck Memory pressure - too many files and blocks CPU pressure - too many metadata operations against a single node Long Startup Time JobTracker is the bottleneck Memory Pressure - too many jobs/tasks/counters in memory CPU pressure - scheduling computation is expensive

HDFS Federation Wishlist Single Cluster Preserve Data Locality Keep Operations Simple Distribute both CPU and Memory Load

Hadoop Federation Design NameN ode #1 NameN ode #N Data Node Data Node... Data Node

HDFS Federation Overview Each NameNode holds a part of the NameSpace Hive tables are distributed between namenodes Hive Metastore stores full locations of the tables (including the namenode) -> Hive clients know which cluster the data is stored in HDFS Clients have a mount table to know where the data is Each namespace uses all datanodes for storage -> the cluster load is fully balanced (Storage and I/O) Single Datanode process per node ensures good utilization of resources

Map-Reduce Federation Backward Compatibility with the old code Preserve data locality Make scheduling faster Ease the resource pressure on the JobTracker

Map Reduce Federation Resource Request Job Client Cluster Resource Manager Job Communication Resource Heartbeats Task Track er... Task Track er

MapReduce Federation Overview Cluster Manager only allocates resources JobTracker per user -> few tasks per JobTracker -> more responsive scheduling ClusterManager is stateless -> shorter restart times -> better availability

Hadoop High Availability

Warehouse High Availability Full cluster restart takes 90-120 mins Software upgrade is 20-30 hrs of downtime/year Cluster crash is 5 hrs of downtime/year MapReduce tolerates failures

HDFS High Availability Design Primary NN Edits Log Edits Log Standb y NN Block Reports/ Block Received NFS Block Reports/ Block Received DataNodes

Clients Design Using ZooKeeper as a method of name resolution Under normal conditions ZooKeeper contains a location of the primary node During the failover ZooKeeper record is empty and the clients know to wait for the failover to complete On a network failure clients check if the ZooKeeper entry has changed and retry the command agains the new Primary NameNode if the failover has occurred For the large clusters Clients also cache the location of the primary on the local node to ease the load on the zookeeper cluster

HDFS Raid

HDFS Raid 3 way replication Data locality - necessary only for the new data Data availability - necessary for all kinds of data Erasure codes Data locality is worse than 3 way replication Data availability is at least as good as 3 way replication

HDFS Raid Detais XOR 10 blocks replicated 3 times = 30 physical blocks Effective replication factor 3.0 Reed Solomon Encoding 10 blocks replicated twice + checksum (XOR) block replicated twice = 22 physical blocks. Effective replication factor 2.2 10 blocks replicated 3 times = 30 physical blocks Effective replication factor 3.0 10 blocks with replication factor 1 + erasure codes (RS) replicated once = 14 physical blocks. Effective replication factor 1.4

HDFS Raid Pros and Cons Saves a lot of space Provides same guarantees for data availability Worse data locality Need to reconstruct blocks instead of replicating (CPU + Network cost) Block location in the cluster is important and needs to be maintained

facebook.com/dms dms@fb.com

(c) 2007 Facebook, Inc. or its licensors. "Facebook" is a registered trademark of Facebook, Inc.. All rights reserved. 1.0