Apache Hadoop Goes Realtime at Facebook ~ Borthakur, Sarma, Gray, Muthukkaruppan, Spiegelberg, Kuang, Ranganathan, Molkov, Menon, Rash, Scmidt and

Similar documents
Apache HBase. Crazy dances on the elephant back

Realtime Apache Hadoop at Facebook. Jonathan Gray & Dhruba Borthakur June 14, 2011 at SIGMOD, Athens

Apache Hadoop Goes Realtime at Facebook

Architectural patterns for building real time applications with Apache HBase. Andrew Purtell Committer and PMC, Apache HBase

Non-Stop for Apache HBase: Active-active region server clusters TECHNICAL BRIEF

Facebook: Cassandra. Smruti R. Sarangi. Department of Computer Science Indian Institute of Technology New Delhi, India. Overview Design Evaluation

BookKeeper. Flavio Junqueira Yahoo! Research, Barcelona. Hadoop in China 2011

HBase Schema Design. NoSQL Ma4ers, Cologne, April Lars George Director EMEA Services

Comparison of the Frontier Distributed Database Caching System with NoSQL Databases

Near Real Time Indexing Kafka Message to Apache Blur using Spark Streaming. by Dibyendu Bhattacharya

Integrating Big Data into the Computing Curricula

NoSQL Data Base Basics

Big Data With Hadoop

Data Warehousing and Analytics Infrastructure at Facebook. Ashish Thusoo & Dhruba Borthakur athusoo,dhruba@facebook.com

Hadoop Ecosystem B Y R A H I M A.

BIG DATA TECHNOLOGY. Hadoop Ecosystem

Lecture 10: HBase! Claudia Hauff (Web Information Systems)!

Petabyte Scale Data at Facebook. Dhruba Borthakur, Engineer at Facebook, SIGMOD, New York, June 2013

CitusDB Architecture for Real-Time Big Data

Hypertable Goes Realtime at Baidu. Yang Dong Sherlock Yang(

CS2510 Computer Operating Systems

CS2510 Computer Operating Systems

Apache Cassandra Present and Future. Jonathan Ellis

LARGE-SCALE DATA STORAGE APPLICATIONS

Design and Evolution of the Apache Hadoop File System(HDFS)

Introduction to Hadoop. New York Oracle User Group Vikas Sawhney

How To Scale Out Of A Nosql Database

Processing of massive data: MapReduce. 2. Hadoop. New Trends In Distributed Systems MSc Software and Systems

Cloud Computing at Google. Architecture

Data Management in the Cloud

High Availability on MapR

Building Mission Critical Messaging System On Top Of HBase

Introduction to Big Data Training

Prepared By : Manoj Kumar Joshi & Vikas Sawhney

Snapshots in Hadoop Distributed File System

Programming Hadoop 5-day, instructor-led BD-106. MapReduce Overview. Hadoop Overview

Data-Intensive Programming. Timo Aaltonen Department of Pervasive Computing

Big Data Technology Core Hadoop: HDFS-YARN Internals

Apache Hadoop: Past, Present, and Future

Lambda Architecture. Near Real-Time Big Data Analytics Using Hadoop. January Website:

Peers Techno log ies Pv t. L td. HADOOP

NoSQL and Hadoop Technologies On Oracle Cloud

THE ATLAS DISTRIBUTED DATA MANAGEMENT SYSTEM & DATABASES

Design Patterns for Distributed Non-Relational Databases

Real-time Analytics at Facebook: Data Freeway and Puma. Zheng Shao 12/2/2011

How To Use Big Data For Telco (For A Telco)

Hadoop Scalability at Facebook. Dmytro Molkov YaC, Moscow, September 19, 2011

Overview. Big Data in Apache Hadoop. - HDFS - MapReduce in Hadoop - YARN. Big Data Management and Analytics

Hadoop IST 734 SS CHUNG

Building Scalable Big Data Infrastructure Using Open Source Software. Sam William

Cassandra A Decentralized, Structured Storage System

Benchmarking Hadoop & HBase on Violin

Apache HBase: the Hadoop Database

Hadoop: A Framework for Data- Intensive Distributed Computing. CS561-Spring 2012 WPI, Mohamed Y. Eltabakh

Storage of Structured Data: BigTable and HBase. New Trends In Distributed Systems MSc Software and Systems

Reusable Data Access Patterns

Distributed storage for structured data

CSE 590: Special Topics Course ( Supercomputing ) Lecture 10 ( MapReduce& Hadoop)

Using distributed technologies to analyze Big Data

Non-Stop Hadoop Paul Scott-Murphy VP Field Techincal Service, APJ. Cloudera World Japan November 2014

THE DEVELOPER GUIDE TO BUILDING STREAMING DATA APPLICATIONS

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

The Hadoop Distributed File System

Apache Hama Design Document v0.6

Journal of science STUDY ON REPLICA MANAGEMENT AND HIGH AVAILABILITY IN HADOOP DISTRIBUTED FILE SYSTEM (HDFS)

CSE-E5430 Scalable Cloud Computing Lecture 2

Application Development. A Paradigm Shift

Hypertable Architecture Overview

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

Unified Batch & Stream Processing Platform

Big Data Benchmark. The Cloudy Approach. Dhruba Borthakur Nov 7, 2012 at the SDSC Industry Roundtable on Big Data and Big Value

Open source large scale distributed data management with Google s MapReduce and Bigtable

Distributed Systems. Tutorial 12 Cassandra

MapReduce with Apache Hadoop Analysing Big Data

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook

Infrastructures for big data

White Paper. Managing MapR Clusters on Google Compute Engine

Trafodion Operational SQL-on-Hadoop

Practical Cassandra. Vitalii

Cloudera Certified Developer for Apache Hadoop

F1: A Distributed SQL Database That Scales. Presentation by: Alex Degtiar (adegtiar@cmu.edu) /21/2013

Challenges for Data Driven Systems

these three NoSQL databases because I wanted to see a the two different sides of the CAP

An Industrial Perspective on the Hadoop Ecosystem. Eldar Khalilov Pavel Valov

Apache Hadoop FileSystem and its Usage in Facebook

Complete Java Classes Hadoop Syllabus Contact No:

Introduction to Apache Cassandra

Avoid a single point of failure by replicating the server Increase scalability by sharing the load among replicas

Cassandra vs MySQL. SQL vs NoSQL database comparison

Google Bing Daytona Microsoft Research

Bigdata High Availability (HA) Architecture

An Approach to Implement Map Reduce with NoSQL Databases

Big Data and Hadoop. Module 1: Introduction to Big Data and Hadoop. Module 2: Hadoop Distributed File System. Module 3: MapReduce

The big data revolution

!"#$%&' ( )%#*'+,'-#.//"0( !"#$"%&'()*$+()',!-+.'/', 4(5,67,!-+!"89,:*$;'0+$.<.,&0$'09,&)"/=+,!()<>'0, 3, Processing LARGE data sets

Introduction to Hbase Gkavresis Giorgos 1470

How To Handle Big Data With A Data Scientist

Hadoop Distributed File System. Dhruba Borthakur Apache Hadoop Project Management Committee

Big Data Storage

Pro Apache Hadoop. Second Edition. Sameer Wadkar. Madhu Siddalingaiah

Transcription:

Apache Hadoop Goes Realtime at Facebook ~ Borthakur, Sarma, Gray, Muthukkaruppan, Spiegelberg, Kuang, Ranganathan, Molkov, Menon, Rash, Scmidt and Aiyer

Problem and Context Ever increasing data at Facebook Launch of Facebook Messages Other Young Turks at Facebook Leaving MySQL and its sharding Migration challeneges Problem in words: Unpredictable growth, write throughput and latency requirements 2

Problem and Context (contd.) The Usual Suspects Cassandra Other NoSQL Other considerations Solution: A near realtime Hadoop/HBase that is modified from the vanilla versions to provide scalability, consistency, availability and a compatible data model. 3

Key contributions Making Hadoop and HBase more real-time Adapting Hadoop and HBase to Facebook's unique requirements Implementation of RealTime HDFS Implementation of Production HBase Operational Optimizations 4

Overview Problem and Context Facebook stands alone (Small) Introduction to Hadoop and HBase Enter the Hs Realtime HDFS Production HBase Operational Optimization The present future 5

Facebook's unique requirements Facebook and the Hadoop ecosystem Offline and sequential Requirement Type 1 Realtime concurrent read access to large stream of realtime data Example: Scribe Requirement Type 2 - Dynamically index a rapidly growing data set for fast random lookups Example: Facebook Messages 6

Facebook's unique requirements Facebook Messaging: Unweildly tables High Write Throughput Data Migration Facebook Insights Realtime Analytics Aggregators Facebook Metrics System Quick reads Automatic Sharding 7

Problem and Context Overview Facebook stands alone (Small) Introduction to Hadoop and HBase Enter the Hs Realtime HDFS Production Hbase Operational Optimization The present future 8

Introduction to Hadoop 9

Introduction to HBase Hbase: A NoSQL database that utilizes an on-disk column storage format. Hbase USP: Provides fast key-based access to a specific cell or data or a range of cells. Based on Google's BigTable but extends it Has Row atomicity and read-modify-write consistency Simplifies a lot of tasks related to distributed databases. Tagline: Random access to web-scale data 10

Introduction to Zookeeper Zookeeper: A software service for a distributed environment that coordinates and configures different machines in a centralized way. A change is not considered successful until it has been written to a quorum A leader is elected within the ensemble for conflicts In HBase, ZooKeeper coordinates and shares state between the Masters and RegionServers. Tagline: Enables highly reliable distributed coordination 11

HBase + Zookeeper 12

Problem and Context Overview Facebook stands alone (Small) Introduction to Hadoop and HBase Enter the Hs Realtime HDFS Production HBase Operational Optimization The present future 13

The Why Hadoop/HBase question Scalability Range Scans Efficient low-latency strong consistency Atomic Read-Modify-Write Random reads Fault Isolation 14

The Why Hadoop/HBase question High write throughput Data model High Availability Non-requirements Tolerance of network partitions Individual data centre failure zero downtime Federation comfort 15

Problem and Context Overview Facebook stands alone (Small) Introduction to Hadoop and HBase Enter the Hs Realtime HDFS Production HBase Operational Optimization The present future 16

Realtime HDFS - AvatarNode 17

Realtime HDFS - AvatarNode 18

Realtime HDFS AvatarNode view 19

Realtime HDFS Logging Enhancements to Transcation logging: Conventional HDFS Change: Let the StandbyNode always know about block ids. Avoidance of partial reads between Active and Standby node 20

Improved block availability Challenge: Placement of non-local blocks is not optimal; can be on any rack or within any node therein. Soution: A new block placement policy which has reduced the probabilty of data loss by orders of magnitude. Define a 'window' of logical racks and logical machines around the original block. 21

Hadoop performance improvements RPC Timeout Live free or fail fast File Lease recovery Local replica awareness New tricks: HDFS sync Concurrent readers 22

Problem and Context Overview Facebook stands alone (Small) Introduction to Hadoop and HBase Enter the Hs Realtime HDFS Production HBase Operational Optimization The present future 23

HBase ACID compliance Requirement: Row-level atomicity and consistency of ACID compliance RegionServer failure during log write for row transactions. Consistency of replicas Solution: WAL edits ~ Write Ahead Log policy Immediate rollback 24

HBase Availability Improvements Master Rewrite Store transient state in Zookeeper Rolling upgrades Handled by reassigning of regions Distributed Logsplitting Outsource to Zookeeper 25

Hbase Performance Improvement Compaction Improvement put latency dropped from 25 ms to 3 ms! Read Optimization Skipping certain unnecessary files for certain queries, reducing I/O Using Bloom filters A new special timestamp file selection algorithm Ensuring that Regions are local to their data 26

Problem and Context Overview Facebook stands alone (Small) Introduction to Hadoop and HBase Enter the Hs Realtime HDFS Production Hbase Operational Optimization The present future 27

Operational Optimizations Facebook's HBase testing program HBase Verify HBCK Added metrics for long running operations too! Manual split instead of automatic 28

Operational Optimizations Dark Launch Dashboard/ODS integration Cross-cluster dashboards for higher analysis Visualize version differences Backups Do it using Scribe as an alternate application log Piggyback on the date sent for Hive analytics 29

Operational Optimizations Importing the data Challenge: Importing legacy data in HBase from a Hadoop job saturates the production network Solution: Use Bulk Import with compression Enhanced by GZIP of the intermediate map output Reducing Network I/O: Decreased the periodicity of major compactions Certain column families excluded from logging 30

The present future Apache Hadoop 2.0 was released in 2012 One addition was YARN A powerful cluster resource management Added the High Availability feature to NameNode by introducing the Hot/Standby NameNode. Greater integration with Zookeeper, especially for the ZKFC (Implementation of failover in DAFS) 31

Thank you and GG! 32