Tachyon: A Reliable Memory Centric Storage for Big Data Analytics



Similar documents
Tachyon: memory-speed data sharing

Tachyon: Reliable File Sharing at Memory- Speed Across Cluster Frameworks

An Open Source Memory-Centric Distributed Storage System

Beyond Hadoop with Apache Spark and BDAS

Spark: Making Big Data Interactive & Real-Time

Next-Gen Big Data Analytics using the Spark stack

Conquering Big Data with BDAS (Berkeley Data Analytics)

Fast and Expressive Big Data Analytics with Python. Matei Zaharia UC BERKELEY

What s next for the Berkeley Data Analytics Stack?

Spark and Shark. High- Speed In- Memory Analytics over Hadoop and Hive Data

How To Create A Data Visualization With Apache Spark And Zeppelin

Unified Big Data Processing with Apache Spark. Matei

How Companies are! Using Spark

GraySort on Apache Spark by Databricks

Can t We All Just Get Along? Spark and Resource Management on Hadoop

Big Data Processing. Patrick Wendell Databricks

Mesos: A Platform for Fine- Grained Resource Sharing in Data Centers (II)

Apache Spark and Distributed Programming

CSE-E5430 Scalable Cloud Computing Lecture 11

Spark in Action. Fast Big Data Analytics using Scala. Matei Zaharia. project.org. University of California, Berkeley UC BERKELEY

The Flink Big Data Analytics Platform. Marton Balassi, Gyula Fora" {mbalassi,

The Berkeley AMPLab - Collaborative Big Data Research

Improving MapReduce Performance in Heterogeneous Environments

CS 294: Big Data System Research: Trends and Challenges

Big Data Analytics Hadoop and Spark

Near Real Time Indexing Kafka Message to Apache Blur using Spark Streaming. by Dibyendu Bhattacharya

Moving From Hadoop to Spark

Rakam: Distributed Analytics API

Dell In-Memory Appliance for Cloudera Enterprise

Introduction to Big Data! with Apache Spark" UC#BERKELEY#

Spark. Fast, Interactive, Language- Integrated Cluster Computing

Hadoop Architecture. Part 1

Unified Big Data Analytics Pipeline. 连 城

Spark Application on AWS EC2 Cluster:

Apache Spark 11/10/15. Context. Reminder. Context. What is Spark? A GrowingStack

Machine- Learning Summer School

Architectures for massive data management

Learning. Spark LIGHTNING-FAST DATA ANALYTICS. Holden Karau, Andy Konwinski, Patrick Wendell & Matei Zaharia

Apache Spark : Fast and Easy Data Processing Sujee Maniyam Elephant Scale LLC sujee@elephantscale.com

Big Data Research in the AMPLab: BDAS and Beyond

Real Time Data Processing using Spark Streaming

Linux Clusters Ins.tute: Turning HPC cluster into a Big Data Cluster. A Partnership for an Advanced Compu@ng Environment (PACE) OIT/ART, Georgia Tech

Mambo Running Analytics on Enterprise Storage

Ali Ghodsi Head of PM and Engineering Databricks

Spark and the Big Data Library

Brave New World: Hadoop vs. Spark

Apache Ignite TM (Incubating) - In- Memory Data Fabric Fast Data Meets Open Source

Significantly Speed up real world big data Applications using Apache Spark

Big Data Analytics. Lucas Rego Drumond

CS555: Distributed Systems [Fall 2015] Dept. Of Computer Science, Colorado State University

Apache Spark and the future of big data applica5ons. Eric Baldeschwieler

Welcome to the unit of Hadoop Fundamentals on Hadoop architecture. I will begin with a terminology review and then cover the major components

The Berkeley Data Analytics Stack: Present and Future

The Stratosphere Big Data Analytics Platform

YARN, the Apache Hadoop Platform for Streaming, Realtime and Batch Processing

Conquering Big Data with Apache Spark

Architectural patterns for building real time applications with Apache HBase. Andrew Purtell Committer and PMC, Apache HBase

6. How MapReduce Works. Jari-Pekka Voutilainen

Apache Flink Next-gen data analysis. Kostas

In-memory data pipeline and warehouse at scale using Spark, Spark SQL, Tachyon and Parquet

Survey of the Benchmark Systems and Testing Frameworks For Tachyon-Perf

Processing NGS Data with Hadoop-BAM and SeqPig

CSE-E5430 Scalable Cloud Computing. Lecture 4

Hadoop Ecosystem B Y R A H I M A.

GraySort and MinuteSort at Yahoo on Hadoop 0.23

HiBench Introduction. Carson Wang Software & Services Group

Apache Ignite TM (Incubating) - In- Memory Data Fabric Fast Data Meets Open Source

An Architecture for Fast and General Data Processing on Large Clusters

Developing Scalable Smart Grid Infrastructure to Enable Secure Transmission System Control

Beyond Batch Processing: Towards Real-Time and Streaming Big Data

Maximizing Hadoop Performance and Storage Capacity with AltraHD TM

IMPROVED FAIR SCHEDULING ALGORITHM FOR TASKTRACKER IN HADOOP MAP-REDUCE

Hadoop & Spark Using Amazon EMR

Hadoop: Embracing future hardware

A Brief Introduction to Apache Tez

Introduction to Big Data with Apache Spark UC BERKELEY

Computing at Scale: Resource Scheduling Architectural Evolution and Introduction to Fuxi System

Take An Internal Look at Hadoop. Hairong Kuang Grid Team, Yahoo! Inc

Big Data Storage Architecture Design in Cloud Computing

Real-Time Analytical Processing (RTAP) Using the Spark Stack. Jason Dai Intel Software and Services Group

Hadoop MapReduce and Spark. Giorgio Pedrazzi, CINECA-SCAI School of Data Analytics and Visualisation Milan, 10/06/2015

Clash of the Titans: MapReduce vs. Spark for Large Scale Data Analytics

Streaming items through a cluster with Spark Streaming

Introduction to Hadoop HDFS and Ecosystems. Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data

Big Data and Scripting Systems beyond Hadoop

Spark and Shark: High-speed In-memory Analytics over Hadoop Data

How to properly misuse Hadoop. Marcel Huntemann NERSC tutorial session 2/12/13

Hadoop. MPDL-Frühstück 9. Dezember 2013 MPDL INTERN

Extending Hadoop beyond MapReduce

Introduc8on to Apache Spark

Berkeley Data Analytics Stack:! Experience and Lesson Learned

From GWS to MapReduce: Google s Cloud Technology in the Early Days

Enhancing Dataset Processing in Hadoop YARN Performance for Big Data Applications

Practical Hadoop. Security. Bhushan Lakhe

Transcription:

Tachyon: A Reliable Memory Centric Storage for Big Data Analytics a Haoyuan (HY) Li, Ali Ghodsi, Matei Zaharia, Scott Shenker, Ion Stoica June 30 th, 2014 Spark Summit @ San Francisco UC Berkeley

Outline Overview Research Open Source Future

Outline Overview Research Open Source Future

Memory is King RAM throughput increasing exponen9ally Disk throughput increasing slowly Memory- locality key to interac9ve response 9me

Realized by many Frameworks already leverage memory 5

Problem solved? 6

An Example: - Fast in- memory data processing framework Keep one in- memory copy inside JVM Track lineage of opera9ons used to derive data Upon failure, use lineage to recompute data map Lineage Tracking join reduce filter map

Issue 1 Different jobs share data: Slow writes to disk storage engine & execution engine same process (slow writes) block 1 block 3 Spark Task Spark mem block manager block 3 block 1 Spark Task Spark mem block manager block 1 block 3 block 2 block 4 HDFS disk 8

Issue 1 Different frameworks share data: Slow writes to disk storage engine & execution engine same process (slow writes) block 1 block 3 Spark Task Spark mem block manager Hadoop MR YARN block 1 block 3 block 2 Block 4 HDFS disk 9

Issue 2 execution engine & storage engine same process block 1 block 3 Spark Task Spark memory block manager block 1 block 3 block 2 block 4 HDFS disk 10

Issue 2 execution engine & storage engine same process block 1 block 3 crash Spark memory block manager block 1 block 3 block 2 block 4 HDFS disk 11

Issue 2 Process crash: lose all cache execution engine & storage engine same process crash block 1 block 3 block 2 block 4 HDFS disk 12

Issue 3 duplicate memory per job & GC execution engine & storage engine same process (duplication & GC) block 1 block 3 Spark Task Spark mem block manager block 3 Block 1 Spark Task Spark mem block manager block 1 block 3 block 2 block 4 HDFS disk 13

Tachyon Reliable data sharing at memory- speed within and across cluster frameworks/jobs 14

Solu9on Overview Basic idea Push lineage down to storage layer Use memory aggressively Facts One data copy in memory Rely on recomputa9on for fault- tolerance

Stack Computation Frameworks (Spark, MapReduce, Impala, Tez, ) Tachyon Existing Storage Systems (HDFS, S3, GlusterFS, )

Architecture 17

Lineage 18

Issue 1 revisited Different frameworks share at memory- speed execution engine & storage engine same process (fast writes) block 1 Spark Task Spark mem Hadoop MR YARN block 1 block 3 block 1 Block 3 block 2 block 4 block 2 Block 4 Tachyon HDFS in- memory disk HDFS disk 19

Issue 2 revisited execution engine & storage engine same process block 1 Spark Task Spark memory block manager block 1 block 3 block 2 block 4 Tachyon HDFS in- memory disk 20

Issue 2 revisited execution engine & storage engine same process crash Spark memory block manager block 1 block 3 block 2 block 4 Tachyon HDFS in- memory disk block 1 block 3 block 2 block 4 HDFS disk 21

Issue 2 revisited process crash: keep memory- cache execution engine & storage engine same process crash block 1 block 3 block 2 block 4 Tachyon HDFS in- memory disk block 1 block 3 block 2 block 4 HDFS disk 22

Issue 3 revisited Off- heap memory storage one memory copy & no GC execution engine & storage engine same process (no duplication & GC) Spark Task Spark mem Spark Task Spark mem block 1 block 3 block 1 Block 3 block 2 block 4 block 2 Block 4 Tachyon HDFS in- memory disk HDFS disk 23

Outline Overview Research Open Source Future

Ques9on 1: How long to get missing data back? That server contains the data computed last month! 25

Lineage enables Asynchronous Checkpoin9ng 26

Edge Algorithm Checkpoint leaves Checkpoint hot files Bounded Recovery Cost 27

Ques9on 2: How to allocate recomputa9on resource? Would recomputa9on slow down my high priority jobs? Priority Inversion? 28

Recomputa9on Resource Alloca9on Priority Based Scheduler Fair Sharing Based Scheduler 29

Comparison with in Memory HDFS

Further Improve Spark s Performance Grep

Master Faster Recovery

Recomputa9on Resource Consump9on 33

Outline Overview Research Open Source Future

A Open Source Status Apache License, Version 0.4.1 (Feb 2014) 15+ Companies Spark and MapReduce applications can run without any code change

Release Growth Tachyon 0.5: -? contributors Tachyon 0.4: - 30 contributors Tachyon 0.2: - 3 contributors Tachyon 0.3: - 15 contributors Apr 13 Oct 13 Feb 14 July 14 36

Contributors Outside of Berkeley More than 75% 37

Tachyon is the Default Off- Heap Storage SoluHon for. Thanks to Redhat!

Thanks to Redhat!

Commercially Supported By ADgeo Thanks to Redhat!

in 41

Spark/MapReduce/Shark without Tachyon Spark val file = sc.textfile( hdfs://ip:port/path ) Hadoop MapReduce hadoop jar hadoop- examples- 1.0.4.jar wordcount hdfs://localhost:19998/input hdfs://localhost: 19998/output Shark CREATE TABLE orders_cached AS SELECT * FROM orders;

Spark/MapReduce/Shark with Tachyon Spark val file = sc.textfile( tachyon://ip:port/path ) Hadoop MapReduce hadoop jar hadoop- examples- 1.0.4.jar wordcount tachyon://localhost:19998/input tachyon:// localhost:19998/output Shark CREATE TABLE orders_tachyon AS SELECT * FROM orders;

Spark OFF_HEAP with Tachyon // Input data from Tachyon s Memory val file = sc.textfile( tachyon://ip:port/path ) // Store RDD OFF_HEAP in Tachyon s Memory file.persist(off_heap)

Thanks to our Code Contributors! Aaron Davidson Achal Soni Ali Ghodsi Andrew Ash Anurag Khandelwal Aslan Bekirov Bill Zhao Calvin Jia Colin Patrick McCabe Shivaram Venkataraman Cheng Chang Du Li Fei Wang Gerald Zhang Grace Huang Hao Cheng Haoyuan Li Henry Saputra Hobin Yoon Huamin Chen Jey Kottalam Joseph Tang Lukasz Jastrzebski Manu Goyal Mark Hamstra Nick Lanham Orcun Simsek Pengfei Xuan Qifan Pu Qianhao Dong Raymond Liu Reynold Xin Robert Metzger Rong Gu Sean Zhong Srinivas Parayya Tao Wang Timothy St. Clair Vamsi Chitters Xi Liu Xiaomin Zhang 45

Outline Overview Research Open Source Future

Goal? 47

Beper Assist Other Components MapRe duce Spark Tez Shark GraphX Impala Tachyon HDFS S3 Localfs Gluster fs NFS Ceph Welcome CollaboraHon!

Short Term Roadmap Ceph Integra9on (Redhat) Hierarchical Local Storage (Intel) Further improver Shark Performance (Yahoo) Beper support for Mul9- tenancy (AMPLab) Many more from AMPLab and Industry Collaborators. 49

Your Requirements?

Tachyon Summary High-throughput, fault-tolerant memory centric storage, with lineage as a first class citizen Further improve performance for frameworks such as Spark, Hadoop, and Shark etc. Healthy community with 15+ companies contributing

Thanks! Ques,ons? More Informa,on: h5p://tachyon- project.org h5ps://github.com/amplab/tachyon Email: haoyuan@cs.berkeley.edu