Unstructured Data Accelerator (UDA) Author: Motti Beck, Mellanox Technologies Date: March 27, 2012

Size: px

Start display at page:

Download "Unstructured Data Accelerator (UDA) Author: Motti Beck, Mellanox Technologies Date: March 27, 2012"

Sophia Butler
10 years ago
Views:

1 Unstructured Data Accelerator (UDA) Author: Motti Beck, Mellanox Technologies Date: March 27,

2 Market Trends Big Data Growing technology deployments are creating an exponential increase in the volume of data available Existing analytical techniques not adequate for business decision-making processes A successful approach of big data analytics will be a critical core competency Delivering significant competitive advantage to organizations Nationwide Electricity Grid Analysis 2

decision-making processes A successful approach of big data analytics will be a critical core

Big Data Not just a Volume Play Facebook statistics 2009 350 Million

updating status each day 2.5 Billion Photos / Month 1.

3 Big Data Not just a Volume Play Facebook statistics Million Named users 175 Million Active users in one day 35 Million Users updating status each day 2.5 Billion Photos / Month 1.6 Million Active pages Growth: 12 TB /day, 2 PB /year Global data volume : 8.7 PB Source: Gartner 3

cores Single Computer Clustered Computing Traditional

4 Market Trends Serial to Parallel Computing Big Data Analytics is Parallel Processing Single Core Multi cores Single Computer Clustered Computing Traditional Data-Processing Pipeline Big Data parallel processing over Hadoop 4

5 Hadoop Cluster over High Performance Networking Mapper Shuffle IO Intensive Phases Bursty characteristics HDFS Replication Data Split Merge Sort Reducer Out Split Mapper Data Split Merge Sort Reducer Mapper Out Split Data Split 5

Replication Data Split Merge Sort Reducer Out Split

6 Avg Cap (GB) ASP ($), ASP/GB (Cent/GB) Peak Bendwidth (MB/s) I/O Bottlenecks: from Disk to Network Trends Use multiple SATA drives SSD is around the corner Matching network speed 10Gigabit Ethernet and beyond Disk and Network Performance SATA 4 SATA 12 SATA SSD (SAS) 12 SSD (SAS) SSD (PCIe) Quad SSD (PCIe) Bandwidth (MB/s) 40GE 10GE 1GE Enterprise Enterprise Server Server SSD SSD Trends 1, , , Source: "SSD Summary", Gartner, May A 2010A Avg Capacity (GB) ASP ($) ASP/GB (Cent/GB)

Quad SSD (PCIe) Bandwidth (MB/s) 40GE 10GE 1GE 500.00 450.00 400.00 350.00 Enterprise Enterprise Server Server SSD SSD Trends 1,400.0 1,200.0 1,000.0 300.00 250.

7 Efficient Fabric Services Scalable and non-blocking Matches extensive data exchange at peak rates Non blocking east-west traffic High Capacity Match all I/O bandwidth of the server Losslessness Efficiency and avoidance of retransmission Offload Full transport offload reduces CPU utilization RDMA zero copy operations 7

I/O bandwidth of the server Losslessness Efficiency and avoidance of

for In-Memory processing Enables to start the Reduce operation in parallel to the Shuffle operation

8 UDA Plug In Architecture Plug-in architecture Hadoop applications are unmodified Plug-in to Apache Hadoop Enabled via xml configuration Efficient Map Reduce Data communication over RDMA Using RDMA for In-Memory processing Enables to start the Reduce operation in parallel to the Shuffle operation Reduce disk IO operation Supports InfiniBand and Ethernet Zero copy, transport offload, kernel bypass 8

9 Software Architecture Hadoop (Java) UDA Plugin (C++) TaskTracker MapTask JobTracker TaskTracker ReduceTask Plug-in Benefits: Zcopy datapath Transport offload Improved merge algorithm MOFSupplier NetMerger Data Engine RDMA Server RDMA Client Merging Merging Thread Merging Thread Thread RDMA NIC / HCA 9

Transport offload Improved merge algorithm MOFSupplier NetMerger Data

10 New Pipelined Data Flow Map Stage Map Map Map Map Map Map Map Map Map Map Map Map Shuffle Merge Vanilla Algorithm shuffle shuffle merge merge Reduce Reduce shuffle merge Header fetch Shuffle Merge New Algorithm shuffle merge Reduce Header fetch Reduce start Time 10

merge merge Reduce Reduce shuffle merge Header fetch Shuffle Merge

11 UDA MapReduce for Hadoop 1.X Shuffle portion executed in-memory Eliminating time consuming HDD read/writes UDA reads Map Output Files (MOF) from mappers Predefined portions of the MOFs, default 1KB Reduce tasks begins only with all mapper tasks completion 11

HDD read/writes UDA reads Map Output Files (MOF) from mappers

Execution Time (sec) UDA Enables Higher Performance 1000 Terasort Benchmark* (20GB file size, 16GB data per node, 8 Mappers, 4 reducers, 4 Disks) 900 800 700 600 500 45% Lower

12 Execution Time (sec) UDA Enables Higher Performance 1000 Terasort Benchmark* (20GB file size, 16GB data per node, 8 Mappers, 4 reducers, 4 Disks) % Lower is better 1 GE 10 GE 400 UDA 10GE ~2X Acceleration Nodes 10 Nodes 12 Nodes *TeraSort is a popular benchmark used to measure the performance of Hadoop cluster 12

is better 1 GE 10 GE 400 UDA 10GE 300 200 ~2X Acceleration 100 0 8 Nodes 10 Nodes 12

written word since inception of mankind Mellanox VPI Solutions Hadoop

13 Accelerating Big Data Analytics EMC 1000-Node Analytic Platform Accelerates Industry's Hadoop Development 24 PetaByte of physical storage Half of every written word since inception of mankind Mellanox VPI Solutions Hadoop Acceleration 2X Faster Hadoop Job Run-Time High Throughput, Low Latency, RDMA Critical for ROI 13

14 Thank You 14

Hadoop Optimizations for BigData Analytics

Hadoop Optimizations for BigData Analytics Weikuan Yu Auburn University Outline WBDB, Oct 2012 S-2 Background Network Levitated Merge JVM-Bypass Shuffling Fast Completion Scheduler WBDB, Oct 2012 S-3 Emerging