Leveraging In-Memory Key/Value Stores in HPC Kelpie Craig Ulmer



Similar documents
Cloud Development Strategies

Cloud Scale Distributed Data Storage. Jürmo Mehine

NoSQL for SQL Professionals William McKnight

Infiniband Monitoring of HPC Clusters using LDMS (Lightweight Distributed Monitoring System)

How To Scale Out Of A Nosql Database

Big Data: A Storage Systems Perspective Muthukumar Murugan Ph.D. HP Storage Division

Scalable Architecture on Amazon AWS Cloud

Enabling High performance Big Data platform with RDMA

extensible record stores document stores key-value stores Rick Cattel s clustering from Scalable SQL and NoSQL Data Stores SIGMOD Record, 2010

NoSQL and Hadoop Technologies On Oracle Cloud

Challenges for Data Driven Systems

Storage Architectures for Big Data in the Cloud

Composite Data Virtualization Composite Data Virtualization And NOSQL Data Stores

Infomatics. Big-Data and Hadoop Developer Training with Oracle WDP

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook

Open source large scale distributed data management with Google s MapReduce and Bigtable

So#ware Tools and Techniques for HPC, Clouds, and Server- Class SoCs Ron Brightwell

NoSQL Data Base Basics

Moving From Hadoop to Spark

GigaSpaces Real-Time Analytics for Big Data

Sentimental Analysis using Hadoop Phase 2: Week 2

Structured Data Storage

Lecture Data Warehouse Systems

BigMemory and Hadoop: Powering the Real-time Intelligent Enterprise

X4-2 Exadata announced (well actually around Jan 1) OEM/Grid control 12c R4 just released

Pilot-Streaming: Design Considerations for a Stream Processing Framework for High- Performance Computing

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software

Making Sense ofnosql A GUIDE FOR MANAGERS AND THE REST OF US DAN MCCREARY MANNING ANN KELLY. Shelter Island

bigdata Managing Scale in Ontological Systems

A bit about Hadoop. Luca Pireddu. March 9, CRS4Distributed Computing Group. (CRS4) Luca Pireddu March 9, / 18

HYPER-CONVERGED INFRASTRUCTURE STRATEGIES

Take An Internal Look at Hadoop. Hairong Kuang Grid Team, Yahoo! Inc

Apache Flink Next-gen data analysis. Kostas

Big Fast Data Hadoop acceleration with Flash. June 2013

Achieving Mainframe-Class Performance on Intel Servers Using InfiniBand Building Blocks. An Oracle White Paper April 2003

Hadoop. Sunday, November 25, 12

Jeffrey D. Ullman slides. MapReduce for data intensive computing

Comparison of the Frontier Distributed Database Caching System with NoSQL Databases

A survey of big data architectures for handling massive data

A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM

Luncheon Webinar Series May 13, 2013

BASHO DATA PLATFORM SIMPLIFIES BIG DATA, IOT, AND HYBRID CLOUD APPS

SQL VS. NO-SQL. Adapted Slides from Dr. Jennifer Widom from Stanford

The Quest for Extreme Scalability

Big Data and Data Science: Behind the Buzz Words

Big Data Processing with Google s MapReduce. Alexandru Costan

Accelerating Hadoop MapReduce Using an In-Memory Data Grid

The Importability of Performance Tools: The Good, the Bad, and the Ugly of PMUs

Web Application Platform for Sandia

NoSQL replacement for SQLite (for Beatstream) Antti-Jussi Kovalainen Seminar OHJ-1860: NoSQL databases

NA-ASC-128R-14-VOL.2-PN

You should have a working knowledge of the Microsoft Windows platform. A basic knowledge of programming is helpful but not required.

Parallel Computing. Benson Muite. benson.

Data Modeling for Big Data

Can the Elephants Handle the NoSQL Onslaught?

Open Source Technologies on Microsoft Azure

Introduction to Big Data Training

Big Data in Test and Evaluation by Udaya Ranawake (HPCMP PETTT/Engility Corporation)

OS/Run'me and Execu'on Time Produc'vity

Big Data on Microsoft Platform

Hadoop. for Oracle database professionals. Alex Gorbachev Calgary, AB September 2013

How To Write A Nosql Database In Spring Data Project

Big Data Approaches. Making Sense of Big Data. Ian Crosland. Jan 2016

Architectures for Big Data Analytics A database perspective

How To Use Big Data For Telco (For A Telco)

Performance and Scalability Overview

Big Data Technology ดร.ช ชาต หฤไชยะศ กด. Choochart Haruechaiyasak, Ph.D.

Addressing Open Source Big Data, Hadoop, and MapReduce limitations

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

Peers Techno log ies Pv t. L td. HADOOP

NoSQL Database Options

EMC s Enterprise Hadoop Solution. By Julie Lockner, Senior Analyst, and Terri McClure, Senior Analyst

Apache Hadoop. Alexandru Costan

Accelerating and Simplifying Apache

Big Data With Hadoop

Hadoop: A Framework for Data- Intensive Distributed Computing. CS561-Spring 2012 WPI, Mohamed Y. Eltabakh

Big Data with Component Based Software

I/O Considerations in Big Data Analytics

THE ATLAS DISTRIBUTED DATA MANAGEMENT SYSTEM & DATABASES

Application Development. A Paradigm Shift

PaaS - Platform as a Service Google App Engine

The evolution of database technology (II) Huibert Aalbers Senior Certified Executive IT Architect

Microsoft Research Windows Azure for Research Training

Database Scalability and Oracle 12c

Design and Evolution of the Apache Hadoop File System(HDFS)

Talend Open Studio for Big Data. Release Notes 5.2.1

Getting Started with SandStorm NoSQL Benchmark

Open source Google-style large scale data analysis with Hadoop

Microsoft Research Microsoft Azure for Research Training

3 Case Studies of NoSQL and Java Apps in the Real World

Transcription:

Photos placed in horizontal position with even amount of white space between photos and header Leveraging In-Memory Key/Value Stores in HPC Kelpie Craig Ulmer Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Corporation, for the U.S. Department of Energy s National Nuclear Security Administration under contract DE-AC04-94AL85000. SAND NO. 2011-XXXXP

Overview: Leveraging KV Stores NoSQL: Dramatic impact on Out-of-Core processing Frameworks handle scheduling, placement, and reliability Enables novice users to take advantage of parallel processing Good for information apps, but not Scientific Computing NoSQL primary concern is disks, assumes cluster hardware Can we leverage some of the concepts of NoSQL in HPC? Distributed In-Memory Key/Value stores are extremely useful Kelpie: Experimental 2D KV for HPC built on Nessie (portable RDMA) Examples: Caching, Glue, Communication, New Frameworks 2

LESSONS FROM NOSQL 3

Time (s) Time (S) How did we get here? Sandia started investigating NoSQL frameworks in 2008 Can we leverage for post-processing on clusters? Make it easier to build one-off analysis Ported analysis kernels to different frameworks/stores Hadoop, Pig, Accumulo, Cassandra, MongoDB, SectorSphere LexisNexis DAS, InfoSphere, XtremeData, Netezza Hadoop MapReduce most practical 1,600 1,400 1,200 1,000 800 600 400 Netezza Mustang Netezza TwinFin6 XtremeData Prototype XtremeData dbx 1008 LexisNexis DAS-20 LexisNexis DAS-60 Hadoop Decline 32 Hadoop Amazon 32 Hadoop Amazon 128 200 0 1M 2M 5M 10M 20M 50M 100M Mesh Mesh Elements 4

MapReduce Recap 5

Good Points of MapReduce Objectify Data Objectify Calculations Framework Handles Scheduling, Reliability Load Balancing 6

Bad Points of MapReduce Inflexible Data Flow Data Suitcase Disk-Based Communication Pull-Based I/O 7

Other Lessons from NoSQL Frameworks trick users into providing useful info Atomic operations, data boundaries Indirection is worthwhile Frameworks can do more, Users micromanage less Basic functionality is often good enough HDFS: Append-only writes 2D Stores > 1D Stores Accumulo: Use a row to build an index generated by multiple writers Be open to alternatives SQL vs NoSQL 8

Meanwhile, Back in HPC What are the main concerns in HPC today? Reliability concerns Architecture changes: Fat nodes Application pipelines New programming models Are any of the NoSQL ideas relevant to HPC? Building blocks: Key/Value Stores 9

KEY/VALUE STORES 10

In-Memory, Key/Value Stores KV Stores: an old idea with renewed interest Distribute a number of servers to hold values Use a key label to reference data Put/Get interface Many existing packages Cluster: Memcached, Redis, Kyoto Cabinet, RAMCloud, HPC: IBM PIMD Clients Servers K/V Server = Hash(key) 11

Kelpie: Distributed KV Store for HPC SNL Developing experimental KV Store for HPC Built on top of Nessie Portable RPC+RDMA library (IB, Gemini, BG/P, Portals, MPI) Typical KV with tweaks 1D or 2D Keys Optional Backing store (local/remote disks) Different modes with Location Hints Remote Mode Remote + Peer Mode Peer Only K/V K/V 12

KV USE STORIES 13

[1/4] Application Caching and I/O Leverage K/V store simply as a way to store intermediate data Users put/get blocks of data via references Scale performance tradeoffs by adjusting K/V store size Also useful for I/O swells K/V Compute Job I/O 14

Core Forward/Reverse Cache BSP Code with forward/reverse dataflow During reverse stages, reuse intermediates from forward stages Requires in-memory storage to be practical Regular access pattern makes prefetch easy Proxy application for tradeoff studies Forward Reverse Wall Clock Time 15

[2/4] Muli-Simulation Glue Many users need to join one monolithic job to another Painful to add coupling some use I/O as the medium KV Stores provide a natural exchange point Analysis 1 K/V K/V Compute Job Analysis 2 I/O 16

Glue: Hardware Fuzzing SNL: Digital hardware fuzzing for reliability testing Flight Recorder Design: Operates in RF-hostile environment Will data corruption lock up the hardware? Parameter study: Many long-running VHDL/Verilog sims Previously: Discovered two design problems over multi-month sim Current: Capture waveforms, characterize, look for abnormalities Kelpie 17

[3/4] Communication without MPI Use KV Store for Inter-Process Communication Work at a higher level of abstraction May be of use in workloads with dynamic data flows Warning: MPI does IPC very, very well Early experiences porting Mantevo applications Positive: Feels more like symbolic language Negative: Performance loss due to indirection K/V K/V Direct Indirect Indirect with Peer Help 18

[4/4] Frameworks Many are working on computational frameworks for HPC ASCR FOX: Fault-Oblivious execution SNL: DAG Task Scheduling for Reliability KV Stores provide a useful starting point Example: Go/MR: GoLang implementation of MapReduce Utilizes Kelpie for storing objects 2D Keys useful for collecting related items K/V 19

Summary NoSQL efforts tested out a variety of interesting concepts Key/Value Stores are a valuable service for HPC Kelpie Currently tested at small scale on IB, BG/P, and Gemini Expanding our user examples and making more robust Planned external release in June 20