World s fastest database and big data analytics platform



Similar documents
GPU System Architecture. Alan Gray EPCC The University of Edinburgh

HETEROGENEOUS HPC, ARCHITECTURE OPTIMIZATION, AND NVLINK

NVIDIA CUDA Software and GPU Parallel Computing Architecture. David B. Kirk, Chief Scientist

STORAGE HIGH SPEED INTERCONNECTS HIGH PERFORMANCE COMPUTING VISUALISATION GPU COMPUTING

Parallel Computing: Strategies and Implications. Dori Exterman CTO IncrediBuild.

Data Centric Interactive Visualization of Very Large Data

Graphics Cards and Graphics Processing Units. Ben Johnstone Russ Martin November 15, 2011

Accelerating Real Time Big Data Applications. PRESENTATION TITLE GOES HERE Bob Hansen

CLOUD GAMING WITH NVIDIA GRID TECHNOLOGIES Franck DIARD, Ph.D., SW Chief Software Architect GDC 2014

Sense Making in an IOT World: Sensor Data Analysis with Deep Learning

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

Bringing Big Data Modelling into the Hands of Domain Experts

Oracle Database In-Memory The Next Big Thing

Graphical Processing Units to Accelerate Orthorectification, Atmospheric Correction and Transformations for Big Data

Overview: X5 Generation Database Machines

SGI High Performance Computing

Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage

The Flash-Transformed Financial Data Center. Jean S. Bozman Enterprise Solutions Manager, Enterprise Storage Solutions Corporation August 6, 2014

GPU File System Encryption Kartik Kulkarni and Eugene Linkov

Performance Beyond PCI Express: Moving Storage to The Memory Bus A Technical Whitepaper

Graphical Processing Units to Accelerate Orthorectification, Atmospheric Correction and Transformations for Big Data

In-Memory Databases Algorithms and Data Structures on Modern Hardware. Martin Faust David Schwalb Jens Krüger Jürgen Müller

Benchmarking Cassandra on Violin

Computer Graphics Hardware An Overview

Optimizing GPU-based application performance for the HP for the HP ProLiant SL390s G7 server

Big Data and Analytics: Getting Started with ArcGIS. Mike Park Erik Hoel

GPU Hardware and Programming Models. Jeremy Appleyard, September 2015

Scaling Your Data to the Cloud

Optimizing a 3D-FWT code in a cluster of CPUs+GPUs

Next Generation GPU Architecture Code-named Fermi

GPUs for Scientific Computing

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software

ACCELERATING SELECT WHERE AND SELECT JOIN QUERIES ON A GPU

MapGraph. A High Level API for Fast Development of High Performance Graphic Analytics on GPUs.

Jeff Adie Principal Systems Engineer.

5x in 5 hours Porting SEISMIC_CPML using the PGI Accelerator Model

The GPU Accelerated Data Center. Marc Hamilton, August 27, 2015

Home Exam 3: Distributed Video Encoding using Dolphin PCI Express Networks. October 20 th 2015

Accelerating CFD using OpenFOAM with GPUs

Scalable and High Performance Computing for Big Data Analytics in Understanding the Human Dynamics in the Mobile Age

OpenPOWER Outlook AXEL KOEHLER SR. SOLUTION ARCHITECT HPC

NVIDIA GeForce GTX 580 GPU Datasheet

E6895 Advanced Big Data Analytics Lecture 14:! NVIDIA GPU Examples and GPU on ios devices

Parallel Programming Survey

The Design and Implement of Ultra-scale Data Parallel. In-situ Visualization System

High Performance. CAEA elearning Series. Jonathan G. Dudley, Ph.D. 06/09/ CAE Associates

Understanding the Value of In-Memory in the IT Landscape

Flash Memory Arrays Enabling the Virtualized Data Center. July 2010

Can Flash help you ride the Big Data Wave? Steve Fingerhut Vice President, Marketing Enterprise Storage Solutions Corporation

Towards Fast SQL Query Processing in DB2 BLU Using GPUs A Technology Demonstration. Sina Meraji sinamera@ca.ibm.com

Steven C.H. Hoi School of Information Systems Singapore Management University

SAP HANA - Main Memory Technology: A Challenge for Development of Business Applications. Jürgen Primsch, SAP AG July 2011

SQL Server 2012 Performance White Paper

Programming models for heterogeneous computing. Manuel Ujaldón Nvidia CUDA Fellow and A/Prof. Computer Architecture Department University of Malaga

Oracle Big Data SQL Technical Update

Removing Performance Bottlenecks in Databases with Red Hat Enterprise Linux and Violin Memory Flash Storage Arrays. Red Hat Performance Engineering

Colgate-Palmolive selects SAP HANA to improve the speed of business analytics with IBM and SAP

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

GPGPU for Real-Time Data Analytics: Introduction. Nanyang Technological University, Singapore 2

Introduction to GP-GPUs. Advanced Computer Architectures, Cristina Silvano, Politecnico di Milano 1

Guided Performance Analysis with the NVIDIA Visual Profiler

Big Data and Analytics: A Conceptual Overview. Mike Park Erik Hoel

Preview of Oracle Database 12c In-Memory Option. Copyright 2013, Oracle and/or its affiliates. All rights reserved.

FPGA-based Multithreading for In-Memory Hash Joins

In-Memory Data Management for Enterprise Applications

Introduction to GPGPU. Tiziano Diamanti

Introducing PgOpenCL A New PostgreSQL Procedural Language Unlocking the Power of the GPU! By Tim Child

Big Data Analytics. Prof. Dr. Lars Schmidt-Thieme

Final Project Report. Trading Platform Server

Turbomachinery CFD on many-core platforms experiences and strategies

Express5800 Scalable Enterprise Server Reference Architecture. For NEC PCIe SSD Appliance for Microsoft SQL Server

2009 Oracle Corporation 1

How To Speed Up A Flash Flash Storage System With The Hyperq Memory Router

Scaling Analysis Services in the Cloud

Architectures for Big Data Analytics A database perspective

HPC Wales Skills Academy Course Catalogue 2015

Dr. Raju Namburu Computational Sciences Campaign U.S. Army Research Laboratory. The Nation s Premier Laboratory for Land Forces UNCLASSIFIED

Next Generation Operating Systems

The Uintah Framework: A Unified Heterogeneous Task Scheduling and Runtime System

TUT NoSQL Seminar (Oracle) Big Data

NVIDIA GRID OVERVIEW SERVER POWERED BY NVIDIA GRID. WHY GPUs FOR VIRTUAL DESKTOPS AND APPLICATIONS? WHAT IS A VIRTUAL DESKTOP?

Binary search tree with SIMD bandwidth optimization using SSE

Drivers to support the growing business data demand for Performance Management solutions and BI Analytics

The Flash Transformed Data Center & the Unlimited Future of Flash John Scaramuzzo Sr. Vice President & General Manager, Enterprise Storage Solutions

Big Data Challenges in Bioinformatics

Cloud Data Center Acceleration 2015

High Performance GPGPU Computer for Embedded Systems

Hadoop & SAS Data Loader for Hadoop

High Performance Computing and Big Data: The coming wave.

Four Keys to Successful Multicore Optimization for Machine Vision. White Paper

When to Use Oracle Database In-Memory

White Paper. How Streaming Data Analytics Enables Real-Time Decisions

Exadata Database Machine

Parallel Algorithm Engineering

Transcription:

World s fastest database and big data analytics platform www.map-d.com @datarefined 33 Concord Ave, Suite 6, Cambridge, MA 238 Todd Mostak Tom Graham Ι Ι todd@map-d.com tom@map-d.com Ι Ι + 67 83 76 + 67 459 796

Win a Free All-Access Pass to GTC 24 Give us your feedback on today s webinar. Fill out the survey for a chance to win a free pass to GTC 24! https://www.surveymonkey.com/s/cqszkj6

World s fastest database and big data analytics platform www.map-d.com @datarefined 33 Concord Ave, Suite 6, Cambridge, MA 238 Todd Mostak Tom Graham Ι Ι todd@map-d.com tom@map-d.com Ι Ι + 67 83 76 + 67 459 796

map-d? Do? Demo? super-fast database built into GPU memory world s fastest real-time big data analytics interactive visualization twitter analytics platform billion+ tweets milliseconds

Core Innovation SQL-enabled column store database built into the memory architecture on GPUs and CPUs Code developed from scratch to take advantage of: Memory bandwidth Massive parallelism across multiple GPUs Systems with both GPU and CPU memory Near-linear scaling to clusters of GPU nodes Data stored as a high-level cache in GPU or CPU memory or cycled through other data stores Standard SQL operations to import and query any type of dataset s of database scans per second running from a single node or cluster with billions of records

Demo: billion+ Tweetmap The Map-D Tweetmap demo at SC3 ran on 8 NVIDIA K4 GPUs (total 96GB of GPU memory) in a single server hosted locally Map-D scanned the entire database of +billion tweets with full text and metadata in 5 milliseconds At the same time, Map-D also rendered HD data visualizations and sent them to Tweetmap s interactive analytics GUI Live demo: www.mapd.csail.mit.edu SC3 video and write up: mapd.it/sc3pres mapd.it/sc3article

billion+ Tweetmap Search tweet text Search by user Live streaming tweets Stats + census data Animate over time Identify trends Heatmap Chloropleth Point map Base maps Share maps

What is Map-D? Database coded into hardware s onboard memory Massive parallelism across multiple GPUs Data streams live on to system GPUs, CPUs, Phi, mobile and custom cards Scales linearly across clusters Any size data set (mobile to TB+) What can Map-D do? Millisecond latency, no need to pre-compute Interactive analysis on any size dataset Animated HD visuals rendered on the fly Multiple data layers Resource intense analytics: ML, network, trending Scientific process at speed of thought Real-time monitoring and visuals of complex systems Socialization and collaboration

Ultra-fast database and analytics engine Runs 7-x faster than other in-memory databases and analytics platforms and getting faster Room for database optimization Growth in hardware speed, parallelism and memory size/bandwidth (GPUs, CPUs, mobile, custom cards)

Application All industry or data-driven computational processes that need instant access to data, real-time analytics and interactive data visualizations GPU-fueled computation engine powers real-time, complex and resource intensive analytical operations, e.g. machine learning, network graphing, trend detection, and semantic analysis Compute-intensive GIS, mapping and analytic operations across multiple data layers Real-time HD streaming of big data visualizations to a monitor or mobile device at 3fps straight from the GPU server using H.264 encoding Real-time analytics and data visualization on mobile devices using the latest multi-core NVIDIA Tegra chips Scalable millisecond-latency analytics across 2GB (mobile device) to TB+ (multi-node GPU cluster with next-gen flash extn.) datasets

Use Cases Real time monitoring of complex systems Paypal needs a real time visualization platform to monitor the 3 million plus data points they generate per second Interactive discovery in large datasets Novartis needs to interactively pattern match candidate molecules from large databases to speed up pharmaceutical R&D Interactive analytics Harvard Worldmap needs interactive mapping and analysis of multiple big data layers Big data visualizations for broadcast Twitter and its broadcast media partners (eg CNN, BBC) need geo-located social analytics to track trending issues in real time for investigative journalism and broadcast Major League Baseball needs an analytics and visualization platform for all pitches pitched in the MLB since 98 for both in-game broadcast and a public web interface NASA and Uni. Colorado need interactive maps of historical ice sheet movement in Antarctica to validate climate change models MIT and King Abdulaziz City for Science and Technology are building a multi-layer smart city big data analytics platform

Demo: Campaign Finance Map Interactively track how money changes political discourse over time Data: donations since 98, full-text political speeches, census Full TweetMap functionality + network analysis

development strategy

Build out the database NOW SQL column store standard operations One node with multiple GPUs Shared nothing architecture Supports WMS SOON Enterprise grade, supported database Linear multi-node scaling CUDA, OpenCL Shared scans to run multiple queries simultaneously 3fps H.264 rendering straight to mobile host Machine learning, trend detection, network graphing

Multiple solutions One architecture Project Partners

the technology

Shared Nothing Processing Multiple GPUs, with data partitioned between them Filter text ILIKE rain Filter text ILIKE rain Filter text ILIKE rain Node Node 2 Node 3

Map-D hardware architecture Large Data Big Data Single GPU Map-D code 2GB memory Map-D code integrated into GPU memory Single CPU 768GB memory Map-D code integrated into CPU memory NVIDIA TEGRA Mobile chip 4GB memory Map-D code integrated into chip memory 8 cards = 4U box 4 sockets = 4U box Small Data Mobile Map-D code runs on GPU + CPU memory 36U rack: ~4GB GPU ~2TB CPU Next Gen Flash 4TB GB/s Map-D running small datasets Native App Web-based service

NOW TB Compute power vs. Onboard memory across a single 4U node FUTURE Intel Xeon E5-267 Max 4 sockets Max power: 2-3 TFLOP Max mem: up to 3 TB Onboard memory FUTURE Intel Phi Knight s Landing ( 5) 4 per node Max power:? TFLOP Max mem:? GB Custom Cards 8- per node Max power: TFLOP Max mem: >GB FUTURE GB NOW NVIDIA Tesla K4 8 per node Max power: 34 TFLOP Max mem: 96 GB NVIDIA Maxwell ( 4) NVIDIA Volta ( 5) 8 per node Max power:? TFLOP Max mem:?? GB Compute power (TFLOP) 5GB NOW 5 3 NOW Intel Phi 72 2 per node Max power: 4.4 TFLOP Max mem: 32 GB NVIDIA Titan 8 per node Max power: 36 TFLOP Max mem: 48 GB

4TB FUTURE Memory Bandwidth: GB/TB per second across a single 4U node Onboard memory TB NOW SSD drive B/w: ~5Mb/s GB 3GB Next Gen Flash memory 4TB/node PCIe3 B/w: ~GB/s NOW Intel Xeon E5-267 4/node Chip: ~4GB/s Node: ~6GB/s Mem: ~3TB FUTURE Intel Phi KLand g 4/node Chip: ~GB/s Node: ~4GB/s Mem: ~.5TB NOW Phi 72 2/node Card: 352GB/s Node: 7GB/s Mem: 32GB NOW Titan GTX 8/node Card: 288GB/s Node: 2.3TB/s Mem: 48GB NOW Tesla K4 8/node Card: 288GB/s Node: 2.3TB/s Mem: 96GB FUTURE Volta 5 8/node Card: ~TB/s Node: ~8TB/s Mem: ~2GB FUTURE Custom /node Card: >4TB/s Node: >4TB/s Mem: ~GB GB Memory Bandwidth (/s) 7GB 2.5TB 8TB 4TB

Tweet Indexing on GPU Encode tweets using a dictionary Filter text ILIKE rain Word Filter SELECT tweetid FROM words WHERE id = 57663 Encoding Rain 57663 Rainbow 57664 Rainman 57665 Rainy 57666

Filtering in Parallel Column-oriented execution Avoids wasting memory bandwidth Plan: Filter SELECT tweet id FROM words WHEREid = 57663 Produce bitmap of tweets to read Read tweets, increment output bins in bitmap TweetId WordId TweetId Lat Lon 57663-4.5 23. 2 57664 2-4.7 77.4 2 27 3-37.4 48.2 3 884 4 28.4-44. Data Tables Reside in GPU Memory

Filtering in Parallel + GPU threads Running in warps Threads in same warp run the exact same instructions Need same amount of data to be efficient Warp Warp 2 Warp 3 TweetId WordId 57663 2 57664 2 27 3 884 Bitmap Tweet Tweet n

Filtering in Parallel + GPU threads Running in warps Threads in same warp run the exact same instructions Need same amount of data to be efficient Warp Warp 2 Warp 3 TweetId WordId 57663 2 57664 2 27 3 884 Bitmap Tweet Tweet n

Filtering in Parallel + GPU threads Running in warps Threads in same warp run the exact same instructions Need same amount of data to be efficient Warp Warp 2 Warp 3 TweetId WordId 57663 2 57664 2 27 3 884 Bitmap Tweet Tweet n

Filtering in Parallel + GPU threads Running in warps Threads in same warp run the exact same instructions Need same amount of data to be efficient TweetId WordId 57663 2 57664 2 27 3 884 Bitmap Tweet Tweet n

Filtering in Parallel + GPU threads Running in warps Threads in same warp run the exact same instructions Need same amount of data to be efficient Bitmap Tweet Tweet n Lat Lon -4.5 23. -4.7 77.4-37.4 48.2 28.4-44.

Filtering in Parallel + GPU threads Running in warps Threads in same warp run the exact same instructions Need same amount of data to be efficient Output buffer Warp Warp 2 Warp 3 Bitmap Lat Lon -4.5 23. -4.7 77.4-37.4 48.2 28.4-44.

Filtering in Parallel + GPU threads Running in warps Threads in same warp run the exact same instructions Need same amount of data to be efficient Output buffer Warp Warp 2 Warp 3 Bitmap Lat Lon -4.5 23. -4.7 77.4-37.4 48.2 28.4-44.

What does the market look like?

Hype around big data Many database players none deliver interactive big data Interactive Lag Time Wait Time Interactive Graphics pipeline GPU integration Hardware dev Fast flash Gigabyte Terabyte Petabyte

Todd Mostak Todd was a researcher at MIT CSAIL, where he worked in the database group. Seeking adventure upon finishing his undergrad, Todd moved to the Middle East, spending two years in Syria and Egypt teaching English, studying Arabic and eventually working as a translator for an Egyptian newspaper. He then completed his MA in Middle East Studies at Harvard University, afterwards taking a position as a Research Fellow at Harvard s Kennedy School of Government, focusing on the analysis of Islamism using forum and social media datasets. The impetus to build Map-D came from how slow he found conventional GIS tools to spatially aggregate and analyze large Twitter datasets. Tom Graham Recently a researcher at Harvard Law School, he focused on the intersection between social networks, big data and law reform. Tom researched privacy and the development of social science methodologies that allow legal scholars, governments and interest groups to interact with social network data. Tom lived in China for many years where he studied Chinese and dabbled in Chinese cooking and calligraphy. He is admitted to the New York Bar and was previously an attorney with Davis Polk in Hong Kong, where he focused on capital markets and M&A across Asia's emerging markets. He is also admitted to practice law in Australia. Tom holds a LLM from Harvard Law School and a LLB, BA and Dip. Languages from Melbourne University.

Thanks for watching www.map-d.com @datarefined info@map-d.com

Upcoming GTC Express Webinars January 3: Debugging CUDA Fortran using Allinea DDT February 5: OpenMM - Accelerating and Customizing Molecular Dynamic Simulations on GPUs February 25: Using GPUs to supercharge visualization and analysis of molecular dynamics simulations with VMD Register at www.gputechconf.com/gtcexpress

GTC 24 Registration is Open Hundreds of sessions in areas including Big Data Analytics & Data Algorithms Large Scale Data Analytics Video & Image Processing Signal & Audio Processing Computer Vision Machine Learning & AI Plus, Christopher White, Program Manager, DARPA Register with GM2EXP for 2% discount www.gputechconf.com

Win a Free All-Access Pass to GTC 24 Give us your feedback on today s webinar. Fill out the survey for a chance to win a free pass to GTC 24! https://www.surveymonkey.com/s/cqszkj6