Introduction to urika. Multithreading. urika Appliance. SPARQL Database. Use Cases



Similar documents
Introduc)on to urika. Mul)threading. SPARQL Database. urika Appliance. XMT- 2 Programming. Use Cases

Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013

Cray: Enabling Real-Time Discovery in Big Data

YarcData urika Technical White Paper

excellent graph matching capabilities with global graph analytic operations, via an interface that researchers can use to plug in their own

urika! Unlocking the Power of Big Data at PSC

The Fusion of Supercomputing and Big Data. Peter Ungaro President & CEO

Six Days in the Network Security Trenches at SC14. A Cray Graph Analytics Case Study

Technical White Paper. October Real-Time Discovery in Big Data Using the Urika-GD. Appliance G OVERN M ENT.

Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage

Supercomputing and Big Data: Where are the Real Boundaries and Opportunities for Synergy?

Achieving Real-Time Business Solutions Using Graph Database Technology and High Performance Networks

High Performance Spatial Queries and Analytics for Spatial Big Data. Fusheng Wang. Department of Biomedical Informatics Emory University

Storage and Retrieval of Large RDF Graph Using Hadoop and MapReduce

Big Data, Fast Data, Complex Data. Jans Aasman Franz Inc

Increasing Flash Throughput for Big Data Applications (Data Management Track)

Graph Database Performance: An Oracle Perspective

Ching-Yung Lin, Ph.D. Adjunct Professor, Dept. of Electrical Engineering and Computer Science IBM Chief Scientist, Graph Computing. October 29th, 2015

Scaling Out With Apache Spark. DTL Meeting Slides based on

Cray Gemini Interconnect. Technical University of Munich Parallel Programming Class of SS14 Denys Sobchyshak

Cray DVS: Data Virtualization Service

LDIF - Linked Data Integration Framework

Graph Analytics in Big Data. John Feo Pacific Northwest National Laboratory

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

! E6893 Big Data Analytics Lecture 9:! Linked Big Data Graph Computing (I)

Comparison of Triple Stores

Manifest for Big Data Pig, Hive & Jaql

Advanced In-Database Analytics

Big Fast Data Hadoop acceleration with Flash. June 2013

Benchmarking Cassandra on Violin

Inge Os Sales Consulting Manager Oracle Norway

How To Write An Article On An Hp Appsystem For Spera Hana

Performance And Scalability In Oracle9i And SQL Server 2000

Cray XT3 Supercomputer Scalable by Design CRAY XT3 DATASHEET

bigdata Managing Scale in Ontological Systems

Infrastructure Matters: POWER8 vs. Xeon x86

Oracle Big Data SQL Technical Update

A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM

Pentaho High-Performance Big Data Reference Configurations using Cisco Unified Computing System

Benchmarking the Performance of Storage Systems that expose SPARQL Endpoints

Maximum performance, minimal risk for data warehousing

Minimize cost and risk for data warehousing

The Evolution of Microsoft SQL Server: The right time for Violin flash Memory Arrays

Big Data and Graph Analytics in a Health Care Setting

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

Oracle BI EE Implementation on Netezza. Prepared by SureShot Strategies, Inc.

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software

Oracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>

Bringing Big Data Modelling into the Hands of Domain Experts

Scalable Cloud Computing Solutions for Next Generation Sequencing Data

The Flash-Transformed Financial Data Center. Jean S. Bozman Enterprise Solutions Manager, Enterprise Storage Solutions Corporation August 6, 2014

Hadoop MapReduce and Spark. Giorgio Pedrazzi, CINECA-SCAI School of Data Analytics and Visualisation Milan, 10/06/2015

BIG DATA What it is and how to use?

ABSTRACT 1. INTRODUCTION. Kamil Bajda-Pawlikowski

Deliverable Billion Triple dataset hosted on the LOD2 Knowledge Store Cluster. LOD2 Creating Knowledge out of Interlinked Data

Building Scalable Big Data Infrastructure Using Open Source Software. Sam William

Copyright 2013, Oracle and/or its affiliates. All rights reserved.

Big Graph Processing: Some Background

Unified Batch & Stream Processing Platform

How To Build A Cisco Ukcsob420 M3 Blade Server

> Semantic Web Use Cases and Case Studies

PSAM, NEC PCIe SSD Appliance for Microsoft SQL Server (Reference Architecture) September 11 th, 2014 NEC Corporation

Big Data With Hadoop

Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack

Platfora Big Data Analytics

Performance and Scalability Overview

The IBM Cognos Platform for Enterprise Business Intelligence

Non Volatile Memory Invades the Memory Bus: Performance and Versatility is the Result

Unified Computing Systems

Outline. High Performance Computing (HPC) Big Data meets HPC. Case Studies: Some facts about Big Data Technologies HPC and Big Data converging

HadoopRDF : A Scalable RDF Data Analysis System

The Hardware Dilemma. Stephanie Best, SGI Director Big Data Marketing Ray Morcos, SGI Big Data Engineering

Big Data and the Data Lake. February 2015

TUT NoSQL Seminar (Oracle) Big Data

Xeon+FPGA Platform for the Data Center

Removing Performance Bottlenecks in Databases with Red Hat Enterprise Linux and Violin Memory Flash Storage Arrays. Red Hat Performance Engineering

4 th Workshop on Big Data Benchmarking

CMSC 611: Advanced Computer Architecture

Big Data Functionality for Oracle 11 / 12 Using High Density Computing and Memory Centric DataBase (MCDB) Frequently Asked Questions

SGI High Performance Computing

Making Multicore Work and Measuring its Benefits. Markus Levy, president EEMBC and Multicore Association

Accelerating and Simplifying Apache

Integrating Open Sources and Relational Data with SPARQL

Apache Hama Design Document v0.6

Big Data Are You Ready? Thomas Kyte

Overview: X5 Generation Database Machines

Lecture 2 Parallel Programming Platforms

From Spark to Ignition:

The deployment of OHMS TM. in private cloud

Bigdata : Enabling the Semantic Web at Web Scale

White Paper. Version 1.2 May 2015 RAID Incorporated

An Overview of SAP BW Powered by HANA. Al Weedman

The Fusion of Supercomputing and Big Data: The Role of Global Memory Architectures in Future Large Scale Data Analytics

Data-intensive HPC: opportunities and challenges. Patrick Valduriez

FPGA-based Multithreading for In-Memory Hash Joins

An Oracle White Paper June High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database

Apigee Insights Increase marketing effectiveness and customer satisfaction with API-driven adaptive apps

Using Data Mining and Machine Learning in Retail

ANALYTICS IN BIG DATA ERA

Turbo-Charging Open Source Hadoop for Faster, more Meaningful Insights

Transcription:

1

Introduction to urika Multithreading urika Appliance SPARQL Database Use Cases 2

Gain business insight by discovering unknown relationships in big data Graph analytics warehouse supports ad hoc queries, pattern-based searches, inferencing and deduction on dynamic data sets Hardware and software dedicated to scalable graph analytics Purpose built to solve big data graph problems with large shared-memory and massive multi-threading Ease adoption and training with industry standards support Data center ready appliance with open-source interfaces enables re-use of in-house skill sets, standard languages and simplified integration 3

Social Networking Intelligence/Security Telecom/Mobile Life Sciences/Biology Healthcare/Medicine Finance Internet/WWW Supply Chain 4

6 Graphs are hard to Partition Graphs are not Predictable Graphs are highly Dynamic? High cost to follow relationships that span Cluster Nodes High cost to follow multiple competing paths which cannot be prefetched/cached High cost to load multiple, constantly changing datasets into in-memory graph models Network is 100 times SLOWER than Memory* Memory is 100 times SLOWER than Processor* Storage I/O is 1000 times SLOWER than Memory I/O* *Source: Hennessy, J. and Patterson, D., Computer Architecture: A Quantitative Approach, 2012 edition 6

7 Graphs are hard to Partition Large Shared Memory Up to 512 TB Graphs are not Predictable Massively Multi-threaded 128 threads/processor Graphs are highly Dynamic Highly Scalable I/O Up to 350 TB/hr Real-time, Interactive Analytics on Big Data Graph Problems 7

Cray Hardware Engine Originally designed for deep analysis of large datasets Very large scalable shared memory Architecture can support 512TB shared memory Typical systems are 2 TB to 32 TB Multithreading Unique highly multithreaded architecture 128 hardware threads per processor Extreme parallelism, hides memory latency Multithreaded Graph Database Highly parallel in-memory RDF quad store High performance inference engine High performance parallel I/O Industry Standard Front End Based on Jena open source semantic DB All standard Linux infrastructure and languages Lustre parallel file system 8 8

Data Warehouse Hadoop Other Big Data Appliances Existing Analytic Environment(s) Export Analytic/ Relationship Results Import Graph Datasets User/App Visualization 9

1 0

MTA-1 1998 Gallium arsenide: Proof of concept First production implementation of latency-tolerant multithreading MTA-2 2002 Transition the microprocessor to CMOS Invention of scaling memory in a multithreading system XMT-1 2008 Integrated MTA and main Cray product line XT First hybrid system to support X86 and multithreading CPUs XMT-2 2011 Made system ready for enterprise production Largest shared memory machine in the world YarcData urika 2012 Optimized for Scalable Graph Analytics 1 1

Relative latency to memory continues to increase Vector processors amortize memory latency Cache-based microprocessors reduce memory latency Multithreaded processors tolerate memory latency Multithreading is most effective when: Parallelism is abundant Data locality is scarce Large graph problems perform well on this architecture Semantic databases Social network analysis 1 2

Many threads per processor core; small thread state Thread-level context switch at every instruction cycle registers ALU stream conventional processor program counter multithreaded processor Slide 13 1 3

Conventional processor Multithreaded processor memory memory When one or a few threads stall, memory/network bandwidth become idle Although some threads stall, others keep issuing local/remote memory requests, keeping most precious resources busy network network Slide 1 4

bandwidth Caches Reduce latency by storing some data in fast, nearby memory Vectors Amortize latency by fetching N words at a time Parallelism Tolerate latency by switching tasks Multithreading tries to balance Little s Law: concurrency = bandwidth * latency latency Slide 15 1 5

Memory addresses are hashed, on cache line (64 byte) boundaries Full/empty bits on all data words sync Data type and HLL functions (readff, readfe, writeef) Loads/stores succeed only when full bit is set/unset Loads/stores leave full bit unset/set future Loads/stores succeed only when full bit is set Lazy evaluation and barriers 63 full-empty trap 2 trap 1 forward 0 tag bits data values Slide 16 1 6

To the programmer, a multiple processor XMT looks like a single processor, except that the number of threads is increased. Bokhari/Sauer 2003 1 7

4 DIMM Slots Redundant VRMs L0 RAS Computer CRAY Seastar2 CRAY Seastar2 CRAY Seastar2 CRAY Seastar2 1 8

Cray urika High Density Node 4 Threadstorm processors 4 memory controllers 4 SeaStar2 NICs Up to 64 GB per node Coherent HyperTransport (2 links between sockets HyperTransport 8 DIMMs Registered DDR2 > 17 GB/sec per node Threadstorm 4 processors Cray SeaStar2 Interconnect 1 9

Compute MTK Service & IO Linux Service Partition Linux OS Specialized Linux nodes Login PEs IO Server PEs Network Server PEs FS Metadata Server PEs System Server PEs Compute Partition MTK (BSD) PCI-E PCI-E 10 GigE Fibre Channel Network RAID Controllers 2 0

Uses the same cabinets, boards, scalable interconnect, I/O and storage infrastructure, user environment, and administrative tools just changes the processor Cabinet 24 blades per cabinet Vertical airflow with optional liquid assist Compute blades 4 Threadstorm processors 16-64 GB per processor Cray XT service and I/O subsystem PCIe connections to storage and networks Scalable Lustre global file system Cray XT high-speed 3D torus network Cray XT power and RAS systems Linux based user environment 2 1

2 2

23 RDF SPARQL Java Linux Apache Tomcat/Jena Graph Analytic Tools Management, Security, Data Pipeline Graph Analytic Database RDF datastore and SPARQL engine Gadgets /Mashups Graph Analytic Platform Shared-memory, Multi-threaded, Scalable I/O graph-optimized hardware Data Center sees another Linux server Applications see industry standard interface RDF, SPARQL, Java, Gadgets Reuse Existing Skillsets Java, OSGI, App Server, SOA, ESB, Web toolkit Standard languages All applications and artifacts built on urika can be run on other platforms 2 3

SELECT?p?x WHERE { (?x type person) (?p sells DVD ) (?x shops-at?p) } API SPARQL Engine database BGP SPARQL Ingest Application Jena/Fuseki HTTP Interface Send query Parse, interpret query Set up query engine ORDER DISTINCT OPTIONAL UNION interface to Web Receive results Translate, send results FILTER generate low-level query urika service nodes urika compute nodes 24 2 4

2 5

RDF triples databases are inherently graphical Some researchers call semantic databases semantic graph databases Vancouver Canada is-type country object Edmonton has-population 730,000 North America 2 6

Resource Description Framework (RDF) Designed to enable semantic web searching and integration of disparate data sources W3C standard formats Every datum represented as subject/predicate/object Ideally with each of those expressed with a URI Standard ontologies in some domains e.g., Open Biological and Biomedical Ontologies (OBO) Examples: subject predicate object <ncbitax:ncbitaxon_840261> rdf:type owl:class <ncbitax:ncbitaxon_195644> rdfs:subclassof <ncbitax:ncbitaxon_185881> <ncbitax:ncbitaxon_816681> rdfs:label "Characiformes sp. BOLD:AAG5151"@en 2 7

SPARQL Protocol and RDF Query Language (SPARQL) Enables matching of graph patterns in the semantic DB Reminiscent of SQL # Lehigh University BenchMark (LUBM) Query 9 PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX ub: <http://www.lehigh.edu/~zhp2/2004/0401/univ-bench.owl#> SELECT?X,?Y,?Z WHERE {?X rdf:type ub:student.?y rdf:type ub:faculty.?z rdf:type ub:course.?x ub:advisor?y.?y ub:teacherof?z.?x ub:takescourse?z} PREFIX == shorthand for a URI variables to be returned from the query find sets of (X, Y, Z) with a subject X of type Student, a subject Y of type Faculty, and a subject Z of type Course, where X is an advisee of Y, Y teaches course Z, and X takes course Z 2 8

2 9

Discover new drugs and treatments The Challenge Multiple massive datasets describing biological network graphs in cancer cells from published literature and experimental data, constantly updated Un-partitionable, densely and irregularly connected graphs Multiple researchers concurrently searching for relationships not found in published literature urika Solution urika holds un-partitioned genomic network graph in memory Contrast experimental models and theories with published results to discover previously unknown relationships Interactive, real time access by multiple researchers Business Value Identify new pathways in cell models to refine cancer treatments Confirmation of elevated VEGF levels by tissue microarray: 3 0

Biotech researchers have mapped most of the basic building blocks of genes but the interactions are not understood Parts List for the human body, but where s the Haynes manual Biotech researchers are constantly looking for new and non-obvious interactions between genes and protein families. Medical research churns out massive quantities of literature, but most researchers don t have time to read even their own specialty publications Over 10,000 genomics publications per month currently 3 1

The Cancer Genome Atlas (TCGA) data combined with literature relationships from Medline Protein domain interactions are extracted from TCGA using Random Forest classification A Normalized Medline Distance (NMD) between proteins is calculated from Medline titles and abstracts Protein data from Uniprot and Pfam are also included 1.2 billion entries that generate 6B triples New data sets constantly being addedmutation data, patient data, methylation data, statistical analysis results and more Current database technology does not have the performance for useful full-scale queries across this data Mutation Methylation Genes (TCGA) Protein Data (UniProt) GRAPH Query Literature (Medline) Protein Families (PFAM) Correlations (Statistical) Patients (Health Records) 3 2

Discover similar Patients to optimize treatment The Challenge Longitudinal, historical data spanning all events, symptoms, diagnoses, diseases, treatments, prescriptions, etc of 10M patients including genetics and family history Ad-hoc, constantly changing definition of similarity based on thousands of parameters Interactive, real-time response during consultation urika Solution urika holds entire relationship graph in memory Identify similar patients based on ad-hoc physician specified patterns Interactive, real-time access by entire physician community Business Value Consistent selection of the most effective treatment for each patient by each doctor every time Blood pressure Prior myocardial infarction Hypertension Body mass index HDL Anti-hypertension meds Patient: Jean Generic 3 3

3 4