adaptive loading adaptive indexing dbtouch 3 Ideas for Big Data Exploration



Similar documents
NoDB: Efficient Query Execution on Raw Data Files

class 1 welcome to CS265! BIG DATA SYSTEMS prof. Stratos Idreos

Overview of Data Exploration Techniques

PostgreSQL Business Intelligence & Performance Simon Riggs CTO, 2ndQuadrant PostgreSQL Major Contributor

SWISSBOX REVISITING THE DATA PROCESSING SOFTWARE STACK

Report Data Management in the Cloud: Limitations and Opportunities

DiNoDB: Efficient Large-Scale Raw Data Analytics

NUMA obliviousness through memory mapping

iservdb The database closest to you IDEAS Institute

Telemetry Database Query Performance Review

Arrays in database systems, the next frontier?

Integrating Apache Spark with an Enterprise Data Warehouse

How To Write A Database Program

SQL Server Performance Tuning and Optimization

Petabyte Scale Data at Facebook. Dhruba Borthakur, Engineer at Facebook, SIGMOD, New York, June 2013

Hadoop s Entry into the Traditional Analytical DBMS Market. Daniel Abadi Yale University August 3 rd, 2010

Business Analytics in (a) Blink

Master s Thesis Nr. 14

Data Mining and Database Systems: Where is the Intersection?

Microsoft SQL Server: MS Performance Tuning and Optimization Digital

Performance Tuning and Optimizing SQL Databases 2016

ICOM 6005 Database Management Systems Design. Dr. Manuel Rodríguez Martínez Electrical and Computer Engineering Department Lecture 2 August 23, 2001

Overview of Scalable Distributed Database System SD-SQL Server

In-Memory Data Management for Enterprise Applications

MAGENTO HOSTING Progressive Server Performance Improvements

Net Developer Role Description Responsibilities Qualifications

Introduction. Part I: Finding Bottlenecks when Something s Wrong. Chapter 1: Performance Tuning 3

Overview. Introduction to Database Systems. Motivation... Motivation: how do we store lots of data?

Technical Challenges for Big Health Care Data. Donald Kossmann Systems Group Department of Computer Science ETH Zurich

BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB

Monitoring and Diagnosing Oracle RAC Performance with Oracle Enterprise Manager

PERFORMANCE EVALUATION OF DATABASE MANAGEMENT SYSTEMS BY THE ANALYSIS OF DBMS TIME AND CAPACITY

MySQL Enterprise Backup

There and Back Again As Quick As a Flash

In-Memory Databases Algorithms and Data Structures on Modern Hardware. Martin Faust David Schwalb Jens Krüger Jürgen Müller

OVERVIEW OF DATA EXPLORATION TECHNIQUES. Stratos Idreos, Olga Papaemmanouil, Surajit Chaudhuri SIGMOD 2015, Melbourne

Database Internals (Overview)

CSE 544 Principles of Database Management Systems. Magdalena Balazinska (magda) Fall 2007 Lecture 1 - Class Introduction

IN-MEMORY DATABASE SYSTEMS. Prof. Dr. Uta Störl Big Data Technologies: In-Memory DBMS - SoSe

DataCell: Building a Data Stream Engine on top of a Relational Database Kernel

Load Distribution in Large Scale Network Monitoring Infrastructures

DBMS Performance Monitoring

In-Memory Performance for Big Data

Monitoring and Diagnosing Oracle RAC Performance with Oracle Enterprise Manager. Kai Yu, Orlando Gallegos Dell Oracle Solutions Engineering

The Uncracked Pieces in Database Cracking

Vertica Live Aggregate Projections

QuickDB Yet YetAnother Database Management System?

How To Write A Store On A Microsoft Database On A Flash (Orchestra) On A Linux Computer (Ortho) On An Ipro (Orroster) On Your Computer Or Your Computer (For Ahem) On The

Designing Access Methods: The RUM Conjecture

A Generic Auto-Provisioning Framework for Cloud Databases

High-Volume Data Warehousing in Centerprise. Product Datasheet

Simple Solutions for Compressed Execution in Vectorized Database System

Benchmarking filesystems and PostgreSQL shared buffers

Big Data Analytics Nokia

Big Data & Cloud Computing. Faysal Shaarani

Using Toaster in a Production Environment

Performance And Scalability In Oracle9i And SQL Server 2000

Business Intelligence in SharePoint 2013

In Memory Accelerator for MongoDB

Do Relational Databases Belong in the Cloud? Michael Stiefel

High Availability for Database Systems in Cloud Computing Environments. Ashraf Aboulnaga University of Waterloo

Preview of Oracle Database 12c In-Memory Option. Copyright 2013, Oracle and/or its affiliates. All rights reserved.

In-Memory Columnar Databases HyPer. Arto Kärki University of Helsinki

SLIDE 1 Previous Next Exit

PRODUCT OVERVIEW SUITE DEALS. Combine our award-winning products for complete performance monitoring and optimization, and cost effective solutions.

Portable Scale-Out Benchmarks for MySQL. MySQL User Conference 2008 Robert Hodges CTO Continuent, Inc.

Future Prospects of Scalable Cloud Computing

Architecture Sensitive Database Design: Examples from the Columbia Group

An Approach to Implement Map Reduce with NoSQL Databases

PUBLIC Performance Optimization Guide

The MonetDB Architecture. Martin Kersten CWI Amsterdam. M.Kersten

SQL Databases Course. by Applied Technology Research Center. This course provides training for MySQL, Oracle, SQL Server and PostgreSQL databases.

SSIS Scaling and Performance

A Comparison of Approaches to Large-Scale Data Analysis

Various Load Testing Tools

Compared to MySQL database, Oracle has the following advantages:

Fallacies of the Cost Based Optimizer

SAP HANA SAP s In-Memory Database. Dr. Martin Kittel, SAP HANA Development January 16, 2013

MySQL performance in a cloud. Mark Callaghan

Case Study - I. Industry: Social Networking Website Technology : J2EE AJAX, Spring, MySQL, Weblogic, Windows Server 2008.

Real Life Performance of In-Memory Database Systems for BI

An Oracle White Paper June High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database

The Impact of Big Data on Classic Machine Learning Algorithms. Thomas Jensen, Senior Business Expedia

Transcription:

adaptive loading adaptive indexing dbtouch Ideas for Big Data Exploration Stratos Idreos CWI, INS-, Amsterdam

data is everywhere

daily data years

daily data years Eric Schmidt: Every two days we create as much data as much we did from dawn of humanity to 00

- exabytes daily data years 0 Eric Schmidt: Every two days we create as much data as much we did from dawn of humanity to 00

not always sure what we are looking for (until we find it)

not always sure what we are looking for (until we find it) data exploration

database systems great... declarative processing, back-end to numerous apps

database systems great... declarative processing, back-end to numerous apps but databases are too heavy and blind!

database systems great... declarative processing, back-end to numerous apps but databases are too heavy and blind! timeline

database systems great... declarative processing, back-end to numerous apps but databases are too heavy and blind! load timeline

database systems great... declarative processing, back-end to numerous apps but databases are too heavy and blind! load index timeline

database systems great... declarative processing, back-end to numerous apps but databases are too heavy and blind! load index query timeline

expert users - idle time - workload knowledge database systems great... declarative processing, back-end to numerous apps but databases are too heavy and blind! load index query timeline

systems tailored for data exploration

no workload knowledge systems tailored for data exploration

no workload knowledge no installation steps systems tailored for data exploration

no workload knowledge no installation steps quick response times systems tailored for data exploration

no workload knowledge no installation steps quick response times minimize data-to-query time systems tailored for data exploration

ideas adaptive indexing ACM SIGMOD Jim Gray Diss. Award - Cor Baayen Award years, papers

ideas adaptive indexing ACM SIGMOD Jim Gray Diss. Award - Cor Baayen Award years, papers adaptive loading In use in Hadapt and Tableau years, papers

ideas adaptive indexing ACM SIGMOD Jim Gray Diss. Award - Cor Baayen Award years, papers adaptive loading In use in Hadapt and Tableau years, papers dbtouch VENI 0 year, paper

ideas adaptive indexing ACM SIGMOD Jim Gray Diss. Award - Cor Baayen Award years, papers adaptive loading In use in Hadapt and Tableau years, papers dbtouch VENI 0 year, paper Martin Kersten, Stefan Manegold, Felix Halim, Panagiotis Karras, Roland Yap, Goetz Graefe, Harumi Kuno, Eleni Petraki, Anastasia Ailamaki, Ioannis Alagiannis, Renata Borovica, Miguel Branco, Ryan Johnson, Erietta Liarou

indexing load index query tune= create proper indexes offline performance 0-00X

indexing load index query tune= create proper indexes offline performance 0-00X but it depends on workload! which indices to build? on which data parts? and when to build them?

load index query

load index query timeline

load index query sample workload timeline

load index query sample workload analyze timeline

load index query sample workload analyze create indices timeline

load index query sample workload analyze create indices query timeline

load index query sample workload analyze create indices query timeline complex and time consuming process

load index query human administrators + auto-tuning tools sample workload analyze create indices query timeline complex and time consuming process

what can go wrong?

what can go wrong? not enough idle time to finish proper tuning

what can go wrong? not enough idle time to finish proper tuning by the time we finish tuning, the workload changes

database cracking

database cracking remove all tuning steps and need for human input incremental, adaptive, partial indexing

initialization indexing database cracking remove all tuning steps and need for human input incremental, adaptive, partial indexing

initialization query indexing database cracking remove all tuning steps and need for human input incremental, adaptive, partial indexing

initialization query indexing database cracking remove all tuning steps and need for human input incremental, adaptive, partial indexing

initialization query indexing database cracking remove all tuning steps and need for human input incremental, adaptive, partial indexing every query is treated as an advice on how data should be stored

Database Cracking CIDR 00 cracking example Column A Cracker column of A Cracker column of A Q: select * from R where R.A > 0 and R.A < Q: select * from R where R.A > and R.A <= Q (copy) Piece : A <= 0 Q (in place) Piece : 0 < A < Piece : <= A Piece : A <= Piece : < A <= 0 Piece : 0 < A < Piece : <= A <= Piece 5: < A

Database Cracking CIDR 00 cracking example Column A Cracker column of A Cracker column of A Q: select * from R where R.A > 0 and R.A < Q: select * from R where R.A > and R.A <= Q (copy) Piece : A <= 0 Q (in place) Piece : 0 < A < Piece : <= A Piece : A <= Piece : < A <= 0 Piece : 0 < A < Piece : <= A <= Piece 5: < A

Database Cracking CIDR 00 cracking example Column A Cracker column of A Cracker column of A Q: select * from R where R.A > 0 and R.A < Q: select * from R where R.A > and R.A <= Q (copy) Piece : A <= 0 Q (in place) Piece : 0 < A < Piece : <= A Piece : A <= Piece : < A <= 0 Piece : 0 < A < Piece : <= A <= Piece 5: < A

Database Cracking CIDR 00 cracking example Column A Cracker column of A Cracker column of A Q: select * from R where R.A > 0 and R.A < Q: select * from R where R.A > and R.A <= Q (copy) Piece : A <= 0 Q (in place) Piece : 0 < A < Piece : <= A Piece : A <= Piece : < A <= 0 Piece : 0 < A < Piece : <= A <= Piece 5: < A

Database Cracking CIDR 00 cracking example Column A Cracker column of A Cracker column of A Q: select * from R where R.A > 0 and R.A < Q: select * from R where R.A > and R.A <= Q (copy) Piece : A <= 0 Q (in place) Piece : 0 < A < Piece : <= A Piece : A <= Piece : < A <= 0 Piece : 0 < A < Piece : <= A <= Piece 5: < A

Database Cracking CIDR 00 cracking example Column A Cracker column of A Cracker column of A Q: select * from R where R.A > 0 and R.A < Q: select * from R where R.A > and R.A <= Q (copy) Piece : A <= 0 Q (in place) Piece : 0 < A < Piece : <= A Piece : A <= Piece : < A <= 0 Piece : 0 < A < Piece : <= A <= Piece 5: < A

Database Cracking CIDR 00 cracking example Column A Cracker column of A Cracker column of A Q: select * from R where R.A > 0 and R.A < Q: select * from R where R.A > and R.A <= Q (copy) Piece : A <= 0 Q (in place) Piece : 0 < A < Piece : <= A Piece : A <= Piece : < A <= 0 Piece : 0 < A < Piece : <= A <= Piece 5: < A

Database Cracking CIDR 00 cracking example Column A Cracker column of A Cracker column of A Q: select * from R where R.A > 0 and R.A < Q: select * from R where R.A > and R.A <= Q (copy) Piece : A <= 0 Q (in place) Piece : 0 < A < Piece : <= A Piece : A <= Piece : < A <= 0 Piece : 0 < A < Piece : <= A <= Piece 5: < A

Database Cracking CIDR 00 cracking example Column A Cracker column of A Cracker column of A Q: select * from R where R.A > 0 and R.A < Q: select * from R where R.A > and R.A <= Q (copy) Piece : A <= 0 Q (in place) Piece : 0 < A < Piece : <= A Piece : A <= Piece : < A <= 0 Piece : 0 < A < Piece : <= A <= Piece 5: < A

Database Cracking CIDR 00 cracking example Column A Cracker column of A Cracker column of A Q: select * from R where R.A > 0 and R.A < Q: select * from R where R.A > and R.A <= Q (copy) Piece : A <= 0 Q (in place) Piece : 0 < A < Piece : <= A Result tuples Piece : A <= Piece : < A <= 0 Piece : 0 < A < Piece : <= A <= Piece 5: < A

Database Cracking CIDR 00 cracking example Column A Cracker column of A Cracker column of A Q: select * from R where R.A > 0 and R.A < Q: select * from R where R.A > and R.A <= Q (copy) Piece : A <= 0 Q (in place) Piece : 0 < A < Piece : <= A Result tuples Piece : A <= Piece : < A <= 0 Piece : 0 < A < Piece : <= A <= Piece 5: < A Dynamically/on-the-fly within the select-operator

Database Cracking CIDR 00 cracking example gain knowledge on how data is organized Column A Cracker column of A Cracker column of A Q: select * from R where R.A > 0 and R.A < Q: select * from R where R.A > and R.A <= Q (copy) Piece : A <= 0 Q (in place) Piece : 0 < A < Piece : <= A Result tuples Piece : A <= Piece : < A <= 0 Piece : 0 < A < Piece : <= A <= Piece 5: < A Dynamically/on-the-fly within the select-operator

Database Cracking CIDR 00 cracking example Column A Cracker column of A Cracker column of A Q: select * from R where R.A > 0 and R.A < Q: select * from R where R.A > and R.A <= Q (copy) Piece : A <= 0 Q (in place) Piece : 0 < A < Piece : <= A Piece : A <= Piece : < A <= 0 Piece : 0 < A < Piece : <= A <= Piece 5: < A Dynamically/on-the-fly within the select-operator

Database Cracking CIDR 00 cracking example Column A Cracker column of A Cracker column of A Q: select * from R where R.A > 0 and R.A < Q: select * from R where R.A > and R.A <= Q (copy) Piece : A <= 0 Q (in place) Piece : 0 < A < Piece : <= A Piece : A <= Piece : < A <= 0 Piece : 0 < A < Piece : <= A <= Piece 5: < A Dynamically/on-the-fly within the select-operator

Database Cracking CIDR 00 cracking example Column A Cracker column of A Cracker column of A Q: select * from R where R.A > 0 and R.A < Q: select * from R where R.A > and R.A <= Q (copy) Piece : A <= 0 Q (in place) Piece : 0 < A < Piece : <= A Piece : A <= Piece : < A <= 0 Piece : 0 < A < Piece : <= A <= Piece 5: < A Dynamically/on-the-fly within the select-operator

Database Cracking CIDR 00 cracking example Column A Cracker column of A Cracker column of A Q: select * from R where R.A > 0 and R.A < Q: select * from R where R.A > and R.A <= Q (copy) Piece : A <= 0 Q (in place) Piece : 0 < A < Piece : <= A Piece : A <= Piece : < A <= 0 Piece : 0 < A < Piece : <= A <= Piece 5: < A Dynamically/on-the-fly within the select-operator

Database Cracking CIDR 00 cracking example Column A Cracker column of A Cracker column of A Q: select * from R where R.A > 0 and R.A < Q: select * from R where R.A > and R.A <= Q (copy) Piece : A <= 0 Q (in place) Piece : 0 < A < Piece : <= A Piece : A <= Piece : < A <= 0 Piece : 0 < A < Piece : <= A <= Piece 5: < A Dynamically/on-the-fly within the select-operator

Database Cracking CIDR 00 cracking example Column A Cracker column of A Cracker column of A Q: select * from R where R.A > 0 and R.A < Q: select * from R where R.A > and R.A <= Q (copy) Piece : A <= 0 Q (in place) Piece : 0 < A < Piece : <= A Piece : A <= Piece : < A <= 0 Piece : 0 < A < Piece : <= A <= Piece 5: < A Dynamically/on-the-fly within the select-operator

Database Cracking CIDR 00 cracking example Column A Cracker column of A Cracker column of A Q: select * from R where R.A > 0 and R.A < Q: select * from R where R.A > and R.A <= Q (copy) Piece : A <= 0 Q (in place) Piece : 0 < A < Piece : <= A Piece : A <= Piece : < A <= 0 Piece : 0 < A < Piece : <= A <= Piece 5: < A Dynamically/on-the-fly within the select-operator

Database Cracking CIDR 00 cracking example Column A Cracker column of A Cracker column of A Q: select * from R where R.A > 0 and R.A < Q: select * from R where R.A > and R.A <= Q (copy) Piece : A <= 0 Q (in place) Piece : 0 < A < Piece : <= A Piece : A <= Piece : < A <= 0 Piece : 0 < A < Piece : <= A <= Piece 5: < A Dynamically/on-the-fly within the select-operator

Database Cracking CIDR 00 cracking example Column A Cracker column of A Cracker column of A Q: select * from R where R.A > 0 and R.A < Q: select * from R where R.A > and R.A <= Q (copy) Piece : A <= 0 Q (in place) Piece : 0 < A < Piece : <= A Piece : A <= Piece : < A <= 0 Piece : 0 < A < Piece : <= A <= Piece 5: < A Dynamically/on-the-fly within the select-operator

Database Cracking CIDR 00 cracking example Column A Cracker column of A Cracker column of A Q: select * from R where R.A > 0 and R.A < Q: select * from R where R.A > and R.A <= Q (copy) Piece : A <= 0 Q (in place) Piece : 0 < A < Piece : <= A Piece : A <= Piece : < A <= 0 Piece : 0 < A < Piece : <= A <= Piece 5: < A Result tuples Dynamically/on-the-fly within the select-operator

Database Cracking CIDR 00 cracking example the more we crack, the more we learn Column A Cracker column of A Cracker column of A Q: select * from R where R.A > 0 and R.A < Q: select * from R where R.A > and R.A <= Q (copy) Piece : A <= 0 Q (in place) Piece : 0 < A < Piece : <= A Piece : A <= Piece : < A <= 0 Piece : 0 < A < Piece : <= A <= Piece 5: < A Result tuples Dynamically/on-the-fly within the select-operator

Database Cracking CIDR 00 cracking example the more we crack, the more we learn Q: select * from R where R.A > 0 and R.A < Q: select * from R where R.A > and R.A <= Column A Q (copy) Cracker column of A Piece : A <= 0 Q (in place) Piece : 0 < A < Piece : <= A Cracker column of A every query is treated as an advice on how data should be stored Piece : A <= Piece : < A <= 0 Piece : 0 < A < Piece : <= A <= Piece 5: < A Result tuples Dynamically/on-the-fly within the select-operator

Sideways Cracking, SIGMOD 0 TPC-H 0000 MonetDB Presorted Sel. Crack Sid. Crack MySQL Presorted Response time (milli secs) 000 0 00 50 00 0 TPC-H Query 5 50 00 0 0 5 0 5 0 5 0 Query sequence

Sideways Cracking, SIGMOD 0 TPC-H 0000 MonetDB Presorted Sel. Crack Sid. Crack MySQL Presorted Response time (milli secs) 000 0 00 50 00 50 0 TPC-H Query 5 Normal MonetDB selection cracking 00 0 0 5 0 5 0 5 0 Query sequence

Sideways Cracking, SIGMOD 0 TPC-H 0000 MonetDB Presorted Sel. Crack Sid. Crack MySQL Presorted Response time (milli secs) 000 0 00 50 00 50 0 TPC-H Query 5 Normal MonetDB selection cracking Presorted MonetDB Preparation cost - minutes 00 0 0 5 0 5 0 5 0 Query sequence

Sideways Cracking, SIGMOD 0 TPC-H 0000 MonetDB Presorted Sel. Crack Sid. Crack MySQL Presorted Response time (milli secs) Presorted MonetDB Preparation cost - minutes 000 0 00 50 00 50 00 0 0 TPC-H Query 5 0 5 0 5 0 5 0 Query sequence Normal MonetDB selection cracking MonetDB with sideways cracking

Sideways Cracking, SIGMOD 0 TPC-H 0000 MonetDB Presorted Sel. Crack Sid. Crack MySQL Presorted Response time (milli secs) Presorted MonetDB Preparation cost - minutes 000 0 00 50 00 50 00 0 0 TPC-H Query 5 0 5 0 5 0 5 0 Query sequence Normal MonetDB selection cracking MonetDB with sideways cracking

Sideways Cracking, SIGMOD 0 TPC-H 0000 MonetDB Presorted Sel. Crack Sid. Crack MySQL Presorted Response time (milli secs) Presorted MonetDB Preparation cost - minutes 000 0 00 50 00 50 00 0 0 TPC-H Query 5 0 5 0 5 0 5 0 Query sequence Normal MonetDB selection cracking MonetDB with sideways cracking

Sideways Cracking, SIGMOD 0 TPC-H 0000 MonetDB Presorted Sel. Crack Sid. Crack MySQL Presorted Response time (milli secs) Presorted MonetDB Preparation cost - minutes 000 0 00 50 00 50 00 0 0 TPC-H Query 5 0 5 0 5 0 5 0 Query sequence Normal MonetDB selection cracking MonetDB with sideways cracking

Stochastic Cracking, PVLDB skyserver experiments cracking answers 0.000 queries while full indexing is still half way creating the index

cracking databases updates (SIGMOD0) multiple columns (SIGMOD0) storage restrictions (SIGMOD0) benchmarking (TPCTC0) algorithms (PVLDB) concurrency control (PVLDB) robustness (PVLDB)

cracking databases updates (SIGMOD0) multiple columns (SIGMOD0) storage restrictions (SIGMOD0) benchmarking (TPCTC0) joins disk based multi-dimensional holistic indexing optimization algorithms (PVLDB) row-stores multi-cores concurrency control (PVLDB) robustness (PVLDB) aggregations vectorwised

loading load index query

loading load index query copy data inside the database database now has full control

loading load index query copy data inside the database database now has full control slow process...not all data might be needed all the time

database vs. unix tools file, attributes, billion tuples DB Awk single query cost (secs) 0 550,00,50,00

database vs. unix tools file, attributes, billion tuples DB Awk single query cost (secs) 0 550,00,50,00 break down db cost Loading Query Processing % %

database vs. unix tools file, attributes, billion tuples DB Awk single query cost (secs) 0 550,00,50,00 break down db cost Loading Query Processing % loading is a major bottleneck %

database vs. unix tools file, attributes, billion tuples DB Awk single query cost (secs) 0 550,00,50,00 break down db cost Loading Query Processing % loading is a major bottleneck %... but writing/maintaining scripts is hard too

adaptive loading load/touch only what is needed and only when it is needed

but raw data access is expensive tokenizing - parsing - no indexing - no statistics

but raw data access is expensive tokenizing - parsing - no indexing - no statistics challenge: fast raw data access

query plan

query plan scan

query plan scan db

query plan scan db

query plan scan zero loading cost

query plan scan files access raw data adaptively on-the-fly zero loading cost

query plan scan files cache access raw data adaptively on-the-fly zero loading cost

selective parsing - file indexing file splitting - online statistics query plan scan files cache access raw data adaptively on-the-fly zero loading cost

NoDB SIGMOD 0 Execution Time (sec) 00 00 000 00 00 00 ~500 ~500 Q0 Q Q Q Q Q5 Q Q Q Q Q0 Q Q Q Q Q5 Q Q Q Q Load 00 0 MySQL CSV Engine MySQL DBMS X DBMS X w/ external files PostgreSQL PostgresRaw PM + C 0

NoDB SIGMOD 0 Execution Time (sec) 00 00 000 00 00 00 ~500 ~500 Q0 Q Q Q Q Q5 Q Q Q Q Q0 Q Q Q Q Q5 Q Q Q Q Load 00 0 MySQL CSV Engine MySQL DBMS X DBMS X w/ external files PostgreSQL PostgresRaw PM + C 0

NoDB SIGMOD 0 Execution Time (sec) 00 00 000 00 00 00 ~500 ~500 Q0 Q Q Q Q Q5 Q Q Q Q Q0 Q Q Q Q Q5 Q Q Q Q Load 00 0 MySQL CSV Engine MySQL DBMS X DBMS X w/ external files PostgreSQL PostgresRaw PM + C 0

NoDB SIGMOD 0 Execution Time (sec) 00 00 000 00 00 00 ~500 ~500 Q0 Q Q Q Q Q5 Q Q Q Q Q0 Q Q Q Q Q5 Q Q Q Q Load 00 0 MySQL CSV Engine MySQL DBMS X DBMS X w/ external files PostgreSQL PostgresRaw PM + C 0

NoDB SIGMOD 0 Execution Time (sec) 00 00 000 00 00 00 ~500 ~500 Q0 Q Q Q Q Q5 Q Q Q Q Q0 Q Q Q Q Q5 Q Q Q Q Load 00 0 MySQL CSV Engine MySQL DBMS X DBMS X w/ external files PostgreSQL PostgresRaw PM + C 0

NoDB SIGMOD 0 Execution Time (sec) 00 00 000 00 00 00 ~500 ~500 Q0 Q Q Q Q Q5 Q Q Q Q Q0 Q Q Q Q Q5 Q Q Q Q Load 00 0 MySQL CSV Engine MySQL DBMS X DBMS X w/ external files PostgreSQL PostgresRaw PM + C 0

NoDB SIGMOD 0 Execution Time (sec) 00 00 000 00 00 00 ~500 ~500 Q0 Q Q Q Q Q5 Q Q Q Q Q0 Q Q Q Q Q5 Q Q Q Q Load 00 0 MySQL CSV Engine MySQL DBMS X DBMS X w/ external files PostgreSQL PostgresRaw PM + C 0

NoDB SIGMOD 0 Execution Time (sec) 00 00 000 00 00 00 ~500 ~500 Q0 Q Q Q Q Q5 Q Q Q Q Q0 Q Q Q Q Q5 Q Q Q Q Load 00 0 MySQL CSV Engine MySQL DBMS X DBMS X w/ external files PostgreSQL PostgresRaw PM + C reducing data-to-query time 0

querying load index query

querying load index query declarative SQL interface

querying load index query declarative SQL interface correct and complete answers

querying load index query declarative SQL interface correct and complete answers

querying complex and slow - not fit for exploration load index query declarative SQL interface correct and complete answers

dbtouch dbtouch CIDR 0

dbtouch dbtouch CIDR 0 data

dbtouch dbtouch CIDR 0 data query

dbtouch dbtouch CIDR 0 no expert knowledge - no set-up costs interactive mode fit for exploration data query

dbtouch CIDR 0 challenge design database systems tailored for touch exploration

dbtouch CIDR 0 challenge design database systems tailored for touch exploration user drives query processing actions

adaptive loading adaptive indexing dbtouch Ideas for Big Data Exploration Stratos Idreos CWI, INS-, Amsterdam

adaptive loading adaptive indexing dbtouch Ideas for Big Data Exploration Stratos Idreos CWI, INS-, Amsterdam systems tailored for exploration

more in INS- data vaults sciborg cyclotron sciql holistic indexing datacell charles hardware-conscious

more in INS- data vaults sciborg cyclotron sciql holistic indexing Thank you! datacell charles hardware-conscious