Preparing Your Data For Cloud



Similar documents
SQL VS. NO-SQL. Adapted Slides from Dr. Jennifer Widom from Stanford


Advanced Data Management Technologies

Cloud Scale Distributed Data Storage. Jürmo Mehine

NoSQL Databases. Institute of Computer Science Databases and Information Systems (DBIS) DB 2, WS 2014/2015

Structured Data Storage

How To Write A Database Program

NoSQL Databases. Nikos Parlavantzas

So What s the Big Deal?

Analytics March 2015 White paper. Why NoSQL? Your database options in the new non-relational world

Slave. Master. Research Scholar, Bharathiar University

NoSQL Data Base Basics

Making Sense ofnosql A GUIDE FOR MANAGERS AND THE REST OF US DAN MCCREARY MANNING ANN KELLY. Shelter Island

Databases 2 (VU) ( )

NoSQL in der Cloud Why? Andreas Hartmann

INTRODUCTION TO CASSANDRA

Open Source Technologies on Microsoft Azure

Introduction to NOSQL

Why NoSQL? Your database options in the new non- relational world IBM Cloudant 1

NoSQL Database Systems and their Security Challenges

A COMPARATIVE STUDY OF NOSQL DATA STORAGE MODELS FOR BIG DATA

NoSQL Databases. Polyglot Persistence

Lecture Data Warehouse Systems

NoSQL systems: introduction and data models. Riccardo Torlone Università Roma Tre

Referential Integrity in Cloud NoSQL Databases

Step by Step: Big Data Technology. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 25 August 2015

Where We Are. References. Cloud Computing. Levels of Service. Cloud Computing History. Introduction to Data Management CSE 344

An Approach to Implement Map Reduce with NoSQL Databases

Big Data Management in the Clouds. Alexandru Costan IRISA / INSA Rennes (KerData team)

Evaluating NoSQL for Enterprise Applications. Dirk Bartels VP Strategy & Marketing

Big Data Technologies. Prof. Dr. Uta Störl Hochschule Darmstadt Fachbereich Informatik Sommersemester 2015

NewSQL: Towards Next-Generation Scalable RDBMS for Online Transaction Processing (OLTP) for Big Data Management

MongoDB in the NoSQL and SQL world. Horst Rechner Berlin,

Big Data Analytics. Rasoul Karimi

Can the Elephants Handle the NoSQL Onslaught?

2.1.5 Storing your application s structured data in a cloud database

The NoSQL Ecosystem, Relaxed Consistency, and Snoop Dogg. Adam Marcus MIT CSAIL

these three NoSQL databases because I wanted to see a the two different sides of the CAP

Mobile + HA + Cloud. Eugene Ciurana! ! pr3d4t0r - irc.freenode.net! ##java, ##security, #awk, #python, #bitcoin! irc.oftc.net: #tor, #tor-dev, #tails!

Making Sense of NoSQL Dan McCreary Wednesday, Nov. 13 th 2014

Challenges for Data Driven Systems

extensible record stores document stores key-value stores Rick Cattel s clustering from Scalable SQL and NoSQL Data Stores SIGMOD Record, 2010

Survey of NoSQL Database Engines for Big Data

Benchmarking and Analysis of NoSQL Technologies

Not Relational Models For The Management of Large Amount of Astronomical Data. Bruno Martino (IASI/CNR), Memmo Federici (IAPS/INAF)

CloudDB: A Data Store for all Sizes in the Cloud

Cloud data store services and NoSQL databases. Ricardo Vilaça Universidade do Minho Portugal

Table of Contents. Développement logiciel pour le Cloud (TLC) Table of Contents. 5. NoSQL data models. Guillaume Pierre

Overview of Databases On MacOS. Karl Kuehn Automation Engineer RethinkDB

Introduction to Apache Cassandra

3 Case Studies of NoSQL and Java Apps in the Real World

nosql and Non Relational Databases

Applications for Big Data Analytics

NoSQL. Thomas Neumann 1 / 22

Domain driven design, NoSQL and multi-model databases

Open source large scale distributed data management with Google s MapReduce and Bigtable

The Quest for Extreme Scalability

A Scalable Relational Database Model for Cloud Computing

References. Introduction to Database Systems CSE 444. Motivation. Basic Features. Outline: Database in the Cloud. Outline

Introduction to Database Systems CSE 444

Cloud Database Emergence

On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform

NOSQL DATABASE SYSTEMS

Study concluded that success rate for penetration from outside threats higher in corporate data centers

Introduction to Big Data Training

Cloud & Big Data a perfect marriage? Patrick Valduriez

Microsoft Azure Data Technologies: An Overview

NOSQL DATABASES AND CASSANDRA

Integrating Big Data into the Computing Curricula

Report Data Management in the Cloud: Limitations and Opportunities

Understanding NoSQL Technologies on Windows Azure

Databases in the Cloud

A survey of big data architectures for handling massive data

CISC 432/CMPE 432/CISC 832 Advanced Database Systems

Cloud Data Management Big Data

DATA DE-DUPLICATION IN. NoSQL DATABASES. A Thesis Submitted to the College of. Graduate Studies and Research

Introduction to NoSQL Databases. Tore Risch Information Technology Uppsala University

Introduction to Polyglot Persistence. Antonios Giannopoulos Database Administrator at ObjectRocket by Rackspace

How To Scale Out Of A Nosql Database

Practical Cassandra. Vitalii

Scaling in the Cloud with AWS. By: Eli White (CTO & mojolive) eliw.com - mojolive.com

Scalable Architecture on Amazon AWS Cloud

NoSQL and Hadoop Technologies On Oracle Cloud

Вовченко Алексей, к.т.н., с.н.с. ВМК МГУ ИПИ РАН

Extending a Methodology for Migration of the Database Layer to the Cloud Considering Relational Database Schema Migration to NoSQL

Big Data Technologies Compared June 2014

What Next for DBAs in the Big Data Era

The evolution of database technology (II) Huibert Aalbers Senior Certified Executive IT Architect

Introduction to NoSQL

BIG DATA Alignment of Supply & Demand Nuria de Lama Representative of Atos Research &

Choosing The Right Big Data Tools For The Job A Polyglot Approach

Performance Management of SQL Server

Understanding NoSQL on Microsoft Azure

How graph databases started the multi-model revolution

An Open Source NoSQL solution for Internet Access Logs Analysis

TRAINING PROGRAM ON BIGDATA/HADOOP

Big Data Analytics. Lucas Rego Drumond

Choosing the right NoSQL database for the job: a quality attribute evaluation

NoSQL storage and management of geospatial data with emphasis on serving geospatial data using standard geospatial web services

BRAC. Investigating Cloud Data Storage UNIVERSITY SCHOOL OF ENGINEERING. SUPERVISOR: Dr. Mumit Khan DEPARTMENT OF COMPUTER SCIENCE AND ENGEENIRING

Transcription:

Preparing Your Data For Cloud Narinder Kumar Inphina Technologies 1

Agenda Relational DBMS's : Pros & Cons Non-Relational DBMS's : Pros & Cons Types of Non-Relational DBMS's Current Market State Applicability of Different Data-Bases in different environments 2

Relational DBMS - Pros Data Integrity ACID Capabilities High Level Query Model Data Normalization Data Independence 3

Relational DBMS - Cons Scaling Issues By Duplication (Master-Slave) By Sharding/Division (Not transparent) Fixed Schema Mostly disk-oriented (Performance) May fair poorly with large data 4

Non-Relational DBMS - Pros Scalability Replication / Availability Performance Deployment Flexibility Modelling Flexibility Faster Development (?) 5

Non-Relational DBMS - Cons Lack of Transactional Support Data Integrity is Application's responsibility Data Duplication / Application Dependent Eventually Consistent (mostly) No Standardization New Technology 6

RDBMS's and Cloud 7

Cloud Capable RDBMS s Almost every RDBMS can run in a IAAS Cloud Platform 8

Cloud Native RDBMS 9

Types of Non-Relational DBMS Key Value Stores Document Stores Column Stores Graph Stores 10

Key Value Data-Bases Object is completely Opaque to DB Mostly GET, PUT & DELETE operations are supported There may be limits on size of Objects Inspired by Amazon Dynamo Paper 11

Key Value DataBases & Cloud Project Voldemort MemCachedDB Tokyo Tyrant 12

Document Data-Bases Object is not completely opaque to DB Every Object has it's own schema FirstName="Bob", Address="5 Oak St.", Hobby="sailing". FirstName="Jonathan", Children=("Michael,10", "Jennifer,8") Can perform queries based on Object's attributes Possible to describe relationships between Objects Joins and Transactions are not supported Good for XML or JSON objects 13

Document Data-Bases & Cloud 14

Column-Store Data-Bases Richer than Document Stores Multi-Dimensional Map Tables Row Column Time-Stamp Supports Multiple Data Types Usually use an Underlying DFS Inspired by Google Big Table Paper 15

Column-Store Data-Bases & Cloud 16

Key Factors while Making a Choice Application Architecture Requirements Platform choices Non-Functional Requirements Consistency Availability Partition Security Data Redemption 17

Different Requirements = Different Solutions 18

Scenario 1 Feature First Corporate Data Consistency Requirements Business Intelligence Legacy Application RDBMS on Amazon Cloud, RackSpace (IaaS) or Microsoft Azure/Amazon RDS (PaaS) 19

Scenario 2 Consumer Facing Application Big Files (Images, BLOB's, Files) Geographically Distributed Mostly writes Not heavy requirement on Rich Queries Key-Value Data Stores (Amazon S3, Project Voldemort, Redis) 20

Scenario 3 Hundreds Of Government Documents with different schemas Need to serve on Web Data Mining Document Data-Stores (Amazon SimpleDB, Apache Couche DB, MongoDB) 21

Scenario 4 Scale First Huge Data-Set Analytical Requirements Consumer Facing High Availability over Consistency Column Data-Stores (Google App Engine, Hbase, Cassandra) 22

Mix & Match of Earlier Scenarios Polyglot Persistence RDBMS for low-volume and high value Key-Value DB for large files with little queries Memcached DB for shortlived Data Column DB for Analytics 23

CAP Theorem 24

Conclusions One Size does not Fit all Many choices No-SQL DB's providing Alternatives RDBMS serve useful purpose 25

nkumar@inphina.com www.inphina.com http://thoughts.inphina.com 26

References http://nosql-database.org/ http://www.drdobbs.com/database/218900502 http://perspectives.mvdirona.com/2009/11/03/onesizedoesnotfitall.aspx http://blog.nahurst.com/visual-guide-to-nosql-systems http://blog.heroku.com/archives/2010/7/20/nosql/ http://www.vineetgupta.com/2010/01/nosql-databases-part-1-landscape.html http://project-voldemort.com/ http://code.google.com/p/redis/ http://memcachedb.org/ http://aws.amazon.com/simpledb/ http://couchdb.apache.org/ http://www.mongodb.org http://riak.basho.com/ 27