Similar documents
nosql and Non Relational Databases

Preparing Your Data For Cloud

SQL VS. NO-SQL. Adapted Slides from Dr. Jennifer Widom from Stanford

Why NoSQL? Your database options in the new non- relational world IBM Cloudant 1

extensible record stores document stores key-value stores Rick Cattel s clustering from Scalable SQL and NoSQL Data Stores SIGMOD Record, 2010

Lecture Data Warehouse Systems

Structured Data Storage

Slave. Master. Research Scholar, Bharathiar University

BRAC. Investigating Cloud Data Storage UNIVERSITY SCHOOL OF ENGINEERING. SUPERVISOR: Dr. Mumit Khan DEPARTMENT OF COMPUTER SCIENCE AND ENGEENIRING

Not Relational Models For The Management of Large Amount of Astronomical Data. Bruno Martino (IASI/CNR), Memmo Federici (IAPS/INAF)

The NoSQL Ecosystem, Relaxed Consistency, and Snoop Dogg. Adam Marcus MIT CSAIL

NoSQL. Thomas Neumann 1 / 22

Advanced Data Management Technologies

Analytics March 2015 White paper. Why NoSQL? Your database options in the new non-relational world

Cloud Scale Distributed Data Storage. Jürmo Mehine

Using Cloud Services for Test Environments A case study of the use of Amazon EC2

Evaluating NoSQL for Enterprise Applications. Dirk Bartels VP Strategy & Marketing

Do Relational Databases Belong in the Cloud? Michael Stiefel

Overview of Databases On MacOS. Karl Kuehn Automation Engineer RethinkDB

Introduction to NoSQL

Introduction to NOSQL

Where We Are. References. Cloud Computing. Levels of Service. Cloud Computing History. Introduction to Data Management CSE 344

A COMPARATIVE STUDY OF NOSQL DATA STORAGE MODELS FOR BIG DATA

An Open Source NoSQL solution for Internet Access Logs Analysis

Can the Elephants Handle the NoSQL Onslaught?

NoSQL Databases. Polyglot Persistence

NoSQL Data Base Basics

The Quest for Extreme Scalability

3 Case Studies of NoSQL and Java Apps in the Real World

MongoDB in the NoSQL and SQL world. Horst Rechner Berlin,

Benchmarking and Analysis of NoSQL Technologies

So What s the Big Deal?

NoSQL Database Options

Introduction to Hadoop. New York Oracle User Group Vikas Sawhney

Databases : Lecture 11 : Beyond ACID/Relational databases Timothy G. Griffin Lent Term Apologies to Martin Fowler ( NoSQL Distilled )

Database Management System Choices. Introduction To Database Systems CSE 373 Spring 2013

Big Systems, Big Data

.NET User Group Bern

JAVA IN THE CLOUD PAAS PLATFORM IN COMPARISON

NOSQL INTRODUCTION WITH MONGODB AND RUBY GEOFF

NoSQL Systems for Big Data Management

Challenges for Data Driven Systems

NoSQL Databases. Institute of Computer Science Databases and Information Systems (DBIS) DB 2, WS 2014/2015

Logistics. Database Management Systems. Chapter 1. Project. Goals for This Course. Any Questions So Far? What This Course Cannot Do.

Department of Software Systems. Presenter: Saira Shaheen, Dated:

Mobile + HA + Cloud. Eugene Ciurana! ! pr3d4t0r - irc.freenode.net! ##java, ##security, #awk, #python, #bitcoin! irc.oftc.net: #tor, #tor-dev, #tails!

Cloud data store services and NoSQL databases. Ricardo Vilaça Universidade do Minho Portugal

Databases 2 (VU) ( )

Introduction to NoSQL

NoSQL and Graph Database

Cloud Computing Is In Your Future

CS 4604: Introduc0on to Database Management Systems. B. Aditya Prakash Lecture #13: NoSQL and MapReduce

NoSQL Databases. Nikos Parlavantzas

NoSQL Database Systems and their Security Challenges

Scalable Architecture on Amazon AWS Cloud

Comparison of the Frontier Distributed Database Caching System with NoSQL Databases

A programming model in Cloud: MapReduce

Introduction to NoSQL Databases. Tore Risch Information Technology Uppsala University

RDF graph Model and Data Retrival

BIG DATA TOOLS. Top 10 open source technologies for Big Data

Introduction to Big Data Training

Application of NoSQL Database in Web Crawling

NoSQL. What Is NoSQL? Why NoSQL?

Applications for Big Data Analytics

Web 2.0 Technology Overview. Lecture 8 GSL Peru 2014

Big Data Management. Big Data Management. (BDM) Autumn Povl Koch September 30,

How To Write A Database Program

NoSQL: Going Beyond Structured Data and RDBMS

Understanding NoSQL on Microsoft Azure

Big Data Buzzwords From A to Z. By Rick Whiting, CRN 4:00 PM ET Wed. Nov. 28, 2012

An Approach to Implement Map Reduce with NoSQL Databases

Understanding NoSQL Technologies on Windows Azure

The CAP theorem and the design of large scale distributed systems: Part I

Open Source Technologies on Microsoft Azure

Crazy NoSQL Data Integration with Pentaho

Big Data Database Revenue and Market Forecast,

APP DEVELOPMENT ON THE CLOUD MADE EASY WITH PAAS

INTRODUCTION TO CASSANDRA

these three NoSQL databases because I wanted to see a the two different sides of the CAP

Sentimental Analysis using Hadoop Phase 2: Week 2

Table of Contents. Développement logiciel pour le Cloud (TLC) Table of Contents. 5. NoSQL data models. Guillaume Pierre

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON

DROSS Distributed & Resilient Open Source Software

Composite Data Virtualization Composite Data Virtualization And NOSQL Data Stores

Cloud Application Development (SE808, School of Software, Sun Yat-Sen University) Yabo (Arber) Xu

Yahoo! Cloud Serving Benchmark

Distributed Data Parallel Computing: The Sector Perspective on Big Data

NoSQL Evaluation. A Use Case Oriented Survey

Cloud Computing with Microsoft Azure

Study concluded that success rate for penetration from outside threats higher in corporate data centers

NoSQL and Hadoop Technologies On Oracle Cloud

Hacettepe University Department Of Computer Engineering BBM 471 Database Management Systems Experiment

NOSQL DATABASES AND CASSANDRA

NoSQL replacement for SQLite (for Beatstream) Antti-Jussi Kovalainen Seminar OHJ-1860: NoSQL databases

How To Scale Out Of A Nosql Database

Referential Integrity in Cloud NoSQL Databases

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

NoSQL in der Cloud Why? Andreas Hartmann

NewSQL: Towards Next-Generation Scalable RDBMS for Online Transaction Processing (OLTP) for Big Data Management

Comparative Analysis of Nosql Specimen with Relational Data Store for Big Data in Cloud

Transcription:

wow

CPSC350 relational schemas table normalization practical use of relational algebraic operators tuple relational calculus and their expression in a declarative query language

relational schemas

CPSC350 relational schemas table normalization practical use of relational algebraic operators tuple relational calculus and their expression in a declarative query language

practical knowledge the stuff you need to know to be a better software developer

An introduction to the non- SQL part of the course

Cloud Storage in a! Post SQL World Ian WIlkes ars technica

Since the rise of the Web, SQL-based relational databases have been the dominant structured storage technology behind online applications.

The past few years have seen the emergence of the cloud as a compelling environment for online application development, bringing true utility computing into the infrastructure pantheon

But the cloud and SQL do not mix well, and multiple efforts are now underway to offer viable alternatives to the venerable database.

while relational databases are by no means doomed, they will soon be joined in the cloud, and possibly out-shined by, new non-relational database technologies

<disect>

more and more computing is in the cloud google docs dropbox

more and more computing is in the cloud google docs dropbox chromeos Google Cr-48

more and more computing is in the cloud google docs dropbox chromeos Google Cr-48 ipad / Android / mobile computing

more and more computing is in the cloud Even cloud/browser based IDEs.

Take a moment! do back of envelope calculation! what % of your computer time on cloud?

results

some believe the cloud and SQL do not mix well

<on one side>

SQL databases work fine setup straightforward programming not that hard wide range of support for small projects XAMPP stack near plug and play. SQL on a Amazon ec2 instance easy to set up and work with

SQL can scale vertically

big iron

SQL can grow vertically

easily SQL cannot grow horizontally

Brewer s Theorem a.k.a. CAP Theorem A system cannot have high consistency, availability, and partition tolerance simultaneously

SQL focuses on consistency

SQL focuses on consistency constraints on foreign keys!

ACID semantics (ch11)

Brewer s Theorem a.k.a. CAP Theorem A system cannot have high consistency, availability, and partition tolerance simultaneously

availability online web apps - availability is a must

Brewer s Theorem a.k.a. CAP Theorem A system cannot have high consistency, availability, and partition tolerance simultaneously

mismatch Clustered machines -- SQL semantics

BASE instead of ACID semantics

BASE Basically Available, Soft state, Eventual consistency

Basically Available: This constraint states that the system does guarantee the availability of the data as regards CAP Theorem; there will be a response to any request. But, that response could still be failure to obtain the requested data or the data may be in an inconsistent or changing state, much like waiting for a check to clear in your bank account.

Soft state: The state of the system could change over time, so even during times without input there may be changes going on due to eventual consistency, thus the state of the system is always soft.

Eventual consistency: The system will eventually become consistent once it stops receiving input. The data will propagate to everywhere it should sooner or later, but the system will continue to receive input and is not checking the consistency of every transaction before it moves onto the next one.

Wait, there s more

Our model JOIN JOIN ORDER BY PythonFlask SQL Server

Extending our model Python Flask JOIN JOIN ORDER BY Python Flask ORDER BY SQL Server JOIN ORDER BY Python Flask JOIN JOIN Python Flask ORDER BY JOIN Python Flask Python Flask Python Flask

Our model JOIN JOIN ORDER BY Apache S. SQL Server

Extending our model Apache S. JOIN JOIN ORDER BY Apache S. ORDER BY SQL Server JOIN ORDER BY Apache S. JOIN JOIN Apache S. ORDER BY JOIN Apache S. Apache S. Apache S.

All data processing on database server! adds load to CPU! further inhibits scaling

Some SQL platform providers offer cloud relational db Amazon s mysql based RDS Heroku s PostgreSQL but these more like managed hosting than true cloud (distributed) computing

push to NoSQL alternatives! sacrifice consistency in favor of scalability and availability!

first implementations proprietary! Google Big Table Amazon Dynamo

NoSQL features No schemas -- no fixed table structure, no fixed columns little to no protection against invalid data

NoSQL features No or limited joins - no built in methods for connecting /joining entries. must be done at application level

NoSQL features Restricted query interface - SQL has rich query interface. nosql somewhat limited. although some NoSQL systems (e.g., CouchDB) offer different views.

NoSQL features Limited transactions or locks - ACID transactions not distributed system friendly.

NoSQL open source projects

Cassandra developed by Facebook and used in-house now an Apache project used in a large number of production environments Digg, Rackspace, Twitter (internally) Netflix Redit CERN not Facebook

CouchDB an Apache project operates on JSON documents the read state of the db is not always consistent with the latest writes. reasonably stable used by: wego, BBC, World Wildlife Fund, http://www.couchbase.com/ customers/case-studies

mongodb operates on JSON documents considered faster than CouchDB. good stability used by foursquare, bit.ly, intuit, shutterfly, nytimes, Etsy, Cern (Large Hadron Collider)justin.tv

Voldemort developed by LinkedIn now open source bare-bones key-value pair db. all data management work done in the applications.

HBase A Big Table clone uses map-reduce

Amazon s simpledb not open source disappointing performance not mature

The top ones Cassandra CouchDB mongodb Redis

Failure of NoSQL DIGG

Road Map for remainder of course mongodb CouchDB Redis