Data Management in the Cloud. Zhen Shi



Similar documents
Daniel J. Adabi. Workshop presentation by Lukas Probst

Report Data Management in the Cloud: Limitations and Opportunities

Data Management in the Cloud

Data Management in the Cloud: Limitations and Opportunities. Annies Ductan

Ethopian Database Management system as a Cloud Service: Limitations and advantages

bigdata Managing Scale in Ontological Systems

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON

The Inside Scoop on Hadoop

Data Management in the Cloud: Limitations and Opportunities

SQL VS. NO-SQL. Adapted Slides from Dr. Jennifer Widom from Stanford

Distribution transparency. Degree of transparency. Openness of distributed systems

Hadoop: A Framework for Data- Intensive Distributed Computing. CS561-Spring 2012 WPI, Mohamed Y. Eltabakh

Automated Control in a Cloud Computing Infrastructure

Highly available, scalable and secure data with Cassandra and DataStax Enterprise. GOTO Berlin 27 th February 2014

Real Time Big Data Processing

How to Do/Evaluate Cloud Computing Research. Young Choon Lee

DISTRIBUTED SYSTEMS [COMP9243] Lecture 9a: Cloud Computing WHAT IS CLOUD COMPUTING? 2

Case Study. Highly Available, Fault Tolerant Cloud Solution & AWS Managed Support. Case Study. A Telehealthcare Company

Divy Agrawal and Amr El Abbadi Department of Computer Science University of California at Santa Barbara

The Modern Online Application for the Internet Economy: 5 Key Requirements that Ensure Success

Introduction to Cloud Computing

Hadoop. MPDL-Frühstück 9. Dezember 2013 MPDL INTERN

Where We Are. References. Cloud Computing. Levels of Service. Cloud Computing History. Introduction to Data Management CSE 344

Amazon EC2 Product Details Page 1 of 5

Cloud Based Distributed Databases: The Future Ahead

Cloud Computing Disaster Recovery (DR)

The Private Cloud Your Controlled Access Infrastructure

Improving MapReduce Performance in Heterogeneous Environments

Cloud Computing and Advanced Relationship Analytics

BIG DATA TRENDS AND TECHNOLOGIES

QLIKVIEW INTEGRATION TION WITH AMAZON REDSHIFT John Park Partner Engineering

Designing a Data Solution with Microsoft SQL Server 2014

AWS Account Setup and Services Overview

ZADARA STORAGE. Managed, hybrid storage EXECUTIVE SUMMARY. Research Brief

Alfresco Enterprise on AWS: Reference Architecture

Pervasive PSQL Meets Critical Business Requirements

INTRODUCING APACHE IGNITE An Apache Incubator Project

NCTA Cloud Architecture

Fault-Tolerant Computer System Design ECE 695/CS 590. Putting it All Together

ISSN: (Online) Volume 3, Issue 6, June 2015 International Journal of Advance Research in Computer Science and Management Studies

NoSQL in der Cloud Why? Andreas Hartmann

Making Sense ofnosql A GUIDE FOR MANAGERS AND THE REST OF US DAN MCCREARY MANNING ANN KELLY. Shelter Island

Amr El Abbadi. Computer Science, UC Santa Barbara

Learning Management Redefined. Acadox Infrastructure & Architecture

Three Ways Enterprises are Protecting SQL Server in the Cloud

Hadoop & Spark Using Amazon EMR

This paper defines as "Classical"

From Grid Computing to Cloud Computing & Security Issues in Cloud Computing

Cloud Storage and Backup

SERVER 101 COMPUTE MEMORY DISK NETWORK

Cluster, Grid, Cloud Concepts

How Transactional Analytics is Changing the Future of Business A look at the options, use cases, and anti-patterns

Operating Stoop for Efficient Parallel Data Processing In Cloud

MarkLogic Enterprise Data Layer

A SURVEY OF POPULAR CLUSTERING TECHNOLOGIES

Affordable, Scalable, Reliable OLTP in a Cloud and Big Data World: IBM DB2 purescale

From Spark to Ignition:

Distributed Data Stores

Amazon Elastic Beanstalk

Written examination in Cloud Computing

Deploying for Success on the Cloud: EBS on Amazon VPC Session ID#11312

Data Management in the Cloud -

Challenges for Data Driven Systems

Migrating Your Databases to Amazon Aurora. June 2016

EMC VPLEX FAMILY. Continuous Availability and data Mobility Within and Across Data Centers

To run large data set applications in the cloud, and run them well,

P4.1 Reference Architectures for Enterprise Big Data Use Cases Romeo Kienzler, Data Scientist, Advisory Architect, IBM Germany, Austria, Switzerland

Big Data Governance Certification Self-Study Kit Bundle

Introduction to Cloud : Cloud and Cloud Storage. Lecture 2. Dr. Dalit Naor IBM Haifa Research Storage Systems. Dalit Naor, IBM Haifa Research

Hosting Transaction Based Applications on Cloud

Big Data on Cloud Computing- Security Issues

Can the Elephants Handle the NoSQL Onslaught?

Storage Architectures for Big Data in the Cloud

NoSQL for SQL Professionals William McKnight

PROPOSAL To Develop an Enterprise Scale Disease Modeling Web Portal For Ascel Bio Updated March 2015

Introduction to Apache Cassandra

Case Study. Cloud Adoption, Fault Tolerant AWS Support & Magento ecommerce Implementation. Case Study

Big Data and Industrial Internet

IT and Storage for Big Data Analytics

Hadoop & its Usage at Facebook

Integrating Big Data into the Computing Curricula

A programming model in Cloud: MapReduce

Object Storage: A Growing Opportunity for Service Providers. White Paper. Prepared for: 2012 Neovise, LLC. All Rights Reserved.

Hadoop Operations Management for Big Data Clusters in Telecommunication Industry

5-Layered Architecture of Cloud Database Management System

White Paper. Managing MapR Clusters on Google Compute Engine

Chapter 18: Database System Architectures. Centralized Systems

extensible record stores document stores key-value stores Rick Cattel s clustering from Scalable SQL and NoSQL Data Stores SIGMOD Record, 2010

Transactions and ACID in MongoDB

Hadoop Parallel Data Processing

White Paper. Prepared by: Neil Shah Director, Product Management March, 2014 Version: 1. Copyright 2014, ezdi, LLC.

In-memory databases and innovations in Business Intelligence

MagFS: The Ideal File System for the Cloud

Transcription:

Data Management in the Cloud Zhen Shi

Overview Introduction 3 characteristics of cloud computing 2 types of cloud data management application 2 types of cloud data management architecture Conclusion

Introduction What is cloud computing? Hail as revolutionizing IT Free corporation Plug into extremely powerful computing resource

Cloud computing platform Introduction

3 characteristics of cloud computing Three characteristics of a cloud computing Compute power is elastic Data is stored at an untrusted host Data is replicated by crossing large geographic distances

3 characteristics of cloud computing Computer power is elastic Computer resource can be scaled up and down

3 characteristics of cloud computing Data is stored at untrusted host Not really deliver from a celestial location subject to local rules and regulations Example: Amazon S3

3 characteristics of cloud computing Data is replicated by crossing large geographic distances Example: Amazon EC2 Regions availability zones Persist even in the face of failures of an entire location

2 types of cloud data management applications Based on cloud characteristics Transactional data management Analytical data management

2 types of cloud data management applications Transactional data management Not typically use shared-nothing architecture

2 types of cloud data management applications What is shared-nothing architecture Distributed computing architecture Node is independent and self-sufficient Single point of connection across the system

2 types of cloud data management applications Transactional data management Hard to maintain ACID when data replication over large geographic distances

2 types of cloud data management applications What is ACID guarantee? Set of properties that guarantee that database transactions are processed reliably Atomicity, Consistency, Isolation, Durability

2 types of cloud data management applications Transactional data management Risks to storing transactional data on an untrusted hosts mission-critical business processes Mission-critical business processes Customer data Credit card number

2 types of cloud data management applications Analytical data management Perfect match with shared-nothing architecture

2 types of cloud data management applications Analytical data management Unnecessary for ACID guarantees

2 types of cloud data management applications Analytical data management Left out sensitive data

2 types of cloud data management applications Comparison with two cloud data management application Transactional data management Sharednothing architecture ACID guarantees Sensitive data Not match Necessary Take care Analytical data management Match Unnecessary Left out

2 types of cloud data analysis DBMS market to move into the cloud data management system Software solutions to perform the data analysis MapReduce-like software Commercially shared-nothing parallel databases

2 types of cloud data analysis Requirement of cloud DBMS Efficiency Fault tolerance Working under a heterogeneous environment Operation on encrypted data Interfacing with business intelligences products

2 types of cloud data analysis MapReduce-like software Fault tolerance High priority Working under a heterogeneous environment Able to run in a heterogeneous environment

2 types of cloud data analysis Operation on encrypted data No ability to operate on encrypted data Interfacing with business intelligences products Not easy interfacing with business intelligences products Efficiency Need to discuss

2 types of cloud data analysis Shared-Nothing parallel database Fault tolerance Restart a query upon a failure Working under a heterogeneous environment Design to run on homogeneous

2 types of cloud data analysis Operating on encrypted data Not able to operate on encrypted data Interfacing with business intelligences products Working perfectly Efficiency Need to discuss

Conclusion Parallel database Advantage: Efficiency Performance MapReduce-like software Advantage: fault tolerance Heterogeneous cluster

Opinion and extension reading Hybrid solution Example: Pig project at Yahoo SCOPE project at Microsoft

References https://aws.amazon.com/ec2/ https://aws.amazon.com/s3/ http://cs.yale.edu/homes/dna/papers/abadi-cloudieee09.pdf