Tutorial on Scalable Cloud-Databases in Research and Practice
|
|
- Abraham Atkinson
- 8 years ago
- Views:
Transcription
1 Tutorial on Scalable Cloud-Databases in Research and Practice Norbert Ritter, Felix Gessert
2 Outline Motivation ORESTES: a Cloud- Database Middleware Solving Latency and Polyglot Storage Overview The New Field Cloud Data Management Cloud Database Models Research Challenges Wrap-up
3 Introduction: Which classes of cloud databases are there?
4 Cloud Databases Backend-as-a- Service Managed NoSQL Databases Cloud-only DBaaS-Systems Analytics-asa-Service Orestes Parse SQL Azure Amazon RDS Heroku Pos. Compose Cloudant DynamoDB Google F1 Database-as-a-Service BigQuery EMR Managed RDBMS Platform-as-a-Service Infrastructure-as-a-Service Cloud-Deployment of DBMSs
5 Data Management Data Analytics Architecture Typical Data Architecture: Analytics Reporting Data Mining Data Warehouse The era of one-size-fits-all database systems is over Specialized cloud DBaaS databases Operative Database Applications
6 Database Sweetspots RDBMS General-purpose ACID transactions Parallel DWH Aggregations/OLAP for massive data amounts NewSQL High throughput relational OLTP Wide-Column Store Long scans over structured data Document Store Deeply nested data models Key-Value Store Large-scale session storage Graph Database Graph algorithms & queries In-Memory KV-Store Counting & statistics Wide-Column Store Massive usergenerated content
7 Cloud-Database Sweetspots Realtime BaaS Communication and collaboration Managed RDBMS General-purpose ACID transactions Managed Cache Caching and transient storage Azure Tables Wide-Column Store Very large tables Wide-Column Store Massive usergenerated content Backend-as-a-Service Small Websites and Apps Managed NoSQL Full-Text Search Google Cloud Storage Object Store Massive File Storage Amazon Elastic MapReduce Hadoop-as-a-Service Big Data Analytics
8 Cloud Data Management New field tackling the design, implementation, evaluation and application implications of database systems in cloud environments: Protocols, APIs, Caching Load distribution, Auto-Scaling, SLAs Workload Management, Metering Replication, Partitioning, Transactions, Indexing Application architecture, Data Models Multi-Tenancy, Consistency, Availability, Query Processing, Security
9 Cloud-Database Models unstructured Data Model unstructured Analytics machine image Analytics- as-a- Service Analytics/ ML APIs schemafree NoSQL machine image Managed NoSQL NoSQL Service Database-as-a-Service relational structured RDBMS machine image Managed RDBMS/ DWH RDBMS/ DWH Service Deployment Model
10 Cloud-Deployed Database Database-image provisioned in IaaS/PaaS-cloud IaaS/PaaS deployment of database system IaaS-Cloud Does not solve: Provisioning, Backups, Security, Scaling, Elasticity, Performance Tuning, Failover, Replication,...
11 Managed RDBMS/DWH/NoSQL DB Cloud-hosted database SQL Azure DBaaS-Provider RDBMS DWH NoSQL DB IaaS-Cloud Provisioning, Backups, Google Security, Cloud SQL Scaling, Elasticity, Performance Tuning, Failover, Replication,... Amazon Redshift RDBMS NoSQL DB DWH
12 Proprietary Cloud Database Designed for and deployed in vendor-specific cloud environment Amazon SimpleDB Black-box system Managed by Cloud Provider Cloud Provider s API Google Cloud Datastore Database.com Azure Blob Storage Azure Tables BigTable, Megastore, Spanner, F1, Dynamo, PNuts, Relational Cloud, Openstack Swift Google Cloud Storage Database Object Store
13 Analytics-as-a-Service Analytic frameworks and machine learning with service APIs Provisioning, Data Ingest Analytics Cluster Amazon Elastic MapReduce Azure HDInsight Google BigQuery Google Prediction API Analytics ML Cloud
14 Backend-as-a-Service DBaaS with embedded custom and predefined application logic Authentication, Users, Validation,etc. Maps to (different) databases Backend API Service-Layer Data API AppCelerator Cloud (mobile) BaaS IaaS-Cloud
15 Pricing Models Pay-per-use and plan-based Account Plan-based Pay-per-use Parameters: Allocated Plan (e.g. Parameters: 2 Network, instances Bandwidth, + X GB storage) Storage, CPU, Requests, etc. Payment: Pre-Paid, Post-Paid Variants: On-Demand, Auction, Reserved e.g. Compose e.g. DynamoDB Usage End of month
16 Database-as-a-Service Approaches to Multi-Tenancy Private OS Private Process/DB Private Schema Shared Schema Virtual Schema Schema Schema Schema Schema Database Database Database Database Database Process Database Process Database Process Database Process VM VM VM VM Hardware Resources Hardware Resources Hardware Resources Hardware Resources e.g. Amazon RDS e.g. Compose e.g. Google DataStore Most SaaS Apps T. Kiefer, W. Lehner Private table database virtualization for dbaas UCC, 2011
17 Multi-Tenancy: Trade-Offs App. indep. Ressource Util. Isolation Maintenance, Provisioning Private OS Private Process/DB Private Schema Shared Schema W. Lehner, U. Sattler Web-scale Data Management for the Cloud Springer, 2013
18 Authentication & Authorization Checking Permissions and Indentity Internal Schemes External Identity Provider e.g. Amazon IAM e.g. OpenID e.g. SAML Federated Identity (Single Sign On) Authenticate/Login Token Authenticated Request Response Authentication Authorization API Database-aa-Service User-based Access Control Role-based Access Control Policies e.g. Amazon S3 ACLs e.g. Amazon IAM e.g. XACML
19 Service Level Agreements (SLAs) Specification of Application/Tenant Requirements SLA Technical Part 1. SLO 2. SLO 3. SLO Legal Part 1. Fees 2. Penalties Service Level Objectives: Availability Durability Consistency/Staleness Query Response Time
20 Service Level Agreements Expressing application requirements Functional Service Level Objectives Guarantee a feature Determined by database system Examples: transactions, join Non-Functional Service Level Objectives Guarantee a certain quality of service (QoS) Determined by database system and service provider Examples: Continuous: response time (latency), throughput Binary: Elasticity, Read-your-writes
21 Service Level Objects Making SLOs measurable through utilities
22 Workload Management Guaranteeing SLAs Typical approach: Maximize: W. Lehner, U. Sattler Web-scale Data Management for the Cloud Springer, 2013
23 Resource & Capacity Planning From a DBaaS provider s perspective Goal: minimize penalty and resource costs Resources Provisioned Resources: #No of Shard- or Replica servers Computing, Storage, Underprovisioning: Network Capacities SLAs violated Usage maximized Actual Expected Load Load Overprovisioning: SLAs met Excess Capacities Time T. Lorido-Botran, J. Miguel-Alonso et al.: Auto-scaling Techniques for Elastic Applications in Cloud Environments. Technical Report, 2013
24 SLAs in the wild Most DBaaS systems offer no SLAs, or only a a simple uptime guarantee Model CAP SLAs SimpleDB Dynamo-DB Azure Tables AE/Cloud DataStore S3, Az. Blob, GCS Table-Store (NoSQL Service) Table-Store (NoSQL Service) Table-Store (NoSQL Service) Entity-Group Store (NoSQL Service) Object-Store (NoSQL Service) CP CP CP 99.9% uptime CP AP 99.9% uptime (S3)
25 Open Research Questions in Cloud Data Management Service-Level Agreements How can SLAs be guaranteed in a virtualized, multi-tenant cloud environment? Consistency Which consistency guarantees can be provided in a georeplicated system without sacrificing availability? Performance & Latency How can a DBaaS deliver low latency in face of distributed storage and application tiers? Transactions Can ACID transactions be aligned with NoSQL and scalability?
26 Location : Santa Clara Submission Deadline: August 30
27 Outline Motivation ORESTES: a Cloud- Database Middleware Two problems: Latency Polyglot Storage Vision: Orestes Middleware Solving Latency and Polyglot Storage Wrap-up
28 Latency & Polyglot Storage Two central problems Goal of ORESTES: Solve both problems through a scalable cloud-database middleware If the application is geographically distributed, how can we guarantee fast database access? If one size doesn t fit all how can polyglot persistence be leveraged on a declarative, automated basis?
29 Problem I: Latency ms 500 ms 1s ms Loading Average: 9,3s -7% Conversions -20% Traffic -9% Visitors -1% Revenue
30 If perceived speed is such an import factor...what causes slow page load times?
31 State of the art Two bottlenecks: latency und processing High Latency Processing Time
32 Network Latency The underlying problem of high page load times I. Grigorik, High performance browser networking. O Reilly Media, 2013.
33 The low-latency vision Data is served by ubiquitous web-caches Low Latency Less Processing
34 The web s caching model Staleness as a consequence of scalability Expiration-based Every object has a defined Time-To-Live (TTL) Revalidations Allow clients and caches to check freshness at the server Stale Data Research Question: Can database services leverage the web caching infrastructure for low latency with rich consistency guarantees?
35 Problem II: Polyglot Persistence Current best practice Research Question: Billing Data Application Layer Nested Application Data Can we automate the Friend network Cached data & metrics data Search Index Session data database Recommendation Engine Files mapping problem? Google Cloud Storage Amazon Elastic MapReduce
36 Vision Schemas can be annotated with requirements - Write Throughput > 10,000 RPS - Read Availability > % - Scans = true - Full-Text-Search = true - Monotonic Read = true Schema DBs Tables Fields
37 Vision The Polyglot Persistence Mediator chooses the database Application Data and Operations Database Metrics Polyglot Persistence Mediator Annotated Schema Latency < 30ms db 1 db 2 db 3
38 The Big Picture Implementation in ORESTES Polyglot Storage and Low Latency are the central goals of ORESTES Database-as-a-Service Middleware: Unified REST Caching, Transactions, Polyglot Storage Schemas, Standard HTTP Caching API Authorization, Multi-Tenancy Desktop Cache Sketch Mobile Internet Reverse-Proxy Caches Orestes Servers Tablet Content-Delivery- Network
39 Outline Motivation ORESTES: a Cloud- Database Middleware Solving Latency and Polyglot Storage Wrap-up Cache Sketch Approach Caching Arbitrary Data Predicting TTLs Polyglot Persistence Mediator SLA-Approach Database Selection
40 Web Caching Concepts Invalidation- and expiration-based caches Request Path Client Expirationbased Caches Cache Hits Invalidationbased Caches Browser Caches, Forward Proxies, ISP Caches Content Delivery Networks, Reverse Proxies Expiration-based Caches: An object x is considered fresh for TTL x seconds The server assigns TTLs for each object Invalidation-based Caches: Expose object eviction operation to the server Server/DB
41 Invalidation-Minimization Staleness-Minimization The Cache Sketch approach Letting the client handle cache coherence Client Needs Revalidation? Bloom filter Client Cache Sketch Expirationbased Caches Browser Caches, Forward Proxies, ISP Caches at connect Periodic every Δ seconds at transaction begin Request Path Cache Hits Invalidationbased Caches Content Delivery Networks, Reverse Proxies Invalidations, Records Report Expirations and Writes Non-expired Record Keys Counting Bloom Filter Server/DB Needs Invalidation? Server Cache Sketch
42 The End-to-End Path of Requests The Caching Hierarchy Low Latency Reduced Database Load DB.posts.get(id) GET /db/posts/{id} JavaScript Cache-Hit: Return Flash-Crowd Object Higher Cache-Miss HTTP or Protection Revalidation: Return Availability record from Forward Request DB with caching TTL Hit Client- (Browser-) Cache Miss Proxy Caches Miss ISP Caches Miss CDN Caches Miss Reverse- Proxy Cache Miss Orestes Updated by Cache Sketch Updated by the server
43 The Client Cache Sketch Let c t be the client Cache Sketch generated at time t, containing the key key x of every record x that was written before it expired in all caches, i.e. every x for which holds: find(key) key h 1... Client Cache Sketch Hit key GET request no Bits = 1 Cache Miss k hash functions h k m Bloom filter bits yes Revalidation key
44 1 Slow initial page loads Solution: Cached Initialization Clients load the Cache Sketch at connection Every non-stale cached record can be reused without degraded consistency purge(obj) False-Positive Hash- Rate: Browser Functions: CDN Cache With distinct updates and 5% error rate: 11 KByte hasha(oid) hashb(oid) hasha(oid) hashb(oid) Flat(Counting Bloomfilter)
45 2 Slow CRUD performance Client Expirationbased Caches Invalidationbased Caches Server Query Cache Sketch Cache Sketch c t fresh records Cache Hits Revalidate record & Refresh Cache Sketch Fresh record & new Cache Sketch -time t -time t + Δ
46 3 High Abort Rates in OCC Client Begin Transaction Bloom Filter Cache Reads Cache Writes Cache Commit: read- & write-set versions Committed OR aborted + stale objects REST-Server REST-Server REST-Server Writes (Hidden) Read all DB Writes (Public) 5 validation 4 Coordinator prevent conflicting validations
47 TTL Estimation Determining the cache expiration Shorter TTLs Longer TTLs less invalidations less stale reads Higher cache-hit rates more invalidations
48 Performance Northern California Ireland Setup: Client CDN Orestes MongoDB Page load times with cached initialization (simulation): With Facebook s cache hit rate: >2,5x improvement Average Latency for YCSB Workloads A and B (real): 95% Read 5% Writes 5x latency improvement
49 Low Latency If the application Transparent is geographically end-to-end distributed, caching how can using we guarantee the Cache fast database Sketch. access? If one size doesn t fit all how can polyglot persistence be leveraged on a declarative, automated basis?
50 Towards Automated Polyglot Persistence Necessary steps Goal: Extend classic workload management to polyglot persistence Leverage hetereogeneous (NoSQL) databases 1. Requirements 2. Resolution 3. Mediation Tenant specifies requirements as Service- Level-Agreements Find or provision a suitable combination of databases Mediate data and database operations
51 Step I - Requirements Expressing the application s needs Tenant annotates schema with his requirements Tenant 1. Define schema 2. Annotate Database Table Field Table Field Field Field annotated Inherits continuous annotations Annotations Continuous non-functional e.g. write latency < 15ms Binary functional e.g. Atomic updates Binary non-functional e.g. Read-your-writes 1 Requirements
52 Step II - Resolution Finding the best database The Provider resolves the requirements RANK: scores available database systems Routing Model: defines the optimal mapping from schema elements to databases Capabilities for available DBs 1. Find optimal Provider Either: Refuse or Provision new DB 2a. If unsatisfiable RANK(schema_root, DBs) through recursive descent using annotated schema and metrics 2b. Generates routing model Routing Model Route schema_element db transform db-independent to dbspecific operations 2 Resolution
53 Step II - Resolution Ranking algorithm by example Annotations Schema DBs = { MongoDB, Riak, Cassandra, CouchDB, Redis, MySQL, S3, Hbase } RANK Algorithm Linearizability Availability Read latency ECommerceDB database Customers Table ShoppingBasket List<String> UserName String Database MongoDB Redis No annotation Cassandra, Availability CouchDB, Redis, recursive descent to child 99% % 0.05 Binary requirement 1. Exclude DBs that do not support 99.9% 0.9 it 2. Recursive descent MySQL 94% 0.04 HBase DBs = { MongoDB, Riak, MySQL, S3, Hbase } Latency 10ms 1 1ms 1 40ms ms 0.1 Continuous requirement databases calculate
54 Step II - Resolution Ranking algorithm by example Annotations Linearizability Availability Read latency Schema ECommerceDB database Customers Table ShoppingBasket List<String> UserName String DB RANK Algorithm Score MongoDB 0.9 Redis MySQL 0.12 HBase 0.5 Binary requirement 1. Exclude DBs that do not support it 2. Recursive descent 3. Pick DB with best total score and add it to routing model Routing Model: Customers MongoDB
55 Step III - Mediation Routing data and operations The PPM routes data Operation Rewriting: translates from abstract to database-specific operations Runtime Metrics: Latency, availability, etc. are reported to the resolver Primary Database Option: All data periodically gets materialized to designated database Report metrics Application 1. CRUD, queries, transactions, etc. Polyglot Persistence Mediator Uses Routing Model Triggers periodic materialization 2. route db 1 db 2 db 3 3 Mediation
56 Evaluation: News Article Prototype of Polyglot Persistence Mediator in ORESTES Scenario: news articles with impression counts Objectives: low-latency top-k queries, highthroughput counts, article-queries Article Counter
57 Evaluation: News Article Prototype built on ORESTES Scenario: news articles with impression counts Objectives: low-latency top-k queries, highthroughput counts, article-queries Mediator Counter updates kill performance
58 Evaluation: News Article Prototype built on ORESTES Scenario: news articles with impression counts Objectives: low-latency top-k queries, highthroughput counts, article-queries Mediator No powerful queries
59 Evaluation: News Article Prototype built on ORESTES Scenario: news articles with impression counts Objectives: low-latency top-k queries, highthroughput counts, article-queries Article ID Title Document Imp. Imp. ID Sorted Set Found Resolution
60 Outline Motivation ORESTES: a Cloud- Database Middleware Current/Future Work Summary Putting ORESTES into practice Solving Latency and Polyglot Storage Wrap-up
61 Outlook: Real-Time Combining Query Caching, Continuous Queries, Polyglot Queries Continuous Queries (Websockets) Create Update Delete ORESTES Polyglot Views Fresh Caches Fresh Cache Sketch Pub-Sub Pub-Sub
62 Summary Cache Sketch: web caching for database services Consistent (Δ-atomic) expiration-based caching Invalidation-based caching with minimal purges Bloom filter of stale objects & TTL Estimation Polyglot Persistence Mediator: 1. SLA-annotated Schemas 2. Score DBs and choose best 3. Route data and operations Requirements Resolution Mediation
63 Team: Felix Gessert, Florian Bücklers, Hannes Kuhlmann, Malte Lauenroth, Michael Schaarschmidt 19. August 2014
64 0,6s 3,0s 5,0s 5,7s 7,2s 0,7s 0,5s 1,8s 2,8s 3,6s 3,4s 0,5s 1,8s 1,5s 1,3s 2,4s 2,9s 4,0s 4,7s 5,7s Page-Load Times What impact does the Cache Sketch have? CALIFORNIEN FRANKFURT TOKYO +156% [WE RT] 0,5s SYDNEY FRANKFURT
65 Backend-as-a-Service Tutorial on the BaaS paradigm from app perspective
66 Thank you orestes.info, baqend.com
Slides: baqend.com/nosql.pdf
Slides: baqend.com/nosql.pdf Felix Gessert (29.04.2014) About me PhD student (database group university of hamburg) About me PhD student (database group university of hamburg) Research Project for PhD
More informationBaqend: a Spin-Off from Database Research
Baqend: a Spin-Off from Database Research Felix Gessert, CEO fg@baqend.com Team: Felix Gessert, Florian Bücklers, Hannes Kuhlmann, Malte Lauenroth, Michael Schaarschmidt 19. August 2014 Presentation is
More informationPreparing Your Data For Cloud
Preparing Your Data For Cloud Narinder Kumar Inphina Technologies 1 Agenda Relational DBMS's : Pros & Cons Non-Relational DBMS's : Pros & Cons Types of Non-Relational DBMS's Current Market State Applicability
More informationGigaSpaces Real-Time Analytics for Big Data
GigaSpaces Real-Time Analytics for Big Data GigaSpaces makes it easy to build and deploy large-scale real-time analytics systems Rapidly increasing use of large-scale and location-aware social media and
More informationCloudDB: A Data Store for all Sizes in the Cloud
CloudDB: A Data Store for all Sizes in the Cloud Hakan Hacigumus Data Management Research NEC Laboratories America http://www.nec-labs.com/dm www.nec-labs.com What I will try to cover Historical perspective
More informationHow To Write A Database Program
SQL, NoSQL, and Next Generation DBMSs Shahram Ghandeharizadeh Director of the USC Database Lab Outline A brief history of DBMSs. OSs SQL NoSQL 1960/70 1980+ 2000+ Before Computers Database DBMS/Data Store
More informationStructured Data Storage
Structured Data Storage Xgen Congress Short Course 2010 Adam Kraut BioTeam Inc. Independent Consulting Shop: Vendor/technology agnostic Staffed by: Scientists forced to learn High Performance IT to conduct
More informationWhy NoSQL? Your database options in the new non- relational world. 2015 IBM Cloudant 1
Why NoSQL? Your database options in the new non- relational world 2015 IBM Cloudant 1 Table of Contents New types of apps are generating new types of data... 3 A brief history on NoSQL... 3 NoSQL s roots
More informationOn- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform
On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform Page 1 of 16 Table of Contents Table of Contents... 2 Introduction... 3 NoSQL Databases... 3 CumuLogic NoSQL Database Service...
More informationAnalytics March 2015 White paper. Why NoSQL? Your database options in the new non-relational world
Analytics March 2015 White paper Why NoSQL? Your database options in the new non-relational world 2 Why NoSQL? Contents 2 New types of apps are generating new types of data 2 A brief history of NoSQL 3
More informationChallenges for Data Driven Systems
Challenges for Data Driven Systems Eiko Yoneki University of Cambridge Computer Laboratory Quick History of Data Management 4000 B C Manual recording From tablets to papyrus to paper A. Payberah 2014 2
More informationWhere We Are. References. Cloud Computing. Levels of Service. Cloud Computing History. Introduction to Data Management CSE 344
Where We Are Introduction to Data Management CSE 344 Lecture 25: DBMS-as-a-service and NoSQL We learned quite a bit about data management see course calendar Three topics left: DBMS-as-a-service and NoSQL
More informationCloud Scale Distributed Data Storage. Jürmo Mehine
Cloud Scale Distributed Data Storage Jürmo Mehine 2014 Outline Background Relational model Database scaling Keys, values and aggregates The NoSQL landscape Non-relational data models Key-value Document-oriented
More informationBenchmarking and Analysis of NoSQL Technologies
Benchmarking and Analysis of NoSQL Technologies Suman Kashyap 1, Shruti Zamwar 2, Tanvi Bhavsar 3, Snigdha Singh 4 1,2,3,4 Cummins College of Engineering for Women, Karvenagar, Pune 411052 Abstract The
More informationScalable Architecture on Amazon AWS Cloud
Scalable Architecture on Amazon AWS Cloud Kalpak Shah Founder & CEO, Clogeny Technologies kalpak@clogeny.com 1 * http://www.rightscale.com/products/cloud-computing-uses/scalable-website.php 2 Architect
More informationComparison of the Frontier Distributed Database Caching System with NoSQL Databases
Comparison of the Frontier Distributed Database Caching System with NoSQL Databases Dave Dykstra dwd@fnal.gov Fermilab is operated by the Fermi Research Alliance, LLC under contract No. DE-AC02-07CH11359
More informationEnabling Database-as-a-Service (DBaaS) within Enterprises or Cloud Offerings
Solution Brief Enabling Database-as-a-Service (DBaaS) within Enterprises or Cloud Offerings Introduction Accelerating time to market, increasing IT agility to enable business strategies, and improving
More informationSo What s the Big Deal?
So What s the Big Deal? Presentation Agenda Introduction What is Big Data? So What is the Big Deal? Big Data Technologies Identifying Big Data Opportunities Conducting a Big Data Proof of Concept Big Data
More informationAmr El Abbadi. Computer Science, UC Santa Barbara amr@cs.ucsb.edu
Amr El Abbadi Computer Science, UC Santa Barbara amr@cs.ucsb.edu Collaborators: Divy Agrawal, Sudipto Das, Aaron Elmore, Hatem Mahmoud, Faisal Nawab, and Stacy Patterson. Client Site Client Site Client
More informationextensible record stores document stores key-value stores Rick Cattel s clustering from Scalable SQL and NoSQL Data Stores SIGMOD Record, 2010
System/ Scale to Primary Secondary Joins/ Integrity Language/ Data Year Paper 1000s Index Indexes Transactions Analytics Constraints Views Algebra model my label 1971 RDBMS O tables sql-like 2003 memcached
More informationNewSQL: Towards Next-Generation Scalable RDBMS for Online Transaction Processing (OLTP) for Big Data Management
NewSQL: Towards Next-Generation Scalable RDBMS for Online Transaction Processing (OLTP) for Big Data Management A B M Moniruzzaman Department of Computer Science and Engineering, Daffodil International
More informationBig Data on AWS. Services Overview. Bernie Nallamotu Principle Solutions Architect
on AWS Services Overview Bernie Nallamotu Principle Solutions Architect \ So what is it? When your data sets become so large that you have to start innovating around how to collect, store, organize, analyze
More informationElastic Application Platform for Market Data Real-Time Analytics. for E-Commerce
Elastic Application Platform for Market Data Real-Time Analytics Can you deliver real-time pricing, on high-speed market data, for real-time critical for E-Commerce decisions? Market Data Analytics applications
More informationAmazon Web Services. 18.11.2015 Yu Xiao
Amazon Web Services 18.11.2015 Yu Xiao Agenda Introduction to Amazon Web Services(AWS) 7 Steps to Select the Right Architecture for Your Web Applications Private, Public or Hybrid Cloud? AWS Case Study
More informationCloud data store services and NoSQL databases. Ricardo Vilaça Universidade do Minho Portugal
Cloud data store services and NoSQL databases Ricardo Vilaça Universidade do Minho Portugal Context Introduction Traditional RDBMS were not designed for massive scale. Storage of digital data has reached
More informationORESTES: a Scalable Database-as-a-Service Architecture for Low Latency
ORESTES: a Scalable -as-a-service Architecture for Low Latency Felix Gessert #, Florian Bücklers #, Norbert Ritter # # Computer Science Department, University of Hamburg Vogt-Kölln Straße, 57 Hamburg,
More informationNoSQL Databases. Nikos Parlavantzas
!!!! NoSQL Databases Nikos Parlavantzas Lecture overview 2 Objective! Present the main concepts necessary for understanding NoSQL databases! Provide an overview of current NoSQL technologies Outline 3!
More informationCluster Computing. ! Fault tolerance. ! Stateless. ! Throughput. ! Stateful. ! Response time. Architectures. Stateless vs. Stateful.
Architectures Cluster Computing Job Parallelism Request Parallelism 2 2010 VMware Inc. All rights reserved Replication Stateless vs. Stateful! Fault tolerance High availability despite failures If one
More informationCloud Computing Is In Your Future
Cloud Computing Is In Your Future Michael Stiefel www.reliablesoftware.com development@reliablesoftware.com http://www.reliablesoftware.com/dasblog/default.aspx Cloud Computing is Utility Computing Illusion
More informationIntroduction to NOSQL
Introduction to NOSQL Université Paris-Est Marne la Vallée, LIGM UMR CNRS 8049, France January 31, 2014 Motivations NOSQL stands for Not Only SQL Motivations Exponential growth of data set size (161Eo
More informationBASHO DATA PLATFORM SIMPLIFIES BIG DATA, IOT, AND HYBRID CLOUD APPS
WHITEPAPER BASHO DATA PLATFORM BASHO DATA PLATFORM SIMPLIFIES BIG DATA, IOT, AND HYBRID CLOUD APPS INTRODUCTION Big Data applications and the Internet of Things (IoT) are changing and often improving our
More informationCan the Elephants Handle the NoSQL Onslaught?
Can the Elephants Handle the NoSQL Onslaught? Avrilia Floratou, Nikhil Teletia David J. DeWitt, Jignesh M. Patel, Donghui Zhang University of Wisconsin-Madison Microsoft Jim Gray Systems Lab Presented
More informationNoSQL in der Cloud Why? Andreas Hartmann
NoSQL in der Cloud Why? Andreas Hartmann 17.04.2013 17.04.2013 2 NoSQL in der Cloud Why? Quelle: http://res.sys-con.com/story/mar12/2188748/cloudbigdata_0_0.jpg Why Cloud??? 17.04.2013 3 NoSQL in der Cloud
More informationData Modeling for Big Data
Data Modeling for Big Data by Jinbao Zhu, Principal Software Engineer, and Allen Wang, Manager, Software Engineering, CA Technologies In the Internet era, the volume of data we deal with has grown to terabytes
More informationMaking Sense ofnosql A GUIDE FOR MANAGERS AND THE REST OF US DAN MCCREARY MANNING ANN KELLY. Shelter Island
Making Sense ofnosql A GUIDE FOR MANAGERS AND THE REST OF US DAN MCCREARY ANN KELLY II MANNING Shelter Island contents foreword preface xvii xix acknowledgments xxi about this book xxii Part 1 Introduction
More informationNoSQL Systems for Big Data Management
NoSQL Systems for Big Data Management Venkat N Gudivada East Carolina University Greenville, North Carolina USA Venkat Gudivada NoSQL Systems for Big Data Management 1/28 Outline 1 An Overview of NoSQL
More informationOverview of Databases On MacOS. Karl Kuehn Automation Engineer RethinkDB
Overview of Databases On MacOS Karl Kuehn Automation Engineer RethinkDB Session Goals Introduce Database concepts Show example players Not Goals: Cover non-macos systems (Oracle) Teach you SQL Answer what
More informationAssignment # 1 (Cloud Computing Security)
Assignment # 1 (Cloud Computing Security) Group Members: Abdullah Abid Zeeshan Qaiser M. Umar Hayat Table of Contents Windows Azure Introduction... 4 Windows Azure Services... 4 1. Compute... 4 a) Virtual
More informationEvaluating NoSQL for Enterprise Applications. Dirk Bartels VP Strategy & Marketing
Evaluating NoSQL for Enterprise Applications Dirk Bartels VP Strategy & Marketing Agenda The Real Time Enterprise The Data Gold Rush Managing The Data Tsunami Analytics and Data Case Studies Where to go
More informationCloud Computing with Microsoft Azure
Cloud Computing with Microsoft Azure Michael Stiefel www.reliablesoftware.com development@reliablesoftware.com http://www.reliablesoftware.com/dasblog/default.aspx Azure's Three Flavors Azure Operating
More informationCUMULUX WHICH CLOUD PLATFORM IS RIGHT FOR YOU? COMPARING CLOUD PLATFORMS. Review Business and Technology Series www.cumulux.com
` CUMULUX WHICH CLOUD PLATFORM IS RIGHT FOR YOU? COMPARING CLOUD PLATFORMS Review Business and Technology Series www.cumulux.com Table of Contents Cloud Computing Model...2 Impact on IT Management and
More informationReferences. Introduction to Database Systems CSE 444. Motivation. Basic Features. Outline: Database in the Cloud. Outline
References Introduction to Database Systems CSE 444 Lecture 24: Databases as a Service YongChul Kwon Amazon SimpleDB Website Part of the Amazon Web services Google App Engine Datastore Website Part of
More informationIntroduction to Database Systems CSE 444
Introduction to Database Systems CSE 444 Lecture 24: Databases as a Service YongChul Kwon References Amazon SimpleDB Website Part of the Amazon Web services Google App Engine Datastore Website Part of
More informationSQL VS. NO-SQL. Adapted Slides from Dr. Jennifer Widom from Stanford
SQL VS. NO-SQL Adapted Slides from Dr. Jennifer Widom from Stanford 55 Traditional Databases SQL = Traditional relational DBMS Hugely popular among data analysts Widely adopted for transaction systems
More informationWINDOWS AZURE DATA MANAGEMENT
David Chappell October 2012 WINDOWS AZURE DATA MANAGEMENT CHOOSING THE RIGHT TECHNOLOGY Sponsored by Microsoft Corporation Copyright 2012 Chappell & Associates Contents Windows Azure Data Management: A
More informationOpen Source Technologies on Microsoft Azure
Open Source Technologies on Microsoft Azure A Survey @DChappellAssoc Copyright 2014 Chappell & Associates The Main Idea i Open source technologies are a fundamental part of Microsoft Azure The Big Questions
More informationWINDOWS AZURE DATA MANAGEMENT AND BUSINESS ANALYTICS
WINDOWS AZURE DATA MANAGEMENT AND BUSINESS ANALYTICS Managing and analyzing data in the cloud is just as important as it is anywhere else. To let you do this, Windows Azure provides a range of technologies
More informationMigrating SaaS Applications to Windows Azure
Migrating SaaS Applications to Windows Azure Lessons Learned 04.04.2012 Speaker Introduction Deepthi Raju Marketing Technology Services Deepthi joined Smartbridge in 2005 and has over twenty years of technology
More informationThe NoSQL Ecosystem, Relaxed Consistency, and Snoop Dogg. Adam Marcus MIT CSAIL marcua@csail.mit.edu / @marcua
The NoSQL Ecosystem, Relaxed Consistency, and Snoop Dogg Adam Marcus MIT CSAIL marcua@csail.mit.edu / @marcua About Me Social Computing + Database Systems Easily Distracted: Wrote The NoSQL Ecosystem in
More informationRyan Horn, Lead Software Engineer at Twilio. November 12, 2014 Las Vegas. BDT312 Using the Cloud to Scale from a Database to a Data Platform
BDT312 Using the Cloud to Scale from a Database to a Data Platform Ryan Horn, Lead Software Engineer at Twilio November 12, 2014 Las Vegas 2014 Amazon.com, Inc. and its affiliates. All rights reserved.
More informationReal Time Big Data Processing
Real Time Big Data Processing Cloud Expo 2014 Ian Meyers Amazon Web Services Global Infrastructure Deployment & Administration App Services Analytics Compute Storage Database Networking AWS Global Infrastructure
More informationNOSQL INTRODUCTION WITH MONGODB AND RUBY GEOFF LANE <GEOFF@ZORCHED.NET> @GEOFFLANE
NOSQL INTRODUCTION WITH MONGODB AND RUBY GEOFF LANE @GEOFFLANE WHAT IS NOSQL? NON-RELATIONAL DATA STORAGE USUALLY SCHEMA-FREE ACCESS DATA WITHOUT SQL (THUS... NOSQL) WIDE-COLUMN / TABULAR
More informationnosql and Non Relational Databases
nosql and Non Relational Databases Image src: http://www.pentaho.com/big-data/nosql/ Matthias Lee Johns Hopkins University What NoSQL? Yes no SQL.. Atleast not only SQL Large class of Non Relaltional Databases
More informationGoogle Cloud Platform The basics
Google Cloud Platform The basics Who I am Alfredo Morresi ROLE Developer Relations Program Manager COUNTRY Italy PASSIONS Community, Development, Snowboarding, Tiramisu' Reach me alfredomorresi@google.com
More informationNoSQL Data Base Basics
NoSQL Data Base Basics Course Notes in Transparency Format Cloud Computing MIRI (CLC-MIRI) UPC Master in Innovation & Research in Informatics Spring- 2013 Jordi Torres, UPC - BSC www.jorditorres.eu HDFS
More informationReference Model for Cloud Applications CONSIDERATIONS FOR SW VENDORS BUILDING A SAAS SOLUTION
October 2013 Daitan White Paper Reference Model for Cloud Applications CONSIDERATIONS FOR SW VENDORS BUILDING A SAAS SOLUTION Highly Reliable Software Development Services http://www.daitangroup.com Cloud
More informationLecture Data Warehouse Systems
Lecture Data Warehouse Systems Eva Zangerle SS 2013 PART C: Novel Approaches in DW NoSQL and MapReduce Stonebraker on Data Warehouses Star and snowflake schemas are a good idea in the DW world C-Stores
More informationStudy concluded that success rate for penetration from outside threats higher in corporate data centers
Auditing in the cloud Ownership of data Historically, with the company Company responsible to secure data Firewall, infrastructure hardening, database security Auditing Performed on site by inspecting
More informationSoftware-Defined Networks Powered by VellOS
WHITE PAPER Software-Defined Networks Powered by VellOS Agile, Flexible Networking for Distributed Applications Vello s SDN enables a low-latency, programmable solution resulting in a faster and more flexible
More informationLARGE-SCALE DATA STORAGE APPLICATIONS
BENCHMARKING AVAILABILITY AND FAILOVER PERFORMANCE OF LARGE-SCALE DATA STORAGE APPLICATIONS Wei Sun and Alexander Pokluda December 2, 2013 Outline Goal and Motivation Overview of Cassandra and Voldemort
More informationPerformance Management for Cloudbased STC 2012
Performance Management for Cloudbased Applications STC 2012 1 Agenda Context Problem Statement Cloud Architecture Need for Performance in Cloud Performance Challenges in Cloud Generic IaaS / PaaS / SaaS
More informationCloud Database Emergence
Abstract RDBMS technology is favorable in software based organizations for more than three decades. The corporate organizations had been transformed over the years with respect to adoption of information
More informationMASTER PROJECT. Resource Provisioning for NoSQL Datastores
Vrije Universiteit Amsterdam MASTER PROJECT - Parallel and Distributed Computer Systems - Resource Provisioning for NoSQL Datastores Scientific Adviser Dr. Guillaume Pierre Author Eng. Mihai-Dorin Istin
More informationArchitecting Applications to Scale in the Cloud
Architecting Applications to Scale in the Cloud Nuxeo White Paper White Paper Architecting Applications to Scale in the Cloud Table of Contents Executive Summary... 3! Between IaaS and SaaS... 3! Nuxeo
More informationThe Cloud to the rescue!
The Cloud to the rescue! What the Google Cloud Platform can make for you Aja Hammerly, Developer Advocate twitter.com/thagomizer_rb So what is the cloud? The Google Cloud Platform The Google Cloud Platform
More informationA Comparison of Clouds: Amazon Web Services, Windows Azure, Google Cloud Platform, VMWare and Others (Fall 2012)
1. Computation Amazon Web Services Amazon Elastic Compute Cloud (Amazon EC2) provides basic computation service in AWS. It presents a virtual computing environment and enables resizable compute capacity.
More informationAmazon Redshift & Amazon DynamoDB Michael Hanisch, Amazon Web Services Erez Hadas-Sonnenschein, clipkit GmbH Witali Stohler, clipkit GmbH 2014-05-15
Amazon Redshift & Amazon DynamoDB Michael Hanisch, Amazon Web Services Erez Hadas-Sonnenschein, clipkit GmbH Witali Stohler, clipkit GmbH 2014-05-15 2014 Amazon.com, Inc. and its affiliates. All rights
More informationCloud Platforms, Challenges & Hadoop. Aditee Rele Karpagam Venkataraman Janani Ravi
Cloud Platforms, Challenges & Hadoop Aditee Rele Karpagam Venkataraman Janani Ravi Cloud Platform Models Aditee Rele Microsoft Corporation Dec 8, 2010 IT CAPACITY Provisioning IT Capacity Under-supply
More informationYahoo! Cloud Serving Benchmark
Yahoo! Cloud Serving Benchmark Overview and results March 31, 2010 Brian F. Cooper cooperb@yahoo-inc.com Joint work with Adam Silberstein, Erwin Tam, Raghu Ramakrishnan and Russell Sears System setup and
More informationOracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>
s Big Data solutions Roger Wullschleger DBTA Workshop on Big Data, Cloud Data Management and NoSQL 10. October 2012, Stade de Suisse, Berne 1 The following is intended to outline
More informationChapter 7: Distributed Systems: Warehouse-Scale Computing. Fall 2011 Jussi Kangasharju
Chapter 7: Distributed Systems: Warehouse-Scale Computing Fall 2011 Jussi Kangasharju Chapter Outline Warehouse-scale computing overview Workloads and software infrastructure Failures and repairs Note:
More informationCloud Computing: Meet the Players. Performance Analysis of Cloud Providers
BASEL UNIVERSITY COMPUTER SCIENCE DEPARTMENT Cloud Computing: Meet the Players. Performance Analysis of Cloud Providers Distributed Information Systems (CS341/HS2010) Report based on D.Kassman, T.Kraska,
More informationNoSQL Databases. Polyglot Persistence
The future is: NoSQL Databases Polyglot Persistence a note on the future of data storage in the enterprise, written primarily for those involved in the management of application development. Martin Fowler
More informationCloud Computing. Lecture 24 Cloud Platform Comparison 2014-2015
Cloud Computing Lecture 24 Cloud Platform Comparison 2014-2015 1 Up until now Introduction, Definition of Cloud Computing Pre-Cloud Large Scale Computing: Grid Computing Content Distribution Networks Cycle-Sharing
More informationIntegrating Big Data into the Computing Curricula
Integrating Big Data into the Computing Curricula Yasin Silva, Suzanne Dietrich, Jason Reed, Lisa Tsosie Arizona State University http://www.public.asu.edu/~ynsilva/ibigdata/ 1 Overview Motivation Big
More informationScalable Web Application
Scalable Web Applications Reference Architectures and Best Practices Brian Adler, PS Architect 1 Scalable Web Application 2 1 Scalable Web Application What? An application built on an architecture that
More informationPetabyte Scale Data at Facebook. Dhruba Borthakur, Engineer at Facebook, SIGMOD, New York, June 2013
Petabyte Scale Data at Facebook Dhruba Borthakur, Engineer at Facebook, SIGMOD, New York, June 2013 Agenda 1 Types of Data 2 Data Model and API for Facebook Graph Data 3 SLTP (Semi-OLTP) and Analytics
More informationAdvanced Data Management Technologies
ADMT 2014/15 Unit 15 J. Gamper 1/44 Advanced Data Management Technologies Unit 15 Introduction to NoSQL J. Gamper Free University of Bozen-Bolzano Faculty of Computer Science IDSE ADMT 2014/15 Unit 15
More informationCloud Computing Trends
UT DALLAS Erik Jonsson School of Engineering & Computer Science Cloud Computing Trends What is cloud computing? Cloud computing refers to the apps and services delivered over the internet. Software delivered
More informationCloud Computing: Making the right choices
Cloud Computing: Making the right choices Kalpak Shah Clogeny Technologies Pvt Ltd 1 About Me Kalpak Shah Founder & CEO, Clogeny Technologies Passionate about economics and technology evolving through
More informationF1: A Distributed SQL Database That Scales. Presentation by: Alex Degtiar (adegtiar@cmu.edu) 15-799 10/21/2013
F1: A Distributed SQL Database That Scales Presentation by: Alex Degtiar (adegtiar@cmu.edu) 15-799 10/21/2013 What is F1? Distributed relational database Built to replace sharded MySQL back-end of AdWords
More informationAzure Data Lake Analytics
Azure Data Lake Analytics Compose and orchestrate data services at scale Fully managed service to support orchestration of data movement and processing Connect to relational or non-relational data
More informationDo Relational Databases Belong in the Cloud? Michael Stiefel www.reliablesoftware.com development@reliablesoftware.com
Do Relational Databases Belong in the Cloud? Michael Stiefel www.reliablesoftware.com development@reliablesoftware.com How do you model data in the cloud? Relational Model A query operation on a relation
More informationWeb Application Hosting in the AWS Cloud Best Practices
Web Application Hosting in the AWS Cloud Best Practices September 2012 Matt Tavis, Philip Fitzsimons Page 1 of 14 Abstract Highly available and scalable web hosting can be a complex and expensive proposition.
More informationAn Approach to Implement Map Reduce with NoSQL Databases
www.ijecs.in International Journal Of Engineering And Computer Science ISSN: 2319-7242 Volume 4 Issue 8 Aug 2015, Page No. 13635-13639 An Approach to Implement Map Reduce with NoSQL Databases Ashutosh
More informationBig Data Technologies Compared June 2014
Big Data Technologies Compared June 2014 Agenda What is Big Data Big Data Technology Comparison Summary Other Big Data Technologies Questions 2 What is Big Data by Example The SKA Telescope is a new development
More informationDISTRIBUTED SYSTEMS [COMP9243] Lecture 9a: Cloud Computing WHAT IS CLOUD COMPUTING? 2
DISTRIBUTED SYSTEMS [COMP9243] Lecture 9a: Cloud Computing Slide 1 Slide 3 A style of computing in which dynamically scalable and often virtualized resources are provided as a service over the Internet.
More informationTECHNOLOGY WHITE PAPER Jun 2012
TECHNOLOGY WHITE PAPER Jun 2012 Technology Stack C# Windows Server 2008 PHP Amazon Web Services (AWS) Route 53 Elastic Load Balancing (ELB) Elastic Compute Cloud (EC2) Amazon RDS Amazon S3 Elasticache
More informationData Services Advisory
Data Services Advisory Modern Datastores An Introduction Created by: Strategy and Transformation Services Modified Date: 8/27/2014 Classification: DRAFT SAFE HARBOR STATEMENT This presentation contains
More informationCISC 432/CMPE 432/CISC 832 Advanced Database Systems
CISC 432/CMPE 432/CISC 832 Advanced Database Systems Course Info Instructor: Patrick Martin Goodwin Hall 630 613 533 6063 martin@cs.queensu.ca Office Hours: Wednesday 11:00 1:00 or by appointment Schedule:
More informationbigdata Managing Scale in Ontological Systems
Managing Scale in Ontological Systems 1 This presentation offers a brief look scale in ontological (semantic) systems, tradeoffs in expressivity and data scale, and both information and systems architectural
More informationNoSQL replacement for SQLite (for Beatstream) Antti-Jussi Kovalainen Seminar OHJ-1860: NoSQL databases
NoSQL replacement for SQLite (for Beatstream) Antti-Jussi Kovalainen Seminar OHJ-1860: NoSQL databases Background Inspiration: postgresapp.com demo.beatstream.fi (modern desktop browsers without
More information<Insert Picture Here> Oracle NoSQL Database A Distributed Key-Value Store
Oracle NoSQL Database A Distributed Key-Value Store Charles Lamb, Consulting MTS The following is intended to outline our general product direction. It is intended for information
More informationTowards a Scalable and Unified REST API for Cloud Data Stores
Towards a Scalable and Unified REST API for Cloud Data Stores Felix Gessert, Steffen Friedrich, Wolfram Wingerath, Michael Schaarschmidt, Norbert Ritter Database and Information Systems Group University
More informationMicrosoft Azure Data Technologies: An Overview
David Chappell Microsoft Azure Data Technologies: An Overview Sponsored by Microsoft Corporation Copyright 2014 Chappell & Associates Contents Blobs... 3 Running a DBMS in a Virtual Machine... 4 SQL Database...
More informationMyDBaaS: A Framework for Database-as-a-Service Monitoring
MyDBaaS: A Framework for Database-as-a-Service Monitoring David A. Abreu 1, Flávio R. C. Sousa 1 José Antônio F. Macêdo 1, Francisco J. L. Magalhães 1 1 Department of Computer Science Federal University
More informationSlave. Master. Research Scholar, Bharathiar University
Volume 3, Issue 7, July 2013 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper online at: www.ijarcsse.com Study on Basically, and Eventually
More informationPreparing Your IT for the Holidays. A quick start guide to take your e-commerce to the Cloud
Preparing Your IT for the Holidays A quick start guide to take your e-commerce to the Cloud September 2011 Preparing your IT for the Holidays: Contents Introduction E-Commerce Landscape...2 Introduction
More informationCloud Computing. Up until now
Cloud Computing Lecture 20 Cloud Platform Comparison & Load Balancing 2010-2011 Up until now Introduction, Definition of Cloud Computing Pre-Cloud Large Scale Computing: Grid Computing Content Distribution
More informationThe Sierra Clustered Database Engine, the technology at the heart of
A New Approach: Clustrix Sierra Database Engine The Sierra Clustered Database Engine, the technology at the heart of the Clustrix solution, is a shared-nothing environment that includes the Sierra Parallel
More information