HDB++: HIGH AVAILABILITY WITH. l TANGO Meeting l 20 May 2015 l Reynald Bourtembourg
|
|
|
- Clifton Carson
- 10 years ago
- Views:
Transcription
1 HDB++: HIGH AVAILABILITY WITH Page 1
2 OVERVIEW What is Cassandra (C*)? Who is using C*? CQL C* architecture Request Coordination Consistency Monitoring tool HDB++ Page 2
3 OVERVIEW What is Cassandra (C*)? Who is using C*? CQL C* architecture Request Coordination Consistency Monitoring tool HDB++ Page 3
4 WHAT IS CASSANDRA? Mythology: an excellent Oracle not believed. A massively scalable open source NoSQL (Not Only SQL) database Created by Facebook Open Source since 2008 Apache license, 2.0, compatible with GPLV3 Page 4
5 WHAT IS CASSANDRA? Peer to peer architecture No Single Point of Failure Replication Continuous Availability Multi Data Centers support 100s to 1000s nodes Java High Write Throughput Read efficiency Page 5
6 WHAT IS CASSANDRA? Source: Page 6
7 OVERVIEW What is Cassandra (C*)? Who is using C*? CQL C* architecture Request Coordination Consistency Monitoring tool HDB++ Page 7
8 WHO IS USING CASSANDRA? Page 8
9 OVERVIEW What is Cassandra (C*)? Who is using C*? CQL C* architecture Request Coordination Consistency Monitoring tool HDB++ Page 9
10 CASSANDRA QUERY LANGUAGE CQL: Cassandra Query Language Very similar to SQL But restrictions and limitations JOIN requests are forbidden No subqueries String comparisons are limited (when not using SOLR) select * from my_table where mystring like %tango% ; No OR operator Can only apply a WHERE condition on an indexed column (or primary key) Page 10
11 CASSANDRA QUERY LANGUAGE Collections (64K Limitation): list set map TTL INSERT = UPDATE (UPSERT) Doc: cqlsh Page 11
12 CASSANDRA QUERY LANGUAGE CREATE TABLE IF NOT EXISTS att_scalar_devdouble_ro ( att_conf_id timeuuid, period text, data_time timestamp, data_time_us int, value_r double, quality int, error_desc text, PRIMARY KEY ((att_conf_id,period),data_time,data_time_us) ) WITH comment='scalar DevDouble ReadOnly Values Table ; Page 12
13 CASSANDRA QUERY LANGUAGE CREATE TABLE IF NOT EXISTS att_scalar_devdouble_ro ( att_conf_id timeuuid, period text, data_time timestamp, data_time_us int, value_r double, quality int, error_desc text, PRIMARY KEY ((att_conf_id,period),data_time,data_time_us) ); Partition key Clustering columns Page 13
14 OVERVIEW What is Cassandra (C*)? Who is using C*? CQL C* architecture Request Coordination Consistency Monitoring tool HDB++ Page 14
15 CASSANDRA ARCHITECTURE Node: one Cassandra instance (Java process) Token Range Node 5 Node 7 Node 6 Node 8 Page 15
16 CASSANDRA ARCHITECTURE Partition: ordered and replicable unit of data on a node identified by a token Partitioner (based on mumur3 algorithm by default) will distribute the data across the nodes. Token Range Node 5 Node 7 Node 6 Node 8 Page 16
17 CASSANDRA ARCHITECTURE Rack: logical set of nodes Token Range Rack 1 Rack 3 Node 5 Rack 2 Node 7 Node 6 Rack 4 Node 8 Page 17
18 CASSANDRA ARCHITECTURE Data Center: logical set of racks Data Center 1 Data Center 2 Token Range Rack 1 Rack 3 Node 5 Rack 2 Node 7 Node 6 Rack 4 Node 7 Page 18
19 REQUEST COORDINATION Cluster: full set of nodes which maps to a single complete token ring Cassandra Cluster Data Center 1 Data Center 2 Token Range Rack 1 Rack 3 Node 5 Rack 2 Node 7 Node 6 Rack 4 Node 7 Page 19
20 OVERVIEW What is Cassandra (C*)? Who is using C*? CQL C* architecture Request Coordination Consistency Monitoring tool HDB++ Page 20
21 REQUEST COORDINATION Coordinator: the node chosen by the client to receive a particular read or write request to its cluster Data Center 1 Client Page 21
22 REQUEST COORDINATION Coordinator: the node chosen by the client to receive a particular read or write request to its cluster Data Center 1 Coordinator Client Page 22
23 REQUEST COORDINATION Coordinator: the node chosen by the client to receive a particular read or write request to its cluster Data Center 1 Coordinator Read/Write Client Page 23
24 REQUEST COORDINATION Any node can coordinate any request Each client request may be coordinated by a different node Data Center 1 Coordinator Acknowledge No Single Point of Failure Client Page 24
25 REQUEST COORDINATION The Cassandra driver chooses the coordinator node Round-Robin pattern, token-aware pattern Client library to manage requests Many open source drivers for many programming languages Java Python C++ Node.js C# Perl PHP Go Clojure Ruby Scala R (GNU S) Client Driver Coordinator Erlang Haskell ODBC Page 25 Rust
26 REQUEST COORDINATION The coordinator manages the replication process Replication Factor (RF): onto how many nodes should a write be copied The write will occur on the nodes responsible for that partition 1 RF (#nodes in cluster) Every write is time-stamped RF=3 Coordinator Client Driver Page 26
27 REQUEST COORDINATION The coordinator manages the replication process Replication Factor (RF): onto how many nodes should a write be copied The write will occur on the nodes responsible for that partition 1 RF (#nodes in cluster) Every write is time-stamped RF=3 Coordinator Client Driver Page 27
28 OVERVIEW What is Cassandra (C*)? Who is using C*? CQL C* architecture Request Coordination Consistency Monitoring tool HDB++ Page 28
29 CONSISTENCY The coordinator applies the Consistency Level (CL) Consistency Level (CL): Number of nodes which must acknowledge a request Examples of CL: ONE TWO THREE ANY ALL (Not recommended) QUORUM (= RF/2 + 1) EACH_QUORUM LOCAL_QUORUM CL may vary for each request On success, the coordinator notifies the client (with most recent partition data in case of read request) Page 29
30 CONSISTENCY ONE - READ - SINGLE DC Coordinator Client Driver Node 6 RF=3 Node 5 Direct Read Request Digest Read Request (Hash) + eventual read repair Page 30
31 CONSISTENCY ONE - READ - SINGLE DC Coordinator Client Driver Node 6 RF=3 Node 5 Direct Read Request Digest Read Request (Hash) + eventual read repair Page 31
32 CONSISTENCY ONE READ - SINGLE DC Coordinator Client Driver Node 6 RF=3 Node 5 Direct Read Request Digest Read Request (Hash) + eventual read repair Page 32
33 CONSISTENCY ONE - READ - SINGLE DC Coordinator Client Driver Node 6 RF=3 Node 5 Direct Read Request Digest Read Request (Hash) + eventual read repair Page 33
34 CONSISTENCY QUORUM READ - SINGLE DC Coordinator Client Driver Node 6 RF=3 Node 5 Direct Read Request Digest Read Request (Hash) Page 34
35 CONSISTENCY QUORUM READ - SINGLE DC Coordinator Client Driver Node 6 RF=3 Node 5 Direct Read Request Digest Read Request (Hash) Page 35
36 CONSISTENCY QUORUM READ - SINGLE DC Coordinator Client Driver Node 6 RF=3 Node 5 Direct Read Request Digest Read Request (Hash) Page 36
37 CONSISTENCY QUORUM READ - SINGLE DC Coordinator Client Driver Node 6 In case of inconsistency: the most recent data is returned RF=3 Node 5 Direct Read Request Digest Read Request (Hash) Page 37
38 CONSISTENCY QUORUM READ - SINGLE DC Coordinator Client Driver Node 6 Read repair if needed RF=3 Node 5 Direct Read Request Digest Read Request (Hash) Page 38
39 CONSISTENCY ONE WRITE - SINGLE DC Coordinator Client Driver Node 6 RF=3 Node 5 Write Request Page 39
40 CONSISTENCY ONE WRITE - SINGLE DC Coordinator Client Driver Node 6 ACK RF=3 Node 5 ACK Page 40
41 CONSISTENCY ONE WRITE - SINGLE DC Coordinator Client Driver Node 6 RF=3 Node 5 Write Request Page 41
42 CONSISTENCY ONE WRITE - SINGLE DC Client Driver SUCCESS Coordinator Node 6 ACK RF=3 Node 5 ACK Page 42
43 CONSISTENCY ONE WRITE - SINGLE DC Client Driver SUCCESS Coordinator hint max_hint_window_in_ms property in cassandra.yaml file Node 6 ACK RF=3 Node 5 ACK Hinted handoff mechanism Page 43
44 CONSISTENCY ONE WRITE - SINGLE DC Client Driver Coordinator hint max_hint_window_in_ms property in cassandra.yaml file Node 6 RF=3 Node 5 Write Request Hinted handoff mechanism Page 44
45 CONSISTENCY ONE WRITE - SINGLE DC Client Driver Coordinator hint max_hint_window_in_ms property in cassandra.yaml file Node 6 RF=3 Node 5 Write Request Hinted handoff mechanism Page 45
46 CONSISTENCY ONE WRITE - SINGLE DC Coordinator Client Driver Node 6 RF=3 Node 5 Hinted handoff mechanism Page 46
47 CONSISTENCY if node downtime > max_hint_window_in_ms Anti-entropy node repair Page 47
48 CONSISTENCY QUORUM WRITE - SINGLE DC Coordinator Client Driver Node 6 RF=3 Node 5 Write Request Page 48
49 CONSISTENCY QUORUM WRITE - SINGLE DC Coordinator Client Driver Node 6 ACK RF=3 Node 5 ACK Page 49
50 CONSISTENCY QUORUM WRITE - SINGLE DC Client Driver SUCCESS Coordinator Node 6 ACK RF=3 Node 5 ACK Page 50
51 CONSISTENCY QUORUM WRITE - SINGLE DC Coordinator Client Driver Node 6 RF=3 Node 5 Write Request Page 51
52 CONSISTENCY QUORUM WRITE - SINGLE DC Client Driver SUCCESS Coordinator Node 6 ACK RF=3 Node 5 ACK Page 52
53 CONSISTENCY QUORUM WRITE - SINGLE DC Coordinator Client Driver Node 6 RF=3 Node 5 Write Request Page 53
54 CONSISTENCY QUORUM WRITE - SINGLE DC Client Driver FAILURE Coordinator Node 6 ACK RF=3 Node 5 ACK Page 54
55 CONSISTENCY LOCAL QUORUM WRITE - MULTI DC Client Driver Coordinator Node 5 Node 6 RF=2 RF=3 Node 5 DC1 DC2 Write Request Page 55
56 CONSISTENCY LOCAL QUORUM WRITE - MULTI DC Client Driver SUCCESS Coordinator Node 5 Node 6 ACK RF=3 RF=2 ACK Node 5 DC1 DC2 ACK Page 56
57 OVERVIEW What is Cassandra (C*)? Who is using C*? CQL C* architecture Request Coordination Consistency Monitoring tool HDB++ Page 57
58 MONITORING TOOL: OPSCENTER Page 58
59 OVERVIEW What is Cassandra (C*)? Who is using C*? CQL C* architecture Request Coordination Consistency Monitoring tool HDB++ Page 59
60 HDB++ libhdb++ <<implements>> <<implements>> libhdb++mysql <<use>> libhdb++cassandra <<use>> hdb++cm-srv hdb++es-srv hdb++es-srv hdb++es-srv hdb++es-srv hdb++cm-srv hdb++es-srv hdb++es-srv hdb++es-srv hdb++es-srv MySQL Cassandra Cassandra Cassandra Page 60
61 CONCLUSION: C* PROS High Availability SW upgrade with no downtime HW failure Linear Scalability Need more performances? => Add nodes Big community with industrial support Page 61
62 CONCLUSION: C* PROS Can use Apache Spark for analytics (distributed processing) List, Set, Map data types (tuples and user defined types soon) Tries not to let you do actions which do not perform well Backups = snapshot = hard links => very fast (+Replication) Difficult to lose data Good fit for time series data Page 62
63 CONCLUSION: C* CONS Requires more total disk space and machines sstable (C* data files) format can change from one version to another No easy way to come back to a previous version once the sstables have been converted to a newer version Cannot rename keyspaces or tables easily (not foreseen in CQL) Tedious to modify existing partitions (Needs to duplicate the data at some point in the process) Page 63
64 CONCLUSION: C* CONS Different way of modeling Not designed for huge read requests Can be tricky to tune to avoid long GC pauses Maintenance: Need to run nodetool repair regularly if some data are deleted to avoid resurrections (CPU intensive operation) Can take quite some time to redeem disk space after deletion in some specific cases. Page 64
65 THE END
66 USEFUL LINKS Planet Cassandra ( Datastax academy ( Cassandra Java Driver getting started ( Cassandra C++ Driver: Datastax documentation ( Users mailing list: #Cassandra channel on IRC ( Page 66
67 CASSANDRA FUTURE DEPLOYMENT DC Prod 1 partition/hour Keyspace prod RF:3 (write LOCAL_QUORUM) 7200 RPM Disks Big CPU - 64GB RAM DC Analytics 1 Keyspace prod RF:3 (read LOCAL_QUORUM) Keyspace analytics RF:3 (write LOCAL_QUORUM) SSD Disks Big CPU 128 GB RAM DC Analytics 2 Keyspace analytics RF:5 (read LOCAL_QUORUM) 7200 RPM Disks Tiny CPU 32 GB RAM Page 67
68 CASSANDRA FUTURE DEPLOYMENT DC Prod 1 partition/hour Keyspace prod RF:3 (write LOCAL_QUORUM) 7200 RPM Disks Big CPU - 64GB RAM DC Analytics 1 Keyspace prod RF:3 (read LOCAL_QUORUM) Keyspace analytics RF:3 (write LOCAL_QUORUM) SSD Disks Big CPU 128 GB RAM Page 68
69 CASSANDRA S NODE-BASED ARCHITECTURE Page 69
70 BASIC WRITE PATH CONCEPT Page 70
71 BASIC READ PATH CONCEPT Page 71
Enabling SOX Compliance on DataStax Enterprise
Enabling SOX Compliance on DataStax Enterprise Table of Contents Table of Contents... 2 Introduction... 3 SOX Compliance and Requirements... 3 Who Must Comply with SOX?... 3 SOX Goals and Objectives...
Introduction to Cassandra
Introduction to Cassandra DuyHai DOAN, Technical Advocate Agenda! Architecture cluster replication Data model last write win (LWW), CQL basics (CRUD, DDL, collections, clustering column) lightweight transactions
Highly available, scalable and secure data with Cassandra and DataStax Enterprise. GOTO Berlin 27 th February 2014
Highly available, scalable and secure data with Cassandra and DataStax Enterprise GOTO Berlin 27 th February 2014 About Us Steve van den Berg Johnny Miller Solutions Architect Regional Director Western
Practical Cassandra. Vitalii Tymchyshyn [email protected] @tivv00
Practical Cassandra NoSQL key-value vs RDBMS why and when Cassandra architecture Cassandra data model Life without joins or HDD space is cheap today Hardware requirements & deployment hints Vitalii Tymchyshyn
No-SQL Databases for High Volume Data
Target Conference 2014 No-SQL Databases for High Volume Data Edward Wijnen 3 November 2014 The New Connected World Needs a Revolutionary New DBMS Today The Internet of Things 1990 s Mobile 1970 s Mainfram
Distributed Systems. Tutorial 12 Cassandra
Distributed Systems Tutorial 12 Cassandra written by Alex Libov Based on FOSDEM 2010 presentation winter semester, 2013-2014 Cassandra In Greek mythology, Cassandra had the power of prophecy and the curse
Apache Cassandra for Big Data Applications
Apache Cassandra for Big Data Applications Christof Roduner COO and co-founder [email protected] Java User Group Switzerland January 7, 2014 2 AGENDA Cassandra origins and use How we use Cassandra Data
Introduction to Apache Cassandra
Introduction to Apache Cassandra White Paper BY DATASTAX CORPORATION JULY 2013 1 Table of Contents Abstract 3 Introduction 3 Built by Necessity 3 The Architecture of Cassandra 4 Distributing and Replicating
Apache Cassandra and DataStax. DataStax EMEA
Apache Cassandra and DataStax DataStax EMEA Agenda 1. Apache Cassandra 2. Cassandra Query Language 3. Sensor/Time Data-Modeling 4. DataStax Enterprise 5. Realtime Analytics 6. What s New 2 About me Christian
JVM Performance Study Comparing Oracle HotSpot and Azul Zing Using Apache Cassandra
JVM Performance Study Comparing Oracle HotSpot and Azul Zing Using Apache Cassandra January 2014 Legal Notices Apache Cassandra, Spark and Solr and their respective logos are trademarks or registered trademarks
Apache Cassandra 1.2
Apache Cassandra 1.2 Documentation January 21, 2016 Apache, Apache Cassandra, Apache Hadoop, Hadoop and the eye logo are trademarks of the Apache Software Foundation 2016 DataStax, Inc. All rights reserved.
Going Native With Apache Cassandra. QCon London, 2014 www.datastax.com @DataStaxEMEA
Going Native With Apache Cassandra QCon London, 2014 www.datastax.com @DataStaxEMEA About Me Johnny Miller Solutions Architect www.datastax.com @DataStaxEU [email protected] @CyanMiller https://www.linkedin.com/in/johnnymiller
LARGE-SCALE DATA STORAGE APPLICATIONS
BENCHMARKING AVAILABILITY AND FAILOVER PERFORMANCE OF LARGE-SCALE DATA STORAGE APPLICATIONS Wei Sun and Alexander Pokluda December 2, 2013 Outline Goal and Motivation Overview of Cassandra and Voldemort
Apache Cassandra 1.2 Documentation
Apache Cassandra 1.2 Documentation January 13, 2013 2013 DataStax. All rights reserved. Contents Apache Cassandra 1.2 Documentation 1 What's new in Apache Cassandra 1.2 1 Key Improvements 1 Concurrent
CASSANDRA. Arash Akhlaghi, Badrinath Jayakumar, Wa el Belkasim. Instructor: Dr. Rajshekhar Sunderraman. CSC 8711 Project Report
CASSANDRA Arash Akhlaghi, Badrinath Jayakumar, Wa el Belkasim Instructor: Dr. Rajshekhar Sunderraman CSC 8711 Project Report 1 Introduction The relational model was brought by E.F. Codd s 1970 paper which
Apache Cassandra 2.0
Apache Cassandra 2.0 Documentation December 16, 2015 Apache, Apache Cassandra, Apache Hadoop, Hadoop and the eye logo are trademarks of the Apache Software Foundation 2015 DataStax, Inc. All rights reserved.
So What s the Big Deal?
So What s the Big Deal? Presentation Agenda Introduction What is Big Data? So What is the Big Deal? Big Data Technologies Identifying Big Data Opportunities Conducting a Big Data Proof of Concept Big Data
Dell Reference Configuration for DataStax Enterprise powered by Apache Cassandra
Dell Reference Configuration for DataStax Enterprise powered by Apache Cassandra A Quick Reference Configuration Guide Kris Applegate [email protected] Solution Architect Dell Solution Centers Dave
CQL for Cassandra 2.2 & later
CQL for Cassandra 2.2 & later Documentation January 21, 2016 Apache, Apache Cassandra, Apache Hadoop, Hadoop and the eye logo are trademarks of the Apache Software Foundation 2016 DataStax, Inc. All rights
Best Practices in Deploying and Managing DataStax Enterprise
Best Practices in Deploying and Managing DataStax Enterprise 1 Table of Contents Table of Contents... 2 Abstract... 3 DataStax Enterprise: The Fastest, Most Scalable Distributed Database Technology for
Facebook: Cassandra. Smruti R. Sarangi. Department of Computer Science Indian Institute of Technology New Delhi, India. Overview Design Evaluation
Facebook: Cassandra Smruti R. Sarangi Department of Computer Science Indian Institute of Technology New Delhi, India Smruti R. Sarangi Leader Election 1/24 Outline 1 2 3 Smruti R. Sarangi Leader Election
Big Data Development CASSANDRA NoSQL Training - Workshop. March 13 to 17-2016 9 am to 5 pm HOTEL DUBAI GRAND DUBAI
Big Data Development CASSANDRA NoSQL Training - Workshop March 13 to 17-2016 9 am to 5 pm HOTEL DUBAI GRAND DUBAI ISIDUS TECH TEAM FZE PO Box 121109 Dubai UAE, email training-coordinator@isidusnet M: +97150
MariaDB Cassandra interoperability
MariaDB Cassandra interoperability Cassandra Storage Engine in MariaDB Sergei Petrunia Colin Charles Who are we Sergei Petrunia Principal developer of CassandraSE, optimizer developer, formerly from MySQL
Cassandra vs MySQL. SQL vs NoSQL database comparison
Cassandra vs MySQL SQL vs NoSQL database comparison 19 th of November, 2015 Maxim Zakharenkov Maxim Zakharenkov Riga, Latvia Java Developer/Architect Company Goals Explore some differences of SQL and NoSQL
Simba Apache Cassandra ODBC Driver
Simba Apache Cassandra ODBC Driver with SQL Connector 2.2.0 Released 2015-11-13 These release notes provide details of enhancements, features, and known issues in Simba Apache Cassandra ODBC Driver with
Why NoSQL? Your database options in the new non- relational world. 2015 IBM Cloudant 1
Why NoSQL? Your database options in the new non- relational world 2015 IBM Cloudant 1 Table of Contents New types of apps are generating new types of data... 3 A brief history on NoSQL... 3 NoSQL s roots
DataStax Enterprise Reference Architecture
DataStax Enterprise Reference Architecture DataStax Enterprise Reference Architecture 7.8.15 1 Table of Contents ABSTRACT... 3 INTRODUCTION... 3 DATASTAX ENTERPRISE... 3 ARCHITECTURE... 3 OPSCENTER: EASY-
Overview of Databases On MacOS. Karl Kuehn Automation Engineer RethinkDB
Overview of Databases On MacOS Karl Kuehn Automation Engineer RethinkDB Session Goals Introduce Database concepts Show example players Not Goals: Cover non-macos systems (Oracle) Teach you SQL Answer what
Cassandra. Jonathan Ellis
Cassandra Jonathan Ellis Motivation Scaling reads to a relational database is hard Scaling writes to a relational database is virtually impossible and when you do, it usually isn't relational anymore The
Open Source Technologies on Microsoft Azure
Open Source Technologies on Microsoft Azure A Survey @DChappellAssoc Copyright 2014 Chappell & Associates The Main Idea i Open source technologies are a fundamental part of Microsoft Azure The Big Questions
Evaluation of NoSQL databases for large-scale decentralized microblogging
Evaluation of NoSQL databases for large-scale decentralized microblogging Cassandra & Couchbase Alexandre Fonseca, Anh Thu Vu, Peter Grman Decentralized Systems - 2nd semester 2012/2013 Universitat Politècnica
Going Native With Apache Cassandra. NoSQL Matters, Cologne, 2014 www.datastax.com @DataStaxEU
Going Native With Apache Cassandra NoSQL Matters, Cologne, 2014 www.datastax.com @DataStaxEU About Me Johnny Miller Solutions Architect @CyanMiller www.linkedin.com/in/johnnymiller We are hiring www.datastax.com/careers
Case study: CASSANDRA
Case study: CASSANDRA Course Notes in Transparency Format Cloud Computing MIRI (CLC-MIRI) UPC Master in Innovation & Research in Informatics Spring- 2013 Jordi Torres, UPC - BSC www.jorditorres.eu Cassandra:
NoSQL: Going Beyond Structured Data and RDBMS
NoSQL: Going Beyond Structured Data and RDBMS Scenario Size of data >> disk or memory space on a single machine Store data across many machines Retrieve data from many machines Machine = Commodity machine
Apache Cassandra Query Language (CQL)
REFERENCE GUIDE - P.1 ALTER KEYSPACE ALTER TABLE ALTER TYPE ALTER USER ALTER ( KEYSPACE SCHEMA ) keyspace_name WITH REPLICATION = map ( WITH DURABLE_WRITES = ( true false )) AND ( DURABLE_WRITES = ( true
A survey of big data architectures for handling massive data
CSIT 6910 Independent Project A survey of big data architectures for handling massive data Jordy Domingos - [email protected] Supervisor : Dr David Rossiter Content Table 1 - Introduction a - Context
Benchmarking Couchbase Server for Interactive Applications. By Alexey Diomin and Kirill Grigorchuk
Benchmarking Couchbase Server for Interactive Applications By Alexey Diomin and Kirill Grigorchuk Contents 1. Introduction... 3 2. A brief overview of Cassandra, MongoDB, and Couchbase... 3 3. Key criteria
F1: A Distributed SQL Database That Scales. Presentation by: Alex Degtiar ([email protected]) 15-799 10/21/2013
F1: A Distributed SQL Database That Scales Presentation by: Alex Degtiar ([email protected]) 15-799 10/21/2013 What is F1? Distributed relational database Built to replace sharded MySQL back-end of AdWords
MySQL és Hadoop mint Big Data platform (SQL + NoSQL = MySQL Cluster?!)
MySQL és Hadoop mint Big Data platform (SQL + NoSQL = MySQL Cluster?!) Erdélyi Ernő, Component Soft Kft. [email protected] www.component.hu 2013 (c) Component Soft Ltd Leading Hadoop Vendor Copyright 2013,
Welcome to Virtual Developer Day MySQL!
Welcome to Virtual Developer Day MySQL! Keynote: Developer and DBA Guide to What s New in MySQL Andrew Morgan - MySQL Product Management @andrewmorgan www.clusterdb.com 1 Program Agenda 1:00 PM Keynote:
The OpenStack TM Object Storage system
The OpenStack TM Object Storage system Deploying and managing a scalable, open- source cloud storage system with the SwiftStack Platform By SwiftStack, Inc. [email protected] Contents Introduction...
Designing Performance Monitoring Tool for NoSQL Cassandra Distributed Database
Designing Performance Monitoring Tool for NoSQL Cassandra Distributed Database Prasanna Bagade, Ashish Chandra, Aditya B.Dhende Pune Institute of Computer Technology University of Pune Pune ABSTRACT: The
Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam [email protected]
Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam [email protected] Agenda The rise of Big Data & Hadoop MySQL in the Big Data Lifecycle MySQL Solutions for Big Data Q&A
BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB
BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB Planet Size Data!? Gartner s 10 key IT trends for 2012 unstructured data will grow some 80% over the course of the next
MADOCA II Data Logging System Using NoSQL Database for SPring-8
MADOCA II Data Logging System Using NoSQL Database for SPring-8 A.Yamashita and M.Kago SPring-8/JASRI, Japan NoSQL WED3O03 OR: How I Learned to Stop Worrying and Love Cassandra Outline SPring-8 logging
High Throughput Computing on P2P Networks. Carlos Pérez Miguel [email protected]
High Throughput Computing on P2P Networks Carlos Pérez Miguel [email protected] Overview High Throughput Computing Motivation All things distributed: Peer-to-peer Non structured overlays Structured
Cassandra A Decentralized, Structured Storage System
Cassandra A Decentralized, Structured Storage System Avinash Lakshman and Prashant Malik Facebook Published: April 2010, Volume 44, Issue 2 Communications of the ACM http://dl.acm.org/citation.cfm?id=1773922
.NET User Group Bern
.NET User Group Bern Roger Rudin bbv Software Services AG [email protected] Agenda What is NoSQL Understanding the Motivation behind NoSQL MongoDB: A Document Oriented Database NoSQL Use Cases What is
NoSQL Database Options
NoSQL Database Options Introduction For this report, I chose to look at MongoDB, Cassandra, and Riak. I chose MongoDB because it is quite commonly used in the industry. I chose Cassandra because it has
Big Data Analytics with Cassandra, Spark & MLLib
Big Data Analytics with Cassandra, Spark & MLLib Matthias Niehoff AGENDA Spark Basics In A Cluster Cassandra Spark Connector Use Cases Spark Streaming Spark SQL Spark MLLib Live Demo CQL QUERYING LANGUAGE
Cassandra A Decentralized Structured Storage System
Cassandra A Decentralized Structured Storage System Avinash Lakshman, Prashant Malik LADIS 2009 Anand Iyer CS 294-110, Fall 2015 Historic Context Early & mid 2000: Web applicaoons grow at tremendous rates
NoSQL and Hadoop Technologies On Oracle Cloud
NoSQL and Hadoop Technologies On Oracle Cloud Vatika Sharma 1, Meenu Dave 2 1 M.Tech. Scholar, Department of CSE, Jagan Nath University, Jaipur, India 2 Assistant Professor, Department of CSE, Jagan Nath
Benchmarking Top NoSQL Databases Apache Cassandra, Couchbase, HBase, and MongoDB Originally Published: April 13, 2015 Revised: May 27, 2015
Benchmarking Top NoSQL Databases Apache Cassandra, Couchbase, HBase, and MongoDB Originally Published: April 13, 2015 Revised: May 27, 2015 http://www.endpoint.com/ Table of Contents Executive Summary...
Ankush Cluster Manager - Cassandra Technology User Guide
Ankush Cluster Manager - Cassandra Technology User Guide Ankush User s Guide for Cassandra, Version 1.5 This manual, and the accompanying software and other documentation, is protected by U.S. and international
Introduction to Big Data Training
Introduction to Big Data Training The quickest way to be introduce with NOSQL/BIG DATA offerings Learn and experience Big Data Solutions including Hadoop HDFS, Map Reduce, NoSQL DBs: Document Based DB
Analytics March 2015 White paper. Why NoSQL? Your database options in the new non-relational world
Analytics March 2015 White paper Why NoSQL? Your database options in the new non-relational world 2 Why NoSQL? Contents 2 New types of apps are generating new types of data 2 A brief history of NoSQL 3
HBase A Comprehensive Introduction. James Chin, Zikai Wang Monday, March 14, 2011 CS 227 (Topics in Database Management) CIT 367
HBase A Comprehensive Introduction James Chin, Zikai Wang Monday, March 14, 2011 CS 227 (Topics in Database Management) CIT 367 Overview Overview: History Began as project by Powerset to process massive
Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments
Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments Important Notice 2010-2015 Cloudera, Inc. All rights reserved. Cloudera, the Cloudera logo, Cloudera Impala, Impala, and
Real-Time Big Data in practice with Cassandra. Michaël Figuière @mfiguiere
Real-Time Big Data in practice with Cassandra Michaël Figuière @mfiguiere Speaker Michaël Figuière @mfiguiere 2 Ring Architecture Cassandra 3 Ring Architecture Replica Replica Replica 4 Linear Scalability
NOSQL DATABASES AND CASSANDRA
NOSQL DATABASES AND CASSANDRA Semester Project: Advanced Databases DECEMBER 14, 2015 WANG CAN, EVABRIGHT BERTHA Université Libre de Bruxelles 0 Preface The goal of this report is to introduce the new evolving
Windows Azure Storage Scaling Cloud Storage Andrew Edwards Microsoft
Windows Azure Storage Scaling Cloud Storage Andrew Edwards Microsoft Agenda: Windows Azure Storage Overview Architecture Key Design Points 2 Overview Windows Azure Storage Cloud Storage - Anywhere and
NoSQL Databases. Institute of Computer Science Databases and Information Systems (DBIS) DB 2, WS 2014/2015
NoSQL Databases Institute of Computer Science Databases and Information Systems (DBIS) DB 2, WS 2014/2015 Database Landscape Source: H. Lim, Y. Han, and S. Babu, How to Fit when No One Size Fits., in CIDR,
Tips and Tricks for Using Oracle TimesTen In-Memory Database in the Application Tier
Tips and Tricks for Using Oracle TimesTen In-Memory Database in the Application Tier Simon Law TimesTen Product Manager, Oracle Meet The Experts: Andy Yao TimesTen Product Manager, Oracle Gagan Singh Senior
An Oracle White Paper June 2012. High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database
An Oracle White Paper June 2012 High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database Executive Overview... 1 Introduction... 1 Oracle Loader for Hadoop... 2 Oracle Direct
CQL for Cassandra 2.0 & 2.1
CQL for Cassandra 2.0 & 2.1 Documentation January 21, 2016 Apache, Apache Cassandra, Apache Hadoop, Hadoop and the eye logo are trademarks of the Apache Software Foundation 2016 DataStax, Inc. All rights
A Brief Outline on Bigdata Hadoop
A Brief Outline on Bigdata Hadoop Twinkle Gupta 1, Shruti Dixit 2 RGPV, Department of Computer Science and Engineering, Acropolis Institute of Technology and Research, Indore, India Abstract- Bigdata is
Oracle BI EE Implementation on Netezza. Prepared by SureShot Strategies, Inc.
Oracle BI EE Implementation on Netezza Prepared by SureShot Strategies, Inc. The goal of this paper is to give an insight to Netezza architecture and implementation experience to strategize Oracle BI EE
Client Overview. Engagement Situation. Key Requirements
Client Overview Our client is one of the leading providers of business intelligence systems for customers especially in BFSI space that needs intensive data analysis of huge amounts of data for their decision
How Transactional Analytics is Changing the Future of Business A look at the options, use cases, and anti-patterns
How Transactional Analytics is Changing the Future of Business A look at the options, use cases, and anti-patterns Table of Contents Abstract... 3 Introduction... 3 Definition... 3 The Expanding Digitization
Scaling Database Performance in Azure
Scaling Database Performance in Azure Results of Microsoft-funded Testing Q1 2015 2015 2014 ScaleArc. All Rights Reserved. 1 Test Goals and Background Info Test Goals and Setup Test goals Microsoft commissioned
Database Scalability and Oracle 12c
Database Scalability and Oracle 12c Marcelle Kratochvil CTO Piction ACE Director All Data/Any Data [email protected] Warning I will be covering topics and saying things that will cause a rethink in
Amazon Redshift & Amazon DynamoDB Michael Hanisch, Amazon Web Services Erez Hadas-Sonnenschein, clipkit GmbH Witali Stohler, clipkit GmbH 2014-05-15
Amazon Redshift & Amazon DynamoDB Michael Hanisch, Amazon Web Services Erez Hadas-Sonnenschein, clipkit GmbH Witali Stohler, clipkit GmbH 2014-05-15 2014 Amazon.com, Inc. and its affiliates. All rights
Hadoop IST 734 SS CHUNG
Hadoop IST 734 SS CHUNG Introduction What is Big Data?? Bulk Amount Unstructured Lots of Applications which need to handle huge amount of data (in terms of 500+ TB per day) If a regular machine need to
Comparing the Hadoop Distributed File System (HDFS) with the Cassandra File System (CFS)
Comparing the Hadoop Distributed File System (HDFS) with the Cassandra File System (CFS) White Paper BY DATASTAX CORPORATION August 2013 1 Table of Contents Abstract 3 Introduction 3 Overview of HDFS 4
ORACLE DATABASE 10G ENTERPRISE EDITION
ORACLE DATABASE 10G ENTERPRISE EDITION OVERVIEW Oracle Database 10g Enterprise Edition is ideal for enterprises that ENTERPRISE EDITION For enterprises of any size For databases up to 8 Exabytes in size.
[Hadoop, Storm and Couchbase: Faster Big Data]
[Hadoop, Storm and Couchbase: Faster Big Data] With over 8,500 clients, LivePerson is the global leader in intelligent online customer engagement. With an increasing amount of agent/customer engagements,
Time series IoT data ingestion into Cassandra using Kaa
Time series IoT data ingestion into Cassandra using Kaa Andrew Shvayka [email protected] Agenda Data ingestion challenges Why Kaa? Why Cassandra? Reference architecture overview Hands-on Sandbox
Axway API Gateway. Version 7.4.1
K E Y P R O P E R T Y S T O R E U S E R G U I D E Axway API Gateway Version 7.4.1 26 January 2016 Copyright 2016 Axway All rights reserved. This documentation describes the following Axway software: Axway
Introduction to Multi-Data Center Operations with Apache Cassandra and DataStax Enterprise
Introduction to Multi-Data Center Operations with Apache Cassandra and DataStax Enterprise White Paper BY DATASTAX CORPORATION October 2013 1 Table of Contents Abstract 3 Introduction 3 The Growth in Multiple
MapReduce with Apache Hadoop Analysing Big Data
MapReduce with Apache Hadoop Analysing Big Data April 2010 Gavin Heavyside [email protected] About Journey Dynamics Founded in 2006 to develop software technology to address the issues
MyISAM Default Storage Engine before MySQL 5.5 Table level locking Small footprint on disk Read Only during backups GIS and FTS indexing Copyright 2014, Oracle and/or its affiliates. All rights reserved.
MakeMyTrip CUSTOMER SUCCESS STORY
MakeMyTrip CUSTOMER SUCCESS STORY MakeMyTrip is the leading travel site in India that is running two ClustrixDB clusters as multi-master in two regions. It removed single point of failure. MakeMyTrip frequently
THE REALITIES OF NOSQL BACKUPS
THE REALITIES OF NOSQL BACKUPS White Paper Trilio Data, Inc. March 2015 1 THE REALITIES OF NOSQL BACKUPS TABLE OF CONTENTS INTRODUCTION... 2 NOSQL DATABASES... 2 PROBLEM: LACK OF COMPREHENSIVE BACKUP AND
References. Introduction to Database Systems CSE 444. Motivation. Basic Features. Outline: Database in the Cloud. Outline
References Introduction to Database Systems CSE 444 Lecture 24: Databases as a Service YongChul Kwon Amazon SimpleDB Website Part of the Amazon Web services Google App Engine Datastore Website Part of
Introduction to Database Systems CSE 444
Introduction to Database Systems CSE 444 Lecture 24: Databases as a Service YongChul Kwon References Amazon SimpleDB Website Part of the Amazon Web services Google App Engine Datastore Website Part of
Introduction to Multi-Data Center Operations with Apache Cassandra, Hadoop, and Solr WHITE PAPER
Introduction to Multi-Data Center Operations with Apache Cassandra, Hadoop, and Solr WHITE PAPER By DataStax Corporation August 2012 Contents Introduction...3 The Growth in Multiple Data Centers...3 Why
DBA'S GUIDE TO NOSQL APACHE CASSANDRA
DBA'S GUIDE TO NOSQL APACHE CASSANDRA THE ENLIGHTENED DBA Smashwords Edition Copyright 2014 The Enlightened DBA This ebook is licensed for your personal enjoyment only. This ebook may not be re-sold or
HiBench Introduction. Carson Wang ([email protected]) Software & Services Group
HiBench Introduction Carson Wang ([email protected]) Agenda Background Workloads Configurations Benchmark Report Tuning Guide Background WHY Why we need big data benchmarking systems? WHAT What is
Moving From Hadoop to Spark
+ Moving From Hadoop to Spark Sujee Maniyam Founder / Principal @ www.elephantscale.com [email protected] Bay Area ACM meetup (2015-02-23) + HI, Featured in Hadoop Weekly #109 + About Me : Sujee
May 6, 2013. DataStax Cassandra South Bay Meetup. Cassandra Modeling. Best Practices and Examples. Jay Patel Architect, Platform Systems @pateljay3001
May 6, 2013 DataStax Cassandra South Bay Meetup Cassandra Modeling Best Practices and Examples Jay Patel Architect, Platform Systems @pateljay3001 That s me Technical Architect @ ebay Passion for building
An Approach to Implement Map Reduce with NoSQL Databases
www.ijecs.in International Journal Of Engineering And Computer Science ISSN: 2319-7242 Volume 4 Issue 8 Aug 2015, Page No. 13635-13639 An Approach to Implement Map Reduce with NoSQL Databases Ashutosh
