YouTube Vitess. Cloud-Native MySQL. Oracle OpenWorld Conference October 26, Anthony Yeh, Software Engineer, YouTube.
|
|
|
- Vernon Richards
- 10 years ago
- Views:
Transcription
1 YouTube Vitess Cloud-Native MySQL Oracle OpenWorld Conference October 26, 2015 Anthony Yeh, Software Engineer, YouTube
2 Spoiler Alert
3 Spoilers 1. History of Vitess 2. What is Cloud-Native Computing? 3. What is Cloud-Native MySQL? 4. Vitess Architecture 5. Transparent Live Resharding 6. Example Resharding Workflow 7. Cloud Scaling Benchmarks 8. Vitess Roadmap 9. It was all a dream
4 Histoir de Vitess
5 Vitess 1.0 golang.org YouTube is a MySQL app Vitess started in YouTube data centers Connection-pooling proxy Early adopter of Go (golang.org) First commit (in original repo) in years before Go 1.0 Cheap connections, goroutines In production at YouTube since
6 Vitess 1.5 YouTube moved into Borg [1] Google MySQL Adapted Vitess to Borg environment Dynamically scheduled container cluster Over time, Vitess evolved within Borg Database protection Query rewriting/blacklisting Row-based cache Shard routing Cluster management Monitoring [1] 6
7 Growing with YouTube YouTube stats [1] >1B users 400 hrs/min of new videos (24K s/s) 80% of views from outside U.S. Schema constantly evolving Content ID, channel admin, live streams, video editing, music Developed live resharding [1] youtube.com/yt/press/statistics.html 7
8 Vitess 2.0 Vitess in 2014 Worked great in YouTube, but outside Users can't get past build step Custom patched MySQL No docs for setting up a deployment Along comes Kubernetes Like open-source Borg Vitess in 2015 Bring-your-own MySQL 5.6 Docker images Out-of-the-box deployment config Runs on any cloud platform supported by Kubernetes (AWS, Azure, GCP, ) Lots more documentation Step-by-step guides 8
9 Cloud-Native Computing
10 Cloud Native Computing Foundation (cncf.io) Distributed systems paradigm Container packaged Dependency management Resource isolation Dynamically scheduled Increase utilization Simplify operations Microservices-oriented Distributed applications Service endpoints 10
11 Kubernetes (κυβερνήτης) kubernetes.io Provides glue for distributed apps Dynamic scheduling Declarative deployment config Service discovery High-availability replica pools Rolling updates Component grouping/metadata Abstracts cloud-platform-specific pieces VM instance creation and config Networking config External load balancers Persistent storage volumes (PD, EBS) 11
12 Cloud-Native MySQL
13 Pets vs. Cattle [1] Your servers might be pets if... You can log into them by hostname or IP. You load data onto them and tell them what to do. You try to fix them when they go down. Your app knows which servers to send which queries to. You know when your last master failover was. Your servers might be cattle if... You know only that a bunch of them are out there, somewhere. They go off and join the herd without being told to. You wait for others to take over their jobs when they go down. Your app throws queries over a magic wall and results appear. You have a monitoring graph of failovers per day. [1] Noah Slater, blog.engineyard.com/2014/pets-vs-cattle 13
14 Dynamic Scale-Out Scale-out by adding more cattle to the herd $ kubectl create -f vttablet.yaml Dynamic scheduling means you don't worry about... What server is this going to be launched on? How is it going to get a snapshot of existing data? How is it going to find and connect to the master? How are app servers going to find out it exists? With live resharding, it even works for scaling master traffic. 14
15 Query Routing as a Service Separation of concerns App doesn't know anything about cluster topology MySQL doesn't know it's inside Kubernetes Vitess connects everything Distributed Vitess components Use service discovery to find each other Communicate over defined RPC APIs 15
16 Vitess Architecture
17 Vitess Components app server vtctld etcd shard vttablet app server mysqld app server vtgate master vtgate vttablet vttablet vttablet vttablet batch job mysqld mysqld mysqld mysqld App Vitess replicas batch replicas 17
18 Vitess Concepts Vitess range custom unsharded Keyspace Shard master master master master master Tablet replica replica replica replica replica Tablet batch batch batch batch batch Tablet 18
19 Range-Based Sharding Basic Hashing Consistent Hashing User ID Shard User ID Keyspace ID Shard F FF FF FF cursor = conn.cursor('test_keyspace', 'replica', keyspace_ids=[my_hash(user_id)]) cursor.execute( 'SELECT message FROM messages WHERE user_id=%(user_id)s ORDER BY time_created_ns', {'user_id': user_id}) return [row[0] for row in cursor.fetchall()] 19
20 Transparent Live Resharding
21 Vitess Resharding User ID Keyspace ID Shard Key Range Before F FF FF FF F FF FF FF F FF FF FF FF FF FF FF User ID Keyspace ID Shard Key Range After F FF FF FF C F FF FF FF F FF FF FF BF FF FF FF C0- C FF FF FF FF 21
22 Transparent to the Application The app has no knowledge of what shards exist. The app keeps running normally during resharding. def my_hash(user_id): m = hashlib.md5() m.update(uint64.pack(user_id)) return m.digest()[:8] cursor = conn.cursor('test_keyspace', 'replica', keyspace_ids=[my_hash(user_id)]) cursor.execute( 'SELECT message FROM messages WHERE user_id=%(user_id)s ORDER BY time_created_ns', {'user_id': user_id}) return [row[0] for row in cursor.fetchall()] 22
23 Live Migration Old and new shards overlap during migration. Old Shards Running New Shards Running DB Read Availability DB Write Availability DB Write Downtime DB Write Availability < 5 seconds 23
24 Masters are cattle too Transparent Live Resharding means you can scale master traffic too, by adding cattle. When resharding stops being scary You don't have to predict and provision for years of growth. Provision for good utilization. Start small and scale out as needed. Cool down hot shards one at a time. 24
25 Resharding Workflow vitess.io/user-guide/sharding-kubernetes.html
26 Prepare Schema Initial sharding Add Keyspace ID column Populate along with new rows Backfill Keyspace ID for existing rows Example: Multi-tenant Guestbook (vitess.io/getting-started/) CREATE TABLE messages ( page BIGINT(20) UNSIGNED, time_created_ns BIGINT(20) UNSIGNED, keyspace_id BIGINT(20) UNSIGNED, message VARCHAR(10000), PRIMARY KEY (page, time_created_ns) ) ENGINE=InnoDB 26
27 Provision New Shards Moo $ kubectl create -f vttablet.yaml 27
28 Copy Data Split Clone Pauses replication on a backup slave So it's consistent with a known GTID set Streams rows to new shards Doesn't use extra disk space on source shard $ vtworker SplitClone --strategy=-populate_blp_checkpoint test_keyspace/0 28
29 Filtered Replication Vitess annotates DMLs with the Keyspace ID New shards subscribe to filtered replication stream Let new shards catch up to old shard master INSERT INTO messages (page,...) VALUES (...) Filtered Replication 80-C0 80- C0-29
30 Verify Data Integrity Split Diff Pauses replication on backup slave in old shard Pauses filtered replication on new shards At the equivalent GTID set Performs a row-by-row comparison Also detects rows only present on one side $ vtworker SplitDiff test_keyspace/0 30
31 Shard Migration Migrate non-master traffic to new shards $ vtctlclient MigrateServedTypes test_keyspace/0 replica Migrate master traffic to new shards $ vtctlclient MigrateServedTypes test_keyspace/0 master 31
32 Scaling Benchmarks
33 Scaling Writes by adding Shards 33
34 Scaling Reads by adding Replicas 34
35 Vitess Roadmap
36 Vitess Stable releases API freeze (no breaking changes) Release changelogs (github.com/youtube/vitess/releases) Versioned Docker images (hub.docker.com/u/vitess/) What's new in 2.0 Deployment configs for Kubernetes 1.0 Users reported success on both AWS and Google Cloud Platform Official client libraries for Java, Python, PHP, Go, and Hadoop MySQL 5.6 support Built-in cloud backup/restore HTTP/2-based grpc protocol (grpc.io) 36
37 Future Plans Drop-in drivers for JDBC, PDO, PEP 249 Already have Go driver for database/sql Cross-shard joins/aggregations Automatic master election Automated out-of-band schema changes 37
38 Resources Try Vitess vitess.io/getting-started/ vitess.io/user-guide/sharding-kubernetes.html Contribute github.com/youtube/vitess Cloud Native Computing Foundation cncf.io Kubernetes kubernetes.io Contact Us Get Updates groups.google.com/d/forum/vitess-announce
Enabling Database-as-a-Service (DBaaS) within Enterprises or Cloud Offerings
Solution Brief Enabling Database-as-a-Service (DBaaS) within Enterprises or Cloud Offerings Introduction Accelerating time to market, increasing IT agility to enable business strategies, and improving
Elastic Scalability in MySQL Fabric using OpenStack
Elastic Scalability in MySQL Fabric using OpenStack Mats Kindahl Senior Principal Software Developer Narayanan Venkateswaran Principal Software Developer Copyright 2014, Oracle and/or its affiliates. All
Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam [email protected]
Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam [email protected] Agenda The rise of Big Data & Hadoop MySQL in the Big Data Lifecycle MySQL Solutions for Big Data Q&A
Overview of Databases On MacOS. Karl Kuehn Automation Engineer RethinkDB
Overview of Databases On MacOS Karl Kuehn Automation Engineer RethinkDB Session Goals Introduce Database concepts Show example players Not Goals: Cover non-macos systems (Oracle) Teach you SQL Answer what
Zero Downtime Deployments with Database Migrations. Bob Feldbauer twitter: @bobfeldbauer email: [email protected]
Zero Downtime Deployments with Database Migrations Bob Feldbauer twitter: @bobfeldbauer email: [email protected] Deployments Two parts to deployment: Application code Database schema changes (migrations,
Solving Large-Scale Database Administration with Tungsten
Solving Large-Scale Database Administration with Tungsten Neil Armitage, Sr. Software Engineer Robert Hodges, CEO. Introducing Continuent The leading provider of clustering and replication for open source
MySQL és Hadoop mint Big Data platform (SQL + NoSQL = MySQL Cluster?!)
MySQL és Hadoop mint Big Data platform (SQL + NoSQL = MySQL Cluster?!) Erdélyi Ernő, Component Soft Kft. [email protected] www.component.hu 2013 (c) Component Soft Ltd Leading Hadoop Vendor Copyright 2013,
Practical Cassandra. Vitalii Tymchyshyn [email protected] @tivv00
Practical Cassandra NoSQL key-value vs RDBMS why and when Cassandra architecture Cassandra data model Life without joins or HDD space is cheap today Hardware requirements & deployment hints Vitalii Tymchyshyn
DISTRIBUTED SYSTEMS [COMP9243] Lecture 9a: Cloud Computing WHAT IS CLOUD COMPUTING? 2
DISTRIBUTED SYSTEMS [COMP9243] Lecture 9a: Cloud Computing Slide 1 Slide 3 A style of computing in which dynamically scalable and often virtualized resources are provided as a service over the Internet.
Welcome to Virtual Developer Day MySQL!
Welcome to Virtual Developer Day MySQL! Keynote: Developer and DBA Guide to What s New in MySQL Andrew Morgan - MySQL Product Management @andrewmorgan www.clusterdb.com 1 Program Agenda 1:00 PM Keynote:
MyISAM Default Storage Engine before MySQL 5.5 Table level locking Small footprint on disk Read Only during backups GIS and FTS indexing Copyright 2014, Oracle and/or its affiliates. All rights reserved.
MongoDB Developer and Administrator Certification Course Agenda
MongoDB Developer and Administrator Certification Course Agenda Lesson 1: NoSQL Database Introduction What is NoSQL? Why NoSQL? Difference Between RDBMS and NoSQL Databases Benefits of NoSQL Types of NoSQL
High-Availability in the Cloud Architectural Best Practices
1 High-Availability in the Cloud Architectural Best Practices Josh Fraser, VP Business Development, RightScale Brian Adler, Sr. Professional Services Architect 2 # RightScale World s #1 cloud management
XDB. Shared MySQL hosting at Facebook scale. Evan Elias Percona Live MySQL Conference, April 2015
XDB Shared MySQL hosting at Facebook scale Evan Elias Percona Live MySQL Conference, April 2015 What is XDB? In-house system for self-service database creation Web UI API for automated creation and management
Achieving Zero Downtime and Accelerating Performance for WordPress
Application Note Achieving Zero Downtime and Accelerating Performance for WordPress Executive Summary WordPress is the world s most popular open source website content management system (CMS). As usage
F1: A Distributed SQL Database That Scales. Presentation by: Alex Degtiar ([email protected]) 15-799 10/21/2013
F1: A Distributed SQL Database That Scales Presentation by: Alex Degtiar ([email protected]) 15-799 10/21/2013 What is F1? Distributed relational database Built to replace sharded MySQL back-end of AdWords
How To Choose Between A Relational Database Service From Aws.Com
The following text is partly taken from the Oracle book Middleware and Cloud Computing It is available from Amazon: http://www.amazon.com/dp/0980798000 Cloud Databases and Oracle When designing your cloud
Storing and Processing Sensor Networks Data in Public Clouds
UWB CSS 600 Storing and Processing Sensor Networks Data in Public Clouds Aysun Simitci Table of Contents Introduction... 2 Cloud Databases... 2 Advantages and Disadvantages of Cloud Databases... 3 Amazon
Scalable Architecture on Amazon AWS Cloud
Scalable Architecture on Amazon AWS Cloud Kalpak Shah Founder & CEO, Clogeny Technologies [email protected] 1 * http://www.rightscale.com/products/cloud-computing-uses/scalable-website.php 2 Architect
The Cloud to the rescue!
The Cloud to the rescue! What the Google Cloud Platform can make for you Aja Hammerly, Developer Advocate twitter.com/thagomizer_rb So what is the cloud? The Google Cloud Platform The Google Cloud Platform
MySQL Strategy. Morten Andersen, MySQL Enterprise Sales. Copyright 2014 Oracle and/or its affiliates. All rights reserved.
MySQL Strategy Morten Andersen, MySQL Enterprise Sales Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not
From the Monolith to Microservices: Evolving Your Architecture to Scale. Randy Shoup @randyshoup linkedin.com/in/randyshoup
From the Monolith to Microservices: Evolving Your Architecture to Scale Randy Shoup @randyshoup linkedin.com/in/randyshoup Background Consulting CTO at Randy Shoup Consulting o o Helping companies from
Distributed Scheduling with Apache Mesos in the Cloud. PhillyETE - April, 2015 Diptanu Gon Choudhury @diptanu
Distributed Scheduling with Apache Mesos in the Cloud PhillyETE - April, 2015 Diptanu Gon Choudhury @diptanu Who am I? Distributed Systems/Infrastructure Engineer in the Platform Engineering Group Design
Scalability of web applications. CSCI 470: Web Science Keith Vertanen
Scalability of web applications CSCI 470: Web Science Keith Vertanen Scalability questions Overview What's important in order to build scalable web sites? High availability vs. load balancing Approaches
Cloud Based Application Architectures using Smart Computing
Cloud Based Application Architectures using Smart Computing How to Use this Guide Joyent Smart Technology represents a sophisticated evolution in cloud computing infrastructure. Most cloud computing products
Database Scalability and Oracle 12c
Database Scalability and Oracle 12c Marcelle Kratochvil CTO Piction ACE Director All Data/Any Data [email protected] Warning I will be covering topics and saying things that will cause a rethink in
High Availability Solutions for the MariaDB and MySQL Database
High Availability Solutions for the MariaDB and MySQL Database 1 Introduction This paper introduces recommendations and some of the solutions used to create an availability or high availability environment
WINDOWS AZURE DATA MANAGEMENT AND BUSINESS ANALYTICS
WINDOWS AZURE DATA MANAGEMENT AND BUSINESS ANALYTICS Managing and analyzing data in the cloud is just as important as it is anywhere else. To let you do this, Windows Azure provides a range of technologies
MySQL Cluster 7.0 - New Features. Johan Andersson MySQL Cluster Consulting [email protected]
MySQL Cluster 7.0 - New Features Johan Andersson MySQL Cluster Consulting [email protected] Mat Keep MySQL Cluster Product Management [email protected] Copyright 2009 MySQL Sun Microsystems. The
Enterprise-level EE: Uptime, Speed, and Scale
Enterprise-level EE: Uptime, Speed, and Scale Reaching beyond EE tools and techniques to service enterprise clients 1. Intro 2. In-memory Caching 3. Load Balancing 4. Multi-environment setup with Docker
Web Application Deployment in the Cloud Using Amazon Web Services From Infancy to Maturity
P3 InfoTech Solutions Pvt. Ltd http://www.p3infotech.in July 2013 Created by P3 InfoTech Solutions Pvt. Ltd., http://p3infotech.in 1 Web Application Deployment in the Cloud Using Amazon Web Services From
Implementing the Future of PostgreSQL Clustering with Tungsten
Implementing the Future of PostgreSQL Clustering with Tungsten Robert Hodges CTO, Continuent, Inc. Agenda / Introductions / Framing the Problem: Clustering for the Masses / Introducing Tungsten / Adapting
Cloud Computing: Meet the Players. Performance Analysis of Cloud Providers
BASEL UNIVERSITY COMPUTER SCIENCE DEPARTMENT Cloud Computing: Meet the Players. Performance Analysis of Cloud Providers Distributed Information Systems (CS341/HS2010) Report based on D.Kassman, T.Kraska,
Oracle Net Service Name Resolution
Oracle Net Service Name Resolution Getting Rid of the TNSNAMES.ORA File! Simon Pane Oracle Database Principal Consultant March 19, 2015 ABOUT ME Working with the Oracle DB since version 6 Oracle Certified
Search Big Data with MySQL and Sphinx. Mindaugas Žukas www.ivinco.com
Search Big Data with MySQL and Sphinx Mindaugas Žukas www.ivinco.com Agenda Big Data Architecture Factors and Technologies MySQL and Big Data Sphinx Search Server overview Case study: building a Big Data
Why Zalando trusts in PostgreSQL
Why Zalando trusts in PostgreSQL A developer s view on using the most advanced open-source database Henning Jacobs - Technical Lead Platform/Software Zalando GmbH Valentine Gogichashvili - Technical Lead
Sharding with postgres_fdw
Sharding with postgres_fdw Postgres Open 2013 Chicago Stephen Frost [email protected] Resonate, Inc. Digital Media PostgreSQL Hadoop [email protected] http://www.resonateinsights.com Stephen
A programming model in Cloud: MapReduce
A programming model in Cloud: MapReduce Programming model and implementation developed by Google for processing large data sets Users specify a map function to generate a set of intermediate key/value
Web Application Hosting in the AWS Cloud Best Practices
Web Application Hosting in the AWS Cloud Best Practices May 2010 Matt Tavis Page 1 of 12 Abstract Highly-available and scalable web hosting can be a complex and expensive proposition. Traditional scalable
Scaling Database Performance in Azure
Scaling Database Performance in Azure Results of Microsoft-funded Testing Q1 2015 2015 2014 ScaleArc. All Rights Reserved. 1 Test Goals and Background Info Test Goals and Setup Test goals Microsoft commissioned
Scaling Pinterest. Yash Nelapati Ascii Artist. Pinterest Engineering. Saturday, August 31, 13
Scaling Pinterest Yash Nelapati Ascii Artist Pinterest is... An online pinboard to organize and share what inspires you. Growth March 2010 Page views per day Mar 2010 Jan 2011 Jan 2012 May 2012 Growth
On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform
On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform Page 1 of 16 Table of Contents Table of Contents... 2 Introduction... 3 NoSQL Databases... 3 CumuLogic NoSQL Database Service...
TECHNOLOGY WHITE PAPER Jun 2012
TECHNOLOGY WHITE PAPER Jun 2012 Technology Stack C# Windows Server 2008 PHP Amazon Web Services (AWS) Route 53 Elastic Load Balancing (ELB) Elastic Compute Cloud (EC2) Amazon RDS Amazon S3 Elasticache
Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture
Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture Apps and data source extensions with APIs Future white label, embed or integrate Power BI Deploy Intelligent
Comparing MySQL and Postgres 9.0 Replication
Comparing MySQL and Postgres 9.0 Replication An EnterpriseDB White Paper For DBAs, Application Developers, and Enterprise Architects March 2010 Table of Contents Introduction... 3 A Look at the Replication
Docker : devops, shared registries, HPC and emerging use cases. François Moreews & Olivier Sallou
Docker : devops, shared registries, HPC and emerging use cases François Moreews & Olivier Sallou Presentation Docker is an open-source engine to easily create lightweight, portable, self-sufficient containers
Managing MySQL Scale Through Consolidation
Hello Managing MySQL Scale Through Consolidation Percona Live 04/15/15 Chris Merz, @merzdba DB Systems Architect, SolidFire Enterprise Scale MySQL Challenges Many MySQL instances (10s-100s-1000s) Often
Docker Java Application with Solr, Mongo, & Cassandra: Design, Deployment, Service Discovery, and Management in Production
Docker Java Application with Solr, Mongo, & Cassandra: Design, Deployment, Service Discovery, and Management in Production You can clone this sample Names Directory Java application from GitHub. git clone
Web Application Hosting in the AWS Cloud Best Practices
Web Application Hosting in the AWS Cloud Best Practices September 2012 Matt Tavis, Philip Fitzsimons Page 1 of 14 Abstract Highly available and scalable web hosting can be a complex and expensive proposition.
Oracle Database 12c Plug In. Switch On. Get SMART.
Oracle Database 12c Plug In. Switch On. Get SMART. Duncan Harvey Head of Core Technology, Oracle EMEA March 2015 Safe Harbor Statement The following is intended to outline our general product direction.
Virtualizing Apache Hadoop. June, 2012
June, 2012 Table of Contents EXECUTIVE SUMMARY... 3 INTRODUCTION... 3 VIRTUALIZING APACHE HADOOP... 4 INTRODUCTION TO VSPHERE TM... 4 USE CASES AND ADVANTAGES OF VIRTUALIZING HADOOP... 4 MYTHS ABOUT RUNNING
CloudCenter Full Lifecycle Management. An application-defined approach to deploying and managing applications in any datacenter or cloud environment
CloudCenter Full Lifecycle Management An application-defined approach to deploying and managing applications in any datacenter or cloud environment CloudCenter Full Lifecycle Management Page 2 Table of
Portable Scale-Out Benchmarks for MySQL. MySQL User Conference 2008 Robert Hodges CTO Continuent, Inc.
Portable Scale-Out Benchmarks for MySQL MySQL User Conference 2008 Robert Hodges CTO Continuent, Inc. Continuent 2008 Agenda / Introductions / Scale-Out Review / Bristlecone Performance Testing Tools /
Tushar Joshi Turtle Networks Ltd
MySQL Database for High Availability Web Applications Tushar Joshi Turtle Networks Ltd www.turtle.net Overview What is High Availability? Web/Network Architecture Applications MySQL Replication MySQL Clustering
SCALABLE DATA SERVICES
1 SCALABLE DATA SERVICES 2110414 Large Scale Computing Systems Natawut Nupairoj, Ph.D. Outline 2 Overview MySQL Database Clustering GlusterFS Memcached 3 Overview Problems of Data Services 4 Data retrieval
NetflixOSS A Cloud Native Architecture
NetflixOSS A Cloud Native Architecture LASER Session 5 Availability September 2013 Adrian Cockcroft @adrianco @NetflixOSS http://www.linkedin.com/in/adriancockcroft Failure Modes and Effects Failure Mode
Preparing for the Big Oops! Disaster Recovery Sites for MySQL. Robert Hodges, CEO, Continuent MySQL Conference 2011
Preparing for the Big Oops! Disaster Recovery Sites for Robert Hodges, CEO, Continuent Conference 2011 Topics / Introductions / A Motivating Story / Master / Slave Disaster Recovery Replication Tungsten
Pulsar Realtime Analytics At Scale. Tony Ng April 14, 2015
Pulsar Realtime Analytics At Scale Tony Ng April 14, 2015 Big Data Trends Bigger data volumes More data sources DBs, logs, behavioral & business event streams, sensors Faster analysis Next day to hours
Big Data Analytics Platform @ Nokia
Big Data Analytics Platform @ Nokia 1 Selecting the Right Tool for the Right Workload Yekesa Kosuru Nokia Location & Commerce Strata + Hadoop World NY - Oct 25, 2012 Agenda Big Data Analytics Platform
In Memory Accelerator for MongoDB
In Memory Accelerator for MongoDB Yakov Zhdanov, Director R&D GridGain Systems GridGain: In Memory Computing Leader 5 years in production 100s of customers & users Starts every 10 secs worldwide Over 15,000,000
White Paper. Optimizing the Performance Of MySQL Cluster
White Paper Optimizing the Performance Of MySQL Cluster Table of Contents Introduction and Background Information... 2 Optimal Applications for MySQL Cluster... 3 Identifying the Performance Issues.....
The Future of PostgreSQL High Availability Robert Hodges - Continuent, Inc. Simon Riggs - 2ndQuadrant
The Future of PostgreSQL High Availability Robert Hodges - Continuent, Inc. Simon Riggs - 2ndQuadrant Agenda / Introductions / Framing the High Availability (HA) Problem / Hot Standby + Log Streaming /
Dave Stokes MySQL Community Manager
The Proper Care and Feeding of a MySQL Server for Busy Linux Admins Dave Stokes MySQL Community Manager Email: [email protected] Twiter: @Stoker Slides: slideshare.net/davidmstokes Safe Harbor Agreement
Future-Proofing MySQL for the Worldwide Data Revolution
Future-Proofing MySQL for the Worldwide Data Revolution Robert Hodges, CEO. What is Future-Proo!ng? Future-proo!ng = creating systems that last while parts change and improve MySQL is not losing out to
Deploying and Managing SolrCloud in the Cloud ApacheCon, April 8, 2014 Timothy Potter. Search Discover Analyze
Deploying and Managing SolrCloud in the Cloud ApacheCon, April 8, 2014 Timothy Potter Search Discover Analyze My SolrCloud Experience Currently, working on scaling up to a 200+ node deployment at LucidWorks
Where We Are. References. Cloud Computing. Levels of Service. Cloud Computing History. Introduction to Data Management CSE 344
Where We Are Introduction to Data Management CSE 344 Lecture 25: DBMS-as-a-service and NoSQL We learned quite a bit about data management see course calendar Three topics left: DBMS-as-a-service and NoSQL
SQL Server 2014 New Features/In- Memory Store. Juergen Thomas Microsoft Corporation
SQL Server 2014 New Features/In- Memory Store Juergen Thomas Microsoft Corporation AGENDA 1. SQL Server 2014 what and when 2. SQL Server 2014 In-Memory 3. SQL Server 2014 in IaaS scenarios 2 SQL Server
Cloud Computing. Lecture 24 Cloud Platform Comparison 2014-2015
Cloud Computing Lecture 24 Cloud Platform Comparison 2014-2015 1 Up until now Introduction, Definition of Cloud Computing Pre-Cloud Large Scale Computing: Grid Computing Content Distribution Networks Cycle-Sharing
Building Success on Acquia Cloud:
Building Success on Acquia Cloud: 10 Layers of PaaS TECHNICAL Guide Table of Contents Executive Summary.... 3 Introducing the 10 Layers of PaaS... 4 The Foundation: Five Layers of PaaS Infrastructure...
How to choose High Availability solutions for MySQL MySQL UC 2010 Yves Trudeau Read by Peter Zaitsev. Percona Inc MySQLPerformanceBlog.
How to choose High Availability solutions for MySQL MySQL UC 2010 Yves Trudeau Read by Peter Zaitsev Percona Inc MySQLPerformanceBlog.com -2- About us http://www.percona.com http://www.mysqlperformanceblog.com/
MySQL and Virtualization Guide
MySQL and Virtualization Guide Abstract This is the MySQL and Virtualization extract from the MySQL Reference Manual. For legal information, see the Legal Notices. For help with using MySQL, please visit
<Insert Picture Here> Oracle In-Memory Database Cache Overview
Oracle In-Memory Database Cache Overview Simon Law Product Manager The following is intended to outline our general product direction. It is intended for information purposes only,
Hypertable Architecture Overview
WHITE PAPER - MARCH 2012 Hypertable Architecture Overview Hypertable is an open source, scalable NoSQL database modeled after Bigtable, Google s proprietary scalable database. It is written in C++ for
Adding Indirection Enhances Functionality
Adding Indirection Enhances Functionality The Story Of A Proxy Mark Riddoch & Massimiliano Pinto Introductions Mark Riddoch Staff Engineer, VMware Formally Chief Architect, MariaDB Corporation Massimiliano
<Insert Picture Here> Oracle NoSQL Database A Distributed Key-Value Store
Oracle NoSQL Database A Distributed Key-Value Store Charles Lamb, Consulting MTS The following is intended to outline our general product direction. It is intended for information
Open Source Technologies on Microsoft Azure
Open Source Technologies on Microsoft Azure A Survey @DChappellAssoc Copyright 2014 Chappell & Associates The Main Idea i Open source technologies are a fundamental part of Microsoft Azure The Big Questions
Storage and Disaster Recovery
Storage and Disaster Recovery Matt Tavis Principal Solutions Architect The Business Continuity Continuum High Data Backup Disaster Recovery High, Storage Backup and Disaster Recovery form a continuum of
Big Data with Component Based Software
Big Data with Component Based Software Who am I Erik who? Erik Forsberg Linköping University, 1998-2003. Computer Science programme + lot's of time at Lysator ACS At Opera Software
MySQL High-Availability and Scale-Out architectures
MySQL High-Availability and Scale-Out architectures Oli Sennhauser Senior Consultant [email protected] 1 Introduction Who we are? What we want? 2 Table of Contents Scale-Up vs. Scale-Out MySQL Replication
Cloud Computing Trends
UT DALLAS Erik Jonsson School of Engineering & Computer Science Cloud Computing Trends What is cloud computing? Cloud computing refers to the apps and services delivered over the internet. Software delivered
Best Practices for Using MySQL in the Cloud
Best Practices for Using MySQL in the Cloud Luis Soares, Sr. Software Engineer, MySQL Replication, Oracle Lars Thalmann, Director Replication, Backup, Utilities and Connectors THE FOLLOWING IS INTENDED
Hadoop and Map-Reduce. Swati Gore
Hadoop and Map-Reduce Swati Gore Contents Why Hadoop? Hadoop Overview Hadoop Architecture Working Description Fault Tolerance Limitations Why Map-Reduce not MPI Distributed sort Why Hadoop? Existing Data
Sharding and MongoDB. Release 3.2.1. MongoDB, Inc.
Sharding and MongoDB Release 3.2.1 MongoDB, Inc. February 08, 2016 2 MongoDB, Inc. 2008-2015 This work is licensed under a Creative Commons Attribution-NonCommercial- ShareAlike 3.0 United States License
Parallel Replication for MySQL in 5 Minutes or Less
Parallel Replication for MySQL in 5 Minutes or Less Featuring Tungsten Replicator Robert Hodges, CEO, Continuent About Continuent / Continuent is the leading provider of data replication and clustering
Low-cost Open Data As-a-Service in the Cloud
Low-cost Open Data As-a-Service in the Cloud Marin Dimitrov, Alex Simov, Yavor Petkov Ontotext AD, Bulgaria {first.last}@ontotext.com Abstract. In this paper we present the architecture and prototype of
Apache HBase. Crazy dances on the elephant back
Apache HBase Crazy dances on the elephant back Roman Nikitchenko, 16.10.2014 YARN 2 FIRST EVER DATA OS 10.000 nodes computer Recent technology changes are focused on higher scale. Better resource usage
Linas Virbalas Continuent, Inc.
Linas Virbalas Continuent, Inc. Heterogeneous Replication Replication between different types of DBMS / Introductions / What is Tungsten (the whole stack)? / A Word About MySQL Replication / Tungsten Replicator:
Linux A first-class citizen in Windows Azure. Bruno Terkaly [email protected] Principal Software Engineer Mobile/Cloud/Startup/Enterprise
Linux A first-class citizen in Windows Azure Bruno Terkaly [email protected] Principal Software Engineer Mobile/Cloud/Startup/Enterprise 1 First, I am software developer (C/C++, ASM, C#, Java, Node.js,
MySQL Security: Best Practices
MySQL Security: Best Practices Sastry Vedantam [email protected] Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes
owncloud Architecture Overview
owncloud Architecture Overview Time to get control back Employees are using cloud-based services to share sensitive company data with vendors, customers, partners and each other. They are syncing data
Drupal in the Cloud. Scaling with Drupal and Amazon Web Services. Northern Virginia Drupal Meetup
Drupal in the Cloud Scaling with Drupal and Amazon Web Services Northern Virginia Drupal Meetup 3 Dec 2008 Cast of Characters Eric at The Case Foundation: The Client With typical client challenges Cost:
Log management with Logstash and Elasticsearch. Matteo Dessalvi
Log management with Logstash and Elasticsearch Matteo Dessalvi HEPiX 2013 Outline Centralized logging. Logstash: what you can do with it. Logstash + Redis + Elasticsearch. Grok filtering. Elasticsearch
Data Warehousing and Analytics Infrastructure at Facebook. Ashish Thusoo & Dhruba Borthakur athusoo,[email protected]
Data Warehousing and Analytics Infrastructure at Facebook Ashish Thusoo & Dhruba Borthakur athusoo,[email protected] Overview Challenges in a Fast Growing & Dynamic Environment Data Flow Architecture,
Veeam Summer School. Thomas Zaatman Veeam Software
Veeam Summer School Thomas Zaatman Veeam Software Availability for the your modern datacentre modern datacenter Veeam Availability Suite v8 Welcome to Veeam Veeam was founded in 2006 Exponential revenue
