MySQL synchronous replication in practice with Galera

Similar documents

MySQL always-up with Galera Cluster

High-availability with Galera Cluster for MySQL

Synchronous multi-master clusters with MySQL: an introduction to Galera

Galera Replication. Synchronous Multi-Master Replication for InnoDB. ...well, why not for any other DBMS as well. Seppo Jaakola Alexey Yurchenko

High Availability Solutions for the MariaDB and MySQL Database

High Availability Solutions for MySQL. Lenz Grimmer DrupalCon 2008, Szeged, Hungary

MySQL High-Availability and Scale-Out architectures

Preparing for the Big Oops! Disaster Recovery Sites for MySQL. Robert Hodges, CEO, Continuent MySQL Conference 2011

MySQL High Availability Solutions. Lenz Grimmer OpenSQL Camp St. Augustin Germany

XtraBackup. Vadim Tkachenko CTO, Co-Founder, Percona Inc Percona Live SF Feb 2011

High Availability and Scalability for Online Applications with MySQL

Top 10 Reasons why MySQL Experts Switch to SchoonerSQL - Solving the common problems users face with MySQL

XtraBackup: Hot Backups and More

Zero Downtime Deployments with Database Migrations. Bob Feldbauer

How to evaluate which MySQL High Availability solution best suits you

MySQL Administration and Management Essentials

Tushar Joshi Turtle Networks Ltd

Monitoring Galera with MONyog

Architectures Haute-Dispo Joffrey MICHAÏE Consultant MySQL

High Availability Using MySQL in the Cloud:

DBA Tutorial Kai Voigt Senior MySQL Instructor Sun Microsystems Santa Clara, April 12, 2010

Comparing MySQL and Postgres 9.0 Replication

Database Scalability {Patterns} / Robert Treat

Percona Server features for OpenStack and Trove Ops

Solving Large-Scale Database Administration with Tungsten

How To Monitor Mysql With Zabbix

Not Your Grandpa s Replication

MySQL backups: strategy, tools, recovery scenarios. Akshay Suryawanshi Roman Vynar

SCALABLE DATA SERVICES

Portable Scale-Out Benchmarks for MySQL. MySQL User Conference 2008 Robert Hodges CTO Continuent, Inc.

SkySQL Data Suite. A New Open Source Approach to MySQL Distributed Systems. Serge Frezefond V

Module 14: Scalability and High Availability

MySQL Replication. openark.org

Database Replication with MySQL and PostgreSQL

High Availability, Disaster Recovery and Extreme Read Scaling using Binlog Servers. Jean-François Gagné jeanfrancois DOT gagne AT booking.

Future-Proofing MySQL for the Worldwide Data Revolution

Load balancing MySQL with HaProxy. Peter Boros Percona 4/23/13 Santa Clara, CA

Zero-Downtime MySQL Backups

Non-Stop for Apache HBase: Active-active region server clusters TECHNICAL BRIEF

Flash Databases: High Performance and High Availability

MySQL/MariaDB Multi-Master Replication & Failover

Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam

High Availability Solutions with MySQL

How to choose High Availability solutions for MySQL MySQL UC 2010 Yves Trudeau Read by Peter Zaitsev. Percona Inc MySQLPerformanceBlog.

Scaling DBMail with MySQL

Adding Indirection Enhances Functionality

MySQL Backup and Recovery: Tools and Techniques. Presented by: René Senior Operational DBA

Cloud Based Application Architectures using Smart Computing

Contents. SnapComms Data Protection Recommendations

Availability Digest. MySQL Clusters Go Active/Active. December 2006

The Future of PostgreSQL High Availability Robert Hodges - Continuent, Inc. Simon Riggs - 2ndQuadrant

PRM and DRBD tutorial. Yves Trudeau October 2012

Implementing the Future of PostgreSQL Clustering with Tungsten

Migration Scenario: Migrating Backend Processing Pipeline to the AWS Cloud

How To Manage Myroster Database With Hp And Myroberty

Red Hat Cluster Suite

A SURVEY OF POPULAR CLUSTERING TECHNOLOGIES

MySQL Backup and Security. Best practices on how to run MySQL on Linux in a secure way Lenz Grimmer <lenz@mysql.com>

MySQL Fabric: High Availability Solution for Connector/Python

Effective MySQL Monitoring. Baron Schwartz March 2012

MySQL and the Open Source Sphere. openark.org

Preventing con!icts in Multi-master replication with Tungsten

ScaleArc for SQL Server

High-Availability Using Open Source Software

Database Administration with MySQL

High Availability Database Solutions. for PostgreSQL & Postgres Plus

MySQL. Leveraging. Features for Availability & Scalability ABSTRACT: By Srinivasa Krishna Mamillapalli

Part 3. MySQL DBA I Exam

McAfee Endpoint Encryption Hot Backup Implementation

MySQL Backup IEDR

MySQL performance in a cloud. Mark Callaghan

SQL Server AlwaysOn (HADRON)

Database Replication with MySQL and PostgresSQL

Tier Architectures. Kathleen Durant CS 3200

Zabbix. At Scale. By Steve Mushero September, Running the World s Internet Servers. Copyright 2014 ChinaNetCloud

The Evolution of Database Management Systems

SAP HANA Operation Expert Summit BUILD - High Availability & Disaster Recovery

Managed Hoster gains competitive edge through new cross-region multi-datacenter architecture

Implementing and Managing Windows Server 2008 Clustering

MySQL Enterprise Backup

MySQL Cluster Deployment Best Practices

Vess A2000 Series HA Surveillance with Milestone XProtect VMS Version 1.0

Eloquence Training What s new in Eloquence B.08.00

Monitoring MySQL. Geert Vanderkelen MySQL Senior Support Engineer Sun Microsystems

Appendix A Core Concepts in SQL Server High Availability and Replication

Björn Kraus. Session Aware Full Page Caching For Magento With Varnish ESI

MySQL backup and restore best practices. Roman Vynar September 21, 2015

F1: A Distributed SQL Database That Scales. Presentation by: Alex Degtiar (adegtiar@cmu.edu) /21/2013

Scalability of web applications. CSCI 470: Web Science Keith Vertanen

Bigdata High Availability (HA) Architecture

Managing MySQL Scale Through Consolidation

Couchbase Server Under the Hood

Welcome to Virtual Developer Day MySQL!

Cloud DBMS: An Overview. Shan-Hung Wu, NetDB CS, NTHU Spring, 2015

Transcription:

MySQL synchronous replication in practice with Galera FOSDEM MySQL and Friends Devroom February 5, 2012, ULB Brussels Oli Sennhauser Senior MySQL Consultant, FromDual oli.sennhauser@fromdual.com

Content Galera Cluster Why Galera? Characteristics Set-up Configuration Starting / stopping SST Information

Galera Cluster Synchronous Multi-Master Replication App App App Load balancing (LB) Node 1 Node 2 Node 3 wsrep wsrep wsrep Galera replication Scaling reads (and writes)

Galera Cluster If one node fails: App App App Load balancing (LB) Node 1 Node 2 Node 3 wsrep wsrep wsrep Galera replication High Availability (HA)

Why Galera? Master-Slave Replication Not multi-master, asynchronous, inconsistencies Master-Master Replication Some kind of multi-master, asynchronous, inconsistencies, conflicts MHA, MMM (v1, v2), Tungsten Bases on MySQL Replication MySQL Cluster Not like InnoDB, Know-How, Network-DB! Active/passive Failover Cluster Operations, resources idling Schooner Expensive, not sure what technology (Memcached, Replication,...)

Characteristics Synchronous replication Based on InnoDB SE (other SE theoretically possible) Active-active real multi-master topology Read and write to any cluster node Automatic membership control True parallel replication, on row level No slave lag No lost transactions Read AND write scalability (Read Scale-Out!) Patch off MySQL binaries (Codership provides binaries) Be aware of Hot Spots on rows Higher probability of dead locks Full sync blocks for writing 3 nodes Initial sync for very big databases (>>50 Gbyte) with mysqldump rsync, xtrabackup

Set-up 3 nodes is recommended Or 2 + 1 (2 mysqld + garbd) SST!!! 2 nodes split brain! Codership MySQL + Galera Plug-in (wsrep) User for SST is root! We recommend to use your own user. On all nodes: GRANT ALL PRIVILEGES ON *.* TO 'sst'@'%' IDENTIFIED BY 'secret'; GRANT ALL PRIVILEGES ON *.* TO 'sst'@'localhost' IDENTIFIED BY 'secret';

Configuration my.cnf (galera.conf) default_storage_engine = InnoDB binlog_format = row innodb_autoinc_lock_mode = 2 innodb_locks_unsafe_for_binlog = 1 innodb_flush_log_at_trx_commit = 0 innodb_doublewrite = 0 query_cache_size = 0 query_cache_type = 0 # wsrep_provider = none wsrep_provider = /lib/plugin/libgalera_smm.so # wsrep_cluster_address = "gcomm://192.168.42.3" wsrep_cluster_address = "gcomm://" wsrep_sst_method wsrep_sst_auth = mysqldump = sst:secret

Start nodes First node: /etc/init.d/mysql start Other nodes: wsrep_cluster_address = "gcomm://192.168.42.1" /etc/init.d/mysql start Check with MySQL error log (on this AND remote node!) SHOW GLOBAL STATUS;

Checks 120131 07:37:17 mysqld_safe Starting mysqld daemon... 120131 7:37:18 [Note] WSREP: wsrep_load(): loading provider library 'libgalera_smm.so' 120131 7:37:18 [Note] WSREP: Start replication... 120131 7:37:18 [Note] WSREP: Shifting CLOSED -> OPEN (TO: 0) 120131 7:37:18 [Note].../mysql/bin/mysqld: ready for connections.... 120131 7:37:23 [Note] WSREP: Quorum results: conf_id = 2, members = 3/3 (joined/total) SHOW GLOBAL STATUS LIKE 'wsrep%'; +----------------------------+--------------------------------------+ Variable_name Value +----------------------------+--------------------------------------+ wsrep_local_state_comment Synced (6) wsrep_cluster_conf_id 3 wsrep_cluster_size 3 wsrep_cluster_status Primary wsrep_connected ON wsrep_local_index 0 wsrep_ready ON +----------------------------+--------------------------------------+

SST Snapshot State Transfer (SST) Initial full sync between 1 st and other nodes mysqldump, rsync, (xtrabackup?, LVM?) Blocks the Donor! ( 3 nodes) With v2.0 there is an Incremental State Transfer (IST) 120131 16:26:42 [Note] WSREP: Quorum results: conf_id = 4, members = 2/3 (joined/total) 120131 16:26:44 [Note] WSREP: Node 2 (Node 3) requested state transfer from '*any*'. Selected 0 (Node 1)(SYNCED) as donor. 120131 16:26:44 [Note] WSREP: Shifting SYNCED -> DONOR/DESYNCED (TO: 2695744) 120131 16:27:10 [Note] WSREP: 2 (Node 3): State transfer from 0 (Node 1) complete. 120131 16:27:10 [Note] WSREP: Member 2 (Node 3) synced with group. 120131 16:27:10 [Note] WSREP: 0 (Node 1): State transfer to 2 (Node 3) complete. 120131 16:27:10 [Note] WSREP: Shifting DONOR/DESYNCED -> JOINED (TO: 2695744) 120131 16:27:10 [Note] WSREP: Member 0 (Node 1) synced with group. 120131 16:27:10 [Note] WSREP: Shifting JOINED -> SYNCED (TO: 2695744) 120131 16:27:10 [Note] WSREP: Synchronized with group, ready for connections

Restarting a node 2 nd and 3 rd node no problem 1 st node: # wsrep_cluster_address = "gcomm://192.168.42.3" wsrep_cluster_address = "gcomm:// This is IMHO non optimal because we have 2 different situations: Initial 1 st node start 1 st node restart

Variables Currently (1.1) 27 Variables SHOW GLOBAL VARIABLES LIKE 'wsrep%'; +--------------------------------+-----------------------------+ Variable_name Value +--------------------------------+-----------------------------+ wsrep_cluster_address gcomm:// wsrep_cluster_name Galera-1.0 wsrep-21 wsrep_max_ws_rows 131072 wsrep_max_ws_size 1073741824 wsrep_node_incoming_address 192.168.42.1:3306 wsrep_node_name Node 1 wsrep_notify_cmd wsrep_on ON wsrep_provider.../plugin/libgalera_smm.so wsrep_retry_autocommit 1 wsrep_slave_threads 1 wsrep_sst_auth ******** wsrep_sst_donor wsrep_sst_method mysqldump wsrep_sst_receive_address AUTO +--------------------------------+-----------------------------+

wsrep_provider_options evs.debug_log_mask = 0x1; evs.inactive_check_period = PT0.5S evs.inactive_timeout = PT15S; evs.info_log_mask = 0; evs.install_timeout = PT15S; evs.join_retrans_period = PT0.3S; evs.keepalive_period = PT1S; evs.max_install_timeouts = 1; evs.send_window = 4; evs.stats_report_period = PT1M; evs.suspect_timeout = PT5S; evs.use_aggregate = true; evs.user_send_window = 2; evs.version = 0; evs.view_forget_timeout = PT5M; gcache.dir =...; gcache.keep_pages_size = 0; gcache.mem_size = 0; gcache.name =.../galera.cache; gcache.page_size = 128M; gcache.size = 128M; gcs.fc_debug = 0; gcs.fc_factor = 0.5; gcs.fc_limit = 16; gcs.fc_master_slave = NO; gcs.max_packet_size = 64500; gcs.max_throttle = 0.25; gcs.recv_q_hard_limit = 9223372036854775807; gcs.recv_q_soft_limit = 0.25; gmcast.listen_addr = tcp://127.0.0.1:4567; gmcast.mcast_addr = ; gmcast.mcast_ttl = 1; gmcast.peer_timeout = PT3S; gmcast.time_wait = PT5S; gmcast.version = 0; pc.checksum = true; pc.ignore_quorum = false; pc.ignore_sb = false; pc.linger = PT2S; pc.npvo = false; pc.version = 0; protonet.backend = asio; protonet.version = 0; replicator.commit_order = 3

Status Currently (1.1) 38 Status information SHOW GLOBAL STATUS LIKE 'wsrep%'; Cluster status Performance metrics General information +----------------------------+--------------------------------------+ Variable_name Value +----------------------------+--------------------------------------+ wsrep_last_committed 2695744 wsrep_replicated 1 wsrep_replicated_bytes 576 wsrep_received 9 wsrep_received_bytes 1051 wsrep_local_commits 1 wsrep_local_send_queue 0 wsrep_local_recv_queue 0 wsrep_flow_control_sent 0 wsrep_flow_control_recv 0 wsrep_provider_version 22.1.1(r95) +----------------------------+--------------------------------------+

Load Balancing In your Application (on your own) Connectors Connector/J PHP: MySQLnd replication and load balancing plug-in SW Load Balancer GLB, Pen, LVS, HAProxy, MySQL Proxy, SQL Relay, HW Load Balancer

Operations 2 Modes: Master-Master Master-Slave Initial configuration (do not mess it up) SST (DB size, NW bandwidth (WAN)) Start / restart Deadlocks and hot spots

Galera Replication Graph from Vadim Tkachenko (Percona): http://www.mysqlperformanceblog.com/2012/01/19/percona-xtradb-cluster-feature-2-multi-master-replication/

Q & A Questions? Discussion? We have some time for face-to-face talks... FromDual provides neutral and independent: Consulting Remote-DBA Support for MySQL, Galera, Percona Server Training www.fromdual.com