High Availability Low Dollar Clustered Storage



Similar documents
High Availability Low Dollar Load Balancing

High Availability with DRBD & Heartbeat. Chris Barber

Building Elastix-2.4 High Availability Clusters with DRBD and Heartbeat (using a single NIC)

High availability infrastructures for TYPO3 Websites

Open-Xchange Server High Availability

Twin Peaks Software High Availability and Disaster Recovery Solution For Linux Server

High Availability Solutions for the MariaDB and MySQL Database

Overview: Clustering MySQL with DRBD & Pacemaker

PRM and DRBD tutorial. Yves Trudeau October 2012

Highly Available NFS Storage with DRBD and Pacemaker

High Availability Storage

High Availability Solutions with MySQL

High-Availability Using Open Source Software

ZCP trunk (build 50384) Zarafa Collaboration Platform. Zarafa HA Manual

Implementing SAN & NAS with Linux by Mark Manoukian & Roy Koh

This document may be freely distributed provided that it is not modified and that full credit is given to the original author.

MySQL High Availability Solutions. Lenz Grimmer OpenSQL Camp St. Augustin Germany

Implementing the SUSE Linux Enterprise High Availability Extension on System z

P03 Linux on POWER OSS High Availability

How to Choose your Red Hat Enterprise Linux Filesystem

Pacemaker. A Scalable Cluster Resource Manager for Highly Available Services. Owen Le Blanc. I T Services University of Manchester

Open Source High Availability on Linux

How To Install Eucalyptus (Cont'D) On A Cloud) On An Ubuntu Or Linux (Contd) Or A Windows 7 (Cont') (Cont'T) (Bsd) (Dll) (Amd)

Building Storage Service in a Private Cloud

HAOSCAR 2.0: an open source HA-enabling framework for mission critical systems

Enterprise Linux Business Continuity Solutions for Critical Applications

INUVIKA TECHNICAL GUIDE

High Availability Solutions for MySQL. Lenz Grimmer DrupalCon 2008, Szeged, Hungary

short introduction to linux high availability description of problem and solution possibilities linux tools

Red Hat Cluster Suite

Architectures Haute-Dispo Joffrey MICHAÏE Consultant MySQL

Ultimate Guide to Oracle Storage

Eliminate SQL Server Downtime Even for maintenance

CentOS Cluster Server. Ryan Matteson

How To Install Storegrid Server On Linux On A Microsoft Ubuntu 7.5 (Amd64) Or Ubuntu (Amd86) (Amd77) (Orchestra) (For Ubuntu) (Permanent) (Powerpoint

A High Performance, High Reliability Perforce Server

SYNNEFO: A COMPLETE CLOUD PLATFORM OVER GOOGLE GANETI WITH OPENSTACK APIs VANGELIS KOUKIS, TECH LEAD, SYNNEFO

Parallels Virtuozzo Containers 4.7 for Linux Readme

IdP Clustering. You want to prevent service outages. High Availability and Load Balancing. Possible problems: HW failures

Selling Virtual Private Servers. A guide to positioning and selling VPS to your customers with Heart Internet

High Availability with Postgres Plus Advanced Server. An EnterpriseDB White Paper

Overview of I/O Performance and RAID in an RDBMS Environment. By: Edward Whalen Performance Tuning Corporation

A SURVEY OF POPULAR CLUSTERING TECHNOLOGIES

SUSE Linux Enterprise High Availability Extension. Piotr Szewczuk konsultant

High Availability & Disaster Recovery Development Project. Concepts, Design and Implementation

High Availability, Replicating, Load Balanced MySQL Databases Oliver Hookins Systems Administrator Network Fixinator Script Hacker All round su

Vess A2000 Series HA Surveillance with Milestone XProtect VMS Version 1.0

Active/Active HA Clustering on Shared Storage with Samba

High Availability and Disaster Recovery for Exchange Servers Through a Mailbox Replication Approach

TechTarget Windows Media

Lightweight Virtualization: LXC Best Practices

Distributed File System Choices: Red Hat Storage, GFS2 & pnfs

Lustre SMB Gateway. Integrating Lustre with Windows

High Availability and Backup Strategies for the Lustre MDS Server

Nutanix NOS 4.0 vs. Scale Computing HC3

Acronis Backup & Recovery 10 Server for Linux. Installation Guide

My OwnCloud Project. by Simon Heisterkamp

High Availability and Load Balancing Cluster for Linux Terminal Services

PVFS High Availability Clustering using Heartbeat 2.0

1. Configuring Apache2 Load Balancer with failover mechanism

Implementing the SUSE Linux Enterprise High Availability Extension on System z Mike Friesenegger

Media Exchange really puts the power in the hands of our creative users, enabling them to collaborate globally regardless of location and file size.

Multiple Public IPs (virtual service IPs) are supported either to cover multiple network segments or to increase network performance.

Introduction to Gluster. Versions 3.0.x

High Availability with PostgreSQL and Pacemaker

Peter Ruissen Marju Jalloh

Building Highly Available OpenZFS Storage Appliances Grenville Whelan

Automation and DevOps Best Practices. Rob Hirschfeld, Dell Matt Ray, Opscode

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 7

Advanced Linux System Administration on Red Hat

SLIDE 1 Previous Next Exit

VIDEO SURVEILLANCE WITH SURVEILLUS VMS AND EMC ISILON STORAGE ARRAYS

HADOOP MOCK TEST HADOOP MOCK TEST I

Diploma in Computer Science

High Availability of the Polarion Server

HighlyavailableiSCSIstoragewith DRBDandPacemaker

Stretching A Wolfpack Cluster Of Servers For Disaster Tolerance. Dick Wilkins Program Manager Hewlett-Packard Co. Redmond, WA dick_wilkins@hp.

Open-Xchange Server High availability Daniel Halbe, Holger Achtziger

Achieve Automated, End-to-End Firmware Management with Cisco UCS Manager

ZCP 7.0 (build 41322) Zarafa Collaboration Platform. Zarafa Archiver Deployment Guide

Acronis Backup & Recovery 10 Server for Linux. Update 5. Installation Guide

HA clustering made simple with OpenVZ

BDR TM v3.0 PRODUCTION RELEASE NOTES

Acronis Backup & Recovery 10 Server for Linux. Installation Guide

ovirt and Gluster hyper-converged! HA solution for maximum resource utilization

Acronis Backup & Recovery 10 Server for Linux. Installation Guide

Virtualization and network mirroring to deliver High Availability to Grid services

RedHat (RHEL) System Administration Course Summary

ORACLE DATABASE HIGH AVAILABILITY STRATEGY, ARCHITECTURE AND SOLUTIONS

Michael Thomas, Dorian Kcira California Institute of Technology. CMS Offline & Computing Week

Tushar Joshi Turtle Networks Ltd

Oracle Net Service Name Resolution

Content Distribution Management

CONDOR CLUSTERS ON EC2

Transcription:

High Availability Low Dollar Clustered Storage Simon Karpen skarpen@shodor.org / simon@karpens.org Thanks to Shodor for use of this space for the meeting. This document licensed under the Creative Commons Attribution Share-Alike 3.0 license, http://creativecommons.org/licenses/by-sa/3.0/us/

Overview Shared storage with no shared hardware Based entirely on an open source software stack Cost to try is minimal; a pair of $500 Dell special boxes will work Scales down enough that I can demo on a pair of virtual machines on this laptop Can scale up with hardware within reason

What Can This Do? File services SMB, NFS, etc Databases MySQL, PgSQL Authentication LDAP, etc Network services DHCP, etc Web services better used for backend file/db than actual web services Other applications? As long as persistent data is on the filesystem,

Limitations Not suitable for applications with very high I/O rates limited by the commodity hardware plus overhead Cross-site replication depends on available bandwidth and write rate Automating failover between more than two hosts can be complex Linux support only (no BSD, etc)

Components Linux Operating system. Examples are based on CentOS. Hardware Just about anything that can run Linux and that has local storage DRBD Distributed Redundant Block Device Heartbeat Part of the Linux-HA project, manages device and service failover Network You need connectivity between the hosts. Gigabit is preferred if available.

Operating System No point in paying for RHEL; you'd have to add DRBD yourself or from the CentOS repositories Redhat wouldn't support you anyway! Other Linux variants should be fine; Ubuntu and SuSE even ship DRBD in some versions Watch support cycles; in production, these clusters will be long lived. Fedora is probably a poor choice.

Other Operating Systems FreeBSD geom_gate/geom_mirror may be able to do something similar, but are not yet stable Nothing really there on Solaris yet; the Sun cluster tools in particular want shared storage I am not aware of anything similar on Windows or OSX

Hardware Think about internal redundancy versus external redundancy Where cost is the primary consideration, concentrate on external redundancy (two servers) plus backups When downtime is the primary consideration, look for 'sturdy' hardware (hardware RAID, redundant power supplies and feeds, etc) A pair of $500 Dell special servers is great for a proof of concept

DRBD Distributed Redundant Block Device Web site at http://www.drbd.org/ Open source, but commercial support is available from LinBit Supports both traditional active/passive cluster and new support for active/active with a real cluster filesystem (OCFS2, GFS2, etc) Can run on LVM, or LVM on DRBD, or both

DRBD cont'd FAQ is at http://wiki.linux-ha.org/drbd/faq Heartbeat plus DRBD's integrity checks work respectably as a fence Status in /proc/drbd Configured in /etc/drbd.conf, configuration for each resource must match on each node active/passive configuration has been very reliable in practice

Sample DRBD Configuration resource "files" { protocol C; on c5drbd0 { device /dev/drbd0; disk /dev/vg0/voldrbd0; address 192.168.122.2:7788; meta-disk internal; } on c5drbd1 { device /dev/drbd0; disk /dev/vg0/voldrbd0; address 192.168.122.3:7788; meta-disk internal; } syncer { rate 5M; } }

Sample DRBD Command Lines By request, I have sample DRBD command lines. All as root on the host or hosts indicated. <resource> is the name of the drbd, i.e. files in the examples On both hosts: drbdadm create-md <resource> drbdadm up <resource> On one host (the primary): drbdadm overwrite-data-of-peer primary <resource> wait for sync to complete create your filesystem on the drbd device (in our example, mke2fs -j /dev/drbd0 ) drbdadm secondary <resource> (assuming you will hand control to heartbeat)

What is Heartbeat? Tool to manage failover of services between nodes Web resources for more advanced configuration are mostly at http://www.linux-ha.org/ Including with or readily available with most Linux distributions You could use other cluster management tools, but would have to do the integration ha.cf and haresources must match between nodes

Heartbeat Notes Examples all use Heartbeat v1 style configuration for simplicty, support of all DRBD features Heartbeat v2 CRM style config can work; see http://wiki.linux-ha.org/drbd/howtov2 Controls access to DRBD devices and services that run on top of DRBD devices (i.e. NFS, MySQL, Samba, etc) Not shown here, but you also need /etc/ha.d/authkeys (trivial)

Sample Minimalist /etc/ha.d/ha.cf ucast eth0 192.168.122.2 ucast eth0 192.168.122.3 keepalive 2 warntime 10 deadtime 30 initdead 120 udpport 694 auto_failback on node c5drbd0 node c5drbd1 respawn hacluster /usr/lib64/heartbeat/ipfail

Sample Minimalist /etc/ha.d/haresources c5drbd0 192.168.122.100 drbddisk::files Filesystem::/dev/drbd0::/files::ext3::noatime nfs c5drbd1

Haresources Notes Additional services, filesystems, etc are space separated Centos5/RHEL5 NFS startup scripts have a bug that will break repeated failover/failback Patch is on the next slide; you WILL need this for reliable NFS failover Again, this is a v1 non-crm configuration. You can use a v2 CRM type configuration; there are advantages and disadvantages of both.

/etc/init.d/nfs patch --- nfs.orig 2008-06-08 11:56:02.000000000-0400 +++ nfs 2008-06-08 11:56:09.000000000-0400 @@ -134,6 +134,7 @@ action $"Shutting down NFS services: " /bin/false fi [ -x /usr/sbin/rpc.svcgssd ] && /sbin/service rpcsvcgssd stop + killall -9 nfsd rm -f /var/lock/subsys/nfs ;; status)

Actual Demonstration Two virtual machines Both running CentOS 5.1 x86_64 KVM virtualization, default Fedora configuration Using the heartbeat and DRBD configuration already shown This is obviously a simple minimal setup; this is how you get started. You will need to customize for your own applications

Final Thoughts This is a good enough HA solution for many applications, at a non-ha price Better but not faster or cheaper than a single server. Cheaper but not better or faster than a replicated SAN or NAS (i.e. Netapp cluster) Replication is not a replacement for backups Replication is not a replacement for backups Replication is not a replacement for backups

Questions? Any Questions? (Q&A and Disucssion) A link to the slides will be up on http://ncsysadmin.org/ A link to the video will also eventually make its way to http://ncsysadmin.org/