High Availability Storage



Similar documents
Implementing the SUSE Linux Enterprise High Availability Extension on System z

SUSE Customer Center Roadmap

Based on Geo Clustering for SUSE Linux Enterprise Server High Availability Extension

Open Source High Availability Writing Resource Agents for your own services. Lars Marowsky-Brée Team Lead SUSE Labs

Using SUSE Cloud to Orchestrate Multiple Hypervisors and Storage at ADP

SUSE Storage. FUT7537 Software Defined Storage Introduction and Roadmap: Getting your tentacles around data growth. Larry Morris

Ceph Distributed Storage for the Cloud An update of enterprise use-cases at BMW

High Availability and Disaster Recovery for SAP HANA with SUSE Linux Enterprise Server for SAP Applications

Relax-and-Recover. Johannes Meixner. on SUSE Linux Enterprise 12.

Advanced Systems Management with Machinery

HO15982 Deploy OpenStack. The SUSE OpenStack Cloud Experience. Alejandro Bonilla. Michael Echavarria. Cameron Seader. Sales Engineer

Implementing Linux Authentication and Authorisation Using SSSD

SUSE Linux Enterprise High Availability Extension. Piotr Szewczuk konsultant

SUSE Linux uutuudet - kuulumiset SUSECon:sta

Build Platform as a Service (PaaS) with SUSE Studio, WSO2 Middleware, and EC2 Chris Haddad

How To Use Suse Linux High Availability Extension 11 Sp (Suse Linux)

High Availability Solutions for the MariaDB and MySQL Database

Challenges Implementing a Generic Backup-Restore API for Linux

Running SAP HANA One on SoftLayer Bare Metal with SUSE Linux Enterprise Server CAS19256

Implementing the SUSE Linux Enterprise High Availability Extension on System z Mike Friesenegger

SUSE Enterprise Storage Highly Scalable Software Defined Storage. Gábor Nyers Sales

Using btrfs Snapshots for Full System Rollback

Deploying Hadoop with Manager

We are watching SUSE

Installing, Tuning, and Deploying Oracle Database on SUSE Linux Enterprise Server 12 Technical Introduction

Btrfs and Rollback How It Works and How to Avoid Pitfalls

Big Data, SAP HANA. SUSE Linux Enterprise Server for SAP Applications. Kim Aaltonen

Operating System Security Hardening for SAP HANA

TUT8155 Best Practices: Linux High Availability with VMware Virtual Machines

Public Cloud. Build, Use, Manage. Robert Schweikert. Public Cloud Architect

SUSE Cloud 5 Private Cloud based on OpenStack

Administration Guide. SUSE Linux Enterprise High Availability Extension 12 SP1

SUSE OpenStack Cloud 4 Private Cloud Platform based on OpenStack. Gábor Nyers Sales gnyers@suse.com

CAS18543 Migration from a Windows Environment to a SUSE Linux Enterprise based Infrastructure Liberty Christian School

How To Make A Cloud Work For You

How an Open Source Cloud Will Help Keep Your Cloud Strategy Options Open

Red Hat Cluster Suite

Oracle Products on SUSE Linux Enterprise Server 11

TUT5605: Deploying an elastic Hadoop cluster Alejandro Bonilla

HO5604 Deploying MongoDB. A Scalable, Distributed Database with SUSE Cloud. Alejandro Bonilla. Sales Engineer abonilla@suse.com

Using SUSE Linux Enterprise to "Focus In" on Retail Optical Sales

EMC DOCUMENTUM xplore 1.1 DISASTER RECOVERY USING EMC NETWORKER

Workflow und Identity Management - Genehmigungsprozesse, Role Mining, Role Design und Compliance Management

Highly Available NFS Storage with DRBD and Pacemaker

Software Defined Everything

INUVIKA TECHNICAL GUIDE

Configuration Management in SUSE Manager 3

IBM TSM DISASTER RECOVERY BEST PRACTICES WITH EMC DATA DOMAIN DEDUPLICATION STORAGE

Data Protection with IBM TotalStorage NAS and NSI Double- Take Data Replication Software

Data Center Automation with SUSE Manager Federal Deployment Agency Bundesagentur für Arbeit Data Center Automation Project

Contingency Planning and Disaster Recovery

Contents. SnapComms Data Protection Recommendations

MySQL High Availability Solutions. Lenz Grimmer OpenSQL Camp St. Augustin Germany

Kangaroot SUSE TechUpdate Interoperability SUSE Linux Enterprise and Windows

SanDisk ION Accelerator High Availability

HIGHLY AVAILABLE MULTI-DATA CENTER WINDOWS SERVER SOLUTIONS USING EMC VPLEX METRO AND SANBOLIC MELIO 2010

Deploying Global Clusters for Site Disaster Recovery via Symantec Storage Foundation on Infortrend Systems

Novell Collaboration Vibe OnPrem

Linux w chmurze publicznej SUSE na platformie Microsoft Azure

High Availability & Disaster Recovery Development Project. Concepts, Design and Implementation

Wicked A Network Manager Olaf Kirch

Scale and Availability Considerations for Cluster File Systems. David Noy, Symantec Corporation

Whitepaper - Disaster Recovery with StepWise

kgraft Live patching of the Linux kernel

Zen and The Art of High-Availability Clustering Lars Marowsky-Brée

Novell PlateSpin Orchestrate

HRG Assessment: Stratus everrun Enterprise

QNAP in vsphere Environment

Deploying Exchange Server 2007 SP1 on Windows Server 2008

High-Availability Using Open Source Software

Of Pets and Cattle and Hearts

High Availability Solutions with MySQL

Eliminate SQL Server Downtime Even for maintenance

High Availability and Disaster Recovery for Exchange Servers Through a Mailbox Replication Approach

Oracle BI Publisher Enterprise Cluster Deployment. An Oracle White Paper August 2007

Novell Open Enterprise Server

High Availability & Disaster Recovery. Sivagopal Modadugula/SAP HANA Product Management Session # 0506 May 09, 2014

Astaro Deployment Guide High Availability Options Clustering and Hot Standby

SUSE Linux Enterprise 12 Security Certifications Common Criteria, EAL, FIPS, PCI DSS,... What's All This About?

File Management Suite. Novell. Intelligently Manage File Storage for Maximum Business Benefit. Sophia Germanides

PVFS High Availability Clustering using Heartbeat 2.0

DevOps and SUSE From check-in to deployment

Real-time Protection for Hyper-V

Wicked Trip into Wicked Network Management

Building Storage Service in a Private Cloud

istorage Server: High-Availability iscsi SAN for Windows Server 2008 & Hyper-V Clustering

Network Attached Storage. Jinfeng Yang Oct/19/2015

High Availability with Postgres Plus Advanced Server. An EnterpriseDB White Paper

SAP Solutions High Availability on SUSE Linux Enterprise Server for SAP Applications

Cisco Active Network Abstraction Gateway High Availability Solution

Informix Dynamic Server May Availability Solutions with Informix Dynamic Server 11

Stretching A Wolfpack Cluster Of Servers For Disaster Tolerance. Dick Wilkins Program Manager Hewlett-Packard Co. Redmond, WA dick_wilkins@hp.

Transcription:

High Availability Storage High Availability Extensions Goldwyn Rodrigues High Availability Storage Engineer SUSE

High Availability Extensions Highly available services for mission critical systems Integrated suite over robust open-source technologies Business Continuity Protect Data Integrity Reduce Unplanned Downtime Commodity Hardware for high availability 2

Cluster A set of computers interacting with each other One goes down, another picks up its responsibility Should be available as much as possible Cluster Types: Active/Active vs Active/Passive (N+1, N+M) Physical vs Virtual vs Hybrid Local Clusters vs Metro vs Geo Clusters 3

Why Cluster Increased Availability Improved Performance Low cost of operation Scalability Disaster Recovery Data Protection Server Consolidation Storage Consolidation 4

CAP Theorem Brewers Theorem All nodes see the same data at the same time Consistency Availability A guarantee that every request receives a response about whether it was successful or failed Partition Tolerance The system continues to operate despite arbitrary message loss or failure of part of the system 5

Fault Tolerance vs High Availability Specialized hardware to detect a hardware fault and switch to redundant hardware Expensive redundant and replicated components A set of computers system-wide, shared resources that cooperate to guarantee essential services Software and hardware to quickly restore services 6

Cluster NFS Server DB Server Web Server Application Server Mail Server 7

A Dysfunctional Cluster SPLIT BRAIN 8

STONITH Shoot the Other Node In the Head RESET 9

10 Quorum

Quorum Policies Ignore Continue cluster operations as usual Freeze Resource management continues New resources are not started Stop All resources affected partition are stopped Suicide Fence all nodes in affected partition 11

Resource Agents Open Cluster Framework (OCF) Manage resources Web Server IP Address Shared Filesystem Resource Operations Id Name Interval timeout 12

Configuration Tools Cluster Resource Manager (CRM) Powerful Command line tool YasT Basic Cluster setup DRBD IP Load balancing High Availability Web Konsole (Hawk) Web based 13

14 Architecture

Resource Agent Constraints Colocation Database Order 1. IPAddr 2. Web Server Location 15

Shared Storage What's the problem here? 16

Shared Storage A common view for all nodes Node Failure: One node can pick up from where the other left Local filesystems don't work Data corruptions because of writing files other nodes access Node Cache Inconsistencies 17

DLM Distributed Lock Manager Provides a Cluster-wide locking for data access Different Level for wide variety of uses No centralized-control Easy for take over The node accessing the object first gets to create the distributed lock object Lock Value Block (LVB) for data synchronization 18

DLM Distributed Lock Manager Mode NL CR CW PR PW EX NL Yes Yes Yes Yes Yes Yes CR Yes Yes Yes Yes Yes No CW Yes Yes Yes No No No PR Yes Yes No Yes No No PW Yes Yes No No No No EX Yes No No No No No 19

clvm Clustered Logical Volume Manager Logical Volume Manager for the cluster Add or remove devices as storage needs change Linear Add storage as required Simple addition of devices in a linear form Mirrored Redundancy of devices over the cluster 20

DRBD Distributed Replicated Block Device Primary Node Secondary Node DRBD Disk 2 DRBD Disk 1 21

DRBD Distributed Replicated Block Device Shared Storage without a SAN Governed by network speeds RAID1 over network block device and local device Fully synchronous, memory synchronous or asynchronous modes of operation Dual Primary for clustered filesytems 22

Shared Filesystem ocfs2 Simultaneously mounted on all nodes All nodes should have access to the data Cluster Filesystem with a good throughput Should not be bogged down with multiple access Force cache flush on other nodes using filesystem when current node access the file 23

OCFS2 Features B-tree Extent based Inline data Indexed Directories Metadata ECC Refcount Extended Attributes ACL Quota Read man mkfs.ocfs2 for more information.. 24

Usage Scenarios Database Server Applications Common database repository CTDB (Samba) Web Servers WWW root Highly Available Virtual Machines VM Disks 25

Generic Clustering Tips Always use STONITH Recommend multiple STONITH devices SBD is an alternative Time synchronization Read the logs when things are not working Record Time of event Redundant Communications Network Device bonding Redundant Ring Protocol 26

OCFS2 Tips Prefer hardware based RAID (mirroring) If you don't want a feature don't enable it Quota, ACL, Inline directories use additional data on disk and additional lookups You can always enable it later using tunefs.ocfs2 Inline directories for large number of files MetaECC for better data protection Protects filesystem from getting corrupted further Immediately run fsck.ocfs2 27

SUSE Linux Enterprise High Availability Extension https://www.suse.com/products/ highavailability/ Thank you. 28

29

Unpublished Work of SUSE LLC. All Rights Reserved. This work is an unpublished work and contains confidential, proprietary and trade secret information of SUSE LLC. Access to this work is restricted to SUSE employees who have a need to know to perform tasks within the scope of their assignments. No part of this work may be practiced, performed, copied, distributed, revised, modified, translated, abridged, condensed, expanded, collected, or adapted without the prior written consent of SUSE. Any use or exploitation of this work without authorization could subject the perpetrator to criminal and civil liability. General Disclaimer This document is not to be construed as a promise by any participating company to develop, deliver, or market a product. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. SUSE makes no representations or warranties with respect to the contents of this document, and specifically disclaims any express or implied warranties of merchantability or fitness for any particular purpose. The development, release, and timing of features or functionality described for SUSE products remains at the sole discretion of SUSE. Further, SUSE reserves the right to revise this document and to make changes to its content, at any time, without obligation to notify any person or entity of such revisions or changes. All SUSE marks referenced in this presentation are trademarks or registered trademarks of Novell, Inc. in the United States and other countries. All third-party trademarks are the property of their respective owners.