z/vm and Linux Disaster Recovery A Customer Experience Lee Stewart Sirius Computer Solutions (DSP)



Similar documents
Virtual Linux Server Disaster Recovery Planning

Implementing Tivoli Storage Manager on Linux on System z

Big Data Storage in the Cloud

Oracle Networking and High Availability Options (with Linux on System z) & Red Hat/SUSE Oracle Update

Using Virtualization to Help Move a Data Center

IBM Infrastructure Suite for z/vm and Linux

Virtualization: TCP/IP Performance Management in a Virtualized Environment Orlando Share Session 9308

TECHNICAL SPECIFICATIONS For DISASTER RECOVERY SERVICES TKG /09/2015

UPSTREAM for Linux on System z

Virtual Networking with z/vm Guest LAN and Virtual Switch

Understanding the New Enterprise Cloud System

Virtual Networking with z/vm Guest LANs and the z/vm Virtual Switch

IBM Software Group. Lotus Domino 6.5 Server Enablement

Understand the New Enterprise Cloud System

Table of Contents. Authors Wendy Wong, Senior Software Architect, and the SOLVE:Operations/Linux Connector team. Print Edition: WW180311

Open Source Mainframe Backup with Hercules

Disaster Recovery Cookbook for Linux Part 1: Before the Storm

Backing Up the Linux/MF & Distributed Environments to One Place. Timothy O Brien Computer Associates International, Inc. ca.com

Backup Solutions with Linux on System z and z/vm. Wilhelm Mild IT Architect IBM Germany mildw@de.ibm.com 2011 IBM Corporation

Customer Experiences:

Options for Backing Up and Restoring z/vm and Linux Guests

P&V Group June The Group P&V is comprised of the brands P&V, VIVIUM, Actel, PNP, Euresa-Life and Arces 1

Options for Backing Up and Restoring a z/vm Cluster and Linux on System z Guests

FDR/UPSTREAM and INNOVATION s other z/os Solutions for Managing BIG DATA. Protection for Linux on System z.

Tracy Dean, IBM August Backing Up and Restoring a z/vm Cluster and Linux on System z Guests. SHARE Session #15727.

Linux on System z. ProvisioningscenarioswithIBMTivoli Service Automation Manager 7.2.2

Overview Customer Login Main Page VM Management Creation... 4 Editing a Virtual Machine... 6

Cloud Computing with xcat on z/vm 6.3

BACKUP AND RECOVERY PLAN MS SQL SERVER

VM Backup methodologies Oren Wolf, TSM Product Manager 11 Mar 2011

Oracle on System z Linux- High Availability Options Session ID 252

Data Center Optimization. Disaster Recovery

Administration Guide - Documentum idataagent (DB2)

High Availability Architectures for Linux in a Virtual Environment

Performance of a webapp.secure Environment

IBM SAP International Competence Center. The Home Depot moves towards continuous availability with IBM System z

Private Cloud for WebSphere Virtual Enterprise Application Hosting

An Oracle White Paper November Oracle Real Application Clusters One Node: The Always On Single-Instance Database

GNR TSM documentation Page 1of 10. TSM Documentation. Finn Henningsen - Sagitta Performance Systems Version th April 2002

How To Handle A Disaster Recovery Plan

Capacity Planning for 1000 virtual servers (What happens when the honey moon is over?) (SHARE SESSION 10334)

z/vm Capacity Planning Overview SHARE 117 Orlando Session 09561

SUSE Linux Enterprise Server For System Z

The Consolidation Process

IBM Systems and Technology Group Technical Conference

Overview. Business value

z/vm Capacity Planning Overview

Crypto and Disaster Recovery. Greg Boyd

High Availability for Linux on IBM System z Servers

Freddy Saldana, Product Manager Tivoli Storage Manager Oxford TSM Symposium September 23-25

Virtualization, SDN and NFV

How To Plan Out A Disaster Recovery Plan For Mip

1. Backup and Recovery Policy

Can You Teach a New Capacity & Performance Specialist Old Tricks? Hindsight and Insight

ITCertMaster. Safe, simple and fast. 100% Pass guarantee! IT Certification Guaranteed, The Easy Way!

Centralize AIX LPAR and Server Management With NIM

BACKING-UP HYPER-V VIRTUAL MACHINES WITH VICEVERSA PRO

Backup Exec System Recovery Management Solution 2010 FAQ

IOS110. Virtualization 5/27/2014 1

Introduction. Setup of Exchange in a VM. VMware Infrastructure

BMC Mainframe Solutions. Optimize the performance, availability and cost of complex z/os environments

Top 10 Tips for z/os Network Performance Monitoring with OMEGAMON Session 11899

EXAM - E nterprise Backup Recovery Design Exam for Data Center Architects Exam.

CA s Cloud Storage for System z

IBM Tivoli Storage Manager FastBack

Arwed Tschoeke, Systems Architect IBM Systems and Technology Group

CommuniGate Pro SIP Performance Test on IBM System z9. Technical Summary Report Version V03

<Insert Picture Here> Oracle Database Support for Server Virtualization Updated December 7, 2009

z/os Curriculum Job Control Language (JCL) Curriculum JES Curriculum WebSphere Curriculum TSO/ISPF for z/os Curriculum

Hyper-V recovery. User Guide

Backup Strategies for z/vm and Linux on z Systems

Introduction to Microsoft Small Business Server

Linux System z Why linux experts are choosing the System z platform.

SHARE in Anaheim. August 6-8, Get the 411 on FDR/UPSTREAM

Cyber Security: Guidelines for Backing Up Information. A Non-Technical Guide

How To Install And Service An Ipa Vmmses/E On Z/Vm And Linux On A Microsoft Vmme (Vmms) On A Z/Mme 3.5 (Vms

Running Oracle Databases in a z Systems Cloud environment

Aktuelles aus z/vm, z/vse, Linux on System z

Local Government Cyber Security:

IIB for Everyone: Affordable Integration

DeltaV Virtualization High Availability and Disaster Recovery

Best Practices for Monitoring Databases on VMware. Dean Richards Senior DBA, Confio Software

Evaluation of Enterprise Data Protection using SEP Software

SuSE Linux High Availability Extensions Hands-on Workshop

Backing Up and Restoring a z/vm Cluster and Linux on System z Guests

Transcription:

z/vm and Disaster Recovery A Customer Experience Lee Stewart Sirius Computer Solutions (DSP) Date Thursday, August 14 th, 2008 Session 9210 2008 Sirius Computer Solutions

The Business Partner Sirius Computer Solutions No, not the satellite radio people IBM Reseller Not a DR vendor Most hardware sales are bundled with software services My role Session 9210 2008 Sirius Computer Solutions 2

The Customer An Insurance Company z9 BC with 2 IFLs 2 Production z/os LPARs 1 VM & LPAR z/vm 5.2; SLES9 SP3; 20+ Images 8-10 Production Migrating to SLES10 SP2 WebSphere; MQ; DB2 Connect Fair Isaac Blaze Advisor Session 9210 2008 Sirius Computer Solutions 3

The Customer Setup Real Processor Hipersockets z/os z/os Router Guest LAN z/vm LPAR 2 IFLs Vswitch OSA OSA Session 9210 2008 Sirius Computer Solutions 4

Two Part Backups Full pack backup from z/os weekly machines shut down briefly on Sunday evenings, FLASHCOPYed, then backed up to tape. z/vm backed up periodically TSM backup for incremental changes and file level restores Server on a Windows box Session 9210 2008 Sirius Computer Solutions 5

The DR Vendor A Large DR Vendor Large z9 Plenty of Memory Lots of Processors All CPs, no IFLs All full speed z/os DR always done under VM in the past Session 9210 2008 Sirius Computer Solutions 6

The Previous DR Test A year earlier, before z/vm and Started on a Monday morning z/os LPARs Restored, Up and Running by Monday afternoon Distributed Systems Mostly up by the end of Wednesday In fairness, a large hodgepodge of systems Some never made it up in full 4 day test Successful test, but next time needs to be better Session 9210 2008 Sirius Computer Solutions 7

Last Year s Test Fall 07 Monday to Thursday Customer personnel NOT onsite at DR vendor site 2 z/os LPARs 1 z/vm LPAR 20+ Guests 8-10 Production Why not bring them all up Still many Distributed Systems And many moved to on Z Fewer unique environments Session 9210 2008 Sirius Computer Solutions 8

Last Year s DR Setup Real Processor, all CPs, running z/vm Customer s z/vm Guest LAN Simulating customer network z/os z/o S Router Vswitch Guest LAN OSA Session 9210 2008 Sirius Computer Solutions 9

Last Year s Test Monday morning: Start restoring tapes Monday early afternoon: 2 z/os LPARs Up and running 1 z/vm LPAR Up and running 20+ Guests All Starting Distributed Systems Working on it Session 9210 2008 Sirius Computer Solutions 10

First Problem Monday late afternoon Performance! Customer s VM image only given less than ½ the requested memory, and no Xstor VM Paging at thousands of pages per second First fix attempt force off non-production guests Still slow, still heavy paging Shutdown VM, increase VM s memory, restart Session 9210 2008 Sirius Computer Solutions 11

Second Problem Tuesday mid-morning Performance! Memory is now correct, even Xstor VM can now see 2 real processors but begin to suspect they aren t dedicated to our LPAR Discover VM is NOT in it s own LPAR, but under the DR vendor VM! Can t shutdown z/os systems to reconfigure LPARs Session 9210 2008 Sirius Computer Solutions 12

Last Year s Real DR Setup Real Processor, all CPs, running z/vm Guest LAN Simulating customer network z/vm z/os z/os Router Vswitch Guest LAN OSA Session 9210 2008 Sirius Computer Solutions 13

Second Problem Why? Customer VM & CMS runs great, runs terrible Why? Hardware vs. Software System Z hardware only supports 2 levels of virtualization 2 levels of SIE LPAR support uses the 1 st level SIE to run the LPARs (DR Vendor s VM system) The DR vendor s VM system (1 st level) uses the 2 nd level SIE to run the 2 nd level systems (the customer s z/os & VM images) The customer s VM system (2 nd level) cannot use SIE to manage the guests (3 rd level) all privops and management tasks must be simulated by CP Session 9210 2008 Sirius Computer Solutions 14

Second Problem The Fix Avoid that 2 nd level of VM Two ways depending on DR vendor and your configuration #1 - Put your VM in an LPAR on the DR vendor s machine Recreates more of your environment Easiest if a large number of guests Change the OSA addresses for VM s TCPIP and Vswitches Usually costs a little more #2 - Run your guests directly under the DR vendor s VM Your Vswitch and guest LANs have to be created by the DR vendor You don t have all your stuff in case Easiest if you are only going to run a couple guests Usually a little cheaper Session 9210 2008 Sirius Computer Solutions 15

Second Problem Bypass Customer decides bringing up a single production will be a successful DR test this time DR vendor dedicates 2 CPs to the VM userid Kill all but one guest Slow but livable for the test Session 9210 2008 Sirius Computer Solutions 16

Third Problem The guest runs ok, but can t contact z/os Oh yea Start the Hipersocket router Slow, but it comes up Guest talks to router ok, but not to z/os No DR definition for Hipersocket connection Session 9210 2008 Sirius Computer Solutions 17

The Missing Hipersocket Router Real Processor, all CPs, running z/vm Guest LAN Simulating customer network z/vm z/os z/os Router Vswitch Guest LAN Session 9210 2008 Sirius Computer Solutions 18

This Year s DR Test Customer s Under DR Vendor s VM Decided by DR coordinator for cost reasons Monday morning: Start restoring tapes Monday early afternoon: 2 z/os LPARs Up and running 3 machines Up and running (under the DR VM) Only the production machines Distributed Systems Working on it One small problem Missing Vswitch controller userids Session 9210 2008 Sirius Computer Solutions 19

This Year s Real DR Setup Real Processor, all CPs, running under DR Vendor s z/vm Guest LAN Simulating customer network Pusedo Hipersocket Guest LAN Guest LAN z/os z/os Router Vswitch DTCVSW1 OSA DTCVSW2 Session 9210 2008 Sirius Computer Solutions 20

Next Year s DR Test Customer VM in a DR LPAR Continue to migrate non-mainframe workload to on z DR recovery speed is only one benefit Bring up all the machines DR their Development and Test environments as well Expect all z/os and z/vm and on System z to be up and running by Monday afternoon. Session 9210 2008 Sirius Computer Solutions 21

Lessons Learned 1. Communicate! What we thought was going to be the configuration, wasn t What we thought was being changed, wasn t Miscommunication all the way around sysprog, DR coordinator, DR contract, DR sales, DR support 2. Be proactive Don t just tell your DR coordinator once and leave it all to them Don t skip out on DR planning meetings You know your systems much better than anyone else Session 9210 2008 Sirius Computer Solutions 22

Lessons Learned 3. Be SURE of your DR configuration For each LPAR Hardware Processors, memory, OSAs Network all connections, including Hipersockets Know what will be in LPARs and what will be under VM 4. Plan ahead to avoid 3 rd level No under VM under VM Run your VM in a DR LPAR -or - Run your directly under the DR VM Session 9210 2008 Sirius Computer Solutions 23

Lessons Learned 5. Check your configuration when you get on the DR machine Hardware, memory, network Don t assume it s set up right Don t assume you have all the pieces you need 6. Don t forget Hipersockets And if running a Hipersocket router, don t forget that Simulate via a Guest LAN or real OSA if necessary Session 9210 2008 Sirius Computer Solutions 24

Summary Session 9210 2008 Sirius Computer Solutions 25

Thank You Session 9210 Lee Stewart lee.stewart@siriuscom.com 2008 Sirius Computer Solutions