Developing a Highly Available Organization. Executive Presentation for: ACP Boston
|
|
|
- Logan Hamilton
- 10 years ago
- Views:
Transcription
1 Developing a Highly Available Organization Executive Presentation for: ACP Boston
2 Agenda Welcome The Need for Options Replication Defined Replication Technologies SunGard Overview Q&A and Wrap-up Vaulting Technology (how it works) Diagrams Case Study Server Replication Technology (how it works) Diagrams Case Study Storage Replication Technology (how it works) Diagrams Case Study
3 Introductions and Welcome Dennis A. Musolino SunGard Regional Vice President
4 The Need for Options Organizations are undergoing increased pressure to align recovery times with business expectations. Organizations are also less and less tolerant of data loss. Only 43 percent of businesses suffering a data loss disaster ever recover sufficiently enough to resume business. (National Archives and Records Administration )
5 Replication Defined Gregory Boutsikaris Sr. Solution Engineer Advanced Recovery Specialist
6 Common Recovery Terms RPO = Recovery Point Objective The maximum amount of lost data that a business can sustain due to an outage. RPO is measured in units of time. RTO = Recovery Time Objective The time it takes to recover operations as required by the business. ATOT/ATOD = At Time Of Test/At Time Of Disaster
7 Recovery Timeline Systems Applications Data Lost Data Vital Records Notifications Restore Technology Capability Resume Business Move to Interim Site Return Home Restore / Failover Communications Recovery Point Objective Restore Business Functions Data Synchronization Relocate Office Equipment / Supplies Work Flow Recovery Time Objective
8 RPO & RTO: Recovery Solutions Transactions Not Captured Declaration Data Retrieval Transit System Restore Start-Up & Network Database Restore Transaction Recreation Traditional Recovery Standby OS Vaulting Services Replication Services Failover Hours of Lost Data (RPO) Hours Required to Resume Business (RTO)
9 Tiered Recovery Approach to a Successful DATA Recovery Methodology Process/Application Recovery Tiers > 24 hrs Tier - 4 Vaulting & Recovery Services Restore from tape/storage ATOT/ATOD 4-24 hrs Tier - 3 Server & Storage Replication Dedicated and/or shared equipment w/ startup services BIA 1-4 hrs Tier - 2 Storage Replication Dedicated and shared equipment 0-1 hrs Tier - 1 Multi Center Managed Services Dedicated equipment, clustering, load balancing RTO
10 Why Replicate Data? Protect information Improve recovery time objectives (RTO) Establish recovery point objectives (RPO) Enable higher levels of availability Free up your technical resources
11 Design Requirements Commitment to recoverability Define thresholds of pain (business endorsed RTO/RPO) for service interruptions Define required environment (configurations, data, interdependencies, leads/feeds) Define tiers of applications Define network requirements
12 Vaulting Technology Overview Primary Data Center Option 1 - Do Nothing (Not an Option) File Server Option 2 - Box and Self-store (Not Reliable) Sun Solaris O/S 400 Print Server * Vaulting servers must have a software agent installed. Option 3 - Warehouse (Costly and Slow) VS Internet ATOT ATOD Systems VS VS VS Microsoft Windows UNIX LINUX Option 4 - Off-site Vault (Reliable and Affordable) End-User Recovery Center LAN File Server Print Server
13 Vaulting Architecture WINDOWS ROAD WARRIOR Customer Site NOVELL NETWARE Off-site Vault WIN NT/2000/2003/XP HP-UX Network Access Supported: Internet, VNET or Direct IBM AIX SUN SOLARIS iseries/as400 REDHAT/SUSE LINUX
14 Delta Technology
15 Case Study: Vaulting
16 Server Replication Overview Primary Data Center File Server Monitor/Management Microsoft SQL BlackBerryExchange Server Server Server Databases S/W S/W S/W Print Server Internet or WAN Failover Recovery Data Center Microsoft SQL BlackBerry Exchange Server Server Server Databases S/W S/W S/W SAN Failback * Server must have software and management agents installed. SAN * Server must have software and management agents installed. File Server End-User Recovery Center LAN Print Server
17 Case Study: Server Replication
18 Storage Replication Technologies Primary Data Center File Server Monitoring/Management Storage Array Storage Array Secondary Data Center Option A Secondary Data Center Option B Microsoft Windows UNIX LINUX Print Server Failback Sun Solaris O/S 400 Mainframe Sun Solaris O/S 400 Mainframe OC-x or DS-x PtoP Dedicated Systems ATOT/ATOD Inventory Microsoft Windows UNIX LINUX LAN End-User Recovery Center File Server Print Server
19 Synchronous Mode Source Host 1) Write received by Symmetrix containing source volume 2) Source RLD sends write data to target RLD 3) Target RLD sends acknowledgement to source RLD 4) Write complete sent to host Target Host 1 4 Channel Director Cache Channel Director Remote Link Director 2 Remote Link Director Channel Director Cache Channel Director Disk Director Disk Director Remote Link Director 3 Remote Link Director Disk Director Disk Director Symmetrix Containing Source (R1) Volumes Symmetrix Containing Target (R2) Volumes Application does not receive I/O acknowledgement until data is received Write completion time is extended - No impact on Reads Most often used in campus solutions and when point-in-time recoverability is an absolute requirement
20 EMC SRDF/A Operation ❶ Capture ❷ Transmit ❸ Receive ❹ Apply Repeat 1 Capture Collects applicationwrite I/O 2 Transmit Sends final set of writes to target 3 Receive Receives writes from Transmit Delta Set 4 Apply Once receive is complete, data is applied to disk SRDF/A performs Write Folding transmits only the final writes from the Capture Delta Set
21 IBM Global Mirror
22 Hitachi Replication Universal Replicator Journal Key Technology: Written Data is chronologically reflected to Secondary site via Journal VOLs. JNL keeps consistency across multiple VOLs. JNL Data is copied by means of special Read IO (Read JNL) initiated by Remote System Primary site Journal data is stored in JNL Volume Journal Data is restored while keeping consistency Secondary site WRT Transfer journal file to remote subsystem USP V Application Volume JNL Read journal asynchronously <Simple DR Configuration using UR> JNL Application Volume USP V
23 EMC RecoverPoint Continuous Remote Replication (CRR) ❶ ❶ ❶ ❶ ❶ ❶ ❶ SAN ❷ ❹ WAN ❷ ❶SAN ❺ ❸ ❸ ❺ ❶ RecoverPoint splitter drivers Mirrors writes to RecoverPoint appliance Resides on host, on CLARiiON, or in fabric ❷ RecoverPoint appliance Runs RecoverPoint software Performs all bi-directional replication Handles monitoring, management, and control Maintains write-order fidelity ❸ Journal Tracks all data changes to every protected LUN Utilizes bookmarks for application-aware recovery ❹ Provides advanced functionality 3 15x data compression No need for expensive Fibre Channel/IP converters ❺ Supports heterogeneous environments Works with EMC and third-party storage* True any-to-any volume replication
24 RecoverPoint/SE CRR ❶ ❶ ❶ ❶ ❶ ❶ SAN ❷ ❹ WAN ❷ SAN ❶ ❶ ❺ ❸ ❸ ❺ ❶ RecoverPoint/SE splitter driver Mirrors writes to RecoverPoint appliance Resides on the Windows Server host or on the CLARiiON CX3 ❷ RecoverPoint appliance Runs RecoverPoint/SE software Performs all bi-directional replication Handles monitoring, management, and control Maintains write-order fidelity ❸ Journal Tracks all data changes to every protected LUN Utilizes bookmarks for application-aware recovery ❹ Provides advanced functionality 3 15x data compression* No need for expensive FC/IP converters ❺ Supports CLARiiON CX series arrays Supports CX3-10, CX3-20, CX3-40, CX3-80 Also supports legacy CLARiiON CX300, CX500, CX700 True volume replication between two CLARiiON CX arrays * Requires optional Bandwidth Reduction Module
25 Continuous Data Protection (CDP) Application servers Database servers ❶SAN Messaging servers Heterogeneous disk systems ❹ File and print servers ❶ ❶ ❶ ❶ ❶ ❷ Local CDP Journals ❸ ❶ RecoverPoint splitter driver Mirrors server writes to RecoverPoint appliance Resides on host, on CLARiiON, or in fabric ❷ RecoverPoint appliance Runs RecoverPoint software Writes changes to CDP Journal Distributes changes to target volumes Maintains write-order consistency across all volumes ❸ Journal Tracks all data changes to every protected LUN Stores bookmarks for application-aware recovery Stores historic data for LUNs to roll back to any point in time ❹ Supports heterogeneous environments Works with EMC and third-party storage Fabric splitters support Brocade Fabric Application Platform (FAP) and Cisco SANTap
26 RecoverPoint/SE CDP Application servers Database servers Messaging servers File and print servers ❶ ❶ ❶ ❶ ❶ SAN CLARiiON array ❹ ❷ Local CDP Journals ❸ ❶ RecoverPoint/SE splitter driver Mirrors server writes to RecoverPoint/SE appliance Resides on Windows host or on the CLARiiON CX3 and supports Linux, Solaris, VMware, and Windows ❷ RecoverPoint appliance Runs RecoverPoint/SE software Writes changes to CDP Journal Distributes changes to target volumes Maintains write-order consistency across all volumes ❸ Journal Tracks all data changes to every protected CLARiiON LUN Stores bookmarks for application-aware recovery ❹ Supports one CLARiiON CX series array Supports current generation CLARiiON CX3 UltraScale series and legacy CLARiiON CX300. CX500, CX700 arrays CLARiiON splitter supported on CLARiiON UltraScale CX3 series
27 RecoverPoint Concurrent Local and Remote Data Protection Oracle Exchange SQL Oracle Exchange SQL SAN RecoverPoint WAN RecoverPoint SAN LUN LUN Production LUN Local CDP Copy Local CDP Journal Remote CRR Journal Remote CRR Copy Create CDP and CRR copies of the same LUNs Local copy is a CDP replica tracking all changes to the production LUN Remote copy is a CRR replica tracking significant changes to the production LUN Can independently recover from local site (any point in time) and remote site (significant point in time) Local copy for operational recovery with single write recovery point objective (RPO) to any point in time Application recovery using local point-in-time image Remote copy enables disaster recovery with customer-selected RPO to any significant point in time Disaster-recovery failover using either local or remote point-in-time image
28 Data De-Duplication Technology Data Reduction: Driven by inline deduplication and compression technology Compatible with Enterprise Backup and Archiving Software: either as a file server or virtual tape library (VTL) Local and Remote Site Data Protection: replication software enables deduplication storage systems to function as a highly efficient WAN vaulting solution for DR, remote office data protection and multi-site tape consolidation. Advanced Data Integrity: provides defense against data integrity issues with continuous fault detection and healing, and end-to-end verification of data recoverability at time of backup.
29 1 S YST S YST A TM 1A -T3 R PS STR T UTI LD UPLXSPE ED M ODE R PS STR T UTI LD UPLXSPE ED M ODE ATM 0 TX RX EN FERF OO F AI S RCLK A CT 100 Mbps LIN K W 2 W 1 W 0 ACT 100 Mbps LINK C F Cisco FASTET HERN ET 0/ 1 F ASTETH ERNET 0/ 0 CON SOLE Catalyst 2950SE RI ES Catalyst 2950SE RI ES AUX S Y ST R PS ST R T U T IL D UP L XS P E ED MOD E A TM 1A -T3 ATM 0 TX RX EN FERF OO F AI S RCLK S YST S YST R PS R PS STR T STR UTI LTD UTI UPLXSPE LD UPLXSPE ED ED M ODE M ODE C atal yst 2 950S E RIE S ACT 100 Mbps LI NK W 2 W 1 W 0 ACT 100 M bps LI NK CF Cisco FAS TETHER NET 0/1 FASTE THERN ET 0/ 0 CON SOLE CISCO SYSTEMS Catalyst 6500 AUX Catalyst Catalyst 2950SE 2950SE RI ES RI ES SE RIE S SD S C ISCO YSTEMS Catalyst 650 0S ER IES SD XEON XEON XEON XEON XEON XEON XEON XEON XEON XA Ω XA Ω LA N re s et S D XA Ω L A N re se t S D XA Ω L A N re se t S D Pow eredge 6650 Pow eredge 6650 Pow eredge XA Ω LA N re s et S D XA Ω LA N re s et S D L A N re se t S D SYST RPS ST RT UT IL DUPL XSPEED MODE SYST RPS ST RT UT IL DUPL XSPEED MODE SYSTEM RPS MOD E Cataly st29 50 SERIES Cataly st2 950 SERIES 1 2 SYST RPS STRT UTIL DUPLXSPEED MODE 1 2 Ca tal yst SERIE XS L SD 10BaseT/ 10B aset x 1X 2X 3X 4X 5X 6X 7X 8X 9X 10X 11X 12X 13X 14X 15X 16X 17X 18X 19X 20X 21X 22X 23X 24X XA Ω LA N re s et S D C at al y st SERIES 1 2 Case Study: VTL Replication
30 SunGard Availability Services
31 SunGard: The Pioneer of Information Availability Over 25 years experience and nearly 10,000 Information Availability clients worldwide including 70 of the Fortune 100 Over 1,450 Managed IT Services Customers 2,500 Information Availability experts and professionals More platforms and North American locations than any other vendor Offers Electronic Vaulting with On-Demand Local Servers Financial stability of a Fortune 500 company
32 SunGard Solutions Information Availability Information Availability Assessments Security Assessments Business & Technology Profiles Information Availability Program Management Business Impact Analyses System Management Services Managed Security Services Managed Storage Services Monitoring Services Network Services Hosting Infrastructure Services End-User Recovery Systems Recovery Mobile Recovery Software Tools Shared SAN
33 SunGard s AdvancedRecovery Solutions SunGard AdvancedRecovery solutions offer a range of services to help ensure increased availability while minimizing cost and downtime. SunGard AdvancedRecovery solutions provide the essential elements needed to recover more quickly after a disaster or business disruption. Services include secure remote replication or electronic vaulting technology, network recovery, imaging, the back-up facilities and platforms, and experienced support to recover quickly.
34 North American Recovery Facilities Calgary, CA Quebec, CN Montreal, CN Toronto, CN (3) Honolulu, HI Metepec, MX Atlanta, GA (2) Austin, TX Birmingham, AL Boston, MA Charlotte, NC Chicago, IL (2) Cleveland OH Dallas, TX DC/Metro, VA Denver, CO (3) Detroit, MI Honolulu, HI Indianapolis, IN Los Angeles, CA San Ramon, CA Minneapolis, MN Nashville, TN New Jersey (3) New York, NY Orlando, FL Philadelphia, PA (2) Phoenix, AZ Pittsburgh, PA (2) Portland, OR Raleigh/Durham, NC San Diego, CA Scottsdale, AZ Seattle, WA St. Louis, MO (2) St. Paul, MN Metepec, MX
35 Managed IT Locations Portland San Diego Scottsdale Phoenix Denver N Denver S Minneapolis Austin Chicago St. Louis Pittsburgh Nashville Atlanta Philadelphia Philadelphia Marlborough Northern New Jersey/NY Raleigh Charlotte Atlanta Southern New Jersey
36 Why SunGard Availability Services Experience matters % Recovery Success Rate Over 2,200 disasters declared Our Methodology... Our Commitment week testing methodology Crisis management process Technology exchange program Recovery scripts & procedures Comprehensive support Warranted 24/7 readiness 3 rd party audit review Support hotline 24/7
37 Closing Q&A Wrap-up THANK YOU!! SunGard Account Manager: Adam Shorr Contact Information:
