Storage Based Replications Miroslav Vraneš EMC Technology Group miroslav.vranes@emc.com 1
Protecting Information Is a Business Decision Recovery point objective (RPO): How recent is the point in time for your recovery? Recovery time objective (RTO): How fast can you restart a failed application? (RPO+RTO = Acceptable Business Risk) RPO RTO DAYS HRS MIN SEC EVENT SEC MIN HRS TIME DAYS 2
Balancing Business Requirements and Cost Cost Cost of Data Availability Cost of System Availability Cost of Data Loss Cost of System Downtime Ideal Solution Set HOURS RPO Hours of Lost Transactions TIME 0 HOURS RTO Hours Required to Resume Business 3
Typical Service Levels by Replication Technology RPO (Hours of lost transactions) EVENT RTO (Hours required to resume business) Tape Vaulting Daily Tape Backup Daily Disk Backup Asynchronous Disk Mirroring Synchronous Disk Mirroring Continuous Protection -36-24 -12 0 12 24 36 48 60 72 4
Typical Service Levels by Approach, Product Avamar/Data Domain Daily Backup: Recovery point every 24 hours TimeFinder/SnapView/SnapSure Snapshots/Clones: Recovery point every three hours SRDF/MirrorView/Replicator Disk Mirroring: Recovery point latest image replicated RecoverPoint Continuous Protection: Recovery to any significant point in time Unlimited recovery points, application bookmarks (T) TIME Checkpoint Pre-Patch Patch Post-Patch Cache Flush Hot Quarterly Checkpoint Backup Close 5
EMC Local Replication Spacesaving snapshot Clone s Snap Source LUN Clone Full-copy clones Snap Array-based software Snapshots: Space-saving views of production information Clones: Full copies of data Enhances productivity Tens of thousands of replicas for greater parallel processing Delivers jointly developed application integration For fast deployment, faster restore/restart S N A P S H O T S / C L O N E S 6
Storage Based Remote Replication synchronous replication asynchronous replication SRDF/STAR: advanced multi-site Zero RPO Provides advanced multi-site (2, 3, 4 sites) deployments Protects open systems and mainframes (SRDF) Provides coordinated and automated failover/restart of applications, servers, storage Synchronous, asynchronous, concurrent, and advanced multi-site configurations 7
Synchronous Mode Operations 4 1 R1 3 SRDF/S links R2 R2 2 R1 Primary/Secondary Secondary/Primary I/O write received from host/server into source cache I/O is transmitted to target cache Receipt acknowledgment is provided by target back to cache of source Ending status is presented to host/server 8
Asynchronous Logical Flow on Source Software moves I/O from the capture to transmit Delta Set cycle for transfer from source to target 1 2 3 4 WAN Primary Secondary Local application does not wait for any completion status before issuing next dependent write I/O 9
EMC MirrorView Cost-effective, multi-site remote replication for CLARiiON and VNX series MIRRORVIEW Synchronous or Asynchronous WAN WAN Integrates with Unisphere for setup and management Integrates with SnapView for local replication Works with RecoverPoint for local continuous data protection Provides bi-directional Replication Fan-in (n:1) and Fan-out (1:n) Multi-site Configurations D I S K M I R R O R I N G 10
LUN level Disaster Recovery MirrorView with Synchronous Cost effective synchronous block replication Primary E F J J O O R R Fracture Log 10 0 0 0 0 10 0 0 0 0 1 0 0 0 1 0 0 0 Secondary E E J J O O R R R E M O T E P R O T E C T I O N S U I T E Cost effective block replication Supports multi-site replication Tracks host writes while link to secondary is down Uses a bitmap to map the entire primary mirror When secondary is available again, sends only changed data Enables partial sync; avoids full resync Minimizes customer s exposure to out-of-sync data Integrates with MSFT Failover Clusters 11
Symmetrix Remote Data Facility (SRDF) Family Industry-leading remote replication SRDF Family SRDF/S Synchronous for zero data exposure SRDF/A Asynchronous for extended distances SRDF/DM Efficient Symmetrix-to- Symmetrix data mobility SRDF/Star Multi-site replication option SRDF/CE Cluster Enabler option SRDF/AR Automated Replication option SRDF/CG Consistency Groups Cascaded SRDF and SRDF/EDP Extended Distance Protection Concurrent SRDF Concurrent EMC offers choice and flexibility to meet any service level requirement Protects against local and regional disruptions Increases application availability by reducing downtime Minimizes/eliminates performance impact on applications and hosts Independent of hosts and operating systems, applications, and databases Improves recovery point objectives (RPOs) and recovery time objectives (RTOs) with automated restart solutions Mission-critical proven with numerous testimonials and references Tens of thousands of licenses shipped 12
SRDF: Most Widely Deployed Disaster Restart Solution R1 R2 SRDF links R2 R1 Primary/Secondary Secondary/Primary Mirror disk in real time between Symmetrix systems Primary (R1) to secondary (R2) architecture Supports bi-directional remote mirror operations Independent of hosts, operating system, applications, and databases 13
SRDF/S R1/R2 Swap and Application/Database Relocation Site A Site B Host outage occurs while synchronizatio n in progress Database services relocated to remote hosts for restart Primary SRDF/S link Source swapped to target Target swapped to source Secondary Faster availability with R1 to R2 initial or incremental synchronization in progress faster access to data 14
SRDF Timestamp for Suspend/Resume Timestamp gets updated when link status changes occur and will be reported on both the R1 and R2 R1 SRDF R2 Production Site Secondary Site Device Link Status :Not Ready (NR) Time of Last Device Link Status Change :Mon Oct 18 15:30:41 2010 Assists administrator in identifying that intersite network issues may exist and aids in determining data currency in event of site disaster 15
SRDF/A Delta Set Push Operation Source Target 1 2 N Capture N 1 Transmit 3 4 WAN 3 N 1 4 Receive N 2 Apply R2 SRDF/A write I/O cycle number assigned as part of capture cycle (N) SRDF/A write I/O acknowledged back to host as local write operation SRDF/A write I/O cycle number is part of transmit/receive cycle (N 1) SRDF/A write I/O acknowledged from target and removed from transmit cycle (N 1) on source Capture to transmit cycle switch initiated based on cycle switch time interval setting with N 1 and N 2 cycles completed 16
SRDF/A Link Resiliency Option N Capture Source N 1 Transmit WAN N 1 Receive Target N 2 Apply R2 Link Resiliency option allows SRDF/A sessions to survive transient link failures Capture cycle continues until link recovers or cache full condition occurs Avoids need for SRDF Automated Restart actions with resynchronization Continues applying data to R2 from N 1 receive cycle if transmit N 1 received in its entirety Large cache capabilities make it possible to have a Link Resiliency option 17
SRDF/A Delta Set Extension N Capture Source N 1 Transmit WAN N 1 Receive Target N 2 Apply Delta Set Extension Delta Set Extension R2 Offloads some or all of the cycle data to preconfigured disk within the Symmetrix Allows for additional flexibility during abnormal I/O flow in the SRDF/A data path Addresses issue of SRDF/A session drops that can be caused by temporary: Host workload increase Insufficient or unavailable link bandwidth Reductions in usable cache in either the R1 or R2 system SRDF/A Delta Set Extension alleviates cache-full conditions during temporary workload imbalances or non-transient SRDF link outages 18
SRDF Compression in Enginuity Enginuity-based SRDF compression, independent of remote adapter hardware and protocol (Fibre Channel or Gigabit Ethernet) Supported for use with SRDF/S, SRDF/A, and SRDF/DM Enabled or disabled at the SRDF group level SRDF primary and secondary Symmetrix systems must be at Enginuity 5874 Q4 2009 SR or higher Reduces intersite network costs associated with SRDF replication Requires less bandwidth, which makes additional network bandwidth available to other applications 19
SRDF/S and SRDF/A Mode Change Feature Increases heavy-i/o application performance Dynamic and consistent switch between SRDF/A and SRDF/S Balanced performance during I/O peaks Improved RPO versus Adaptive Copy Potential for reduced bandwidth Normal mode Scheduled/scripted SRDF/S switch to SRDF/A mode Normal mode SRDF/S 6:00 a.m. Mode Change 10:00 p.m. 20
EMC Replication Manager Automate application recovery Exchange EMC SnapSure SharePoint SQL Server SAP Oracle Database EMC Replication Manager Application Intelligence and Automation EMC Replicator EMC SnapView EMC RecoverPoint family DB2 UDB EMC TimeFinder family EMC Celerra Family EMC CLARiiON Family EMC Symmetrix Family EMC Unified Storage Single console Point-and-click replica management Easy-to-use wizard interface requires no scripting Automated Application-consistent replicas for backup and testing Application and storage discovery VM-consistent replicas with VMware 21
THANK YOU 22