Failures. Network Survivability. Failure Scenarios. Terminology. Classification - Protection and. Protection vs Restoration.

Similar documents
SDH and WDM A look at the physical layer

Network Design for Highly Available VoIP

SDH and WDM: a look at the physical layer

tyuiopasdfghjklzxcvbnmqwertyuiopas

WDM Passive Optical Networks: Protection and Restoration

Capacity planning and.

Failsafe Protection for Utility Critical Infrastructure

ASON for Optical Networks

Relationship between SMP, ASON, GMPLS and SDN

Iyad Katib and Deep Medhi DRCN 2011 Krakow, Poland October 2011

LOAD BALANCING IN WDM NETWORKS THROUGH DYNAMIC ROUTE CHANGES

Broadband Networks. Prof. Karandikar. Department of Electrical Engineering. Indian Institute of Technology, Bombay. Lecture - 26

WDM network management

The Economics of Cisco s nlight Multilayer Control Plane Architecture

Performance Management and Fault Management. 1 Dept. of ECE, SRM University

Determine: route for each connection and protect them if necessary to minimize total network cost (say wavelength-links).

MPLS Part II - Recovery

LOGICAL TOPOLOGY DESIGN Practical tools to configure networks

Policy-Based Fault Management for Integrating IP over Optical Networks

Recession-Proof Consulting Services with CWDM Network Design

A New Fault Tolerant Routing Algorithm For GMPLS/MPLS Networks

10G CWDM Conversion Technology

FIBER OPTIC COMMUNICATIONS. Optical Fibers

Course 12 Synchronous transmission multiplexing systems used in digital telephone networks

Recovery Modeling in MPLS Networks

SONET and DWDM: Competing Yet Complementary Technologies for The Metro Network

Demand-wise Shared Protection for Meshed Optical Networks

John Ragan Director of Product Management. Billy Wise Communications Specialist

Multiple Layer Traffic Engineering in NTT Network Service

OPNET SP GURU TRANSPORT PLANNER GRADUATE PROJECT WAN/ LAN PLANNIG AND DESIGN

Technical White Paper for Multi-Layer Network Planning

End-to-end GMPLS based Protection & Restoration on new generation router interface (Integrating Optical Transport into Router)

PROTECTION ALGORITHMS FOR BANDWIDTH GUARANTEED CONNECTIONS IN MPLS NETWORKS WONG SHEK YOON

Communication Networks. MAP-TELE 2011/12 José Ruela

Partial Path Protection for WDM Networks: End-to-End Recovery Using Local Failure Information

IRMA: Integrated Routing and MAC Scheduling in Multihop Wireless Mesh Networks

Integrated Backup Topology Control and Routing of Obscured Traffic in Hybrid RF/FSO Networks*

Computer Network. Interconnected collection of autonomous computers that are able to exchange information

Optical Layer Monitoring Schemes for Fast Link Failure Localization in All-Optical Networks

Leveraging Advanced Load Sharing for Scaling Capacity to 100 Gbps and Beyond

Single-Link Failure Detection in All-Optical Networks Using Monitoring Cycles and Paths

Introduction to Optical Networks

Partial Path Protection for WDM Networks: End-to-End Recovery Using Local Failure Information

Ring Span Bandwidth Management in RPR

Performance advantages of resource sharing in polymorphic optical networks

It explains the differences between the Plesiochronous Digital Hierarchy and the Synchronous Digital Hierarchy.

Lightpath Planning and Monitoring

Optical Network Survivability: Protection Techniques in the WDM Layer

Multi Protocol Label Switching (MPLS) is a core networking technology that

PART III. OPS-based wide area networks

Industrial Ethernet How to Keep Your Network Up and Running A Beginner s Guide to Redundancy Standards

Data Communication Networks

IP over Optical Networks - A Framework draft-ip-optical-framework-01.txt

A Tutorial on SONET/SDH Automatic Protection Switching (APS) Technology White Paper. Steve Gorshe Principal Engineer

Computer Networks. Definition of LAN. Connection of Network. Key Points of LAN. Lecture 06 Connecting Networks

Workshop - New Paradigms for Routing in Optical Networks

Optical Communication Networks. Transport networks

How To Set Up A Network On A Wdm Network

APPLICATION NOTE 211 MPLS BASICS AND TESTING NEEDS. Label Switching vs. Traditional Routing

Disaster Recovery Design Ehab Ashary University of Colorado at Colorado Springs

A Heuristic Approach to Service Restoration in MPLS Networks

Multiplexing. Multiplexing is the set of techniques that allows the simultaneous transmission of multiple signals across a single physical medium.

Network Design. Yiannos Mylonas

Attaching the PA-A1-ATM Interface Cables

A Novel Approach on DWDM based metropolitan ring network

Deploying Multiservice Applications Using RPR Over the Existing SONET Infrastructure

Chapter 4 Connecting to the Internet through an ISP

WHITEPAPER MPLS: Key Factors to Consider When Selecting Your MPLS Provider

Survivable MPLS Over Optical Transport Networks: Cost and Resource Usage Analysis

Fast Reroute Techniques in MPLS Networks. George Swallow

Design of Light-Tree Based Optical Inter-Datacenter Networks

Chapter 11: WAN. Abdullah Konak School of Information Sciences and Technology Penn State Berks. Wide Area Networks (WAN)

H3C SR8800 RPR Technology White Paper

Ring Protection: Wrapping vs. Steering

Module 5. Broadcast Communication Networks. Version 2 CSE IIT, Kharagpur

Objectives. Lecture 4. How do computers communicate? How do computers communicate? Local asynchronous communication. How do computers communicate?

Capacity Allocation and Contention Resolution in a Photonic Slot Routing All-Optical WDM Mesh Network

Dynamic Load Balancing in WDM Packet Networks With and Without Wavelength Constraints

MPLS: Key Factors to Consider When Selecting Your MPLS Provider Whitepaper

Increasing the Robustness of IP Backbones in the Absence of Optical Level Protection

subject to x dp = 1.0, d D (2a)

End-to-End Dedicated Protection in Multi-Segment Optical Networks

Module 15: Network Structures

CHAPTER 1 INTRODUCTION

Upgrading Path to High Speed Fiber Optic Networks

WAN. Introduction. Services used by WAN. Circuit Switched Services. Architecture of Switch Services

MPLS. Packet switching vs. circuit switching Virtual circuits

Introduction, Rate and Latency

Chapter 2 - The TCP/IP and OSI Networking Models

Dynamic Congestion-Based Load Balanced Routing in Optical Burst-Switched Networks

CTS2134 Introduction to Networking. Module 07: Wide Area Networks

IP Core Transport Network

Network Structure or Topology

PART II. OPS-based metro area networks

AN OVERVIEW OF QUALITY OF SERVICE COMPUTER NETWORK

Optimal Mixtures of Different Types of Recovery Schemes in Optical Networks

System Interconnect Architectures. Goals and Analysis. Network Properties and Routing. Terminology - 2. Terminology - 1

EECC694 - Shaaban. Transmission Channel

Extended Distance SAN with MC/ServiceGuard Opens New Disaster Recovery Opportunities

1 Which network type is a specifically designed configuration of computers and other devices located within a confined area? A Peer-to-peer network

Transcription:

Network Survivability Basic concepts Classification of survivability schemes Protection, restoration SONET ring protection UPSR, BLSR/2, BLSR/4 Optical ring protection OCh, OMS DPRing, SPRing two-fiber, four-fiber Optical mesh protection Shared line protection Dedicated path protection Shared path protection SRLG protection Failures Network survivability Ensure that network continues to carry traffic in the event of a failure Failure types Fiber cut Duct cut Transceiver failure Node failure Failure rates Cable-cut rate: 1-5 cuts/year/1000 km Tx failure rate: 10867 FIT Rx failure rate: 4311 FIT FIT = Failure In Time: average number of failures in 10 9 hours Repair time Equipment MTTR (mean time to repair): 2 hours Cable-cut MTTR: 12 hours Example: Cable with 100 strands of fiber Each fiber with 100 wavelengths Each wavelength at 10 Gbps (OC-192) 100 Tbps of data loss Other failures Software Higher layer equipment (IP router) Failure Scenarios Terminology Single link or node failure At most one failure occurs at a time Multiple link or node failure Multiple near-simultaneous failures occur Shared Risk Group (SRG) or Shared-Risk-Link Group (SRLG) Single risk factor affects multiple links or nodes e.g., multiple fibers in a cable or duct Restoration time Instant that a connection goes down to instant that traffic is restored Restoration success rate Ratio between number of successfully restored traffic streams and number of disrupted traffic streams after a failure Availability Probability that a system (connection) will be operating and available at some arbitrary time in the future e.g., five 9 s availability = 99.999% Reliability Probability that system (connection) will operate without disruption for predefined period of time Protection vs Restoration Classification - Protection and Restoration Protection Backup resources (routes and wavelengths) pre-computed and reserved in advance Faster recovery time Guaranteed recovery ability Restoration When a failure occurs, a backup route and a wavelength are discovered dynamically More resource efficient 1

Classification of Protection Schemes Protection layer IP (GMPLS), SONET, optical-layer (wavelength, waveband), fiber Path vs. line/link Path protection: end-to-end per-channel protection (GMPLS path, SONET circuit, lightpath) Line/link protection: handles all channels on given network link Dedicated vs. shared Dedicated: 1 to 1 correspondence between backup resources and working resources Easier management, faster protection switching More resource intensive Shared: given backup resources are shared by multiple working resources Better resource efficiency Complex management, slower protection switching Revertive vs. non-revertive Revertive: when fault is repaired, switch traffic back to primary resources Non-revertive: when fault is repaired, primary resources become back-up resources Ring vs. mesh Dedicated Path Protection 1+1 protection Each working path has link-disjoint backup path Data transmitted over both paths simultaneously Best signal selected at destination 1:1 protection Each working path has link-disjoint backup path Data transmitted only over working path until failure occurs Unused backup path resources may be used for unprotected traffic Requires additional signaling to notify source node 50% of network resources used for protection Shared Path Protection Link/Line Protection 1:N Protection resources shared by N working paths Reduces resource usage May not be able to handle multiple failures Protects all traffic on a failed link Handled by the two end nodes of the link Routes all traffic around the failed link Faster, but less resource efficient than path protection Dedicated: 1+1, 1:1 Shared: 1:N. Span Protection Static Protection vs. Dynamic Protection Span: Multiple fibers bundled in a conduit. Similar to link protection. Static All connection requests are available Find routing and allocate resources for working and backup paths/links Minimize resource usage Dynamic Connection requests arrive one at a time and each connection exists for only a finite duration For each request, find routes and allocate resources for working and backup paths/links Minimize blocking, minimize resource usage 2

SONET Ring Protection SONET Layers Automatic Protection Switching (APS) protocols Self-healing rings UPSR Unidirectional Path-Switched Ring 2 fibers Path-based Dedicated 1+1 BLSR/2 Bidirectional Line-Switched Ring with 2 fibers Line-based Shared BLSR/4 Bidirectional Line-Switched Ring with 4 fibers Line-based Shared Path End-to-end connection Line Multiplexing and synchronization Section Regenerator, error monitoring, framing, etc. Photonic Lasers, LEDs, etc. SONET Layers UPSR Unidirectional Path- Switched Ring Two Fibers clockwise working counterclockwise protection Path level protection 1+1 dedicated protection 50% network capacity required for protection Bi-directional connection requires 1 unit of bandwidth on all fibers of ring Popular for low speed local exchange and access network (OC-3 and OC-12) Simple UPSR SONET BLSR Working traffic in both directions Line layer protection Protection capacities can be shared 50% of network capacity for protection spatial reuse of bandwidth on ring Widely deployed in long-haul and inter-office network BLSR/2 2 fibers half the capacity of each fiber is reserved for protection Shortest path routing Complex signaling BLSR/4 4 fibers One working fiber and one protection fiber in each direction Shortest path routing Span switching on transmitter or receiver failure Ring switching on node failure or fiber cut Most complex signaling 3

BLSR/2 BLSR/4 Span Switching BLSR/4 Ring Switching Optical Layer Protection Optical Channel (OCh) Protection Operate on individual light path Path protection Cost may be higher than OMS protection Dedicated: OCh-DPRing Shared: OCh-SPRing Optical Multiplex Section (OMS) Protection Operate on all wavelengths in a fiber link Link/span protection Dedicated: OMS-DPRing Shared: OMS-SPRing Can use two or four fibers per link Similar to SONET UPSR OCh-DPRing OCh-SPRing Shared path protection OADM OADM OADM OADM 4

OMS-DPRing Dedicated link protection Not very practical Each wavelength supports at most one unidirectional lightpath OMS-SPRing Shared link protection Similar to SONET BLSR On 4 fiber configuration, both span switching and ring switching are supported (for single-fiber failure or entire-link failure) 4-Fiber OMS-SPRing Mesh Network vs Ring Network Why Mesh? More flexible Especially suitable for dense network Require complex management and signaling Two Variations: Line/Span Protection Utilize cycles for protection Node cover Ring cover P-cycle Path Protection Utilize disjoint backup path for protection Dedicated Shared Static Link/Span Protection Static Link/Span Protection Minimum Node Cover: One cycle covering all nodes. Protect any single link failure May have to go through a very long route Ring Cover: Every link is covered by two cycles (one in each direction) For planar graph, one solution is: one cycle on outer face of graph one cycle on each inner face of graph May not be resource efficient. 5

Static Link/Span Protection Static Link/Span Protection Ring Cover: P-cycle: Pre-configured cycles Extended from ring cover and minimum node cover Multiple cycles in the network Cycles may protect straddling links More resource efficient Dedicated Path Protection in Single Wavelength Networks Static Dedicated Path Protection Problem Two edge or/and node disjoint paths for each connection Requires 50% network capacity for protection Static problem Requests known in advance Satisfy all requests while minimizing resource usage Dynamic problem Requests arrive one at a time For each requests, find route and resources, while minimizing blocking All connection requests are available Routing and wavelength assignment problem must be solved for each connection request, both the primary paths and backup paths Objective: minimize resources NP-hard problem Solved by Integer Linear Programming (ILP) or through heuristics ILP for Static Dedicated Path Protection Heuristics for Static Dedicated Path Protection Given: set of connections, physical topology Find: set of working and protection lightpaths Objective: minimize resources used Constraints: For each connection request (s,d), there is exactly one primary path and one protection path exiting s and entering d Primary path and its protection path must be link/node disjoint Flow conservation: Number of paths entering a node must be equal to the ones exiting it. A wavelength on a link can only be used by one primary path or one backup path When a link i->j fails, the number of lightpaths (primary + protection) failed between source-destination pair (s, d) should not exceed the demand between them. Typically consider requests one at a time For each request, find primary and backup path Two-step approach: find primary first, then find backup One-step approach: find both primary and backup simultaneously 6

Alternate Path Algorithm Simple Two-Step Algorithm Fixed set of possible working paths for each s-d pair Fixed set of possible protection paths for each working path Choose working path and link disjoint protection path Example: W paths in working group, B paths in backup group. Paths within a group are not link disjoint. Paths in different groups are link disjoint. Run shortest path algorithm to find a working path Remove links on working path Run shortest path algorithm again to find protection path Two Step Algorithm Simple Two-Step Algorithm Problems: The combined cost of the two paths may not be minimal May not find a pair for some topologies trap topology Simple Two-Step Algorithm Iterative Simple Two-Step Algorithm One solution: Iterative Simple Two-Step Algorithm Take K shortest paths from s to d. Run the Two-Step Algorithm for each of the shortest path until certain criterion are met. 3 shortest paths from s to d : Red, Blue and Green Run Simple Two-Step Algorithm for each of them 7

Integrated Algorithm Suurballe s Integrated Algorithm Run shortest path algorithm from s to d Set the costs of the links along the shortest path to negative Reverse direction of links along the shortest path Run shortest path algorithm again Remove overlapped links and combine the remaining links Minimizes combined cost of working and protection paths Integrated Algorithm Integrated Algorithm Integrated Algorithm Integrated Algorithm 8

Dedicated Path Protection in Multi-Wavelength Networks Dedicated Path Protection in Multi-Wavelength Networks Now things get a bit more tricky Each link has more than one wavelength No wavelength converter Working path and protection path may be on different wavelengths e Suurballe s algorithm doesn t work. Simple Two-Step algorithm and Iterative Two-Step algorithm are OK but do not give the best results. It is a NP-complete problem! Numerous heuristics exist. The Simple Two-Step is one of them. s a b d e f s a b d f Dedicated Path Protection in Networks with Shared-Risk Link Groups (SRLGs) Dedicated Path Protection in Networks with SRLGs Another variation of tricky networks Single wavelength or multi-wavelength with wavelength converters Multiple links may fail simultaneously Examples: Lightpaths routed over same physical fiber Lightpaths routed over different fibers but all the fibers are bundled or routed over same terrain Shared-Risk Link Group: Set of physical or logical links that are affected by the same failure event Solutions exist for special topologies NP-Complete if risks are randomly distributed in the network One heuristic solution: Run Suurballe s algorithm first Run Simple Two-Step Algorithm for each of the two paths under the SRLG disjoint constraint Pick the pair with the minimal combined cost Static Shared Path Protection Problem ILP for Static Shared Path Protection All connection requests are available. Routing and wavelength assignment problem must be solved for each connection request, both the primary paths and backup paths Two primary paths may share resources on their backup paths as long as the primary paths are link disjoint NP-complete problem Solved by Integer Linear Programming (ILP) or through heuristics Given: set of connections, physical topology Find: set of working and protection lightpaths Objective: minimize resources used Constraints: Applicable primary protection constraints Two lightpaths protected by the same wavelength w on the same link p->q cannot go through the same link i -> j Wavelength w on link i->j can only be utilized by either a primary path or by protection paths A primary path should not have more than one protection path 9

Dynamic Shared Path Protection More bandwidth efficient More complex than dedicated protection Additional shared protection constraints: The number of shared protection paths on one wavelength of a link can not exceed the maximum. Two working paths that have common SRLGs can not share protection paths Set up working and protection paths to maximize shared capacity: Set the cost of the links on existing protection path to zero. Shared Path Protection Heuristics Centralized Central controller maintains for each wavelength on each link Available Reserved for working path Reserved for protection path Set of working paths that use this wavelength on this link for protection Distributed with full information Each node maintains information as in centralized case Information distributed to nodes via flooding (e.g., OSPF) Distributed with limited information Each node maintains for each wavelength on each link Available Reserved Each node maintains for each wavelength on its own links Available Reserved for working path Reserved for protection path Set of working paths that use this wavelength on this link for protection Route calculation done by source node Sharing and wavelength assignment decisions for protection path made on a hop-by-hop basis Other Protection Problems Shared protection with SRLG constraint Minimum-risk path problems Maximum survivability problems 10