MPLS Traffic Engineering with Auto-Bandwidth: Operational Experience and Lessons Learned. Mohan Nanduri Mark Kasten Naoki Kitajima Microsoft



Similar documents
MPLS Traffic Engineering with Auto-Bandwidth: Operational Experience and Lessons Learned. Mohan Nanduri Mark Kasten Naoki Kitajima

MPLS RSVP-TE Auto-Bandwidth: Practical Lessons Learned

MPLS RSVP-TE Auto-Bandwidth: Practical Lessons Learned. Richard A Steenbergen <ras@gt-t.net>

MAXIMIZE BANDWIDTH UTILIZATION WITH JUNIPER NETWORKS TE++

PCEP Extensions for MPLS-TE LSP Automatic Bandwidth Adjustment with Stateful PCE

MPLS for Dummies. Richard A Steenbergen <ras@nlayer.net> nlayer Communications, Inc.

SDN IN WAN NETWORK PROGRAMMABILITY THROUGH CENTRALIZED PATH COMPUTATION. 1 st September 2014

Juniper Networks NorthStar Controller

Managing Dynamic Configuration

Top 3 Issues and Questions (in Network Monitoring!) Developing a Network Monitoring Architecture! infotex. Dan Hadaway CRISC Managing Partner, infotex

Configuring WMI Performance Monitors

Task Manager. Tasks. Starting Task Manager CHAPTER

BFD. (Bidirectional Forwarding Detection) Does it work and is it worth it? Tom Scholl, AT&T Labs NANOG 45

MENTER Overview. Prepared by Mark Shayman UMIACS Contract Review Laboratory for Telecommunications Science May 31, 2001

CS 356 Lecture 25 and 26 Operating System Security. Spring 2013

VirtualCenter Database Performance for Microsoft SQL Server 2005 VirtualCenter 2.5

GE Measurement & Control. Top 10 Cyber Vulnerabilities for Control Systems

Testing VoIP on MPLS Networks

Minder. simplifying IT. All-in-one solution to monitor Network, Server, Application & Log Data

pc resource monitoring and performance advisor

DMS Performance Tuning Guide for SQL Server

ALTIRIS Package Server

Investigation and Comparison of MPLS QoS Solution and Differentiated Services QoS Solutions

Managing Mobile Devices Over Cellular Data Networks

Ecom Infotech. Page 1 of 6

Northwestern University Dell Kace Patch Management

Voice, Video and Data Convergence > A best-practice approach for transitioning your network infrastructure. White Paper

Managed Service Plans

WhatsUp Gold v11 Features Overview

Performance Evaluation of Linux Bridge

Mike Canney Principal Network Analyst getpackets.com

Table Of Contents. - Microsoft Windows - WINDOWS XP - IMPLEMENTING & SUPPORTING MICROSOFT WINDOWS XP PROFESSIONAL...10

Chapter 28 Denial of Service (DoS) Attack Prevention

Accelerate Testing Cycles With Collaborative Performance Testing

Intelligent Policy Enforcement for LTE Networks

OptiView. Total integration Total control Total Network SuperVision. Network Analysis Solution. No one knows the value of an

OptiView. Total integration Total control Total Network SuperVision. Network Analysis Solution. No one knows the value of an

An Architecture for Application-Based Network Operations

Configuring NetFlow Secure Event Logging (NSEL)

Network Monitoring. RMON-Based vs. Localized Analysis. White paper. w w w. n i k s u n. c o m

HOW TO EVALUATE AND SELECT TOOL A HIGH-END LOAD TESTING. Marquis Harding Reality Test P R E S E N T A T I O N. Presentation. Bio

Performance Management in a Virtual Environment. Eric Siebert Author and vexpert. whitepaper

Caching SMB Data for Offline Access and an Improved Online Experience

ITIL A guide to Event Management

VI Performance Monitoring

Cox Business Internet Performance Management Reporting Quick Reference Guide

Better management of large-scale, heterogeneous networks toward a programmable management plane

Chapter 15: Advanced Networks

Common Core Network Readiness Guidelines Is your network ready? Detailed questions, processes, and actions to consider.

Picasso Recommendation

Protecting SQL Server Databases Software Pursuits, Inc.

White Paper. The Ten Features Your Web Application Monitoring Software Must Have. Executive Summary

Performance Tuning Guide for ECM 2.0

MySQL Enterprise Edition Most secure, scalable MySQL Database, Online Backup, Development/Monitoring Tools, backed by Oracle Premier Lifetime Support

How To Provide Qos Based Routing In The Internet

A Talari Networks White Paper. Turbo Charging WAN Optimization with WAN Virtualization. A Talari White Paper

Site2Site VPN Optimization Solutions

Comparing MPLS and Internet Links for Delivering VoIP Services

Using WebLOAD to Monitor Your Production Environment

Fundamentals of VoIP Call Quality Monitoring & Troubleshooting. 2014, SolarWinds Worldwide, LLC. All rights reserved. Follow SolarWinds:

Best Practices from Deployments of Oracle Enterprise Operations Monitor

High Availability for Internet Information Server Using Double-Take 4.x

Best Practices for NetFlow/IPFIX Analysis and Reporting

How To Manage A Network With A Network Management System (Qoe)

Distributed applications monitoring at system and network level

Smart Queue Scheduling for QoS Spring 2001 Final Report

Wireless Technologies for the 450 MHz band

Red Hat Network: Monitoring Module Overview

NETWORK ISSUES: COSTS & OPTIONS

Troubleshooting Common Issues in VoIP

REMOTE BACKUP-WHY SO VITAL?

INTERNET CONNECTIONS FOR VIEWPOINT CLOUD COMPUTING

Are Duplicate Packets Interfering with Network Monitoring? White Paper

Introduction to Performance Measurements and Monitoring. Vinayak

Overcoming Obstacles to Retail Supply Chain Efficiency and Vendor Compliance

Hands-On Microsoft Windows Server 2008

Migrating SaaS Applications to Windows Azure

Application-Centric Analysis Helps Maximize the Value of Wireshark

Web Security Log Server Error Reference

ERP on HANA Suite Migration. Robert Hernandez Director In-Memory Solutions SAP Americas

Consequences of Poorly Performing Software Systems

MANAGING NETWORK COMPONENTS USING SNMP

CS640: Introduction to Computer Networks. Why a New Service Model? Utility curve Elastic traffic. Aditya Akella. Lecture 20 QoS

HP LeftHand SAN Solutions

Troubleshooting VoIP and Streaming Video Problems

PRODUCT OVERVIEW SUITE DEALS. Combine our award-winning products for complete performance monitoring and optimization, and cost effective solutions.

Latency Analyzer (LANZ)

Production Balance Management

ITIL A guide to event management

GRAND ERIE DISTRICT SCHOOL BOARD

Transcription:

MPLS Traffic Engineering with Auto-Bandwidth: Operational Experience and Lessons Learned Mohan Nanduri Mark Kasten Naoki Kitajima Microsoft February 29, 2012

Agenda Offline vs. Auto-Bandwidth Auto-Bandwidth basics and overview Operational Experience Lessons Learned Configuration

MPLS LSP Bandwidth Adjustment Few ways to accomplish this: Offline calculation Calculating paths using offline tool Periodically adjusting bandwidth value based on measured 95 percentile traffic plus adding additional buffer for growth Auto-Bandwidth Rely on devices to measure traffic and adjust bandwidth

MPLS Auto-Bandwidth Terminology Adjust-interval Frequency at which LSP bandwidth is adjusted Adjust-threshold Change in LSP bandwidth utilization percentage that triggers new LSP calculation Minimum and Maximum Bandwidth Minimum and Maximum amount of bandwidth Overflow and Underflow Number of consecutive bandwidth overflow/underflow samples before triggering bandwidth adjustment Monitor Bandwidth Monitor usage of bandwidth without triggering adjustment

MPLS Auto-Bandwidth Overview Automates the process of monitoring and adjusting LSP bandwidth MPLS auto-bandwidth measures the traffic flows through the LSP, adjusting the bandwidth based on measured traffic and defined parameters on per-lsp basis LSPs are treated independently to adjust bandwidth without regard for any adjustments previously made or pending for other LSPs

MPLS Auto-Bandwidth Overview Auto-Bandwidth Calculation mechanism: Bandwidth adjustment is based on per-lsp measured traffic utilization and chooses the highest value during the adjust interval At the end of each interval the highest measured bandwidth value is compared with the existing LSP bandwidth value If the delta between existing LSP bandwidth and highest measured bandwidth value is equivalent or greater than adjust-threshold value, LSP tries to re-signal with the new bandwidth, otherwise the LSP bandwidth remains the same Make-before-break mechanism kicks into place and LSP resignals for the new bandwidth. Once it s successful in setting up the LSP, the maximum bandwidth value is reset to zero If the new bandwidth cannot be setup, LSP uses the existing bandwidth value The above process is repeated for the next adjust-interval

MPLS Auto-Bandwidth With Adjust-Interval Adjust-Interval Adjust-Interval Adjust-Interval Adjust-Interval Adjust-Interval Max Bw = 100 Mbps LSP Banwidth signaled during this period is 100 Mbps Max Bw = 150 Mbps LSP Banwidth signaled during this period is 150 Mbps Bandwidth in Mbps (X) Max Bw = 50 Mbps LSP Banwidth signaled during this period is 50 Mbps Max Bw = 30 Mbps LSP Banwidth signaled during this period is 30 Mbps Time (T)

MPLS Auto-Bandwidth With Adjust-Interval The highest measured bandwidth value is 100 Mbps for the given adjust-interval period, LSP is signaled with 100 Mbps and its reset to zero As the new adjust-interval starts, the highest measured bandwidth value is 50 Mbps. LSP is signaled with 50 Mbps if the adjust-threshold value is met

MPLS Auto-Bandwidth With Overflow Configured Adjust-Interval Overflow Counter = Y Adjust-threshold Bandwidth in Mbps (X) Measure BW = 100 Mbps LSP Banwidth signaled at 100 Mbps LSP BW = 80 Mbps Time (T) LSP signals with new BW before the configured adjust-interval: 1) If the delta between measured and LSP signaled BW adjust-threshold value 2) If it reaches the configured overflow count..

MPLS Auto-Bandwidth With Overflow If the highest measured bandwidth value is higher than the current LSP signaled bandwidth value The delta between highest measured bandwidth value and current LSP signaled bandwidth value is greater than or equal to configured adjust-threshold value The number of occurrences must reach the configured overflow count consecutively. Overflow counter resets to the configured overflow value if the bandwidth decreases during one the sampling cycles LSP is signaled with the new bandwidth value before the configured adjust-interval Adjust-interval timer is reset to configured value.

Benefits of Auto-Bandwidth Network can react faster to sudden burst of traffic in near real-time and not rely on manually intervention Effective use of bandwidth resources by minimizing the over-subscription/padding of LSP bandwidth Maximizes the usage of available bandwidth and optimizes the network effectively to use more preferred, shorter latency, paths first.

Why auto-bandwidth for us? Using offline bandwidth calculation based on historical traffic data was slow, cumbersome, and didn't support real time growth Varying services with different traffic profiles Lagging behind bandwidth adjustments Not reacting to sudden spikes in traffic shifts

Operational Experience Some skepticism initially letting the device make decisions and automatically adjust bandwidth Auto-bandwidth deployed since early 2009 Phased rollout of auto-bandwidth globally Extensive analysis was performed on network after auto-bandwidth was introduced to optimize Significant churn by smaller sized LSPs Tiered LSP model was introduce to reduce churn RSVP interface subscriptions were modified to account spike in traffic Sudden spike in traffic is still an issue Automatic LSP creation and deletion using event scripts introduced Overall Worked fairly well so far

Lessons Learned Significant amount of time was spent to understand internals, was tough to get information initially from vendor Many hours were spent testing and validating, scripts to validate saved lot of time Testing all the available knobs thoroughly helped, even though we did not plan to use them right away. We ran into multiple issues with these knobs Thorough planning and training made the deployment very smooth Post deployment correlation scripts to check traffic stats and MPLS bandwidth helped to check any anomalies

Lessons Learned Show commands lacks historical bandwidth adjustment information, have to glean information from multiple log files Multiple enhancements have been submitted to display information to make troubleshooting easier, to name a few: Show previous bandwidth adjusted values Minimum/maximum bandwidth ever used Counter to keep track of number of bandwidth adjustments Ran into some issues early on Smear timers not spacing LSPs properly Bloated traffic statistics used to setup LSPs Based on configured values, smaller LSPs were causing churn and analyzing a weeks worth of data showed more than 5000 events

Configuration groups { autobw_lsp_setup { protocols { mpls { label-switched-path <*> { ldp-tunneling; soft-preemption; admin-group include-any [ gold]; adaptive; auto-bandwidth { adjust-interval 1200; adjust-threshold 10; minimum-bandwidth 1m; maximum-bandwidth 2g; autobw_lsp_setup_1m_profile { protocols { mpls { label-switched-path <*> { ldp-tunneling; admin-group include-any [ gold ]; adaptive; auto-bandwidth { adjust-interval 1200; adjust-threshold 50; minimum-bandwidth 1m; maximum-bandwidth 50m; label-switched-path rtr1-to-rtr2 { apply-groups autobw_lsp_setup; to 10.1.1.2; admin-group exclude [bad]; label-switched-path rtr1-to-rtr3 { apply-groups autobw_lsp_setup_1m; to 10.1.1.3; admin-group exclude [bad]; mpls { statistics { file mpls_stats.log size 1m files 10; interval 60; auto-bandwidth;

Thank You