DHCP Failover: Requirements of a High-Performance System



Similar documents
DHCP Failover. Necessary for a secure and stable network. DHCP Failover White Paper Page 1

Configuration Notes 0215

Configuration Guide. DHCP Server. LAN client

Cost Savings Analysis of IP Address Management (IPAM) Software for Service Providers

Leveraging Best Practices for SolarWinds IP Address Manager

DNS must be up and running. Both the Collax server and the clients to be backed up must be able to resolve the FQDN of the Collax server correctly.

NMS300 Network Management System

Cisco Application Networking Manager Version 2.0

Avid inews Command Best Practices

Network Registrar Data Backup and Recovery Strategies

DHCPv6 Failover Update IETF85

vrealize Automation Load Balancing

Note: This case study utilizes Packet Tracer. Please see the Chapter 5 Packet Tracer file located in Supplemental Materials.

Administrator Guide VMware vcenter Server Heartbeat 6.3 Update 1

UserLock advanced documentation

Monitoring Your Network

Panorama High Availability

Setting Up the EntraPass Mirror Database and Redundant Server

Administering and Managing Log Shipping

Conquering the Challenges of IP Network Management with DHCP and DNS

DiskBoss. File & Disk Manager. Version 2.0. Dec Flexense Ltd. info@flexense.com. File Integrity Monitor

Disaster Recovery White Paper

CA ARCserve and CA XOsoft r12.5 Best Practices for protecting Microsoft SQL Server

User Manual. Onsight Management Suite Version 5.1. Another Innovation by Librestream

High Availability and Disaster Recovery Solutions for Perforce

High Availability. Palo Alto Networks. PAN-OS Administrator s Guide Version 6.0. Copyright Palo Alto Networks

Introduction. What is a Remote Console? What is the Server Service? A Remote Control Enabled (RCE) Console

Informix Dynamic Server May Availability Solutions with Informix Dynamic Server 11

orrelog Ping Monitor Adapter Software Users Manual

Whitepaper Continuous Availability Suite: Neverfail Solution Architecture

The Importance of a Resilient DNS and DHCP Infrastructure

BlackBerry Enterprise Server Version: 5.0. Monitoring Guide

NQA Technology White Paper

How To Manage My Smb Ap On Cwm On Pc Or Mac Or Ipad (Windows) On A Pc Or Ipa (Windows 2) On Pc (Windows 3) On An Ipa Or Mac (Windows 5) On Your Pc

Top 10 Reasons why MySQL Experts Switch to SchoonerSQL - Solving the common problems users face with MySQL

High Availability for Citrix XenApp

IP ADDRESS MANAGER 4.3 (IPAM)

Smart Tips. Enabling WAN Load Balancing. Key Features. Network Diagram. Overview. Featured Products. WAN Failover. Enabling WAN Load Balancing Page 1

FioranoMQ 9. High Availability Guide

Beyond Quality of Service (QoS) Preparing Your Network for a Faster Voice over IP (VoIP)/ IP Telephony (IPT) Rollout with Lower Operating Costs

Using HP Systems Insight Manager to achieve high availability for Microsoft Team Foundation Server

Privileged Access Management Upgrade Guide

High Availability Guide for Distributed Systems

Smart Business Architecture for Midsize Networks Network Management Deployment Guide

Lab Configure Syslog on AP

High Availability Solutions & Technology for NetScreen s Security Systems

INUVIKA TECHNICAL GUIDE

WhatsUp Gold v16.3 Installation and Configuration Guide

Snapt Redundancy Manual

Networking Guide Redwood Manager 3.0 August 2013

Cost-Savings Analysis of IP Address Management Software: A Guide for Service Providers

Coyote Point Systems White Paper

CA ARCserve Replication and High Availability for Windows

Windows clustering glossary

Using WhatsUp IP Address Manager 1.0

EventSentry Overview. Part I About This Guide 1. Part II Overview 2. Part III Installation & Deployment 4. Part IV Monitoring Architecture 13

Studio 5.0 User s Guide

Configuring SIP Trunk Failover in AOS

This guide provides detailed information on how to configure and use server redundancy on Yealink IP phones.

High Availability Configuration Guide

Microsoft Exchange 2003 Disaster Recovery Operations Guide

Redundant Servers. APPolo Redundant Servers User Guide. User Guide. Revision: 1.2 Last Updated: May 2014 Service Contact:

Configuring the Transparent or Routed Firewall

StarWind Virtual SAN Installation and Configuration of Hyper-Converged 2 Nodes with Hyper-V Cluster

Metalogix SharePoint Backup. Advanced Installation Guide. Publication Date: August 24, 2015

GLBP - Gateway Load Balancing Protocol

Transport and Network Layer

Cisco Active Network Abstraction Gateway High Availability Solution

SanDisk ION Accelerator High Availability

High Availability and Clustering

Guide to the LBaaS plugin ver for Fuel

Highly Available AMPS Client Programming

Chapter 12 Supporting Network Address Translation (NAT)

DNS Architecture Case Study: Resiliency and Disaster Recovery

This guide provides detailed information on how to configure and use server redundancy on Yealink IP phones.

Understanding IBM Lotus Domino server clustering

Chapter 4 Customizing Your Network Settings

Technical Bulletin 5844

Configuring the BIG-IP and Check Point VPN-1 /FireWall-1

Initial Access and Basic IPv4 Internet Configuration

Understanding DNS (the Domain Name System)

VERITAS Storage Foundation 4.3 for Windows

3.1 RS-232/422/485 Pinout:PORT1-4(RJ-45) RJ-45 RS-232 RS-422 RS-485 PIN1 TXD PIN2 RXD PIN3 GND PIN4 PIN5 T PIN6 T PIN7 R+ PIN8 R-

CA ARCserve Replication and High Availability for Windows

BlackBerry Enterprise Server

TECHNICAL PAPER. Veeam Backup & Replication with Nimble Storage

Understanding offline files

Troubleshooting PHP Issues with Zend Server Code Tracing

White Paper. The Ten Features Your Web Application Monitoring Software Must Have. Executive Summary

SIP-DECT Knowledge Base SIP-DECT System Update

Configuring Failover

Providing Patch Management With N-central. Version 7.2

Chapter 8 Router and Network Management

WARP 3.0 Table of Contents

SevOne NMS Download Installation and Implementation Guide

Executive Summary WHAT IS DRIVING THE PUSH FOR HIGH AVAILABILITY?

DATA CENTER. Best Practices for High Availability Deployment for the Brocade ADX Switch

Everything You Need to Know About Network Failover

Transcription:

DHCP Failover: Requirements of a High-Performance System A white paper by Incognito Software April, 2006 2006 Incognito Software Inc. All rights reserved. Page 1 of 6

DHCP Failover: Requirements of a High-Performance System Introduction...2 DHCP Failover Overview...2 Characteristics of Robust, High-Performance DHCP Implementations...3 Maximum Reliability with 1:1 Failover...3 Efficient IP Address Space Utilization and Server Communications...3 Ease of Server Configuration...5 Integration with Other Broadband Provisioning Functions...5 Comparison of System Functionality...5 Reliable DHCP systems are key to deploying successful VoIP services. The secondary server must react immediately and assume the role of assigning IP addresses. Introduction Reliable DHCP systems are key to deploying successful VoIP services. VoIP requires a minimum of 5-nines, 99.999% uptime, which is why DHCP failover has to work flawlessly. The IETF has released a DHCP Failover Protocol draft document, although it has not yet reached RFC status. The IETF draft represents a set of minimum requirements for DHCP failover. For maximum performance, broadband providers need to go beyond the draft specifications. Implementations of DHCP failover vary widely, and even subtle differences can have a huge impact on reliability. This paper discusses a high-performance implementation of DHCP failover, especially critical for broadband environments. DHCP Failover Overview DHCP failover involves both a primary DHCP server and a secondary backup DHCP server, which takes over DHCP duties should the primary fail. During normal operation, only the primary server performs IP address assignment while the secondary server is in standby mode. The primary server issues IP address leases, but the secondary does not. The primary server constantly transfers IP address dta to the secondary server to ensure data synchronization. Also, the two servers regularly poll each other, using stay-alive heartbeat messages, to determine their status. An interruption in this heartbeat is deemed a failure situation. If the primary server shuts down or stops communicating with the secondary server, failover mode is initiated. The secondary server must react immediately and assume the role of assigning IP addresses. The secondary server must also attempt to re-establish communications with the primary. When the primary comes back 2006 Incognito Software Inc. All rights reserved. Page 2 of 6

online, it should resume its role as primary after being updated by the secondary with the most recent IP address data. If the primary server loses communications with the secondary server, the primary continues to allocate IP addresses while trying to re-establish communications with the secondary. As a critical part of a broadband service infrastructure, DHCP failover implementations have to meet broadband needs for robustness and efficiency. Characteristics of Robust, High-Performance DHCP Implementations As a critical part of a broadband service infrastructure, DHCP failover implementations have to meet broadband needs for robustness and efficiency. There are several key features that differentiate a high-performance DHCP failover system: a 1:1 relationship between the primary and secondary servers, efficient use of IP address space and server communications, ease of server configuration, and integration with other device provisioning functions. Maximum Reliability with 1:1 Failover The IETF draft specifies n:1 failover and load-balancing, which involves one secondary as backup for multiple primary servers. The limitation with this approach is that the failure of multiple primary servers may have a wide-reaching, catastrophic impact on the entire network. A higher-performance system dedicates one secondary for each primary, a 1:1 failover relationship, where the secondary is an exact replica of the primary, including configuration information. This scheme is a true hot standby system, which ensures that no performance penalties occur during failover. A highperformance scheme uses 100% of the available IP address space for broadband service deployments. Efficient IP Address Space Utilization and Server Communications Part of the IETF draft discusses possible solutions to "two challenging scenarios that test the robustness of a DHCP failover system. However, these solutions significantly reduce a network s available IP address space due to the reservation of large blocks of IP addresses and MCLT restrictions, and also slow down communications. While the IETF draft ensures there will be no contention between servers, there is a large price to pay up to 50% of the IP addresses may have to be reserved specifically for the secondary server alone, where they sit idle most of the time, not available for broadband service deployments. An alternative, higher-performance scheme is possible it uses 100% of available IP addresses for broadband service deployments, offers high-water mark alerts to avoid IP address depletion, and leverages a faster communications channel for synchronization. 2006 Incognito Software Inc. All rights reserved. Page 3 of 6

A more efficient approach is to have both servers use the same address pools, but during failover mode, the secondary server processes addresses in an order opposite to that of the primary. Primary-Secondary Server Contention: IETF Challenging Scenario#1 DHCP servers can't communicate to each other and both are allocating IP addresses to clients. The primary and secondary servers may be functioning correctly but cannot communicate with each other due to an unrelated network failure. In this situation, both servers attempt to allocate the same IP addresses, a contention issue. To avoid this scenario, the IETF Draft suggests using disjoint IP address pools, which essentially cuts the number of available IP addresses in half by devoting specific address ranges to the secondary server. The IETF approach requires network operators to develop critical assumptions about the arrival rate of new DHCP clients during each failover period and the mean-time-to-restoration for the primary server. This determines the amount of IP address space to be reserved for the secondary server. Underestimating these parameters can result in the primary being offline and the secondary being unable to issue leases because its address pool is exhausted. Overestimating these parameters can create a persistent and very wasteful reservation of routable IP address space. A more efficient approach is to have both servers use the same address pools, but during failover mode, the secondary server processes addresses in an order opposite to that of the primary. In other words, the secondary always allocates the highest IP address possible (starting at the top of the range and working its way down to the bottom) and the primary always allocates the lowest IP address possible (bottom to top). This helps to ensure that the two servers do not allocate duplicate IP addresses. Every active IP address lease should also indicate which server the lease was allocated from. As well, if customizable high-water marks are available in the DHCP system, they can help prevent the network from running out of IP address space. When a user-specified threshold is reached, the system can trigger an emergency notification to network administrators via e-mail, SNMP traps, or online popup messages. Primary-Secondary Server Synchronization: IETF Challenging Scenario#2 primary server crashes before secondary is updated, and the requesting DHCP client goes offline and won t be discovered by a ping. The primary may crash before it can update the secondary with the most recent changes, resulting in a lack of data synchronization between servers. The approach presented in the IETF Draft has two performancerelated issues. The IETF draft suggests a single TCP connection for both data transfers and keep-alive heartbeat messages between primary and secondary servers, which can clog communications and cause false startup of DHCP failover. Also, the IETF uses a delay mechanism, called Maximum Client Lead 2006 Incognito Software Inc. All rights reserved. Page 4 of 6

To ensure the most up-to-date data synchronization between servers, there is a fast, efficient, low-overhead method that uses a less complex transport layer protocol, UDP, and a data format that has virtually zero overhead. A DHCP failover system should simplify server configuration with a GUI and built-in rules to prevent errors. To achieve the fastest return on investment, there needs to be tight integration between the DHCP failover system and additional services such as TFTP and DNS. Time (MCLT), which is the length of time an IP address lease may be renewed by either server without contacting the other. The longer this time is, the longer it will take the functioning server to recover IP addresses. The IETF approach reserves IP addresses for a prolonged period of time, thereby reducing the available IP address pool size. To ensure the most up-to-date data synchronization between servers, there is a faster, more efficient, lower-overhead method that uses a separate, less complex transport layer protocol, UDP, and a data format that has virtually zero overhead. UDP is also used for heartbeat messages but in a different thread to eliminate any blockages. (During initial synchronization of the servers, the primary server sends configuration information and IP address lease data to the secondary using a TCP connection.) Ease of Server Configuration To keep network operations efficient and productive, a DHCP failover system should offer an easy-to-use interface. It should simplify server configuration with a GUI and built-in rules to prevent configuration errors. It also needs to combine monitoring, reporting, and emergency alert features into a single user interface. Over 200 statistics may be monitored for failover operation including event times, errors, number of messages sent/received, and status. In contrast, the IETF document describes manual configuration of the secondary server using a basic command line interface. Integration with Other Broadband Provisioning Functions To achieve the fastest return on investment, there needs to be tight integration between the DHCP failover system and additional services such as TFTP, DNS, and ToD for end-to-end support of a wide variety of broadband provisioning requirements. Comparison of System Functionality The following table summarizes some differences in functionality between Incognito Software s DHCP failover system and other implementations. 2006 Incognito Software Inc. All rights reserved. Page 5 of 6

Feature Incognito Software Other Implementations Initial Setup Failover Monitoring Utilization of IP Address Space Failover Communications Changes to the DHCP service configuration on the secondary server 2 Steps with a GUI: (1) Enter the IP address of the secondary server, and (2) Click the Initiate Failover button. All further configuration and synchronization is automatic. A command line interface is also available. (1) GUI tree view, (2) emails to administrators, (3) SNMP traps, (4) notification to currently logged-in consoles, or (5) the command line interface. 100% utilization of IP address space is possible by using bottom-up allocation for the primary and top-down allocation for the secondary. Two levels of highwater marks are set on both servers to monitor address pool levels and avoid conflicts. TCP and multiple UDP threads for data synchronization and heartbeat information with virtually no overhead and no bottlenecks. Changes are permitted to the DHCP service configuration on the secondary server during failover or during normal operation. When the primary is ready and/or restored, all changes are automatically sent to the primary during synchronization. 10 steps with a command line interface: (1) Create a copy of the primary server s configuration file, (2) Transfer the file to the secondary server, (3) Import the file into the secondary, (4) Confirm the primary and secondary have identical configurations, (5) Compare text output, looking for discrepancies between the primary and secondary configurations, (6) If there are errors, resolve them and return to Step 1, or (7) Save the new configuration, (8) Reload the secondary server, (9) Wait several minutes, and (10) Verify that the primary and secondary servers are synchronized. (1) GUI menu options, (2) SNMP traps, or (3) the command line interface. Large blocks of IP address space are reserved only for the secondary server, where they go unused unless the secondary is issuing leases. MCLT reserve addresses for a particular period of time after their leases have expired. A single TCP connection for all data synchronization and heartbeat information, which can lead to delays and false failover. Not supported. Contact: Incognito Software Inc. Phone: 604.688.4332 or US/Canada toll free 800.877.1856 Fax: 604.688.4339 Email: sales@incognito.com Web: http://www.incognito.com 2006 Incognito Software Inc. All rights reserved. Page 6 of 6