A Task-Based Adaptive-TTL approach for Web Server Load Balancing *

Similar documents

How To Balance A Web Server With Remaining Capacity

Load Balancing of DNS-Based Distributed Web Server Systems with Page Caching

Global Server Load Balancing

Back-End Forwarding Scheme in Server Load Balancing using Client Virtualization

GLOBAL SERVER LOAD BALANCING WITH SERVERIRON

CS514: Intermediate Course in Computer Systems

International Journal of Combined Research & Development (IJCRD ) eissn: x; pissn: Volume: 2; Issue: 5; May -2014

DATA COMMUNICATOIN NETWORKING

LOAD BALANCING MECHANISMS IN DATA CENTER NETWORKS

Efficient DNS based Load Balancing for Bursty Web Application Traffic

Load Balancing of Web Server System Using Service Queue Length

Request Routing, Load-Balancing and Fault- Tolerance Solution - MediaDNS

FortiBalancer: Global Server Load Balancing WHITE PAPER

Lecture 8a: WWW Proxy Servers and Cookies

A JAVA TCP SERVER LOAD BALANCER: ANALYSIS AND COMPARISON OF ITS LOAD BALANCING ALGORITHMS

Lecture 8a: WWW Proxy Servers and Cookies

Web Caching and CDNs. Aditya Akella

DNS ROUND ROBIN HIGH-AVAILABILITY LOAD SHARING

Combining Global Load Balancing and Geo-location with Emissary TM

Load Balancing Web Applications

SHIV SHAKTI International Journal of in Multidisciplinary and Academic Research (SSIJMAR) Vol. 4, No. 3, June 2015 (ISSN )

Fair load-balance on parallel systems for QoS 1

HUAWEI OceanStor Load Balancing Technical White Paper. Issue 01. Date HUAWEI TECHNOLOGIES CO., LTD.

EVALUATION OF LOAD BALANCING ALGORITHMS AND INTERNET TRAFFIC MODELING FOR PERFORMANCE ANALYSIS. Arthur L. Blais

Content Delivery Networks

AN EFFICIENT LOAD BALANCING ALGORITHM FOR A DISTRIBUTED COMPUTER SYSTEM. Dr. T.Ravichandran, B.E (ECE), M.E(CSE), Ph.D., MISTE.,

Application-layer protocols

Alteon Global Server Load Balancing

Global Server Load Balancing

Optimization for QoS on Web-Service-Based Systems with Tasks Deadlines 1

Web Application Hosting Cloud Architecture

Fast Web Page Allocation On a Server Using Self- Organizing Properties of Neural Networks

Adaptable Load Balancing

MEASURING WORKLOAD PERFORMANCE IS THE INFRASTRUCTURE A PROBLEM?

Protagonist International Journal of Management And Technology (PIJMT)

Performance Comparison of Assignment Policies on Cluster-based E-Commerce Servers

Scalability of web applications. CSCI 470: Web Science Keith Vertanen

Citrix NetScaler Global Server Load Balancing Primer:

Internet Technologies. World Wide Web (WWW) Proxy Server Network Address Translator (NAT)

CONTENT of this CHAPTER

S y s t e m A r c h i t e c t u r e

Global Server Load Balancing (GSLB) Concepts

Abstract. 1. Introduction

LOAD BALANCING AS A STRATEGY LEARNING TASK

Content Distribu-on Networks (CDNs)

Communications and Networking

Load balancing using Remote Method Invocation (JAVA RMI)

Multicast-based Distributed LVS (MD-LVS) for improving. scalability and availability

DEPLOYMENT GUIDE Version 1.1. DNS Traffic Management using the BIG-IP Local Traffic Manager

Optimization of Cluster Web Server Scheduling from Site Access Statistics

CDN Brokering. Content Distribution Internetworking

Overlay Networks. Slides adopted from Prof. Böszörményi, Distributed Systems, Summer 2004.

CheckPoint Software Technologies LTD. How to Configure Firewall-1 With Connect Control

Content Delivery Networks

SiteCelerate white paper

Globule: a Platform for Self-Replicating Web Documents

How To Understand The Power Of A Content Delivery Network (Cdn)

1. Comments on reviews a. Need to avoid just summarizing web page asks you for:

Lecture 3: Scaling by Load Balancing 1. Comments on reviews i. 2. Topic 1: Scalability a. QUESTION: What are problems? i. These papers look at

Development of Software Dispatcher Based. for Heterogeneous. Cluster Based Web Systems

Distributed Systems LEEC (2005/06 2º Sem.)

Self Reconfigurable Distributed Load Balancing For Secure and Privacy-Preserving Information Brokering.

DNS records. RR format: (name, value, type, TTL) Type=NS

Creating Web Farms with Linux (Linux High Availability and Scalability)

Network Troubleshooting

Comparison of Load Balancing Strategies on Cluster-based Web Servers

How To Run A Web Farm On Linux (Ahem) On A Single Computer (For Free) On Your Computer (With A Freebie) On An Ipv4 (For Cheap) Or Ipv2 (For A Free) (For

Outline. Mariposa: A wide-area distributed database. Outline. Motivation. Outline. (wrong) Assumptions in Distributed DBMS

Application Latency Monitoring using nprobe

How To Load Balance On A Bgg On A Network With A Network (Networking) On A Pc Or Ipa On A Computer Or Ipad On A 2G Network On A Microsoft Ipa (Netnet) On An Ip

First Midterm for ECE374 02/25/15 Solution!!

ExamPDF. Higher Quality,Better service!

Advanced Networking Technologies

Virtual Technologies for Learning System. Chao-Wen Chan, Chih-Min Chen. National Taichung University of Science and Technology, Taiwan

Introduction to ServerIron ADX Application Switching and Load Balancing. Module 7: Global Server Load Balancing (GSLB) Revision 0310

Load Balancing a Cluster of Web Servers

LOAD BALANCING TECHNIQUES FOR RELEASE 11i AND RELEASE 12 E-BUSINESS ENVIRONMENTS

HTG XROADS NETWORKS. Network Appliance How To Guide: EdgeDNS. How To Guide

Web Server Software Architectures

The International Journal Of Science & Technoledge (ISSN X)

Keywords: Dynamic Load Balancing, Process Migration, Load Indices, Threshold Level, Response Time, Process Age.

LinkProof DNS Quick Start Guide

Using IPM to Measure Network Performance

How To Build A Large Scale Mail Transfer Protocol (Mta) System In A Distributed System

Topics. 1. What is load balancing? 2. Load balancing techniques 3. Load balancing strategies 4. Sessions 5. Elastic load balancing

Content Delivery Networks (CDN) Dr. Yingwu Zhu

Understanding Slow Start

IOS Server Load Balancing

Radware AppDirector and Juniper Networks Secure Access SSL VPN Solution Implementation Guide

Transcription:

A Task-Based Adaptive-TTL approach for Web Server Load Balancing * Devarshi Chatterjee Zahir Tari RMIT University School of Computer Science and IT Melbourne, Australia zahirt@cs cs.rmit.edu.au * Supported by ARC (ARC Discovery DP 0346545) and SUN Microsystems

Overview Motivation Technical Issues Basics & Limitations of LAN/WAN based approaches Our approach WAN-level (with HLW and server classification) LAN-level (with re-direction and queue estimation) Testing results

Why Load Balancing? Benefits Improve resource utilization Improve system performance Adding extra devices costly Architecture Independent mirrored-servers Manually selected by users Not scalable as they provide a non-transparent location binding Distributed web servers Appear as a single host. Authoritative DNS (A-DNS) Allow to control binding within a cluster, based on specific load balancing policies and information (such as TTL). Bottleneck at the A-DNS (with all the classical problems).

Load Characteristics Empirical measurements [Crovella 1998] show many short tasks, and fewer large tasks. E.g. 1998 World Soccer Cup [M. Arlitt et al., HP T.R. 1999] Traffic measurements 1.3 billions of requests in 3 month, with 90% static HTML, 2% dynamic requests. 0,08% of requests counts for 20% of data. Request distribution is heavy-tail (i.e. P[X>x] x -α ) Heavy-tailed distribution (e.g. Pareto distribution) Decreasing failure rate (i.e. the longer it runs, the longer to continue) Infinite variance Few (less than 1%) very large tasks take up more than 50% of the total task sizes

Task Distribution Heavy-tailed Pr{X>x} ~ x - α, 0 < α < 2 α 2 (moderate variability) α 1 (high variability) Simplest Heavy-tailed (Pareto( distribution) Pr{X x} = 1 (k/x) α Examples Process CPU consumption in Unix system, α 1 1 [Harchol[ 1997]. ν File transferred via HTTP, 1.1 α α 1.3 [Crovella[ 1997]. ν File transferred via FTP, 0.9 α α 1.1 [Paxson[ 1995].

Bounded Pareto distribution B(k,p,α) f(x) = (αk α / (1-(k/p) α ) x -α-1 k x p k,, the shortest possible job p,, the largest possible job 10 14 power law w/ exponent - α -1 10 13 10 8 k p 10 7 0.2 0.4 1 α 2 B(k,10 10, α)

Technical problems Estimation of the load [LAN]] Periodic updates, baysian models, stale- based approaches [Dahlin[ 2002] [WAN]] Difficult to do as DNS controls only a very tiny amount of requests. Server-based feedback crucial! Task assignment [LAN] request size distribution,, arrival rate, known/unknown size, divisible/indivisible task [WAN] request size distribution,, TTL=0 problem, client/server information Task migration/re-direction

LAN Metrics WaitingTime(T k ) FlowTime (T k ) = WaitingTime(T k ) + ProcessingTime(T k ) SlowDown (T k ) = WaitingTime(T k )/TaskSize TaskSize(T k ) WAN Cumulative Function (CF) maximum cluster utilization (e.g. the probability that the maximum cluster utilization is below a certain threshold).

WAN-based Approaches Facts De-centralized model (for routing requests) Caching at I-DNS (bypassing A-DNS) Depends on the focus Client: : uses Hidden Load Weight (HLW) [Colajanni et al., IEEE Computer 1999]. Server: similar to LAN-based, however less advance (e.g. sending load asynchronously [Yu et al., IEEE TPDS 1998]). DNS: uses A-DNS as a dispatcher [Colajanni et al., ACM Computing Survey 2002]. Much more scalable and efficient for geographically distributed web servers! Best solution is the INTEGRATION of the first two approaches within DNS-based approach.

DNS-based Architecture 5 4 Server IP Address 1 Client 1 3 Intermediate Name Servers DNS 2 KEY 1. Address request 2. Web Server selection (Address 1) 3. Address Mapping (URL Address 1) 4. Document Request (Address 1) 5. Document Response (Address 1) Server IP Address n

DNS-based Load Balancing TTL (Time To Live) Period for which a resource record (containing name to address translation) is cached. When it expires a new translation is sought. Algorithms Static All requests have the same TTL. High risk overloading servers. #types: stateless, client or sever based, combined Dynamic Varies depending on the requests, domains etc.

Stateless Various Random [Colajanni et al., Internet Computing, 1999] RR [Internet Software Consortium BIND]: BIND finds the next IP from a list of associated IP addresses. Limitations Does not take into server load and server availability Only deals with a small fractions of requests Ignores high variability in client s s requests (i.e. small # of very large tasks)

Static based Approaches Client-based Geographical location E.g. topological proximity [CISCO 1997], client-server link latency, round trip delays [Beck et al., WWW 1998] [CISCO DistributedDirectory, # of requests, using Hidden Load Weight (HLW) Basic: : RR2 [Yu et al., IEEE TPDS 1998] maintains two lists of server pools to balance Load-based: DAL (Dynamic Accumulated Load) [Cardellini et al., COMPSAC 1998], where A-DNS Kind of Load Index based on HLW bin@server (as cumulated HLW from various domains) lowest bin,, and increases the value by the HLW of the requests

Static based Approaches Server-based Compute load (in a certain way?!?) Send it synchronously (i.e. periodically, approx 8 or 16s) or asynchronously (with 2 alarms) to A-DNS, excluding overloaded servers. Requests routed in LLF-like at the same frequency the load is sent to A-DNS E.g. lbmnamed [Schemers, LISA 1995], however is limited as sets TTL to zero to avoid caching. Client & Server based Combination of HLW and alarms from overload servers E.g. CISCO s s DistributedDirectory uses proximity with server availability.

Dynamic based Approaches Idea Variability of TTL based on domain popularity Results in low TTL for clients making frequent queries and vice versa [Colajanni[ et al., WWW 1999] Limitations I-DNS may cache name-address mapping TTL=0 problem (I-DNS discard it, A-DNS bottleneck) Task size is not considered (Pareto distribution) TTL does not work on web browser caching

Principle Proposed Model Take into account client s s parameters Dynamic TTL assigned based on domain popularity Maximum time taken to download a document Processing capacity of servers HTTP redirection used as server side load balancing Implemented Assignment Table containing server info (computation of task size, Web server processing capability, Web server utilization) Updated every 15 seconds

Architecture Document Request / reply HTTPD Custom Logfile Task size calculator Load Collector Assignment Table Load Broadcaster Authoritative DNS Status information Web Server Name-Address Translation request Cookie Validator Web pages IP Address, Associated TTL

6 4 Server IP Address 1 Client 1 3 Intermediate Name Servers DNS 2 5 KEY 1 Address request 2 Web Server selection (Address 1) 3 Address Mapping (URL => Address 1) 4 Document Request (Address 1) 5 Server side check (cached?) 6 Document Response (Address 1) Server IP Address n

3 6 Server IP Address 1 Client 1 2 Intermediate Name Servers DNS 4 5 Server KEY 1. Address request (Intermediate Name Server) 2. Address Mapping (URL => Address 1) 3. Document Request (Cached - Address 1) 4. Web Server Selection 5. HTTP-Redirection 6. Document Response (Address 1) IP Address n

Various Testing TTL variation Response time Utilization of the cluster Sensitivity of update interval to Assignment table Effect of TTL TTL set to 15s, 0.99 probability of not overloading the cluster Cumulative function (e.g. probability of the maximum utilization is below a threshold) [Colajanni et al., WWW 1999]

CF with TTL = 15s

CF with TTL = 60s

CF with TTL = 120s

Client Perceived Delay High initial response time, better than the rest at 75% of cluster utilization 16% improvement over the rest at 80% cluster utilization

Sensitivity of Update Interval to Assignment Table Not highly dependent like dynamic TTL (DT) A high (> 60 seconds) interval caused independence to be affected

Conclusion Client-server algorithms have better performance DLB-TS insensitive to the change of TTL, whereas Adaptive-TTL fails at high TTL DLB-TS lowers the client perceived delay under high load