High volume Internet data centers. MPLS-based Request Routing. Current dispatcher technology. MPLS-based architecture



Similar documents
MPLS Based Web Switching

Web Switching (Draft)

How To Test Performance Of A Cdn Server

Web Caching and CDNs. Aditya Akella

How To Understand The Power Of A Content Delivery Network (Cdn)

DATA COMMUNICATOIN NETWORKING

CSC2231: Akamai. Stefan Saroiu Department of Computer Science University of Toronto

Measuring the Web: Part I - - Content Delivery Networks. Prof. Anja Feldmann, Ph.D. Dr. Ramin Khalili Georgios Smaragdakis, PhD

Content Delivery Networks. Shaxun Chen April 21, 2009

CDN Brokering. Content Distribution Internetworking

Content Delivery Networks (CDN) Dr. Yingwu Zhu

Distributed Systems. 23. Content Delivery Networks (CDN) Paul Krzyzanowski. Rutgers University. Fall 2015

Overlay Networks. Slides adopted from Prof. Böszörményi, Distributed Systems, Summer 2004.

Demand Routing in Network Layer for Load Balancing in Content Delivery Networks

CDN and Traffic-structure

Lecture 3: Scaling by Load Balancing 1. Comments on reviews i. 2. Topic 1: Scalability a. QUESTION: What are problems? i. These papers look at

Internet Content Distribution

On the Use and Performance of Content Distribution Networks

CS514: Intermediate Course in Computer Systems

Investigation and Comparison of MPLS QoS Solution and Differentiated Services QoS Solutions

Department of Computer Science Institute for System Architecture, Chair for Computer Networks. Caching, Content Distribution and Load Balancing

Advanced Networking Technologies

Content Distribu-on Networks (CDNs)

Web Application Hosting Cloud Architecture

FAQs for Oracle iplanet Proxy Server 4.0

The Measured Performance of Content Distribution Networks

Distributed Systems. 25. Content Delivery Networks (CDN) 2014 Paul Krzyzanowski. Rutgers University. Fall 2014

Evaluating User-perceived Benefits of Content Distribution Networks

SiteCelerate white paper

DNS, CDNs Weds March Lecture 13. What is the relationship between a domain name (e.g., youtube.com) and an IP address?

Request Routing, Load-Balancing and Fault- Tolerance Solution - MediaDNS

Evaluating Cooperative Web Caching Protocols for Emerging Network Technologies 1

Distributed Systems 19. Content Delivery Networks (CDN) Paul Krzyzanowski

1. Comments on reviews a. Need to avoid just summarizing web page asks you for:

Scalable Internet Services and Load Balancing

Global Server Load Balancing

Meeting Worldwide Demand for your Content

Hierarchical Content Routing in Large-Scale Multimedia Content Delivery Network

Content-Aware Load Balancing using Direct Routing for VOD Streaming Service

Internet Content Distribution

Data Center Content Delivery Network

How To Provide Qos Based Routing In The Internet

International Journal of Combined Research & Development (IJCRD ) eissn: x; pissn: Volume: 2; Issue: 5; May -2014

Week 3 / Paper 2. Bernhard Ager, Wolfgang Mühlbauer, Georgios Smaragdakis, Steve Uhlig ACM IMC 2010.

Content Delivery Networks

Distributed Systems. 24. Content Delivery Networks (CDN) 2013 Paul Krzyzanowski. Rutgers University. Fall 2013

Open Issues in Content Distribution

The Domain Name Service, Etc. Jeff Chase Duke University, Department of Computer Science CPS 212: Distributed Information Systems

Efficient Parallel Distributed Load Balancing in Content Delivery Networks

An MPLS based architecture for differentiated Web service

GLOBAL SERVER LOAD BALANCING WITH SERVERIRON

A host-based firewall can be used in addition to a network-based firewall to provide multiple layers of protection.

Traffic delivery evolution in the Internet ENOG 4 Moscow 23 rd October 2012

- Multiprotocol Label Switching -

Indirection. science can be solved by adding another level of indirection" -- Butler Lampson. "Every problem in computer

Server Load Balancing Using Adaptive Routing on an MPLS Enabled Network. Joseph Grzybowski

Design and implementation of an MPLS based load balancing architecture for Web switching

Computer Networks & Security 2014/2015

DESIGN AND VERIFICATION OF LSR OF THE MPLS NETWORK USING VHDL

LOAD BALANCING AS A STRATEGY LEARNING TASK

Advanced Computer Networks. Layer-7-Switching and Loadbalancing

DNS ROUND ROBIN HIGH-AVAILABILITY LOAD SHARING

ISSN Vol.06,Issue.05, August-2014, Pages:

Real-Time Analysis of CDN in an Academic Institute: A Simulation Study

How To Improve Performance On A Ccdn (Dns)

A Simulation Study of Effect of MPLS on Latency over a Wide Area Network (WAN)

CDN Brokering. Alexandros Biliris, Chuck Cranor, Fred Douglis, Michael Rabinovich, Sandeep Sibal, Oliver Spatscheck, and Walter Sturm

Superior Disaster Recovery with Radware s Global Server Load Balancing (GSLB) Solution

RA-MPLS VPN Services. Kapil Kumar Network Planning & Engineering Data. Kapil.Kumar@relianceinfo.com

Multi Protocol Label Switching (MPLS) is a core networking technology that

OpenFlow Based Load Balancing

Load balancing as a strategy learning task

Cisco Videoscape Distribution Suite Service Broker

Intel Ethernet Switch Load Balancing System Design Using Advanced Features in Intel Ethernet Switch Family

The old Internet. Software in the Network: Outline. Traditional Design. 1) Basic Caching. The Arrival of Software (in the network)

Scalable Internet Services and Load Balancing

Proxy Server, Network Address Translator, Firewall. Proxy Server

Internetworking II: VPNs, MPLS, and Traffic Engineering

Single Pass Load Balancing with Session Persistence in IPv6 Network. C. J. (Charlie) Liu Network Operations Charter Communications

The Application Front End Understanding Next-Generation Load Balancing Appliances

APPLICATION NOTE 211 MPLS BASICS AND TESTING NEEDS. Label Switching vs. Traditional Routing

CDN Brokering. Alexandros Biliris, Chuck Cranor, Fred Douglis, Michael Rabinovich, Sandeep Sibal, Oliver Spatscheck, and Walter Sturm

Service Assurance Tools

Rapid IP redirection with SDN and NFV. Jeffrey Lai, Qiang Fu, Tim Moors December 9, 2015

Lecture 8b: Proxy Server Load Balancing

Analysing the impact of CDN based service delivery on traffic engineering

The Value of Content Distribution Networks Mike Axelrod, Google Google Public

FortiBalancer: Global Server Load Balancing WHITE PAPER

DNS records. RR format: (name, value, type, TTL) Type=NS

Application Delivery Networking

Domain Name System (DNS)

A Taxonomy of CDNs. Chapter Introduction. Mukaddim Pathan and Rajkumar Buyya

LARGE SCALE INTERNET SERVICES

Server Traffic Management. Jeff Chase Duke University, Department of Computer Science CPS 212: Distributed Information Systems

Computer Networks - CS132/EECS148 - Spring

Alteon Global Server Load Balancing

VPN over Satellite A comparison of approaches by Richard McKinney and Russell Lambert

Load Balancing in Mobile IPv6 s Correspondent Networks with Mobility Agents

How To Model The Content Delivery Network (Cdn) From A Content Bridge To A Cloud (Cloud)

Transcription:

MPLS-based Request Routing High volume Internet data centers MPLS-based Request Routing Arup Acharya, Anees Shaikh, Renu Tewari, Dinesh Verma IBM TJ Watson Research Center Web server cluster + front-end dispatcher direct requests based on server load, requested content, client identity, etc. dispatcher clients Next Generation Internet SoSe03 240 Next Generation Internet SoSe03 241 Current dispatcher technology MPLS-based architecture Layer-4 dispatchers route requests based on TCP/IP headers high-performance h/w implementations functionality limited to load balancing or simple affinity Layer-7 dispatchers use application information (e.g., HTTP headers) sophisticated functionality content-based routing, affinity, load-balancing scalability and performance limited by TCP connection termination Application-level gateway TCP splicing TCP connection handoff clients MPLS provides a circuit-switching service over a hop-by-hop IP network Architecture components MPLS network MPLS-enabled client-side proxy MPLS switch acting as dispatcher forward proxy MPLS-enabled network LSR LSP MPLS switch servers Desired solution sophisticated functions and flexibility high-performance control connection Next Generation Internet SoSe03 242 Next Generation Internet SoSe03 243

MPLS label stacking Label distribution to proxies label stacking: Labels typically used for expressing routing policies Use label stacking to push application-layer label Outer routing label used for switching in the network MPLS-enabled server network further improves performance MPLS-enabled network Persistent control connection between dispatcher and proxy content-based routing: URL label mapping load balancing: labels, weights, and policy client affinity: labels and timeout, start/stop URLs service differentiation: per-service class label set (e.g., gold, silver, bronze ) Dispatcher populates layer-2 label table IP pkt IP pkt L A IP pkt L A L R IP pkt L A IP pkt L A L R forward proxy ingress LSR egress LSR MPLS switch Next Generation Internet SoSe03 244 Next Generation Internet SoSe03 245 Deployment issues Summary Need wide MPLS deployment in core and edge supported by reports from large ISPs and IP service equipment vendors (e.g., for VPNs) Why install an MPLS-enabled proxy? data center and proxies in same administrative control (ISP with hosting service) ASPs with large enterprise customers and SLAs Intranet and extranet servers Scaling to many proxies and web sites limit proxy participation to high-volume client sites proxies may initiate with selected, popular sites Key advantages leverage growth of MPLS deployment in core and edge networks removes primary bottleneck of TCP termination realization in standard off-the-shelf switch hardware implements sophisticated request routing functions Requirements assign some request-routing functionality to proxies MPLS-aware proxies at the network edges implementation of control protocol for label distribution Next Generation Internet SoSe03 246 Next Generation Internet SoSe03 247

Literature and acknowledgements Content Distribution in the WWW MPLS-based Request Routing (R&D Synopsis) A. Acharya, A. Shaikh, R. Tewari, and D. Verma, Proc. Int'l Workshop on Web Caching and Content Distribution (WCW '01), June 2001. Extended version published as IBM Research Report RC 22275 http://www.research.ibm.com/people/a/aashaikh/papers/rc22275.pdf In MPLS World News http://www.mplsworld.com/archi_drafts/focus/analy-ibm.htm Motivation & Classification Web Caching Content Distribution Networks Techniques Performance Thanks to Anees Shaikh and Arup Acharya for providing their presentation! Next Generation Internet SoSe03 248 Next Generation Internet SoSe03 249 Content Distribution in the WWW: Motivation Content Distribution in the WWW WWW users use HTTP to retrieve web objects from a server Response time can be slow ( World wide wait ): Low-speed path causing low transmission delay One or more congested links cause queuing delay and packet drops Web server is overloaded Strategy: Replicate server content Direct client to best server Content distribution refers to mechanisms for: 1. Replicating content on multiple servers in the Internet 2. Providing end systems means to determine the servers with fastest response Large industry: Cisco, Lucent, Inktomi, CacheFlow etc.: provide hard-and software Akamai, AT&T etc.: provide content distribution services to providers such as CNN and Yahoo Classification: Web caching Content distribution networks P2p file sharing (extra lectures) Next Generation Internet SoSe03 250 Next Generation Internet SoSe03 251

Web caching Web caching: Motivation A web cache (proxy server) is a network entity that satisfies HTTP requests on the behalf of an origin server Cache is both a client (to the origin server) and a server (to the clients) client HTTP request HTTP response HTTP request HTTP response Proxy server HTTP request Origin server Reduce latency by avoiding slow links between client and origin server: low bandwidth links congested links Reduce traffic on links between institutional network and regional ISP. Reduce traffic on transoceanic links. Spread load of overloaded origin server to caches. An Internet dense with caches allows a content provider to offer highperformance distribution at low cost. Inexpensive server Low-bandwidth Internet connection client Origin server Next Generation Internet SoSe03 252 Next Generation Internet SoSe03 253 Design Techniques: Hierarchical Caching Cooperative Caching Regional ISP National ISP Origin servers Regional ISP Multiple sibling caches within a single ISP. One or more of the siblings could contain the requested object. Cooperation: ICP (Internet Cache Protocol): siblings send messages to each other to find a copy of object (Intercache communication) CARP (Cache Array Routing Protocol): URL space is partitioned (Hashbased Request Routing) Can have cooperating sibling caches in each ISP in each tier of a hierarchy. Each ISP can have a cache. ISPs higher in hierarchy have larger user populations higher hit rates Local ISP Local ISP Web cache Next Generation Internet SoSe03 254 Next Generation Internet SoSe03 255

Caching: Other caching terms Reverse proxy caching: Caches close to the origin server. Independent of client-side proxy caching Active Caching: Aims at caching dynamic content, e.g. personalized content Content Prefetching: Retrieve data from remote servers in anticipation of client requests Cache Digest: A summary of the contents of an Internet Object Caching Server Well-known cache systems: Harvest: introduced the idea of hierarchical caching (mainly for FTP) Squid: extended Harvest for HTTP, introduced ICP Next Generation Internet SoSe03 256 1000,000 other hosts September 11 with Web caching Web Server www.cnn.com request old content request User merlot.cis.udel.edu New Content WTC News! ISP 1000,000 other hosts - Congestion / Bottleneck - Caching Proxy Figure from J. R. Iyengar Content Distribution Networks CDNs vs. Web caches Content distribution networks (CDNs) are a mechanism to deliver content to end users on behalf of origin web sites. CDNs consist of a collection of surrogates (non-origin servers) that attempt to offload work from origin servers by delivering content on their behalf. A CDN can be regarded as a set of widely-dispersed caches but there are two major differences to web caches: Surrogates are coordinated by a mechanism that routes client requests to good surrogate Surrogates are potentially populated by other means than requests by clients For each request, the CDN locates a surrogate close to the client that serves the request. Different notions of close are: Network proximity Bandwidth availability Availability of content Low latency, e.g. choosing a lightly loaded server (or not heavily loaded server) Origin server in North America Surrogate in South America CDN distribution node Surrogate in Europe Surrogate in Asia Next Generation Internet SoSe03 258 Next Generation Internet SoSe03 259

September 11 with CDN CDN Techniques 1000,000 other users IL FL CA Web Server www.cnn.com WA request DE new content User merlot.cis.udel.edu New Content WTC News! MI NY MA 1000,000 other users Figure from J. R. Iyengar - Distribution Infrastructure - Surrogate There are two major techniques for redirecting client requests for objects served by the CDN to a particular CDN server (sometimes called Request Routing): 1. DNS redirection a. Full-site content delivery b. Partial-site content delivery 2. URL rewriting Other techniques are: Anycast (does not consider surrogate load) Transport-layer request routing: can be used in combination with DNS redirection. Next Generation Internet SoSe03 260 Next Generation Internet SoSe03 261 DNS Redirection DNS Redirection Normal DNS operation: Client: checks local cache Modified DNS: Client: checks local cache http://www.ieee.org/ http 199.172.136.40 http://www.ieee.org/ http Surrogate X Local DNS: IP-Adresse für www.ieee.org? www.ieee.org A 199.172.136.40 dns.ieee.org Local DNS: IP-Adresse für www.ieee.org? www.ieee.org Surrogate X Modified: chooses one surrogate dns.ieee.org NS 199.172.136.6 dns.ieee.org NS 199.172.136.6 Root- Nameserver Root- Nameserver: Knows dns.ieee.org Other NS that cannot resolve name Other NS that cannot resolve name Next Generation Internet SoSe03 262 Next Generation Internet SoSe03 263

DNS Redirection Full- and Partial-site Content Delivery Advantages: Simple no changes to existing protocols, clients or servers General works for all IP-based applications, independent of transport protocol used Full-site: All requests to the CDN are redirected via DNS Surrogates either serve content from their cache or forward requests to the origin server Used by Adero, Netcaching,.. Partial-site: Origin sites modifies the embedded URLs for objects (images) so that these URLs are resolved by the CDN s DNS server Actual syntax varies with the CDN. Speedera changes www.foo.com/bar.gif to foo.speedera.net/www.foo.com/bar.gif Next Generation Internet SoSe03 264 Next Generation Internet SoSe03 265 DNS redirect and TTL Problems with small TTL values DNS resource records contain Time to Live field that specifies how long a client may cache a resource record. RFC 1912 recommends TTL values of 1-5 days Nameservers typically use a TTL of 1 day. The DNS of CDNs have very small TTL values of 10-200 seconds Aim: better load balancing Small TTLs lead to problems: Clients must perform DNS lookups more frequently This can increase client latency Increased load on DNS It has been observed that in many cases the time between the HTTP GET request and the arrival of the first data packet accounts for 30-40% of the response time, the main reason is the bad performance of DNS Next Generation Internet SoSe03 266 Next Generation Internet SoSe03 267

Effectiveness of DNS-based Server Selection URL Rewriting Study by A. Shaikh, R. Tewari and M. Agrawal: Without careful TTL tuning, client latency can increase In particular, when web pages contain more embedded objects Typical client-nameserver distance is 8 or more hops. Furthermore clients and nameservers often have disjoint paths to surrogates. Latency to nameserver is poor indicator of latency to client URL Rewriting: Origin server rewrites URL links as part of dynamically generating pages to redirect clients to different servers. At resource access time, the page is dynamically rewritten with the IP address of one of the surrogates, avoiding the need for a DNS lookup. Problem: First request must be served from origin server Hybrid approach: Use URL rewriting to identify a particular server that might resolve to the IP address of another surrogate Next Generation Internet SoSe03 268 Next Generation Internet SoSe03 269 Performance of CDNs (1) Performance of CDNs (2) Study by Johnson et al. that evaluated performance of two CDNs (Akamai and Digital Island) 1. CDNs are able to succesfully provide services by avoiding significantly "bad" services as opposed to being able to pick the best ones. 2. CDN's occasionally make bad choices in picking servers for clients that have measured latencies worse than going to the original client thereby degrading service for client rather than improving them. 3. The use of CDN's actually does improve performance on average when considering both performance using the origin server as well as comparing the choice of server to other possible choices. Study by Krishnamurty et al.: CDNs offer much better performance than origin servers Significant differences in download times between different CDNs Compared the download time for a newly obtained surrogate to a fixed and the previous surrogate (i.e. effect of low TTL values): In almost all cases, the response time was better using the previous or fixed server Indicates that even worst-case client response time is generally not improved with a DNS lookup to find a new server Confirms the findings by Shaikh and Tewari that careful tuning of TTL values is important (and difficult) Next Generation Internet SoSe03 270 Next Generation Internet SoSe03 271

Literature B. Krishnamurthy, C. Wills, and Y. Zhang. On the Use and Performance of Content Distribution Networks Proceedings of SIGCOMM IMW 2001, California, November 2001. A. Shaikh, R. Tewari, M. Agrawal. On the Effectiveness of DNSbased Server Selection, Proc. IEEE INFOCOM 2001, April 2001. K. Johnson, J. Carr, M. Day, and M. F. Kaashoek. The measured performance of content distribution networks. 5th International Web Caching and Content Delivery Workshop, Lisbon, Portugal, May 2000. G. Barish and K. Obraczka. World Wide Web Caching: Trends and Techniques. IEEE Communications Magazine Internet Technology Series, May 2000. Next Generation Internet SoSe03 272