Reliable Distributed Systems



Similar documents
CSC2231: Akamai. Stefan Saroiu Department of Computer Science University of Toronto

How To Test Performance Of A Cdn Server

Web Caching and CDNs. Aditya Akella

How To Understand The Power Of A Content Delivery Network (Cdn)

Measuring the Web: Part I - - Content Delivery Networks. Prof. Anja Feldmann, Ph.D. Dr. Ramin Khalili Georgios Smaragdakis, PhD

DATA COMMUNICATOIN NETWORKING

Tema 5: Distribución de contenidos

Content Distribu-on Networks (CDNs)

Internet Content Distribution

Communications Software. CSE 123b. CSE 123b. Spring Lecture 13: Load Balancing/Content Distribution. Networks (plus some other applications)

1. Comments on reviews a. Need to avoid just summarizing web page asks you for:

Lecture 3: Scaling by Load Balancing 1. Comments on reviews i. 2. Topic 1: Scalability a. QUESTION: What are problems? i. These papers look at

CDN and Traffic-structure

From Internet Data Centers to Data Centers in the Cloud

Distributed Systems. 25. Content Delivery Networks (CDN) 2014 Paul Krzyzanowski. Rutgers University. Fall 2014

Distributed Systems. 24. Content Delivery Networks (CDN) 2013 Paul Krzyzanowski. Rutgers University. Fall 2013

On the Use and Performance of Content Distribution Networks

Request Routing, Load-Balancing and Fault- Tolerance Solution - MediaDNS

Distributed Systems 19. Content Delivery Networks (CDN) Paul Krzyzanowski

Distributed Systems. 23. Content Delivery Networks (CDN) Paul Krzyzanowski. Rutgers University. Fall 2015

Measuring CDN Performance. Hooman Beheshti, VP Technology

Global Server Load Balancing

Content Delivery Networks (CDN) Dr. Yingwu Zhu

CS514: Intermediate Course in Computer Systems

Overlay Networks. Slides adopted from Prof. Böszörményi, Distributed Systems, Summer 2004.

Content Delivery Network. Version 0.95

AKAMAI WHITE PAPER. Delivering Dynamic Web Content in Cloud Computing Applications: HTTP resource download performance modelling

The Measured Performance of Content Distribution Networks

High volume Internet data centers. MPLS-based Request Routing. Current dispatcher technology. MPLS-based architecture

Final for ECE374 05/06/13 Solution!!

Open Issues in Content Distribution

Homework 2 assignment for ECE374 Posted: 02/21/14 Due: 02/28/14

Content Delivery Networks

Traffic delivery evolution in the Internet ENOG 4 Moscow 23 rd October 2012

A Precise and Efficient Evaluation of the Proximity between Web Clients

First Midterm for ECE374 02/25/15 Solution!!

Content Delivery Networks. Shaxun Chen April 21, 2009

Akamai CDN, IPv6 and DNS security. Christian Kaufmann Akamai Technologies DENOG 5 14 th November 2013

Overview. Tor Circuit Setup (1) Tor Anonymity Network

Rapid IP redirection with SDN and NFV. Jeffrey Lai, Qiang Fu, Tim Moors December 9, 2015

The Value of Content Distribution Networks Mike Axelrod, Google Google Public

Internet Content Distribution

Front-End Performance Testing and Optimization

COMP 631: COMPUTER NETWORKS. How to distribute content without requiring centralized, heavy-duty servers? Peer-to-peer content distribution

Content Delivery Networks

Computer Networks - CS132/EECS148 - Spring

ICP. Cache Hierarchies. Squid. Squid Cache ICP Use. Squid. Squid

The Effectiveness of Request Redirection on CDN Robustness

DNS ROUND ROBIN HIGH-AVAILABILITY LOAD SHARING

Demand Routing in Network Layer for Load Balancing in Content Delivery Networks

OpenFlow Based Load Balancing

DNS, CDNs Weds March Lecture 13. What is the relationship between a domain name (e.g., youtube.com) and an IP address?

The Web History (I) The Web History (II)

GLOBAL SERVER LOAD BALANCING WITH SERVERIRON

Hashing in Networked Systems

Load Balancing in Distributed Web Server Systems With Partial Document Replication

Update logo and logo link on A Master. Update Date and Product on B Master

Computer Networking LAB 2 HTTP

If you are considering a cloud infrastructure, your first questions should be:

bbc Adobe LiveCycle Data Services Using the F5 BIG-IP LTM Introduction APPLIES TO CONTENTS

Indirection. science can be solved by adding another level of indirection" -- Butler Lampson. "Every problem in computer

Dynamic Content Acceleration: Lightning-Fast Web Apps with Amazon CloudFront and Amazon Route 53

Advanced Computer Networks. Layer-7-Switching and Loadbalancing

LOAD BALANCING TECHNIQUES FOR RELEASE 11i AND RELEASE 12 E-BUSINESS ENVIRONMENTS

SiteCelerate white paper

EE 7376: Introduction to Computer Networks. Homework #3: Network Security, , Web, DNS, and Network Management. Maximum Points: 60

Akamai CDN, IPv6 and DNS security. Christian Kaufmann Akamai Technologies APNIC th August 2013

Web Page Response Time

Web Application Hosting Cloud Architecture

The Domain Name Service, Etc. Jeff Chase Duke University, Department of Computer Science CPS 212: Distributed Information Systems

Responsive Web Design: A Three-Screen Study in Site Performance

Load Balancing Web Applications

First Midterm for ECE374 03/24/11 Solution!!

DNS records. RR format: (name, value, type, TTL) Type=NS

The State of the Address. Mike Afergan Rob Beverly January 27, 2005

FortiBalancer: Global Server Load Balancing WHITE PAPER

Homework Assignment #3 Due 11/20 at 5:00pm EE122 Fall 2012

Department of Computer Science Institute for System Architecture, Chair for Computer Networks. Caching, Content Distribution and Load Balancing

How To Load Balance On A Bgg On A Network With A Network (Networking) On A Pc Or Ipa On A Computer Or Ipad On A 2G Network On A Microsoft Ipa (Netnet) On An Ip

Week 3 / Paper 2. Bernhard Ager, Wolfgang Mühlbauer, Georgios Smaragdakis, Steve Uhlig ACM IMC 2010.

The Effect of Caches for Mobile Broadband Internet Access

COMP 361 Computer Communications Networks. Fall Semester Midterm Examination

Project #2. CSE 123b Communications Software. HTTP Messages. HTTP Basics. HTTP Request. HTTP Request. Spring Four parts

CDN Brokering. Content Distribution Internetworking

Handling Flash Crowds From Your Garage

First Midterm for ECE374 03/09/12 Solution!!

Drafting Behind Akamai (Travelocity-Based Detouring)

1 Introduction to Internet Content Distribution

Content Delivery Networks

1 Introduction: Network Applications

Global Server Load Balancing

CDN Brokering. Alexandros Biliris, Chuck Cranor, Fred Douglis, Michael Rabinovich, Sandeep Sibal, Oliver Spatscheck, and Walter Sturm

FortiGate Multi-Threat Security Systems I

Effect of anycast on K-root

An In-depth study of Mobile Browser Performance

How To Model The Content Delivery Network (Cdn) From A Content Bridge To A Cloud (Cloud)

Broadband Quality Test Plan

EECS 489 Winter 2010 Midterm Exam

Scaling with Zeus Global Load Balancer

Transcription:

Reliable Distributed ystems A glimpse into the world of ontent Distribution Networks Based on a slide set developed by Prof. Paul Francis

ontent Routing Principle (a.k.a. ontent Distribution Network) enter enter Backbone IP Backbone IP Backbone IP IP IP IP ite ites

ontent Routing Principle (a.k.a. ontent Distribution Network) enter enter O ontent Origin here at Origin erver Backbone IP Backbone IP Backbone IP IP ite IP IP ites ontent ervers distributed throughout the Internet

ontent Routing Principle (a.k.a. ontent Distribution Network) enter enter O Backbone Backbone Backbone IP IP IP IP IP IP ite ites ontent is served from content servers nearer to the client

Two basic types of DN: cached and pushed enter enter O Backbone Backbone Backbone IP IP IP IP IP IP ite ites

ached DN enter enter O 1. lient requests content. Backbone Backbone Backbone IP IP IP IP IP IP ite ites

ached DN enter enter O Backbone Backbone Backbone IP IP IP 1. lient requests content. 2. checks cache, if miss gets content from origin server. IP IP IP ite ites

ached DN enter Backbone Backbone Backbone IP IP IP enter O 1. lient requests content. 2. checks cache, if miss gets content from origin server. 3. caches content, delivers to client. IP IP IP ite ites

ached DN Backbone Backbone Backbone IP IP IP IP enter IP enter O IP ite ites 1. lient requests content. 2. checks cache, if miss gets content from origin server. 3. caches content, delivers to client. 4. Delivers content out of cache on subsequent requests.

Pushed DN enter enter O Backbone Backbone Backbone IP IP IP 1. Origin erver pushes content out to all s. IP IP IP ite ites

Pushed DN enter enter O Backbone Backbone Backbone IP IP IP 1. Origin erver pushes content out to all s. 2. Request served from s. IP IP IP ite ites

DN benefits ontent served closer to client Less latency, better performance Load spread over multiple distributed s More robust (to IP failure as well as other failures) Handle flashes better (load spread over IPs) But well-connected, replicated enters can do this too

DN costs and limitations ached DNs can t deal with dynamic/personalized content More and more content is dynamic lassic DNs limited to images Managing content distribution is non-trivial Tension between content lifetimes and cache performance Dynamic cache invalidation Keeping pushed content synchronized and current

DN example: Akamai Won huge market share of DN business late 90 s ached approach Now offers full web hosting services in addition to caching services alled edgesuite

Akamai caching services ARL: Akamai Resource Locator http://a620.g.akamai.net/7/620/16/259fdbf4ed29de/www.cnn.com/i/22.gif Host Part Akamai ontrol Part ontent URL /7/620/16/259fdbf4ed29de/ a620.g.akamai.net/ /www.cnn.com/i/22.gif Thanks to Ratul Mahajan, Univ of Wash, How Akamai Works

ARL: Akamai Resource Locator http://a620.g.akamai.net/7/620/16/259fdbf4ed29de/www.cnn.com/i/22.gif ontent Provider (P) selects which content will be hosted by Akamai. Akamai provides a tool that transforms this P URL into this ARL /7/620/16/259fdbf4ed29de/ a620.g.akamai.net/ /www.cnn.com/i/22.gif

ARL: Akamai Resource Locator http://a620.g.akamai.net/7/620/16/259fdbf4ed29de/www.cnn.com/i/22.gif This in turn causes the client to access Akamai s content server instead of the origin server. /7/620/16/259fdbf4ed29de/ a620.g.akamai.net/ /www.cnn.com/i/22.gif

ARL: Akamai Resource Locator http://a620.g.akamai.net/7/620/16/259fdbf4ed29de/www.cnn.com/i/22.gif If Akamai s content server doesn t have the content in its cache, it retrieves it using this URL. /7/620/16/259fdbf4ed29de/ a620.g.akamai.net/ /www.cnn.com/i/22.gif

ARL ontrol Part Type ode ustomer Number (different types (I.e. NN, Yahoo ) will have different contents)??? ontent hecksum (May be used for identifying changed content. May also validate content) /7/620/16/259fdbf4ed29de/ a620.g.akamai.net/ /www.cnn.com/i/22.gif http://a620.g.akamai.net/7/620/16/259fdbf4ed29de/www.cnn.com/i/22.gif

ARL Host Part But why such a complex domain name???? /7/620/16/259fdbf4ed29de/ a620.g.akamai.net/ /www.cnn.com/i/22.gif http://a620.g.akamai.net/7/620/16/259fdbf4ed29de/www.cnn.com/i/22.gif

ARL Host Part.net TLD Points to ~8 akamai.net DN servers (random ordering, TTL order of hours to days) akamai.net g.akamai.net a620.g.akamai.net Attempts to select ~8 g.akamai.net DN servers near client. (Using BGP? TTL order of 30 min 1 hour) Makes a very fine-grained load-balancing decision among local content servers. TTL order of 30 sec 1 min.

Other content routing mechanisms Dynamic HTML URL re-writing URLs in HTML pages re-written to point at nearby and non-overloaded content server In theory, finer-grained proximity decision Because know true client, not client s DN resolver In practice very hard to be fine-grained learway and Fasttide did this ould in theory put IP address in re-written URL, save a DN lookup But problem if user bookmarks page

Other content routing mechanisms HTTP 302 Redirect Directs client to another (closer, load balanced) server For instance, redirect image requests to distributed server, but handle dynamic home page from origin server ee draft-cain-known-request-routing-00.txt for good description of these issues But expired, so use Google to find archived copy

How well do DNs work? enter enter O Backbone Backbone Backbone IP IP IP IP IP IP ite ites

How well do DNs work? Backbone Backbone Backbone IP IP IP IP enter IP enter O IP ite ites Recall that the bottleneck links are at the edges. Even if s are pushed towards the edge, they are still behind the bottleneck link!

Elements of a web page Download DN round trip TP handshake (2 round trips) low-start ~8 round trips to fill DL pipe total 128K bytes ompare to 56 Kbytes for cnn.com home page Download finished before slow-start completes Total 11 round trips Example: U coast-to-coast propagation delay is ~15 ms 30 ms reduction in RTT means 330 ms total reduction ertainly noticeable o if we can reduce latency we can improve tcp (and thus http) performance

AT&T tudy Zhang, Krishnamurthy and Wills AT&T Labs Traces taken in ept. 2000 and Jan. 2001 ompared DNs with each other ompared DNs against non-dn sites

Methodology elected a bunch of DNs Akamai, peedera, Digital Island Note, most of these gone now! elected a number of non-dn sites for which good performance could be expected U.. and international origin U..: Amazon, Bloomberg, NN, EPN, MTV, NAA, Playboy, ony, Yahoo elected a set of images of comparable size for each DN and non-dn site ompare apples to apples Downloaded images from 24 NIMI machines Widely deployed measurement platforms

Response Time Results Including DN Lookup Time umulative Probability

Response Time Results Including DN Lookup Time About one second umulative Probability Author conclusion: DNs generally provide much shorter download time.

DNs out-performed non- DNs Why is this? Lets consider ability to pick good content servers using DN redirection Recall: short DN TTLs give DNs more control over which servers clients can use They compared time to download with a fixed IP address versus the IP address dynamically selected by the DN for each download

Effectiveness of DN load balancing

Effectiveness of DN load balancing Black: longer download time than fixed IP addr Blue: shorter download time, but total time longer because of DN lookup Green: same IP address chosen Red: shorter total time

DN load balancing not very effective

Other findings of study Each DN performed best for at least one (NIMI) client Why? Because of proximity? The best origin sites were better than the worst DNs DNs with more servers don t necessarily perform better Note that they don t know load on servers HTTP 1.1 improvements (parallel download, pipelined download) help a lot Even more so for origin (non-dn) cases Note not all origin sites implement pipelining

Ultimately a frustrating study Never actually says why DNs perform better, only that they do For all we know, maybe it is because DNs threw more money at the problem More server capacity and bandwidth relative to load

Another study Keynote ystems A Performance Analysis of 40 e-business Web ites Doing measurements since 1997 (All from one location, near as one can tell) Latest (publically available) measurement January 2001

Historical trend: lear improvement

Performance breakdown Basically says that smaller content leads to shorter download times (duh!) Average content size 12K bytes Average content size 44K bytes Average content size 99K bytes

Effect of DN: Positive (but again, we don t know why)

Most web sites not using DN (4-1) Note: non-dns can work well (DN not always better)

ummary DNs still used and still performing well On a par or better than best non-dn web sites DN usage not a huge difference We don t know why DNs perform well But could very well simply be server capacity Knowledge of client location valuable more for customized advertising than for latency Advertisements in right language