How To Understand The Power Of A Content Delivery Network (Cdn)



Similar documents
Overview. Tor Circuit Setup (1) Tor Anonymity Network

Communications Software. CSE 123b. CSE 123b. Spring Lecture 13: Load Balancing/Content Distribution. Networks (plus some other applications)

Measuring the Web: Part I - - Content Delivery Networks. Prof. Anja Feldmann, Ph.D. Dr. Ramin Khalili Georgios Smaragdakis, PhD

DATA COMMUNICATOIN NETWORKING

Web Caching and CDNs. Aditya Akella

CSC2231: Akamai. Stefan Saroiu Department of Computer Science University of Toronto

Content Distribu-on Networks (CDNs)

ICP. Cache Hierarchies. Squid. Squid Cache ICP Use. Squid. Squid

Distributed Systems 19. Content Delivery Networks (CDN) Paul Krzyzanowski

Content Delivery Networks (CDN) Dr. Yingwu Zhu

Lecture 3: Scaling by Load Balancing 1. Comments on reviews i. 2. Topic 1: Scalability a. QUESTION: What are problems? i. These papers look at

Indirection. science can be solved by adding another level of indirection" -- Butler Lampson. "Every problem in computer

Internet Content Distribution

Distributed Systems. 23. Content Delivery Networks (CDN) Paul Krzyzanowski. Rutgers University. Fall 2015

1. Comments on reviews a. Need to avoid just summarizing web page asks you for:

Distributed Systems. 24. Content Delivery Networks (CDN) 2013 Paul Krzyzanowski. Rutgers University. Fall 2013

Akamai CDN, IPv6 and DNS security. Christian Kaufmann Akamai Technologies DENOG 5 14 th November 2013

Distributed Systems. 25. Content Delivery Networks (CDN) 2014 Paul Krzyzanowski. Rutgers University. Fall 2014

Request Routing, Load-Balancing and Fault- Tolerance Solution - MediaDNS

Department of Computer Science Institute for System Architecture, Chair for Computer Networks. Caching, Content Distribution and Load Balancing

The Application Front End Understanding Next-Generation Load Balancing Appliances

Overlay Networks. Slides adopted from Prof. Böszörményi, Distributed Systems, Summer 2004.

Web Application Hosting Cloud Architecture

Reliable Distributed Systems

The Effectiveness of Request Redirection on CDN Robustness

Global Server Load Balancing

Akamai CDN, IPv6 and DNS security. Christian Kaufmann Akamai Technologies APNIC th August 2013

Content Delivery Networks. Shaxun Chen April 21, 2009

The Value of Content Distribution Networks Mike Axelrod, Google Google Public

Content Delivery Networks

Making the Internet fast, reliable and secure. DE-CIX Customer Summit Steven Schecter <schecter@akamai.com>

The Application Delivery Controller Understanding Next-Generation Load Balancing Appliances

High volume Internet data centers. MPLS-based Request Routing. Current dispatcher technology. MPLS-based architecture

THE MASTER LIST OF DNS TERMINOLOGY. v 2.0

FortiBalancer: Global Server Load Balancing WHITE PAPER

GLOBAL SERVER LOAD BALANCING WITH SERVERIRON

BGP and Traffic Engineering with Akamai. Caglar Dabanoglu Akamai Technologies AfPIF 2015, Maputo, August 25th

THE MASTER LIST OF DNS TERMINOLOGY. First Edition

Content Delivery Network. Version 0.95

From Internet Data Centers to Data Centers in the Cloud

COMP 631: COMPUTER NETWORKS. How to distribute content without requiring centralized, heavy-duty servers? Peer-to-peer content distribution

The Value of a Content Delivery Network

Internet Content Distribution

Teridion. Rethinking Network Performance. The Internet. Lightning Fast. Technical White Paper July,

Traffic delivery evolution in the Internet ENOG 4 Moscow 23 rd October 2012

CS514: Intermediate Course in Computer Systems

Building a Resilient World Wide Web

Rapid IP redirection with SDN and NFV. Jeffrey Lai, Qiang Fu, Tim Moors December 9, 2015

Open Issues in Content Distribution

Handling Flash Crowds From Your Garage

Getting Started with AWS. Hosting a Static Website

Data Center Content Delivery Network

Meeting Worldwide Demand for your Content

Testing & Assuring Mobile End User Experience Before Production. Neotys

Global Server Load Balancing

Creating "Origin Pull" on Akamai (1)

The old Internet. Software in the Network: Outline. Traditional Design. 1) Basic Caching. The Arrival of Software (in the network)

The Web History (I) The Web History (II)

Enabling Media Rich Curriculum with Content Delivery Networking

CDN Brokering. Content Distribution Internetworking

Measuring CDN Performance. Hooman Beheshti, VP Technology

CDN and Traffic-structure

The Effect of Caches for Mobile Broadband Internet Access

EdgeCast Networks Inc. Flash Media Streaming Administration Guide

Demand Routing in Network Layer for Load Balancing in Content Delivery Networks

The Critical Role of an Application Delivery Controller

Video Streaming Without Interruption

AKAMAI WHITE PAPER. Turbo-Charging Dynamic Web Sites with Akamai EdgeSuite SM

Real-Time Analysis of CDN in an Academic Institute: A Simulation Study

How To Model The Content Delivery Network (Cdn) From A Content Bridge To A Cloud (Cloud)

DNS, CDNs Weds March Lecture 13. What is the relationship between a domain name (e.g., youtube.com) and an IP address?

DNS ROUND ROBIN HIGH-AVAILABILITY LOAD SHARING

Citrix NetScaler Global Server Load Balancing Primer:

How To Create A Toecdn (Open Edge Content Delivery Network) From Scratch On A Microsoft Ipad Or Ipad (For Free) On A Pc Or Ipa (For A Free) With A Free Ipad) On An Ip

F5 and Infoblox DNS Integrated Architecture Offering a Complete Scalable, Secure DNS Solution

CSCI-1680 CDN & P2P Chen Avin

AKAMAI WHITE PAPER. Delivering Dynamic Web Content in Cloud Computing Applications: HTTP resource download performance modelling

Results-Oriented Application Acceleration with FastView Because Every Second Counts Whitepaper

Tema 5: Distribución de contenidos

Getting Started with AWS. Static Website Hosting

Experimentation with the YouTube Content Delivery Network (CDN)

Towards a Peer-to-Peer Extended Content Delivery Network

SiteCelerate white paper

DNS records. RR format: (name, value, type, TTL) Type=NS

Basheer Al-Duwairi Jordan University of Science & Technology

Week 3 / Paper 2. Bernhard Ager, Wolfgang Mühlbauer, Georgios Smaragdakis, Steve Uhlig ACM IMC 2010.

7 Easy Steps to Implementing Application Load Balancing For 100% Availability and Accelerated Application Performance

Front-End Performance Testing and Optimization

5 Easy Steps to Implementing Application Load Balancing for Non-Stop Availability and Higher Performance

CDN Brokering. Alexandros Biliris, Chuck Cranor, Fred Douglis, Michael Rabinovich, Sandeep Sibal, Oliver Spatscheck, and Walter Sturm

CDN Brokering. Alexandros Biliris, Chuck Cranor, Fred Douglis, Michael Rabinovich, Sandeep Sibal, Oliver Spatscheck, and Walter Sturm

A DNS Reflection Method for Global Traffic Management

CHAPTER 4 PERFORMANCE ANALYSIS OF CDN IN ACADEMICS

Global Load Balancing with Brocade Virtual Traffic Manager

Ensuring Business Continuity and Disaster Recovery with Coyote Point Systems Envoy

Dynamic Content Acceleration: Lightning-Fast Web Apps with Amazon CloudFront and Amazon Route 53

Web DNS Peer-to-peer systems (file sharing, CDNs, cycle sharing)

A Walk through Content Delivery Networks

Akamai s EdgePlatform for Application Acceleration

Transcription:

Overview 5-44 5-44 Computer Networking 5-64 Lecture 8: Delivering Content Content Delivery Networks Peter Steenkiste Fall 04 www.cs.cmu.edu/~prs/5-44-f4 Web Consistent hashing Peer-to-peer CDN Motivation Edge servers Content delivery Mapping Impact on Internet Some slides based on presentation by Patrick Gilmore Content Distribution Networks (CDNs) The content providers are the CDN customers. Content replication CDN company installs hundreds of CDN servers throughout Internet Close to users CDN replicates its customers content in CDN servers. When provider updates content, CDN updates servers origin server in North America CDN distribution node CDN server in S. America CDN server in Europe CDN server in Asia http://www.akamai.com/html/technology/nui/news/index.html 4

Case Study on Reliability and Scalability: The 000 Election What is the CDN? Customer Visits (Millions) 0 7 5 0 7 5 0 Crash Zone Without Akamai this site could not have served customers above their crash zone Time Edge Caches: work with ISP and networks everywhere to install edge caches Edge = close to customers Content delivery: getting content to the edge caches Content can be objects, video, or entire web sites Mapping: find the closest edge server for each user and deliver content from that server Network proximity not the same as geographic proximity Focus is on performance as observed by user (quality) 5-44 S'0 6 Potential Benefits Edge Caches Very good scalability Near infinite if deployed properly Good economies at large scales Infrastructure is shared efficiently by customers Statistical multiplexing: hot sites use more resources Avoids congestion and long latencies Through mapping to closest server Can be extremely reliable Very high degree of redundancy Can mitigate some DoS attacks Region set of caches managed as a cluster May have a specific function: http, streaming, Availability is a major concern in architecture design Redundancy at the network level See next slide Dealing with server failures Servers do fail occasionally Each server has a buddy which is constantly trading hellos If hellos stop, buddy starts to respond directly to requests for primary server Users in the middle of a download may have to hit reload No one else will notice any interruption 7 8

Example Configuration Content Delivery: Possible Bottlenecks Last Mile Problem First Mile Problem End User Peering Problem Internet Backbone Problem Host Server Process Flow Process Flow. User wants to download distributed web content. User is directed through Akamai s dynamic mapping to the closest edge cache

Process Flow Process Flow a a. Edge cache searches local hard drive for content b. If requested object is not on local hard drive, edge cache checks other edge caches in same region for object Process Flow Process Flow b b a a a a c b. If requested object is not cached or not fresh, edge cache sends an HTTP GET the origin server c. Origin server delivers object to edge cache over optimized connection 4

Process Flow Core Hierarchy Regions b a 4 a c 4. Edge server delivers content to end user. User requests content and is mapped to optimal edge Akamai server Core Hierarchy Regions Core Hierarchy Regions. If content is not present in the region, it is requested from most optimal core region. Core region makes one request back to origin server 5

Core Hierarchy Regions Core Hierarchy Features Reduces traffic back to origin server Reduces infrastructure needs of customer Provides best protection against flash crowds Especially important for large files (e.g. Operating System updates or video files) Improved end-user response time Core regions are well connected Optimized connection speeds object delivery 4. Core region can serve many edge regions with one request to origin server Mapping: Server Selection Application Based Which server? Lowest load to balance load on servers Best performance to improve client performance Based on Geography? RTT? Throughput? Load? Any alive node to provide fault tolerance How to direct clients to a particular server? As part of routing anycast, cluster load balancing Not covered As part of application HTTP redirect As part of naming DNS HTTP supports simple way to indicate that Web page has moved (0X responses) Server receives Get request from client Decides which server is best suited for particular client and object Returns HTTP redirect to that server Can make informed application specific decision May introduce additional overhead multiple connection setup, name lookups, etc. While good solution in general, but HTTP Redirect has some design flaws 4 6

Naming Based Mapping Algorithms Client does name lookup for service Name server chooses appropriate server address A-record returned is best one for the client What information can name server base decision on? Server load/location must be collected Information in the name lookup request Name service client typically the local name server for client Three main components to finding closest edge cache to end user from a Network point of view: Packet Loss + Throughput + Latency Listed in order of importance (roughly) Mapping also takes into account edge cache performance Does a server have an object on its hard drive? Uses consistent hashing algorithm Does the edge cache have CPU, RAM, bandwidth, etc. available to serve end-user? 5 Access to Web Site without CDN FreeFlow Akamai s Object Delivery Service HTTP request user enters standard URL Client s Servers Customer Web End User Server User enters Customer s Web User s browser Objects served standard URL Server returns requests embedded 4 with round trips www.customer.com HTML with objects from customer across the Internet embedded URLs Web server <img src=/images/logo.gif> HTTP request for embedded content Akamai Server Content Served Locally HTML code contains Akamai URLs (ARL) Example ARL: img src= a000.g.akamai.net/ /www.customer.com/images/logo.gif 7

Steps in Content Retrieval Resolution of Modified Name Clients fetch html document from primary server E.g. fetch index.html from cnn.com URLs for replicated content are replaced in html E.g. <img src= http://cnn.com/af/x.gif > replaced with <img src= http://a7.g.akamaitech.net/7//cnn.com/af/x.gif > Note that modified name includes original file name Drawback: must modify the content Client is forced to resolve a.g.akamaitech.net hostname Focus is on static content Can be freely cached Root server gives NS record for akamai.net Akamai.net name server returns NS record for g.akamaitech.net Name server chosen to be in region of client s name server TTL is large G.akamaitech.net nameserver chooses server in region Should try to chose server that has file in cache - How to choose? Uses a name and hash TTL is small why? 9 0 How Akamai Works Akamai Subsequent Requests cnn.com (content provider) DNS root server Akamai server cnn.com (content provider) DNS root server Akamai server Get index. html 4 End-user Get foo.jpg 5 6 7 8 9 0 Get /cnn.com/foo.jpg Akamai high-level DNS server Akamai low-level DNS server Nearby matching Akamai server Get index. html Akamai high-level DNS server 7 Akamai low-level DNS End-user 8 9 Get 0 /cnn.com/foo.jpg server Nearby matching Akamai server 8

EdgeSuite Akamai s Site Delivery Service Customer CNAME s (aliases) www.customer.com Anyone looking up www.customer.com will be redirected to an Akamai hostname - customer.d4p.net No, I do not know why we use d4p.net. customer.d4p.net is CNAME d to axxx.g.akamai.net Standard Akamai mapping magic sends returns the closest edge server for axxx.g.akamai.net EdgeSuite Akamai s Site Delivery Service End user never communicates with origin server Content retrieved from private hostname, e.g., origin.customer.com? High reliability Thousands of servers backing each other up If one geographic area is disabled, no other area is affected Mitigates some DoS attacks Uncacheable content is tunneled back to origin Can reduce need for tunneling with server-side scripts running on edge caches now very common Persistent TCP connections increase performance Helps with downloading of objects to end caches Overview Web Consistent hashing Peer-to-peer CDN Motivation Edge servers Content delivery Mapping Impact on Internet Change in Traffic with CDN Edge Caches Transit ($$$) ISP Z Transit ($$) Transit ($$ /) Peering Transit ($$$) ISP P ISP X Transit ($$) Transit ($$$) Transit ($$$) Peering Transit ($) Transit ($$) ISP Y ISP Q 5 6 9

Impact of CDN Growth Some Recent Trends Flattening of the Internet More content is served from the edge of the network Also see an increase in peering More traffic remains at edge reduced load on core Changes in the economic relationships Caches benefit users: better performance, reliability Happy customers benefit the CDN ISP benefits since more content is served locally Reduces traffic from provider direct economic benefit CDNs sometimes place caches in eye ball ISPs for free, but economic models change all the time If CDN s can deploy caches, why can t I? Content providers have started to deploy CDNs Reduce cost, assuming you are large enough Optimize caching to their specific requiremens Can still use CDNs, e.g., in certain regions,. Internet Service Providers also try deploy CDNs Sometimes difficult to build the business relationships with content owners too many ISPs! How about the know-how? Hybrid solutions are emerging, e.g., ISPs install hardware and license software from CDNs 7 9 Summary Caching improves web performance Caching only at client is only partial solution Not enough locality Content Delivery Networks move data closer to user, balance load, increase availability, Is having impact on structure of the Internet No longer just a solution for static content 40 0