Distributed Systems. 25. Content Delivery Networks (CDN) 2014 Paul Krzyzanowski. Rutgers University. Fall 2014



Similar documents
Distributed Systems. 23. Content Delivery Networks (CDN) Paul Krzyzanowski. Rutgers University. Fall 2015

Distributed Systems. 24. Content Delivery Networks (CDN) 2013 Paul Krzyzanowski. Rutgers University. Fall 2013

Distributed Systems 19. Content Delivery Networks (CDN) Paul Krzyzanowski

Web Caching and CDNs. Aditya Akella

Measuring the Web: Part I - - Content Delivery Networks. Prof. Anja Feldmann, Ph.D. Dr. Ramin Khalili Georgios Smaragdakis, PhD

DATA COMMUNICATOIN NETWORKING

How To Understand The Power Of A Content Delivery Network (Cdn)

Content Delivery Networks. Shaxun Chen April 21, 2009

Indirection. science can be solved by adding another level of indirection" -- Butler Lampson. "Every problem in computer

Content Distribu-on Networks (CDNs)

Making the Internet fast, reliable and secure. DE-CIX Customer Summit Steven Schecter <schecter@akamai.com>

CDN and Traffic-structure

Dynamic Content Acceleration: Lightning-Fast Web Apps with Amazon CloudFront and Amazon Route 53

Data Center Content Delivery Network

Overlay Networks. Slides adopted from Prof. Böszörményi, Distributed Systems, Summer 2004.

DNS, CDNs Weds March Lecture 13. What is the relationship between a domain name (e.g., youtube.com) and an IP address?

Content Delivery Networks (CDN) Dr. Yingwu Zhu

From Internet Data Centers to Data Centers in the Cloud

1. Comments on reviews a. Need to avoid just summarizing web page asks you for:

Content Delivery Networks

Azure Media Service Cloud Video Delivery KILROY HUGHES MICROSOFT AZURE MEDIA

Demand Routing in Network Layer for Load Balancing in Content Delivery Networks

CSC2231: Akamai. Stefan Saroiu Department of Computer Science University of Toronto

Request Routing, Load-Balancing and Fault- Tolerance Solution - MediaDNS

Internet Content Distribution

How To Model The Content Delivery Network (Cdn) From A Content Bridge To A Cloud (Cloud)

THE MASTER LIST OF DNS TERMINOLOGY. v 2.0

Comparative Performance Report

Overview. Tor Circuit Setup (1) Tor Anonymity Network

Lecture 3: Scaling by Load Balancing 1. Comments on reviews i. 2. Topic 1: Scalability a. QUESTION: What are problems? i. These papers look at

A Link Load Balancing Solution for Multi-Homed Networks

THE MASTER LIST OF DNS TERMINOLOGY. First Edition

AKAMAI WHITE PAPER. Delivering Dynamic Web Content in Cloud Computing Applications: HTTP resource download performance modelling

ExamPDF. Higher Quality,Better service!

Transport and Network Layer

Learning Management Redefined. Acadox Infrastructure & Architecture

Internet Firewall CSIS Packet Filtering. Internet Firewall. Examples. Spring 2011 CSIS net15 1. Routers can implement packet filtering

How To Understand The Power Of Icdn

high-quality steaming over the Internet

Getting Started with AWS. Hosting a Static Website

Chapter 4: Networking and the Internet

leaseweb cdn CDN Product Sheet - LeaseWeb - EN 1.06

The Effectiveness of Request Redirection on CDN Robustness

Single Pass Load Balancing with Session Persistence in IPv6 Network. C. J. (Charlie) Liu Network Operations Charter Communications

Global Server Load Balancing

ICP. Cache Hierarchies. Squid. Squid Cache ICP Use. Squid. Squid

Measuring CDN Performance. Hooman Beheshti, VP Technology

Using IPM to Measure Network Performance

Basheer Al-Duwairi Jordan University of Science & Technology

The Value of Content Distribution Networks Mike Axelrod, Google Google Public

Amazon Web Services Yu Xiao

Web DNS Peer-to-peer systems (file sharing, CDNs, cycle sharing)

Load Balancing and Sessions. C. Kopparapu, Load Balancing Servers, Firewalls and Caches. Wiley, 2002.

John S. Otto Fabián E. Bustamante

Question: 3 When using Application Intelligence, Server Time may be defined as.

Towards a Peer-to-Peer Extended Content Delivery Network

Department of Computer Science Institute for System Architecture, Chair for Computer Networks. Caching, Content Distribution and Load Balancing

The content delivery network (CDN): Delivering the ultimate Web experience

Giving life to today s media distribution services

Session 11 Under the hood of a commercial website

Testing & Assuring Mobile End User Experience Before Production. Neotys

Rapid IP redirection with SDN and NFV. Jeffrey Lai, Qiang Fu, Tim Moors December 9, 2015

Load Balancing for Microsoft Office Communication Server 2007 Release 2

Analysing the impact of CDN based service delivery on traffic engineering

Demystifying CDNs: Building Video Enabled Networks That Maintain Your Network Integrity And Meet User Demand

Peer-to-Peer Networks. Chapter 6: P2P Content Distribution

CA106 Web Design. Dr. Dónal Fitzpatrick, School of Computing Room l2.48, Extension 8929,

The old Internet. Software in the Network: Outline. Traditional Design. 1) Basic Caching. The Arrival of Software (in the network)

Cisco Videoscape Distribution Suite Service Broker

Network: several computers who can communicate. bus. Main example: Ethernet (1980 today: coaxial cable, twisted pair, 10Mb 1000Gb).

Web Application Deployment in the Cloud Using Amazon Web Services From Infancy to Maturity

BREAKING HTTPS WITH BGP HIJACKING. Artyom Gavrichenkov R&D Team Lead, Qrator Labs

architecture: what the pieces are and how they fit together names and addresses: what's your name and number?

Homework Assignment #3 Due 11/20 at 5:00pm EE122 Fall 2012

Transforming cloud infrastructure to support Big Data Ying Xu Aspera, Inc

VIDEO HOSTING AND STREAMING SUMMARY REPORT (STSC 18/4)

Dynamic Load Balancing and Node Migration in a Continuous Media Network

Content Delivery Networks

How To Load Balance On A Bgg On A Network With A Network (Networking) On A Pc Or Ipa On A Computer Or Ipad On A 2G Network On A Microsoft Ipa (Netnet) On An Ip

Building a Mobile App Security Risk Management Program. Copyright 2012, Security Risk Advisors, Inc. All Rights Reserved

CS 457 Lecture 19 Global Internet - BGP. Fall 2011

Accelerating Wordpress for Pagerank and Profit

Essential Ingredients for Optimizing End User Experience Monitoring

Denial of Service Attacks and Resilient Overlay Networks

Characterizing and Mitigating Web Performance Bottlenecks in Broadband Access Networks

LARGE SCALE INTERNET SERVICES

COMP 631: COMPUTER NETWORKS. How to distribute content without requiring centralized, heavy-duty servers? Peer-to-peer content distribution

ASPERA HIGH-SPEED TRANSFER SOFTWARE. Moving the world s data at maximum speed

BGP and Traffic Engineering with Akamai. Christian Kaufmann Akamai Technologies MENOG 14

Transcription:

Distributed Systems 25. Content Delivery Networks (CDN) Paul Krzyzanowski Rutgers University Fall 2014 November 16, 2014 2014 Paul Krzyzanowski 1

Motivation Serving web content from one location presents problems Scalability Reliability Performance Flash crowd problem What if everyone comes to your site at once? Cache content and serve requests from multiple servers at the network edge (close to the user) Reduce demand on site s infrastructure Provide faster service to users Content comes from nearby servers November 16, 2014 2014 Paul Krzyzanowski 2

Focus on Content Computing is still done by the site host s server(s) Offload the static parts they often make up the bulk of the bytes: Images Video CSS files Static pages November 16, 2014 2014 Paul Krzyzanowski 3

Serving & Consuming Content Web server cache Internet cache cache cache Every request goes to the server. Repeated requests from one client may be optimized by browser-based caching but that cached data is local to the browser November 16, 2014 2014 Paul Krzyzanowski 4

Caching Proxies Web server Internet Caching Proxy Caching Proxy Caching proxy in an organization. Take advantage of what others before you have recently accessed. November 16, 2014 2014 Paul Krzyzanowski 5

Load Balancing Web server Web server Web server Load Balancer Internet Increase capacity at the server. Internet connectivity can be a bottleneck + latency from client to server. November 16, 2014 2014 Paul Krzyzanowski 6

Internet End-to-End Packet Delivery Network edges: applications & hosts Local ISP Web server Tier 2 ISP Tier 1 ISP Tier 1 ISP Tier 2 ISP Local ISP Network core: routers November 16, 2014 2014 Paul Krzyzanowski 7

Multihoming Get network links from multiple ISPs Server has one IP address but multiple links Announce address to upstream routers via BGP: Provides clients with a choice of routes and fault tolerance for a server s ISP going down Local ISP Web server Tier 2 ISP Tier 1 ISP Tier 1 ISP Tier 2 ISP Local ISP November 16, 2014 2014 Paul Krzyzanowski 8

Mirroring (Replication) Synchronize multiple servers Use multiple ISPs: location-based load balancing, ISP & server fault tolerance Local ISP Web server Tier 2 ISP copy Tier 1 ISP Tier 1 ISP Web server Tier 2 ISP Local ISP November 16, 2014 2014 Paul Krzyzanowski 9

Improving scalability, availability, & performance Scalability Load balance among multiple servers Multiple ISPs if network congestion is a concern Availability Replicate servers Multiple data centers & ISPs Performance Mirror (replicate) servers for load balancing Cache content and serve requests from multiple servers at the network edge (close to the user) Reduce demand on site s infrastructure Provide faster service to users Content comes from nearby servers November 16, 2014 2014 Paul Krzyzanowski 10

But these approaches have problems! Local balancing Data center or ISP can fail Multihoming IP protocols (BGP) are often not quick to find new routes Mirroring at multiple sites Synchronization can be difficult Proxy servers Typically a client-side solution Low cache hit rates All require extra capacity and extra capital costs November 16, 2014 2014 Paul Krzyzanowski 11

Akamai Distributed Caching Company evolved from MIT research "Invent a better way to deliver Internet content" Tackle the "flash crowd" problem Akamai runs on 150,000 servers in 1,200 networks across 92 countries http://www.akamai.com/html/about/facts_figures.html November 16, 2014 2014 Paul Krzyzanowski 12

Akamai s goal Try to serve clients from servers likely to have the content Nearest: lowest round-trip time Available: server that is not too loaded Likely: server that is likely to have the data November 16, 2014 2014 Paul Krzyzanowski 13

Overlay Network The Internet is a collection of many autonomous networks Connectivity is based on business decisions peering agreements, not performance ISP s performance incentives are: Last-mile connectivity to end users Connectivity to servers on the ISP Overlay network Collection of caching servers at many, many ISPs All know about each other November 16, 2014 2014 Paul Krzyzanowski 14

Overlay Network 1. Domain name lookup Translated by mapping system to an edge server that can serve the content Use custom DNS servers Take requestor s address into into account to find the nearest edge 2. sends request to the given edge server Edge server may be able to serve content from its cache May need to contact the origin server via the transport system Origin server Transport system Edge servers Clients mapping system November 16, 2014 2014 Paul Krzyzanowski 15

Mapping: Domain Name Lookup Akamai uses Dynamic DNS servers Resolve a host name based on: user location (minimize network distance) server health server load network status load balancing Try to find an edge server at the customer s ISP November 16, 2014 2014 Paul Krzyzanowski 16

Akamai collects network performance data Map network topology Based on BGP and traceroute information Estimate hops and transit time Content servers report their load to a monitoring application Monitoring app publishes load reports to a local (Akamai) DNS server Akamai DNS server determines which IP addresses to return when resolving names Load shedding: If servers get too loaded, the DNS server will not respond with those addresses November 16, 2014 2014 Paul Krzyzanowski 17

Benefits of an overlay network CDN 1. Caching 2. Routing 3. Security November 16, 2014 2014 Paul Krzyzanowski 18

Caching Goal: Increase hit rate on edge servers Reduce hits on origin servers Static content can be served from caches Dynamic content still goes back to the origin Origin server Parent servers Two-level caching If edge servers don t have the data, check with parent servers Edge servers Clients November 16, 2014 2014 Paul Krzyzanowski 19

Types of content Static content Cached depending on original site's requirements (never to forever) Dynamic content Caching proxies cannot do this Akamai uses Edge Side Includes technology (www.esi.org) Assembles dynamic content on edge servers Similar to server-side includes Page is broken into fragments with independent caching properties Assembled on demand Streaming media Live stream is sent to an entry-point server in the Akamai network Stream is delivered from the entry-point server to multiple edge servers Edge servers serve content to end users. November 16, 2014 2014 Paul Krzyzanowski 20

Routing Route to parent servers or origin via the overlay network Origin server Routing decision factors: measured latency packet loss available bandwidth Results in ranked list of alternate paths from edge to origin Alt. Path 2 Direct path Alt. Path 3 Each intermediate node acts as a forwarder Keep TCP connections active for efficiency Edge servers November 16, 2014 2014 Paul Krzyzanowski 21

Security High capacity Overwhelm DDoS attacks Expertise Maintain systems and software Extra security software Hardened network stack Detect & defend attacks Shield the origin Attacks hit the CDN, not the origin November 16, 2014 2014 Paul Krzyzanowski 22

Other Things CDNs Do November 16, 2014 2014 Paul Krzyzanowski 23

Signed URLs in Amazon CloudFront Example: Amazon CloudFront CDN Similar in concept to Akamai Requests for content are routed to the nearest edge location Cached content with original located at origin servers Integrates with back-end Amazon services Private content: provide special URLs for restricted content Control access to content via a signed URL URL contains: policy or a reference to a policy Signature = encrypted hash URL cannot be modified Policies include: Validity: start time & expiration time Range of IP addresses that are allowed to access the object November 16, 2014 2014 Paul Krzyzanowski 24

Limelight Orchestrate Focus on video distribution and content management Video transcoding Encode video to a variety of formats Support playback on various devices: different formats & bitrates Ad insertion Integrate with ad servers (DoubleClick, LiveRail, Tremor, YuMe) Pre-roll, post-roll, mid-roll, overlay, etc. November 16, 2014 2014 Paul Krzyzanowski 25

LimeLight Orchestrate Transcoding Universal URL Publish Web site Transcode Content server Content server Content server f(player, device, encoding parameters) November 16, 2014 2014 Paul Krzyzanowski 26

Server-side Video Ad Insertion Example: Limelight Reach Ads Ad Server (3 rd party) Ad selection Ad call Web site Ad insertion Request context CDN Media with ad Request content November 16, 2014 2014 Paul Krzyzanowski 27

(off topic) Client-side Video Ad Insertion Example: Google Interactive Media Ads (IMA) SDK Flash, HTML5, ios, Android Access VAST ads VAST: Video Ad Serving Template XML schema for serving ads to video players IAB (Interactive Advertising Bureau) standard Supported by DoubleClick Google AdSense many others Ad Server Request ad CDN (web content, including markup for player) Request page CDN (media content) Request content Player November 16, 2014 2014 Paul Krzyzanowski 28

The End November 16, 2014 2014 Paul Krzyzanowski 29