Distributed Systems. 23. Content Delivery Networks (CDN) Paul Krzyzanowski. Rutgers University. Fall 2015

Similar documents
Distributed Systems. 25. Content Delivery Networks (CDN) 2014 Paul Krzyzanowski. Rutgers University. Fall 2014

Distributed Systems. 24. Content Delivery Networks (CDN) 2013 Paul Krzyzanowski. Rutgers University. Fall 2013

Distributed Systems 19. Content Delivery Networks (CDN) Paul Krzyzanowski

Web Caching and CDNs. Aditya Akella

Measuring the Web: Part I - - Content Delivery Networks. Prof. Anja Feldmann, Ph.D. Dr. Ramin Khalili Georgios Smaragdakis, PhD

DATA COMMUNICATOIN NETWORKING

How To Understand The Power Of A Content Delivery Network (Cdn)

Content Delivery Networks. Shaxun Chen April 21, 2009

Indirection. science can be solved by adding another level of indirection" -- Butler Lampson. "Every problem in computer

Content Distribu-on Networks (CDNs)

DNS, CDNs Weds March Lecture 13. What is the relationship between a domain name (e.g., youtube.com) and an IP address?

Making the Internet fast, reliable and secure. DE-CIX Customer Summit Steven Schecter <schecter@akamai.com>

Dynamic Content Acceleration: Lightning-Fast Web Apps with Amazon CloudFront and Amazon Route 53

Data Center Content Delivery Network

Overlay Networks. Slides adopted from Prof. Böszörményi, Distributed Systems, Summer 2004.

Content Delivery Networks (CDN) Dr. Yingwu Zhu

THE MASTER LIST OF DNS TERMINOLOGY. v 2.0

CDN and Traffic-structure

Demand Routing in Network Layer for Load Balancing in Content Delivery Networks

1. Comments on reviews a. Need to avoid just summarizing web page asks you for:

From Internet Data Centers to Data Centers in the Cloud

Internet Content Distribution

THE MASTER LIST OF DNS TERMINOLOGY. First Edition

Content Delivery Networks

Azure Media Service Cloud Video Delivery KILROY HUGHES MICROSOFT AZURE MEDIA

How To Understand The Power Of Icdn

Request Routing, Load-Balancing and Fault- Tolerance Solution - MediaDNS

Learning Management Redefined. Acadox Infrastructure & Architecture

A Link Load Balancing Solution for Multi-Homed Networks

Overview. Tor Circuit Setup (1) Tor Anonymity Network

CSC2231: Akamai. Stefan Saroiu Department of Computer Science University of Toronto

ExamPDF. Higher Quality,Better service!

How To Model The Content Delivery Network (Cdn) From A Content Bridge To A Cloud (Cloud)

Transport and Network Layer

Internet Firewall CSIS Packet Filtering. Internet Firewall. Examples. Spring 2011 CSIS net15 1. Routers can implement packet filtering

Lecture 3: Scaling by Load Balancing 1. Comments on reviews i. 2. Topic 1: Scalability a. QUESTION: What are problems? i. These papers look at

Comparative Performance Report

The Effectiveness of Request Redirection on CDN Robustness

AKAMAI WHITE PAPER. Delivering Dynamic Web Content in Cloud Computing Applications: HTTP resource download performance modelling

Getting Started with AWS. Hosting a Static Website

Global Server Load Balancing

Single Pass Load Balancing with Session Persistence in IPv6 Network. C. J. (Charlie) Liu Network Operations Charter Communications

Load Balancing for Microsoft Office Communication Server 2007 Release 2

The old Internet. Software in the Network: Outline. Traditional Design. 1) Basic Caching. The Arrival of Software (in the network)

Web DNS Peer-to-peer systems (file sharing, CDNs, cycle sharing)

Load Balancing and Sessions. C. Kopparapu, Load Balancing Servers, Firewalls and Caches. Wiley, 2002.

Basheer Al-Duwairi Jordan University of Science & Technology

leaseweb cdn CDN Product Sheet - LeaseWeb - EN 1.06

Question: 3 When using Application Intelligence, Server Time may be defined as.

The Value of Content Distribution Networks Mike Axelrod, Google Google Public

Towards a Peer-to-Peer Extended Content Delivery Network

ICP. Cache Hierarchies. Squid. Squid Cache ICP Use. Squid. Squid

Denial of Service Attacks and Resilient Overlay Networks

Rapid IP redirection with SDN and NFV. Jeffrey Lai, Qiang Fu, Tim Moors December 9, 2015

Demystifying CDNs: Building Video Enabled Networks That Maintain Your Network Integrity And Meet User Demand

Analysing the impact of CDN based service delivery on traffic engineering

The Effect of Caches for Mobile Broadband Internet Access

Cisco Videoscape Distribution Suite Service Broker

ASPERA HIGH-SPEED TRANSFER SOFTWARE. Moving the world s data at maximum speed

Internet Infrastructure Measurement: Challenges and Tools

Web Application Deployment in the Cloud Using Amazon Web Services From Infancy to Maturity

Using IPM to Measure Network Performance

Characterizing and Mitigating Web Performance Bottlenecks in Broadband Access Networks

Department of Computer Science Institute for System Architecture, Chair for Computer Networks. Caching, Content Distribution and Load Balancing

The content delivery network (CDN): Delivering the ultimate Web experience

high-quality steaming over the Internet

Protocol Data Units and Encapsulation

High volume Internet data centers. MPLS-based Request Routing. Current dispatcher technology. MPLS-based architecture

Datacenter Transformation

Content Delivery Networks

Dynamic Load Balancing and Node Migration in a Continuous Media Network

Study of Flexible Contents Delivery System. With Dynamic Server Deployment

Giving life to today s media distribution services

CS 457 Lecture 19 Global Internet - BGP. Fall 2011

Session 11 Under the hood of a commercial website

BGP and Traffic Engineering with Akamai. Christian Kaufmann Akamai Technologies MENOG 14

Essential Ingredients for Optimizing End User Experience Monitoring

Peer-to-Peer Networks. Chapter 6: P2P Content Distribution

COMP 631: COMPUTER NETWORKS. How to distribute content without requiring centralized, heavy-duty servers? Peer-to-peer content distribution

Measuring CDN Performance. Hooman Beheshti, VP Technology

How To Create A Toecdn (Open Edge Content Delivery Network) From Scratch On A Microsoft Ipad Or Ipad (For Free) On A Pc Or Ipa (For A Free) With A Free Ipad) On An Ip

CA106 Web Design. Dr. Dónal Fitzpatrick, School of Computing Room l2.48, Extension 8929,

John S. Otto Fabián E. Bustamante

HIVE STREAMING. Mikael Högqvist Senior Research Engineer 07/05/14 1

The Application Front End Understanding Next-Generation Load Balancing Appliances

GLOBAL SERVER LOAD BALANCING WITH SERVERIRON

Building Content Distribution Platforms over Flexible Optical Networks

Homework Assignment #3 Due 11/20 at 5:00pm EE122 Fall 2012

Multihoming and Multi-path Routing. CS 7260 Nick Feamster January

Transcription:

Distributed Systems 23. Content Delivery Networks (CDN) Paul Krzyzanowski Rutgers University Fall 2015 November 17, 2015 2014-2015 Paul Krzyzanowski 1

Motivation Serving web content from one location presents problems Scalability Reliability Performance Flash crowd problem What if everyone comes to your site at once? Cache content and serve requests from multiple servers at the network edge (close to the user) Reduce demand on site s infrastructure Provide faster service to users Content comes from nearby servers November 17, 2015 2014-2015 Paul Krzyzanowski 2

Focus on Content Computing is still done by the site host s server(s) Offload the static parts they often make up the bulk of the bytes: Images Video CSS files Static pages November 17, 2015 2014-2015 Paul Krzyzanowski 3

cache cache cache cache Serving & Consuming Content Web server Internet Every request goes to the server. Repeated requests from one client may be optimized by browser-based caching but that cached data is local to the browser November 17, 2015 2014-2015 Paul Krzyzanowski 4

Caching Proxy Caching Proxy Caching Proxies Web server Internet Caching proxy in an organization. Take advantage of what others before you have recently accessed. November 17, 2015 2014-2015 Paul Krzyzanowski 5

Load Balancer Load Balancing Web server Web server Web server Internet Increase capacity at the server. Internet connectivity can be a bottleneck + latency from client to server. November 17, 2015 2014-2015 Paul Krzyzanowski 6

Internet End-to-End Packet Delivery Network edges: applications & hosts Local ISP Web server Tier 2 ISP Tier 1 ISP Tier 1 ISP Tier 2 ISP Local ISP Network core: routers November 17, 2015 2014-2015 Paul Krzyzanowski 7

Multihoming Get network links from multiple ISPs Server has one IP address but multiple links Announce address to upstream routers via BGP: Provides clients with a choice of routes and fault tolerance for a server s ISP going down Local ISP Web server Tier 2 ISP Tier 1 ISP Tier 1 ISP Tier 2 ISP Local ISP November 17, 2015 2014-2015 Paul Krzyzanowski 8

copy Mirroring (Replication) Synchronize multiple servers Use multiple ISPs: location-based load balancing, ISP & server fault tolerance Local ISP Web server Tier 2 ISP Tier 1 ISP Tier 1 ISP Web server Tier 2 ISP Local ISP November 17, 2015 2014-2015 Paul Krzyzanowski 9

Improving scalability, availability, & performance Scalability Load balance among multiple servers Multiple ISPs if network congestion is a concern Availability Replicate servers Multiple data centers & ISPs Performance Mirror (replicate) servers for load balancing Cache content and serve requests from multiple servers at the network edge (close to the user) Reduce demand on site s infrastructure Provide faster service to users Content comes from nearby servers November 17, 2015 2014-2015 Paul Krzyzanowski 10

But these approaches have problems! Local balancing Data center or ISP can fail Multihoming IP protocols (BGP) are often not quick to find new routes Mirroring at multiple sites Synchronization can be difficult Proxy servers Typically a client-side solution Low cache hit rates All require extra capacity and extra capital costs November 17, 2015 2014-2015 Paul Krzyzanowski 11

Akamai Distributed Caching Company evolved from MIT research "Invent a better way to deliver Internet content" Tackle the "flash crowd" problem Akamai runs on 200,000 servers in 1,400 networks across 110 countries Delivers 15-30% of all web traffic Reaching over 30 Terabits per second http://www.akamai.com/html/about/facts_figures.html 12

Akamai s goal Try to serve clients from servers likely to have the content Nearest: lowest round-trip time Available: server that is not too loaded Likely: server that is likely to have the data November 17, 2015 2014-2015 Paul Krzyzanowski 13

Overlay Network The Internet is a collection of many autonomous networks Connectivity is based on business decisions peering agreements, not performance An ISP s top performance incentives are: Last-mile connectivity to end users Connectivity to servers on the ISP The Overlay network Collection of caching servers at many, many ISPs All know about each other November 17, 2015 2014-2015 Paul Krzyzanowski 14

Overlay Network 1. Domain name lookup Translated by mapping system to an edge server that can serve the content Use custom DNS servers Take requestor s address into into account to find the nearest edge 2. sends request to the given edge server Edge server may be able to serve content from its cache May need to contact the origin server via the transport system Origin server Transport system Edge servers Clients mapping system November 17, 2015 2014-2015 Paul Krzyzanowski 15

Mapping: Domain Name Lookup Akamai uses Dynamic DNS servers Resolve a host name based on: user location (minimize network distance) server health server load network status load balancing Try to find an edge server at the customer s ISP November 17, 2015 2014-2015 Paul Krzyzanowski 16

Akamai collects network performance data Map network topology Based on BGP and traceroute information Estimate hops and transit time Content servers report their load to a monitoring application Monitoring app publishes load reports to a local (Akamai) DNS server Akamai DNS server determines which IP addresses to return when resolving names Load shedding: If servers get too loaded, the DNS server will not respond with those addresses November 17, 2015 2014-2015 Paul Krzyzanowski 17

Benefits of an overlay network CDN 1. Caching 2. Routing 3. Security November 17, 2015 2014-2015 Paul Krzyzanowski 18

Caching Goal: Increase hit rate on edge servers Reduce hits on origin servers Static content can be served from caches Dynamic content still goes back to the origin Origin server Parent servers Two-level caching If edge servers don t have the data, check with parent servers Edge servers Clients November 17, 2015 2014-2015 Paul Krzyzanowski 19

Types of content Static content Cached depending on original site's requirements (never to forever) Dynamic content Caching proxies cannot do this Akamai uses Edge Side Includes technology (www.esi.org) Assembles dynamic content on edge servers Similar to server-side includes Page is broken into fragments with independent caching properties Assembled on demand Streaming media Live stream is sent to an entry-point server in the Akamai network Stream is delivered from the entry-point server to multiple edge servers Edge servers serve content to end users. November 17, 2015 2014-2015 Paul Krzyzanowski 20

Alt. Path 2 Direct path Alt. Path 3 Routing Route to parent servers or origin via the overlay network Origin server Routing decision factors: measured latency packet loss available bandwidth Results in ranked list of alternate paths from edge to origin Each intermediate node acts as a forwarder Keep TCP connections active for efficiency Edge servers November 17, 2015 2014-2015 Paul Krzyzanowski 21

Security High capacity Overwhelm DDoS attacks Expertise Maintain systems and software Extra security software Hardened network stack Detect & defend attacks Shield the origin Attacks hit the CDN, not the origin November 17, 2015 2014-2015 Paul Krzyzanowski 22

Other Things CDNs Do November 17, 2015 2014-2015 Paul Krzyzanowski 23

Signed URLs in Amazon CloudFront Example: Amazon CloudFront CDN Similar in concept to Akamai Requests for content are routed to the nearest edge location Cached content with original located at origin servers Integrates with back-end Amazon services Private content: provide special URLs for restricted content Control access to content via a signed URL URL contains: policy or a reference to a policy Signature = encrypted hash URL cannot be modified Policies include: Validity: start time & expiration time Range of IP addresses that are allowed to access the object November 17, 2015 2014-2015 Paul Krzyzanowski 24

Limelight Orchestrate Focus on video distribution and content management Video transcoding Encode video to a variety of formats Support playback on various devices: different formats & bitrates Ad insertion Integrate with ad servers (DoubleClick, LiveRail, Tremor, YuMe) Pre-roll, post-roll, mid-roll, overlay, etc. November 17, 2015 2014-2015 Paul Krzyzanowski 25

LimeLight Orchestrate Transcoding Universal URL Publish Web site Transcode Content server Content server Content server f(player, device, encoding parameters) November 17, 2015 2014-2015 Paul Krzyzanowski 26

Server-side Video Ad Insertion Example: Limelight Reach Ads Ad Server (3 rd party) Ad selection Ad call Web site Ad insertion Request context CDN Media with ad Request content November 17, 2015 2014-2015 Paul Krzyzanowski 27

The End November 17, 2015 2014-2015 Paul Krzyzanowski 29