DATA COMMUNICATOIN NETWORKING

Similar documents
Content Distribu-on Networks (CDNs)

Web Caching and CDNs. Aditya Akella

Measuring the Web: Part I - - Content Delivery Networks. Prof. Anja Feldmann, Ph.D. Dr. Ramin Khalili Georgios Smaragdakis, PhD

How To Understand The Power Of A Content Delivery Network (Cdn)

Distributed Systems 19. Content Delivery Networks (CDN) Paul Krzyzanowski

Distributed Systems. 23. Content Delivery Networks (CDN) Paul Krzyzanowski. Rutgers University. Fall 2015

Distributed Systems. 25. Content Delivery Networks (CDN) 2014 Paul Krzyzanowski. Rutgers University. Fall 2014

Distributed Systems. 24. Content Delivery Networks (CDN) 2013 Paul Krzyzanowski. Rutgers University. Fall 2013

CSC2231: Akamai. Stefan Saroiu Department of Computer Science University of Toronto

Lecture 3: Scaling by Load Balancing 1. Comments on reviews i. 2. Topic 1: Scalability a. QUESTION: What are problems? i. These papers look at

Communications Software. CSE 123b. CSE 123b. Spring Lecture 13: Load Balancing/Content Distribution. Networks (plus some other applications)

ICP. Cache Hierarchies. Squid. Squid Cache ICP Use. Squid. Squid

1. Comments on reviews a. Need to avoid just summarizing web page asks you for:

Overview. Tor Circuit Setup (1) Tor Anonymity Network

GLOBAL SERVER LOAD BALANCING WITH SERVERIRON

The Effect of Caches for Mobile Broadband Internet Access

Global Server Load Balancing

How To Understand The Power Of The Internet

Request Routing, Load-Balancing and Fault- Tolerance Solution - MediaDNS

Reliable Distributed Systems

SiteCelerate white paper

Advanced Networking Technologies

Content Delivery Networks

The Critical Role of an Application Delivery Controller

EECS 489 Winter 2010 Midterm Exam

Advanced Computer Networks. Layer-7-Switching and Loadbalancing

THE MASTER LIST OF DNS TERMINOLOGY. First Edition

CS514: Intermediate Course in Computer Systems

Key Components of WAN Optimization Controller Functionality

DATA COMMUNICATOIN NETWORKING

Internet Content Distribution

Internet Content Distribution

High Availability HTTP/S. R.P. (Adi) Aditya Senior Network Architect

FortiBalancer: Global Server Load Balancing WHITE PAPER

Content Delivery Networks. Shaxun Chen April 21, 2009

The Application Front End Understanding Next-Generation Load Balancing Appliances

The Web History (I) The Web History (II)

Content Delivery Networks

THE MASTER LIST OF DNS TERMINOLOGY. v 2.0

The Effectiveness of Request Redirection on CDN Robustness

Accelerating Wordpress for Pagerank and Profit

Results-Oriented Application Acceleration with FastView Because Every Second Counts Whitepaper

Content Delivery Networks (CDN) Dr. Yingwu Zhu

Data Center Content Delivery Network

COMP 631: COMPUTER NETWORKS. How to distribute content without requiring centralized, heavy-duty servers? Peer-to-peer content distribution

Project #2. CSE 123b Communications Software. HTTP Messages. HTTP Basics. HTTP Request. HTTP Request. Spring Four parts

Global Server Load Balancing

Akamai CDN, IPv6 and DNS security. Christian Kaufmann Akamai Technologies DENOG 5 14 th November 2013

Rapid IP redirection with SDN and NFV. Jeffrey Lai, Qiang Fu, Tim Moors December 9, 2015

High volume Internet data centers. MPLS-based Request Routing. Current dispatcher technology. MPLS-based architecture

<Insert Picture Here> Oracle Web Cache 11g Overview

Implementing Reverse Proxy Using Squid. Prepared By Visolve Squid Team

From Internet Data Centers to Data Centers in the Cloud

Week 3 / Paper 2. Bernhard Ager, Wolfgang Mühlbauer, Georgios Smaragdakis, Steve Uhlig ACM IMC 2010.

DEPLOYMENT GUIDE Version 1.1. Deploying F5 with IBM WebSphere 7

The Value of Content Distribution Networks Mike Axelrod, Google Google Public

Overlay Networks. Slides adopted from Prof. Böszörményi, Distributed Systems, Summer 2004.

CDN Brokering. Content Distribution Internetworking

Meeting Worldwide Demand for your Content

Indirection. science can be solved by adding another level of indirection" -- Butler Lampson. "Every problem in computer

Reverse Proxy with SSL - ProxySG Technical Brief

Application Layer -1- Network Tools

A Task-Based Adaptive-TTL approach for Web Server Load Balancing *

Measuring CDN Performance. Hooman Beheshti, VP Technology

B6: GET /started/with/ HTTP Analysis

Computer Networks - CS132/EECS148 - Spring

CS 188/219. Scalable Internet Services Andrew Mutz October 8, 2015

Information- Centric Networks. Section # 3.2: DNS Issues Instructor: George Xylomenos Department: Informatics

Scalable Internet Services and Load Balancing

Flash Crowds & Denial of Service Attacks

First Midterm for ECE374 02/25/15 Solution!!

CDN and Traffic-structure

Department of Computer Science Institute for System Architecture, Chair for Computer Networks. Caching, Content Distribution and Load Balancing

AKAMAI WHITE PAPER. Delivering Dynamic Web Content in Cloud Computing Applications: HTTP resource download performance modelling

How To Model The Content Delivery Network (Cdn) From A Content Bridge To A Cloud (Cloud)

Scalable Internet Services and Load Balancing

GoToMyPC Corporate Advanced Firewall Support Features

Scalability of web applications. CSCI 470: Web Science Keith Vertanen

Single Pass Load Balancing with Session Persistence in IPv6 Network. C. J. (Charlie) Liu Network Operations Charter Communications

Cisco Application Networking for IBM WebSphere

DEPLOYMENT GUIDE Version 1.2. Deploying the BIG-IP System v10 with Microsoft IIS 7.0 and 7.5

Combining Global Load Balancing and Geo-location with Emissary TM

Working With Virtual Hosts on Pramati Server

Deployment Guide. AX Series with Microsoft Office SharePoint Server

Web Application Hosting Cloud Architecture

Zscaler Internet Security Frequently Asked Questions

CHAPTER 4 PERFORMANCE ANALYSIS OF CDN IN ACADEMICS

Load Balancing and Sessions. C. Kopparapu, Load Balancing Servers, Firewalls and Caches. Wiley, 2002.

Demand Routing in Network Layer for Load Balancing in Content Delivery Networks

Learning Management Redefined. Acadox Infrastructure & Architecture

Going Cloud, Going Mobile: Will Your Network Drag You Down? Wes Morgan, IBM / 2 Feb 2016

ZEN LOAD BALANCER EE v3.02 DATASHEET The Load Balancing made easy

Citrix NetScaler Global Server Load Balancing Primer:

The Measured Performance of Content Distribution Networks

F5 and Oracle Database Solution Guide. Solutions to optimize the network for database operations, replication, scalability, and security

HTTP. Internet Engineering. Fall Bahador Bakhshi CE & IT Department, Amirkabir University of Technology

Configuring Citrix NetScaler for IBM WebSphere Application Services

Adding Advanced Caching and Replication Techniques to the Apache Web Server

Transcription:

DATA COMMUNICATOIN NETWORKING Instructor: Ouldooz Baghban Karimi Course Book: Computer Networking, A Top-Down Approach, Kurose, Ross Slides: - Course book Slides - Slides from Princeton University COS461 Spring 2012 offering, Jennifer Rexford

Course Overview Basics of Computer Networks Internet & Protocol Stack Application Layer Transport Layer Network Layer Data Link Layer Advanced Topics Case Studies of Computer Networks Internet Applications Network Management Network Security Advanced Topics 2

Content Distribution Networks Single server Single point of failure Easily overloaded Far from most clients Popular content Popular site Flash crowd (aka Slashdot effect ) Denial of Service attack Advanced Topics 3

Skewed Popularity of Web Traffic Zipf or power-law distribution Characteristics of WWW Client-based Traces Carlos R. Cunha, Azer Bestavros, Mark E. Crovella, BU-CS-95-01 Advanced Topics 4

Proxy Caches Reactively replicates popular content Smaller round-trip times to clients client Proxy server origin server Reduces load on origin servers Reduces network load, and bandwidth costs client Maintain persistent TCP connections Advanced Topics 5

Forward Proxy Cache close to the client Improves client performance Reduces network provider s costs client Proxy server Explicit proxy Requires configuring browser Implicit proxy Service provider deploys an on path proxy that intercepts and handles Web requests client Advanced Topics 6

Reverse Proxy origin server Proxy server Cache close to server Improve client performance Reduce content provider cost Load balancing, content assembly, transcoding, etc. Directing clients to the proxy Map the site name to the IP address of the proxy origin server Advanced Topics 7

Reverse Proxy origin server Proxy server Cache close to server Improve client performance Reduce content provider cost Load balancing, content assembly, transcoding, etc. Directing clients to the proxy Map the site name to the IP address of the proxy origin server Advanced Topics 8

Limitations of Web Caching Much content is not cacheable Dynamic data Stock prices, scores, web cams CGI scripts R Results depend on parameters Cookies Results may depend on passed data SSL Encrypted data is not cacheable Analytics Owner wants to measure hits Stale data Or overhead of refreshing the cached data Advanced Topics 9

Content Distribution Network Proactive content replication Content provider (e.g., CNN) contracts with a CDN CDN replicates the content On many servers spread throughout the Internet origin server in North America CDN distribution node Updating the replicas Advanced Topics Updates pushed to replicas when the content changes CDN server in S. America CDN server in Europe CDN server in Asia 10

Server Selection Policy Live server For availability Requires continuous monitoring of liveness, load, and performance Lowest load To balance load across the servers Closest Nearest geographically, or in round-trip time Best performance Throughput, latency, Cheapest bandwidth, electricity, Advanced Topics 11

Server Selection Mechanism Application HTTP redirection Advantages Fine-grain control Selection based on client IP address GET Redirect GET OK Disadvantages Extra round-trips for TCP connection to server Overhead on the server Advanced Topics 12

Server Selection Mechanism Routing Anycast routing 1.2.3.0/24 Advantages No extra round trips Route to nearby server Disadvantages 1.2.3.0/24 Does not consider network or server load Different packets may go to different servers Used only for simple request-response apps Advanced Topics 13

Server Selection Mechanism Naming DNS-based server selection 1.2.3.4 Advantages Avoid TCP set-up delay DNS caching reduces overhead Relatively fine control DNS query Disadvantage Based on IP address of local DNS server Hidden load effect DNS TTL limits adaptation 1.2.3.5 local Advanced Topics 14

Example: Distributed servers Servers: ~61,000 Networks: ~1,000 Countries: ~70 Many customers Apple, BBC, FOX, GM IBM, MTV, NASA, NBC, NFL, NPR, Puma, Red Bull, Rutgers, SAP, Client requests Hundreds of billions per day Half in the top 45 networks 15-20% of all Web traffic worldwide Advanced Topics 15

& DNSs cnn.com (content provider) DNS root server GET index. html http://cache.cnn.com/foo.jpg 1 2 HTTP HTTP global regional End user Nearby Advanced Topics 16

& DNSs cnn.com (content provider) DNS root server GET index. html HTTP HTTP DNS lookup cache.cnn.com http://cache.cnn.com/foo.jpg 1 2 3 ALIAS: 4 g.akamai.net global regional End user Nearby Advanced Topics 17

& DNSs cnn.com (content provider) DNS root server GET index. html HTTP HTTP DNS lookup cache.cnn.com http://cache.cnn.com/foo.jpg 5 1 2 3 ALIAS: 4 6 g.akamai.net ALIAS a73.g.akamai.net DNS lookup g.akamai.net global regional End user Nearby Advanced Topics 18

& DNSs cnn.com (content provider) DNS root server GET index. html HTTP HTTP http://cache.cnn.com/foo.jpg 5 1 2 3 ALIAS: 4 6 g.akamai.net ALIAS a73.g.akamai.net End user DNS lookup cache.cnn.com Address 1.2.3.4 DNS lookup g.akamai.net global regional Nearby Advanced Topics 19 7 8

& DNSs cnn.com (content provider) DNS root server GET index. html HTTP HTTP http://cache.cnn.com/foo.jpg 5 1 2 3 ALIAS: 4 6 g.akamai.net ALIAS a73.g.akamai.net End user DNS lookup cache.cnn.com Address 1.2.3.4 GET /foo.jpg Host: cache.cnn.com 9 DNS lookup g.akamai.net global regional Nearby Advanced Topics 20 7 8

HTTP & DNSs cnn.com (content provider) DNS root server GET index. html HTTP http://cache.cnn.com/foo.jpg 5 1 2 3 ALIAS: 4 6 g.akamai.net ALIAS a73.g.akamai.net End user GET foo.jpg DNS lookup 11 cache.cnn.com 12 Address 1.2.3.4 GET /foo.jpg Host: cache.cnn.com 9 DNS lookup g.akamai.net global regional Nearby Advanced Topics 21 7 8

HTTP & DNSs cnn.com (content provider) DNS root server GET index. html HTTP http://cache.cnn.com/foo.jpg 5 1 2 3 ALIAS: 4 6 g.akamai.net ALIAS a73.g.akamai.net End user GET foo.jpg DNS lookup 11 cache.cnn.com 12 Address 1.2.3.4 GET /foo.jpg Host: cache.cnn.com 9 DNS lookup g.akamai.net 10 global regional Nearby Advanced Topics 22 7 8

HTTP & DNSs cnn.com (content provider) DNS root server GET index. html HTTP http://cache.cnn.com/foo.jpg 5 1 2 3 ALIAS: 4 6 g.akamai.net ALIAS a73.g.akamai.net End user GET foo.jpg DNS lookup 11 cache.cnn.com 12 Address 1.2.3.4 GET /foo.jpg Host: cache.cnn.com 9 DNS lookup g.akamai.net 10 global regional Nearby Advanced Topics 23 7 8

& DNSs cnn.com (content provider) DNS root server server GET index. html 1 2 high-level How Works: 8 9 Cache Hit 10 24 End user low-level DNS server Nearby hash-chosen server Advanced Topics 24 7 GET /cnn.com/foo.jpg

Mapping System Equivalence classes of IP addresses IP addresses experiencing similar performance Quantify how well they connect to each other Collect and combine measurements Ping, traceroute, BGP routes, server logs E.g., over 100 TB of logs per days Network latency, loss, and connectivity Map each IP class to a preferred server Based on performance, health, etc. Updated roughly every minute Map client request to a server in the Load balancer selects a specific server E.g., to maximize the cache hit rate Advanced Topics 25

Adapting to Failures Failing hard drive on a server Suspends after finishing in progress requests Failed server Another server takes over for the IP address Low-level map updated quickly Failed High-level map updated quickly Failed path to customer s origin server Route packets through an intermediate node Advanced Topics 26

Transport Optimizations Bad Internet routes Overlay routing through an intermediate server Packet loss Sending redundant data over multiple paths TCP connection set-up/teardown Pools of persistent connections TCP congestion window and round-trip time Estimates based on network latency measurements Advanced Topics 27

Application Optimizations Slow download of embedded objects Prefetch when HTML page is requested Large objects Content compression Slow applications Moving applications to edge servers E.g., content aggregation and transformation E.g., static databases (e.g., product catalogs) E.g. batching and validating input on Web forms Advanced Topics 28

Conclusion Content distribution is hard Many, diverse, changing objects Clients distributed all over the world Reducing latency Contribution distribution solutions Reactive caching Proactive content distribution networks Advanced Topics 29