Department of Computer Science Institute for System Architecture, Chair for Computer Networks. Caching, Content Distribution and Load Balancing

Similar documents

How To Understand The Power Of A Content Delivery Network (Cdn)

Measuring the Web: Part I - - Content Delivery Networks. Prof. Anja Feldmann, Ph.D. Dr. Ramin Khalili Georgios Smaragdakis, PhD

Lecture 3: Scaling by Load Balancing 1. Comments on reviews i. 2. Topic 1: Scalability a. QUESTION: What are problems? i. These papers look at

Real-Time Analysis of CDN in an Academic Institute: A Simulation Study

Content Delivery Networks (CDN) Dr. Yingwu Zhu

1. Comments on reviews a. Need to avoid just summarizing web page asks you for:

Indirection. science can be solved by adding another level of indirection" -- Butler Lampson. "Every problem in computer

Request Routing, Load-Balancing and Fault- Tolerance Solution - MediaDNS

ICP. Cache Hierarchies. Squid. Squid Cache ICP Use. Squid. Squid

GLOBAL SERVER LOAD BALANCING WITH SERVERIRON

Distributed Systems. 23. Content Delivery Networks (CDN) Paul Krzyzanowski. Rutgers University. Fall 2015

CS 188/219. Scalable Internet Services Andrew Mutz October 8, 2015

From Internet Data Centers to Data Centers in the Cloud

DATA COMMUNICATOIN NETWORKING

Web DNS Peer-to-peer systems (file sharing, CDNs, cycle sharing)

Distributed Systems. 25. Content Delivery Networks (CDN) 2014 Paul Krzyzanowski. Rutgers University. Fall 2014

Understanding Slow Start

Internet Content Distribution

CDN and Traffic-structure

Intelligent Content Delivery Network (CDN) The New Generation of High-Quality Network

HUAWEI OceanStor Load Balancing Technical White Paper. Issue 01. Date HUAWEI TECHNOLOGIES CO., LTD.

Cisco Videoscape Distribution Suite Service Broker

Lecture 2 CS An example of a middleware service: DNS Domain Name System

Domain Name System :49:44 UTC Citrix Systems, Inc. All rights reserved. Terms of Use Trademarks Privacy Statement

Overview. Tor Circuit Setup (1) Tor Anonymity Network

CS312 Solutions #6. March 13, 2015

Distributed Systems 19. Content Delivery Networks (CDN) Paul Krzyzanowski

Content Delivery Networks. Shaxun Chen April 21, 2009

CHAPTER 4 PERFORMANCE ANALYSIS OF CDN IN ACADEMICS

Overlay Networks. Slides adopted from Prof. Böszörményi, Distributed Systems, Summer 2004.

Web Caching and CDNs. Aditya Akella

Load Balancing using MS-Redirect Mechanism

How To - Configure Virtual Host using FQDN How To Configure Virtual Host using FQDN

IERG 4080 Building Scalable Internet-based Services

Data Center Content Delivery Network

High volume Internet data centers. MPLS-based Request Routing. Current dispatcher technology. MPLS-based architecture

Globule: a Platform for Self-Replicating Web Documents

CSC2231: Akamai. Stefan Saroiu Department of Computer Science University of Toronto

Distributed Systems. 24. Content Delivery Networks (CDN) 2013 Paul Krzyzanowski. Rutgers University. Fall 2013

TECHNOLOGY WHITE PAPER Jun 2012

DNS, CDNs Weds March Lecture 13. What is the relationship between a domain name (e.g., youtube.com) and an IP address?

Making the Internet fast, reliable and secure. DE-CIX Customer Summit Steven Schecter <schecter@akamai.com>

Computer Networks & Security 2014/2015

Transport and Network Layer

Web Application Hosting Cloud Architecture

Getting Started with AWS. Hosting a Static Website

Introduction to Network Operating Systems

Demand Routing in Network Layer for Load Balancing in Content Delivery Networks

Configuring DNS. Finding Feature Information

Server Traffic Management. Jeff Chase Duke University, Department of Computer Science CPS 212: Distributed Information Systems

A Taxonomy of CDNs. Chapter Introduction. Mukaddim Pathan and Rajkumar Buyya

Computer Networks - CS132/EECS148 - Spring

ExamPDF. Higher Quality,Better service!

ZEN LOAD BALANCER EE v3.04 DATASHEET The Load Balancing made easy

Public Cloud Partition Balancing and the Game Theory

Content Distribu-on Networks (CDNs)

THE MASTER LIST OF DNS TERMINOLOGY. v 2.0

The old Internet. Software in the Network: Outline. Traditional Design. 1) Basic Caching. The Arrival of Software (in the network)

3. The Domain Name Service

Identifying More Efficient Ways of Load balancing the Web (http) Requests.

Cloud Computing Disaster Recovery (DR)

Caching Techniques on CDN Simulated Frameworks

Computer Networks 1 (Mạng Máy Tính 1) Lectured by: Dr. Phạm Trần Vũ MEng. Nguyễn CaoĐạt

Outline. Definition. Name spaces Name resolution Example: The Domain Name System Example: X.500, LDAP. Names, Identifiers and Addresses

CSIS CSIS 3230 Spring Networking, its all about the apps! Apps on the Edge. Application Architectures. Pure P2P Architecture

1 Introduction: Network Applications

THE MASTER LIST OF DNS TERMINOLOGY. First Edition

DEPLOYMENT GUIDE Version 1.1. DNS Traffic Management using the BIG-IP Local Traffic Manager

Enabling Media Rich Curriculum with Content Delivery Networking

Availability Digest. Redundant Load Balancing for High Availability July 2013

Application Note. Active Directory Federation Services deployment guide

Domain Name System (DNS)

The Effect of Caches for Mobile Broadband Internet Access

Towards a Peer-to-Peer Extended Content Delivery Network

FortiBalancer: Global Server Load Balancing WHITE PAPER

FAQ: BroadLink Multi-homing Load Balancers

Content Delivery Networks

Topics. 1. What is load balancing? 2. Load balancing techniques 3. Load balancing strategies 4. Sessions 5. Elastic load balancing

Introduction to Computer Networks

Barracuda Load Balancer Online Demo Guide

Concept of Cache in web proxies

TESTING & INTEGRATION GROUP SOLUTION GUIDE

Evaluating Cooperative Web Caching Protocols for Emerging Network Technologies 1

ZEN LOAD BALANCER EE v3.02 DATASHEET The Load Balancing made easy

Internet Content Distribution

Measuring CDN Performance. Hooman Beheshti, VP Technology

NET0183 Networks and Communications

Testing & Assuring Mobile End User Experience Before Production. Neotys

Naming. Name Service. Why Name Services? Mappings. and related concepts

Understanding DNS (the Domain Name System)

How Comcast Built An Open Source Content Delivery Network National Engineering & Technical Operations

Getting Started with AWS. Static Website Hosting

Transcription:

Department of Computer Science Institute for System Architecture, Chair for Computer Networks Caching, Content Distribution and Load Balancing

Motivation Which optimization means do exist? Where should means for optimization be applied? First Mile? Web Last Mile? Middle Mile? Web documents recently published containing important news Within one point in time several thousands of requests have to be handled ( flash crowd problem) Globally distributed users First Mile Middle Mile Last Mile 2

Outline Web caching and associated protocols Internet Cache Protocol (ICP) Cache consistency mechanisms Content Distribution Networks (CDNs) Architecture of a CDN Content replication mechanisms Akamai Load Balancing Proxy-based Load Balancing Elastic Load Balancing 3

Web Caches Mechanism for temporary storage of Web resources to reduce bandwidth usage, load, and perceived lag A Web cache stores copies of documents passing through it; subsequent requests may be satisfied from the cache Caches are often deployed by Internet Service Providers (ISPs) to cache content for own users by Web Providers to avoid regeneration of content for different users ISPs and Web s run various caches in parallel which store subsets of entirely cached documents Interaction between caches is important for cooperatively answer requests Request ISP Cache ISP Cache ISP Cache Is a request to the original necessary? Web Does any cache contain the content? 4

Internet Cache Protocol Request / reply protocol used for coordinating (hierarchical) Web caches and for exchanging data between caches on demand Specified in Request for Comments 2186 and 2187 Goals: to enable efficient inter-cache communication and to minimize the number of remote requests to the originating Most important protocol message semantics: ICP_OP_QUERY: Request a resource from another cache ICP_OP_HIT: Indicate that a cache offers a specific resource ICP_OP_MISS: Indicate that a requested resource is not available in a cache Requested Web resource is locally not available User Datagram Protocol (UDP) is used as transport layer protocol HTTP GET-Request ISP Cache ICP_OP_QUERY ICP_OP_HIT ISP Cache HTTP GET-Request HTTP Reply Web resource is delivered to Web client and cached locally HTTP Reply 5

Internet Cache Protocol Using configuration options in cache systems, cache hierarchies can be built Sample configuration file used, e.g., in the ICP-enabled proxy implementation Squid (path in Unix-based systems: /etc/squid/squid.conf) TCP port for HTTP communication with cache UDP port for ICP communication cache_peer parentcache.example.com parent 3128 3130 cache_peer childcache2.example.com sibling 3128 3130 cache_peer childcache3.example.com sibling 3128 3130 parentcache.example.com parent cache ISP Cache access Origin Web ISP Cache ISP Cache ISP Cache sibling cache systems childcache1.example.com (holding above configuration file) childcache2.example.com childcache3.example.com 6

Hierarchical Cache with ICP Caches first try to resolve requests on the same level and thus request sibling caches for a Web resource not available locally If the Web resource is not available or if the validity of a Web resource has expired in all sibling caches, the parent cache is accessed For minimizing interaction, only parent caches are allowed to fetch Web resources from the origin HTTP GET Request 1 8 HTTP Reply Specific flag ICP_FLAG_HIT_OBJ is set in query, thus the object is directly returned in the response (ICP_OB_HIT_OBJ) HTTP 7 ISP Cache GET Request HTTP Reply ICP_OB_HIT_OBJ ICP_OP_MISS 3 ICP_OP_QUERY 4 ISP Cache ISP Cache ICP_OP_QUERY 2 ISP Cache 5 6 Origin Web 7

Cache Consistency Mechanism Consistency Categories Strong consistency Delta consistency Weak consistency The cached versions are consistent with a content provider s version at all times An update of the version of the content provider will propagate through the system and all replicas will be consistent after a fixed time period (=configuration parameter) The version available in a cache may or may not be consistent with the version of a content provider To provide cache consistency, three categories of solutions exist: Server-driven consistency: The content provider notifies the caches every time the content changes (e.g. Web Content Distribution Protocol) Client-driven consistency: The clients request updates managed Web resources in regular intervals (polling) Leases approach: A lease is a promise by the that it will push updates only a specified time period 8

Web Content Distribution Protocol HTTP and XML-based protocol for updating replicated content Supports strong, delta, and weak consistency Example for strong consistency mechanism: Replication / Cache Server Register request content_group = 12345 consistency_level = strong Register response Consistency_level = strong ( confirmation) Invalidation notification request invalidation_id (object id) = 9876 immediate-refresh commit = yes Origin Server Object in content group 12345 changes Origin determines all subscribers Replace old version by new version Fetch object from Invalidation notification response Commit request Commit response Server waits until all subscribers have sent a response Before a commit request is received by the replication / cache, every request to the invalidated Web object is forwarded to the origin 9

Web Content Distribution Protocol Example for an invalidation request sent by the origin to the replication cache : POST /invalidate_request HTTP/1.1 Content Length: 512 <?xml version= 1.0 > <!DOCTYPE InvRequest wcdp.dtd > <invalidation sequence_number=100> <identifier> <object_invalidation_id=9876/> <object_invalidation_group_id=12345/> </identifier> <action> <invalidate=immediate/> <refresh=yes/> </action> <type atomic=no/> <consistency require_commit=yes/> </invalidation> HTTP header Sequence number used, e.g., in later commit request Identification of the invalidated object Due to strong consistency, the invalidation should be done immediately atomic = no: a single object and not the whole group is updated Commit is sent after all subscribers have sent a response 10

Leases-based Caching Request Get resource + lease request Replication / Cache Server Reply + lease information Invalidation of resource time < lease_time Origin Server Server issues lease on first request and sends notification until expiry Clients need to renew lease upon expiry time if necessary Smooth tradeoff between state and message exchange Zero lease duration: polling, infinite leases: -push Lease duration may be determined based on various factors: Age-based leases: granting short leases to frequently modify objects and long leases to long lived objects Renewal-frequency-based leases: proxies at which an object is popular get longer leases Server load based leases: shorter leases are issued during heavy load 11

Regular Web Caches - Problems Caching proxies are mainly deployed by Internet Service Providers (ISP) in order to handle requests close to users Central Problems: Proxies serve either only ISP clients or Web s they have been deployed for No network-wide optimization to avoid congestion or to improve latency issues Content is not cached proactively only Web resources requested in regular intervals are located in proxy caches System is still vulnerable to flash crowd problem Origin Server (www.example.com) Internet Service Provider 1 1 3 Internet Service Provider 2 Proxy cache Proxy 2 2 intercepters intercepters cache www.example.com/index.htm www.example.com/index.htm www.example.com/index.htm 1 2 4 3 12

Development of CDNs Increased functional abilities of Internet and Web Applications, improved performance, amount of content increases Pre-CDN period First Generation CDNs Second Generation CDNs Hierarchical caching Static and dynamic content Video on Demand, media streaming Communitybased CDNs Combination of CDNs with peerto-peer-based infrastructures Server farms Caching proxy deployment Efficiency improved Web minor technological gap These in practice very important topics are mainly covered in this lecture Paradigm change: Responsibility shifts further to end-users (convergence to peerto-peer-based solutions) Beginning of 90 s Late 90 s 2005 today When have approaches been developed? future? 13

Content Distribution Networks A Content Distribution Network (alternative: Content Delivery Network) is a system of computers containing replicated data placed at various nodes of a network Intended to improve access to data by increasing access bandwidth, redundancy and reducing access latency Uses techniques for content replication (=proactive placement of content) and content caching Comparison to Proxy Servers: Feature Proxy Server CDN Key practice Web caching Content replication and caching Cached content Dynamically changes; content requested by users of Website / ISP Predefined content distributed by the CDN-supported content providers Scalability Low to medium high Performance Vulnerable to flash crowd events Stable; suitable for resourcehungry applications (e.g. streaming media) 14

Architectural Components of a CDN Web s delivering content to end users via a CDN Origin Server 1 Origin Server 2 Origin Server n 7 Billing Organization Content Distribution Network 3 1 Monitoring and Distribution System Accounting System 6 Request Routing System 2 6 4 Replication Server 1 Replication Server 2 Replication Server n 5 5 Edge Servers offering content replicated from origin s Web User 1 Web User 2 Web User n 15

Architectural Components of a CDN 1 2 3 4 5 6 7 Origin s replicate their content proactively to the Content Distribution Network Content is forwarded to different replication s via the Distribution Subsystem Information about the placement of content is provided to the Request Routing Subsystem Requests of users for specific Web resources are directed to the Request Routing Subsystem which selects an appropriate Replication Server based on proximity and current network characteristics (load, latency, ) After forwarding the user requests to an appropriate Replication Server, the requested Web resource is delivered to the user The amount of content is transmitted for billing purposes to the Accounting System The customers are billed based on the amount of transferred data and usage of the CDN 16

Comparison between P2P networks and CDNs Category Constitution Main goal Integrity Consistency CDNs A collection of networked computers spanning the Internet Distribution of replication s to the edge of the Internet Reducing latency during content delivery in the Web Integrity should be guaranteed between replication s Strong cache consistency between cache s Peer-to-Peer- Networks File retrieval networks formed by ad-hoc aggregation of resources Collaboration among peers File sharing among peers Not addressed Weak consistency between cached content Autonomy None Autonomous peers Administration Individual companies, proprietary in nature Self-interested end users / peers 17

Combining replication and caching in CDNs Replication Server (Storage capacity: c + r = 100%) Cached Content in Dynamic Cache c% Replicated Content in Static Cache r% Filled by classical cache replacement policies such as: Least Recently Used (LRU) Least Frequently Used (LFU) Filled by replication strategy such as il2p Total storage capability of a replication has to be distributed to the dynamic and static cache Central question: Which percentage c of the total storage capacity should be distributed to the dynamic cache and which percentage r to the static cache? 18

Combining replication and caching in CDNs By combining a huge amount of statically replicated content with a solid amount of content adapted to user behavior, very good performance is achieved Hit ratio 1,0 0,8 0,6 0,4 Mean response Time (s) 1,2 1,0 0,2 100%r 0% c 80%r 20% c 50%r 50% c 20%r 80% c Replication vs. Caching percentage 0% r 100% c 0,8 0,6 0,4 0,2 100%r 0% c 80%r 20% c 50%r 50% c 20%r 80% c Replication vs. Caching percentage 0% r 100% c Configuration turns system into a pure proxy cache with all drawbacks regarding scalability and vulnerability to flash crowd events 19

Exemplary Replication strategy: il2p Replication s cooperate with each other?? Latency and load object placement (il2p) = Cooperative push-based replication strategy taking into account network s latency and the Web resources load Problem of optimal replication is divided into two sub-problems: Selection of the best replication for each object (phase 1) Prioritizing the potential replications by taking the object popularity into account (phase 2) Phase 1 and 2 are repeated until all objects are distributed and the replication s do not have any free storage space left Rep. Rep. Phase 1 tries to find optimal object/replicatio n pairs minimizing the network latency Rep. Rep. Phase 2 selects from the results of phase 1 the object which produces the highest network load and place it on the By taking the popularity of the Web objects in the 2 nd phase into account, it can be avoided to place many popular objects on one thus increasing the probability to overload this Rep. Rep. 20

il2p Phase 1 The first phase determines a distribution of objects to replication s that minimizes the overall latency Goal: Minimize the function cost(x) where x defines the placement of an object cost ( x) N K (( p * / N k i i 1 k 1 j 1 )* D j ik ( x)) i, j = replication s k = object (Web resource) i p k = request rate for replication i = probability that a user will request object k Distance D to a replica of object k from replication i under placement x; as distance measure the latency is used The cost of a placement is high if a has got a high request rate, the probability that a user will request an object or the distance from many replication s to the object is high During every iteration the parameters for calculating the cost are adapted 21

il2p Phase 2 For every placement determined in phase 1, a utility value is calculated which takes the load of an object into account: Utility _ value load * latency where latency k = the latency that the object k produces if it is replicated to the replication which has been determined by the previous step load k = total load of object k access_rate k = number of accesses of object k per unit time s k = the size of object k load access _ rate * s k k k Only the object with the highest utility value is replicated to the selected in phase 1, afterwards phase 1 starts again During the next iteration, s already containing an object are excluded from the optimal placement selection in phase 1 for this object As a result of the algorithm objects with high loads may be placed to various replication s k k k Potential result: Objects with high loads are assigned to various replication s Rep. Rep. Rep. These replication s are selected reducing the latency during object access 22

Excursus: Domain Name System The Domain Name System (DNS) is a distributed, hierarchical database enabling the translation of domain names to IP addresses Every name contains a set of databases ( zone files ) describing translation rules ( resource records ) for domain names it is assigned for Various types of resource records are available, such as: Type A: describes a mapping between a domain name and an IPv4 address Type NS: maps a domain name to another name assigned for this domain Type CNAME: specifies that the domain name is an alias of another - the DNS lookup will continue by retrying the lookup with the new name Example resource records: mail.example.com. A 141.76.40.23 ftp.example.com. A 141.76.40.24 www.example.com. CNAME www.example.com.edgesuite.net The CDN Akamai uses the CNAME mechanism to route requests into the Akamai network and thus to Akamai replication s 23

Akamai Akamai is the largest CDN worldwide with more than 15.000 replication s deployed in 60 different countries Contains CNAME entry for www.example.com Query: www.example.com.edgesuite.net Remote DNS Server 2 Query: www.example.com Local DNS Server Remote DNS Server 3 Local network 4 a1312.g.akamai.net 5 za.akamaitech.net 6 n0g.akamai.net Return IP address of replication 7 Akamai root DNS Akamai highlevel DNS Akamai low-level DNS Akamai network 1 http://www.example.com 8 Replication / Edge Replication / Edge Replication / Edge 24

Akamai 1 2 3 4 7 to For resolving domain names to IP addresses, a local domain is accessed in a first step The request is forwarded by the local DNS to a on a higher hierarchical level which maps the requested domain to another domain (due to a CNAME entry in the resource records) The newly received domain (www.example.com.edgesuite.net) is resolved via a further name that directs the request to a domain deployed by Akamai 6 The Akamai domain forward the request to a domain of a sub-network assigned responsibility to serve the request The IP address of the responsible replication is returned to the local DNS thus the user client can send a request to this in step 8 25

Request Routing System in Akamai Forwarding requests to a replication is done in a two-stop-approach for scalability purposes: Specific IP address range After selecting an appropriate cluster close to the user / his local DNS, a concrete replication is determined within this cluster Selection of cluster is based on IP address range of user / local DNS and various historic and real-time data gathered from a monitoring system (network load, log files, ) and based on configuration options Selection of a concrete replication is done on historic and real-time monitoring data plus configuration options Local DNS Server Server Infrastructure + Web resources Mapping to Akamai cluster Akamai infrastructure 1 2 Request Akamai highlevel DNS Routing System 5 6 10 Selected Cluster Akamai low-level DNS Continuous monitoring Continuous monitoring 4 Cluster Monitoring System Global Monitoring System Continuous updates Configuration Options 7 Mapping to Akamai replication 9 26

Community Networks Community networks are networks mainly formed by decentralized users contributing to the services and the content available in such a network In community networks, the decentralized characteristics of the Internet are explored and means are applied to enable a direct interaction of end user systems For this purpose peer-to-peer-based protocols are used Thus, in sum network infrastructures result which combine CDN systems with peer-to-peer infrastructures Internet Cellular network with mobile users Operator cache / replication Mobile Node Mobile Node Mobile Node Content Distribution Network Peer-to-Peer (P2P) protocols are used Stationary Node Stationary Node Stationary Node Stationary Node 27

Load Balancing To ensure high availability and performance of Web applications, load balancing between replicated instances of a Web is applied Example for easily applicable approach: DNS Round Robin: IP addresses of redundant s are published as A records for one domain; DNS delivers addresses in changing order ( round robin ) Resolver Resolver www.example.com. 2400 IN A 141.76.40.2 www.example.com. 2400 IN A 141.76.40.8 Request 1 A 141.76.40.2 A 141.76.40.8 Request 2 A 141.76.40.8 A 141.76.40.2 DNS- Server Selected drawbacks if DNS-based approach is applied without extension: Does not enable dynamic delegation to s depending on their current load For every redundant, a public IP address has to be provided DNS replies are cached on client side and thus follow-up requests are always sent to the same Drawbacks are avoided by applying proxy-based approaches 28

Proxy-based Load Balancing Approach: Software proxy ( front end ) intercepts requests and delegates them to one out of a group of identical s ( back ends ) running the Web application Front end Server 1 Load Balancing Proxy 141.76.40.210 Has public IP Balancing algorithm determines which to select (e.g. round robin or priority-based selection) Delegation is realized using Network Address Translation (NAT) or on application level Back ends 192.168.0.10 Server 2 Web Application Server 3 Web Application 192.168.0.11 Physical or virtual machines Run identical copies of the Web application Have only private IP addresses Example configuration for front end and back ends based on syntax of HAProxy (http://www.haproxy.org/) frontend www bind *:80 acl url_blog path_beg /blog use_backend blog-backend if url_blog default_backend web-backend If document path within URL starts with string /blog, requests are delegated to the s defined in the back end blog-backend backend blog-backend balance roundrobin mode http blog1 192.168.0.10:80 check blog2 192.168.0.11:80 check backend web-backend Balancing strategy Health of is continuously checked by proxy 29

Virtual machines report in regular 1 intervals selected load/ usage statistics (e.g. memory) Elastic Load Balancing Elastic Load Balancing / autoscaling enables automatic creation or termination of virtual machines (VMs) based on the system s or application s state Implementation examples: Delegation of requests to virtual machines is done via Load Balancing Proxy Amazon Elastic Compute Cloud OpenStack Heat Autoscaling System Resource Monitor CPU: 42% Memory:3,1 GB VM 1 Web Application VM 2 Web Application Physical Machine 1 2 CPU: 70% Memory: 4,8 GB Autoscaling Conditions Decision Unit Load Balancing Proxy CPU: 30% Memory: 2,1 GB VM 3 Web Application Typically provided via configuration file 3 VM 4 Web Application Physical Machine 2 VM Management/ Scheduler Create/delete virtual machine 4 Update proxy configuration 30

Conclusion Caching proxy deployment Hierarchical caching Various protocols are available for inter-cache communication and Web content distribution, e.g.: Internet Cache Protocol (ICP) Web Content Distribution Protocol (WCDP) Possible consistency guarantees: Strong consistency Delta consistency Weak consistency Central problems regarding caching solutions: Caches are only responsible for selected Web s or serve only customers of one Internet Service Provider Content is not cached proactively only Web resources requested in regular intervals are located in proxy caches Content Distribution Networks solve this problem Content Distribution Networks e.g. Akamai Core mechanism: Redirection of users requests to one of the replication s (e.g. by using CNAME entries in DNS s) Mechanism applies as well for streaming CDNs CDNs combine strategies of caching content and replicating it proactively For achieving high availability and performance of Web applications, Proxybased Load Balancing can be applied and potentially extended by automatic scaling mechanisms (Elastic Load Balancing) 31

References RFCs of Internet Cache Protocol: Website of the Web Cache Squid : Book about CDNs: Paper about il2p: Website of Akamai Technologies, Inc.: Amazon EC2 Load Balancing: OpenStack Heat: http://tools.ietf.org/html/rfc2186 http://tools.ietf.org/html/rfc2187 http://www.squid-cache.org/ Content Delivery Networks, Lecture Notes in Electrical Engineering, Vol. 9 Buyya, Rajkumar; Pathan, Mukaddim; Vakali, Athena (Eds.), 2008 George Pallis, Konstantinos Stamos, Athena Vakali, Dimitrios Katsaros, Antonis Sidiropoulos, Yannis Manolopoulos. "Replication based on Objects Load under a Content Distribution Network", In Proceedings of the 2nd IEEE International Workshop on Challenges in Web Information Retrieval and Integration (WIRI) (In conjunction with ICDE'06), IEEE Press, Atlanta, Georgia, USA, April 3rd, 2006. http://www.akamai.com/ http://aws.amazon.com/de/documentation/elasticloadbalancing/ https://wiki.openstack.org/wiki/heat 32