An Analysis of Facebook Photo Caching

Similar documents
Finding a needle in Haystack: Facebook s photo storage IBM Haifa Research Storage Systems

How To Understand The Power Of Icdn

Distributed Systems. 23. Content Delivery Networks (CDN) Paul Krzyzanowski. Rutgers University. Fall 2015

The Value of a Content Delivery Network

Testing & Assuring Mobile End User Experience Before Production. Neotys

Distributed Systems. 25. Content Delivery Networks (CDN) 2014 Paul Krzyzanowski. Rutgers University. Fall 2014

Overlay Networks. Slides adopted from Prof. Böszörményi, Distributed Systems, Summer 2004.

Traffic delivery evolution in the Internet ENOG 4 Moscow 23 rd October 2012

Content Delivery Networks. Shaxun Chen April 21, 2009

Measuring CDN Performance. Hooman Beheshti, VP Technology

ECE 7650 Scalable and Secure Internet Services and Architecture ---- A Systems Perspective

Content Management in Planet-Scale Video CDNs. Kianoosh Mokhtarian

Migration of Virtual Machines for Better Performance in Cloud Computing Environment

Measuring the Web: Part I - - Content Delivery Networks. Prof. Anja Feldmann, Ph.D. Dr. Ramin Khalili Georgios Smaragdakis, PhD

How To Understand The Power Of A Content Delivery Network (Cdn)

1. Comments on reviews a. Need to avoid just summarizing web page asks you for:

ECE6130 Grid and Cloud Computing

Object Request Reduction in Home Nodes and Load Balancing of Object Request in Hybrid Decentralized Web Caching

Lecture 3: Scaling by Load Balancing 1. Comments on reviews i. 2. Topic 1: Scalability a. QUESTION: What are problems? i. These papers look at

QoS Provision in a Cloud-Based Multimedia Storage System

STeP-IN SUMMIT June 2014 at Bangalore, Hyderabad, Pune - INDIA. Mobile Performance Testing

Making the Internet fast, reliable and secure. DE-CIX Customer Summit Steven Schecter <schecter@akamai.com>

Data Center Use Cases and Trends

Netflix Open Connect. 2 Terabits from 6 racks

Storage I/O Control: Proportional Allocation of Shared Storage Resources

Data Center Content Delivery Network

The old Internet. Software in the Network: Outline. Traditional Design. 1) Basic Caching. The Arrival of Software (in the network)

Hierarchical Content Routing in Large-Scale Multimedia Content Delivery Network

Internet Traffic Evolution

AKAMAI WHITE PAPER. Delivering Dynamic Web Content in Cloud Computing Applications: HTTP resource download performance modelling

MCSE SYLLABUS. Exam : Managing and Maintaining a Microsoft Windows Server 2003:

Scalable Internet Services and Load Balancing

Evolution of OpenCache: an OpenSource Virtual Content Distribution Network (vcdn) Platform

Evaluating Cooperative Web Caching Protocols for Emerging Network Technologies 1

Petabyte Scale Data at Facebook. Dhruba Borthakur, Engineer at Facebook, SIGMOD, New York, June 2013

Azure Media Service Cloud Video Delivery KILROY HUGHES MICROSOFT AZURE MEDIA

Lecture Outline. Topology Design: I. Topologies and Reliability. Another Case: Mobile Performance Optimization. Jeremiah Deng.

The Classical Architecture. Storage 1 / 36

Week 3 / Paper 2. Bernhard Ager, Wolfgang Mühlbauer, Georgios Smaragdakis, Steve Uhlig ACM IMC 2010.

Overview of recent changes in the IP interconnection ecosystem

WAVE: Popularity-based and Collaborative In-network Caching for Content-Oriented Networks

Scalable Internet Services and Load Balancing

AKAMAI WHITE PAPER. The Challenges of Connecting Globally in the Pharmaceutical Industry

Wikimedia Architecture Doing More With Less. Asher Feldman Ryan Lane Wikimedia Foundation Inc.

Performance In The Cloud

CDN and Traffic-structure

Wikimedia architecture. Mark Bergsma Wikimedia Foundation Inc.

Mobile Performance Testing Approaches and Challenges

Improving Content Delivery by Exploiting the Utility of CDN Servers

Configuring Citrix NetScaler for IBM WebSphere Application Services

Teridion. Rethinking Network Performance. The Internet. Lightning Fast. Technical White Paper July,

Monetizing Networks in the Enterprise and Content Ecosystems. C. Kaplan VP Global Consulting Ciena Corporation

APPLICATION NOTE. Elastic Scalability. for HetNet Deployment, Management & Optimization

The Importance of High Customer Experience

A Proposed Service Broker Strategy in CloudAnalyst for Cost-Effective Data Center Selection

Novel Systems. Extensible Networks

Distributed Systems. 24. Content Delivery Networks (CDN) 2013 Paul Krzyzanowski. Rutgers University. Fall 2013

KEMP LoadMaster. Enabling Hybrid Cloud Solutions in Microsoft Azure

Building a Resilient World Wide Web

Berlin Storage, Backup and Disaster Recovery in the Cloud AWS Customer Case Study: HERE Maps for Life

Going Cloud, Going Mobile: Will Your Network Drag You Down? Wes Morgan, IBM / 2 Feb 2016

LARGE SCALE INTERNET SERVICES

Evaluating Impact of Storage on Smartphone Energy Efficiency

On efficient delivery of web content

Timing of a Disk I/O Transfer

leaseweb cdn CDN Product Sheet - LeaseWeb - EN 1.06

Distributed Systems. REK s adaptation of Prof. Claypool s adaptation of Tanenbaum s Distributed Systems Chapter 1

Q: What is the difference between the other load testing tools which enables the wan emulation, location based load testing and Gomez load testing?

Performance of vsphere Flash Read Cache in VMware vsphere 5.5

Load Balancing in Distributed Web Server Systems With Partial Document Replication

THE CHALLENGE OF AVAILABILITY & PERFORMANCE OF YOUR SITES

The Cloud Trade Off IBM Haifa Research Storage Systems

The Effect of Caches for Mobile Broadband Internet Access

Content Delivery Networks (CDN) Dr. Yingwu Zhu

Where Do You Tube? Uncovering YouTube Server Selection Strategy

Comparative Performance Report

Petabyte Scale Data at Facebook. Dhruba Borthakur, Engineer at Facebook, UC Berkeley, Nov 2012

HOW IS WEB APPLICATION DEVELOPMENT AND DELIVERY CHANGING?

The Requirement for a New Type of Cloud Based CDN

Web Caching and CDNs. Aditya Akella

Research on AWS. Genomics Research Statistical Analysis Mathematical Modeling Computational Fluid Dynamics Grid/Cluster/High Performance Computing

Experimentation with the YouTube Content Delivery Network (CDN)

Carrier-grade VoIP platform with Kamailio at 1&1

Fault-Tolerant Framework for Load Balancing System

Connectivity in a Cloud-Enabled World

Intel Ethernet Switch Load Balancing System Design Using Advanced Features in Intel Ethernet Switch Family

Content Delivery Networks

Bit-Rate and Application Performance in Ultra BroadBand Networks

DATA COMMUNICATOIN NETWORKING

Pavlo Baron. Big Data and CDN

IPv6 deployment status & Migration Strategy

Finding a needle in Haystack: Facebook s photo storage

Deploying ArcGIS for Server Using Esri Managed Services

Exploiting Locality of Interest in Online Social Networks

STeP-IN SUMMIT June 18 21, 2013 at Bangalore, INDIA. Enhancing Performance Test Strategy for Mobile Applications

Information- Centric Networks. Section # 3.2: DNS Issues Instructor: George Xylomenos Department: Informatics

DATA COMMUNICATOIN NETWORKING

Advanced Farm Administration with XenApp Worker Groups

Tools and Methods for Testing the QoS of Internet services

Transcription:

An Analysis of Facebook Photo Caching Qi Huang, Ken Birman, Robbert van Renesse, Wyatt Lloyd, Sanjeev Kumar, Harry C. Li Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author. Copyright is held by the Owner/Author(s). SOSP 13, Nov. 3 6, 2013, Farmington, Pennsylvania, USA. ACM 978-1-4503-2388-8/13/11. http://dx.doi.org/10.1145/2517349.2522722

The problem 250 billion photos on Facebook in 2013 350 million more uploaded daily High availability and low latency required for delivery infrastructure.

Facebook's Photo-Serving Stack Client Browser cache Facebook storage and caches Akamai CDN

Facebook's Photo-Serving Stack Client Browser cache Facebook storage and caches Akamai CDN Focus exclusively on Facebook stack

Edge Caches (FIFO) Client Browser cache PoP Edge cache tens of independent edge caches across the US

Edge Caches (FIFO) Client Browser cache PoP Edge cache Purpose: reduce latency reduce bandwidth utilization between clients and datacenters

Global Origin Cache (FIFO) Data center Client Browser cache PoP Edge cache hash(id) Origin cache Data center a single global cache requests routed to specific data centers based on hash of photo id

Global Origin Cache (FIFO) Data center Client Browser cache PoP Edge cache hash(id) Origin cache Data center Purpose: traffic sheltering for I/O bound backend

Haystack backend Data center Client Browser cache PoP Edge cache hash(id) Origin cache Backend (Haystack) Backend (Haystack) Data center located in the same data centers as origin servers

Resizers Resizer Data center Client Browser cache PoP Edge cache hash(id) Origin cache R R Backend (Haystack) Backend (Haystack) Data center photos are served in different sizes to different users resized photos are treated as distinct objects by the cache layer

Methodology

Stack instrumentation Client PoP Data center Browser cache Edge cache Origin cache R Backend (Haystack) Instrumentation scope desktop clients only data for browser cache inferred by counting requests seen at browsers and edge caches

Sampling method request-based sampling: collect X% of requests object-based sampling: collect X% of objects

Sampling method request-based sampling: collect X% of requests object-based sampling: collect X% of objects by deterministic test on photo id

Object-based Sampling better coverage of unpopular photos simplified cross-stack analysis

Analysis

The collected data One-month long trace 77M requests from 13.2M user browsers 1.3M unique photos (2.6M photo objects)

Traffic sheltering Client PoP Data center Browser cache Edge cache Origin cache R Backend (Haystack) 77.2M 26.6M 11.2M 7.6M 65.5% 58% 31.8%

Popularity Distribution approximately Zipfian

Traffic share by photo popularity

Geographic Traffic Distribution

Client To Edge Cache Traffic Atlanta

Client To Edge Cache Traffic 5% New York 5% California 35% Washington D.C. 5% Dallas 20% Atlanta 10% Miami

Client To Edge Cache Traffic routing algorithm considers latency, edge capacity and ISP peering cost substantial remote traffic is normal

Edge Cache to Origin Cache Traffic Traffic flow based on content, not locality.

Cross-Region Traffic at Backend Caused by data migrations and hardware failures Only 0.2% of Origin Cache to Backend traffic

Potential Improvements

Cache Simulation replay the trace use the first 25% to warm the cache evaluate using the remaining 75% of the trace

Object-hit Ratios for Edge Cache S4LRU improves 9% at x over FIFO Still a lot of room for improvement

S4LRU Cache space L3 L2 L1 L0 Four queues are maintained at levels 0 to 3

S4LRU Cache space L3 L2 L1 L0 Missed objects inserted into the head of level 0 queue

S4LRU Cache space L3 L2 L1 L0 On a cache hit, the item is moved to the head of the next higher queue

S4LRU Cache space L3 L2 L1 L0 items evicted from the tail of a queue to the head of the next lower queue

Byte-hit Ratios for Edge Cache LFU fails to reduce bandwidth utilization over the current solution

Origin cache S4LRU improves Origin slightly more than Edge (14%)

Social-Network Analysis

Content Age Effect

Traffic share for non-profile photos by age

Traffic share for social activity groups. Number of followers