Swarms: First Class Citizens in the Future Internet. Don Towsley UMass - Amherst

Similar documents
The Internet is Flat: A brief history of networking over the next ten years. Don Towsley UMass - Amherst

Department of Computer Science Institute for System Architecture, Chair for Computer Networks. File Sharing

The Role and uses of Peer-to-Peer in file-sharing. Computer Communication & Distributed Systems EDA 390

RESEARCH ISSUES IN PEER-TO-PEER DATA MANAGEMENT

Computer Networks & Security 2014/2015

PEER TO PEER FILE SHARING USING NETWORK CODING

The BitTorrent Protocol

Peer-to-peer filetransfer protocols and IPv6. János Mohácsi NIIF/HUNGARNET TF-NGN meeting, 1/Oct/2004

QoE-Aware Multimedia Content Delivery Over Next-Generation Networks

P2P File Sharing: BitTorrent in Detail

Final for ECE374 05/06/13 Solution!!

Peer-to-Peer Networks. Chapter 2: Initial (real world) systems Thorsten Strufe

D. SamKnows Methodology 20 Each deployed Whitebox performs the following tests: Primary measure(s)

N6Lookup( title ) Client

Network Applications

VIA CONNECT PRO Deployment Guide

Common P2P Examples. Peer to Peer Networks. Client-Server Architecture. Uses of P2P. Napster Morpheus Gnutella Freenet BitTorrent Skype

Computer Networks and the Internet

Transport and Network Layer

Application Description

Resource Pooling across the Internet. Mark Handley, UCL

An Introduction to VoIP Protocols

CSCI-1680 CDN & P2P Chen Avin

Peer-to-Peer Networks 02: Napster & Gnutella. Christian Schindelhauer Technical Faculty Computer-Networks and Telematics University of Freiburg

Note! The problem set consists of two parts: Part I: The problem specifications pages Part II: The answer pages

An Introduction to Peer-to-Peer Networks

PERFORMANCE OF MOBILE AD HOC NETWORKING ROUTING PROTOCOLS IN REALISTIC SCENARIOS

Multicast vs. P2P for content distribution

Inbound Load Balance. User Manual

VIA COLLAGE Deployment Guide

Principles of Network Applications. Dr. Philip Cannata

Computer Networks. Examples of network applica3ons. Applica3on Layer

Datagram-based network layer: forwarding; routing. Additional function of VCbased network layer: call setup.

Peer-to-Peer Networks Organization and Introduction 1st Week

IT4405 Computer Networks (Compulsory)

P2P: centralized directory (Napster s Approach)

A Scalable Multi-Server Cluster VoIP System

Distributed Systems. 2. Application Layer

Unit 3 - Advanced Internet Architectures

FS2You: Peer-Assisted Semi-Persistent Online Storage at a Large Scale

Information Searching Methods In P2P file-sharing systems

SE4C03: Computer Networks and Computer Security Last revised: April Name: Nicholas Lake Student Number: For: S.

Modern Multi-factor and Remote Access Technologies

Applications & Application-Layer Protocols: The Domain Name System and Peerto-Peer

Professor Yashar Ganjali Department of Computer Science University of Toronto.

Converging P2P with clouds towards advanced real time media distribution architectures.

Introduction to Computer Networks

Topics in basic DBMS course

Network Security. Vorlesung Kommunikation und Netze SS 10 E. Nett

A Link Load Balancing Solution for Multi-Homed Networks

The Value of Content Distribution Networks Mike Axelrod, Google Google Public

How to Setup and Connect to an FTP Server Using FileZilla. Part I: Setting up the server

Decentralized Peer-to-Peer Network Architecture: Gnutella and Freenet

Network Simulation Traffic, Paths and Impairment

Technical Overview Simple, Scalable, Object Storage Software

EE627 Lecture 22. Multihoming Route Control Devices

Trace Driven Analysis of the Long Term Evolution of Gnutella Peer-to-Peer Traffic

An Adaptive Load Balancing to Provide Quality of Service

DNS and P2P File Sharing

LIVE VIDEO STREAMING USING ANDROID

Communications Software. CSE 123b. CSE 123b. Spring Lecture 13: Load Balancing/Content Distribution. Networks (plus some other applications)

CDN and Traffic-structure

Wide-area Network Acceleration for the Developing World. Sunghwan Ihm (Princeton) KyoungSoo Park (KAIST) Vivek S. Pai (Princeton)

16/5-05 Datakommunikation - Jonny Pettersson, UmU 2. 16/5-05 Datakommunikation - Jonny Pettersson, UmU 4

Internet, Part 2. 1) Session Initiating Protocol (SIP) 2) Quality of Service (QoS) support. 3) Mobility aspects (terminal vs. personal mobility)

CSE 3461 / 5461: Computer Networking & Internet Technologies

Systems for Fun and Profit

Peer-to-Peer: an Enabling Technology for Next-Generation E-learning

1. The Web: HTTP; file transfer: FTP; remote login: Telnet; Network News: NNTP; SMTP.

Traditional Telephony IP Telephony Voice, Data, Video Integration. Media Convergence. IP Router Server IP Router. Video Distribution Fax

Chapter 7 outline. 7.5 providing multiple classes of service 7.6 providing QoS guarantees RTP, RTCP, SIP. 7: Multimedia Networking 7-71

Layered protocol (service) architecture

Executive Brief for Sharing Sites & Digital Content Providers. Leveraging Hybrid P2P Technology to Enhance the Customer Experience and Grow Profits

Multihoming and Multi-path Routing. CS 7260 Nick Feamster January

04201 Abstracts Collection Content Distribution Infrastructures. Dagstuhl Seminar

Week 4 / Paper 1. Open issues in Interdomain Routing: a survey

PacketStorm Communications, Inc. was founded in November 1998 by a group of engineers from the prestigious Bell Laboratories.

Review: Lecture 1 - Internet History

How To Understand The Power Of A Content Delivery Network (Cdn)

Limitations of Packet Measurement

Microsoft Office 365 vs Google Apps for Business

RTP / RTCP. Announcements. Today s Lecture. RTP Info RTP (RFC 3550) I. Final Exam study guide online. Signup for project demos

F5 Silverline DDoS Protection Onboarding: Technical Note

PEER-TO-PEER NETWORK

Peer-to-Peer Systems: "A Shared Social Network"

Distributed Systems. 23. Content Delivery Networks (CDN) Paul Krzyzanowski. Rutgers University. Fall 2015

Boosting mobility performance with Multi-Path TCP

CS268 Exam Solutions. 1) End-to-End (20 pts)

Client/server and peer-to-peer models: basic concepts

Internet Traffic Evolution

Need for Signaling and Call Control

Network Address Translation (NAT)

CSCI Topics: Internet Programming Fall 2008

Hadoop and Map-Reduce. Swati Gore

Introduction, Rate and Latency

Advanced Peer to Peer Discovery and Interaction Framework

Network Level Multihoming and BGP Challenges

ITU-T Kaleidoscope 2010 Beyond the Internet? - Innovations for future networks and services

Interconnection of Heterogeneous Networks. Internetworking. Service model. Addressing Address mapping Automatic host configuration

The Challenges of Stopping Illegal Peer-to-Peer File Sharing

Transcription:

Swarms: First Class Citizens in the Future Internet Don Towsley UMass - Amherst 11

Vision Information organized in one or more swarms End hosts efficiently find, obtain information, content using simple local policies Q: what should network provide? 22

Outline r a traffic-centric history of the Internet r swarms r an architecture to support swarms r summary 33

Academic Internet: 1993 r 1993 academic traffic news email WWW ftp other 1993 44

The web: 1995 r 1993 academic traffic r 1995 web appears other news ftp WWW news email 1995 1993 55

Rise of the Web: 2000 r 1995 web shows up r 2000 almost 75% web traffic m Napster, Gnutella just starting ftp other email streaming WWW 2000 1995 66

Rise of P2P: 2004 r 2000 almost 75% web traffic r 2004 67% P2P traffic r over half BitTorrent (BT) m BT canonical swarm P2P other WWW 2004 2000 77

Swarm (BT) Tutorial (Cohen) r large file (~GBs), divided into small pieces (256KB). r peers initially given random set (50+) peers/servers peer swarm servers peer r peers fetch from 4 or more random peers/servers peer 88

Peer reselection r peer connected to four peers at a time r every 30 seconds m drops peer with lowest download rate m selects new peer (out of 50+) randomly r multi-purpose mechanism m bootstraps new peers m connectivity m balances loads among peers lower delays 99

Peer reselection r peer connected to four peers at a time r every 30 seconds m drops peer with lowest download rate m select new peer (out of 50-100) randomly r multi-purpose mechanism m bootstraps new clients m connectivity m balances loads among peers lower delays m reduces need for traffic engineering 1010

Ingredients to success of BT r multiple connections / active downloads r random peer reselection Robust to failures, changes in traffic patterns 1111

Multi-source data transport r provide requestor set of sources r requestor balances load across sources/ paths r exhibits desirable load balancing properties r BT transport mode m TCP (connection) rate 1212

Benefits of multi-source transport r capacity increases with number of sources per requestor r two sources + random resampling achieves same capacity as using many sources r BT uses 4 sources + resampling maximum network capacity many sources 3 2sources sources single source 2 sources + resampling 1313

Rise of on demand video: 2007 r P2P traffic shrinks from 67% to 37% r fraction of swarm traffic swarm fraction shrinks P2P other WWW Why? r availability of inexpensive on demand video thru WWW r 10% YouTube traffic other news -groups 2007 2004 1414

Current trends r swarm traffic growing in absolute terms r shrinking in relative terms r rest of traffic single source, path r trend more prominent in developed countries % swarm traffic ~2006 How can we reap the benefits 1515 of swarms? ti m e

YouTube model 1616

Can YouTube benefit from swarming? Click to edit Master subtitle style Without introducing BT? 1717

Joint routing/rate control Consider source-destination pair r provide session set of paths YouTube r session balances load across paths r well studied within Kelly 1818

Joint rate/route control r similar load balancing properties to multisource r increasing no. paths increases capacity, robustness r two paths with random replacement of worst as good as using all paths 1919

Netflix, YouTube: multipath 2020

but YouTube replicates 2121

can add multi-source 2222

and multi-path 2323

Multisource + multipath r capacity increases with number of m paths per session m sources per session r two sources + random source/path resampling achieves same capacity as many sources, paths r high tolerance to maximum network capacity multisource + multisource multipath two sources + resampling 2 sources two + sources multipath single srce/path source 2 sources + src/path resampling 2424

Issues r controller design r path/source selection r scheduling r locating content r where, when, what to replicate r security 2525

Controller design r falls within Kelly optimization-based framework r many designs m different fairness criteria m convergence, stability, r receiver oriented design r IETF efforts on multipath rate control 2626

Source/path selection r sources m tied to finding content defer to later r paths m multihoming, increasingly common m supported by IPv6 2727

Scheduling Q: what to obtain from which source? A: coding m divide content into chunks m fetch random linear combinations of chunks from any source m chunks solve set of linear equations m combinations precoded, coded on the fly 2828

Naming, addressing r name is the address (Jacobsen) m Named Data Networking (NDN) project (Zhang, Jacobsen) r avoids need for name-address translation r handles mobility r hierachical names m (e.g., levels of interest, replication) r route setup content search 2929

Routing functionality (Jacobsen) name content storage /utube.com/bailey.mpg/v3 data forwarding table prefix /utube.com/ 0,1 port content pending search table prefix /utube.com/bailey.mp g/v3 0 port 2 port 0 port 1 port 2 get /utube.com/bailey.mpg/ v3 data get /utube.com/bailey.mpg/ v3 3030

IP router port 0 port 1 prefix forwarding table port port 2 3131

Finding content r simple swarm-like techniques m random walks m expanding ring search r third party search engines m google m bing 3232

Random walk search r propagate query randomly through network r halt when found, or hit max. TTL Scalability in network size: r O(1) communication overhead 3333

Random walk performance Many replicas Few replicas Popular overhead O(1), search time O(1) overhead O(n), search time O(n) Unpopular overhead O(1), search time O(1) overhead O(1), search time O(n) Long delays, high overhead for less replicated content 3434

Multiple RW search r initiate k random walks r halt once content found r tune k to popularity, replication 3535

Multiple random walk performance Many replicas Few replicas Popular overhead O(1), search time O(1) overhead O(n), search time O(log2 n) Unpopular overhead O(1), search time O(1) overhead O(1), search time O(log2 n) How do we reduce overhead? 3636

Search times r couple no. copies to popularity m popular content many copies and/or r introduce breadcrumbs m entries in forwarding table 3737

Breadcrunbs f search f sear ch f f 3838

Search: research questions Replication r when? m always? randomly? r partial replication? m random coding + multisource permits this 3/4 1.5 3/4 3939

Content search: summary r swarming techniques m well defined searches m handles most searches r Google model m complex searches m added value searches m third party ($$$) r hybrids? 4040

Security r secure content not connection r convert randomization into security dynamic secrets r code so that requestor must obtain pieces from a minimum number sources 4141

A network swarm architecture End hosts m network control swarms m simple, robust, local randomized policies Networks m rich connectivity m content replication m sets of paths as needed 4242

Parting thoughts r swarms simplify network management/ control m network responsibility: rich connectivity m infrastructure to support swarms r applies to m real-time, multimedia m mobile, wireless 4343

Parting thoughts r details to work out m technology for swarms m infrastructure to support swarms m content replication, searching m coding m security r economics of swarms r rich set of PhD research problems considerable latitude for architecture 4444 and

The end Thanks and questions Slides (soon) available at http://gaia.cs.umass.edu/towsley/inria-11.pdf 4545