The Other 50% of Internet Traffic. Craig Labovitz labovit@deepfield.net

Similar documents
Internet Traffic Evolution

Internet Traffic and Content Consolidation

Craig Labovitz, Scott Iekel-Johnson, Danny McPherson Arbor Networks Jon Oberheide, Farnam Jahanian University of Michigan

ATLAS Internet Observatory 2009 Annual Report

ATLAS Internet Observatory Bandwidth Bandwagon: An ISOC Briefing Panel November 11, 2009, Hiroshima, Japan

Measuring the Web: Part I - - Content Delivery Networks. Prof. Anja Feldmann, Ph.D. Dr. Ramin Khalili Georgios Smaragdakis, PhD

Trends in Internet Traffic Patterns Darren Anstee, EMEA Solutions Architect

Internet Inter-Domain Traffic

CDN and Traffic-structure

DNSFlow- Enabling netflowlike telemetry for DNS

IPv6 The Big Picture. Rob Evans, Janet

Internet Traffic Trends A View from 67 ISPs

MarkScan. India New Delhi. Digital Content Protection Movie Case Study

DNS, CDNs Weds March Lecture 13. What is the relationship between a domain name (e.g., youtube.com) and an IP address?

Three short case studies

How the Netflix ISP Speed Index Documents Netflix Congestion Problems

Current Trends in IXPs, Hosting, and Cloud Computing

Cisco Videoscape Distribution Suite Service Broker

Analysing the impact of CDN based service delivery on traffic engineering

Why VoIP Peer to Peer and Social Networking providers cannot ignore Legacy Telecoms! Or why Peer to Peer VoIP needs Voip-Pal IP to succeed!

Uncovering the Big Players of the Web

VPN Configuration Guide. Linksys (Belkin) LRT214 / LRT224 Gigabit VPN Router

The Internet Peering Playbook: Connecting to the Core of the Internet Second Edition Written by William B. Norton

The forces behind the changing Internet: IXPs, content delivery, and virtualization

Data Center Content Delivery Network

Symantec RuleSpace Data Sheet

The Challenges of Stopping Illegal Peer-to-Peer File Sharing

How using Google Analytics can improve your website performance and social campaigns

How Router Technology Shapes Inter-Cloud Computing Service Architecture for The Future Internet

Lab Diagramming External Traffic Flows


Regional Interconnection Strategy for Africa. Regional Peering and Interconnection Economics

The Role IXPs and Peering Play in the Evolution of the Internet

It s stated goal is to give people the power to share and make the world more open and connected.

Executive Summary. Internet Traffic and Capacity

CFP Broadband Working Group: Broadband Usage Cost Recovery

Distributed Systems. 25. Content Delivery Networks (CDN) 2014 Paul Krzyzanowski. Rutgers University. Fall 2014

Decoding DNS data. Using DNS traffic analysis to identify cyber security threats, server misconfigurations and software bugs

Peer-to-peer filetransfer protocols and IPv6. János Mohácsi NIIF/HUNGARNET TF-NGN meeting, 1/Oct/2004

SE4C03: Computer Networks and Computer Security Last revised: April Name: Nicholas Lake Student Number: For: S.

Distributed Systems. 23. Content Delivery Networks (CDN) Paul Krzyzanowski. Rutgers University. Fall 2015

Session 2 Generating Value from 'Big Data' Mark T. Bain

VPN Configuration Guide WatchGuard Fireware XTM

Internet Peering as a Cloud enabler for Enterprises

LiveAction Visualization, Management, and Control for Cisco IWAN Overview

Teach Yourself VISUALLY Search Engine Optimization (SEO)

How To Improve Interconnection

SEO for Profit. A Wordtracker Masterclass in search engine optimization. Mark Nunney

The Role and uses of Peer-to-Peer in file-sharing. Computer Communication & Distributed Systems EDA 390

Netflix Open Connect Network. PTT Forum December 2012 Flavio Amaral (South America Network Strategy)


Copyright Infringement and Enforcement in the US

Adaptive HTTP streaming and HTML5. 1 Introduction. 1.1 Netflix background. 1.2 The need for standards. W3C Web and TV Workshop, 8-9 February 2011

What is. LDeX MEDIA PLATFORM?

Web Hosting 101. with Patrick McNeil

How To Detect An Advanced Persistent Threat Through Big Data And Network Analysis

The State Of The CDN Market: Video Pricing, Contract, Volume and Market Sizing Trends

Should Internet Service Providers Fear Peer-Assisted Content Distribution?

Web Analytics Understand your web visitors without web logs or page tags and keep all your data inside your firewall.

SEO Basics for Starters

VPN Configuration Guide. Dell SonicWALL

Measure. Analyze. Optimize. Search Engine Optimization. Prepared for: Onstar Pest Control. Date: March 30th, 2015

Indirection. science can be solved by adding another level of indirection" -- Butler Lampson. "Every problem in computer

The Broadband Service Optimization Handbook Chapter 3

Pushing CDN-ISP Collaboration to the Limit

Visualization, Management, and Control for Cisco IWAN

Whose IP Is It Anyways: Tales of IP Reputation Failures

Software Defined Networks

MC-FTP (Multicast File Transfer Protocol): Implementation and Comparison with

Pushing CDN-ISP Collaboration to the Limit

TELEGEOGRAPHY REPORT EXECUTIVE SUMMARY. Executive Summary

Getting Started with PRTG Network Monitor 2012 Paessler AG

Cloud Security: Evaluating Risks within IAAS/PAAS/SAAS

A complete website solution for ownership and hosted on your own server.

Transcription:

The Other 50% of Internet Traffic Craig Labovitz labovit@deepfield.net 1

Earlier In previous work, focused on HyperGiants 50% of Internet traffic due to 150 companies i.e. Google, MSFT, CDNs, large consumer This talk looks at the other 50%... 2

Mysteries Earlier work raised number of questions Easy to understand role of Google, Facebook, etc. But harder to understand role of a handful of companies that kept showing up in the data 3

Mysteries Same ASN show up again and again AS16265 AS16276 AS20473 AS39572 and more Same CIDR ranges and, often, same data centers (or at least traceroute paths) Massive share of global Internet traffic Asking around, many theories, but no one completely sure 4

Mysteries Same ASN show up again and again AS16265 AS16276 AS20473 AS39572 and more Same CIDR ranges and, often, same data centers (or at least traceroute paths) Massive share of global Internet traffic Asking around, many theories, but no one completely sure (or at least had any data) 5

Mysteries So, where are these terabytes of traffic going? Services usually behind URL load balancer Netflow provides no clues... Other tools come up empty (whois, curl, DNS) Intentional service obfuscation (e.g. short lived DNS) Sometimes mocking You are too stupid. This site is not for you. 6

Some Words about the Data Different data than earlier work Population discovery algorithms combined with passive / active DNS, large scale crawling, flow, third-party public / licensed data sources, some machine learning and a bit of cleverness Plus large numbers of VMs around the word And a few months of big data crunching 7

About this Work This work a best-effort (really a side / sparetime research project) Data is still preliminary, but is reasonably representative and covers very large datasets... And finally, focus of work is on the infrastructure and economics (not the content, morality, legality, etc.) Complete report / research paper in progress 8

Expectations Conventional wisdom* is that adult sites, P2P (trackers and seed) and file sharing are distributed across a huge swath of the Internet. Basically everywhere. * as explained by Slashdot 9

Expectations Conventional wisdom* is that adult sites, P2P (trackers and seed) and file sharing are distributed across a huge swath of the Internet. Basically everywhere. No. * as explained by Slashdot 10

This Talk A brief tour of a small number of companies that play a massive role in Internet traffic Specifically, File Sharing, P2P, and Adult Particular focus on how this infrastructure is built and the business models that support it 11

File Sharing First have to define what we mean All file storage sites (well, except MegaUpload) look identical... Similar graphics and sales messaging Similar DMCA notices and warnings So we define file sharing sites by industry self-classification 12

Industry Self-Classification Many dozens of pay-for-link sites offer the same set of file locker sites And generally (never) do not include dropbox, box.net, etc. 13

File Sharing Distinct ecosystems each with unique infrastructure, advertising, payment and analytics partners Multiple different business models and niche target markets Pay for search ($10 / month) Pay for storage ($10 / month) Advertisement supported 14

File Sharing Averages 5-10% of all consumer traffic And 1-2% of all consumers Peak percentage of active subscribers using file sharing service in a one hour period Similar traffic patterns to P2P i.e. highly diurnal and peaks around 1am local time 15

FileSharing (January 18, 2012) Lines are proportional to traffic percentages 16

FileSharing (January 19, 2012) Lines are proportional to traffic percentages 17

File Sharing Topology Hundreds of distinct domain names (though many actually the same site and owned by same organization) 10 of these sites contribute 70% of all Internet file sharing traffic And 4 colo / hosting companies (across ~8 locations) contribute 85% of all file sharing traffic Megavideo (Carpathia / Leaseweb) quickly replaced (with Putlocker a big winner) 18

Adult DRM / billing / control uses specialized hosting Almost traffic comes from CDNs Adult sites share red-light neighborhood Hosting similarly segment infrastructure Several small CDNs specialize in only adult content (e.g. Swift) Some CDNs have decided porn is not part of their business (e.g. Akamai) 19

Adult Others cater to the market with specialized pricing, SLAs and marketing This means we do not ask what you stream nor do we judge what you stream, how you stream or when you stream. Specialized hosting, payment and advertising 20

Internet Adult Traffic Lines are proportional to traffic percentages 21

The Rise and Fall (and Rise) of P2P P2P used to be mainly between dorm and consumer PCs Slow, unreliable, throttled by ISPs And home IPs pursued by RIAA agents So P2P declined as a percentage of Internet traffic (see my last talks)... 22

P2P Today But then came the cloud (and HD movies) Enter hosted cloud seed boxes GigE interfaces (great ratios) No throttling Latest torrent code pre-installed Conveniently located outside US (and RIAA) jurisdiction Still small percentage of P2P (1-5%), but growing rapidly 23

P2P Infrastructure Seedbox / Tracker Hosting Percentages OVH LeaseWeb Softlayer SingleHop FDCServers 0 3.75 7.5 11.25 15 http://torrent-invites.com/seedbox-discussions/182440-ovh-vs-softlayer.html 24

A Very Brief Tour Large Russian File Sharing Hosting Provider 25

A Very Brief Tour Large UK File Sharing Hosting Provider 26

Final Thoughts File Sharing, P2P and Adult consume large proportion of Internet traffic (up to 30%) Tendency towards centralized / common infrastructure network effect specialized market needs targeted sales strategy P2P and file sharing staggeringly inefficient (US -> Europe) Dynamic market with shifting demands and potentially significant impact on carrier peering / IXP decisions 27

Questions? labovit@deepfield.net 28