Web tracking technologies & price discrimina5on in e- commerce Nataliia Bielova INDES team

Similar documents
Web Tracking for You. Gregory Fleischer

Table of Contents. Open-Xchange Authentication & Session Handling. 1.Introduction...3

Hack Yourself First. Troy troyhunt.com

Is preventing browser fingerprinting a lost cause?

A more detailed list of types of fingerprinting is included below. A similar definition is provided by [[RFC6973]].

The Web: some jargon. User agent for Web is called a browser: Web page: Most Web pages consist of: Server for Web is called Web server:

Privacy leakage vs. Protection measures: the growing disconnect

Checking Browser Settings, and Basic System Requirements for QuestionPoint

CIS 551 / TCOM 401 Computer and Network Security. Spring 2007 Lecture 20

Cookie Same Origin Policy

Demystifying cache. Kristian Lyngstøl Product Specialist Varnish Software AS

WompMobile Technical FAQ

Crying Wolf? On the Price Discrimination of Online Airline Tickets

1. When will an IP process drop a datagram? 2. When will an IP process fragment a datagram? 3. When will a TCP process drop a segment?

Leonardo Hotels Group Page 1

Load testing with. WAPT Cloud. Quick Start Guide

A guide to affilinet s tracking technology

P&WC Portal Settings. 1) Portal Language Setting:

Network Technologies

The Risks of Client-Side Data Storage From cookie to database

Weird New Tricks for Browser Fingerprinting. yan ToorCon 2015

Protocolo HTTP. Web and HTTP. HTTP overview. HTTP overview

CSE 135 Server Side Web Languages Lecture # 7. State and Session Management

HTTP. Internet Engineering. Fall Bahador Bakhshi CE & IT Department, Amirkabir University of Technology

1.5.5 Cookie Analysis

CS 188/219. Scalable Internet Services Andrew Mutz October 8, 2015

Web24 Supported Software

How To Test Your Web Site On Wapt On A Pc Or Mac Or Mac (Or Mac) On A Mac Or Ipad Or Ipa (Or Ipa) On Pc Or Ipam (Or Pc Or Pc) On An Ip

Checking IE Settings, and Basic System Requirements for QuestionPoint

Host Fingerprinting and Tracking on the Web: Privacy and Security Implications

RMFT Web Client User Guide

CONTENT of this CHAPTER

Lifesize Cloud Privacy Statement

Internet Technologies. World Wide Web (WWW) Proxy Server Network Address Translator (NAT)

Contents Using Jive Anywhere

Obfuscation for and against device fingerprinting Position Paper for Symposium on Obfuscation New York University, February 15, 2014

An Insight into Cookie Security

Outline Definition of Webserver HTTP Static is no fun Software SSL. Webserver. in a nutshell. Sebastian Hollizeck. June, the 4 th 2013

Working With Virtual Hosts on Pramati Server

Using Webmail. Document Updated: 9/08. Technical Manual: User Guide. The Webmail Window. Displaying and Hiding the Full Header.

HTML5. Eoin Keary CTO BCC Risk Advisory.

Preparing for the Cross Site Request Forgery Defense

Student ANGEL FAQs. General Issues: System Requirements:

New World Construction FTP service User Guide

Webmail User Guide. The Webmail Window. Logging In to Webmail. Displaying and Hiding the Full Header. Printing an . Composing a New

MAXA-COOKIE-MANAGER - USER MANUAL - SW-Release V 5.0 / Document Rev. 1.1

Index. AdWords, 182 AJAX Cart, 129 Attribution, 174

Introduction to the AirWatch Browser Guide

Downloading and Installing Interactive Reporting (Hyperion) Web Client

Web application security

Error Code Quick Reference Guide Updated 01/28/2015

Ad Tech for Privacy Counsel November 5, Marc M. Groman, President & CEO Network Advertising Initiative

CEFNS Web Hosting a Guide for CS212

IMRG Peermap API Documentation V 5.0

Login with Amazon. Getting Started Guide for Websites. Version 1.0

MassTransit 6.0 Enterprise Web Configuration for Macintosh OS 10.5 Server

Instructions for Configuring Your Browser Settings and Online Security FAQ s. ios8 Settings for iphone and ipad app

BUSINESS CHICKS, INC. Privacy Policy

CS640: Introduction to Computer Networks. Applications FTP: The File Transfer Protocol

Novell Filr 1.0.x Mobile App Quick Start

Check list for web developers

CTIS 256 Web Technologies II. Week # 1 Serkan GENÇ

HSBCnet FX AND MM TRADING. Troubleshooting and Minimum System Requirements

Using Webmail. Document Updated: 11/10. Technical Manual: User Guide. The Webmail Window. Logging In to Webmail. Displaying and Hiding the Full Header

A Practical Attack to De Anonymize Social Network Users

Frequently Asked Questions for logging in to Online Banking

Outlook Data File navigate to the PST file that you want to open, select it and choose OK. The file will now appear as a folder in Outlook.

DAY OF TESTING GUIDE. Contact Information. Websites. Please do not bookmark the Proctor Interface or the Student Interface.

Web Conferencing Version 8.3 Troubleshooting Guide

KUB Website Troubleshooting

Adobe Marketing Cloud Marketing Cloud Cookies

CGI Programming. Examples

Using Foundstone CookieDigger to Analyze Web Session Management

Frequently Asked Questions for the USA TODAY e-newspaper

Eucalyptus User Console Guide

Protecting Browser State from Web Privacy Attacks. Collin Jackson, Andrew Bortz, Dan Boneh, John Mitchell Stanford University

Relax Everybody: HTML5 Is Securer Than You Think

How-to setup a proxy in the cloud

DNS REBINDING DENIS BARANOV, POSITIVE TECHNOLOGIES

Optimal Browser Settings for Internet Explorer Running on Microsoft Windows

1 Introduction: Network Applications

Acunetix Website Audit. 5 November, Developer Report. Generated by Acunetix WVS Reporter (v8.0 Build )

Investment Management System. Connectivity Guide. IMS Connectivity Guide Page 1 of 11

DDoS Protecion Total AnnihilationD. DDoS Mitigation Lab

WatchGuard QMS End User Guide

COMP 112 Assignment 1: HTTP Servers

tpischeduler tpischeduler TotalFBO tpischeduler TotalFBO Initial Installation tpischeduler TotalFBO tpischeduler

Website Optimization Tips for Speed

The Hyper-Text Transfer Protocol (HTTP)

Transcription:

Web tracking technologies & price discrimina5on in e- commerce Nataliia Bielova INDES team 11 February 2016

Back in 1993 2

Today 3

Today Web Tracking google- analyhcs.com doubleclick.com Bigger browsing profiles = increased value for trackers = reduced privacy for users quantserve.com (HypotheHcal tracking relahonships only.) Courtesy of Franziska Roesner 4

AdverHsement Ad network AdverHsers Ad space of doubleclick doubleclick.com Real- Hme- bidding (RTB) 5

Price discriminahon 6

More evidence 7

In this talk Web Tracking How does it work? How can you protect yourself from being tracked? Is it legal? Price discriminahon Airline Hckets study How to get a bever price? Is it legal? 8

How does Web Tracking work? 9

HTTP protocol is stateless Web browser URL path: bbc.co.uk/news Parameters Method: GET HTTP request Web server HTTP response Status: 200 OK Content: HTML page Set- cookies: session- id=2082787201l & - Rendering pages - ExecuHng scripts/plugins (JavaScript) - Launching new HTTP requests - Mapping requests to apps - ContacHng DBs - ConstrucHng responses 10

HTTP protocol is stateless Web browser URL path: bbc.co.uk/news Parameters Method: GET HTTP request Web server Cookie Database bbc.co.uk/news: session-id=2082787201l HTTP response Status: 200 OK Content: HTML page Set- cookies: session- id=2082787201l & 11

HTTP protocol is stateless Web browser URL path: bbc.co.uk/news... Method: GET Cookies: session- id=2082787201l & HTTP request Web server Cookie Database bbc.co.uk/news: session-id=2082787201l 12

Mechanisms Required By Trackers Ability to store/create user idenhty in the browser HTTP cookies HTTP headers browser storages device fingerprinHng: browser properhes OS properhes IP address Stateful tracking Stateless tracking Ability to communicate user idenhty back to tracker HTTP request headers JavaScript J. Mayer, J. Mitchell Third- party web tbracking: Policy and Technology IEEE SSP 12 Nataliia ielova 13

Stateful tracking TRACKING VIA COOKIES 14

Within- Site Tracking First- party cookies are used to track repeat visits to a site. hvp://site1.com google- analyhcs.com Cookie Database site1.com: ga_id=123 <script src=googleanalytics.com/ script.js> </src> script googleanalytics.com/track? ga_id=123& site=site1.com processing engine 2:52pm: user 123 visited site1.com logs Based on the slide of Franziska Roesner 15

Within- Site Tracking First- party cookies are used to track repeat visits to a site. hvp://site1.com google- analyhcs.com Cookie Database site1.com: ga_id=123 <script src=googleanalytics.com/ script.js> </src> script googleanalytics.com/track? ga_id=123& site=site1.com processing engine 2:52pm: user 123 visited site1.com logs Based on the slide of Franziska Roesner 16

Cookie stealing Access cookies: document.cookie Script that sends cookies script // google-analytics.com/script.js var url = http://google-analytics.com/track?ga_id= + encodeuri(document.cookie) + &site= + encodeuri(document.location); document.write('<img src=' + url + '/>'); 17

First- party cookie selng 18

First- party cookies benefits Keep the session through different windows/tabs Website owners can evaluate website stahshcs popularity of certain pages popularity of links selected and copied phrases 19

Origin Cookies: first- & third- party Origin ny5mes.com b.scorecardresearch.com nyhmes.com <script src=facebook.com> First- party can read/write cookies of ny5mes.com </src> JavaScript 1 <iframe src=b.scorecardresearch.com> Html page + JavaScript 2 Third- party can read/write cookies of b.scorecardresearch.com </iframe> Nataliia Bielova 20

Cross- site Tracking Third- party cookies are used by trackers included in other sites to create profiles. tracker.com hvp://site1.com hvp://site2.com Cookie Database tracker.com: id=789 cookie: id=789 processing engine <iframe src=tracker.com/ ad.html> ad cookie: id=789 9:30am: user 789 visited site1.com 9:31am: user 789 visited site2.com logs </iframe> Roesner etal DetecHng and Defending Against Third- Party Tracking on the Web NSDI 2012 Courtesy of Franziska Roesner 21

PracHcal protechon: Third- party cookies blocking 22

PracHcal protechon: Third- party cookies blocking Does not influence your browsing experience Does not adjust adverhsements for you So why are third- party cookies s5ll there? It s a business of adverhsement companies 23

Personal Cross- Site Tracking hvp://facebook.com facebook.com processing engine Cookie Database facebook.com: fb_id=522 First- party site hvp://site.com <iframe src=faceboook.com /button.html> cookie: fb_id=522 logs 2:34pm: user 522 visited facebook.com 2:35pm: user 522 visited site.com </iframe> 24

Third- party cookie blocking problem Important detail: In most browsers, third- party cookie blocking ophon doesn t block sending the cookies Privacy problems: If a tracker can ever set a cookie, third- party cookie blocking is rendered ineffec5ve. The user can be tracked just because a site she visits contains a social buton 25

ShareMeNot Cookie Database facebook.com: fb_id=522 hvp://facebook.com First- party site hvp://site.com <iframe src=faceboook.com /button.html> facebook.com processing engine logs 2:34pm: user 522 visited facebook.com 2:35pm: user 522 visited site.com </iframe> Now is a part of Privacy Badger hvps://www.eff.org/privacybadger 26

Cookie respawning Cookies can respawn even if the user has deleted them KISSmetrics and Hulu.com lawsuits HTML5 localstorage (across sessions only) Flash LSOs (across sessions and web browsers) HTTP headers: Etag, LastModified A. Soltani hvp://ashkansoltani.org/2011/08/11/respawn- redux- flash- cookies/ August 2011 27

Respawning - local storages KissMetrics lawsuit: HTML5 localstorage (across sessions) User leaves the page tracker.com hvp://site1.com Cookie Database tracker.com: id=789 <iframe src=tracker.com/ ad.html> ad cookie: id=789 processing engine 9:30am: user 789 visited site1.com HTML5 localstorage </iframe> logs tracker.com: id=789 28

Respawning - local storages KissMetrics lawsuit: HTML5 localstorage (across sessions) User deletes all the cookies! tracker.com Cookie Database tracker.com: id=789 HTML5 localstorage hvp://site2.com <iframe src=tracker.com/ ad.html> ad </iframe> cookie: id=789 processing engine 9:30am: user 789 visited site1.com 9:31am: user 789 visited site2.com logs tracker.com: id=789 29

Respawning via Etag header Etag header is a caching mechanism GET: A.com A.com Resource cache A.com: token = x234dff (html body) 200 OK Etag: x234dff (html body) GET: A.com If- None- Match: x234dff 304 Not Modified Etag: x234dff A.com has not changed 30

Respawning via Etag header INITIAL REQUEST HEADER: GET /i.js HTTP/1.1 Host: i.kissmetrics.com INITIAL RESPONSE HEADER: Etag: "Z9iGGN1n1-zeVqbgzrlKkl39hiY" Expires: Sun, 12 Dec 2038 01:19:31 GMT Last-Modified: Wed, 27 Jul 2011 00:19:31 GMT Set-Cookie: _km_cid=z9iggn1n1-zevqbgzrlkkl39hiy; expires=sun, 12 Dec 2038 01:19:31 GMT;path=/; SUBSEQUENT REQUEST HEADER (PRIVATE BROWSING MODE WITH ALL COOKIES BLOCKED): GET /i.js HTTP/1.1 Host: i.kissmetrics.com If-None-Match: "Z9iGGN1n1-zeVqbgzrlKkl39hiY" KissMetrics lawsuit, August 2011 31

Not Respawning, but Tracking Important detail: If Etag header, HTML5 localstorage, or Flash LSO didn t store a copy of cookies => tracking would not be detected! Privacy problem: All of these storages can be used for tracking without cookies 32

ProtecHon from stateful tracking Browser selng: block third- party cookies Protects from tracking (purely) via cookies Does not protect from cookie respawning Does not protect from tracking via other storages Browser extension: block scripts/requests only from known adver5sement/tracking companies Does not protect from tracking by other companies Does not protect form tracking by the main website 33

Research soluhons Dynamic Informa5on Flow Control Monitors JavaScript to prevent cookie stealing Strong formal guarantee secret data sources (cookies) do not flow into public data sinks (servers, storages) Several implementahons: Enhanced web browser FlowFox [De Groef et al. CCS 12] FireFox plugin ZaphodFacets [AusHn & Flanagan POPL 12] 34

Do- not- track and EU eprivacy direchve 35

Do- Not- Track (DNT) 36

Do- Not- Track (DNT) Tracking preference expression New HTTP request header DNT:1 OpHonal HTTP response header Tk:1 (server is compliant) How the web servers should enforce DNT? do- not- track à do- not- target do not target the users based on collected data but shll allow data to be collected Did anything actually change? IE 10 adds DNT:1 by default, Yahoo! and Apache ignore it. 37

EU eprivacy DirecHve 95/46 w.r.t. Stateful tracking Actual Regula5on 2002/58/EC: users should be able to refuse to have info stored in their browser Interpreta5on EU states: users can change their cookie seyngs 2009/136/EC: users should give a consent to have info stored in their browser Some EU states: cookie selng is an implicit consent Most of other EU states: no, we need other standard with explicit consent 38

Thanks to EU eprivacy DirecHve 39

Stateless Web Tracking WEB- BASED DEVICE FINGERPRINTING 40

Stateless tracking Web- based device fingerprin5ng Allows track users without the need of cookies of any other stateful client- side idenhfier Hidden from users Hard to avoid/opt- out PanopHclick demonstrates Certain avributes of your browser environment can be used to accurately track you Based on a slide of Nick Nikiforakis 41

ProperHes fingerprinted by PanopHclick PanopHclick hvps://panophclick.eff.org 42

ResulHng fingerprints Browser property User Agent (browser name and version, OS version, etc) HTTP_ACCEPT header Browser plugin details Time zone Screen size and color depth System fonts Cookies enabled? Supercookies test Source HTTP JavaScript HTTP JavaScript JavaScript JavaScript Flash/Java HTTP JavaScript JavaScript 83.6% of users could be uniquely idenhfied 94.2% of users with Flash/Java could be uniquely idenhfied Plugins and fonts are the most idenhfying metrics! Nataliia Bielova 46

Very hard to opt- out Even if you delete all the cookies you clean all the storages (HTML5, Flash) you use browser private mode your fingerprint remains the same! 47

Browser extensions are worse than nothing Acar et al. reviewed 11 different browser extensions that spoof a browser s user- agent All of them had one or more of the following: Incomplete coverage of the navigator object Impossible configurahons Mismatch between UA header and UA property Problem: When installing these, a user becomes more visible and more fingerprintable than before Courtesy f Nick Nikiforakis Acar et al. FPDetecHve: DusHng the Woeb for Fingerprinters, CCS 13 48

Browser extensions are worse than nothing Acar et Fingerprintable a l. r eviewed 11 different browser extensions Surface that spoof a browser s user- agent Extension_A All of them had one or m ore of the following: Incomplete coverage of the navigator object Impossible configurahons Mismatch between UA header and UA property Extension_C Extension_B Problem: installing these, a user becomes more visible and When more fingerprintable than before Courtesy f Nick Nikiforakis Acar et al. FPDetecHve: DusHng the Woeb for Fingerprinters, CCS 13 49

In this talk Web Tracking How does it work? How can you protect yourself from being tracked? Is it legal? Price discriminahon Airline Hckets study How to get a bever price? Is it legal? 50

Price DiscriminaHon in e- commerce 51

52

53

What is price discriminahon? SituaHon where two consumers are charged differently for the same product Based on how much they are able/willing to pay Not possible in real life, but possible on the internet! 54

Why price discriminahon is possible? Massive amounts of user- data gathered on the web is natural fit to determine how much the user is willing to pay Is the user a big spender? Has the user searched for this before? Is the user achvely comparing products (potenhally cross- site)? How bad does he want/need the product? 55

How does it work? Main discrimina5ve factors: Geographical locahon IP address Device and OS UserAgent HTTP header, JavaScript Previous search queries 1 st - party tracking (cookies) Consumer profile (willingness to spend) 3 rd - party tracking (cookies, storages, fingerprinhng) 56

Does the EU law protect us? The EU law is prohibits Different treatment of two clients based on their na5onality and residence. DiscriminaHon based on gender, religion or race DirecHve 2006/123/EC of the European Parliament and of the Council of 12 December 2006 on services in the internal market 57

58

Does the EU law protect us? The EU law is prohibits Different treatment of two clients based on their na5onality and residence. DiscriminaHon based on gender, religion or race But nothing prohibits discrimina5on based on device, search history or cookies! DirecHve 2006/123/EC of the European Parliament and of the Council of 12 December 2006 on services in the internal market 59

General retailers study Real- user study based on plugin DiscriminaHve factors: country + device Results: Amazon, Staples: different price for different country Shoplet: smaller price for ``budget profiles (based on cookies) $heriff plugin compares prices for you Mikians, GyarmaH, Erramilli, Laoutaris Detec?ng Price and Search Discrimina?on in the Internet" in Proc. of ACM HotNets'12. 60

Airline Hckets study Vissers, Nikiforakis, Bielova, Joosen. Crying Wolf? On the Price DiscriminaHon of 61 Online Airline Tickets. Hot Topics in Privacy Enhancing Technologies (HotPETs 2014)

DiscriminaHve factors: Device and OS Hypothesis Consumer profile (willingness to spend) Previous search queries Geographical locahon Hypothesis If airlines return different prices according to we should observe a systema?c price difference between emulated users who differ in those characteris?cs. 62

Large- scale analysis: Overview Numbers: 25 airlines, twice per day 3 weeks 66 user profiles 2 geographical locahons 130,000+ search queries How? A web scraper Scrape all airlines with all different user profiles EmulaHng real user interachons CasperJS, PhantomJS 63

Large- scale analysis: User profiles Browser and OS profiles User- Agent string and JS navigator object IE 9 on Win7, Safari 6 on OSX 10.7, Consumer profiles Gathering cookies from relevant websites Affluent, Budget and Flight Comparer Cookie selng profiles AccepHng/Blocking certain cookies own cookies, with(out) consumer profile cookies, no third- party cookies Geographical profile Scraper in runs in parallel on different locahons New York and Leuven 64

Results No consistent price discriminahon based on the analyzed used profiles! Analyzed top- 25 companies Dataset: hvp://people.cs.kuleuven.be/~thomas.vissers/data/price_discriminahon.zip 65

General retailers, hotel and car rental study Large- scale study: 16 websites Web scraper + Real user tests real users pay more than machines! Results: Expedia and Hotels.com steered users toward more expensive hotels based on search history Priceline: users who clicked on or reserved low- price hotel rooms received slightly different results in a different order Hannak, Soeller, Lazer, Mislove, Wilson Measuring Price Discrimina?on and Steering 66 on E- commerce Web Sites, ACM/USENIX IMC 14

How to avoid discriminahon? Block 3 rd party cookies: Not always effechve Does block other tracking mechanisms Mask IP address: Tor browser is effechve but slow Privoxy proxy may be effechve, but risky? 67

Summary Web Tracking Stateful and stateless: cookies, storages, fingerprinhng No effechve protechon from all EU law: user consent Price discriminahon Studies: Airline Hckets, General retailers, Hotels and cars Turn off some tracking, use $eriff to find a bever price EU law: only prohibits discriminahon based on geolocahon and nahonality 68