1 Understanding and Defending Big Data Presented to International Association of Privacy Professionals Princeton, New Jersey KnowledgeNet December 5, 2012 Mark S. Melodia, Paul Bond,
2 Understanding and Defending Big Data What is Big Data? Why is it under attack? What has the result been to date?
3 What is Big Data? Roughly 90% of all the digital data in the world has been created in the last two years. These data sets are: very large, even by modern standards, diverse, encompassing many data elements, often relating to different topics, and unstructured frequently, the data sets are really data dumps.
4 What is Big Data? Databases often includes non-traditional types of information. Data is collected and analyzed in closer-toreal-time than ever before. Companies are adopting new tools and processes.
5 What is Big Data? Better analytics have made data sets more valuable. Investors demand granular details on what information assets are held see Facebook IPO and other 2012 offerings. Per IBM s 2012 report, Analytics: The Real- World Use of Data, 75% of companies use or intend to use Big Data solutions.
6 What is Big Data? For example, when consumers drive the Ford Focus, an electric car, they generate driving data used by: Ford engineers to improve car design, Utilities to decide on choosing locations for charging stations, Traffic planners to speed up traffic flow.
7 What is Big Data? Other use cases: Health insurance plans use big data to make better coverage decisions and improve patient outcomes Epidemiologists can track the outbreak of diseases by monitoring online activity Use of Big Data can greatly improve the effectiveness of fraud detection, acting in real-time and drawing on multiple channels Studios track social media minute-by-minute as people react to movies playing on opening weekend.
8 What is Big Data? Other use cases: Improving the effectiveness of power grids by monitoring of smart meters Better tailoring advertisements to consumers: Online Over the phone (determining which upsell to try based on market segment information) And, for example, in-store (changing digital displays based on facial recognition) Better understanding customers and potential customers
9 Where Does Big Data Come From? Social media, online game worlds, and other self-expressive activity Other online activity that leaves a trail Offline activity that is tied into information streams Sensors and the Internet of Things.
10 Examples of Big Data Sources Online, Intentionally Expressive Conduct People feed into Big Data by what they do and say in online communities Through social network aggregation platforms, there is a multiplier effect of communications Social networks and game worlds capture a large amount of diverse interaction.
11 Examples of Big Data Sources Online, Intentional Use of Social and Game Networks Users don t just speak, they also: Say which brands they like Show which ideas they agree with Map out networks of friends and family Indicate social status Volunteer where they come from Make purchases Send gifts Calendar important upcoming dates in their lives.
12 Examples of Big Data Sources Online, Intentionally Expressive Conduct The wide variety of social media sites show the diversity of information captured, as well as how unstructured such information can be. Social media sites can be; General Use (e.g., Facebook, MySpace, Google+) Professional (e.g., LinkedIn) Mostly text-based (e.g., Twitter) Mostly visual (e.g., Pinterest, Instagram) Location-based (e.g., FourSquare) Question and answer (e.g., Quora) Debate (e.g., Debate.org) Online game worlds (e.g., World of Warcraft)
13 Examples of Big Data Sources Online, Intentional Use of Other Expressive Outlets In addition to participating in social media, consumers are creating more substantive, personal content: Multi-media (e.g., YouTube) Extended commentary (e.g., Tumblr, Reddit) Reviews (e.g., imdb, Amazon, goodreads, Yelp) Blogging and news aggregation.
14 Examples of Big Data Sources Conduct Not Intended to Be Expressive Internet searches (many of which are now performed through social media) Online activity: creating information stored on server logs generating information stored in cookies resulting in the user s receipt of a pixel or web beacon
15 In Addition, People Engage in Online Conduct Not Intended to Be Expressive, But Which Is Telling Purchases online can result in the creation of device prints for security Purchases online that can be processed by payment card networks Internet browsing activity can be logged Pulling up pages with social media plug-ins may tell social media sites where you are Accessing communications over other digital channels may create a trail: Cable viewing habits Streaming of online music or other multi-media
16 Offline Conduct Can Be Swept Into Big Data Location activity can be tracked with GPS-enabled devices Purchases with payment cards, or use of automatic payment devices Having a smart meter monitor energy consumption over time Filing court records or property records Walking past a monitor capable of facial recognition Driving past a monitor capable of license plate recognition Having an implanted device that sends information back to its manufacturer
17 Why is Big Data Under Attack? Anxiety Past failures of anonymization Concerns regarding how the profits of Big Data will be shared Additional concerns
18 Fear of Big Data And An Effort to Ward It Off
19 Facebook Privacy Notice Hoax This notice is a hoax Every so often, it spreads like wildfire over Facebook Even after being debunked month after month People want to believe there is a simple solution.
20 Failures of Anonymization Privacy compliance concerns can chill adoption of Big Data solutions. Privacy focuses on personal information. So to get out of the privacy box, many companies rely on anonymization or psuedonomyziation. This approach offers significant comfort, but is not fail-safe.
21 Anonymization/Psuedonymization: Offers Significant Comfort in Principle GLBA excludes aggregate information or blind data from its privacy rules HIPAA has well-developed de-identification procedures In its report, Protecting Consumer Privacy in an Era of Rapid Change, the FTC also excluded such information: as long as (1) a given data set is not reasonably identifiable, (2) the company publicly commits not to re-identify it, and (3) the company requires any downstream users of the data to keep it in de-identified form, that data will fall outside the scope of the framework.
22 AOL Records Re-Identified Based On The Content of Searches Controls can fail. AOL released 20 million search records to the academic community in pseudonymous form. Journalists and researchers quickly re-identified individual users by name, using tells in searches. The release had already been mirrored all around the world, and could not be pulled back. This privacy event prompted a class action suit against AOL that lasted from 2006 to 2010.
23 Netflix Records Re-identified Based On Public Data At Other Film Websites Netflix released pseudonymous viewing records and reviews, as part of a contest to see who can improve search algorithms. Some academics compared the Netflix data with information on the Internet Movie Database. Since many movie lovers are on both, the researchers were able to deduce to the name of many Netflix users and see their viewing habits. Like AOL s release, the result was a class action against the releasing party, and a reduced public belief in psuedonymization.
24 Touched By A Flash Cookie: Tracking Litigation Adobe Flash video works, in part, but putting a Flash cookie onto the user s computer. That Flash cookie is not deleted when the user deletes browser cookies. Ad networks quickly realized the Flash cookie could be used to identify a device uniquely, and in a way that the user would be unlikely to address. Class actions followed against ad networks, resulting in settlements. Next, plaintiffs attorneys sued both ad networks and brands who used them in the same action resulting in a quick dismissal of the brands with prejudice.
25 Touched By A Flash Cookie: Tracking Litigation A rash of class action lawsuits have been filed against national brands in Missouri and Arkansas without naming the ad networks or third party trackers as defendants at all. The Complaints allege generally that the defendants collect information wrongfully through Flash cookies, and share it with third parties. In this and other litigation, the plaintiff s class action bar considers it irrelevant whether a name is connected with any cookie-generated information: Plaintiffs assume any data subject will be re-identified in the end, by Big Data tools And Plaintiffs assert that even a device-linked identifier is enough to trigger privacy protection requirements.
26 Opposition to Big Data From A Consumer Protection Point of View Many suits have been brought by groups of individuals who feel that companies shouldn t profit from personal information. Those individuals have sought compensation for the value of their data as a marketing or analytics tool. So far, such suits for the value of personal information have not gained much ground, as described in the slides below. Despite the failure of such litigation, regulatory pressure continues unabated.
27 Additional Concerns: Web-lining The Bipartisan Congressional Privacy Caucus posed a pointed question to 12 supposed data brokers earlier this year, asking for: Detailed information of what information each collected, stored, and use, potentially extend far beyond rage, race, and sex Whether that information is used to score individuals for purposes of offering access to education, healthcare, employment, and other economic opportunities The term the Caucus used for this scoring is web-lining, with explicit comparison to the prohibited, discriminatory practice of redlining that has caused so much litigation against mortgage companies and brokers.
28 Additional Concerns: Civil Liberties Governments and private industry routinely share information and cooperate with analytic projects There can be a negative public reaction to such projects, as evidenced by the costly NSA/Wiretap litigation In a recent Supreme Court case involving the use of GPS trackers, the Court noted that the device had sent more than 2,000 pages of locational data over 28 days, a level of surveillance that was not reasonable. United States v. Antoine Jones, 132 S. Ct. 945 (2012). The volume of information at issue may start to be one index of the reasonableness of government action.
29 Additional Concerns: Rights To Access And Duty To Destroy Because big data is large, diverse, and unstructured, it is correspondingly more difficult to search. Much of the data stored still consists of icebergs blocks of data from which there is no practical way to draw intelligence. Yet, there is a growing push to give information subjects access rights to all information about them on demand. The right to access, joined with the nascent right to be forgotten, will become all the more complex to accomplish in the Big Data era.
30 What Has The Result Been To Date?: For the most part, Courts have not stymied the collection, use, and re-use of information key to Big Data Objections that consumers have a right to more directly benefit from personal information have been largely unsuccessful. But litigation continues, and regulatory attention is at an all-time high.
31 In re Jetblue Airways Corp. Privacy Litigation, 379 F.Supp.2d 299 (E.D.N.Y. 2005). In general, information maintained by a company about an individual is not considered an economic asset of the individual. In Jetblue, airline passengers sued an airline for unlawfully transferring their personal information for use in a federally-funded study on military base security. Id. at 303. The court found that there was no support for the proposition that an individual passenger's personal information has or had any compensable value in the economy at large, and hence plaintiffs stated no actual damages.
32 In re DoubleClick Inc. Privacy Litigation, 154 F.Supp.2d 497, 525 (S.D.N.Y. 2001). Courts have also resisted claims that consumers should be compensated for the value of information about their online activity. [A]lthough demographic information is valued highly the value of its collection has never been considered a economic loss to the subject. Demographic information is constantly collected on all consumers by marketers, mail-order catalogues and retailers. However, we are unaware of any court that has held the value of this collected information constitutes damage to consumers or unjust enrichment to collectors.
33 CVS Caremark and Walgreens Cases There have even been suits about the resale of properly deidentified data sets, mainly involving pharmacy-level prescription data. These, too, have been rebuffed. Under the circumstances the plaintiffs could have no reasonable expectation of being compensated for the information related to that transaction [filling a prescription], because that information carries with it no compensable value at the individual level. Steinberg v. CVS Caremark Corp., 2012 WL at *9-10 (E.D. Pa. 2012). the sale of de-identified prescription data does not carry a compensable value to consumers, and thus plaintiff had not shown that he was harmed by defendants actions. Todd Murphy v. Walgreen Corporation, et al., docketed as CU- BT-CTL in the Superior Court of California, San Diego County (Minute Order of May 9, 2012).
34 Sorrell v. IMS Health Inc. United States Supreme Court case striking down a law designed to preclude certain analysis of Big Data sets Several states passed laws limiting the use of data about the prescription-writing habits of physicians. The laws prohibited companies like IMS from buying that information to derive actionable marketing intelligence for outreach to physicians. The Supreme Court found that these laws violated the First Amendment by limiting speech in a way that hinged on the content of the speech. In so doing, it gave short shrift to claims that outreach violated physician privacy.
35 However Privacy class actions continue to be filed en masse: Many of them presupposing that personal information has economic value Many of them presupposing that all information collected will eventually be re-linked to a person The FTC continues to evidence an interest in more fully regulating major platform providers and information brokers, as well as casting a wider net with existing statutes like the FCRA and COPPA The actual implementation of Big Data analysis raises a host of legal questions contracting, tax, cross-border transfer, insurance, anti-trust as to which very little has yet been settled.
NBA Math Hoops Privacy Statement and Children s Privacy Statement Updated October 17, 2013. This Privacy Statement applies to the web sites mobile applications provided by Learn Fresh Education Co. (collectively,
How to Write a LinkedIn Profile Use LinkedIn to build a powerful personal brand and generate new business. David Adams Copyright 2015 Published by Revenue Wise, Inc. 1000 Fourth Street, Suite 595 San Rafael,
What You Need to Know Before Distributing Your Infographic Improve your audience outreach efforts by learning why and how to use the best social, owned and earned platforms available. Targeting specific
Talen Energy Corporation Website Privacy Notice Talen Energy Corporation and its affiliates (collectively referred to in this notice as Talen Energy, we, us, our and other similar pronouns), have developed
Big Issues with Big Data Jeff N. Jensen Office of Legal Affairs September 29, 2016 Big Data is Everywhere Entertainment industry uses big data to create programming to meet customer interests Brokers gather
Video Marketing for Restaurants Restaurants are Using Online Video to Reach More Consumers and Boost Brand Impact Online video is becoming ever more present in consumer life therefore businesses have started
Central Ohio s Small Business Resource Who is @Tonya Wilson? Program Coordinator for the Ohio SBDC at Columbus State Manage SBDC marketing, branding, outreach, digital media & events AAS Multimedia Production
Data Use Policy Revision 1.1 03/09/2014 Ramos M. Mays, Chief Technology Officer Table of Contents 1. Information Sources... 3 2. Information we receive... 3 3. How we use information... 4 4. How long we
Zep Inc.: Global Online Privacy Notice Effective Date: March 26, 2015 We at Zep Inc., along with our affiliates (collectively, Zep ), respect your concerns about privacy. This Global Online Privacy Notice
Database Marketing Fall 2016 Web analytics (incl real-time data) Collaborative filtering Facebook advertising Mobile marketing Slide set 8 1 Web analytics: Data Collected via the Internet Customers can
Presented by Spyros Mygdanis & Giorgos Kalaitzis About this presentation 1. Websites Elements of a successful website 2. Content 3. Internet Marketing Channels 4. Building your online strategy 1. Websites
Healthcare meets Social Networking 2014 The Good, Bad & Difficult What s going on in 2014. Pinkman Creative Paul Pinkman, owner of Paul Pinkman Creative Design 20 + years in online media development Focus
Before the Federal Trade Commission Washington, DC 20580 In the Matter of ) ) DoubleClick Inc. ) ) Complaint and Request for Injunction, Request for Investigation and for Other Relief INTRODUCTION 1. This
Online and Mobile Privacy Notice ( Privacy Notice ) Introduction This Privacy Notice applies to the operations of Cigna Global Health Benefits and its affiliated companies listed at the end of this Privacy
Big Data Analytics Empowering SME s to think and act Author: Haricharan Mylaraiah Chief Operating Officer 1320 Greenway Drive, Irving, TX 75038 Contents Executive Summary... 3 Introduction... 4 What Big
Big Data & The Law Will Landecker & John Rake Portland Eugene Salem harrang.com 800.315.4172 2014 Harrang Long Gary Rudnick P.C. Introduction Clients need data to solve problems or predict results. Attorneys
IBM Software White Paper Financial Services Smarter digital banking with big data Transform customer relationships and improve profitability 2 Smarter digital banking with big data Contents 2 Introduction
Demystifying Digital Digital Marketing 101 Mal Chia Digital Account Director @malchia @communikateetal Slides will be emailed after the session 2 Current state (and where have we come from) 15.7 million
Qualcomm Retail Solutions Inc. GIMBAL PLATFORM DIGITAL INSIGHTS INTO THE PHYSICAL WORLD The Advantages of Gimbal for Retailers, Brands and Application Developers Revision 1 November 2013 1 Table of Contents
Social media importance in Businesses Social media networks were a novelty 5 years ago, but today their importance is no longer debated. Yes, businesses have definitely realized the power of social media
General Statement Lotlinx has two separate marketing/advertising functions. Firstly, we market our customers (auto dealers) inventory to shoppers currently searching the Internet with the intent to purchase,
Technology & Applications Three Technology Must-Haves to Improve Sales Effectiveness and Boost Win Rates Executive Summary To drive sales excellence, sales professionals need to monitor their objectives,
Why is BIG Data Important? March 2012 1 Why is BIG Data Important? A Navint Partners White Paper May 2012 Why is BIG Data Important? March 2012 2 What is Big Data? Big data is a term that refers to data
Social Media And the Workplace Scott Patterson Labor and Employment Attorney Butzel Long POTENTIAL ISSUES Employee productivity Harassment and discrimination Too much information Legal issues POTENTIAL
Connecting you to the voice of the market Contacts www.decisyon.com Corporate Headquarters 795 Folsom Street, 1st Floor San Francisco, CA 94107 1 844-329-3972 European Office Viale P. L. Nervi Directional
Are You Ready for Big Data? Jim Gallo National Director, Business Analytics April 10, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?
Social Business Presented at FSCI By Paul Barron Telling the digital story to create in-real-life profits Presentation by Paul Barron 2 Founder Digital Branding & Social Consumer Analytics Author The changing
Leading Safe Cities Setting the Standard for Safe City Projects in the United States Edge360 is a provider of Safe City solutions to State & Local governments, helping our clients ensure they have a secure,
Are You Ready for Big Data? Jim Gallo National Director, Business Analytics February 11, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?
Chapter 2 The Elements of Internet Marketing Everyone is concerned about how to market their business in the new age of the internet. In the early years, just having a website was enough. Soon it became
DEVELOPING A SOCIAL MEDIA STRATEGY Creating a social media strategy for your business 2 April 2012 Version 1.0 Contents Contents 2 Introduction 3 Skill Level 3 Video Tutorials 3 Getting Started with Social
5 KEYS TO CREATING AWESOME CONTENT FOR SOCIAL CAMPAIGNS PRESENTED BY Satisfying the consumer s need for useful content requires an authentic, organic, and dedicated voice that leverages the reputations
Social Marketing & Liability Fred E. Karlinsky, Esq. Co-Chair, Insurance Regulatory & Transactions Practice Shareholder, Greenberg Traurig Louisiana Insurers Conference Insurance Compliance Seminar August
Inbound Digital Marketing Proposal Webfactories Prepared For Sample Company SAMPLE COMPANY Created By Richie Parsotan Webfactories firstname.lastname@example.org http://www.webfactories.biz Proposal Letter Sample
Marketing and Promoting Your Cooperative Through Social Media How social media can be a success for your housing cooperative The History of Social Media First email sent Geocities, first social network
Social Intelligence Report Adobe Digital Index Q2 2015 Key Insights Paid Social Cost per click (CPC) rates for Facebook are flat YoY while impressions fell by half and click through rates doubled 51% of
The Matrix Reloaded: Cybersecurity and Data Protection for Employers Jodi D. Taylor Why Talk About This Now? Landscape is changing Enforcement by federal and state governments on the rise Legislation on
Setting the Record Straight: Press Releases that Stand Out in the Digital Age Press releases are among the most widely used tools that PR professionals have in their tool kit, helping their clients win
The Decline of Online Privacy And the rise of personal identity management in the age of big data A uknow White Paper by Tim Woda, co founder of uknow.com, Inc. Overview There is a sense of fear and powerlessness
Privacy Statement Kelly Services, Inc. and its subsidiaries ("Kelly Services" or Kelly ) respect your privacy and we acknowledge that you have certain rights related to any personal information we collect
MARKETING AND MEDIA RELATIONS Social Media Guidelines Emporia State University Last Updated: July 09, 2011 Introduction What is Social Media? Social media consists of web-based tools used to interact with
Ad Tech for Privacy Counsel November 5, 2014 Marc M. Groman, President & CEO Network Advertising Initiative Why? What We ll Cover This Morning: Basic Client Server Transactions Maintaining State What this
2012 Legal Marketing Survey Report Greetings and welcome to the 2012 Legal Marketing Survey Report. Avvo and LexBlog have partnered to publish their first annual review of the legal marketing industry.
1 Index Subject Page Management Summary 3 Role within the organization 4 For what purposes do you use social media? 5 What social channels does the company use? 5 Does your organization use a blog? 6 How
Target Marketing 102 What s Next? What your customers say about you matters 14 % of people trust ads 78% of people trust consumer recommendations Source: Neilson Global Trust in advertising survey, 2007
ENTERTAINMENT, MEDIA & ADVERTISING MARKET RESEARCH HANDBOOK 2015-2016 Published by Richard K. Miller & Associates December 2014; 355 pages; ISBN # 9781577832010 PART I: MARKET SUMMARY 1 MARKET ASSESSMENT:
TIP SHEET HOW TO GET SOCIAL MEDIA INFLUENCERS TALKING ABOUT YOU A social media influencer is someone who talks positively about your brand and products on Twitter, Facebook, Google+, Pinterest or any other
This article first appeared in the October 2010 issue of The Corporate Counselor. Insurance Coverage In Consumer Class Actions John W. McGuinness and Justin F. Lavella The business world is an increasingly
Universal Banking Solution System Integration Consulting Business Process Outsourcing The relevance of online analytics to banking In its 2010 report on the state of online banking in the United States,
Cookies Compliance Advisory Note: this is an advisory notice that summarises the current position of the Article 29 Working Group and makes suggestions as to how organisations might practically achieve
A FROST & SULLIVAN EXECUTIVE SUMMARY Driving Relevant Business Insights From Social Media In Collaboration With: View the ondemand version of the ebroadcast: www.frost.com/insights Even the most successful
Does Big Data spell Big Trouble for Lawyers? Paul Karlzen Director HR Information & Analytics April 1, 2015 Presented to Colorado Bar Association, Communications & Technology Law Section Denver, Colorado