Big Data & Analytics @ Netflix. Paul Ellwood February 9th, 2015



Similar documents
BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

The 4 Pillars of Technosoft s Big Data Practice

Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth

Understanding Your Customer Journey by Extending Adobe Analytics with Big Data

Student Project 1 - Explorative Data Analysis with Hadoop and Spark

Sunnie Chung. Cleveland State University

Big Data Analytics Nokia

Big Data Executive Survey

The Need for Training in Big Data: Experiences and Case Studies

Integrating a Big Data Platform into Government:

QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, Viswa Sharma Solutions Architect Tata Consultancy Services

Getting Started Practical Input For Your Roadmap

Customer Case Study. Sharethrough

BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata

Data Challenges in Telecommunications Networks and a Big Data Solution

Data Science and Business Analytics Certificate Data Science and Business Intelligence Certificate

Big Data for everyone Democratizing big data with the cloud. Steffen Krause Technical

Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture

How To Learn To Use Big Data

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

Big Analytics: A Next Generation Roadmap

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

UNIFY YOUR (BIG) DATA

Why Big Data Analytics?

The Data Engineer. Mike Tamir Chief Science Officer Galvanize. Steven Miller Global Leader Academic Programs IBM Analytics

This Symposium brought to you by

The Directors Cut. The power of data: What directors need to know about Big Data, analytics and the evolution of information.

Best Practices for Hadoop Data Analysis with Tableau

Augmented Search for Web Applications. New frontier in big log data analysis and application intelligence

Outline. What is Big data and where they come from? How we deal with Big data?

Native Connectivity to Big Data Sources in MSTR 10

BIG DATA & DATA SCIENCE

Big Data: Key Concepts The three Vs

BIG DATA TRENDS AND TECHNOLOGIES

Building a data analytics platform with Hadoop, Python and R

Michelle Wallace, Product Marketing. The Simple Data Strategy that Helped LinkedIn Boost Business- Services Revenue by 85%

Introduction to Big Data! with Apache Spark" UC#BERKELEY#

International Journal of Advancements in Research & Technology, Volume 3, Issue 5, May ISSN BIG DATA: A New Technology

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

QUICK FACTS. Delivering a Unified Data Architecture for Sony Computer Entertainment America TEKSYSTEMS GLOBAL SERVICES CUSTOMER SUCCESS STORIES

Beyond Lambda - how to get from logical to physical. Artur Borycki, Director International Technology & Innovations

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract

Building Your Big Data Team

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

ESS event: Big Data in Official Statistics

QUICK FACTS. Implementing a Big Data Solution on Behalf of a Media House TEKSYSTEMS GLOBAL SERVICES CUSTOMER SUCCESS STORIES

Ganzheitliches Datenmanagement

Tapping Into Hadoop and NoSQL Data Sources with MicroStrategy. Presented by: Jeffrey Zhang and Trishla Maru

Capturing Meaningful Competitive Intelligence from the Social Media Movement

Teradata Unified Big Data Architecture

Workday Big Data Analytics

Questionnaire about the skills necessary for people. working with Big Data in the Statistical Organisations

The Impact of Big Data on Classic Machine Learning Algorithms. Thomas Jensen, Senior Business Expedia

Reference Architecture, Requirements, Gaps, Roles

Big Data a threat or a chance?

Making Sense of Your Organization s Big Data. A Guide for Small and Medium Businesses

Kingdom Big Data & Analytics Summit 28 FEB 1 March 2016 Agenda MASTERCLASS A 28 Feb 2016

2014 STATE OF SELF-SERVICE BI REPORT

Doing Multidisciplinary Research in Data Science

actionable big data. maximum roi. Making Analytics Make Actionable Sense: PART 2

6 Steps to Faster Data Blending Using Your Data Warehouse

BIG DATA TECHNOLOGY. Hadoop Ecosystem

GROW YOUR ANALYTICS MATURITY

locuz.com Big Data Services

PARC and SAP Co-innovation: High-performance Graph Analytics for Big Data Powered by SAP HANA

Building Scalable Big Data Infrastructure Using Open Source Software. Sam William

Cisco IT Hadoop Journey

Big Data Infrastructure at Spotify

ANALYTICS CENTER LEARNING PROGRAM

Senior Business Intelligence/Engineering Analyst

Big Data Use Case: Business Analytics

5 Big Data Use Cases to Understand Your Customer Journey CUSTOMER ANALYTICS EBOOK

Cloud Integration and the Big Data Journey - Common Use-Case Patterns

Tap into Hadoop and Other No SQL Sources

Big Data Efficiencies That Will Transform Media Company Businesses

Choosing A Customer Data Platform

Using Data Mining and Machine Learning in Retail

The Big Data Ecosystem at LinkedIn Roshan Sumbaly, Jay Kreps, and Sam Shah LinkedIn

Developing a Business Analytics Roadmap

GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION

Transcription:

Big Data & Analytics @ Netflix Paul Ellwood February 9th, 2015

Who Am I? Director, Data Science & Engineering Also Leader, DataKind San Francisco chapter Formerly: Director, Product Analytics @ Netflix Formerly: Associate Partner, Analytics & Optimization @ Rosetta MS in Predictive Analytics from Northwestern BS in Systems & Control Eng. from CWRU

2007!!!

The Future of TV is Finally Here

Competition Winning the Moment of Truth

Stay focused and run fast

Culture of Freedom & Responsibility Context, not control Highly aligned, loosely coupled No brilliant jerks Professional sports team No time or expense policies

Netflix Expense Policy Act in Netflix s Best Interest

QA Team?

Accept things WILL break

Safety Nets

Give folks what they need

Eliminate Rules & Process

Tell vs. Ask

Conrad Gessner The modern world overwhelms people with data and this overabundance is both "confusing and harmful" to the mind.

Francis Bacon If a man will begin with certainties, he shall end in doubts; but if he will be content to begin with doubts, he shall end in certainties.

Innovation Cycle Formulate Hypothesis Productize Offline Experiment A/B Test

Analytics Teams Data Science & Engineering Science & Algorithms Business Stakeholders Consumer Insights Financial Planning & Analysis

(Mostly Cloud) Data Platform Source Source Systems Source Systems Source Systems Systems S3 Sting

Data Science & Engineering Enabling a data-driven culture

Metrics What does success look like?

Data Model How will people use it?

Data Integration How can we stitch it all together?

Data Visualization How should we present it?

Insights What does it mean?

Vertically Aligned Product Streaming Content Marketing Finance Messaging Content Delivery Content Buying Digital Customer Service Sign-up Flow Device Partners Originals TV Billing Social Streaming Supply Chain Out-of-Home Payments Search Infrastructure Consumer Device UIs Insights Rec Algos

Cross-functional Teams Horizontal Roles Skills Technologies Insights Data Scientists Data Analysts Metrics, Analysis, Data Mining, Modeling SQL, R, Python Data Visualization Data Visualization Engineers Design & Build Reports, Interactive Dashboards Microstrategy, Tableau, D3 Data Engineering Data Engineers Data Modeling, ETL Amazon S3, Hadoop, MapReduce, Pig, Python, Hive, Teradata, Redshift, SQL

Data Privacy VPPA EU DPA Judge Robert Bork (1927-2012) 1988 Video Privacy Protection Act:...A video tape service provider who knowingly discloses, to any person, personally identifiable information concerning any consumer shall be liable for - Protecting personal data is a fundamental right - Personal data can only be kept in the context of the legitimate ordinary business activities - Right to be forgotten :

Marketing City-level metrics LTV modeling Media mix modeling Pioneering use of Google s digital stack Audience Targeting Originals Marketing Real Time Bidding (RTB) Optimization Looking for a new leader!

Content Content performance on service Target price for content Catalog coverage for current customers Catalog coverage for future customers Digital Supply Chain

Product Multi-channel Messaging Sign-up flow optimization Deeper personalization of recommendations New UI designs Kids Impact of social Search

Streaming Optimizing the Streaming experience ISP content delivery performance Device partnerships

FinOps Fraud Reduction Global payment processing Gift Cards Bill on Behalf Of Talent & Recruiting Analytics Customer Service -> Insights

Open Source Analytic Tools ScorePMML Time Series Anomaly Detection coming

Questions?

APPENDIX

Star Ratings Row Order Row Ranking Over 50% of hours from homepage Evidence

Smart TV

Extract, Transform, Load (ETL) Logging (Raw) Searches/clicks/plays (Clean Fact) Sums, Avgs (Clean Aggregate) Signals (Join Aggregate) Offline Analysis & Signal Discovery:

A/B Testing: Randomly allocate users to offline strategies Schedule daily job to update model(s) data Measure changes in key metrics over time vs. production Primary: Retention, streaming hours Secondary: Hours from titles discovered in search Genie: Lipstick: Quinto:

Determine what data is good enough Expect the unexpected (edge cases) Be vigilant (everyone involved with A/B test is looking at website/ui) Teamwork highly aligned, loosely coupled Data-driven decision making

The Challenge of Choice

Personalized Recommendations

Adaptive Row Ordering

Predicted Ratings

My List

Facebook Integration

User Experience

Price Tiers

Often product decisions are made by the most passionate person in the room

Sheer volume can get in the way of making the right decisions

Core Metrics What are we trying to accomplish?

Our Core Metrics Non-member Conversion rates Rate of paying for second month Member Retention now revenue weighted Streaming hours a great proxy for retention

Hypothesis Formulation Back to that Freedom & Responsibility thing

Qualitative Studies Voice of the Customer

Data Exploration Digging deep

Time Investment Data Visualization Tool Trade-offs Custom Web Products Sting Robustness

Handling the raw data

Analytics What does it mean?

Example Requests Launch Reporting & Analysis Reach Projections Usage Patterns Viewing Patterns Feature Mining for Algorithmic Models Predictive Models

Hypothesis Evaluation

Large Scale A/B Testing Let the customer decide

Product Strat Meetings A meeting, really??