Demystifying Data Science and Big Data. Natasha Balac, Ph.D. September, 2016

Similar documents
Demystifying The Data Scientist

Challenges of Analytics

Elke Rundensteiner

Introduction To Predictive Analytics And Data Mining!

Proposal for New Program: Minor in Data Science: Computational Analytics

Data Mining and Analytics in Realizeit

Why your business decisions still rely more on gut feel than data driven insights.

Statistics Meets Big Data 統 計 遇 見 大 數 據

SASA: Strategic planning for the future

UNDERSTAND YOUR CLIENTS BETTER WITH DATA How Data-Driven Decision Making Improves the Way Advisors Do Business

Chapter 2 Big Data Panel at SIGDSS Pre-ICIS Conference 2013: A Swiss-Army Knife? The Profile of a Data Scientist

PHPM 672 Data Science for HSR PHPM 677 Data Science in Public Health

Predictive Analytics Certificate Program

Worldwide Advanced and Predictive Analytics Software Market Shares, 2014: The Rise of the Long Tail

CONNECTING DATA WITH BUSINESS

The Data Engineer. Mike Tamir Chief Science Officer Galvanize. Steven Miller Global Leader Academic Programs IBM Analytics

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank

Extend your analytic capabilities with SAP Predictive Analysis

I D C T E C H N O L O G Y S P O T L I G H T

Cutting Through The Hype: What You Need To Know About Big Data

BIG DATA WITHIN THE LARGE ENTERPRISE 9/19/2013. Navigating Implementation and Governance

Strategies For Setting Up Your Organisation For Success With Big Data. Kevin Long Business Development Director Teradata

Keynote Speaker. Jay Schnitzer, MD, PhD MITRE Corporation Director Biomedical Sciences

Analytics Centre of Excellence: Roles, Responsibilities and Challenges

Understanding Big Data Analytics for Research

LARGE-SCALE DATA-DRIVEN DECISION- MAKING: THE NEXT REVOLUTION FOR TRADITIONAL INDUSTRIES

An Overview of Predictive Analytics for Practitioners. Dean Abbott, Abbott Analytics

FIVE YEAR REVIEWS OF HEALTH SCIENCES ORGANIZED RESEARCH UNITS UNIVERSITY OF CALIFORNIA, SAN DIEGO Supplement to UCSD ORU Policy & Procedures, May 2010

Fundamental Statistical Analysis for a Future with Big Data

PACE Predictive Analytics Center of San Diego Supercomputer Center, UCSD. Natasha Balac, Ph.D.

Multichannel Customer Listening and Social Media Analytics

Community Partnerships Strategic Plan

2015 Global Identity and Access Management (IAM) Market Leadership Award

Jay Buckingham Dynamic Signal

SMART MINDS + SMART CITIES

Statistics for BIG data

Strategy Consulting at Accenture

STEM Mathematics Major:

Empowering the Masses with Analytics

Smarter Analytics. Barbara Cain. Driving Value from Big Data

THE KEY TO EXECUTIVE DECISIONS!

Real World Application and Usage of IBM Advanced Analytics Technology

BIG DATA & DATA SCIENCE

Teaching Business Statistics through Problem Solving

Big Data & Netflix. Paul Ellwood February 9th, 2015

Easy Execution of Data Mining Models through PMML

Big Data Analytics. David Dietrich, EMC Education Services. April 4, 2013

Insight for Informed Decisions

6. UNC Degree Program Establishments... Courtney Thornton

Synergic Partners: Spanish big-data pioneer

Using Big Data & Analytics to Increase PR & Marketing Brand Awareness

Eric Collins Connected Holdings. Market Makers: Accelerate Your IoT Business with Connected Holdings!

The CS Principles Project 1

College of Agriculture, Engineering and Science INSPIRING GREATNESS

a Data Science Univ. Piraeus [GR]

Welcome. Host: Eric Kavanagh. The Briefing Room. Twitter Tag: #briefr

Elevating the Customer Experience in the Mobile World

The Use of Open Source Is Growing. So Why Do Organizations Still Turn to SAS?

The Canadian Realities of Big Data and Business Analytics. Utsav Arora February 12, 2014

Family Engineering! For Parents & Elementary-Aged Children

Master Specialization in Knowledge Engineering

INDEX. Introduction Page 3. Methodology Page 4. Findings. Conclusion. Page 5. Page 10

Why Big Data Analytics?

Analytics For Everyone - Even You

INTRODUCTION TO DATA SCIENCE USING R

Discovering, Not Finding. Practical Data Mining for Practitioners: Level II. Advanced Data Mining for Researchers : Level III

What Tech Buyers Think TM

Location Analytics for Financial Services. An Esri White Paper October 2013

IDC FutureScape: Worldwide Big Data & Analytics 2016 Predictions. IDC Web Conference November 2015

Big Data Executive Survey

@LogiAnalytics #SelfServiceBI

APPROACHABLE ANALYTICS MAKING SENSE OF DATA

Understanding Today s Chief Data Scientist

POSTGRADUATE PROGRAMS IN APPLIED DATA ANALYTICS

Data Science Certificate Program

Strategic Decisions Supported by SAP Big Data Solutions. Angélica Bedoya / Strategic Solutions GTM Mar /2014

Transcription:

Demystifying Data Science and Big Data Natasha Balac, Ph.D. September, 2016

Background Over 25 Years of Experience in Data Mining Ph.D. in Machine Learning with emphasis on Big Data and Mobile Robots Director of Interdisciplinary Center for Data Science(ICData) at the Qualcomm Institute(CaliIT2) at UCSD Founder and CEO of Data Insight Discovery, Inc. Lecturer UCSD MAS in Data Science and Engineering UCSD Extension Data Mining Certificate Coursera Big Data Specialization

University of California, San Diego UCSD CalIT2 Qualcomm Institute Calit2 is taking ideas beyond theory into practice, accelerating innovation and shortening the time to product development and job creation. Where the university traditionally has focused on education and research, Calit2 extends that focus to include development and deployment of prototype infrastructure for testing new solutions in a real-world context.

Interdisciplinary Center for Data Science ICData Bridge the Industry and Academia Gap Data Science across discipline and verticals Open Standards and Methodology ICData is a non-profit, public academic organization oto promote, educate and innovate in the area of Data Science Inform, Educate and Train Interdisciplinary Center of for Data Science Research and Collaboration oto utilize the power of Data Science for social good Big Data Technologies oto develop innovative, practical curriculum to broaden participation in the field of Data Science

Data Insight Discovery Founded in January 2014 San Diego, CA Women Owned Service Offerings Include Data Driven solutions Predictive Analytics Services Business Intelligence and Analytics Condition Based Maintenance Digital Marketing Systems and IoT Integration Services Big Data Technologies Key Strengths include - Numerous successfully deployed Predictive Analytics Projects - Over 25 years of experience and expertise

Interdisciplinary Data Science Research and Collaborations Data Science across discipline and verticals Fraud Detection Modeling Behaviors UCSD s World-renowned Microgrid Bridge the Industry and Academia Gap Interdisciplinary Center of for Data Science Open Standards and Methodology Biomedical Informatics Smart Grid Analytics Anomaly detection Smart City Generates 92% of campus electricity $8 Million+ in annual savings One of the world s most advanced microgrids Inform, Educate and Train Research and Collaboration Sport Analytics Big Data Technologies IoT Population Health, mhealth Nano-engineering White House Big Data Event: Data to Knowledge to Action Launch Partners Award

What is Big Data? Extremely large data sets that may be analyzed computationally to reveal patterns, trends, and associations, especially relating to human behavior and interactions IBM, 2012

What is Big Data? Wikipedia: an all-encompassing term for any collection of data sets so large and complex that it becomes difficult to process using on-hand data management tools or traditional data processing applications Oxford English Dictionary: data of a very large size, typically to the extent that its manipulation and management present significant logistical challenges

How Big is Big Data?

Gartner Hype Cycle for Emerging Technologies 7/16

Growing Internet of Things Data

What Is The Value In Big Data?

Transforming Data Into Insight For Making Better Decisions Gartner, 2013

What is Data Mining? A set of technologies that uncovers relationships and patterns within large volumes of data that can be used to predict future behavior and events Predictive Analytics is technology that learns form experience to predict the future outcomes in order to drive better business decisions Extracting / Mining Information/Meaning from data Interesting knowledge (rules, regularities, patterns, constraints) from raw data Implicit, previously unknown and unexpected, potentially extremely useful information from data

Terminology Data Science Data Mining Big Data Machine Learning Predictive Analytics Advanced Analytics

Predictive Analytics Process Explore Data Find Patterns Perform Prediction s

Analytics Maturity Levels

Data scientist: The hot new gig in tech Data Scientist: The Sexiest Job of the 21st Century The next sexy job in next 10 years will be statistician Hal Varian, Google Chief Economist Geek Chic Wall Street Journal new cool kids on campus Gartner in 2012 said there would be a shortage of 100,000 data scientists in the United States by 2020 McKinsey Global Institute Big data Report in 2011 By 2018, the United States alone could face a shortage of 140,000 to 190,000 people with deep analytical skills as well as 1.5 million managers and analysts with the know-how to use the analysis of big data to make effective decisions - demand that s 60 percent greater than supply The human expertise to capture and analyze big data is both the most expensive and the most constraining factor for most organizations pursuing big data initiatives Thomas Davenport

Data Science Job Growth Not enough Data Scientist Crowdflower survey report: A full 83% of respondents said there weren t enough data scientists to go around By 2018 shortage of 140-190,000 predictive analysts and 1.5M managers / analysts in the US Gartner says the current demand for data scientist exceeds the current supply by factor of three

Data Scientist Skill and Characteristics Intellectual curiosity, Intuition Find needle in a haystack Ask the right questions value to the business Communication and engagements Presentation skills Let the data speak but tell a story Story teller drive business value not just data insights Creativity Guide further investigation Business Savvy Discovering patterns that identify risks and opportunities Measure

Strata Survey Skills

Scoop.it World the Data Science Tools

Citizen Data Scientist This is Bob, our new Citizen Data Scientist. He previously worked as a citizen dentist and a citizen pilot. This cartoon was ably drawn by Jon Carter.

Thank you! natasha@datainsightdiscovery.com Natasha Balac