Finding Value and Being Valuable in the Trough of Disillusionment

Similar documents
H T Tech nologies 2013

Understanding Cloud Compu2ng Services. Rain in business success with amazing solu2ons in Cloud technology

Managed Services. An essen/al set of tools for today's businesses

Integrating a Big Data Platform into Government:

Senior Business Intelligence/Engineering Analyst

B2B Offerings. Helping businesses op2mize. Infolob s amazing b2b offerings helps your company achieve maximum produc2vity

Making Sense of Big Data. Dr. Thomas E. Potok Computa2onal Data Analy2cs Group Leader Oak Ridge Na2onal Laboratory

Developing the Agile Mindset for Organiza7onal Agility. Shannon Ewan Managing

Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc All Rights Reserved

Big Data and Data Science. The globally recognised training program

ANALYTICS CENTER LEARNING PROGRAM

Program Model: Muskingum University offers a unique graduate program integra6ng BUSINESS and TECHNOLOGY to develop the 21 st century professional.

Trends in Big Data Discovery and Analytics! Summary Results! November 2014!

Apache Hadoop: The Pla/orm for Big Data. Amr Awadallah CTO, Founder, Cloudera, Inc.

Identity and Access Positioning of Paradgimo

Big Data. Introducción. Santiago González

Data Stream Algorithms in Storm and R. Radek Maciaszek

Cost Effec/ve Approaches to Best Prac/ces in Data Analy/cs for Internal Audit

Exchange of experience from a SuccessFactors LMS Implementa9on

Building a data analytics platform with Hadoop, Python and R

Everything You Need to Know about Cloud BI. Freek Kamst

Big Data and Data Science: Behind the Buzz Words

Will The Document Survive?

The 3 questions to ask yourself about BIG DATA

Phone Systems Buyer s Guide

DATA SCIENCE CURRICULUM WEEK 1 ONLINE PRE-WORK INSTALLING PACKAGES COMMAND LINE CODE EDITOR PYTHON STATISTICS PROJECT O5 PROJECT O3 PROJECT O2

How To Use Splunk For Android (Windows) With A Mobile App On A Microsoft Tablet (Windows 8) For Free (Windows 7) For A Limited Time (Windows 10) For $99.99) For Two Years (Windows 9

Industry-Driven Master Certificate in

MAXIMIZING THE SUCCESS OF YOUR E-PROCUREMENT TECHNOLOGY INVESTMENT. How to Drive Adop.on, Efficiency, and ROI for the Long Term

Hadoop s Advantages for! Machine! Learning and. Predictive! Analytics. Webinar will begin shortly. Presented by Hortonworks & Zementis

Consulting and Systems Integration (1) Networks & Cloud Integration Engineer

Decision Support Optimization through Predictive Analytics - Leuven Statistical Day 2010

MORE THAN WHAT YOU SEE

1 Actuate Corpora-on Big Data Business Analy/cs

Convergence: Telecom Moving into Mainstream IT Channel

Mission. To provide higher technological educa5on with quality, preparing. competent professionals, with sound founda5ons in science, technology

The 2012 Data Informed Analytics and Data Survey

Internet of Things. Opportunity Challenges Solutions

Making big data simple with Databricks

Cisco IT Hadoop Journey

UNIFIED, END- TO- END EDISCOVERY

Predictions for the Digital Workplace 2015

BIG DATA What it is and how to use?

The Elusive U,lity Customer: How Big Data & Analy,cs Connects U,li,es & Their Customers

Customer Case Study. Sharethrough

GAME-CHANGING TRENDS IN SUPPLY CHAIN

Strategies For Setting Up Your Organisation For Success With Big Data. Kevin Long Business Development Director Teradata

Big Data Executive Survey

Project Por)olio Management

C++ (Senior) Developer for SAP HANA database kernel team

Mangrove - SOA Modeling Framework Crea&on Review

Management Decision Making. Hadi Hosseini CS 330 David R. Cheriton School of Computer Science University of Waterloo July 14, 2011

Analytics A survey on analytic usage, trends, and future initiatives. Research conducted and written by:

Disrupting The Market: Predictive Analytics As A Service

CSE 427 CLOUD COMPUTING WITH BIG DATA APPLICATIONS

Work with a large team in a fast-paced, agile environment within a Multinational Corporation (MNC)

INVESTOR PRESENTATION. Third Quarter 2014

Internship Opportunities Xerox Research Centre India (XRCI), Bangalore Analytics Research Group

UNIFY YOUR (BIG) DATA

Toys or Tasks? Mary-Ann Claridge of Mandrel Systems Ltd Presentation for Cambridge Wireless - 23 April /04/2015

Big Data. The Big Picture. Our flexible and efficient Big Data solu9ons open the door to new opportuni9es and new business areas

Oracle Solu?ons for Higher Educa?on

Architec;ng Splunk for High Availability and Disaster Recovery

The New Analy,cal Ecosystem: Bridging the Worlds of BI and Big Data

Hadoop and Map-Reduce. Swati Gore

Financial, Telco, Retail, & Manufacturing: Hadoop Business Services for Industries

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, Viswa Sharma Solutions Architect Tata Consultancy Services

BPO. Accerela*ng Revenue Enhancements Through Sales Support Services

Lecture 32 Big Data. 1. Big Data problem 2. Why the excitement about big data 3. What is MapReduce 4. What is Hadoop 5. Get started with Hadoop

A Tutorial Introduc/on to Big Data. Hands On Data Analy/cs over EMR. Robert Grossman University of Chicago Open Data Group

Introduction to Data Mining and Machine Learning Techniques. Iza Moise, Evangelos Pournaras, Dirk Helbing

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Evolution of Taxonomies and A Supply Chain of Things. Daniel E. O Leary University of Southern California 2012

Gartner delivers the technology-related insight necessary for our clients to make the right decisions, every day.

SMB CRM Integra.on 2012 IT Pain Points and Investments 2013

An Integrated Approach to Manage IT Network Traffic - An Overview Click to edit Master /tle style

The Real Score of Cloud

Student Handbook Master of Information Systems Management (MISM)

Private Cloud Website Solu2on

Cellular Development Made Easy Open Communica7ons Gateways

Data Science Certificate Program

Innovative Advances in. Big Data and Analytics

BIG DATA WITHIN THE LARGE ENTERPRISE 9/19/2013. Navigating Implementation and Governance

Apache Spark and the future of big data applica5ons. Eric Baldeschwieler

Big Data Storage Challenges for the Industrial Internet of Things

Lake Tuggeranong College Unit Outline

Introducing Oracle Exalytics In-Memory Machine

The Gartner Scenario For 2010: The Current State and Future Direction of the IT Industry

DAMA NY DAMA Day October 17, 2013 IBM 590 Madison Avenue 12th floor New York, NY

Big Data & Netflix. Paul Ellwood February 9th, 2015

Introduc8on to Apache Spark

So#ware quality assurance - introduc4on. Dr Ana Magazinius

Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.

Melissa Coates. Tools & Techniques for Implementing Corporate and Self-Service BI. Triad SQL BI User Group 6/25/2013. BI Architect, Intellinet

Hunk & Elas=c MapReduce: Big Data Analy=cs on AWS

THE STATE OF THE DATA WAREHOUSE

The Library (Big) Data scien4st

How to transform data into dollars this is always about Business Intelligence

You should have a working knowledge of the Microsoft Windows platform. A basic knowledge of programming is helpful but not required.

Using Data Mining and Machine Learning in Retail

Transcription:

Finding Value and Being Valuable in the Trough of Disillusionment Jordan McIver uk.linkedin.com/in/jmcdatascience February 2016

Agenda Why you should stay awake What will it take Give me a break

Agenda Why you should stay awake

Gartner Hype Cycle - 2014

Gartner Hype Cycle - 2015

Mo Data Mo Problems Incremental Change Brain.Contents %<% rm BS %<% insert Terminology.Consensus Fish Sticks instead of Fish Poles

Opportunity for Them

Opportunity for You 55 per cent of data scien9sts have fewer than three years of experience in the discipline 84% of CIOs believe that their organiza9on can analyze data in real 9me, only 42% of developers agree with that statement h?p://www.sas.com/content/dam/sas/en_gb/image/other1/events/wmagds/datascien9st- survey- report- web%20final.pdf h?ps://voltdb.com/sites/default/files/real- 9me- data- report.pdf

Opportunity for You Wings Shell Gills h?ps://www.dezyre.com/ar9cle/type- a- data- scien9st- vs- type- b- data- scien9st/194

Job Role: Data Scientist The following experience: Working technology business developing new products, at least 4 years experience in developing and delivering analy3cal solu3ons Familiar with at least two industries: Banking & Capital Markets Retail & Consumer Goods U9li9es Telecom Healthcare High Tech Manufacturing Experience in at least two analy9cs applica9ons: Machine Data u9liza9on Internet of Things Process analy9cs Supply Chain Analy9cs Proficiency in R and Python and at least two years working experience with the following tools: Map Reduce (Java or other language) Mahout Hive or Pig Graph Databases A solid understanding of how sobware components can be integrated to form a solu9on architecture, pros and cons of different technologies etc. Key Technical Requirements Strong experience rela9ng to predic3ve modeling, data mining, data explora3on etc. Design and development of modeling data marts (feature engineering) Development of precise requirements SQL Documenta9on & communica9on skills The candidate must have an outstanding academic background, least a 2:1 degree and a Master degree of equivalent in either Maths, Sta3s3cs, Economics, Finance The successful candidate will have at least 4 years experience in a cu?ng edge technology business Previous experience in building and explaining sophis3cated models to senior management and incorpora9ng feedback in model development

Agenda What will it take

"If you just do analy.cs, if you just do scrip.ng, if you just make a model and this model does not go into produc.on then at the end of the day you just did research, but your company is not going to profit" h?p://www.compu9ng.co.uk/ctg/news/2433095/a- lot- of- companies- will- stop- hiring- data- scien9sts- when- they- realise- that- the- majority- bring- no- value- says- data- scien9st

Hypothesis led experimenta.on over predetermined solu.ons Ac.onable response to events over data repor.ng Building produc.on ready prototypes over comprehensive IT strategies Stream processing over rela.onal databases

What do you get. Quicker, cheaper, be?er.. Manage uncertainty.. Business change!!!!!!!! ANALOGIES

Agile Data.. not Data Agility Approaches, tools, ethos, science, engineering, analysis

Agile Data Science hap://www.datasciencemanifesto.org/ Solving problems, not models or algorithms All valida9on of data, hypotheses and performance should be tracked, reviewed and automated Prior to building a model, construct an evalua9on framework with end- to- end business focused acceptance criteria A product needs a pool of measures to evaluate its quality. A single number cannot capture the complexity of reality Even research can be broken down into clearly defined tasks; the smallest of itera9ons should be preferred in acquiring, integra9ng and correc9ng knowledge

Agile Data Architecture Russell Jurney

h?ps://www.linkedin.com/pulse/agile- data- scien9sts- do- scale- sam- savage h?p://columbia- applied- data- science.github.io/pages/lowclass- python- style- guide.html Agile Data Development Cloud EDA vs./into produc9on Notebooks (ipython) vs. code editors (PyCharm, Intellij, Eclipse) R tes?hat, Runit, quickcheck, svunit Con9nuous Integra9on - Jenkins / Hudsons Sam Savage Refactoring TDD/BDD

EXAMPLE!!!!

Proposed Timeline (Using a Test & Learn Process) 0 2 4 8 6 Showcase 2 Weeks

Agenda Give me a break

The first step in extracting features is to look at the data Initial Insights Class 1 and Class 2 are the most difficult to separate (confirmed by classification performance results) Elevation could be important to create new features as structure is evident in some plots Some values are zero / missing Many variable combinations do not indicate any interesting structures

OUT OF DATE..