Big Data: Opportunities for the Dental Benefits Industry

Similar documents
Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook

Big Data Big Data/Data Analytics & Software Development

Chapter 11 Map-Reduce, Hadoop, HDFS, Hbase, MongoDB, Apache HIVE, and Related

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

Getting to Know Big Data

All You Wanted to Know About Big Data Projects Chida Jan 2014

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

How To Scale Out Of A Nosql Database

BIG DATA, MAPREDUCE & HADOOP

Hadoop: Distributed Data Processing. Amr Awadallah Founder/CTO, Cloudera, Inc. ACM Data Mining SIG Thursday, January 25 th, 2010

Age of Big data. Presented by: Mohammad Iqbal BCM -2014

BIG DATA What it is and how to use?

How To Use Big Data For Telco (For A Telco)

BIG DATA & ANALYTICS. Transforming the business and driving revenue through big data and analytics

Reference Architecture, Requirements, Gaps, Roles

Hadoop implementation of MapReduce computational model. Ján Vaňo

Role of Cloud Computing in Big Data Analytics Using MapReduce Component of Hadoop

Big Data and Apache Hadoop s MapReduce

Application Development. A Paradigm Shift


Hadoop IST 734 SS CHUNG

Composite Data Virtualization Composite Data Virtualization And NOSQL Data Stores

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

Department of Computer Science University of Cyprus EPL646 Advanced Topics in Databases. Lecture 14

Big Data, Why All the Buzz? (Abridged) Anita Luthra, February 20, 2014

Interactive data analytics drive insights

Hadoop Ecosystem B Y R A H I M A.

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

Hadoop Distributed File System. Jordan Prosch, Matt Kipps

A Brief Introduction to Apache Tez

What is Big Data? Concepts, Ideas and Principles. Hitesh Dharamdasani

Introduction to Hadoop. New York Oracle User Group Vikas Sawhney

Native Connectivity to Big Data Sources in MicroStrategy 10. Presented by: Raja Ganapathy

Big Data and Hadoop with components like Flume, Pig, Hive and Jaql

Introduction to Hadoop

Hadoop Distributed File System. T Seminar On Multimedia Eero Kurkela

Tap into Hadoop and Other No SQL Sources

Open source software framework designed for storage and processing of large scale data on clusters of commodity hardware

BIG DATA USING HADOOP

CSE-E5430 Scalable Cloud Computing Lecture 2

Big Data Zurich, November 23. September 2011

Large scale processing using Hadoop. Ján Vaňo

Big Data With Hadoop

Architectures for Big Data Analytics A database perspective

Jeffrey D. Ullman slides. MapReduce for data intensive computing

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

Introduction to Hadoop HDFS and Ecosystems. Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data

Like what you hear? Tweet it using: #Sec360

#mstrworld. Tapping into Hadoop and NoSQL Data Sources in MicroStrategy. Presented by: Trishla Maru. #mstrworld

A Brief Outline on Bigdata Hadoop

BIG DATA TECHNOLOGY. Hadoop Ecosystem

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Timo Elliott VP, Global Innovation Evangelist SAP SE or an SAP affiliate company. All rights reserved. 1

BigMemory and Hadoop: Powering the Real-time Intelligent Enterprise

Big Data Technology ดร.ช ชาต หฤไชยะศ กด. Choochart Haruechaiyasak, Ph.D.

MapReduce with Apache Hadoop Analysing Big Data

Big Data: Are You Ready? Kevin Lancaster

Introduction to Hadoop

Customized Report- Big Data

Changing the Equation on Big Data Spending

Distributed File Systems

Prepared By : Manoj Kumar Joshi & Vikas Sawhney

WA2192 Introduction to Big Data and NoSQL EVALUATION ONLY

Load Rebalancing for File System in Public Cloud Roopa R.L 1, Jyothi Patil 2

Dominik Wagenknecht Accenture

Accelerating and Simplifying Apache

DATA MINING WITH HADOOP AND HIVE Introduction to Architecture

Big Data Buzzwords From A to Z. By Rick Whiting, CRN 4:00 PM ET Wed. Nov. 28, 2012

INDUS / AXIOMINE. Adopting Hadoop In the Enterprise Typical Enterprise Use Cases

TAMING THE BIG CHALLENGE OF BIG DATA MICROSOFT HADOOP

Talend Big Data. Delivering instant value from all your data. Talend

BIG DATA TOOLS. Top 10 open source technologies for Big Data

CSE 590: Special Topics Course ( Supercomputing ) Lecture 10 ( MapReduce& Hadoop)

Data-Intensive Programming. Timo Aaltonen Department of Pervasive Computing

Hadoop and Map-Reduce. Swati Gore

Approaches for parallel data loading and data querying

Ubuntu and Hadoop: the perfect match

Executive Summary... 2 Introduction Defining Big Data The Importance of Big Data... 4 Building a Big Data Platform...

Integrating a Big Data Platform into Government:

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

Tapping Into Hadoop and NoSQL Data Sources with MicroStrategy. Presented by: Jeffrey Zhang and Trishla Maru

Introduc)on to the MapReduce Paradigm and Apache Hadoop. Sriram Krishnan

Synopsis of Big Data Technologies

Workshop on Hadoop with Big Data

WHITE PAPER. Four Key Pillars To A Big Data Management Solution

Community Driven Apache Hadoop. Apache Hadoop Basics. May Hortonworks Inc.

EXECUTIVE REPORT. Big Data and the 3 V s: Volume, Variety and Velocity

Hadoop. MPDL-Frühstück 9. Dezember 2013 MPDL INTERN

Programming Hadoop 5-day, instructor-led BD-106. MapReduce Overview. Hadoop Overview

Chase Wu New Jersey Ins0tute of Technology

INTRODUCTION TO CASSANDRA

NoSQL and Hadoop Technologies On Oracle Cloud

Big Data Analysis and HADOOP

Hadoop Big Data for Processing Data and Performing Workload

Big Data and Data Science: Behind the Buzz Words

Big Data Course Highlights

An Improved Version of Big Data Classification and Clustering using Graph Search Technique

BIG DATA: STORAGE, ANALYSIS AND IMPACT GEDIMINAS ŽYLIUS

Transcription:

Big Data: Opportunities for the Dental Benefits Industry Joel Reichert - VP, Data Strategy Herschel Reich - VP, Payer Consulting September 16, 2014 Big Data: Opportunities for the Dental Benefits Industry Joel Reichert - VP, Data Strategy Herschel Reich - VP, Payer Consulting 1

Agenda Big Data 101 Use Cases For Big Data Analytics in Insurance & Healthcare Opportunities for the Dental Benefits Market Challenges for Implementing Big Data Confidential property of Optum. Do not distribute or reproduce without express permission from Optum. 3 From Big Data to Big Dental The Old Paradigm: Big Dreams Big Deficits Big Disappointments The New Paradigm Big Data Big Drilling Big Dental Big Disclaimer Confidential property of Optum. Do not distribute or reproduce without express permission from Optum. 4 2

How I Learned about Big Data! I watched Captain America: The Winter Solider 3 times on an eleven hour flight In Captain America's latest big screen adventure, his arch-enemy isn't a diabolical super villain or space alien. It's big data. "The 21st century is a digital book," the captain is told. "Your bank records, medical histories, voting patterns, emails, phone calls, your damn SAT scores! [Our] algorithm evaluates people's past to predict their future. While most of the people in this room won t go as far as Hydra in eliminating the competition, we can appreciate it s business context. I never did see Moneyball but the concept of using analytics to win in sports is only getting more extreme! Confidential property of Optum. Do not distribute or reproduce without express permission from Optum. 5 What influenced the development of Big Data solutions? Data Growing data volumes and sources Structured (claims, eligibility, etc.) & Un-Structured (social media, images, etc.) Platform A new scalable solution to manage the data and processing (Hadoop & GFS) Storage New ways to store data in various formats (Text, NoSQL, Columnar, etc.) Processing New ways of processing the data (Batch, Stream, In-Memory, etc.) Analytics New Insights Confidential property of Optum. Do not distribute or reproduce without express permission from Optum. 6 3

Big Data Quiz Which of the following is not one of the 4 V s of Big Data? 1. Volume 2. Veracity 3. Variety 4. Victory 5. Velocity Confidential property of Optum. Do not distribute or reproduce without express permission from Optum. 7 What is Big Data?. Big data is high-volume, -velocity and -variety information assets that demand cost-effective, innovative forms of information processing for enhanced insight and decision making. Confidential property of Optum. Do not distribute or reproduce without express permission from Optum. 8 4

History of Big Data 2002 2003 The world just wanted a better open-source search engine. Doug Cutting and Mike Cafarella set out to build it. They called their project Nutch. Sanjay Ghemawat, Howard Gobioff, Shun-Tak Leung release paper on GFS (Google File System) 2004 Sanjay Ghemawat and Jeffery Dean Google release paper on MapReduce 2005 Cutting and Cafarella built up the underlying file systems and processing framework that would become Hadoop (in Java, notably, whereas Google s MapReduce used C++) and ported Nutch on top of it 2006 Cutting goes to Yahoo, to build open source technologies based on GFS and MapReduce. Launches Hadoop as an open-source Apache Software Foundation project. 2007 Christophe Bisciglia, a Google engineer, begins teaching a computer science course at the University of Washington focusing on web-scale data. Google s MapReduce code was proprietary, so adopts Hadoop as a proxy. 2008 2009 Slow march of horizontal scalability, growing Hadoop s capabilities from the single digits of nodes into the thousands, every factor of 2 or 1.5 required re-engineering. With Google s blessing, Bisciglia incorporated a company called Cloudera in March 2008 and closed its first funding round in April 2009. 2011 Yahoo spins off Hortonworks as a separate company in June of 2011. Confidential property of Optum. Do not distribute or reproduce without express permission from Optum. 9 What is Big Data? The characteristics of Big Data (aka Hadoop) Scalable Fault-tolerant Open Source Distributed system for Data Storage Processing Commodity Hardware The components of Hadoop Distributed File System (HDFS) the storage manager MapReduce the programming model YARN the resource manager Common libraries and utilities http://en.wikipedia.org/wiki/hadoop Confidential property of Optum. Do not distribute or reproduce without express permission from Optum. 10 5

The Old Data Management We are leaving the old age of data storage and processing with finite and absolute use cases. Storage limited to files and relational data stores Processing options were limited to scripting, ETL tools and in database processing Confidential property of Optum. Do not distribute or reproduce without express permission from Optum. 11 Age of New Data Management Data storage and processing are no longer finite and absolute. The new goal is matching storage and processing, which continue to expand. Spectrum of storage options: ASCII File ProtocolBuffers, Thrift and Avro serialization Sequence File RecordColumnar File ORC (Optimized RC) File Key-Value Column Family Document Property Graph RDF/Triple Stores Row Oriented Column Oriented Spectrum of processing options: Confidential property of Optum. Do not distribute or reproduce without express permission from Optum. 12 6

Proliferation of New Data Management Tools Courtesy of Knowledgent (http://knowledgent.com) Confidential property of Optum. Do not distribute or reproduce without express permission from Optum. 13 The New Data Management The aspiration is to match storage, processing and analytics to create new value. Confidential property of Optum. Do not distribute or reproduce without express permission from Optum. 14 7

Non-Techie Spin on the New World Data, Storage, Processing and Analytics are all increasing rapidly and not necessarily sync Storage has evolved to handle non-structured data in non-relational databases In many cases, Big Data Analytics are being used today even on non- Big Data assets Actuaries need to evolve or will be replaced by AI beings with better personalities! Confidential property of Optum. Do not distribute or reproduce without express permission from Optum. 15 Big Data Industry Use Cases Courtesy of Knowledgent (http://knowledgent.com) Confidential property of Optum. Do not distribute or reproduce without express permission from Optum. 16 8

Use Cases in Healthcare Clinical Risk Helping providers understand the risk profile of their members using medical records. Member Health Profiles Helping insurance payers understand the risk profile of their members administrative data (claims, labs, etc.) Provider Payer Value Based Reimbursement Helping insurance payers understand and measure provider quality and efficiency. Confidential property of Optum. Do not distribute or reproduce without express permission from Optum. 17 Examples of Big Data Applications for the Dental Space Provider Profiling & Fraud Detection in Real Time Customer Service Integrated Cross-Platform / Cross-Product Views Optimized work assignment for underwriters and claim adjudicators Risk Evaluations based upon Claims, HRA and EHR Data Predictive and Prioritized Network Recruiting Guided Customized Product Enrollments Outcomes Based Reimbursement Claim Review performed by AI Confidential property of Optum. Do not distribute or reproduce without express permission from Optum. 18 9

Big Data Adoption Gartner Survey Reveals That 64 Percent of Organizations Have Invested or Plan to Invest in Big Data in 2013 Big data investments in 2013 continue to rise, with 64 percent of organizations investing or planning to invest in big data technology. However, less than eight percent of survey respondents have actually deployed. Of the 64 percent of organizations investing or planning to invest in big data technology in 2013, 30 percent have already invested in big data technology, 19 percent plan to invest within the next year, and an additional 15 percent plan to invest within two years. Industries leading big data investments in 2013 are media and communications, banking, and services. Thirty-nine percent of media and communications organizations said that they have already invested in big data, followed by 34 percent of banking organizations and 32 percent of services firms. Planned investments during the next two years are highest for transportation (50 percent), healthcare (41 percent) and insurance (40 percent). However, every vertical industry again shows big data investment and planned investment. http://www.gartner.com/newsroom/id/2593815 Confidential property of Optum. Do not distribute or reproduce without express permission from Optum. 19 Big Data & The Hype Big Data is approaching the peak of inflated expectations. Confidential property of Optum. Do not distribute or reproduce without express permission from Optum. 20 20 10

The Pending Trough? In a study of more than 100 data scientists.. 71 percent of data scientists believe their jobs have grown more difficult as a result of a multiplying variety of data sources, not just data volume. 1 Only 48 percent of respondents said they had used Hadoop or Spark for their work and 76 percent felt Hadoop is too slow, takes too much effort to program or has other limitations 1 Hadoop has been unrealistically hyped as a universal, disruptive big data solution, noting that it is not a viable solution for some use cases that require complex analytics. 1 Twenty-two percent of the data scientists surveyed said Hadoop and Spark were not well-suited to their analytics. [It was] also found that 35 percent of data scientists who tried Hadoop or Spark have stopped using it. 1 1. http://www.computerworld.com/article/2489767/big-data/data-scientists-frustrated-by-data-variety--find-hadoop-limiting.html Confidential property of Optum. Do not distribute or reproduce without express permission from Optum. 21 Big Data Challenges Confidential property of Optum. Do not distribute or reproduce without express permission from Optum. 22 11

Non-Techie Translation on the Big Data Challenges If you haven t put this to good use yet, you re in good company The tools are still evolving If you want to succeed, you need a good consultant! Confidential property of Optum. Do not distribute or reproduce without express permission from Optum. 23 Thank you. 12

http://nadpconverge.org/evaluations 13