Big Data in Healthcare: Myth, Hype, and Hope



Similar documents
So Just What Is Big Data? James E. Tcheng, MD, FACC, FSCAI

Information Technology The Challenges of New Asset Management Models. Filipe Frasquilho Director of Information Technology REFER Telecom

Big Data a threat or a chance?

Medical Big Data Interpretation

A Strategic Approach to Unlock the Opportunities from Big Data

BIG DATA STRATEGY. Rama Kattunga Chair at American institute of Big Data Professionals. Building Big Data Strategy For Your Organization

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

Statistical Challenges with Big Data in Management Science

THE REAL-TIME OPERATIONAL VALUE OF BIG DATA MATT DAVIES

HOW WILL BIG DATA AFFECT RADIOLOGY (RESEARCH / ANALYTICS)? Ronald Arenson, MD

Of all the data in recorded human history, 90 percent has been created in the last two years. - Mark van Rijmenam, Think Bigger, 2014

Big Data Analytics in Space Exploration and Entrepreneurship

Data Centric Computing Revisited

From Data to Foresight:

Data Analytics in Organisations and Business

Applications for Business Intelligence, Predictive Analytics and Big Data

Now, Next and the Future: IT, Big Data and other Implications for RIM. Presented by Michael S. Smith /

Test Data Management in the New Era of Computing

Big Data Analytics- Innovations at the Edge

BIG DATA FUNDAMENTALS

Introduction to the Mathematics of Big Data. Philippe B. Laval

Big Data and Analytics

BIG DATA & ANALYTICS. Transforming the business and driving revenue through big data and analytics

BIG DATA: PROMISE, POWER AND PITFALLS NISHANT MEHTA

Big Data Analytics: 14 November 2013

12 th World Telecommunication/ICT Indicators Symposium (WTIS-14)

1. Understanding Big Data

Big Data. Donald Kossmann & Nesime Tatbul Systems Group ETH Zurich

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: Vol. 1, Issue 6, October Big Data and Hadoop

CSC590: Selected Topics BIG DATA & DATA MINING. Lecture 2 Feb 12, 2014 Dr. Esam A. Alwagait

A Case Study on the Use of Unstructured Data in Healthcare Analytics. Analysis of Images for Diabetic Retinopathy

Digital Catapult. The impact of Big Data in a Connected Digital Economy Future of Healthcare. Mark Wall Big Data & Analytics Leader.

Big Analytics: A Next Generation Roadmap

Introduction to Engineering Using Robotics Experiments Lecture 17 Big Data

How Big Is Big Data Adoption? Survey Results. Survey Results Big Data Company Strategy... 6

NETWORKING SOLUTIONS FOR THE HEALTHCARE INDUSTRY

EMC DOCUMENTUM CONTENT ENABLED EMR Enhance the value of your EMR investment by accessing the complete patient record.

Big Data, Integration and Governance: Ask the Experts

Fujitsu Big Data Software Use Cases

GE Healthcare. Centricity PACS and PACS-IW with Universal Viewer* Where it all comes together

Big Data :: Big Demand

Big Data: Study in Structured and Unstructured Data

Intel IT s Big Data Transformation. Aziz Safa VP, GM Enterprise Applications & Strategy November, 2014

TECHNIQUES FOR OPTIMIZING THE RELATIONSHIP BETWEEN DATA STORAGE SPACE AND DATA RETRIEVAL TIME FOR LARGE DATABASES

Doing Multidisciplinary Research in Data Science

The Big Picture on Big Data. Princeton Section 307 Dinner Meeting December 11, 2013 Richard Herczeg

DGE /DG Connect

Big Data Integration: A Buyer's Guide

Big Data and Analytics: Challenges and Opportunities

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics

CUSTOMER RELATIONSHIP MANAGEMENT AND BUSINESS ANALYTICS: A LEAD NURTURING APPROACH

A New Era Of Analytic

Outline. What is Big data and where they come from? How we deal with Big data?

Research Note What is Big Data?

Open & Big Data for Life Imaging Technical aspects : existing solutions, main difficulties. Pierre Mouillard MD

COMP9321 Web Application Engineering

Data Isn't Everything

BIG DATA. Value 8/14/2014 WHAT IS BIG DATA? THE 5 V'S OF BIG DATA WHAT IS BIG DATA?

Big Health Data the challenges and connections

Timo Elliott VP, Global Innovation Evangelist SAP SE or an SAP affiliate company. All rights reserved. 1

General overview, and sources and uses of Big Data for urban and regional analysis

IBM - Fueling the Oil & Gas Industry

Find your future in the history

Big Data Analytics: Today's Gold Rush November 20, 2013

Intelligent Tools For A Productive Radiologist Workflow: How Machine Learning Enriches Hanging Protocols

The Big Data Paradigm Shift. Insight Through Automation

Big Data Introduction, Importance and Current Perspective of Challenges

Big Impacts from Big Data UNION SQUARE ADVISORS LLC

IBM Software Integrating and governing big data

How Using Big Data in Security Helps (and Hurts) Us

Cutting Through The Hype: What You Need To Know About Big Data

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

Big Data & Analytics: Your concise guide (note the irony) Wednesday 27th November 2013

So What s the Big Deal?

Healthcare data analytics. Da-Wei Wang Institute of Information Science

What happens when Big Data and Master Data come together?

Big Data Analytics. The Hype and the Hope* Dr. Ted Ralphs Industrial and Systems Engineering Director, Laboratory

Big Data: Image & Video Analytics

Big Data-Challenges and Opportunities

HIPAA and Big Data Twenty Third National HIPAA Summit. March 17, 2015 Mitchell W. Granberg, Optum Chief Privacy Officer

Transcription:

Big Data in Healthcare: Myth, Hype, and Hope Woojin Kim, MD Insert Organization Logo Here or Remove

Disclosure Co-founder/Shareholder Montage Healthcare Solutions, Inc Consultant Infiniti Medical, LLC Advisory board Zebra Medical Vision Ltd

http://globalnews.ca/news/696445/research-in-big-data-analytics-working-to-save-lives-of-premature-babies/

Objectives What is Big Data?

What is Big Data? Popular term for any collection of data sets (both structured and unstructured)

What is Big Data? Popular term for any collection of data sets (both structured and unstructured) so BIG and complex

What is Big Data? Popular term for any collection of data sets (both structured and unstructured) so BIG and complex that it is difficult to process using traditional database processing techniques

Volume Velocity Variety 3D Data Management: Controlling Data Volume, Velocity, and Variety by Doug Laney, Feb 2001

Volume Velocity Variety Variability Value Veracity http://en.wikipedia.org/wiki/big_data

Volume Velocity Variety Variability Value Veracity http://en.wikipedia.org/wiki/big_data

Google processes over 40K search queries/sec, over 3.5 billion in a single day http://www.internetlivestats.com/google-search-statistics/

http://mozy.com/blog/wp-uploads/2013/06/interactive_infographic/?int_cid=us-blog-txt-petabyte

http://mozy.com/blog/wp-uploads/2013/06/interactive_infographic/?int_cid=us-blog-txt-petabyte

http://mozy.com/blog/wp-uploads/2013/06/interactive_infographic/?int_cid=us-blog-txt-petabyte

http://mozy.com/blog/wp-uploads/2013/06/interactive_infographic/?int_cid=us-blog-txt-petabyte

What about radiology?

12 million reports from 1988 to 2015 22 GB

12 million reports from 1988 to 2015 22 GB

12 million reports from 1988 to 2015 22 GB

12 million reports from 1988 to 2015 22 GB PACS archive? 250 TB

12 million reports from 1988 to 2015 22 GB PACS archive? 250 TB

http://articles.mercola.com/sites/articles/archive/2009/08/01/how-large-is-a-petabyte.aspx

Volume Velocity Variety Variability Value Veracity http://en.wikipedia.org/wiki/big_data

Your eyes take about 400 ms to blink once

Your eyes take about 400 ms to blink once Programs that use trading algorithms can make a trade based on fast-moving market data in about 0.7 ms

What will happen? How? Prescriptive Analytics Foresight Why? Predictive Analytics Value What happened? Diagnostic Analytics Insight Descriptive Analytics Hindsight Difficulty Adapted from Gartner (http://timoelliott.com/blog/2013/02/gartnerbi-emea-2013-part-1-analytics-moves-to-the-core.html)

What will happen? How? Prescriptive Analytics Foresight Why? Predictive Analytics Value What happened? Diagnostic Analytics Insight Descriptive Analytics Hindsight Difficulty Adapted from Gartner (http://timoelliott.com/blog/2013/02/gartnerbi-emea-2013-part-1-analytics-moves-to-the-core.html)

What Wal-Mart Knows About Customers' Habits The New York Times (November 14, 2004)

What will happen? How? Prescriptive Analytics Foresight Why? Predictive Analytics Value What happened? Diagnostic Analytics Insight Descriptive Analytics Hindsight Difficulty Adapted from Gartner (http://timoelliott.com/blog/2013/02/gartnerbi-emea-2013-part-1-analytics-moves-to-the-core.html)

What is Analytics 3.0?

Three eras of analytics Analytics 1.0 Analytics 2.0 Analytics 3.0 Data Type Small, structured Large, unstructured All types combined Primary Analytics Type Descriptive Descriptive, predictive Prescriptive Davenport, TH. Big Data @ Work

Three eras of analytics Analytics 1.0 Analytics 2.0 Analytics 3.0 Data Type Small, structured Large, unstructured All types combined Primary Analytics Type Descriptive Descriptive, predictive Prescriptive Davenport, TH. Big Data @ Work

Three eras of analytics Analytics 1.0 Analytics 2.0 Analytics 3.0 Data Type Small, structured Large, unstructured All types combined Primary Analytics Type Descriptive Descriptive, predictive Prescriptive Davenport, TH. Big Data @ Work

Three eras of analytics Analytics 1.0 Analytics 2.0 Analytics 3.0 Data Type Small, structured Large, unstructured All types combined Primary Analytics Type Descriptive Descriptive, predictive Prescriptive Davenport, TH. Big Data @ Work

The Eight Levels of the Analytics Adoption Model http://www.healthcatalyst.com/healthcare-analytics-adoption-model/

http://compass.ups.com/ups-driver-avoid-left-turns/

Saving 1 min per driver/day $14.5 million/year

Saving 1 mile per driver/day $30 million/year

What about radiology?

http://www.imagingbiz.com/portals/medical-imaging-review/day-life-radiology-taking-advantage-data-evidence-based-decisions

What about healthcare?

https://www.healthcatalyst.com/predictive-analytics-healthcare-lessons

@woojinrad THANK YOU

Big Data David Piraino, M.D.

Conflict of Interest AGFA advisory Board Royalties on Stereotactic Patents I m an old guy and don t like change

Big Data Yes, we have it in imaging Yes, we can discover knowledge from it No, we don t have to tools to discover knowledge

Big Data Definition data sets so large or complex that traditional data processing applications are inadequate. Challenges include analysis, capture, curation, search, sharing, storage, transfer, visualization, and http://en.wikipedia.org/wiki/big_data information privacy. an accumulation of data that is too large and complex for processing by traditional database management tools http://www.merriam-webster.com/dictionary/big%20data

Volume Cleveland Clinic Definition The quantity of data that is generated is very important in this context. It is the size of the data which determines the value and potential of the data under consideration and whether it can actually be considered Big Data or not. The name Big Data itself contains a term which is related to size and hence the characteristic. New Data Per DAY Cleveland Clinic Radiology 50 GB Databases 1.5 TB images 1+ TB log files 0.5 TB other? Sensor data Total 3-4 TB That is pretty BIG http://en.wikipedia.org/wiki/big_data

Volume US Definition The quantity of data that is generated is very important in this context. It is the size of the data which determines the value and potential of the data under consideration and whether it can actually be considered Big Data or not. The name Big Data itself contains a term which is related to size and hence the characteristic. http://en.wikipedia.org/wiki/big_data New Data Per DAY US (Projecting from Cleveland Clinic data) 35 TB Databases 1+ PB images 800 TB log files 400 TB other? Sensor data Total 2-3 PB per day That is BIG How BIG is the rest of imaging? PACS data per year estimate of 0.6 to 1.2 Exabytes per year as 50% of total medical data. http://blogs.sas.com/content/hls/2011/10/21/how-big-is-big-data-in-healthcare/

Variety Definition This means that the category to which Big Data belongs to is also a very essential fact that needs to be known by the data analysts. This helps the people, who are closely analyzing the data and are associated with it, to effectively use the data to their advantage and thus upholding the importance of the Big Data. Images, structured text, unstructured text, wave forms, scanned documents, video etc Categories Image CT MG Etc. Text Reports Structured data Breast Lung screening Log files Video Wave forms Etc. http://en.wikipedia.org/wiki/big_data

Velocity Definition The term velocity in the context refers to the speed of generation of data or how fast the data is generated and processed to meet the demands and the challenges which lie ahead in the path of growth and development. It s getting faster We can generate image data and log data quickly Turn around times are decreasing ED under 30 minutes Average within a few hours As we provide more real time interpretation, we need more real time analysis. From EMR data can we predict what resources we need in the next few minutes, hours, and days From population data can we predict resources over months to years. From outcome data, personal health information and genomic data can we predict the best exam for each patient http://en.wikipedia.org/wiki/big_data

Variability Definition This is a factor which can be a problem for those who analyse the data. This refers to the inconsistency which can be shown by the data at times, thus hampering the process of being able to handle and manage the data effectively. Variable and unstructured text and images Unstructured reports Structured reports Unstructured ordering and clinical information Etc http://en.wikipedia.org/wiki/big_data

Veracity Definition The quality of the data being captured can vary greatly. Accuracy of analysis depends on the veracity of the source data. Radiologists are always right So no problem here Well we can work on this There are definitions of truth like surgery and pathology as well as standard outcomes that can be used http://en.wikipedia.org/wiki/big_data

Complexity Definition Data management can become a very complex process, especially when large volumes of data come from multiple sources. These data need to be linked, connected and correlated in order to be able to grasp the information that is supposed to be conveyed by these data. This situation, is therefore, termed as the complexity of Big Data. Correlation is possible but not easy Cross document and cross patient linkages are available in most cases Correlation is possible but not easy Prediction is much harder http://en.wikipedia.org/wiki/big_data

We should approach as a Data Science problem Data analysis hierarchy Descriptive: summarize without further interpretation Exploratory: search for trends, correlations, or relationships to generate new ideas Inferential: will the exploratory analysis hold true for other similar data but not how or why Predictive: subset of data is used to predict an outcome on a single person or unit Causal: identifies magnitude and direction of relationships. Usually only possible with randomized trials Mechanistic: one measurement always leads to a specific behavior like gravity The Elements of Data Analytic Style A guide for people who want to analyze data. Jeff Leek http://leanpub.com/datastyle

Correlation: % CT Utilization to Mean Temperature (0.81)