So Just What Is Big Data? James E. Tcheng, MD, FACC, FSCAI



Similar documents
Introduction to the Mathematics of Big Data. Philippe B. Laval

Doing Multidisciplinary Research in Data Science

HP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica

The Big Deal about Big Data. Mike Skinner, CPA CISA CITP HORNE LLP

CSC590: Selected Topics BIG DATA & DATA MINING. Lecture 2 Feb 12, 2014 Dr. Esam A. Alwagait

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics

Age of Big data. Presented by: Mohammad Iqbal BCM -2014

Big Data in Healthcare: Myth, Hype, and Hope

Applications for Business Intelligence, Predictive Analytics and Big Data

Introduction to Predictive Analytics. Dr. Ronen Meiri

BIG DATA, BIOBANKS AND PREDICTIVE ANALYTICS FOR A BETTER CLINICAL OUTCOME

Majed Al-Ghandour, PhD, PE, CPM Division of Planning and Programming NCDOT 2016 NCAMPO Conference- Greensboro, NC May 12, 2016

Big Data a threat or a chance?

What happens when Big Data and Master Data come together?

BIG DATA: ARE YOU READY? Andy Kyiet Demand Flow Intelligence May, 2013

How To Use Big Data In Healthcare

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Big Data Analytics: Collecting, Analyzing and Decision Making

Big Data: Study in Structured and Unstructured Data

Peter Rakers De Verstoring van Big Data

BIG DATA CHALLENGES AND PERSPECTIVES

Using Ultra-Large Data Sets in Healthcare New Questions-New Answers

CIS 4930/6930 Spring 2014 Introduction to Data Science Data Intensive Computing. University of Florida, CISE Department Prof.

How To Get More Data From Your Computer

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: Vol. 1, Issue 6, October Big Data and Hadoop

CAP4773/CIS6930 Projects in Data Science, Fall 2014 [Review] Overview of Data Science

Transforming the Telecoms Business using Big Data and Analytics

Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA

Game On: How Information is Changing the Rules of Insurance

SCALABLE FILE SHARING AND DATA MANAGEMENT FOR INTERNET OF THINGS

A Survey on Big Data Concepts and Tools

How To Use Data Analysis To Get More Information From A Computer Or Cell Phone To A Computer

Mining Big Data. Pang-Ning Tan. Associate Professor Dept of Computer Science & Engineering Michigan State University

Big Data Analytics. Prof. Dr. Lars Schmidt-Thieme

Problems to store, transfer and process the Big Data 6/2/2016 GIANG TRAN - TTTGIANG2510@GMAIL.COM 1

BIG DATA FUNDAMENTALS

Big Data: Public Sector Opportunities, Challenges, and Implications

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

Adding value to Healthcare with Big Data

Ali Eghlima Ph.D Director of Bioinformatics. A Bioinformatics Research & Consulting Group

Oracle Big Data for Dummies

Analytical Tools: What Auditors Need to Know About Big Data

A New Era Of Analytic

Data Centric Computing Revisited

Copyright (c) 2012, Meta Business Systems. Mario Bojilov Meta Business Systems 20 February 2013

How Big Is Big Data Adoption? Survey Results. Survey Results Big Data Company Strategy... 6

Uncovering Value in Healthcare Data with Cognitive Analytics. Christine Livingston, Perficient Ken Dugan, IBM

MEDICAL DATA MINING. Timothy Hays, PhD. Health IT Strategy Executive Dynamics Research Corporation (DRC) December 13, 2012

Big Data / FDAAWARE. Rafi Maslaton President, cresults the maker of Smart-QC/QA/QD & FDAAWARE 30-SEP-2015

Modern (Computational) Approaches to Big Data Analytics. CSC 576 Computer Science, University of Rochester Instructor: Ji Liu

Now, Next and the Future: IT, Big Data and other Implications for RIM. Presented by Michael S. Smith /

Big Data Introduction, Importance and Current Perspective of Challenges

No Data Governance, No Actionable Insights

Turning Big Data into Big Decisions Delivering on the High Demand for Data

Big Data, Why All the Buzz? (Abridged) Anita Luthra, February 20, 2014

Pervasive Location Analytics and A Billion Dollar Opportunity. Jitender Aswani, Portfolio Strategist, Business Analytics, SAP

Big Data Analytics in Space Exploration and Entrepreneurship

Digital Earth: Big Data, Heritage and Social Science

Of all the data in recorded human history, 90 percent has been created in the last two years. - Mark van Rijmenam, Think Bigger, 2014

BIG DATA STRATEGY. Rama Kattunga Chair at American institute of Big Data Professionals. Building Big Data Strategy For Your Organization

Big Analytics: A Next Generation Roadmap

Big Data and CancerLinQ

Introduction to Engineering Using Robotics Experiments Lecture 17 Big Data

We are Big Data A Sonian Whitepaper

The HP IT Transformation Story

Exploiting Data at Rest and Data in Motion with a Big Data Platform

Two Recent LE Use Cases

North Highland Data and Analytics. Data Governance Considerations for Big Data Analytics

Industry Trends & Challenges in Oil & Gas

Big Data: What defines it and why you may have a problem leveraging it DISCUSSION PAPER

Gi-Joon Nam, IBM Research - Austin Sani R. Nassif, Radyalis. Opportunities in Power Distribution Network System Optimization (from EDA Perspective)

Data Analytics in Organisations and Business

KNOWLEDGENT WHITE PAPER. Big Data Enabling Better Pharmacovigilance

Big Data. George O. Strawn NITRD

What Is Big Data? Craig C. Douglas University of Wyoming

Transcription:

So Just What Is Big Data? James E. Tcheng, MD, FACC, FSCAI

Disclosures James E. Tcheng, MD, FACC, FSCAI Affiliations / Financial Relationships / Other RWI ACC Chair, Informatics and Health IT Task Force Philips Healthcare consultant (compensated) Epic Systems consultant (not compensated) MedStreaming consultant (not compensated) GE Healthcare consultant (not compensated) Lumedx consultant (not compensated)

A megabyte (MB) is to a gigabyte (GB) as a terabyte (TB) is to a: Question 1 1. Exabyte (EB) 2. Petabyte (PB) 3. Yottabyte (YB) 4. Zettabyte (ZB)

A typical large tertiary care hospital generates ~100 TB of data per year. The printed (text) content of the Library of Congress is estimated to be: 1. 10 TB 2. 100 TB 3. 1000 TB (=1 PB) 4. 10,000 TB (=10 PB) Question 2 640K ought to be enough for anybody.

One gram of DNA can theoretically hold how much data? Question 3 1. ½ Terabyte (TB) 2. ½ Petabyte (PB) 3. ½ Exabyte (EB) 4. ½ Zettabyte (ZB)

Big Data Defined Volume. Volume. Volume megabyte, gigabyte, terabyte, petabyte, exabyte (=10 9 GB), zettabyte, yottabyte Velocity faster and faster! Variety both structured data and unstructured information (e.g., analog text / audio / video) Variability quality / veracity, quantity, timing Value targets operational insight (e.g., individual patient care) Laney D, META Group. 3D Data Management: Controlling Data Volume, Velocity, and Variety. Feb 2001.

KB MB TB PB EB ZB YB Big Volume Library of Congress: ~10 TB of text (20 PB of audio, video) Duke Heart Center: ~30TB clinical data / yr = Facebook in 1 day; CERN in 1 second Google: processes 24PB of data / day All words ever spoken by humans : 5 EB of text 1 gram of DNA: 455 EB (~1/2 ZB) 2015 worldwide IP traffic: 966 EB (61% Internet video)

High Velocity

Endless Variety Genomic Other Omics Imaging Phenotypic Exposure Clinical

#headache Huge Variability 95% of the world s data is unstructured Text, images, video, voice, etc. Most healthcare data is unstructured New data types are emerging Messaging, social media, sensor data

Big Data Versus Romance

So Why Big Data? Big volume isn t new (stock exchanges) Real time velocity isn t new (railway management) Variety and variability of data is increasing (and increasingly difficult to manage) Requires changing the data ownership paradigm VALUE the potential for insight and understanding

Turning Big Data into Value Data-fication of the World Documentation Events Procedures Billing Images Registries Social Media Omics Sensors Etc. Volume Velocity Variety Variability Analyzing Big Data: Natural language processing Text analytics Information extraction Data mining Predictive modelling Inferential analysis Comparative effectiveness Etc. Visualizing Big Data: Infographics Advanced data visualization Interactive data Value Contextual modelling Etc. Value

Link Between Overall Guidelines Adherence and Mortality Peterson ED et al, JAMA 2006; 295:1912-1920 Every 10% in guidelines adherence 10% in mortality OR 0.90 (0.84-0.97, P<.001)

Big Data in Healthcare 6 Key High-Risk, High-Cost Use Cases high-cost patients (identify early, intensively manage) high-probability readmissions (match resources to risk) triage (assessment in context posterior probabilities) clinical decompensation ( signal : noise, address alarm fatigue) adverse events (e.g. AKI, infection, adverse drug events) treatment optimization for complex, multi-organ disease (multisite longitudinal registries e.g., PCORnet) Bates DW et al., Health Affairs (2014) 33;1123

Big Data in Healthcare 6/09 3/10 826 million tweets 146 million geo-located Across 1300 counties Eval for stress & hostility: health, attractiveness, job, curse words +CDC, US Census data STRONGER correlation than 10 classic predictors, incl. education, SES, smoking, hypertension (r=.42 vs.36, p=.049) Eichstaedt JC et al., Psychological Science, 2015;26:159

Implications Big Data in Cardiology What analytics approach? Start at the end and perfection is the enemy of good What measurement sources? Cardiology already data-rich, including non-traditional data How to make predictions actionable? Not just reductionist CDS think heuristic medical decision making Similarity of development cohort to use case? Model (clinical trial) populations versus registries versus reality Tailoring the intervention to the individual Actualization, delivery can be high-cost

Big Trends in Health IT We are falling well short of achieving meaningful use of HIT EHR certification criteria, etc., are stifling innovation We must move from closed systems to open software architectures enabling Big Data analytics not just document transport Big Data vision of an interoperable health data infrastructure Using patient-reported, omic, embedded / wireless sensors, open sources New discovery likely dependent on shared access to international health data JASON Report, AHRQ 2014 10 Year HIT Infrastructure, ONC 2014