Predictive Analytics for Demand Forecasting and Planning Managers A Big Data Challenge Hans Levenbach, Delphus, Inc.

Similar documents
Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics

Introduction to Predictive Analytics. Dr. Ronen Meiri

ANALYTICS IN BIG DATA ERA

Big Data Hope or Hype?

Big Data / FDAAWARE. Rafi Maslaton President, cresults the maker of Smart-QC/QA/QD & FDAAWARE 30-SEP-2015

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, Viswa Sharma Solutions Architect Tata Consultancy Services

New Jersey Big Data Alliance

Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA

Clustering Big Data. Anil K. Jain. (with Radha Chitta and Rong Jin) Department of Computer Science Michigan State University November 29, 2012

Leveraging Predictive Analytic and Artificial Intelligence Technology for Financial and Clinical Performance

Doing Multidisciplinary Research in Data Science

Using Predictive Analytics to Reduce COPD Readmissions

BIG DATA, BIOBANKS AND PREDICTIVE ANALYTICS FOR A BETTER CLINICAL OUTCOME

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

Promises and Pitfalls of Big-Data-Predictive Analytics: Best Practices and Trends

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract

Majed Al-Ghandour, PhD, PE, CPM Division of Planning and Programming NCDOT 2016 NCAMPO Conference- Greensboro, NC May 12, 2016

Growth of Unstructured Data & Object Storage. Marcel Laforce Sr. Director, Object Storage

The Rise of Industrial Big Data

Certified Professional in Demand Forecasting Workshop

Drivers to support the growing business data demand for Performance Management solutions and BI Analytics

CoolaData Predictive Analytics

How To Use Big Data In Healthcare

Information Management course

Chapter 6. Foundations of Business Intelligence: Databases and Information Management

Embracing SaaS: A Blueprint for IT Success

Survey of Big Data Benchmarking

Digital Earth: Big Data, Heritage and Social Science

Using Ultra-Large Data Sets in Healthcare New Questions-New Answers

Big Data Challenges in Bioinformatics

Session 3: The World of Big Data from the Evolving Digital World Laura Jenkins Jirele, Jeff Kirsch, Tom Jirele

NITRD and Big Data. George O. Strawn NITRD

Big Data: Beyond the Hype

Big Data Analytics & Electronics System Design

Medical Big Data Interpretation

Data Mining and Machine Learning in Bioinformatics

Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data

Definition of Computers. INTRODUCTION to COMPUTERS. Historical Development ENIAC

How To Speed Up A Flash Flash Storage System With The Hyperq Memory Router

Hadoop. Sunday, November 25, 12

How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW

SCALABLE FILE SHARING AND DATA MANAGEMENT FOR INTERNET OF THINGS

Chapter 6 8/12/2015. Foundations of Business Intelligence: Databases and Information Management. Problem:

ANALYTICS CENTER LEARNING PROGRAM

Data Centric Computing Revisited

Global SME Big Data Market

Big Analytics: A Next Generation Roadmap

Hur hanterar vi utmaningar inom området - Big Data. Jan Östling Enterprise Technologies Intel Corporation, NER

Dimensionalizing Big Data. WA State vs. peers. Building on strengths CONTENTS. McKinsey & Company 1

Hadoop Cluster Applications

Big Data. George O. Strawn NITRD

Foundations of Business Intelligence: Databases and Information Management

Example application (1) Telecommunication. Lecture 1: Data Mining Overview and Process. Example application (2) Health

Next presentation starting soon Business Analytics using Big Data to gain competitive advantage

Introducing Big Data. Abstract. with Small Changes. Agenda. Big Data in the News. Bits and Bytes

The Internet of Things

Coverage Analysis: The Cornerstone of Clinical Research Billing Presented by: Mary L. Veazie, CPA, MBA, CHC, CHRC Executive Director, Clinical

Moving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage

Outline. What is Big data and where they come from? How we deal with Big data?

Advanced Analytics for Call Center Operations

The Big Deal about Big Data. Mike Skinner, CPA CISA CITP HORNE LLP

Big Data-Challenges and Opportunities

Introduction to Analytics and Big Data - Hadoop. Rob Peglar EMC Isilon

Nine Common Types of Data Mining Techniques Used in Predictive Analytics

Mining Big Data. Pang-Ning Tan. Associate Professor Dept of Computer Science & Engineering Michigan State University

Big Data Analytics. An Introduction. Oliver Fuchsberger University of Paderborn 2014

BUY BIG DATA IN RETAIL

Managing big data for smart grids and smart meters

Interactive Data Visualization with Focus on Climate Research

Big Data: Study in Structured and Unstructured Data

Big Data. Fast Forward. Putting data to productive use

Vinay Parisa 1, Biswajit Mohapatra 2 ;

PREDICTIVE ANALYTICS FOR THE HEALTHCARE INDUSTRY

Big Data Services at DKRZ

Healthcare Delivery. Transforming. through Mobility Solutions. A Solution White Paper - version 1.0

redesigning the data landscape to deliver true business intelligence Your business technologists. Powering progress

Taking A Proactive Approach To Loyalty & Retention

Cloudian The Storage Evolution to the Cloud.. Cloudian Inc. Pre Sales Engineering

Biomedical Informatics Applications, Big Data, & Cloud Computing

Data-Driven Decisions: Role of Operations Research in Business Analytics

TUT NoSQL Seminar (Oracle) Big Data

Big Data: Image & Video Analytics

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

Model Hospital Management System. Integrate Every Aspect of Patient Care with a Hospital Management System

Transcription:

Predictive Analytics for Demand Forecasting and Planning Managers A Big Data Challenge Hans Levenbach, Delphus, Inc. ISF2013, KAIST College of Business, Seoul, Korea

Agenda Role of Big Data in Small Predictive Analytics Big Data Applications Hospital Patient Care Activity: Geography, Payer Class & Product Lines Multiple Competitor & Product Mix Forecasting Multi-State & Migration Perspectives Physician Activity & Multiple Physician Ranking Hospital Closing and Merger Simulations A Big Data and Reporting Framework For Hospital Management A Modeling Pathway For Demand Forecasters/Planners The PEER Process. Preparing Healthy Data Executing Analytic Modeling Methodologies Evaluating Performance Diagnostics Reconciling Multiple Modeling Pathways 2

Big Data Usage Examples (in petabytes) The world's effective capacity to exchange information through two-way telecom networks was 281 petabytes of (optimally compressed) information in 1986, 471 petabytes in 1993, 2,200 petabytes in 2000, and 65,000 (optimally compressed) petabytes in 2007 - this is the informational equivalent to every person exchanging 6 newspapers per day! Internet: Google processes about 24 petabytes of data per day Telecoms: AT&T transfers about 30 petabytes of data through its networks each day Physics: The experiment in the Large Hadron Collider produce about 15 petabytes of data per year, which will be distributed over the LHC Computing grid Neurology: It is estimated that the human s brain's ability to store memories is equivalent to about 2.5 petabytes of binary data Climate science: The German Climate Computing Centre (DKRZ) has a storage capacity of 60 petabytes of climate data ] Archives: The internet archive contains about 5.8 petabytes of data as of December 2010. It was growing at the rate of about 100 terabytes per month in March 2009 Film: The 2009 movie Avatar is reported to have taken over 1 petabyte of local storage at Weta Dogital for the rendering of the 3D CGI effects In August 2011, IBM was reported to have built the largest storage array ever, with a capacity of 120 petabytes In January 2012, Cray began construction of the Blue Water Supercomputer, which will have a capacity of 500 petabytes making it the largest storage array ever if realized!

How Big Is a Petabyte? A petabyte (derived from the SI prefix peta) is a unit of information equal to one quadrillion bytes, or 1024 terabytes. The unit symbol for the petabyte is PB. The prefix peta (P) indicates the fifth power to 1000. According to IBM: Everyday, we create 2.5 quintillion bytes of data so much that 90% of the data in the world today has been created in the last two years alone

Predictive Analytics For Healthcare Applications Predictive analytics encompasses a variety of techniques from data mining, statistics, and game theory that analyze current and historical facts to make predictions about future events. Although healthcare industry has lagged behind sectors like retail and banking in the use of big data partly because of concerns about patient confidentiality it could soon catch up We have to bring the science of management back into healthcare Donald Berwick, MD Institute of Healthcare Improvement McKinsey Company Donald Berwick, MD Institute of Healthcare Improvement

Step 1: Preparing P Healthy Data: Centralized Data Source for NY Hospital Applications - SPARCS The annual OUTPATIENT.ZIP file from SPARCS is 700 MB s and expands to 18 gigabytes in a text file format. The annual INPATIENT.ZIP file from SPARCS is 500 MB and expand to 7 GB in a text flat file format. How many hospitals?: New York 268, Connecticut over 30, New Jersey over 140 How many patient records per year?: NY inpatient Records over 2.5 million How many lowest level records in the demand table if we had all hospitals SQL with SPARCS data?

Typical Database Sizes in Supply Chain Organizations Wal-Mart has 5,000 Stores, 100,000 items, and if a typical store carries a full line of items, then you can then potentially expect 500,000,000 lowest level records If an online Store has 6 Distribution Centers and 35,000 Items, and each distribution center carries a full line of items, one can then expect 210,000 lowest level records For NY Hospitals, with 5 years of history and approximately 30 competitors, 15 geographic areas, 4 financial classes, 6 services and 23 products we end up with 220,000 lowest level records in the demand table

Step 2: Executing E Analytic Modeling:Methodologies Predictive Analytic Methods Tend To Be Data-Driven The use of current and past data, in conjunction with statistical, structural or other analytical models and methods, to determine the likelihood of certain future events As you move up the steps, the complexity of the approaches increases while the volume of data decreases FORECASTING DECISION TREE CLUSTERING Data framework tends to be structured and connected as DATA QUALITY drives credibility in modeling results

Big Data Techniques Cluster Analysis Cluster analysis or clustering is the assignment of a set of observations into subsets (called clusters) so that observations in the same cluster are similar in some sense. Clustering is a method of unsupervised learning, and a common technique for statistical data analysis used in many fields, including machine learning, data mining, pattern recognition, image analysis and bioinformation.

Decision Trees: Progressive Class Distinction All Patients Descend tree using any available differentiating attributes; natural, derived or inferred; separately or in combination Primary Tumor Site Lung Prostate Colon Colon: Classification by Stage Blood Adenoma C1 D Normal > 100 product line/service combinations B1 Met

Patient-Centric Demand Forecasting For Hospital Management Patient Product/Service Demand Planning Patient Classification with Co- Morbidities & Clinical Pathways Supply Components

Applications in Healthcare Understanding medical prevalence, we can predict Admissions --> Co-Morbidities --> Physician Work Flow --> Product/Service Mix --> Supply Chain requirements Patient Care Activity: Geography, Payer Class & Product Lines Multiple Competitor & Product Mix Forecasting Physician Activity & Multiple Physician Ranking Multi-State & Migration Perspectives Berger Commission Type Simulations e.g. Hospital Closings and Mergers

Product Mix and Service Need Service Mix for First Year Service Mix for Forecast Year Surgery 31% Maternity 14% Psychiatry 11% Surgery 30% Maternity 15% Psychiatry 9% Medicine 44% Medicine 46% Maternity Psychiatry Medicine Surgery

Replenishment Planning White Plains Occupancy 300 290 280 270 260 250 240 230 220 210 200 Jan-04 May-04 Sep-04 Jan-05 May-05 Sep-05 Jan-06 May-06 Sep-06 Jan-07 May-07 Sep-07 Jan-08 May-08 Avg. Beds Occupied

Step 3: Evaluating E Performance Diagnostics Predictive Analytics: Forecasting Service Needs Maternity (History, Trend-Cycle & Prediction Limits 4000 3500 3000 Admissions 2500 2000 1500 1000 500 Data Lower PL Upper PL Trend-Cycle 0 1 5 9 13 17 21 25 29 33 37 41 45 49 53 57 61 65 69 73 77 81 Time (Months) 8000 Surgery (History, Trend-Cycle and Prediction Limits) Admissions 7000 6000 5000 4000 3000 2000 1000 Data Lower PL Upper PL Trend-Cycle 0 1 5 9 13 17 21 25 29 33 37 41 45 49 53 57 61 65 69 73 77 81 85 89 Time (Months)

Predictive Analytics Development of Methods The use of current and past data, in conjunction with statistical, structural or other analytical models and methods, to determine the likelihood of certain future events Forecasting and Strategic Planning methods cover a range from relatively simple time series models to more advanced techniques such as simulation and advising REPORTING & ADVISING SIMULATION & SCENARIO PLANNING FORECASTING HL/PS, ISF2010 San Diego, CA

Step 4: Reconciling R Multiple Modeling Pathways Predictive Analytics - Complexity Data Scale versus Complexity of Methods The use of current and past data, in conjunction with statistical, structural or other analytical models and methods, to determine the likelihood of certain future events Predictive methods cover a range from relatively simple classification through forecasting to more advanced techniques such as simulation and advising MONITORING & ADVISING SIMULATION & SCENARIO PLANNING T/S FORECASTING DECISION TREES CLUSTERING As you move up the steps, the complexity of the approaches increases while the volume of data decreases

A Modeling Framework for Hospital Management (In the Cloud Software-as-a-Service Model) Data Resource Multi-year Multihospital Information Mining & Analytics Product Line Trends Spatial Analysis Market Shares Budget PEER Planner Dashboard Forecasting & Collaborative Planning Competitive Simulation Model Interactive

Implications for Researchers and Practitioners Demand Forecasting Research Opportunities Curing Unhealthy Structured Data Intermittent Missing Transitioning new product intros Consistent,... Demand Modeling with Structured Relational Datasets Product/Service Hierarchies 23 Product & 6 Service Summary Levels Place Segmentations Geo Regions, Payor Class, Period Granularities Months, Weeks, Days, Hours New Challenges for Forecasting Decision Support Systems scalable methods for very large datasets

Questions? Contact: Hans Levenbach Email: hlevenbach@delphus.com URL: www/delphus.com and www.cpdftraining.org