Process Mining Data Science in Action



Similar documents
Process Mining The influence of big data (and the internet of things) on the supply chain

Using Process Mining to Bridge the Gap between BI and BPM

Chapter 4 Getting the Data

Summary and Outlook. Business Process Intelligence Course Lecture 8. prof.dr.ir. Wil van der Aalst.

Process Mining. ^J Springer. Discovery, Conformance and Enhancement of Business Processes. Wil M.R van der Aalst Q UNIVERS1TAT.

Data Science. Research Theme: Process Mining

Chapter 12 Analyzing Spaghetti Processes

Model Discovery from Motor Claim Process Using Process Mining Technique

Process Mining and Visual Analytics: Breathing Life into Business Process Models

Business Process Modeling

Process Mining: Making Knowledge Discovery Process Centric

BIS 3106: Business Process Management. Lecture Two: Modelling the Control-flow Perspective

Business Intelligence and Process Modelling

Process Modelling from Insurance Event Log

ProM 6 Exercises. J.C.A.M. (Joos) Buijs and J.J.C.L. (Jan) Vogelaar {j.c.a.m.buijs,j.j.c.l.vogelaar}@tue.nl. August 2010

Mercy Health System. St. Louis, MO. Process Mining of Clinical Workflows for Quality and Process Improvement

Dotted Chart and Control-Flow Analysis for a Loan Application Process

Supporting the Workflow Management System Development Process with YAWL

Feature. Applications of Business Process Analytics and Mining for Internal Control. World

CHAPTER 1 INTRODUCTION

Supporting the BPM lifecycle with FileNet

Towards Cross-Organizational Process Mining in Collections of Process Models and their Executions

Business Process Discovery

Intelligent Process Management & Process Visualization. TAProViz 2014 workshop. Presenter: Dafna Levy

Process simulation. Enn Õunapuu

Towards a Software Framework for Automatic Business Process Redesign Marwa M.Essam 1, Selma Limam Mansar 2 1

Process Mining Tools: A Comparative Analysis

3TU.BSR: Big Software on the Run

Implementing Heuristic Miner for Different Types of Event Logs

Title: Basic Concepts and Technologies for Business Process Management

WoPeD - An Educational Tool for Workflow Nets

Process Mining. Data science in action

Application of Process Mining in Healthcare A Case Study in a Dutch Hospital

Analysis of Service Level Agreements using Process Mining techniques

MTAT Business Process Management (BPM) Lecture 6 Quantitative Process Analysis (Queuing & Simulation)

Using Trace Clustering for Configurable Process Discovery Explained by Event Log Data

Combination of Process Mining and Simulation Techniques for Business Process Redesign: A Methodological Approach

Analytics for Performance Optimization of BPMN2.0 Business Processes

Lluis Belanche + Alfredo Vellido. Intelligent Data Analysis and Data Mining. Data Analysis and Knowledge Discovery

Trace Clustering in Process Mining

SOFTWARE PROCESS MINING

Process Mining and Monitoring Processes and Services: Workshop Report

Technology Implications of an Instrumented Planet presented at IFIP WG 10.4 Workshop on Challenges and Directions in Dependability

INCREASE REVENUE PER SQUARE METER WITH ACTIONABLE INSIGHTS. Powered by

BPIC 2014: Insights from the Analysis of Rabobank Service Desk Processes

Web Load Stress Testing

Professional Organization Checklist for the Computer Science Curriculum Updates. Association of Computing Machinery Computing Curricula 2008

FileNet s BPM life-cycle support

Handling Big(ger) Logs: Connecting ProM 6 to Apache Hadoop

MS-10750: Monitoring and Operating a Private Cloud with System Center Required Exam(s) Course Objectives. Price. Duration. Methods of Delivery

Introduction to Business Model Simulation ActiveVOS Designer

WebSphere Business Monitor V7.0 Business space dashboards

Modeling and Analysis of Incoming Raw Materials Business Process: A Process Mining Approach

Structural Detection of Deadlocks in Business Process Models

Reducing or increasing the number of people assigned to an activity. A sudden increase of the number of instances flowing through the process.

Maximize Warehouse Management Systems with Process Mining Technology

Process mining challenges in hospital information systems

Radiation Oncology Patient & Family Guide

Big Data Analytics in Mobile Environments

TAYLOR II MANUFACTURING SIMULATION SOFTWARE

Nr.: Fakultät für Informatik Otto-von-Guericke-Universität Magdeburg

Data Mining, Predictive Analytics with Microsoft Analysis Services and Excel PowerPivot

A Biologically Inspired Approach to Network Vulnerability Identification

Flexible Web Visualization for Alert-Based Network Security Analytics

Workflow Support for the Healthcare Domain

WebSphere Business Monitor V6.2 Business space dashboards

Software Requirements Specification. Schlumberger Scheduling Assistant. for. Version 0.2. Prepared by Design Team A. Rice University COMP410/539

Business Process Management: A personal view

EDIminer: A Toolset for Process Mining from EDI Messages

Process Mining Using BPMN: Relating Event Logs and Process Models

Augmented Search for Web Applications. New frontier in big log data analysis and application intelligence

Business Process Quality Metrics: Log-based Complexity of Workflow Patterns

SIMULATION STANDARD FOR BUSINESS PROCESS MANAGEMENT. 15 New England Executive Park The Oaks, Clews Road

Cisco Change Management: Best Practices White Paper

Discovering Stochastic Petri Nets with Arbitrary Delay Distributions From Event Logs

BPMN and Simulation. L. J. Enstone & M. F. Clark The Lanner Group April 2006

10g versions followed on separate paths due to different approaches, but mainly due to differences in technology that were known to be huge.

Improving Decision Making and Managing Knowledge

Process Mining and Fraud Detection

Transcription:

Process Mining Data Science in Action Wil van der Aalst Scientific director of the DSC/e Dutch Data Science Summit, Eindhoven, 4-5-2014.

Process Mining Data Science in Action https://www.coursera.org/course/procmin

statistics data mining machine learning stochastics process mining databases algorithms data science large scale distributed computing industrial engineering visualization visual analytics behavioral/ social sciences privacy domain knowledge

statistics data mining machine learning stochastics process mining databases algorithms data science large scale distributed computing industrial engineering visualization visual analytics behavioral/ social sciences privacy domain knowledge

business process management business process reengineering process science statistics stochastics data mining machine learning process mining databases algorithms data science large scale distributed computing industrial engineering visualization visual analytics behavioral/ social sciences privacy domain knowledge model checking formal methods concurrency Petri nets BPMN

Internet of Events

Internet of Events: 4 sources of event data Internet of Events

Internet of Events: 4 sources of event data Internet of Content Big Data Internet of Events

Internet of Events: 4 sources of event data Internet of Content Internet of People Big Data social Internet of Events

Internet of Events: 4 sources of event data Internet of Content Internet of People Internet of Things Big Data social cloud Internet of Events

Internet of Events: 4 sources of event data Internet of Content Internet of People Internet of Things Internet of Places Big Data social cloud mobility Internet of Events

Starting point for process mining: Event data student name course name exam date mark Peter Jones Business Information systems 16-1-2014 8 Sandy Scott Business Information systems 16-1-2014 5 Bridget White Business Information systems 16-1-2014 9 John Anderson Business Information systems 16-1-2014 8 Sandy Scott BPM Systems 17-1-2014 7 Bridget White BPM Systems 17-1-2014 8 Sandy Scott Process Mining 20-1-2014 5 Bridget White Process Mining 20-1-2014 9 John Anderson Process Mining 20-1-2014 8 case id activity name timestamp other data every row is an event (here: an exam attempt)

Another event log: order handling order number activity timestamp user product quantity 9901 register order 22-1-2014@09.15 Sara Jones iphone5s 1 9902 register order 22-1-2014@09.18 Sara Jones iphone5s 2 9903 register order 22-1-2014@09.27 Sara Jones iphone4s 1 9901 check stock 22-1-2014@09.49 Pete Scott iphone5s 1 9901 ship order 22-1-2014@10.11 Sue Fox iphone5s 1 9903 check stock 22-1-2014@10.34 Pete Scott iphone4s 1 9901 handle payment 22-1-2014@10.41 Carol Hope iphone5s 1 9902 check stock 22-1-2014@10.57 Pete Scott iphone5s 2 9902 cancel order 22-1-2014@11.08 Carol Hope iphone5s 2 case id activity name timestamp resource other data

Another event log: patient treatment patient activity timestamp doctor age cost 5781 make X-ray 23-1-2014@10.30 Dr. Jones 45 70.00 5541 blood test 23-1-2014@10.18 Dr. Scott 61 40.00 5833 blood test 23-1-2014@10.27 Dr. Scott 24 40.00 5781 blood test 23-1-2014@10.49 Dr. Scott 45 40.00 5781 CT scan 23-1-2014@11.10 Dr. Fox 45 1200.00 5833 surgery 23-1-2014@12.34 Dr. Scott 24 2300.00 5781 handle payment 23-1-2014@12.41 Carol Hope 45 0.00 5541 radiation therapy 23-1-2014@13.57 Dr. Jones 61 140.00 5541 radiation therapy 23-1-2014@13.08 Dr. Jones 61 140.00 case id activity name timestamp resource other data

Let's play Case Activity Timestamp Resource 432 register travel request (a) 18-3-2014:9.15 John 432 get support from local manager (b) 18-3-2014:9.25 Mary 432 check budget by finance (d) 19-3-2014:8.55 John 432 decide (e) 19-3-2014:9.36 Sue 432 accept request (g) 19-3-2014:9.48 Mary Play-In Play-Out Replay start register travel request (a) get support from local manager (b) get detailed motivation letter (c) check budget by finance (d) reinitiate request (f) decide (e) accept request (g) reject request (h) end

Play-Out Case Activity Timestamp Resource 432 register travel request (a) 18-3-2014:9.15 John 432 get support from local manager (b) 18-3-2014:9.25 Mary 432 check budget by finance (d) 19-3-2014:8.55 John 432 decide (e) 19-3-2014:9.36 Sue 432 accept request (g) 19-3-2014:9.48 Mary get support from local manager (b) register travel request (a) get detailed motivation letter (c) decide (e) accept request (g) start check budget by finance (d) reject request (h) end reinitiate request (f)

Play Out: A possible scenario a b d e g XORsplit get support from local manager (b) XORjoin start register travel request (a) XORjoin ANDsplit get detailed motivation letter (c) check budget by finance (d) reinitiate request (f) decide (e) accept request (g) reject request (h) ANDjoin XORsplit XORjoin end Case Activity Timestamp Resource 432 register travel request (a) 18-3-2014:9.15 John 432 get support from local manager (b) 18-3-2014:9.25 Mary 432 check budget by finance (d) 19-3-2014:8.55 John 432 decide (e) 19-3-2014:9.36 Sue 432 accept request (g) 19-3-2014:9.48 Mary

Play Out: Another scenario get support from local manager (b) start register travel request (a) get detailed motivation letter (c) check budget by finance (d) reinitiate request (f) decide (e) accept request (g) reject request (h) end a d c e f b d e h

Play Out: Process model allows for many more scenarios get support from local manager (b) adcefcdefbdefbdeg adceg adbeh adbeh abdeg acdefcdefbdeh abcefbdeh acdefcdefbdeh acbefbdeg abdeg abdeg acbefbdeh acdefcdefbdeh adbeh adceh acbefbdeg adcefcdefbdefbdeg adceh adcefcdefbdefbdeg abdeg start register travel request (a) get detailed motivation letter (c) check budget by finance (d) reinitiate request (f) decide (e) accept request (g) reject request (h) end

Case Activity Timestamp Resource 432 register travel request (a) 18-3-2014:9.15 John 432 get support from local manager (b) 18-3-2014:9.25 Mary 432 check budget by finance (d) 19-3-2014:8.55 John 432 decide (e) 19-3-2014:9.36 Sue 432 accept request (g) 19-3-2014:9.48 Mary Play-In get support from local manager (b) register travel request (a) get detailed motivation letter (c) decide (e) accept request (g) start check budget by finance (d) reject request (h) end reinitiate request (f)

Loesje van der Aalst desire line

Play In: Simple process allowing for 4 traces abdeg adbeg adbeg adbeh abdeh abdeg abdeh abdeh abdeh abdeh adbeh adbeh adbeh get support from local manager (b) accept request (g) register travel request (a) decide (e) start check budget by finance (d) reject request (h) end

Play In: Process allowing for more traces adcefcdefbdefbdeg abdeg adcefcdefbdefbdeg abcefbdeh acbefbdeg acdefcdefbdeh adceg adbeh adbeh adcefcdefbdefbdeg abdeg abdeg abdeg acbefbdeh acdefcdefbdeh acbefbdeg adceh adbeh adceh acdefcdefbdeh get support from local manager (b) register travel request (a) get detailed motivation letter (c) decide (e) accept request (g) start check budget by finance (d) reject request (h) end reinitiate request (f)

No modeling needed!

Example Process Discovery (Dutch housing agency, 208 cases, 5987 events)

Example process discovery for hospital (627 gynecological oncology patients, 24331 events)

Replay Case Activity Timestamp Resource 432 register travel request (a) 18-3-2014:9.15 John 432 get support from local manager (b) 18-3-2014:9.25 Mary 432 check budget by finance (d) 19-3-2014:8.55 John 432 decide (e) 19-3-2014:9.36 Sue 432 accept request (g) 19-3-2014:9.48 Mary get support from local manager (b) start register travel request (a) get detailed motivation letter (c) check budget by finance (d) reinitiate request (f) decide (e) accept request (g) reject request (h) end

process model event data

desire line very safe system

Replay a c d e g get support from local manager (b) register travel request (a) get detailed motivation letter (c) decide (e) accept request (g) start check budget by finance (d) reject request (h) end reinitiate request (f)

Replay a c get support from local manager (b) e g? check budget (d) is missing! register travel request (a) get detailed motivation letter (c) decide (e) accept request (g) start check budget by finance (d) reject request (h) end reinitiate request (f)

Replay a c h d e g get support from local manager (b)? reject request (h) is impossible register travel request (a) get detailed motivation letter (c) decide (e) accept request (g) start check budget by finance (d) reject request (h) end reinitiate request (f)

Conformance Checking (WOZ objections Dutch municipality, 745 objections, 9583 event, f= 0.988)

Replay with timestamps a 9.15 c 9.20 d 9.35 e 10.15 g 11.30 start 9.15 register travel request (a) get support from local manager (b) 9.20 5 55 get detailed motivation letter (c) check budget by finance (d) 20 40 9.35 reinitiate request (f) 10.15 decide (e) 75 11.30 accept request (g) reject request (h) end

Replay with timestamps for many traces frequencies of paths frequencies of activities get support from local manager (b) waiting times and other delays between activities register travel request (a) get detailed motivation letter (c) decide (e) accept request (g) start check budget by finance (d) reinitiate request (f) durations of activities reject request (h) end

Performance Analysis Using Replay (WOZ objections Dutch municipality, 745 objections, 9583 event, f= 0.988)

Overview world business processes people machines components organizations models analyzes Play-Out supports/ controls specifies configures implements analyzes software system records events, e.g., messages, transactions, etc. (process) model discovery conformance Play-In event logs enhancement Replay

Process mining toolbox

examine thoroughly register request examine casually decide pay compensation start check ticket reject request end reinitiate request Process models can be seen as "process maps"

What we can learn from maps abstraction: leaving out insignificant roads and towns aggregation: smaller entities are amalgamated into larger ones (suburbs and cities) layout: positioning of elements has a clear meaning size and color: highlight more important entities (e.g. highways have a different color)

Compare process models to maps get support from local manager (b) start register travel request (a) abstraction? get detailed motivation letter (c) check budget by finance (d) reinitiate request (f) decide (e) accept request (g) reject request (h) size and color? end b aggregation? start A a register request c1 c2 examine thoroughly A c examine casually d check ticket c3 c4 e decide f M c5 reinitiate request g pay compensation h reject request end layout?

Can we see what matters most? get support from local manager (b) metropolis or village? register travel request (a) get detailed motivation letter (c) decide (e) accept request (g) start check budget by finance (d) reject request (h) end reinitiate request (f) highway or dirt road?

"the map" does not exist

Zoom

Subway map

Bicycle map

a map is a view on reality map reality same for process models

Model provides a view on reality (event data), just like a map!

Multiple views depending on purpose (performance, compliance, training, etc.).

breathing life into process models otherwise they end up in some drawer

Project on maps: traffic jams real estate for sale location of trucks/trains crime rates Project on process models: bottlenecks deviations costs

Examples

Not that new Charles Minard's 1869 chart showing the number of men in Napoleon s 1812 Russian campaign army, their movements, as well as the temperature they encountered on the return path. 422.000 175.000 100.000 10.000 24.000

Actively using process models

What can we lean from navigation devices? detect prediction recommendation

Driven by maps, historic information, and current information. Flexible: Adapts to circumstances and does not force the driver to take a particular route. Can your information system do this?

Conclusion Process models are like maps! Connecting event data and process models! better models live models

Positioning process mining process model analysis (simulation, verification, optimization, gaming, etc.) performanceoriented questions, problems and solutions process mining complianceoriented questions, problems and solutions data-oriented analysis (data mining, machine learning, business intelligence)

data science process science