Process Mining. Luigi Pontieri Istituto di Calcolo e Reti ad Alte Prestazioni ICAR-CNR Via Bucci 41c, Rende (CS) pontieri@icar.cnr.

Similar documents
ProM Framework Tutorial

ProM 6 Tutorial. H.M.W. (Eric) Verbeek mailto:h.m.w.verbeek@tue.nl R. P. Jagadeesh Chandra Bose mailto:j.c.b.rantham.prabhakara@tue.

Process Mining. ^J Springer. Discovery, Conformance and Enhancement of Business Processes. Wil M.R van der Aalst Q UNIVERS1TAT.

Process Mining Tools: A Comparative Analysis

Model Discovery from Motor Claim Process Using Process Mining Technique

Combination of Process Mining and Simulation Techniques for Business Process Redesign: A Methodological Approach

Process Mining: Using CPN Tools to Create Test Logs for Mining Algorithms

Chapter 12 Analyzing Spaghetti Processes

Process Mining and Network Analysis

Decision Mining in Business Processes

Process Mining Framework for Software Processes

BUsiness process mining, or process mining in a short

CHAPTER 1 INTRODUCTION

Master Thesis September 2010 ALGORITHMS FOR PROCESS CONFORMANCE AND PROCESS REFINEMENT

Life-Cycle Support for Staff Assignment Rules in Process-Aware Information Systems

Process Modelling from Insurance Event Log

Mercy Health System. St. Louis, MO. Process Mining of Clinical Workflows for Quality and Process Improvement

Federico Rajola. Customer Relationship. Management in the. Financial Industry. Organizational Processes and. Technology Innovation.

Dept. of IT in Vel Tech Dr. RR & Dr. SR Technical University, INDIA. Dept. of Computer Science, Rashtriya Sanskrit Vidyapeetha, INDIA

Discovering User Communities in Large Event Logs

Feature. Applications of Business Process Analytics and Mining for Internal Control. World

Data Science. Research Theme: Process Mining

Handling Big(ger) Logs: Connecting ProM 6 to Apache Hadoop

Business Process Modeling

Process Mining The influence of big data (and the internet of things) on the supply chain

Lluis Belanche + Alfredo Vellido. Intelligent Data Analysis and Data Mining. Data Analysis and Knowledge Discovery

Business process measurement - data mining. enn@cc.ttu.ee

EMiT: A process mining tool

The ProM framework: A new era in process mining tool support

Using Process Mining to Bridge the Gap between BI and BPM

Data Science Betere processen en producten dankzij (Big) data. Wil van der Aalst

Configuring IBM WebSphere Monitor for Process Mining

Summary and Outlook. Business Process Intelligence Course Lecture 8. prof.dr.ir. Wil van der Aalst.

Workflow Simulation for Operational Decision Support

Process Mining and Monitoring Processes and Services: Workshop Report

A Software Framework for Risk-Aware Business Process Management

Curriculum Vitae et Studiorum

Process Mining. Data science in action

Dotted Chart and Control-Flow Analysis for a Loan Application Process

Analysis of Service Level Agreements using Process Mining techniques

ProM 6 Exercises. J.C.A.M. (Joos) Buijs and J.J.C.L. (Jan) Vogelaar {j.c.a.m.buijs,j.j.c.l.vogelaar}@tue.nl. August 2010

CUSTOMER Presentation of SAP Predictive Analytics

Supporting the Workflow Management System Development Process with YAWL

USING PROCESS MINING FOR ITIL ASSESSMENT: A CASE STUDY WITH INCIDENT MANAGEMENT

Translating Message Sequence Charts to other Process Languages using Process Mining

«Software Open Source come fattore abilitante dei Progetti per le Smart Cities»

Analyzer: An Mining Plug-in for the ProM Framework

Business Process Mining: An Industrial Application

Contents. Dedication List of Figures List of Tables. Acknowledgments

Process Mining Online Assessment Data

Process Mining A Comparative Study

Mining Process Models with Non-Free-Choice Constructs

Catalogo Corsi 2013/2014

SOFTWARE PROCESS MINING

Article. Abstract. This is a pre-print version. For the printed version please refer to

Chapter 4 Getting the Data

Process Mining Of Invabyte And Event Logs

The Open University s repository of research publications and other research outputs

Generation of a Set of Event Logs with Noise

Application of Process Mining in Healthcare A Case Study in a Dutch Hospital

Towards a Software Framework for Automatic Business Process Redesign Marwa M.Essam 1, Selma Limam Mansar 2 1

Workflow Analysis and Design

Dal PDM al PLM, architettura tradizionale e piattaforma Cloud : l'integrazione facilitata dalla nuova tecnologia

UNIVERSITY OF ROME TOR VERGATA

LME - Law Making Environment. Model driven semantic drafting and annotation of legislative sources

Corso: Core Solutions of Microsoft Skype for Business 2015 Codice PCSNET: MSKY-5 Cod. Vendor: Durata: 5

Source code security testing

EDIminer: A Toolset for Process Mining from EDI Messages

Formal Modeling and Analysis by Simulation of Data Paths in Digital Document Printers

Applicazione del Web Semantico alle Imprese

Business Process Management Demystified: A Tutorial on Models, Systems and Standards for Workflow Management

Corso: Microsoft Project Server 2010 Technical Boot Camp Codice PCSNET: AAAA-0 Cod. Vendor: - Durata: 5

Methods and Technologies for Business Process Monitoring

Lezione 10 Introduzione a OPNET

KNOWLEDGE GRID An Architecture for Distributed Knowledge Discovery

Process Mining for Electronic Data Interchange

ON INTEGRATING UNSUPERVISED AND SUPERVISED CLASSIFICATION FOR CREDIT RISK EVALUATION

Process Mining in Big Data Scenario

Relational XES: Data Management for Process Mining

UNIVERSITÀ DI PISA Department of Computer Science. Master s degree in Business Informatics (2 years, 120 ECTS)

Nirikshan: Process Mining Software Repositories to Identify Inefficiencies, Imperfections, and Enhance Existing Process Capabilities

BPIC 2014: Insights from the Analysis of Rabobank Service Desk Processes

Diagramming Techniques:

Process-Aware Information Systems: Design, Enactment, and Analysis

CURRICULUM VITAE. WORK EXPERIENCE (Academic) EDUCATIONAL QUALIFICATIONS. Maria Giovanna Confetto

Process Aware Host-based Intrusion Detection Model

CS 565 Business Process & Workflow Management Systems

Process Mining: A Two-Step Approach using Transition Systems and Regions

Test Suite Generation for Services

A Generic Import Framework For Process Event Logs

UNIVERSITY OF ROME TOR VERGATA

Modeling and Analysis of Incoming Raw Materials Business Process: A Process Mining Approach

Business Process Analysis in Healthcare Environments: a Methodology based on Process Mining

Traffic Prediction in Wireless Mesh Networks Using Process Mining Algorithms

PLG: a Framework for the Generation of Business Process Models and their Execution Logs

Information Management course

Cognos + IBM Customer Experience Suite:

Facilitating Business Process Discovery using Analysis

Business Process Mining for Internal Fraud Risk Reduction: Results of a Case Study

Process Mining Data Science in Action

Transcription:

Process Mining Luigi Pontieri Istituto di Calcolo e Reti ad Alte Prestazioni ICAR-CNR Via Bucci 41c, Rende (CS) pontieri@icar.cnr.it

Argomenti Caratteristiche generali delle tecniche di Process Mining (PM) Il PM come approccio all analisi (ex-post) di processi organizzativi Caratteristiche dei processi e dei dati (log) oggetto dell analisi Obiettivi, potenzialità e problematiche correlate al Process Mining Inquadramento del PM nel ciclo vita dei processi organizzativi Classificazione degli approcci di Process Mining Analysis perspectives: Control-flow, Case, Performances Tasks: Discovery, Extension, Conformance testing Approfondimento su alcune tecniche e di workflow discovery Induzione di Control Flow graphs: algoritmo di base Uno sguardo ad alcuni approcci classici (α-algorithm, HeuristicMiner, Multi-phase, Fuzzy) 2

Argomenti (2) Valutazione e validazione dei modelli (scoperti o pre-esistenti) Conformance Checking Log-based property verification Altri task e metodi di PM Induzione di modelli organizzativi e di social networks (cenni) Tecniche clustering-based per la scoperta di schemi di processo gerarchici/tassonomici Tecniche per l estensione di un modello di processo Ulteriori linee di sviluppo del PM Scoperta di istanze di esecuzione anomale Integrazione del PM con ontologie di processo e di dominio 3

Organizzazione Lezioni Teoria di base Strumenti SW per il Process Mining Esempi di uso della suite open-source ProM Casi di studio Esercitazioni Esercizi sui concetti appresi nelle lezioni Analisi di alcuni dataset di esempio con ProM 4

Materiale didattico Lezioni (slide MS PowerPoint): http://www.icar.cnr.it/pontieri/didattica/pm/slides/ Riferimenti bibliografici I. Witten, E. Frank, Data Mining: Practical Machine Learning Tools with Java Implementation. Morgan Kaufman, 1999 Una serie di articoli scientifici disponibili all indirizzo http://www.icar.cnr.it/pontieri/didattica/pm/papers/ 5

Outline Part I Introduction to Process Mining Context, motivation and goal General characteristics of the analyzed processes and logs Classification of Process Mining approaches Part II Workflow discovery Induction of basic Control Flow graphs Other techniques (α-algorithm, Heuristic Miner, Fuzzy mining) Part III Beyond control-flow mining Organizational mining Social net discovery Extension of workflow models Part IV Evaluation and validation of discovered models Conformance Check Log-based property verification Part V Clustering-based Process Mining Discovery of hierarchical workflow models Discovery of process taxonomies Outlier detection 6

Process Mining Part I Introduction and Basic Concepts Context, motivations, goals Characteristics of the analyzed data Classification of Process Mining approaches Based on slides by Prof. Wil van der Aalst and Dr. Ana Karla A. de Medeiros

Outline Part I Introduction to Process Mining Context, motivation and goal General characteristics of the analyzed processes and logs Classification of Process Mining approaches Part II Workflow discovery Induction of basic Control Flow graphs Other techniques (α-algorithm, Heuristic Miner, Fuzzy mining) Part III Beyond control-flow mining Organizational mining Social net discovery Extension of workflow models Part IV Evaluation and validation of discovered models Conformance Check Log-based property verification Part V Clustering-based Process Mining approaches Discovery of hierarchical workflow models Discovery of process taxonomies Outlier detection 8

Process Mining: basic idea Aims to discover process knowledge based on historical execution data Logs register what happened along past process enactments, and are maintained by diverse kinds of transactional IS (WfMS, ERP, CRM, ) Process Design Implementation / Configuration process enactment Process Knowledge (e.g., Process Models, Business Rules, Execution Patterns) process mining verification abcdfg abcfd abcdfe. 9

Process Mining: basic idea The focus is on the real behavior of the process, rather than on its expected/prescribed behavior Run-time diagnosis Design-time process enactment - process mining Process Discovery/Extension Conformance Testing Log-based Verification implementation/ configuration process design - verification -validation -performance analysis (ex-ante) 10

PM vs. Design-time Workflow Analysis Validation bases on comparing models with requirements/expectations Validating real models is hard, and requires some reflection of reality Verification concerns the correctness/soundness of the model typically used to answer qualitative questions Is there a deadlock possible? Is it possible to successfully handle a specific case? Will all cases terminate eventually? It is possible to execute two tasks in any order? Ex-ante performance analysis Typically regard quantitative aspects How many cases can be handled in 1 hour? What is the average flow time? Common approaches: Simulation, queuing theory Markovian analysis (based on abstraction)

Process Mining vs. Design-time Analysis Process mining uses historical event logs as a reflection of reality behavioral models are linked to real log events Reduces the abstraction gap between model and reality 12

Classification of Process Mining approaches Process Mining can support different kinds of analysis tasks Different kinds of knowledge on process execution can be found Control flow perspective: What is the typical flow of work for the handling of orders? What s the procedure (combination of tasks) followed for orders above 10K? Case perspective: Was the invoice 1203 paid on time? How regular and rush orders differ in the execution flow? Organizational perspective: Which people appear to be working together closely? 13

Process Mining tasks: Discovery Start Workflow Model Register order Prepare shipment (Re)send bill Ship goods Contact customer Receive payment Archive order End Organizational Model Social Network 14

Discovery: an example control-flow model 15

Process Mining tasks: Conformance Check Compliance Process Model Start Register order Prepare shipment (Re)send bill Ship goods Contact customer Receive payment Archive order End Auditing/Security 16

Process Mining tasks: Extension Start Register order Bottlenecks/ Business Rules Process Model Prepare shipment (Re)send bill Ship goods Contact customer Receive payment Archive order End Performance Analysis 17

Extension: example of decision point analysis builds a decision tree for each choice 18

Process Mining vs Data Mining Process Mining is a specialization of Data Mining with a strong business process viewpoint Some traditional DM techniques can be used in the context of PM New techniques have been specifically developed for process mining e.g. the discovery of workflow models 19

Process Mining tools Open-source tools available at www.processmining.org ProM ProMimport 20

ProM architecture 21

ProM 22

Some questions ProM can help answer to What an extent the cases (proc. inst.) comply with a process model? Where are the problems? How frequent is the (non-)compliance? How are the cases actually being executed? Statistics on the execution paths of a given model What is the most frequent path? What is the distribution of all cases over the different paths through the process? What are the routing probabilities for each split node? Statistics on execution performances What is the average/minimum/maximum throughput time of cases? Which paths take too much time on average? How many cases follow these routings? What are the critical sub-paths for these paths? What is the average service time for each task? How much time was spent between any two tasks in the process model? 23

Some questions ProM can help answer to (2) Identification and verification of Business rules What are the business rules in the process model? Are the rules indeed being obeyed? Interaction among people What is the communication structure and dependencies among people? How many transfers happen from one role to another role? Who are important people in the communication flow? Who subcontract work to whom? Who work on the same tasks? 24

ProM workflow management systems case handling / CRM systems ERP systems Outlook Staffware FLOWer SAP R/3 CPN Tools InConcert Vectus BaaN ARIS/ARIS PPM MQ Series Siebel Peoplesoft YAWL Caramba ProM framework common XML format for storing/ exchanging workflow logs Core input/output visualization analysis... Plugins alpha algorithm genetic algorithm Tsinghua alpha algorithm Multi phase algorithms social network miner case data extraction property verifier... External Tools NetMiner Viscovery... 25

Representation of log data: the MXML format task label 26

Event Logs: the MXML format (2) 27

Event Logs: the MXML format (3) Compulsory fields! Fields relevant to the organizational perspective Which fields are useful for case-based analyses? 28

Toy example: paper reviewing Event log: processes process instances events Per event: activity name (event type) (originator) (timestamp) (data) 29

attributes of an event end of activity start of activity start of process instance 30

An equivalent (relational) schema for log events ProM Import allows to convert data from such a database into an MXML file 31