Predicting Deadline Transgressions Using Event Logs
|
|
|
- Emma Foster
- 10 years ago
- Views:
Transcription
1 Predicting Deadline Transgressions Using Event Logs Anastasiia Pika 1, Wil M. P. van der Aalst 2,1, Colin J. Fidge 1, Arthur H. M. ter Hofstede 1,2, and Moe T. Wynn 1 1 Queensland University of Technology, Brisbane, Australia {a.pika,c.fidge,a.terhofstede,m.wynn}@qut.edu.au 2 Eindhoven University of Technology, Eindhoven, The Netherlands {w.m.p.v.d.aalst}@tue.nl Abstract. E ective risk management is crucial for any organisation. One of its key steps is risk identification, but few tools exist to support this process. Here we present a method for the automatic discovery of a particular type of process-related risk, the danger of deadline transgressions or overruns, based on the analysis of event logs. We define a set of time-related process risk indicators, i.e., patterns observable in event logs that highlight the likelihood of an overrun, and then show how instances of these patterns can be identified automatically using statistical principles. To demonstrate its feasibility, the approach has been implemented as a plug-in module to the process mining framework ProM and tested using an event log from a Dutch financial institution. 1 Introduction E ective risk management is crucial for organisations. ISO Guide 73:2009 defines risk as the e ect of uncertainty on objectives where e ect is a deviation from the expected positive and/or negative [3]. One of the most important aspects of risk management is risk identification [7]. Traditional risk management approaches o er only high-level guidance about risk identification methods and rely on the knowledge of domain experts [7]. Accordingly, our goal is to show how the data recorded in event logs by contemporary workflow management systems can be exploited for the purpose of risk identification. Various approaches for predicting timeliness have been proposed in the literature [8, 9] and serve as a starting point for our work. Van der Aalst et al. s approach [8] builds an annotated transition system and remaining process time is then predicted based on the average of earlier cases visiting the same state. Van Dongen et al. s approach [9] predicts the remaining cycle time of a case by using non-parametric regression based on case-related data as the predictor variables. A framework for identification and analysis of the operational risks associated with single business process activities, as well as a whole process, was proposed by Jallow et al. [4]. Wickboldt et al. proposed a framework that makes use of a process model and process execution data from historical records for risk prediction [10]. The use of process mining for the identification of transactional Page 77
2 fraud risk was proposed by Jans et al. [5]. Overall, our approach di ers from previous work in that: it does not require as an input risk indicators defined by experts or pre-classified data [4, 10]; it is not restricted to transactional fraud risk [5]; and it focuses on identifying the risk of not meeting a deadline rather than estimating the remaining cycle time of a case [8, 9]. Since our approach is based on actual data in event logs, it focuses on processrelated risks only. We refer to a risk as process-related if its root cause is any combination of process behaviour (notably the activities performed and their sequence), resource behaviour (e.g., resource availability, capabilities and interaction patterns) or case-related data. Process-related risks can jeopardise the achievement of process goals in terms of cost, timeliness or the quality of outputs [4]. In this paper we consider only one type of risk, the likelihood that cases do not meet their deadline, however our general strategy is not restricted to time-related risks. Our approach consists of three steps: 1) definition of Process Risk Indicators (PRIs); 2) devising a way to identify instances of risk patterns in a log; and 3) defining a predictor function that characterises the risk of a case failing (from its local characteristics only). 2 Risk Identification Method Before introducing our Process Risk Indicators (PRIs), we first introduce some notations. Let denote a run of a process model. Random variable X denotes a case s outcome in terms of timeliness per run. In this paper, we assume that X takes one of two possible values: 1 if a case is delayed and 0 if it is completed in time. Per run there is cumulative distribution function F such that F (x) =P (X apple x) for X. In this way the risk of case delay can be quantified. Function F captures both impact and likelihood. Assuming that a process is in a steady state there exists such a function F for all runs. Our goal is to define a function G that predicts the value of X, i.e., we would like to minimize the expected value of the di erence X G. Function G is based on a few local characteristics of. Let E denote the set of all possible events. A trace is a sequence of events 2E. An event log L is a set of traces. We assume that each event has the following attributes: an activity name, atime stamp, aresource and a transaction type (including start and complete). Each case is described by a trace 2 L which can be related to a process model run. Using indicators for risk monitoring is a common practice in areas such as safety and fraud detection, so we use risk indicators for identification of process-related risks. We define a Process Risk Indicator as a pattern observable in an event log whose presence indicates a higher likelihood of some processrelated risk. In this paper we consider only the risk of a case overrun. Our aim is to identify domain-independent indicators that can be identified by analysing event logs and do not require any additional information, e.g. a process model. We have defined five time-related PRIs. PRI 1: Abnormal activity execution time. A case contains an activity whose duration is significantly higher than its normal duration. Page 78
3 PRI 2: Abnormal waiting time. Activity execution is not started for an abnormally long period of time after it has been enabled. PRI 3: Multiple activity repetitions. An activity is repeated multiple times in a case. PRI 4: Atypical activities. A case contains an activity that has not been performed often previously. PRI 5: Multiple resource involvement. The number of resources involved in a case significantly exceeds the norm. Our method for PRI discovery is based on unsupervised statistical techniques for outlier identification. They have the advantage of not requiring pre-classified data samples for learning. We use the sample standard deviations approach for outlier detection which assumes that the sampled values follow a normal distribution. A cut-o threshold for a normally distributed population is usually defined as µ ± 2 (for a 95% confidence interval). Observations whose values are outside this range are considered outliers. If a sample contains extreme outliers a cut-o threshold defined by the mean x and standard deviation s is often unnecessarily biased, so for a normally distributed population the median x is a robust estimator for x and a robust estimator for s is 1.483MAD [6]. Our method for PRI identification consists of two steps: (1) Identify a cut-o threshold by analysing the given event log; and (2) For a given case (represented by a trace) identify outliers using the learned threshold. For each trace 2 L we introduce attributes for each risk indicator n, denoted PRI n. These attributes are used by the risk identification method to store information about the indicators found in a trace. Attribute PRI n is 1 if indicator n is found, and 0 otherwise. Following Zhang et al. [11], we assume that activity durations follow a lognormal distribution, therefore logarithms of activity durations approximately follow a normal distribution. To identify the presence of PRI 1 in a trace belonging to a run of the process model, the following procedure is followed. For every activity a occurring in at least one trace corresponding to : create a sample x of logarithms of the durations of all occurrences of a in traces corresponding to (di erence between complete and start events); calculate a cut-o threshold t = x +2s; for a given activity instance compare logarithm of its duration with the threshold t and if it exceeds the threshold set the value of the corresponding case s attribute PRI 1 = 1. A similar procedure is followed for other PRIs. For PRI 2 we also assume that waiting times follow a log-normal distribution [11]. The waiting time is calculated as the di erence between the end time and the start time of two consecutive activities in a log. Importantly, this assumption may not always be true. For PRI 3 and PRI 5 we assume that the number of activity executions in a case and the number of resources involved in a case follow a normal distribution. An activity is considered atypical (PRI 4 ) if it has been executed in fewer than a certain number of cases in the log. The threshold t is an input parameter that represents the fraction of cases where a particular activity has been executed. We define a predictor function G that estimates the risk level of a case based on the risk indicators it exhibits. Thus binary function G predicts a delay if Page 79
4 any of the indicators is found in a case. We have also defined a function Score that returns a suspicion score based on the number of identified indicators for each case. A high suspicion score means that many indicators were found in a case, and can be used to calibrate risk alert levels. Let be a trace that represents a given case, (PRI n ) denote the value of attribute PRI n of trace, {PRI 1,...,PRI k } be a set of k PRIs, and w i denote the weight chosen for indicator PRI i : G( )= k W i=1 P (PRI i ); Score( )= k w i (PRI i ) In our current implementation once a risk indicator is identified we update the corresponding attribute of a trace. Functions G and Score are calculated for each complete trace and the values are compared with actual case durations to evaluate the performance of the functions. i=1 3 Experimental Results Our approach has been implemented as a plug-in of the process mining framework ProM 6. Its main functionality is to identify occurrences of our five PRIs in a given log and to thus predict the likelihood of a case being delayed. Predicted values are then compared with the actual outcome of a case to evaluate the performance of the predictor functions. In order to isolate traces corresponding to di erent process model runs the plug-in uses either the existing ProM 6 replay plug-in [1] or the trace clustering plug-in [2] (if the process model is not available). We evaluated our approach using an event log which represents the application process for a personal loan or overdraft from a Dutch financial institution given for the BPI Challenge The log contains 13,087 traces in total and we first filtered this log to produce 934 traces suited to our experimental purposes. The plug-in that uses the trace clustering was applied. The filtered log was grouped into 12 clusters with the total number of traces in each cluster ranging from 20 to 206. After clustering, the traces in each cluster were put into either a training set (used to learn cut-o thresholds) or a test set. For each cluster within the training set we estimated the normal case duration as x MAD. Cases whose durations exceeded this value were considered to be delayed. Table 1 shows the experimental results for the test set of 462 traces. To evaluate the quality of predictions we used the mean absolute error (MAE). This is calculated as 1 n P n i=1 p i r i for both delayed cases (yielding the MAE for false negatives) and for cases that are in time (yielding the MAE for false positives), where n is the number of cases in each category and p i and r i denote predicted and real values respectively. We calculated the MAE separately for delayed cases and cases that are on time, because it is often important to distinguish between di erent types of errors, both false-negatives and false-positives, as their impact 3 BPI Challenge doi: /uuid:3926db30-f aebc e91f Page 80
5 Table 1. Experimental results showcasing the predictive value of five process risk indicators (PRIs) on the test set of the BPI Challenge event log. 5PRIs PRI 1 PRI 2 PRI 3 PRI 4 PRI 5 Delayed In Time TN FN FP TP TN FP TN FP TN FP TN FP TN FP Traces % 76% 24% 51% 49% 28% 27% 0% 4% 66% 28% 0% 0.5% 10% 6% Legend: TN True Negatives; FN False Negatives; FP False Positives; TP True Positives on business performance can be very di erent. We can observe that the MAE for delayed cases with 5 PRIs is 0.24, i.e., the predictor function estimated correctly the outcome of 76% of delayed cases ( True Negatives in Table 1). On the other hand, the MAE for the cases that are not delayed is 0.51 ( False Positives in Table 1). From further analysis, we observed that 74% of the 221 cases that were falsely predicted as delayed have durations that are very close to the cuto threshold (the di erence is lower than 5% of assumed normal case duration). From the individual PRI results, we can see that for this particular log almost all predicted problems ( True Negatives in Table 1) are based on observations of PRIs 1, 3 and 5. We have also analysed the ability of PRIs to provide operational support. For this particular event log, we were able to identify the presence of PRIs 1, 3 and 4 early during a case s execution, while PRIs 2 and 5 for most of the cases could only be discovered after half of the normal case duration for the run corresponding to that case had passed. Table 1 focussed on the results from our first predictor function, G. We also tested the weighted Score function (with w i = 1 for all PRIs) and found that for most of the cases predicted as delayed just one of the indicators was discovered (64% of correctly predicted cases and 76% of falsely predicted cases). This reveals that the suspicion attached to these poor results of G was actually very low. After examining the BPI Challenge event log we noted certain log characteristics that may have influenced the presented results and discovered opportunities for the improvement of the risk identification method. The durations of the cases assigned to a cluster did not significantly deviate from the cut-o thresholds, thus there were very few outlier cases. Also, the number of traces in some clusters were too small to get statistically significant results. Many activities have very small durations compared to the total case duration. Discarding durations whose values are lower than some predefined threshold may help to filter out false positive predictions. The event log used does not contain start events recorded for all activities. To be able to work with the event logs that do not contain start events we can use an indicator PRI 6: Abnormal sub-process duration that considers both activity service and waiting time (sub-process durations are calculated as the time di erence between two consecutive complete events). Applying PRI 6 and PRI 3 v.2 (that considers the absolute values of repetition durations) we were able to correctly estimate the outcome of 86% of delayed cases and 30% of cases in time were falsely predicted as delayed. Page 81
6 4 Conclusions We have presented a new approach for predicting whether or not a case will meet its deadline. We first defined relevant Process Risk Indicators and then used statistical methods to identify their presence in event logs. Our initial results indicate that further work is needed to properly calibrate the analysis, perhaps on a process-specific basis, to minimise the annoyance of false-positive warnings and the more serious threat of false-negative alert failures. (As noted above, the data set available to us for experimentation was not well-suited to our purposes. We have recently obtained a larger data set from an Insurance Company and will use it for experiments.) Although we only focused on the risk of case overruns in this paper, we believe that the overall strategy is suitable for any quantifiable type of risk, such as financial losses or low-quality outputs. Acknowledgement. This research is funded by the ARC Discovery Project Risk-aware Business Process Management (DP ). References 1. A. Adriansyah, B.F. van Dongen, and W.M.P. van der Aalst. Conformance checking using cost-based fitness analysis. In Enterprise Distributed Object Computing Conference (EDOC), th IEEE International, pages55 64.IEEE, R. Bose and W.M.P. van der Aalst. Context aware trace clustering: Towards improving process mining results. In Proceedings of the SIAM International Conference on Data Mining, SDM, pages , International Organization for Standardization. Risk management: vocabulary = Management du risque: vocabulaire (ISO guide 73). Geneva, A.K. Jallow, B. Majeed, K. Vergidis, A. Tiwari, and R. Roy. Operational risk analysis in business processes. BT Technology Journal, 25(1): , M. Jans, N. Lybaert, K. Vanhoof, and J.M. van der Werf. A business process mining application for internal transaction fraud mitigation. Expert Systems with Applications, 38(10): , P.J. Rousseeuw. Robust estimation and identifying outliers. In Handbook of Statistical methods for Engineers and Scientists, chapter 16. McGraw-Hill, New York, Standards Australia and Standards New Zealand. Risk management: principles and guidelines (AS/NZS ISO 31000:2009). Sydney, NSW, Wellington, NZ, 3rd edition, W.M.P. van der Aalst, M.H. Schonenberg, and M. Song. Time prediction based on process mining. Information Systems, 36(2): , B. van Dongen, R. Crooy, and W.M.P. van der Aalst. Cycle time prediction: When will this case finally be finished? On the Move to Meaningful Internet Systems: OTM 2008, pages , J.A. Wickboldt, L.A. Bianchin, R.C. Lunardi, L.Z. Granville, L.P. Gaspary, and C. Bartolini. A framework for risk assessment based on analysis of historical information of workflow execution in it systems. Computer Networks, 55(13): , P. Zhang and N. Serban. Discovery, visualization and performance analysis of enterprise workflow. Computational statistics & data analysis, 51(5): , Page 82
Profiling Event Logs to Configure Risk Indicators for Process Delays
Profiling Event Logs to Configure Risk Indicators for Process Delays Anastasiia Pika 1, Wil M. P. van der Aalst 2,1, Colin J. Fidge 1, Arthur H. M. ter Hofstede 1,2, and Moe T. Wynn 1 1 Queensland University
Towards Cross-Organizational Process Mining in Collections of Process Models and their Executions
Towards Cross-Organizational Process Mining in Collections of Process Models and their Executions J.C.A.M. Buijs, B.F. van Dongen, W.M.P. van der Aalst Department of Mathematics and Computer Science, Eindhoven
Mercy Health System. St. Louis, MO. Process Mining of Clinical Workflows for Quality and Process Improvement
Mercy Health System St. Louis, MO Process Mining of Clinical Workflows for Quality and Process Improvement Paul Helmering, Executive Director, Enterprise Architecture Pete Harrison, Data Analyst, Mercy
Dotted Chart and Control-Flow Analysis for a Loan Application Process
Dotted Chart and Control-Flow Analysis for a Loan Application Process Thomas Molka 1,2, Wasif Gilani 1 and Xiao-Jun Zeng 2 Business Intelligence Practice, SAP Research, Belfast, UK The University of Manchester,
Combination of Process Mining and Simulation Techniques for Business Process Redesign: A Methodological Approach
Combination of Process Mining and Simulation Techniques for Business Process Redesign: A Methodological Approach Santiago Aguirre, Carlos Parra, and Jorge Alvarado Industrial Engineering Department, Pontificia
Process Mining and Monitoring Processes and Services: Workshop Report
Process Mining and Monitoring Processes and Services: Workshop Report Wil van der Aalst (editor) Eindhoven University of Technology, P.O.Box 513, NL-5600 MB, Eindhoven, The Netherlands. [email protected]
Mining Configurable Process Models from Collections of Event Logs
Mining Configurable Models from Collections of Event Logs J.C.A.M. Buijs, B.F. van Dongen, and W.M.P. van der Aalst Eindhoven University of Technology, The Netherlands {j.c.a.m.buijs,b.f.v.dongen,w.m.p.v.d.aalst}@tue.nl
Application of Process Mining in Healthcare A Case Study in a Dutch Hospital
Application of Process Mining in Healthcare A Case Study in a Dutch Hospital R.S. Mans 1, M.H. Schonenberg 1, M. Song 1, W.M.P. van der Aalst 1, and P.J.M. Bakker 2 1 Department of Information Systems
Process Modelling from Insurance Event Log
Process Modelling from Insurance Event Log P.V. Kumaraguru Research scholar, Dr.M.G.R Educational and Research Institute University Chennai- 600 095 India Dr. S.P. Rajagopalan Professor Emeritus, Dr. M.G.R
Trace Clustering in Process Mining
Trace Clustering in Process Mining M. Song, C.W. Günther, and W.M.P. van der Aalst Eindhoven University of Technology P.O.Box 513, NL-5600 MB, Eindhoven, The Netherlands. {m.s.song,c.w.gunther,w.m.p.v.d.aalst}@tue.nl
Supporting the BPM lifecycle with FileNet
Supporting the BPM lifecycle with FileNet Mariska Netjes Hajo A. Reijers Wil. M.P. van der Aalst Outline Introduction Evaluation approach Evaluation of FileNet Conclusions Business Process Management Supporting
Feature. Applications of Business Process Analytics and Mining for Internal Control. World
Feature Filip Caron is a doctoral researcher in the Department of Decision Sciences and Information Management, Information Systems Group, at the Katholieke Universiteit Leuven (Flanders, Belgium). Jan
Process Mining by Measuring Process Block Similarity
Process Mining by Measuring Process Block Similarity Joonsoo Bae, James Caverlee 2, Ling Liu 2, Bill Rouse 2, Hua Yan 2 Dept of Industrial & Sys Eng, Chonbuk National Univ, South Korea jsbae@chonbukackr
PM 2 : a Process Mining Project Methodology
PM 2 : a Process Mining Project Methodology Maikel L. van Eck, Xixi Lu, Sander J.J. Leemans, and Wil M.P. van der Aalst Eindhoven University of Technology, The Netherlands {m.l.v.eck,x.lu,s.j.j.leemans,w.m.p.v.d.aalst}@tue.nl
Implementing Heuristic Miner for Different Types of Event Logs
Implementing Heuristic Miner for Different Types of Event Logs Angelina Prima Kurniati 1, GunturPrabawa Kusuma 2, GedeAgungAry Wisudiawan 3 1,3 School of Compuing, Telkom University, Indonesia. 2 School
Using Trace Clustering for Configurable Process Discovery Explained by Event Log Data
Master of Business Information Systems, Department of Mathematics and Computer Science Using Trace Clustering for Configurable Process Discovery Explained by Event Log Data Master Thesis Author: ing. Y.P.J.M.
Process mining challenges in hospital information systems
Proceedings of the Federated Conference on Computer Science and Information Systems pp. 1135 1140 ISBN 978-83-60810-51-4 Process mining challenges in hospital information systems Payam Homayounfar Wrocław
Discovering Stochastic Petri Nets with Arbitrary Delay Distributions From Event Logs
Discovering Stochastic Petri Nets with Arbitrary Delay Distributions From Event Logs Andreas Rogge-Solti 1 and Wil M.P. van der Aalst 2 and Mathias Weske 1 1 Business Process Technology Group, Hasso Plattner
Performance Measures in Data Mining
Performance Measures in Data Mining Common Performance Measures used in Data Mining and Machine Learning Approaches L. Richter J.M. Cejuela Department of Computer Science Technische Universität München
Business Process Modeling
Business Process Concepts Process Mining Kelly Rosa Braghetto Instituto de Matemática e Estatística Universidade de São Paulo [email protected] January 30, 2009 1 / 41 Business Process Concepts Process
Using Process Mining to Bridge the Gap between BI and BPM
Using Process Mining to Bridge the Gap between BI and BPM Wil van der alst Eindhoven University of Technology, The Netherlands Process mining techniques enable process-centric analytics through automated
CCaaS: Online Conformance Checking as a Service
CCaaS: Online Conformance Checking as a Service Ingo Weber 1, Andreas Rogge-Solti 2, Chao Li 1, and Jan Mendling 2 1 NICTA, Sydney, Australia [email protected] 2 Wirtschaftsuniversität Wien,
Discovering User Communities in Large Event Logs
Discovering User Communities in Large Event Logs Diogo R. Ferreira, Cláudia Alves IST Technical University of Lisbon, Portugal {diogo.ferreira,claudia.alves}@ist.utl.pt Abstract. The organizational perspective
Process Mining. ^J Springer. Discovery, Conformance and Enhancement of Business Processes. Wil M.R van der Aalst Q UNIVERS1TAT.
Wil M.R van der Aalst Process Mining Discovery, Conformance and Enhancement of Business Processes Q UNIVERS1TAT m LIECHTENSTEIN Bibliothek ^J Springer Contents 1 Introduction I 1.1 Data Explosion I 1.2
Model Discovery from Motor Claim Process Using Process Mining Technique
International Journal of Scientific and Research Publications, Volume 3, Issue 1, January 2013 1 Model Discovery from Motor Claim Process Using Process Mining Technique P.V.Kumaraguru *, Dr.S.P.Rajagopalan
Relational XES: Data Management for Process Mining
Relational XES: Data Management for Process Mining B.F. van Dongen and Sh. Shabani Department of Mathematics and Computer Science, Eindhoven University of Technology, The Netherlands. B.F.v.Dongen, [email protected]
ProM 6 Tutorial. H.M.W. (Eric) Verbeek mailto:[email protected] R. P. Jagadeesh Chandra Bose mailto:j.c.b.rantham.prabhakara@tue.
ProM 6 Tutorial H.M.W. (Eric) Verbeek mailto:[email protected] R. P. Jagadeesh Chandra Bose mailto:[email protected] August 2010 1 Introduction This document shows how to use ProM 6 to
Azure Machine Learning, SQL Data Mining and R
Azure Machine Learning, SQL Data Mining and R Day-by-day Agenda Prerequisites No formal prerequisites. Basic knowledge of SQL Server Data Tools, Excel and any analytical experience helps. Best of all:
Generation of a Set of Event Logs with Noise
Generation of a Set of Event Logs with Noise Ivan Shugurov International Laboratory of Process-Aware Information Systems National Research University Higher School of Economics 33 Kirpichnaya Str., Moscow,
BPIC 2014: Insights from the Analysis of Rabobank Service Desk Processes
BPIC 2014: Insights from the Analysis of Rabobank Service Desk Processes Bruna Christina P. Brandão, Guilherme Neves Lopes, Pedro Henrique P. Richetti Department of Applied Informatics - Federal University
Process Mining in Big Data Scenario
Process Mining in Big Data Scenario Antonia Azzini, Ernesto Damiani SESAR Lab - Dipartimento di Informatica Università degli Studi di Milano, Italy antonia.azzini,[email protected] Abstract. In
Eventifier: Extracting Process Execution Logs from Operational Databases
Eventifier: Extracting Process Execution Logs from Operational Databases Carlos Rodríguez 1, Robert Engel 2, Galena Kostoska 1, Florian Daniel 1, Fabio Casati 1, and Marco Aimar 3 1 University of Trento,
Analysis of Service Level Agreements using Process Mining techniques
Analysis of Service Level Agreements using Process Mining techniques CHRISTIAN MAGER University of Applied Sciences Wuerzburg-Schweinfurt Process Mining offers powerful methods to extract knowledge from
Business Process Measurement in small enterprises after the installation of an ERP software.
Business Process Measurement in small enterprises after the installation of an ERP software. Stefano Siccardi and Claudia Sebastiani CQ Creativiquadrati snc, via Tadino 60, Milano, Italy http://www.creativiquadrati.it
Business Process Mining for Internal Fraud Risk Reduction: Results of a Case Study
Business Process Mining for Internal Fraud Risk Reduction: Results of a Case Study Mieke Jans, Nadine Lybaert, and Koen Vanhoof Hasselt University, Agoralaan Building D, 3590 Diepenbeek, Belgium http://www.uhasselt.be/
BUsiness process mining, or process mining in a short
, July 2-4, 2014, London, U.K. A Process Mining Approach in Software Development and Testing Process: A Case Study Rabia Saylam, Ozgur Koray Sahingoz Abstract Process mining is a relatively new and emerging
SAS Fraud Framework for Banking
SAS Fraud Framework for Banking Including Social Network Analysis John C. Brocklebank, Ph.D. Vice President, SAS Solutions OnDemand Advanced Analytics Lab SAS Fraud Framework for Banking Agenda Introduction
Configuring IBM WebSphere Monitor for Process Mining
Configuring IBM WebSphere Monitor for Process Mining H.M.W. Verbeek and W.M.P. van der Aalst Technische Universiteit Eindhoven Department of Mathematics and Computer Science P.O. Box 513, 5600 MB Eindhoven,
Aligning Event Logs and Declarative Process Models for Conformance Checking
Aligning Event Logs and Declarative Process Models for Conformance Checking Massimiliano de Leoni, Fabrizio M. Maggi, and Wil M. P. van der Aalst Eindhoven University of Technology, Eindhoven, The Netherlands
Practical Data Science with Azure Machine Learning, SQL Data Mining, and R
Practical Data Science with Azure Machine Learning, SQL Data Mining, and R Overview This 4-day class is the first of the two data science courses taught by Rafal Lukawiecki. Some of the topics will be
Chapter 12 Analyzing Spaghetti Processes
Chapter 12 Analyzing Spaghetti Processes prof.dr.ir. Wil van der Aalst www.processmining.org Overview Chapter 1 Introduction Part I: Preliminaries Chapter 2 Process Modeling and Analysis Chapter 3 Data
Learning Business Rules for Adaptive Process Models
Learning Business Rules for Adaptive Process Models Hans Friedrich Witschel, Tuan Q. Nguyen, Knut Hinkelmann Fachhochschule Nordwestschweiz FHNW Olten, Switzerland [email protected], [email protected],
PLG: a Framework for the Generation of Business Process Models and their Execution Logs
PLG: a Framework for the Generation of Business Process Models and their Execution Logs Andrea Burattin and Alessandro Sperduti Department of Pure and Applied Mathematics University of Padua, Italy {burattin,sperduti}@math.unipd.it
Process Mining Using BPMN: Relating Event Logs and Process Models
Noname manuscript No. (will be inserted by the editor) Process Mining Using BPMN: Relating Event Logs and Process Models Anna A. Kalenkova W. M. P. van der Aalst Irina A. Lomazova Vladimir A. Rubin Received:
Mining of Agile Business Processes
Artificial Intelligence for Business Agility Papers from the AAAI 2011 Spring Symposium (SS-11-03) Mining of Agile Business Processes Simon Brander 1, Knut Hinkelmann 1, Andreas Martin 1, Barbara Thönssen
Prediction of DDoS Attack Scheme
Chapter 5 Prediction of DDoS Attack Scheme Distributed denial of service attack can be launched by malicious nodes participating in the attack, exploit the lack of entry point in a wireless network, and
CHAPTER 1 INTRODUCTION
CHAPTER 1 INTRODUCTION 1.1 Research Motivation In today s modern digital environment with or without our notice we are leaving our digital footprints in various data repositories through our daily activities,
Replaying History. prof.dr.ir. Wil van der Aalst www.processmining.org
Replaying History prof.dr.ir. Wil van der Aalst www.processmining.org Growth of data PAGE 1 Process Mining: Linking events to models PAGE 2 Where did we apply process mining? Municipalities (e.g., Alkmaar,
Simulation-based Evaluation of Workflow Escalation Strategies
Simulation-based Evaluation of Workflow Escalation Strategies Ka-Leong Chan 1, Yain-Whar Si 1, Marlon Dumas 2 1 Faculty of Science and Technology, University of Macau 2 Institute of Computer Science, University
Merging Event Logs with Many to Many Relationships
Merging vent Logs with Many to Many Relationships Lihi Raichelson and Pnina Soffer Department of Information Systems, University of Haifa, Haifa 31905, Israel [email protected], [email protected]
CSC574 - Computer and Network Security Module: Intrusion Detection
CSC574 - Computer and Network Security Module: Intrusion Detection Prof. William Enck Spring 2013 1 Intrusion An authorized action... that exploits a vulnerability... that causes a compromise... and thus
Data Mining Algorithms Part 1. Dejan Sarka
Data Mining Algorithms Part 1 Dejan Sarka Join the conversation on Twitter: @DevWeek #DW2015 Instructor Bio Dejan Sarka ([email protected]) 30 years of experience SQL Server MVP, MCT, 13 books 7+ courses
Compliance Guide: ASD ISM OVERVIEW
Compliance Guide: ASD ISM OVERVIEW Australian Information Security Manual Mapping to the Principles using Huntsman INTRODUCTION In June 2010, The Australian Government Protective Security Policy Framework
Local outlier detection in data forensics: data mining approach to flag unusual schools
Local outlier detection in data forensics: data mining approach to flag unusual schools Mayuko Simon Data Recognition Corporation Paper presented at the 2012 Conference on Statistical Detection of Potential
Process Mining-based Understanding and Analysis of Volvo IT s Incident and Problem Management Processes The BPI Challenge 2013
Process Mining-based Understanding and Analysis of Volvo IT s Incident and Problem Management Processes The BPI Challenge 2013 Chang Jae Kang 2, Young Sik Kang *,1, Yeong Shin Lee 1, Seonkyu Noh, Hyeong
EMiT: A process mining tool
EMiT: A process mining tool B.F. van Dongen and W.M.P. van der Aalst Department of Technology Management, Eindhoven University of Technology P.O. Box 513, NL-5600 MB, Eindhoven, The Netherlands. [email protected]
Intrusion Detection via Machine Learning for SCADA System Protection
Intrusion Detection via Machine Learning for SCADA System Protection S.L.P. Yasakethu Department of Computing, University of Surrey, Guildford, GU2 7XH, UK. [email protected] J. Jiang Department
Summary and Outlook. Business Process Intelligence Course Lecture 8. prof.dr.ir. Wil van der Aalst. www.processmining.org
Business Process Intelligence Course Lecture 8 Summary and Outlook prof.dr.ir. Wil van der Aalst www.processmining.org Overview Chapter 1 Introduction Part I: Preliminaries Chapter 2 Process Modeling and
Towards a Software Framework for Automatic Business Process Redesign Marwa M.Essam 1, Selma Limam Mansar 2 1
ACEEE Int. J. on Communication, Vol. 02, No. 03, Nov 2011 Towards a Software Framework for Automatic Business Process Redesign Marwa M.Essam 1, Selma Limam Mansar 2 1 Faculty of Information and Computer
Process Mining in Healthcare: Data Challenges when Answering Frequently Posed Questions
Process Mining in Healthcare: Data Challenges when Answering Frequently Posed Questions R.S. Mans 1, W.M.P. van der Aalst 1, R.J.B. Vanwersch 1, A.J. Moleman 2 1 Department of Information Systems, Eindhoven
Supporting the Workflow Management System Development Process with YAWL
Supporting the Workflow Management System Development Process with YAWL R.S. Mans 1, W.M.P. van der Aalst 1 Department of Mathematics and Computer Science, Eindhoven University of Technology, P.O. ox 513,
Benford s Law and Digital Frequency Analysis
Get M.A.D. with the Numbers! Moving Benford s Law from Art to Science BY DAVID G. BANKS, CFE, CIA September/October 2000 Until recently, using Benford s Law was as much of an art as a science. Fraud examiners
3TU.BSR: Big Software on the Run
Summary Millions of lines of code - written in different languages by different people at different times, and operating on a variety of platforms - drive the systems performing key processes in our society.
Using Predictive Analytics to Detect Fraudulent Claims
Using Predictive Analytics to Detect Fraudulent Claims May 17, 211 Roosevelt C. Mosley, Jr., FCAS, MAAA CAS Spring Meeting Palm Beach, FL Experience the Pinnacle Difference! Predictive Analysis for Fraud
Five steps to Enterprise Risk Management
risk decisions 2011 Five steps to Enterprise Risk Management by Val Jonas CEO Risk Decisions Group www.riskdecisions.com management solutions Val Jonas: Five steps to Enterprise Risk Management Five steps
B. Majeed British Telecom, Computational Intelligence Group, Ipswich, UK
The current issue and full text archive of this journal is available at wwwemeraldinsightcom/1463-7154htm A review of business process mining: state-of-the-art and future trends A Tiwari and CJ Turner
Eliminating Over-Confidence in Software Development Effort Estimates
Eliminating Over-Confidence in Software Development Effort Estimates Magne Jørgensen 1 and Kjetil Moløkken 1,2 1 Simula Research Laboratory, P.O.Box 134, 1325 Lysaker, Norway {magne.jorgensen, kjetilmo}@simula.no
Utility-Based Fraud Detection
Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence Utility-Based Fraud Detection Luis Torgo and Elsa Lopes Fac. of Sciences / LIAAD-INESC Porto LA University of
Statistics in Retail Finance. Chapter 7: Fraud Detection in Retail Credit
Statistics in Retail Finance Chapter 7: Fraud Detection in Retail Credit 1 Overview > Detection of fraud remains an important issue in retail credit. Methods similar to scorecard development may be employed,
Robustness of a Spoken Dialogue Interface for a Personal Assistant
Robustness of a Spoken Dialogue Interface for a Personal Assistant Anna Wong, Anh Nguyen and Wayne Wobcke School of Computer Science and Engineering University of New South Wales Sydney NSW 22, Australia
Unsupervised learning: Clustering
Unsupervised learning: Clustering Salissou Moutari Centre for Statistical Science and Operational Research CenSSOR 17 th September 2013 Unsupervised learning: Clustering 1/52 Outline 1 Introduction What
Process Mining Data Science in Action
Process Mining Data Science in Action Wil van der Aalst Scientific director of the DSC/e Dutch Data Science Summit, Eindhoven, 4-5-2014. Process Mining Data Science in Action https://www.coursera.org/course/procmin
Flow-based detection of RDP brute-force attacks
Flow-based detection of RDP brute-force attacks Martin Vizváry [email protected] Institute of Computer Science Masaryk University Brno, Czech Republic Jan Vykopal [email protected] Institute of Computer
EFFECTIVE CONSTRUCTIVE MODELS OF IMPLICIT SELECTION IN BUSINESS PROCESSES. Nataliya Golyan, Vera Golyan, Olga Kalynychenko
380 International Journal Information Theories and Applications, Vol. 18, Number 4, 2011 EFFECTIVE CONSTRUCTIVE MODELS OF IMPLICIT SELECTION IN BUSINESS PROCESSES Nataliya Golyan, Vera Golyan, Olga Kalynychenko
Stock Market Forecasting Using Machine Learning Algorithms
Stock Market Forecasting Using Machine Learning Algorithms Shunrong Shen, Haomiao Jiang Department of Electrical Engineering Stanford University {conank,hjiang36}@stanford.edu Tongda Zhang Department of
Decision Mining in Business Processes
Decision Mining in Business Processes A. Rozinat and W.M.P. van der Aalst Department of Technology Management, Eindhoven University of Technology P.O. Box 513, NL-5600 MB, Eindhoven, The Netherlands {a.rozinat,w.m.p.v.d.aalst}@tm.tue.nl
Handling Big(ger) Logs: Connecting ProM 6 to Apache Hadoop
Handling Big(ger) Logs: Connecting ProM 6 to Apache Hadoop Sergio Hernández 1, S.J. van Zelst 2, Joaquín Ezpeleta 1, and Wil M.P. van der Aalst 2 1 Department of Computer Science and Systems Engineering
Monte Carlo analysis used for Contingency estimating.
Monte Carlo analysis used for Contingency estimating. Author s identification number: Date of authorship: July 24, 2007 Page: 1 of 15 TABLE OF CONTENTS: LIST OF TABLES:...3 LIST OF FIGURES:...3 ABSTRACT:...4
Process Mining: Making Knowledge Discovery Process Centric
Process Mining: Making Knowledge Discovery Process Centric Wil van der alst Department of Mathematics and Computer Science Eindhoven University of Technology PO Box 513, 5600 MB, Eindhoven, The Netherlands
An Outlook on Semantic Business Process Mining and Monitoring
An Outlook on Semantic Business Process Mining and Monitoring A.K. Alves de Medeiros 1,C.Pedrinaci 2, W.M.P. van der Aalst 1, J. Domingue 2,M.Song 1,A.Rozinat 1,B.Norton 2, and L. Cabral 2 1 Eindhoven
