Nearest Neighbour Algorithms for Forecasting Call Arrivals in Call Centers
|
|
|
- Ethel White
- 10 years ago
- Views:
Transcription
1 Nearest Neighbour Algorithms for Forecasting Call Arrivals in Call Centers Sandjai Bhulai, Wing Hong Kan, and Elena Marchiori Vrije Universiteit Amsterdam Faculty of Sciences De Boelelaan 1081a 1081 HV Amsterdam The Netherlands {sbhulai, wing, Abstract In this paper we study a nearest neighbour algorithm for forecasting call arrivals to call centers. The algorithm does not require an underlying model for the arrival rates and it can be applied to historical data without pre-processing it. We show that this class of algorithms provides a more accurate forecast when compared to the conventional method that simply takes averages. The nearest neighbour algorithm with the Pearson correlation distance function is also able to take correlation structures, that are usually found in call center data, into account. Numerical experiments show that this algorithm provides smaller errors in the forecast and better staffing levels in call centers. The results can be used for a more flexible workforce management in call centers. 1 Introduction A call center is a collection of resources providing a telephony interface between a service provider and its customers. The resources consist, among others, of customer service representatives, who talk to customers over the telephone. The service representatives are usually supported by elaborate information and communication (ICT) equipment. Due to the rapid growth of e-commerce, the contact between the service provider and its customers is additionally subject to other channels, such as and the Internet. This general interface between a service provider and its customers is now often referred to as a contact center. Most organizations with customer contact have a call center nowadays, or hire specialized firms to handle their communications with customers through call centers. Current trends are towards an increase in economic scope and workforce (see Gans et al. [8]). Hence, there is an enormous financial interest in call centers that leads to the importance of efficient management of call centers. The efficiency relates to the efficient use of the workforce, since the costs in a call center are dominated by personnel costs. The basis of efficient workforce management in call centers is the well-known Erlang- C model (see Erlang [7]). This queueing model, also referred to as an M/M/c model, 1
2 assumes that the process of incoming calls forms a Poisson process with parameter λ, that the service duration is exponentially distributed with parameter µ, and that there are c service representatives. Based on these assumptions, the model allows one to calculate the waiting time distribution under a given load R = λ/µ of the call center. The result is used to compute the minimum number of service representatives needed to meet a target service level β, i.e., the waiting time objective. Halfin and Whitt [13] suggest the square-root safety-staffing principle, recommending the number of representatives to be c R + β R, where R is the offered load and β R is safety staffing to compensate for stochastic variability. The Erlang-C model is perhaps the simplest model in call center circles. For many applications, however, the model is an over-simplification due to the assumption of a constant arrival rate over the whole day. Common call center practice is to use the stationary independent period-by-period (SIPP) approximation (see Green et al. [12]). The SIPP approximation uses the average arrival rate over a period of 15 or 30 minutes, based on historical data, and the number of service representatives in that period as input to the stationary Erlang-C model to approximate performance in that period. The pointwise stationary approximation (PSA) (see Green and Kolesar [11]) is the limiting version of the SIPP approximation when the period length approaches zero. The stationary models implicitly assume that the time required for the system to relax is small when compared to the length of the period. However, abrupt changes in the arrival rate, or overload situations during one or more periods lead to non-stationary behaviour that must be accounted for. Yoo [19] and Ingolfsson et al. [15] present methods to calculate the transient behaviour by numerically solving the Chapman-Kolmogorov forward equations for the time-varying M t /M/c t queueing system. Yoo [19] and Ingolfsson et al. [14] approximate the continuously varying parameters with small, discrete intervals and use the uniformization method (see Grassmann [10]) to calculate the change in system occupancy from one period to the next. Experiments show that the method of randomization yields results that are close to the exact solutions in general. While the non-stationary models perform well and can be used for workforce management, they assume that the overall arrival rate is known. In practice, the arrival rate is predicted from historical data and is not known with certainty in advance. The risk involved in ignoring this uncertainty can be substantial. A call center that is planning to operate at 95% utilization, can experience an actual utilization of 99.75% with exploding waiting times when the arrival rate turns out to be 5% higher than planned (see Gans et al. [8]). A natural way to deal with this uncertainty is to increase the safety staffing β R, but it remains to be studied how much the increase exactly should be. Call centers generate a vast amount of data from which the history of each call that enters the system can be reconstructed. Statistical data analysis on the records of individual calls reveals structural properties that can be used in determining the call arrival function. Brown et al. [4] have found that there is a significant correlation in call arrivals across different time periods in the same planning horizon. Moreover, the first few hours of a day often provide significant information about the call volumes for the remainder of the day. These findings motivate general arrival rate models such as those suggested by Avradimis et al. [2], Jongbloed and Koole [16], and Steckley et al. [18]. 2
3 The rise in computational power and large call center databases bring forth new forecasting techniques that increase the accuracy of the forecasts. In this paper we present a K-nearest neighbour algorithm for forecasting the arrival rate function dynamically. The method is based on comparing arrival rate functions on different days with the observed pattern so far. The K arrival rate functions that are closest, with respect to some distance function, to the observed pattern are used to forecast the call arrival rate function for the rest of the planning horizon. The algorithm is able to take into account the correlations in call center data found by Brown et al. [4]. Moreover, the algorithm does not require a model and can be applied without pre-processing the historical data. The resulting forecast of the K-nearest neighbour algorithm can be used to create more accurate calculations of the number of representatives that is needed to meet the service level (e.g., by using the non-stationary models). This leads to greater flexibility in the management of the workforce. Next to their job of handling calls, customer representatives also have administrative tasks, meetings, and trainings. These tasks could be re-scheduled or cancelled if the forecast shows a surge in calls that was unprecedented for. Representatives could also attend to other channels in periods that are predicted to be less busy. Similarly, inbound and outbound calls could be mixed in such periods (call blending), so that the workforce is used efficiently (see, e.g., Bhulai and Koole [3] and Gans and Zhou [9]). The outline of the paper is as follows. In Section 2 we give the exact problem formulation. We then continue to present the K-nearest neighbour algorithm, that will be used to solve the problem, in Section 3. A case study with the results of applying this algorithm are presented in Section 4. Finally, Section 5 concludes the paper by summarizing the main results. 2 The Forecasting Problem Consider a call center whose statistics are stored in a large database. Typically, the statistics contain a lot of information on each individual call, such as the starting time, the end time, the waiting time, the handling agent, and much more. In practice, many call centers store summarized historical data only, due to the historically high cost of maintaining and storing large databases even if these reasons are no longer prohibitive. Let us therefore assume that the data is aggregated over a period of length t minutes. Note that when t is sufficiently small, we get the information on individual calls back. For our purpose of forecasting the call arrival rate function, we concentrate on the number of calls in a period of length t. The length of a period t together with the opening hours of the call center define n periods over the day, which we denote by t 1,..., t n. Assume that the database contains m records, i.e., data on aggregated call arrivals of m days. Then we can represent the database by an m n matrix H, where entry h d,i represents the number of calls that occurred on day d in period t i of length t for d = 1,..., m and i = 1,..., n. Data analysis (see, e.g., Brown et al. [4] and Steckley et al. [18]) shows that the shape of the call arrival rate function on a particular day of the week is usually the same over different weeks. In case of an unusual event, e.g., a holiday, the shape differs significantly. 3
4 Therefore, in practice, the data is cleaned first by removing records for weeks containing unusual events, and the matrix H is divided into submatrices H (1),..., H (7) per day of the week. Conventional forecasting algorithms base their forecast on this data by taking the mean number of call arrivals for that period based on the historical values of that day. Thus, the forecast Fi CV for the number of calls in period t i on day j of the week is given by F CV i = 1 k k d=1 h (j) d,i, for i = 1,..., n, where k is the number of records in the submatrix H (j). The conventional algorithm ignores additional structure that is present in the call arrival pattern. From data analysis it is known that there is a significant correlation in call arrivals across different time periods in the same planning horizon. Typically, the first few hours of a day often provide significant information about the call volumes for the remainder of the day. The models suggested in Avradimis et al. [2], Jongbloed and Koole [16], and Steckley et al. [18] use this structure to determine the total number of calls over the day and distribute the calls according to the correlated structure over the different periods. These models, however, mostly focus on estimation rather than on prediction and also rely on the time-consuming data analysis and preparation to determine parameters of the models. The discussion above shows that there is still a need for efficient computational algorithms that do not require the tedious data analysis and use the correlation structures in the call arrivals to yield accurate forecasts. Consequently, the algorithm should also be able to update the forecast as soon as new information on call arrivals is available. A possible way to do this is to compare arrival rate functions of different days with the observed pattern so far. The arrival rate functions that are closest, with respect to some distance function, to the observed pattern can then be used to forecast the call arrival rate function for the rest of the planning horizon. This leads to the class of nearest neighbour algorithms. The distance function can be used to capture the correlation structures in call arrivals. Moreover, the algorithm does not require a model and can be applied without pre-processing or analyzing the historical data. In the next section we explain a version of the K-nearest neighbour algorithm, that we will use to generate forecasts, in greater detail. 3 The K-Nearest Neighbour Algorithm The K-nearest neighbour algorithm (see, e.g., Cover and Hart [5]) is a machine learning technique that belongs to the class of instance based learners. Given a training set, a similarity measure over patterns, and a number K, the algorithm predicts the output of a new instance pattern by combining (e.g., by means of weighted average) the known outputs of its K most similar patterns in the training set. The training data is stored in memory and only used at run time for predicting the output of new instances. Advantages of this technique are the (implicit) construction of a local model for each new instance pattern, and its robustness to the presence of noisy training patterns (cf., e.g., Mitchell [17]). 4
5 Nearest neighbour algorithms have been successfully applied to many pattern recognition problems, such as scene analysis (Duda and Hart [6]) and robot control (Atkeson et al. [1]). Application of K-nearest neighbour algorithms to our forecasting problem in call centers translates into the prediction of the arrival rate function until the end of the planning horizon, based on matching K arrival rate functions to the pattern observed so far. More formally, suppose that we have observed the call arrival rates r 1,..., r x on a specific day in periods t 1,..., t x where x < n. Let us call this information the reference trace r x = (r 1,..., r x ). The nearest neighbour algorithm compares the reference trace to historical data h d,x = (h d,1,..., h d,x ) from the matrix H for d = 1,..., m. The comparison is based on a distance function D( r x, h d,x ), which is defined by a norm on the space of the traces. The K nearest traces with respect to the distance function D are used to generate a forecast for the arrival rates ˆr x+1,..., ˆr n in periods t x+1,..., t n. The result depends on the value of K as well as on the choice of the distance measure D. In the next subsections we will describe two distance functions, the Euclidean distance and the Pearson correlation distance, that we will use in the numerical experiments. The former distance function can be seen as the conventional forecasting algorithm when restricted to the submatrices H (1),..., H (7). The latter distance function is our novel approach to forecasting the call arrival rate function. Each subsection will motivate the choice of the distance function, and describe how to generate forecasts. Euclidean Distance (ED) The Euclidean distance is the most widely used and the most natural distance function to use. It is defined by the Euclidean norm ED( r x, [ x h d,x ) = (r i h d,i ) 2] 1/2. i=1 The effect of using this distance function in the nearest neighbour algorithm is that traces are considered to be near when the arrival rates in the historical data almost exactly match the observed arrival rates in the different periods. By selecting the K nearest traces, say on days d 1,..., d K, the forecast for period t i is given by F ED i = 1 K [ hd1,i + + h dk,i], for i = x + 1,..., n. Note that by restricting to a specific day j of the week, i.e., to the submatrix H (j), we get the conventional forecast Fi CV. Also observe that the distance function is sensitive to trends and days with special events. Hence, this distance function actually requires cleaned historical data without trends and days with special events. Therefore, the forecast needs to be adjusted for these situations. 5
6 Pearson Correlation Distance (PD) The Pearson Correlation distance is based on the correlation coefficient between two vectors. A correlation value close to zero denotes little similarity, whereas a value close to one signifies a lot of similarity. Hence, the definition of the distance function is given by PD( r x, h d,x ) = 1 correlation( r x, h d,x ) = 1 (x 1) x ( i=1 ri mean( r x ) ) ( h d,i mean( h d,x ) ) ( ri mean( r x ) ) 2 x ( i=1 hd,i mean( h d,x ) ) 2, x i=1 where mean( r x ) = 1 x x i=1 r i and mean( h d,x ) = 1 x x i=1 h d,i. Note that the function looks at similarities in the shape of two traces rather than the exact values of the data. Consequently, this function answers the need to capture correlation structures in the call rates. Moreover, it does not require cleaned data, because it is less sensitive to trends and special events. When the K nearest traces have been selected, say on days d 1,..., d K, the forecast for period t i cannot be generated as in the case of the Euclidean distance. Since the distance function selects traces based on the shape, the offset of the trace for day d j needs to be adjusted by c dj = 1 [ x ] (r i h dj,i). x i=1 Therefore, the forecast for period t i is given by F PD i = 1 K [ (hd1,i + c d1 ) + + (h dk,i + c dk ) ], for i = x + 1,..., n. In the next section we will use the K-nearest neighbour algorithm to forecast call arrival functions using these two distance functions based on real historical data. 4 Numerical Experiments In this section we illustrate the nearest neighbour algorithm using real call center data of an Israeli bank. The call center data with documentation are freely available from The data contains records during a period of a year. The call center is staffed from 7 am to midnight from Sunday to Thursday, it closes at 2 pm on Friday, and reopens at 8 pm on Sunday. In our experiments we take for practical staffing purposes t to be equal to 15 minutes. Thus, taking the opening hours on Sunday until Thursday into account, we have 68 periods for which the number of arriving calls are aggregated per 15 minutes. Given these 5 days, we therefore have 5 52 records, which are used to construct the matrix H and the submatrices H (1),..., H (5). In order to evaluate the quality of the forecasts produced by the nearest neighbour algorithm, we pick a reference day and assume that the call arrival function r x up to period 6
7 40 35 Mean Development of each Working Day SUNDAY MONDAY TUESDAY WEDNESDAY THURSDAY 30 Number of Calls I II III IV Interval Figure 1: Forecast update moments over the day. x {1,..., 68} is known. The nearest neighbour algorithm then selects K traces that are nearest to r x and generates forecasts as explained in the previous section. Typically, historical data will be used, i.e., data only prior to the reference day selected. This is not prohibitive in practice, since call centers usually have enough data to work with. In our case, however, this could result in too few traces when a day early in the year is picked. Since the call center data has a negligible trend in the number of calls, we use all traces available for the matching procedure. Thus, we treat traces that are not prior to the reference day as historical data gathered from the previous year. This also enables us to make better comparisons, since every reference day can be compared with 51 records when using H (j) and 259 records when using H. In principle, the nearest neighbour algorithm can be run every time new data of the reference trace is available. In practice, reacting to every new forecast might lead to many different staffing configurations which might not be manageable. Therefore, we choose to update the forecast only at specific moments of the day. The call arrival function of call centers typically has two peaks, with relatively more calls arriving as compared to other periods, during a day. Depending on the services and opening hours of the call center, the first peak usually occurs in the morning, and the second in the afternoon. It is important to have staffed enough service representatives to meet the service level and not to build up a backlog of waiting calls, which might affect the service level in the periods to come. Therefore, having a better forecast before and after such peaks can result in flexible solutions, as described in Section 1, to prevent violation of the service level. Based on 7
8 180 (Seperated Working Days) WEDNESDAY Period3 Conventional Euclidian Pearson Error K Figure 2: Forecast errors for CV, ED, and PD under SWD as a function of K. the data of the call center, see Figure 1, we identify three moments over the day at which the forecast is updated. Figure 1 shows the mean arrival rates for the different days of the week, and identifies 9 am (start of period 9), 1 pm (start of period 25), and 6 pm (start of period 45) as update moments. Having described the setup of the experiments, we continue to describe the forecasting algorithms that we will compare in the experiments. We shall compare the conventional way of forecasting (CV), described in Section 2, and the K nearest neighbour algorithm with the Euclidean distance (ED) and the Pearson correlation distance (PD), described in Section 3. Note that the conventional algorithm CV is equivalent to ED when K is set equal to the total number of records available. These algorithms will also be compared by forecasting based on separated days of the week H (1),..., H (5) (SWD) and combined days of the week H (CWD). Note that at the start of period 1, the CV is the only applicable algorithm, since there are no traces available for the nearest neighbour algorithm. Evaluation from a statistical perspective In this subsection we will compare the algorithms CV, ED, and PD based on SWD and CWD, resulting in 6 algorithms. Since the presentation of these algorithms for five days of the week with three update moments in a day (giving 15 combinations) is too extensive, we only present the Wednesday with 1 pm as the update moment as a representative case. We evaluate the forecast at the start of period 25 and we define the error of the forecast as the average over the absolute differences between the forecasted number of calls and the actual realization in periods 25,, 44. It is not necessary to take the other periods into account, since at the start of period 45 the algorithm will update the forecast again. We start by comparing the different algorithms by studying the error as a function of the size of the neighbourhood K. Figures 2 and 3 show the error for the CV, ED, and PD 8
9 200 (Combined Working Days) WEDNESDAY Period3 Conventional Euclidian Pearson Error K Figure 3: Forecast errors for CV, ED, and PD under CWD as a function of K. algorithms under both SWD and CWD as a function of K. Note that the CV algorithm does not depend on the value of K, and hence results in a horizontal line in the figures. Also note that when K increases the ED algorithm resembles the CV algorithm more, and consequently has the same performance as the CV algorithm for large K. Both figures consistently show that the PD algorithm performs better than the CV and the ED algorithms. Also, the PD algorithm does not need data preparation and actually profits from working with the full dataset that is not split up for every day of the week. The results under CWD seem to be more stable for different values of K. Moreover, under CWD already moderate values for K suffice to run the PD algorithm. The quality measure used to compare the performance of the considered algorithms is an average of the errors obtained by taking every Wednesday out of the 52 Wednesdays as a reference day. Therefore, it is interesting to analyze the variability in the errors of these forecasts. Figures 4 and 5 show the spread of the error under SWD for the ED and PD algorithms illustrated by boxplots. The variance of the CV algorithm is obtained by looking at the graph for the ED algorithm with K = 51. The figures show that the errors for the nearest neighbour algorithms are denser around the lower error values. This strengthens the conclusion that the nearest neighbour algorithm outperforms the conventional forecasts. More formally, one can statistically test which of the algorithms performs better by using the Wilcoxon rank test. This test determines if for each pair of methods, with each using its best value of K, the median of the first method is significantly lower than that of the second one. By carrying out the test for every pair of methods (CV, ED, and PD), for every dataset (SWD H (1),..., H (5), and CWD H), and for every update moment we obtained the significance values for all the data. For illustration we give the significance values in Table 1 for the comparison of ED and CV under SWD. 9
10 Sunday Monday Tuesday Wednesday Thursday II III IV Table 1: Significance values for ED and CV under SWD. 300 (Seperated Working Days) WEDNESDAY Period3 ED 300 (Seperated Working Days) WEDNESDAY Period3 PD Error 150 Error K K Figure 4: Spread of the errors for CV and PD under SWD as a function of K. 300 (Combined Working Days) WEDNESDAY Period3 ED 300 (Combined Working Days) WEDNESDAY Period3 PD Error 150 Error K K Figure 5: Spread of the errors for CV and PD under CWD as a function of K. The table shows, for each interval (see Figure 1) and day combination, if the ED algorithm performs better than the CV algorithm. With a significance level of α = 5%, the table shows that for interval III the ED is significantly better than CV. With the exception of Tuesday and Wednesday, it is also better in interval II. Interval IV is somewhat unclear; Monday and Wednesday are predicted more accurately. In general we can conclude that intervals II and III are forecasted more accurately when the nearest neighbour algorithm is used. Preference is given to the Pearson correlation distance, due to lower forecasting errors and less variability in the spread of the errors. For interval IV there was no clear indication that any method was significantly better that the others. However, the PD algorithm performed best in most cases. 10
11 Evaluation from a staffing perspective In the previous subsection we studied the forecasting algorithms from a statistical perspective. However, it is not apparent how the conclusions of the previous subsection translate to improvements in staffing. Therefore, in this subsection we evaluate the results of the algorithms in the context of staffing in call centers. We first illustrate the effect of the nearest neighbour forecasting algorithm on staffing when the SIPP approach is used (see Green et al. [12]). Next, we study the effect on staffing based on transient models (see Ingolfsson et al. [14]). To illustrate the SIPP approach, we model the call center as an M/M/c queue in every period. Furthermore, we assume that the call center has a service level target specifying that 80% of the customers should have a service representative on the line within 20 seconds. >From the call center data, we can derive that the average service time of the representatives is 2.4 minutes per call. The staffing can now be based on the Erlang-C formula (see Erlang [7]), using the forecasts of the call arrivals, such that the service level is met in every period. We analyze three scenarios in Table 2 for staffing on a specific Wednesday in interval III (i.e., periods 25 to 44). The first scenario uses the real call arrival rates h i to staff the minimum number of service representatives n i such that the attained service level SL i is above 80%. The second scenario uses the conventional forecasting method to derive estimates h CV i of the call arrival rate. Based on these forecasts, the number of representatives n CV i is determined such that the service level is met. However, in reality the service level experienced under the real call arrival rates is given by SL CV i. The last scenario is similar to the second with the exception that the forecast is based on the nearest neighbour algorithm with the Pearson correlation distance. Table 2 shows that the conventional method systematically overestimates the call arrivals. This results in planning 137 representatives and an average weighted service level i h i SL i / i h i of The nearest neighbour algorithm does not meet the service level in every period. It staffs 116 representatives with an average weighted service level of It is close to the optimal staffing level of 108 with an average weighted service level of In general, numerical experiments show that the nearest neighbour algorithm produces results closer to the optimal staffing levels and the optimal average weighted service level. On special days, i.e., days with relatively high or low call volumes, the service level of the nearest neighbour algorithm is generally also better when viewed at each interval as compared to the service levels generated with the conventional method. To illustrate the effect of the forecasting methods on staffing with transient models, we model the call center as an M t /M/c t queueing system. The forecasts in scenarios 2 and 3 are updated at the beginning of interval II, III, and IV. As in the previous case, scenario 2 uses the conventional forecasting algorithm. In scenario 3, the best forecasting algorithm is used in each interval, i.e., the forecasting algorithm that gives the lowest errors in each interval for Wednesdays. Thus, the conventional forecast is used in interval I, the nearest neighbour forecasting algorithm PD is used in interval II and III, and for interval IV the nearest neighbour algorithm ED is used. Due to the computational complexity of this experiment, we take t to be one hour, with an additional half hour at the end of the day for handling all remaining calls in the system. For the same reason, the service level is set 11
12 h i n i SL i h CV i n CV i SL CV i h i n i SL i Table 2: Numerical experiments for SIPP staffing. to have an average speed of answer (ASA) of at most 30 seconds. Table 3 shows the results for the three scenarios for several days (hence the index d instead of i). When we compare scenario 2 and 3 with each other we cannot draw firm conclusions. Out of the 50 Wednesdays that we examined, the nearest neighbour algorithm performed better in 26 cases, worse in 17 cases, and had similar performance in 7 cases. The variability in performance is caused by the estimate in interval I. The staffing based on this estimate can result in an ASA far from 30. This result has a big influence on the staffing in the next intervals, since the staffing levels will be adjusted to correct the average speed of answer. Hence, dynamic forecasting in combination with transient models should be done more carefully. 5 Conclusions We have investigated the effectiveness of K-nearest neighbour algorithms for forecasting call volumes in call centers, based on two distance functions (the Euclidean distance and the Pearson correlation distance). From a statistical point of view, the nearest neighbour algorithm yields significantly more accurate predictions than the conventional method 12
13 n d SL d n CV d SL CV d n d SL d Table 3: Numerical experiments for staffing with transient models. that simply takes average over the historical data. Depending on the characteristics of the call arrival function in a specific interval, preference is given to one of the distance functions, resulting in a hybrid forecasting algorithm that mixes forecasting methods over the intervals. Additionally, the nearest neighbour algorithm does not require data preparation and captures the correlation structures typically found in call center data. When the forecasting algorithms are viewed upon from a staffing perspective, the method of staffing and the way the service level is calculated is relevant for the performance. When the staffing is based on independent periods, the average waited service level under the hybrid nearest neighbour algorithm performs very well. When the staffing is based on a transient model, however, no firm conclusion can be drawn. The error in the forecast in earlier periods have a big influence on the staffing in later periods. Hence, dynamic forecasting when using transient models warrants more research. References [1] C.G. Atkeson, A.W. Moore, and S. Schaal. Locally weighted learning. Artificial Intelligence Review, 11:11 73, [2] A.N. Avradimis, A. Deslauriers, and P. L Ecuyer. Modeling daily arrivals to a telephone call center. Management Science, 50: , [3] S. Bhulai and G.M. Koole. A queueing model for call blending in call centers. IEEE Transactions on Automatic Control, 48: , [4] L. Brown, N. Gans, A. Mandelbaum, A. Sakov, H. Shen, S. Zeltyn, and L. Zhao. Statistical analysis of a telephone call center: A queueing-science perspective. Submitted, [5] T.M. Cover and P.E. Hart. Nearest neighbour pattern classification. IEEE Transactions on Information Theory, 13:21 27, [6] R.O. Duda and P.E. Hart. Pattern Classification and Scene Analysis. Wiley, [7] A.K. Erlang. Solutions of some problems in the theory of probabilities of significance in automatic telephone exchanges. Electroteknikeren, 13:5 13,
14 [8] N. Gans, G. Koole, and A. Mandelbaum. Telephone call centers: Tutorial, review, and research prospects. Manufacturing and Service Operations Management, 5:79 141, [9] N. Gans and Y.-P. Zhou. A call-routing problem with service-level constraints. Operations Research, 51: , [10] W.K. Grassmann. Transient solutions in Markovian queueing systems. Computers & Operations Research, 4:47 53, [11] L. Green and P. Kolesar. The pointwise stationary approximation for queues with non-stationary arrivals. Management Science, 37:84 97, [12] L. Green, P. Kolesar, and J. Soares. Improving the SIPP approach for staffing service systems that have cyclic demands. Operations Research, 49: , [13] S. Halfin and W. Whitt. Heavy-traffic limits for queues with many exponential servers. Operations Research, 29: , [14] A. Ingolfsson, E. Cabral, and X. Wu. Combining integer programming and the randomization method to schedule employees. Working paper, [15] A. Ingolfsson, M.A. Haque, and A. Umnikov. Accounting for time-varying queueing effects in workforce scheduling. European Journal of Operational Research, 139: , [16] G. Jongbloed and G.M. Koole. Managing uncertainty in call centers using Poisson mixtures. Applied Stochastic Models in Business and Industry, 17: , [17] T. Mitchell. Machine Learning. McGraw Hill, [18] S.G. Steckley, S.G. Henderson, and V. Mehrotra. Service system planning in the presence of a random arrival rate. Submitted, [19] J. Yoo. Queueing models for staffing service operations. PhD thesis, University of Maryland,
Appendix: Simple Methods for Shift Scheduling in Multi-Skill Call Centers
MSOM.1070.0172 Appendix: Simple Methods for Shift Scheduling in Multi-Skill Call Centers In Bhulai et al. (2006) we presented a method for computing optimal schedules, separately, after the optimal staffing
Rule-based Traffic Management for Inbound Call Centers
Vrije Universiteit Amsterdam Research Paper Business Analytics Rule-based Traffic Management for Inbound Call Centers Auteur: Tim Steinkuhler Supervisor: Prof. Dr. Ger Koole October 7, 2014 Contents Preface
An Overview of Routing and Staffing Algorithms in Multi-Skill Customer Contact Centers. Submitted version
An Overview of Routing and Staffing Algorithms in Multi-Skill Customer Contact Centers Ger Koole & Auke Pot Department of Mathematics, Vrije Universiteit Amsterdam, The Netherlands Submitted version 6th
This paper introduces a new method for shift scheduling in multiskill call centers. The method consists of
MANUFACTURING & SERVICE OPERATIONS MANAGEMENT Vol. 10, No. 3, Summer 2008, pp. 411 420 issn 1523-4614 eissn 1526-5498 08 1003 0411 informs doi 10.1287/msom.1070.0172 2008 INFORMS Simple Methods for Shift
Simple Methods for Shift Scheduling in Multi-Skill Call Centers
Simple Methods for Shift Scheduling in Multi-Skill Call Centers Sandjai Bhulai, Ger Koole & Auke Pot Vrije Universiteit, De Boelelaan 1081a, 1081 HV Amsterdam, The Netherlands Final version Abstract This
Modeling and Optimization Problems in Contact Centers
Modeling and Optimization Problems in Contact Centers Pierre L Ecuyer Département d Informatique et de Recherche Opérationnelle Université de Montréal, C.P. 6128, Succ. Centre-Ville Montréal, H3C 3J7,
FORECASTING CALL CENTER ARRIVALS: A COMPARATIVE STUDY
FORECASTING CALL CENTER ARRIVALS: A COMPARATIVE STUDY by Rouba Ibrahim and Pierre L Ecuyer Department of Computer Science and Operations Research University of Montreal {ibrahiro, lecuyer}@iro.umontreal.ca
Flexible Workforce Management System for Call Center: A case study of public sector
Asia Pacific Management Review (2007) 12(6), 338-346 Flexible Workforce Management System for Call Center: A case study of public sector Jun Woo Kim a, Sang Chan Park a,* a Department of Industrial Engineering,
Supplement to Call Centers with Delay Information: Models and Insights
Supplement to Call Centers with Delay Information: Models and Insights Oualid Jouini 1 Zeynep Akşin 2 Yves Dallery 1 1 Laboratoire Genie Industriel, Ecole Centrale Paris, Grande Voie des Vignes, 92290
Optimal shift scheduling with a global service level constraint
Optimal shift scheduling with a global service level constraint Ger Koole & Erik van der Sluis Vrije Universiteit Division of Mathematics and Computer Science De Boelelaan 1081a, 1081 HV Amsterdam The
Managing uncertainty in call centers using Poisson mixtures
Managing uncertainty in call centers using Poisson mixtures Geurt Jongbloed and Ger Koole Vrije Universiteit, Division of Mathematics and Computer Science De Boelelaan 1081a, 1081 HV Amsterdam, The Netherlands
Single item inventory control under periodic review and a minimum order quantity
Single item inventory control under periodic review and a minimum order quantity G. P. Kiesmüller, A.G. de Kok, S. Dabia Faculty of Technology Management, Technische Universiteit Eindhoven, P.O. Box 513,
PARTIAL CROSS TRAINING IN CALL CENTERS WITH UNCERTAIN ARRIVALS AND GLOBAL SERVICE LEVEL AGREEMENTS. D. J. Medeiros
Proceedings of the 07 Winter Simulation Conference S. G. Henderson, B. Biller, M.-H. Hsieh, J. Shortle, J. D. Tew, and R. R. Barton, eds. PARTIAL CROSS TRAINING IN CALL CENTERS WITH UNCERTAIN ARRIVALS
A Logarithmic Safety Staffing Rule for Contact Centers with Call Blending
MANAGEMENT SCIENCE Vol., No., Xxxxx, pp. issn 25-199 eissn 1526-551 1 INFORMS doi 1.1287/xxxx.. c INFORMS A Logarithmic Safety Staffing Rule for Contact Centers with Call Blending Guodong Pang The Harold
Telephone Traffic Queues in a Customer Call Center. A Thesis. Presented for the. Master of Science Degree. The University of Tennessee at Chattanooga
Telephone Traffic Queues in a Customer Call Center A Thesis Presented for the Master of Science Degree The University of Tennessee at Chattanooga Patrick Todd April 2009 Copyright 2009 by Patrick Todd
A Comparative Study of the Pickup Method and its Variations Using a Simulated Hotel Reservation Data
A Comparative Study of the Pickup Method and its Variations Using a Simulated Hotel Reservation Data Athanasius Zakhary, Neamat El Gayar Faculty of Computers and Information Cairo University, Giza, Egypt
AS-D1 SIMULATION: A KEY TO CALL CENTER MANAGEMENT. Rupesh Chokshi Project Manager
AS-D1 SIMULATION: A KEY TO CALL CENTER MANAGEMENT Rupesh Chokshi Project Manager AT&T Laboratories Room 3J-325 101 Crawfords Corner Road Holmdel, NJ 07733, U.S.A. Phone: 732-332-5118 Fax: 732-949-9112
FORECASTING CALL CENTER ARRIVALS: FIXED-EFFECTS, MIXED-EFFECTS, AND BIVARIATE MODELS
FORECASTING CALL CENTER ARRIVALS: FIXED-EFFECTS, MIXED-EFFECTS, AND BIVARIATE MODELS Rouba Ibrahim Desautels Faculty of Management McGill University Pierre L Ecuyer Department of Operations Research University
Component Ordering in Independent Component Analysis Based on Data Power
Component Ordering in Independent Component Analysis Based on Data Power Anne Hendrikse Raymond Veldhuis University of Twente University of Twente Fac. EEMCS, Signals and Systems Group Fac. EEMCS, Signals
Service-Level Variability and Impatience in Call Centers
S er v i c el e v el Va r i a bi l i t ya nd I mpa enc ei nca l l Cent er s Al e xroubos Service-Level Variability and Impatience in Call Centers Roubos, Alex, 1985 Service-Level Variability and Impatience
Modeling and Simulation of a Pacing Engine for Proactive Campaigns in Contact Center Environment
Modeling and Simulation of a Pacing Engine for Proactive Campaigns in Contact Center Environment Nikolay Korolev, Herbert Ristock, Nikolay Anisimov Genesys Telecommunications Laboratories (an Alcatel-Lucent
COPING WITH TIME-VARYING DEMAND WHEN SETTING STAFFING REQUIREMENTS FOR A SERVICE SYSTEM
COPING WITH TIME-VARYING DEMAND WHEN SETTING STAFFING REQUIREMENTS FOR A SERVICE SYSTEM by Linda V. Green Peter J. Kolesar Ward Whitt Graduate School of Business Graduate School of Business IEOR Department
An Overview of Knowledge Discovery Database and Data mining Techniques
An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,
How To Manage A Call Center
THE ROLE OF SIMULATION IN CALL CENTER MANAGEMENT Roger Klungle AAA Michigan Introduction With recent advances in technology and the changing nature of business, call center management has become a rapidly
Analysis of call centre arrival data using singular value decomposition
APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY Appl. Stochastic Models Bus. Ind., 2005; 21:251 263 Published online in Wiley InterScience (www.interscience.wiley.com). DOI: 10.1002/asmb.598 Analysis
Performance Analysis of Session-Level Load Balancing Algorithms
Performance Analysis of Session-Level Load Balancing Algorithms Dennis Roubos, Sandjai Bhulai, and Rob van der Mei Vrije Universiteit Amsterdam Faculty of Sciences De Boelelaan 1081a 1081 HV Amsterdam
AS-D2 THE ROLE OF SIMULATION IN CALL CENTER MANAGEMENT. Dr. Roger Klungle Manager, Business Operations Analysis
AS-D2 THE ROLE OF SIMULATION IN CALL CENTER MANAGEMENT Dr. Roger Klungle Manager, Business Operations Analysis AAA Michigan 1 Auto Club Drive Dearborn, MI 48126 U.S.A. Phone: (313) 336-9946 Fax: (313)
Workforce scheduling with logical constraints: theory and applications in call centers
Workforce scheduling with logical constraints: theory and applications in call centers Gábor Danó This thesis was supervised by Sandjai Bhulai and Ger Koole Department of Mathematics Vrije Universiteit
USING SIMULATION TO EVALUATE CALL FORECASTING ALGORITHMS FOR INBOUND CALL CENTER. Guilherme Steinmann Paulo José de Freitas Filho
Proceedings of the 2013 Winter Simulation Conference R. Pasupathy, S.-H. Kim, A. Tolk, R. Hill, and M. E. Kuhl, eds USING SIMULATION TO EVALUATE CALL FORECASTING ALGORITHMS FOR INBOUND CALL CENTER Guilherme
The Analysis of Dynamical Queueing Systems (Background)
The Analysis of Dynamical Queueing Systems (Background) Technological innovations are creating new types of communication systems. During the 20 th century, we saw the evolution of electronic communication
Creating operational shift schedules for third-level IT support: challenges, models and case study
242 Int. J. Services Operations and Informatics, Vol. 3, Nos. 3/4, 2008 Creating operational shift schedules for third-level IT support: challenges, models and case study Segev Wasserkrug*, Shai Taub,
Deployment of express checkout lines at supermarkets
Deployment of express checkout lines at supermarkets Maarten Schimmel Research paper Business Analytics April, 213 Supervisor: René Bekker Faculty of Sciences VU University Amsterdam De Boelelaan 181 181
Performance Analysis of a Telephone System with both Patient and Impatient Customers
Performance Analysis of a Telephone System with both Patient and Impatient Customers Yiqiang Quennel Zhao Department of Mathematics and Statistics University of Winnipeg Winnipeg, Manitoba Canada R3B 2E9
LECTURE - 3 RESOURCE AND WORKFORCE SCHEDULING IN SERVICES
LECTURE - 3 RESOURCE AND WORKFORCE SCHEDULING IN SERVICES Learning objective To explain various work shift scheduling methods for service sector. 8.9 Workforce Management Workforce management deals in
A Stochastic Programming Model for Scheduling Call Centers with Global Service Level Agreements
A Stochastic Programming Model for Scheduling Call Centers with Global Service Level Agreements Working Paper Thomas R. Robbins Terry P. Harrison Department of Supply Chain and Information Systems, Smeal
Fluid Approximation of a Priority Call Center With Time-Varying Arrivals
Fluid Approximation of a Priority Call Center With Time-Varying Arrivals Ahmad D. Ridley, Ph.D. William Massey, Ph.D. Michael Fu, Ph.D. In this paper, we model a call center as a preemptive-resume priority
A joint control framework for supply chain planning
17 th European Symposium on Computer Aided Process Engineering ESCAPE17 V. Plesu and P.S. Agachi (Editors) 2007 Elsevier B.V. All rights reserved. 1 A joint control framework for supply chain planning
Social Media Mining. Data Mining Essentials
Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers
Analysis of Various Forecasting Approaches for Linear Supply Chains based on Different Demand Data Transformations
Institute of Information Systems University of Bern Working Paper No 196 source: https://doi.org/10.7892/boris.58047 downloaded: 16.11.2015 Analysis of Various Forecasting Approaches for Linear Supply
STRATEGIC CAPACITY PLANNING USING STOCK CONTROL MODEL
Session 6. Applications of Mathematical Methods to Logistics and Business Proceedings of the 9th International Conference Reliability and Statistics in Transportation and Communication (RelStat 09), 21
SYNERGISTIC MODELING OF CALL CENTER OPERATIONS
SYNERGISTIC MODELING OF CALL CENTER OPERATIONS DENNIS C. DIETZ AND JON G. VAVER Received 14 July 2005; Revised 30 November 2005; Accepted 9 December 2005 We synergistically apply queueing theory, integer
Simulation of a Claims Call Center: A Success and a Failure
Proceedings of the 1999 Winter Simulation Conference P. A. Farrington, H. B. Nembhard, D. T. Sturrock, and G. W. Evans, eds. SIMULATION OF A CLAIMS CALL CENTER: A SUCCESS AND A FAILURE Roger Klungle AAA
Using Data Mining for Mobile Communication Clustering and Characterization
Using Data Mining for Mobile Communication Clustering and Characterization A. Bascacov *, C. Cernazanu ** and M. Marcu ** * Lasting Software, Timisoara, Romania ** Politehnica University of Timisoara/Computer
Automatic Calibration of an In-vehicle Gaze Tracking System Using Driver s Typical Gaze Behavior
Automatic Calibration of an In-vehicle Gaze Tracking System Using Driver s Typical Gaze Behavior Kenji Yamashiro, Daisuke Deguchi, Tomokazu Takahashi,2, Ichiro Ide, Hiroshi Murase, Kazunori Higuchi 3,
8. Time Series and Prediction
8. Time Series and Prediction Definition: A time series is given by a sequence of the values of a variable observed at sequential points in time. e.g. daily maximum temperature, end of day share prices,
Call Center Metrics: Glossary of Terms
Call Center Metrics: Glossary of Terms A. abandoned call. A call or other type of contact that has been offered into a communications network or telephone system but is terminated by the person originating
The Effects of Start Prices on the Performance of the Certainty Equivalent Pricing Policy
BMI Paper The Effects of Start Prices on the Performance of the Certainty Equivalent Pricing Policy Faculty of Sciences VU University Amsterdam De Boelelaan 1081 1081 HV Amsterdam Netherlands Author: R.D.R.
Performance Indicators for Call Centers with Impatience
Performance Indicators for Call Centers with Impatience Oualid Jouini 1, Ger Koole 2 & Alex Roubos 2 1 Ecole Centrale Paris, Laboratoire Génie Industriel, Grande Voie des Vignes, 9229 Châtenay-Malabry,
STATISTICAL ANALYSIS OF CALL-CENTER OPERATIONAL DATA: FORECASTING CALL ARRIVALS, AND ANALYZING CUSTOMER PATIENCE AND AGENT SERVICE
STATISTICAL ANALYSIS OF CALL-CENTER OPERATIONAL DATA: FORECASTING CALL ARRIVALS, AND ANALYZING CUSTOMER PATIENCE AND AGENT SERVICE HAIPENG SHEN Department of Statistics and Operations Research, University
Lean Six Sigma Analyze Phase Introduction. TECH 50800 QUALITY and PRODUCTIVITY in INDUSTRY and TECHNOLOGY
TECH 50800 QUALITY and PRODUCTIVITY in INDUSTRY and TECHNOLOGY Before we begin: Turn on the sound on your computer. There is audio to accompany this presentation. Audio will accompany most of the online
Multivariate Analysis of Ecological Data
Multivariate Analysis of Ecological Data MICHAEL GREENACRE Professor of Statistics at the Pompeu Fabra University in Barcelona, Spain RAUL PRIMICERIO Associate Professor of Ecology, Evolutionary Biology
How To Find The Optimal Base Stock Level In A Supply Chain
Optimizing Stochastic Supply Chains via Simulation: What is an Appropriate Simulation Run Length? Arreola-Risa A 1, Fortuny-Santos J 2, Vintró-Sánchez C 3 Abstract The most common solution strategy for
A Regression Approach for Forecasting Vendor Revenue in Telecommunication Industries
A Regression Approach for Forecasting Vendor Revenue in Telecommunication Industries Aida Mustapha *1, Farhana M. Fadzil #2 * Faculty of Computer Science and Information Technology, Universiti Tun Hussein
BANGKOK TRAFFIC MONITORING SYSTEM
BANGKOK TRAFFIC MONITORING SYSTEM Sorawit NARUPITI Associate Professor Transportation Research Laboratory, Department of Civil Engineering, Chulalongkorn University Phayathai Road, Bangkok 10330 Thailand
Accurate Forecasting: The Heart of Call Center Success
Accurate Forecasting: The Heart of Call Center Success Accurate Forecasting: The Heart of Call Center Success Page 2 Overview In times of economic crisis and dwindling profits, it is more important than
A REVIEW OF WORKFORCE CROSS-TRAINING IN CALL CENTERS FROM AN OPERATIONS MANAGEMENT PERSPECTIVE. O. Zeynep Akşin. Fikri Karaesmen.
A REVIEW OF WORKFORCE CROSS-TRAINING IN CALL CENTERS FROM AN OPERATIONS MANAGEMENT PERSPECTIVE O. Zeynep Akşin Fikri Karaesmen and E. Lerzan Örmeci Graduate School of Business Koç University 34450, Istanbul
Call Center Measurements, Data Models and Data Analysis
Service Engineering http://ie.technion.ac.il/serveng200 Call Center Measurements, Data Models and Data Analysis Adapted from: Telephone Call Centers: Tutorial, Review, and Research Prospects By Noah Gans
Measurement and Modelling of Internet Traffic at Access Networks
Measurement and Modelling of Internet Traffic at Access Networks Johannes Färber, Stefan Bodamer, Joachim Charzinski 2 University of Stuttgart, Institute of Communication Networks and Computer Engineering,
Optimizing Call Center Staffing using Simulation and Analytic Center Cutting Plane Methods
Optimizing Call Center Staffing using Simulation and Analytic Center Cutting Plane Methods Júlíus Atlason, [email protected] Marina A. Epelman, [email protected] Shane G. Henderson, [email protected]
Analysis of algorithms of time series analysis for forecasting sales
SAINT-PETERSBURG STATE UNIVERSITY Mathematics & Mechanics Faculty Chair of Analytical Information Systems Garipov Emil Analysis of algorithms of time series analysis for forecasting sales Course Work Scientific
A Brief Study of the Nurse Scheduling Problem (NSP)
A Brief Study of the Nurse Scheduling Problem (NSP) Lizzy Augustine, Morgan Faer, Andreas Kavountzis, Reema Patel Submitted Tuesday December 15, 2009 0. Introduction and Background Our interest in the
Demand forecasting & Aggregate planning in a Supply chain. Session Speaker Prof.P.S.Satish
Demand forecasting & Aggregate planning in a Supply chain Session Speaker Prof.P.S.Satish 1 Introduction PEMP-EMM2506 Forecasting provides an estimate of future demand Factors that influence demand and
Generating Personnel Schedules in an Industrial Setting Using a Tabu Search Algorithm
Generating Personnel Schedules in an Industrial Setting Using a Tabu Search Algorithm Pascal Tellier 1 and George White 2 1 PrairieFyre Software Inc., 555 Legget Dr., Kanata K2K 2X3, Canada [email protected]
Analysis of a Production/Inventory System with Multiple Retailers
Analysis of a Production/Inventory System with Multiple Retailers Ann M. Noblesse 1, Robert N. Boute 1,2, Marc R. Lambrecht 1, Benny Van Houdt 3 1 Research Center for Operations Management, University
Standardization and Its Effects on K-Means Clustering Algorithm
Research Journal of Applied Sciences, Engineering and Technology 6(7): 399-3303, 03 ISSN: 040-7459; e-issn: 040-7467 Maxwell Scientific Organization, 03 Submitted: January 3, 03 Accepted: February 5, 03
Server Load Prediction
Server Load Prediction Suthee Chaidaroon ([email protected]) Joon Yeong Kim ([email protected]) Jonghan Seo ([email protected]) Abstract Estimating server load average is one of the methods that
CHAPTER 3 CALL CENTER QUEUING MODEL WITH LOGNORMAL SERVICE TIME DISTRIBUTION
31 CHAPTER 3 CALL CENTER QUEUING MODEL WITH LOGNORMAL SERVICE TIME DISTRIBUTION 3.1 INTRODUCTION In this chapter, construction of queuing model with non-exponential service time distribution, performance
Evaluating the Lead Time Demand Distribution for (r, Q) Policies Under Intermittent Demand
Proceedings of the 2009 Industrial Engineering Research Conference Evaluating the Lead Time Demand Distribution for (r, Q) Policies Under Intermittent Demand Yasin Unlu, Manuel D. Rossetti Department of
FEGYVERNEKI SÁNDOR, PROBABILITY THEORY AND MATHEmATICAL
FEGYVERNEKI SÁNDOR, PROBABILITY THEORY AND MATHEmATICAL STATIsTICs 4 IV. RANDOm VECTORs 1. JOINTLY DIsTRIBUTED RANDOm VARIABLEs If are two rom variables defined on the same sample space we define the joint
Cluster Analysis for Evaluating Trading Strategies 1
CONTRIBUTORS Jeff Bacidore Managing Director, Head of Algorithmic Trading, ITG, Inc. [email protected] +1.212.588.4327 Kathryn Berkow Quantitative Analyst, Algorithmic Trading, ITG, Inc. [email protected]
Introduction to Engineering System Dynamics
CHAPTER 0 Introduction to Engineering System Dynamics 0.1 INTRODUCTION The objective of an engineering analysis of a dynamic system is prediction of its behaviour or performance. Real dynamic systems are
http://www.jstor.org This content downloaded on Tue, 19 Feb 2013 17:28:43 PM All use subject to JSTOR Terms and Conditions
A Significance Test for Time Series Analysis Author(s): W. Allen Wallis and Geoffrey H. Moore Reviewed work(s): Source: Journal of the American Statistical Association, Vol. 36, No. 215 (Sep., 1941), pp.
M. Sugumaran / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 2 (3), 2011, 1001-1006
A Design of Centralized Meeting Scheduler with Distance Metrics M. Sugumaran Department of Computer Science and Engineering,Pondicherry Engineering College, Puducherry, India. Abstract Meeting scheduling
Local outlier detection in data forensics: data mining approach to flag unusual schools
Local outlier detection in data forensics: data mining approach to flag unusual schools Mayuko Simon Data Recognition Corporation Paper presented at the 2012 Conference on Statistical Detection of Potential
