Bayesian Analysis of Call Center Arrival Data
|
|
- Maryann Reynolds
- 8 years ago
- Views:
Transcription
1 Bayesian Analysis of Call Center Arrival Data Fabrizio Ruggeri 1, Refik Soyer 2 and Murat Tarimcilar 2 1 Istituto di Matematica Applicata e Tecnologie Informatiche Consiglio Nazionale delle Ricerche Via Bassini 15, I-20133, Milano, Italy fabrizio@mi.imati.cnr.it fabrizio/ 2 George Washington University Washington D.C., USA
2 OUTLINE OF THE TALK General issues on call centers data Some models for call centers data Efficacy of advertising campaigns Bayesian models Example Future research
3 CALL CENTER Centralised hub aimed to make or get calls to/from (prospective) customers (Often) primary point of contact between customers and businesses Major investment for many organisations 2.86 million operator positions in over 50,000 call centers in the US, with some locations employing over 1000 agents Not only businesses, but also governments, etc. Some citations from Weinberg, Brown and Stroud, 2006
4 Different functions CALL CENTER OPERATIONS Only inbound calls (e.g. requests for assistance) Only outbound calls (e.g. promotions) Both of them (possibly, outbound calls when idle from inbound ones, including answers to previous inbound calls) Different structures All issues handled by equally trained agents Various levels, e.g. automated answers to simple issues contact with lowly trained agents for ordinary issues contact with highly trained agents (e.g. supervisors) for the most complex issues
5 Different technologies CALL CENTER OPERATIONS Direct phone calls to agents Phone calls to a computerised system, routing calls, e.g., to automated answers and different levels of agents Computer-phone integration, allowing for identification of customers and immediate availability of his/her data (personal and past communications) Contact center: Computer supporting agents via a full range of media ( , fax, web pages, chat) Different policies No customer must be lost No premium customer must be lost Trade between losses due to customer losses and center staffing
6 Qualitative measures, e.g. CALL CENTER QUALITY Customer satisfaction about user-friendly system length of hold-in-line effectiveness of answers Company image affecting future businesses Quantitative measures, e.g. Abandonment, as fraction of customers leaving the queue before service Retrial, as average number of calls needed to solve a problem Waiting, as its average or some percentiles of the waiting-time distribution Profit
7 CALL CENTER DATA CLASSIFICATION Operational data Typically, aggregated data over some periods (minutes/daily/weekly/yearly) from history of each call, e.g. total number of calls served or abandoned average waiting time agents utilisation level Marketing data Combination of phone data with customer s profile and past history Euro-figures for past sales and future marketing targets Psychological data Surveys of customers, agents and managers about subjective perception of service level and working environment Classification taken from Koole and Mandelbaum, 2001
8 Customer level CALL CENTER DATA USE Future marketing target based on past transactions of existing customers Identification of possible (and profitable) new customers Company level Quality of service (lost calls, waiting times, etc.) Staffing of call center Monitoring of agents performance (might be unlawful) Training of agents
9 (Statistical) interest in CALL CENTER Forecasting demand in different periods (e.g. from hourly to yearly) Customer loss Optimal number of agents (possibly time-dependent) Customers (both current and perspective) profiles etc.
10 MODELS FOR CALL CENTER ARRIVAL DATA Customer level Customer profile, using CRM (Customer Relations Management) and Data Mining Degree of satisfaction, mostly via surveys
11 MODELS FOR CALL CENTER ARRIVAL DATA Company level Time series (especially ARIMA processes) arrivals over periods of time detection of different patterns in different periods (e.g. before and after Xmas) normalised (w.r.t. total daily calls) arrivals during each day Queueing models arrival time system availability (optimal) number of channels service policy (e.g. premium customers)
12 MODELS FOR CALL CENTER ARRIVAL DATA Poisson processes with time dependent arrival rates Doubly stochastic Poisson models Nonhomogeneous Poisson processes (NHPPs) Focus on forecasting models for optimal scheduling and staffing of telephone operators in a call center, using aggregate arrival data
13 BACKGROUND Consumer Electronics Producer Limited variety of products Long life cycle Aging products, sporadic upgrades Targeted advertising Sales Average sale is around $ Almost all the sales are through the sales call center Call return is very low if first time the customer is denied the call Advertisement Many media venues used, although print media is majority Each ad is targeted and urging customers to place calls
14 OBJECTIVES Evaluation of different advertisement policies from a marketing viewpoint through call center arrival data Analysis of effects of relevant covariates, e.g. Media expense (in $ s) Venue type (monthly magazine, daily newspaper, etc.) Ad format (full page, half page, colour, etc.) Offer type (free shipment, payment schedule, etc. ) Seasonal indicators Prediction of calls volume generated by specific ad over any interval of interest Arrival data are not aggregated
15 Typical call arrival data for an ad DATA Time interval (in days) Number of calls (0, 1] 6 (1, 2] 5 (2, 3] 1 (3, 4] 3 (4, 5] 2 (5, 6] 2 (6, 7] 3 (7, 8] 0 (8, 9] 2 (9, 10] 2 (10, 18] 0 Number of calls for ad in unit intervals Features of the ad (media, cost, etc.) Corresponding ad known for each call in earlier work
16 GOALS OF THE ONGOING RESEARCH Handling calls unassigned to any ad Different models for covariates Bayesian nonparametrics
17 C 1,...,C m : m campaigns (ad) NOTATIONS T 1,...,T m : campaigns starting times, T 1 T 2... T m Calls recorded as number of arrivals in I j = (t j 1,t j ], j = 1,n (starting times coincide with one of the endpoints of the intervals I j ) n ij : number of calls in I j related to campaign C i, i = 1,m, j = 1,n u j : number of unallocated calls in I j (0 in earlier work) (in each I j consider only campaigns C i with T i t j 1 )
18 NONHOMOGENEOUS POISSON PROCESS N t,t 0 # events by time t N(y,s) # events in (y,s] Λ(t) = EN t mean value function Λ(y,s) = Λ(s) Λ(y) expected # events in (y,s] N t,t 0, NHPP with intensity function λ(t) iff 1. N 0 = 0 2. independent increments 3. P{# events in (t,t+h) 2} = o(h) 4. P{# events in (t,t+h) = 1} = λ(t)h+o(h) P{N(y,s) = k} = Λ(y,s)k k! e Λ(y,s), k N
19 NONHOMOGENEOUS POISSON PROCESS λ(t) λ t HPP λ(t): intensity function of N t P{N(t,t+ ] 1} λ(t) := lim, t 0 0 µ(t) := dλ(t) dt : Rocof (rate of occurrence of failures) Property 3. µ(t) = λ(t) a.e. Λ(y,s) = s y λ(t)dt
20 MODULATED NHPP MODEL NHPP s with intensity dependent on covariates λ i (t,z i ) = λ 0 (t)exp{γ Z i } λ 0 (t) baseline intensity and γ parameter λ i(t,z i ) λ j (t,z j ) = exp{γ (Z i Z j )} Proportional Intensities Model Cox, 1972
21 MODEL Call center arrival data modeled as NHPP dependent on the campaign (i.e. covariates Z i ) dependent on the starting point T i for campaign C i λ i (t) = λ 0 (t T i )exp{γ Z i }I [Ti, )(t) Λ i (t) = Λ 0 (t T i )exp{γ Z i }I [Ti, )(t)
22 MODEL Superposition Theorem Sum of independent NHPP with intensity functions λ i (t) is still a NHPP with intensity function λ(t) = λ i (t)
23 MODEL Number of calls decreasing to zero Power Law Process (PLP) λ(t) = Mβt β 1 Λ(t) = Mt β M,β,t > 0 but here 0 < β < 1 λ(t) 0 for t Alternative: λ(t) = β 0 log(1+β 1 t) (1+β 1 t) (increasing from 0 and then decreasing to 0)
24 MODEL Cumulative number of arrivals approximates Λ PLP logλ(t) = logm +βlogt plot of logλ(t) v.s. logt to check appropriateness of PLP +**+%*;++ % * "+1
25 MODEL WITH PERFECT LINKAGE OF CALLS No further details on the previous model to avoid repetitions with the model with unallocated calls Random effects model, accounting for differences unexplained by covariates Model similar to the previous one, but NHPPs with Λ i (t) = M i t β logm i = θ +φ i, φ i random effect terms φ i s conditional independent N(0,1/τ) and τ G(a τ,b τ )
26 DIGRESSION: MODEL FOR UNASSIGNED CALLS Unassigned calls define a new process with Λ u (t) = δ u Mt β δ u r.v. rescaling the baseline mean value function Model describes arrivals of unallocated calls but not their assignment to campaigns and, consequently, these data are less useful to measure effectiveness of campaigns
27 A POSSIBLE (BUT INCONVENIENT) MODEL Arrivals in the interval I j, j = 1,n: n ij related to campaign C i, i = 1,m u j unallocated Interest in P(n 1j C 1,...,n mj C m,u j I j ) All possible allocations: u j = {u 1j C 1,...,u mj C m }, with m l=1 u lj = u j Specify a distribution P(u j ) (e.g. multinomial) on the allocation Compute P(n 1j C 1,...,n mj C m,u j I j ) = all u j P(n 1j +u 1j C 1,...,n mj +u mj C m u j )P(u j )
28 ASSUMPTIONS Latent variables Y j = (Y 1j,...,Y mj ) Mult(u j,p 1j,...,p mj ), for each I j unknown number of unallocated calls assigned to each campaign Drop the notations C i from the probabilities ij = Λ i (t j ) Λ i (t j 1 ), for each i,j For each interval I j, P(n 1j +Y ij,...,n mj +Y mj,y j,u j ) = = P(n 1j +Y ij,...,n mj +Y mj Y j,u j )P(Y j u j )P(u j ) { m n ij+y ij }{ ( ij u ) m } = n ij +Y ij! e ij j p Y ij ij P(u j ) Y 1j,...,Y mj i=1 No interest in P(u j ) partial likelihood i=1
29 For any NHPP n j=1( u j Y 1j,...,Y mj ) { m i=1 For a PLP PARTIAL LIKELIHOOD } n ij +Y ij ij n ij +Y ij! e ij py ij ij = e m i=1 Λ(t n) n j=1 ( u j Y 1j,...,Y mj ) { m i=1 m i=1 n ij = n j (j = 1,n), n j=1 n ij = N i and n j=1 Y ij = U i (i = 1,m) n j=1 n j = N and n j=1 Y j = U } n ij +Y ij ij n ij +Y ij! py ij ij M N+U e M m { m i=1 (t n T i ) β e γ Zi m i=1 i=1 n e γ Z i (N i +U ) i j=1 { [(tj T i ) β (t j 1 T i ) β ] } n ij +Y ij n ij +Y ij! ( u ) j Y 1j,...,Y mj p Y ij ij }
30 PRIORS Intervals I j,j = 1,n Independent p j Dir(α 1j,...,α mj ) α ij = αe δ(t j T ) i I (Ti, )(t j ) α ij = 0 for unstarted campaigns ( degenerate Dirichlet) decreasing α ij from an interval to next ones (calls unlikely from older ads: Ep ij = α ij m ) l=1 α lm α,δ: either known or prior M G(a,b) any π(β) (nothing convenient for simulations) any π(γ) (nothing convenient for simulations)
31 POSTERIORS n (j) = (n 1j,...,n mj ), for interval I j, j = 1,n n = (n (1),...,n (n) ) p j Y j,n Dir(α 1j +Y 1j,...,α mj +Y mj ), j = 1,n γ M,n,β,Y j : π(γ)e m i=1 { M(tn T i ) β (N i +U i )} e γ Z i
32 POSTERIORS M β,n,γ G(a+N +U,b+ m i=1 (t n T i ) β e γ Z i ) β M,n,γ,Y j π(β)e M m i=1 (t n T i ) β e γ Z i nj=1 mi=1 [(t j T i ) β (t j 1 T i ) β ] n ij+y ij P(Y j β,m,γ,n) ( n ) ij +Y ij u j ni=1 ij Y 1j,...,Y mj (n ij +Y ij )! p ijy ij ij, j = 1,n MCMC simulation (Gibbs with Metropolis steps within)
33 EXAMPLE Weekly time intervals 10 campaigns Cost of the campaign (in $1000) as unique covariate Actual data considered and missing links randomly assigned Interest in interval 4 with 3 active campaigns and 4 missing links
34 EXAMPLE Posterior probabilities Y ij Ad1 Ad2 Ad Posterior means and actual values Ad1 Ad2 Ad3 Mean Actual Predictive distribution of calls in future intervals
35 EXAMPLE Posterior density of γ (covariate coefficient) density.default(x = gnsfr[200:2272]) Density N = 2073 Bandwidth = Z = 0 for campaign cost above a threshold and Z = 1 under it High probability of negative γ higher λ(t) for expensive campaign more calls for it
36 EXAMPLE Posterior density of M (PLP parameter: λ(t) = Mt β ) density.default(x = Ms[200:2272]) Density N = 2073 Bandwidth =
37 EXAMPLE Posterior density of β (PLP parameter: λ(t) = Mt β ) density.default(x = bns[200:2272]) Density N = 2073 Bandwidth =
38 FUTURE RESEARCH More detailed data analysis Other models including covariates Nonparametric approach
39 HIERARCHICAL NHPP MODELS m campaigns C i m PLPs with parameters (M i,β i ), i = 1,m Possible priors β i ρ,δ G(ρexp{X T i δ},ρ) M i ν,σ G(νexp{X T i σ},ν) ρ,ν,δ,σ Priors Empirical Bayes
40 NONPARAMETRIC APPROACH # events in [T 0,T 1 ] P(Λ[T 0,T 1 ]),Λ[T 0,T 1 ] = Λ(T 1 ) Λ(T 0 ) Parametric case: Λ[T 0,T 1 ] = T 1 T 0 λ(t)dt Nonparametric case: Λ[T 0,T 1 ] G(, ) = Λ d.f. of the random measure M Notation: µb := µ(b) Definition 1 Let α be a finite, σ-additive measure on (S,S). The random measure µ follows a Standard Gamma distribution with shape α (denoted by µ GG(α,1)) if, for any family {S j,j = 1,...,k} of disjoint, measurable subsets of S, the random variables µs j are independent and such that µs j G(αS j,1), for j = 1,...,k. Definition 2 Let β be an α-integrable function and µ GG(α,1). The random measure M = βµ, s.t. βµ(a) = β(x)µ(dx), A S, follows a A Generalised Gamma distribution, with shape α and scale β (denoted by M GG(α,β)).
41 Consequences: NONPARAMETRIC APPROACH µ P α,1,p α,1 unique p.m. on (Ω,M), space of finite measures on (S,S), with these finite dimensional distributions M P α,β, weighted random measure, with P α,β p.m. induced by P α,1 EM = βα, i.e. Ω M(A)P α,β(dm) = A β(x)α(dx), A S
42 NONPARAMETRIC APPROACH Theorem 1 Let ξ = (ξ 1,...,ξ n ) be n Poisson processes with intensity measure M. If M GG(α,β) a priori, then M GG(α + n i=1 ξ i,β/(1 + nβ)) a posteriori. Data: {y ij,i = 1...k j } n j=1 from ξ = (ξ 1,...,ξ n ) Bayesian estimator of M: measure M s.t., S S, MS = S β(x) 1+nβ(x) α(dx)+ n k j j=1 i=1 β(y ij ) 1+nβ(y ij ) I S(y ij ) Constant β = MS = β 1+nβ [αs + n k j j=1 i=1 I S (y ij )]
43 NONPARAMETRIC APPROACH Data (calls) recorded as number of events in disjoint intervals Comparison between parametric and nonparametric models Update of the Gamma process
Modeling and Analysis of Call Center Arrival Data: A Bayesian Approach
Modeling and Analysis of Call Center Arrival Data: A Bayesian Approach Refik Soyer * Department of Management Science The George Washington University M. Murat Tarimcilar Department of Management Science
More informationBasics of Statistical Machine Learning
CS761 Spring 2013 Advanced Machine Learning Basics of Statistical Machine Learning Lecturer: Xiaojin Zhu jerryzhu@cs.wisc.edu Modern machine learning is rooted in statistics. You will find many familiar
More informationA Uniform Asymptotic Estimate for Discounted Aggregate Claims with Subexponential Tails
12th International Congress on Insurance: Mathematics and Economics July 16-18, 2008 A Uniform Asymptotic Estimate for Discounted Aggregate Claims with Subexponential Tails XUEMIAO HAO (Based on a joint
More information7.1 The Hazard and Survival Functions
Chapter 7 Survival Models Our final chapter concerns models for the analysis of data which have three main characteristics: (1) the dependent variable or response is the waiting time until the occurrence
More informationMATH4427 Notebook 2 Spring 2016. 2 MATH4427 Notebook 2 3. 2.1 Definitions and Examples... 3. 2.2 Performance Measures for Estimators...
MATH4427 Notebook 2 Spring 2016 prepared by Professor Jenny Baglivo c Copyright 2009-2016 by Jenny A. Baglivo. All Rights Reserved. Contents 2 MATH4427 Notebook 2 3 2.1 Definitions and Examples...................................
More informationα α λ α = = λ λ α ψ = = α α α λ λ ψ α = + β = > θ θ β > β β θ θ θ β θ β γ θ β = γ θ > β > γ θ β γ = θ β = θ β = θ β = β θ = β β θ = = = β β θ = + α α α α α = = λ λ λ λ λ λ λ = λ λ α α α α λ ψ + α =
More informationExam C, Fall 2006 PRELIMINARY ANSWER KEY
Exam C, Fall 2006 PRELIMINARY ANSWER KEY Question # Answer Question # Answer 1 E 19 B 2 D 20 D 3 B 21 A 4 C 22 A 5 A 23 E 6 D 24 E 7 B 25 D 8 C 26 A 9 E 27 C 10 D 28 C 11 E 29 C 12 B 30 B 13 C 31 C 14
More informationINSURANCE RISK THEORY (Problems)
INSURANCE RISK THEORY (Problems) 1 Counting random variables 1. (Lack of memory property) Let X be a geometric distributed random variable with parameter p (, 1), (X Ge (p)). Show that for all n, m =,
More informationInference of Probability Distributions for Trust and Security applications
Inference of Probability Distributions for Trust and Security applications Vladimiro Sassone Based on joint work with Mogens Nielsen & Catuscia Palamidessi Outline 2 Outline Motivations 2 Outline Motivations
More informationApplication of Bayesian Methods in Detection of Healthcare Fraud
A publication of CHEMICAL ENGINEERING TRANSACTIONS VOL. 33, 2013 Guest Editors: Enrico Zio, Piero Baraldi Copyright 2013, AIDIC Servizi S.r.l., ISBN 978-88-95608-24-2; ISSN 1974-9791 The Italian Association
More information1 Prior Probability and Posterior Probability
Math 541: Statistical Theory II Bayesian Approach to Parameter Estimation Lecturer: Songfeng Zheng 1 Prior Probability and Posterior Probability Consider now a problem of statistical inference in which
More informationBayesian Statistics in One Hour. Patrick Lam
Bayesian Statistics in One Hour Patrick Lam Outline Introduction Bayesian Models Applications Missing Data Hierarchical Models Outline Introduction Bayesian Models Applications Missing Data Hierarchical
More informationA LOGNORMAL MODEL FOR INSURANCE CLAIMS DATA
REVSTAT Statistical Journal Volume 4, Number 2, June 2006, 131 142 A LOGNORMAL MODEL FOR INSURANCE CLAIMS DATA Authors: Daiane Aparecida Zuanetti Departamento de Estatística, Universidade Federal de São
More informationPortfolio Using Queuing Theory
Modeling the Number of Insured Households in an Insurance Portfolio Using Queuing Theory Jean-Philippe Boucher and Guillaume Couture-Piché December 8, 2015 Quantact / Département de mathématiques, UQAM.
More informationNotes on the Negative Binomial Distribution
Notes on the Negative Binomial Distribution John D. Cook October 28, 2009 Abstract These notes give several properties of the negative binomial distribution. 1. Parameterizations 2. The connection between
More informationNonparametric adaptive age replacement with a one-cycle criterion
Nonparametric adaptive age replacement with a one-cycle criterion P. Coolen-Schrijner, F.P.A. Coolen Department of Mathematical Sciences University of Durham, Durham, DH1 3LE, UK e-mail: Pauline.Schrijner@durham.ac.uk
More informationPerformance Analysis of a Telephone System with both Patient and Impatient Customers
Performance Analysis of a Telephone System with both Patient and Impatient Customers Yiqiang Quennel Zhao Department of Mathematics and Statistics University of Winnipeg Winnipeg, Manitoba Canada R3B 2E9
More informationGaussian Conjugate Prior Cheat Sheet
Gaussian Conjugate Prior Cheat Sheet Tom SF Haines 1 Purpose This document contains notes on how to handle the multivariate Gaussian 1 in a Bayesian setting. It focuses on the conjugate prior, its Bayesian
More informationExponential Distribution
Exponential Distribution Definition: Exponential distribution with parameter λ: { λe λx x 0 f(x) = 0 x < 0 The cdf: F(x) = x Mean E(X) = 1/λ. f(x)dx = Moment generating function: φ(t) = E[e tx ] = { 1
More informationPart 2: One-parameter models
Part 2: One-parameter models Bernoilli/binomial models Return to iid Y 1,...,Y n Bin(1, θ). The sampling model/likelihood is p(y 1,...,y n θ) =θ P y i (1 θ) n P y i When combined with a prior p(θ), Bayes
More informationSTATISTICAL ANALYSIS OF CALL-CENTER OPERATIONAL DATA: FORECASTING CALL ARRIVALS, AND ANALYZING CUSTOMER PATIENCE AND AGENT SERVICE
STATISTICAL ANALYSIS OF CALL-CENTER OPERATIONAL DATA: FORECASTING CALL ARRIVALS, AND ANALYZING CUSTOMER PATIENCE AND AGENT SERVICE HAIPENG SHEN Department of Statistics and Operations Research, University
More informationStatistical Machine Learning from Data
Samy Bengio Statistical Machine Learning from Data 1 Statistical Machine Learning from Data Gaussian Mixture Models Samy Bengio IDIAP Research Institute, Martigny, Switzerland, and Ecole Polytechnique
More informationCourse 4 Examination Questions And Illustrative Solutions. November 2000
Course 4 Examination Questions And Illustrative Solutions Novemer 000 1. You fit an invertile first-order moving average model to a time series. The lag-one sample autocorrelation coefficient is 0.35.
More information3. Regression & Exponential Smoothing
3. Regression & Exponential Smoothing 3.1 Forecasting a Single Time Series Two main approaches are traditionally used to model a single time series z 1, z 2,..., z n 1. Models the observation z t as a
More informationSupplement to Call Centers with Delay Information: Models and Insights
Supplement to Call Centers with Delay Information: Models and Insights Oualid Jouini 1 Zeynep Akşin 2 Yves Dallery 1 1 Laboratoire Genie Industriel, Ecole Centrale Paris, Grande Voie des Vignes, 92290
More informationSpatial Statistics Chapter 3 Basics of areal data and areal data modeling
Spatial Statistics Chapter 3 Basics of areal data and areal data modeling Recall areal data also known as lattice data are data Y (s), s D where D is a discrete index set. This usually corresponds to data
More informationCredit Risk Models: An Overview
Credit Risk Models: An Overview Paul Embrechts, Rüdiger Frey, Alexander McNeil ETH Zürich c 2003 (Embrechts, Frey, McNeil) A. Multivariate Models for Portfolio Credit Risk 1. Modelling Dependent Defaults:
More informationImperfect Debugging in Software Reliability
Imperfect Debugging in Software Reliability Tevfik Aktekin and Toros Caglar University of New Hampshire Peter T. Paul College of Business and Economics Department of Decision Sciences and United Health
More informationPast and present trends in aggregate claims analysis
Past and present trends in aggregate claims analysis Gordon E. Willmot Munich Re Professor of Insurance Department of Statistics and Actuarial Science University of Waterloo 1st Quebec-Ontario Workshop
More informationExamining credit card consumption pattern
Examining credit card consumption pattern Yuhao Fan (Economics Department, Washington University in St. Louis) Abstract: In this paper, I analyze the consumer s credit card consumption data from a commercial
More informationBayes and Naïve Bayes. cs534-machine Learning
Bayes and aïve Bayes cs534-machine Learning Bayes Classifier Generative model learns Prediction is made by and where This is often referred to as the Bayes Classifier, because of the use of the Bayes rule
More informationPS 271B: Quantitative Methods II. Lecture Notes
PS 271B: Quantitative Methods II Lecture Notes Langche Zeng zeng@ucsd.edu The Empirical Research Process; Fundamental Methodological Issues 2 Theory; Data; Models/model selection; Estimation; Inference.
More informationValidation of Software for Bayesian Models using Posterior Quantiles. Samantha R. Cook Andrew Gelman Donald B. Rubin DRAFT
Validation of Software for Bayesian Models using Posterior Quantiles Samantha R. Cook Andrew Gelman Donald B. Rubin DRAFT Abstract We present a simulation-based method designed to establish that software
More informationUsing the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, and Discrete Changes
Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, Discrete Changes JunXuJ.ScottLong Indiana University August 22, 2005 The paper provides technical details on
More informationBayesian Statistics: Indian Buffet Process
Bayesian Statistics: Indian Buffet Process Ilker Yildirim Department of Brain and Cognitive Sciences University of Rochester Rochester, NY 14627 August 2012 Reference: Most of the material in this note
More informationRobust Staff Level Optimisation in Call Centres
Robust Staff Level Optimisation in Call Centres Sam Clarke Jesus College University of Oxford A thesis submitted for the degree of M.Sc. Mathematical Modelling and Scientific Computing Trinity 2007 Abstract
More informationSurvival Distributions, Hazard Functions, Cumulative Hazards
Week 1 Survival Distributions, Hazard Functions, Cumulative Hazards 1.1 Definitions: The goals of this unit are to introduce notation, discuss ways of probabilistically describing the distribution of a
More informationSOCIETY OF ACTUARIES/CASUALTY ACTUARIAL SOCIETY EXAM C CONSTRUCTION AND EVALUATION OF ACTUARIAL MODELS EXAM C SAMPLE QUESTIONS
SOCIETY OF ACTUARIES/CASUALTY ACTUARIAL SOCIETY EXAM C CONSTRUCTION AND EVALUATION OF ACTUARIAL MODELS EXAM C SAMPLE QUESTIONS Copyright 005 by the Society of Actuaries and the Casualty Actuarial Society
More informationStatistical Machine Learning
Statistical Machine Learning UoC Stats 37700, Winter quarter Lecture 4: classical linear and quadratic discriminants. 1 / 25 Linear separation For two classes in R d : simple idea: separate the classes
More informationR 2 -type Curves for Dynamic Predictions from Joint Longitudinal-Survival Models
Faculty of Health Sciences R 2 -type Curves for Dynamic Predictions from Joint Longitudinal-Survival Models Inference & application to prediction of kidney graft failure Paul Blanche joint work with M-C.
More informationInstitute of Actuaries of India Subject CT3 Probability and Mathematical Statistics
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in
More informationM/M/1 and M/M/m Queueing Systems
M/M/ and M/M/m Queueing Systems M. Veeraraghavan; March 20, 2004. Preliminaries. Kendall s notation: G/G/n/k queue G: General - can be any distribution. First letter: Arrival process; M: memoryless - exponential
More informationAnalysis of Call Center Data
University of Pennsylvania ScholarlyCommons Wharton Research Scholars Journal Wharton School 4-1-2004 Analysis of Call Center Data Yu Chu Cheng University of Pennsylvania This paper is posted at ScholarlyCommons.
More informationProfit-oriented shift scheduling of inbound contact centers with skills-based routing, impatient customers, and retrials
Noname manuscript No. (will be inserted by the editor) Profit-oriented shift scheduling of inbound contact centers with skills-based routing, impatient customers, and retrials Stefan Helber and Kirsten
More informationA Latent Variable Approach to Validate Credit Rating Systems using R
A Latent Variable Approach to Validate Credit Rating Systems using R Chicago, April 24, 2009 Bettina Grün a, Paul Hofmarcher a, Kurt Hornik a, Christoph Leitner a, Stefan Pichler a a WU Wien Grün/Hofmarcher/Hornik/Leitner/Pichler
More informationRule-based Traffic Management for Inbound Call Centers
Vrije Universiteit Amsterdam Research Paper Business Analytics Rule-based Traffic Management for Inbound Call Centers Auteur: Tim Steinkuhler Supervisor: Prof. Dr. Ger Koole October 7, 2014 Contents Preface
More informationThe Exponential Distribution
21 The Exponential Distribution From Discrete-Time to Continuous-Time: In Chapter 6 of the text we will be considering Markov processes in continuous time. In a sense, we already have a very good understanding
More informationFuzzy Differential Systems and the New Concept of Stability
Nonlinear Dynamics and Systems Theory, 1(2) (2001) 111 119 Fuzzy Differential Systems and the New Concept of Stability V. Lakshmikantham 1 and S. Leela 2 1 Department of Mathematical Sciences, Florida
More informationCall Admission Control for Voice/Data Integration in Broadband Wireless Networks
Call Admission Control for Voice/Data Integration in Broadband Wireless Networks Majid Ghaderi, Student Member, IEEE, and Raouf Boutaba, Senior Member, IEEE School of Computer Science University of Waterloo
More informationAnalysis of window-observation recurrence data
Iowa State University Digital Repository @ Iowa State University Graduate Theses and Dissertations Graduate College Analysis of window-observation recurrence data Jianying Zuo Iowa State University Follow
More informationOn the modeling and forecasting of call center arrivals
On the modeling and forecasting of call center arrivals Rouba Ibrahim, Han Ye, Pierre L Ecuyer, Haipeng Shen To cite this version: Rouba Ibrahim, Han Ye, Pierre L Ecuyer, Haipeng Shen. On the modeling
More information1.5 / 1 -- Communication Networks II (Görg) -- www.comnets.uni-bremen.de. 1.5 Transforms
.5 / -- Communication Networks II (Görg) -- www.comnets.uni-bremen.de.5 Transforms Using different summation and integral transformations pmf, pdf and cdf/ccdf can be transformed in such a way, that even
More informationData Mining and Data Warehousing. Henryk Maciejewski. Data Mining Predictive modelling: regression
Data Mining and Data Warehousing Henryk Maciejewski Data Mining Predictive modelling: regression Algorithms for Predictive Modelling Contents Regression Classification Auxiliary topics: Estimation of prediction
More informationReject Inference in Credit Scoring. Jie-Men Mok
Reject Inference in Credit Scoring Jie-Men Mok BMI paper January 2009 ii Preface In the Master programme of Business Mathematics and Informatics (BMI), it is required to perform research on a business
More information11. Time series and dynamic linear models
11. Time series and dynamic linear models Objective To introduce the Bayesian approach to the modeling and forecasting of time series. Recommended reading West, M. and Harrison, J. (1997). models, (2 nd
More informationEstimating the Degree of Activity of jumps in High Frequency Financial Data. joint with Yacine Aït-Sahalia
Estimating the Degree of Activity of jumps in High Frequency Financial Data joint with Yacine Aït-Sahalia Aim and setting An underlying process X = (X t ) t 0, observed at equally spaced discrete times
More informationPARTIAL CROSS TRAINING IN CALL CENTERS WITH UNCERTAIN ARRIVALS AND GLOBAL SERVICE LEVEL AGREEMENTS. D. J. Medeiros
Proceedings of the 07 Winter Simulation Conference S. G. Henderson, B. Biller, M.-H. Hsieh, J. Shortle, J. D. Tew, and R. R. Barton, eds. PARTIAL CROSS TRAINING IN CALL CENTERS WITH UNCERTAIN ARRIVALS
More informationBayesX - Software for Bayesian Inference in Structured Additive Regression
BayesX - Software for Bayesian Inference in Structured Additive Regression Thomas Kneib Faculty of Mathematics and Economics, University of Ulm Department of Statistics, Ludwig-Maximilians-University Munich
More information2. Illustration of the Nikkei 225 option data
1. Introduction 2. Illustration of the Nikkei 225 option data 2.1 A brief outline of the Nikkei 225 options market τ 2.2 Estimation of the theoretical price τ = + ε ε = = + ε + = + + + = + ε + ε + ε =
More informationUseful Recent Trends in Simulation Methodology
Useful Recent Trends in Simulation Methodology Shane Henderson Cornell University Thanks: NSF DMI 0400287 1 e.g., Inbound call centre staffing Min staffing costs s/t customers happy enough Min c T x s/t
More informationStatistical Analysis of Life Insurance Policy Termination and Survivorship
Statistical Analysis of Life Insurance Policy Termination and Survivorship Emiliano A. Valdez, PhD, FSA Michigan State University joint work with J. Vadiveloo and U. Dias Session ES82 (Statistics in Actuarial
More informationWeb-based Supplementary Materials for. Modeling of Hormone Secretion-Generating. Mechanisms With Splines: A Pseudo-Likelihood.
Web-based Supplementary Materials for Modeling of Hormone Secretion-Generating Mechanisms With Splines: A Pseudo-Likelihood Approach by Anna Liu and Yuedong Wang Web Appendix A This appendix computes mean
More informationSimple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
More informationParametric Survival Models
Parametric Survival Models Germán Rodríguez grodri@princeton.edu Spring, 2001; revised Spring 2005, Summer 2010 We consider briefly the analysis of survival data when one is willing to assume a parametric
More informationLecture 3: Linear methods for classification
Lecture 3: Linear methods for classification Rafael A. Irizarry and Hector Corrada Bravo February, 2010 Today we describe four specific algorithms useful for classification problems: linear regression,
More informationARMA, GARCH and Related Option Pricing Method
ARMA, GARCH and Related Option Pricing Method Author: Yiyang Yang Advisor: Pr. Xiaolin Li, Pr. Zari Rachev Department of Applied Mathematics and Statistics State University of New York at Stony Brook September
More informationModels for Product Demand Forecasting with the Use of Judgmental Adjustments to Statistical Forecasts
Page 1 of 20 ISF 2008 Models for Product Demand Forecasting with the Use of Judgmental Adjustments to Statistical Forecasts Andrey Davydenko, Professor Robert Fildes a.davydenko@lancaster.ac.uk Lancaster
More informationSampling for Bayesian computation with large datasets
Sampling for Bayesian computation with large datasets Zaiying Huang Andrew Gelman April 27, 2005 Abstract Multilevel models are extremely useful in handling large hierarchical datasets. However, computation
More informationStatistics Graduate Courses
Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.
More informationKnowledge Management in Call Centers: How Routing Rules Influence Expertise and Service Quality
Knowledge Management in Call Centers: How Routing Rules Influence Expertise and Service Quality Christoph Heitz Institute of Data Analysis and Process Design, Zurich University of Applied Sciences CH-84
More informationThe Proportional Odds Model for Assessing Rater Agreement with Multiple Modalities
The Proportional Odds Model for Assessing Rater Agreement with Multiple Modalities Elizabeth Garrett-Mayer, PhD Assistant Professor Sidney Kimmel Comprehensive Cancer Center Johns Hopkins University 1
More informationMarkov Chain Monte Carlo Simulation Made Simple
Markov Chain Monte Carlo Simulation Made Simple Alastair Smith Department of Politics New York University April2,2003 1 Markov Chain Monte Carlo (MCMC) simualtion is a powerful technique to perform numerical
More informationPrivate Equity Fund Valuation and Systematic Risk
An Equilibrium Approach and Empirical Evidence Axel Buchner 1, Christoph Kaserer 2, Niklas Wagner 3 Santa Clara University, March 3th 29 1 Munich University of Technology 2 Munich University of Technology
More informationOverview of Monte Carlo Simulation, Probability Review and Introduction to Matlab
Monte Carlo Simulation: IEOR E4703 Fall 2004 c 2004 by Martin Haugh Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab 1 Overview of Monte Carlo Simulation 1.1 Why use simulation?
More informationAn Internal Model for Operational Risk Computation
An Internal Model for Operational Risk Computation Seminarios de Matemática Financiera Instituto MEFF-RiskLab, Madrid http://www.risklab-madrid.uam.es/ Nicolas Baud, Antoine Frachot & Thierry Roncalli
More informationMicro-Level Loss Reserving Models with Applications in Workers Compensation Insurance
Micro-Level Loss Reserving Models with Applications in Workers Compensation Insurance Xiaoli Jin University of Wisconsin-Madison Abstract Accurate loss reserves are essential for insurers to maintain adequate
More information4 The M/M/1 queue. 4.1 Time-dependent behaviour
4 The M/M/1 queue In this chapter we will analyze the model with exponential interarrival times with mean 1/λ, exponential service times with mean 1/µ and a single server. Customers are served in order
More informationModern Optimization Methods for Big Data Problems MATH11146 The University of Edinburgh
Modern Optimization Methods for Big Data Problems MATH11146 The University of Edinburgh Peter Richtárik Week 3 Randomized Coordinate Descent With Arbitrary Sampling January 27, 2016 1 / 30 The Problem
More informationLecture 7: Continuous Random Variables
Lecture 7: Continuous Random Variables 21 September 2005 1 Our First Continuous Random Variable The back of the lecture hall is roughly 10 meters across. Suppose it were exactly 10 meters, and consider
More informationA random point process model for the score in sport matches
IMA Journal of Management Mathematics (2009) 20, 121 131 doi:10.1093/imaman/dpn027 Advance Access publication on October 30, 2008 A random point process model for the score in sport matches PETR VOLF Institute
More informationChapter 3 RANDOM VARIATE GENERATION
Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.
More informationMarkov chain models of a telephone call center with call blending
Markov chain models of a telephone call center with call blending Alexandre Deslauriers Juta Pichitlamken Pierre L Ecuyer Athanassios N. Avramidis GERAD and Département d Informatique et de Recherche Opérationnelle
More informationInterpretation of Somers D under four simple models
Interpretation of Somers D under four simple models Roger B. Newson 03 September, 04 Introduction Somers D is an ordinal measure of association introduced by Somers (96)[9]. It can be defined in terms
More informationENGINEERING SOLUTION OF A BASIC CALL-CENTER MODEL
ENGINEERING SOLUTION OF A BASIC CALL-CENTER MODEL by Ward Whitt Department of Industrial Engineering and Operations Research Columbia University, New York, NY 10027 Abstract An algorithm is developed to
More informationPacific Journal of Mathematics
Pacific Journal of Mathematics GLOBAL EXISTENCE AND DECREASING PROPERTY OF BOUNDARY VALUES OF SOLUTIONS TO PARABOLIC EQUATIONS WITH NONLOCAL BOUNDARY CONDITIONS Sangwon Seo Volume 193 No. 1 March 2000
More informationPortfolio Distribution Modelling and Computation. Harry Zheng Department of Mathematics Imperial College h.zheng@imperial.ac.uk
Portfolio Distribution Modelling and Computation Harry Zheng Department of Mathematics Imperial College h.zheng@imperial.ac.uk Workshop on Fast Financial Algorithms Tanaka Business School Imperial College
More informationStochastic Models for Inventory Management at Service Facilities
Stochastic Models for Inventory Management at Service Facilities O. Berman, E. Kim Presented by F. Zoghalchi University of Toronto Rotman School of Management Dec, 2012 Agenda 1 Problem description Deterministic
More informationSTRATEGIC CAPACITY PLANNING USING STOCK CONTROL MODEL
Session 6. Applications of Mathematical Methods to Logistics and Business Proceedings of the 9th International Conference Reliability and Statistics in Transportation and Communication (RelStat 09), 21
More informationA General Approach to Variance Estimation under Imputation for Missing Survey Data
A General Approach to Variance Estimation under Imputation for Missing Survey Data J.N.K. Rao Carleton University Ottawa, Canada 1 2 1 Joint work with J.K. Kim at Iowa State University. 2 Workshop on Survey
More informationBayesian logistic betting strategy against probability forecasting. Akimichi Takemura, Univ. Tokyo. November 12, 2012
Bayesian logistic betting strategy against probability forecasting Akimichi Takemura, Univ. Tokyo (joint with Masayuki Kumon, Jing Li and Kei Takeuchi) November 12, 2012 arxiv:1204.3496. To appear in Stochastic
More informationMicroclustering: When the Cluster Sizes Grow Sublinearly with the Size of the Data Set
Microclustering: When the Cluster Sizes Grow Sublinearly with the Size of the Data Set Jeffrey W. Miller Brenda Betancourt Abbas Zaidi Hanna Wallach Rebecca C. Steorts Abstract Most generative models for
More informationExample: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.
Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C
More informationSimple Markovian Queueing Systems
Chapter 4 Simple Markovian Queueing Systems Poisson arrivals and exponential service make queueing models Markovian that are easy to analyze and get usable results. Historically, these are also the models
More informationProbability Theory. Florian Herzog. A random variable is neither random nor variable. Gian-Carlo Rota, M.I.T..
Probability Theory A random variable is neither random nor variable. Gian-Carlo Rota, M.I.T.. Florian Herzog 2013 Probability space Probability space A probability space W is a unique triple W = {Ω, F,
More informationFluid Approximation of a Priority Call Center With Time-Varying Arrivals
Fluid Approximation of a Priority Call Center With Time-Varying Arrivals Ahmad D. Ridley, Ph.D. William Massey, Ph.D. Michael Fu, Ph.D. In this paper, we model a call center as a preemptive-resume priority
More informationM1 in Economics and Economics and Statistics Applied multivariate Analysis - Big data analytics Worksheet 1 - Bootstrap
Nathalie Villa-Vialanei Année 2015/2016 M1 in Economics and Economics and Statistics Applied multivariate Analsis - Big data analtics Worksheet 1 - Bootstrap This worksheet illustrates the use of nonparametric
More informationAuxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus
Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus Tihomir Asparouhov and Bengt Muthén Mplus Web Notes: No. 15 Version 8, August 5, 2014 1 Abstract This paper discusses alternatives
More informationBayesian Methods in Applied Econometrics, or, Why Econometrics Should Always and Everywhere Be Bayesian
Bayesian Methods in Applied Econometrics, or, Why Econometrics Should Always and Everywhere Be Bayesian Christopher A. Sims Princeton University sims@princeton.edu August 6, 2007 Introduction The first
More information5. Linear Regression
5. Linear Regression Outline.................................................................... 2 Simple linear regression 3 Linear model............................................................. 4
More informationTwo Topics in Parametric Integration Applied to Stochastic Simulation in Industrial Engineering
Two Topics in Parametric Integration Applied to Stochastic Simulation in Industrial Engineering Department of Industrial Engineering and Management Sciences Northwestern University September 15th, 2014
More information