Useful Recent Trends in Simulation Methodology



Similar documents
Lecture 3: Linear methods for classification

STORM: Stochastic Optimization Using Random Models Katya Scheinberg Lehigh University. (Joint work with R. Chen and M. Menickelly)

Optimal shift scheduling with a global service level constraint

Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab

Two Topics in Parametric Integration Applied to Stochastic Simulation in Industrial Engineering

6.231 Dynamic Programming and Stochastic Control Fall 2008

PARTIAL CROSS TRAINING IN CALL CENTERS WITH UNCERTAIN ARRIVALS AND GLOBAL SERVICE LEVEL AGREEMENTS. D. J. Medeiros

Basics of Statistical Machine Learning

Stochastic Inventory Control

An Introduction to Machine Learning

Rule-based Traffic Management for Inbound Call Centers

Supplement to Call Centers with Delay Information: Models and Insights

Stochastic Processes and Queueing Theory used in Cloud Computer Performance Simulations

IEOR 6711: Stochastic Models, I Fall 2012, Professor Whitt, Final Exam SOLUTIONS

2WB05 Simulation Lecture 8: Generating random variables

CHAPTER 3 CALL CENTER QUEUING MODEL WITH LOGNORMAL SERVICE TIME DISTRIBUTION

Department of Mathematics, Indian Institute of Technology, Kharagpur Assignment 2-3, Probability and Statistics, March Due:-March 25, 2015.

Exponential Distribution

A Uniform Asymptotic Estimate for Discounted Aggregate Claims with Subexponential Tails

Maximum Likelihood Estimation

Two-Stage Stochastic Linear Programs

Optimal order placement in a limit order book. Adrien de Larrard and Xin Guo. Laboratoire de Probabilités, Univ Paris VI & UC Berkeley

Monte Carlo Methods in Finance

2.3 Convex Constrained Optimization Problems

Generating Random Numbers Variance Reduction Quasi-Monte Carlo. Simulation Methods. Leonid Kogan. MIT, Sloan , Fall 2010

Competitive Analysis of QoS Networks

Decision Theory Rational prospecting

Lecture 15 An Arithmetic Circuit Lowerbound and Flows in Graphs

Optimizing Call Center Staffing using Simulation and Analytic Center Cutting Plane Methods

Generating Random Variables and Stochastic Processes

BRAESS-LIKE PARADOXES FOR NON-COOPERATIVE DYNAMIC LOAD BALANCING IN DISTRIBUTED COMPUTER SYSTEMS

MAS2317/3317. Introduction to Bayesian Statistics. More revision material

Appendix: Simple Methods for Shift Scheduling in Multi-Skill Call Centers

Model-based Synthesis. Tony O Hagan

Stochastic Models for Inventory Management at Service Facilities

UNIT 2 QUEUING THEORY

Exploratory Data Analysis

The Steepest Descent Algorithm for Unconstrained Optimization and a Bisection Line-search Method

MAS108 Probability I

ENGINEERING SOLUTION OF A BASIC CALL-CENTER MODEL

Is Truck Queuing Productive? Study of truck & shovel operations productivity using simulation platform MineDES

Principle of Data Reduction

Probabilistic Models for Big Data. Alex Davies and Roger Frigola University of Cambridge 13th February 2014

SIMULATION-BASED CUTTING PLANE METHODS FOR OPTIMIZATION OF SERVICE SYSTEMS

AI: A Modern Approach, Chpts. 3-4 Russell and Norvig

Process simulation. Enn Õunapuu

HT2015: SC4 Statistical Data Mining and Machine Learning

INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition)

A Stochastic Programming Model for Scheduling Call Centers with Global Service Level Agreements

1 Sufficient statistics

Chapter 1. Introduction

A joint chance-constrained programming approach for call center workforce scheduling under uncertain call arrival forecasts

CITY UNIVERSITY LONDON. BEng Degree in Computer Systems Engineering Part II BSc Degree in Computer Systems Engineering Part III PART 2 EXAMINATION

Machine Learning.

The Real Business Cycle model

To give it a definition, an implicit function of x and y is simply any relationship that takes the form:

The Master s Degree with Thesis Course Descriptions in Industrial Engineering

Lecture 7: Continuous Random Variables

Static Program Transformations for Efficient Software Model Checking

CHAPTER 7 SECTION 5: RANDOM VARIABLES AND DISCRETE PROBABILITY DISTRIBUTIONS

TIETS34 Seminar: Data Mining on Biometric identification

Statistics 104: Section 6!

Hydrodynamic Limits of Randomized Load Balancing Networks

Chapter 8 Detailed Modeling

Notes on Continuous Random Variables

Monte Carlo testing with Big Data

Generating Random Variables and Stochastic Processes

Analysis of a Production/Inventory System with Multiple Retailers

Modern Optimization Methods for Big Data Problems MATH11146 The University of Edinburgh

Master s Theory Exam Spring 2006

Load Balancing and Switch Scheduling

Real-Time Scheduling 1 / 39

Section 3.7. Rolle s Theorem and the Mean Value Theorem. Difference Equations to Differential Equations

Gambling and Data Compression

Designing a learning system

PS 271B: Quantitative Methods II. Lecture Notes

1 The EOQ and Extensions

CSCI567 Machine Learning (Fall 2014)

Numerisches Rechnen. (für Informatiker) M. Grepl J. Berger & J.T. Frings. Institut für Geometrie und Praktische Mathematik RWTH Aachen

Java Modules for Time Series Analysis

Homework set 4 - Solutions

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

constraint. Let us penalize ourselves for making the constraint too big. We end up with a

Nearest Neighbour Algorithms for Forecasting Call Arrivals in Call Centers

Monte Carlo and Empirical Methods for Stochastic Inference (MASM11/FMS091)

LECTURE 16. Readings: Section 5.1. Lecture outline. Random processes Definition of the Bernoulli process Basic properties of the Bernoulli process

Machine Learning in Spam Filtering

Finding Liveness Errors with ACO

MATH4427 Notebook 2 Spring MATH4427 Notebook Definitions and Examples Performance Measures for Estimators...

Transcription:

Useful Recent Trends in Simulation Methodology Shane Henderson Cornell University Thanks: NSF DMI 0400287 1 e.g., Inbound call centre staffing Min staffing costs s/t customers happy enough Min c T x s/t g i (x) 0, i = 1,, num periods Max min i g i (x) s/t c T x B From Simulation Model Simulation Optimization 2 1

Inbound Call Centre Staffing What is the arrival rate of customers? Input Uncertainty Conversations with HR folks suggests dependence of call volumes between periods Dependence Modeling 3 Outline Part I: Simulation Optimization Discuss, argue, argue Part II: Input Uncertainty Discuss, argue, argue Part III: Dependence Modeling Run out of time / discuss, argue, argue Caveat Emptor: I will be giving my views on these subjects, which are diametrically opposed to common sense and good judgement. 4 2

Part I Simulation Optimization 5 The Generic Problem Min f(x) s/t g(x) 0 Here f(x) ) = E f(x, Y) g(x) ) = E g(x, Y) Example 1: Newsvendor Y = demand x = amount to stock f(x, Y) ) = cx - s min(y, x) v max(x Y,, 0) 6 3

Example 2: Call Center Min staff costs, s/t satisfactory service x i = number of staff working shift i Y = interarrival,, service, abandonment times etc for one day s operation f(x) ) = c T x = staffing cost g i (x, Y) ) = Num happy customers 0.8 Num arrivals 7 Example 3, 4 Example 3: Ambulance base location x = vector of ambulance base locations Y = call locations, times, scene times etc over time horizon f(x, Y) ) = fraction of calls reached in 10 mins Example 4: Ambulance redeployment x = parameters of redeployment policy Y,, f = same as in Example 3 8 4

Sample Average Approximation AKA external sampling, sample path opt, stochastic counterpart. Strong links with retrospective optimization Replace E( ) with sample average of Fix the random samples beforehand Apply a deterministic optimization algorithm Doesn t always work some weird counterexamples 9 Example: Newsvendor f(x, Y) ) = cx - s min(y, x) v max(x Y,, 0) Generate demands Y 1,, Y n f n is piecewise linear and convex Can apply a convex optimization code 10 5

SAA General Results Lots known about x n, the optimal solution Under reasonable conditions, including unique minimum: If multiple optima, related results hold 11 Gradient Estimation Derivatives of f n (x) ) boost computation Same thing as IPA derivatives Sometimes need extra smoothing when f n is not differentiable Easily found for newsvendor. What about harder problems? 12 6

Example 3: Base Location (Mason) f n not continuous, but is piecewise differentiable E f n f But does that matter? 13 Stochastic Approximation Time-honoured approach. Dates to 50 s Let G(x) ) be estimate of f(x) x n+1 = x n a n G(x n ) {a n } is gain (step length) sequence Essentially steepest descent Works well only if choose a n s carefully Can prove lots of theorems, but Use only if SAA isn t efficient 14 7

Selecting the Best System Finite number of x s Allows us to make statements like P(x n is truly best of x s visited) 0.95 Use after search procedure is complete Clean up afterwards Additional runs will probably be needed 15 Bird s-eye View Apply SAA if good optimization algorithms available for SAA problem Usually need structured problem IPA (true) gradients help a lot, not essential Requires careful control of random number streams Samples don t need to be iid can apply variance reduction methods If deterministic problems are tough, then anything goes. (Random search, metaheuristics) Selection of the best can clean up afterwards 16 8

SAA Where to Find Out More Alex Shapiro (GA Tech), esp. MC sampling methods chapter in Handbook of Stochastic Programming, and Linderoth,, Shapiro and Wright paper Atlason, Epelman,, Henderson - Annals of OR, Man Sci submission discrete problems hard! Gradient estimation: Glasserman,, Fu and Hu books, L Ecuyer papers, reviews by Michael Fu Selecting the best: Chapter by Kim and Nelson in forthcoming Handbook of Simulation 17 Part II Input Uncertainty 18 9

The Bruce Lee bakery Open from 6am till 3pm every day Service times i.i.d., exp(µ) How many staff are needed so that 90% of customers wait 1 minute in queue? Poisson arrival process, rate Λ i on day i (Λ : i i 0) i.i.d. G(α, β) 19 The Bruce Lee bakery Similar for E S 1 so standard approach works 20 10

The Steve Russell wine store Open from 11am till 8pm every day Service times i.i.d., exp(µ) How many staff are needed so that 90% of customers wait 1 minute in queue? Poisson arrival process, rate λ Don t know λ Uncertainty in λ well modeled as G(α, β) 21 The Steve Russell wine store Performance measure still E S 1 / E N 1 E N 1 = 9 λ =? Which value of λ to use? Same problem with E S 1 What should we do? 22 11

Terminology Simulation model needs input distributions and parameters Model uncertainty: which family? Parameter uncertainty: what parameters? Input uncertainty: Both problems 23 Why do we care? M/M/1 /1 queue, arrival rate λ 0 = 9 (unknown), service rate µ 0 = 10 Expected time in system is f (λ 0 ) where True performance = f (λ ) = 1 0 24 12

Estimating λ 0 Assume this holds exactly Possibly negative ignore Possibly bigger than µ 0 Assume perfect simulation that reports 25 The induced distribution of 26 13

Back to the wine store By ignoring input uncertainty, we can make one of two errors: Overestimate λ 0 : Hire too many staff Super service, expensive Underestimate λ 0 : Hire too few staff Poor service, cheap Not so bad at a wine store, but what about at a 911 call center? 27 What should we do about it? Any method should be Transparent to users Statistically valid Easily implemented Computationally efficient Approach 0: Do nothing OK so long as users understand that the simulation analyzes the chosen system a sense of model bias is obtained Sensitivity analysis, uncertainty analysis 28 14

Cheng and Holland 03 Suppose θ = (θ( 1, θ 2 ) (e.g., arrival, service rates) θ 2 Simulate here (L b, U b ) f increases And here (L s, U s ) θ 1 (L s, U b ) 29 Bayesian methods Chick (1997, 1999, 2000, 2001) Chick and Ng (2002) Ng and Chick (2001) Zouaoui and Wilson (2001a, 2001b, 2003a, 2003b) Lots of work on Bayesian Model Average 30 15

Induced distribution methods Study distribution of Distribution function: Lee (1998), Lee and Glynn (1999, 2003) Mean: Andradottir and Glynn (2003) Density: Steckley and Henderson (2003), Steckley (2005) Helton: G(x; m, θ) ) = P(X 1 (m, θ) x) Look at distribution of 31 Bird s Eye View and More Info Cheng and Holland for small numbers of parameters Induced distribution if you can get your head around the idea Bayesian model average once the tools are automated and in software In the meantime: sensitivity and uncertainty analysis More info: Henderson tutorial, WSC 03 32 16

Part III Modeling Dependence 33 Dependence Some of the random variables in the simulation are related Can dramatically alter performance, decisions based on model # arrivals in morning vs afternoon in call centre Wind fields in yacht match race simulation Travel speeds of ambulances 34 17

Modeling Philosophy What needs to be modeled? Use simple models to help decide. 1. Capture the physics 2. Use reasonable joint distribution with well understood pluses and minuses 3. Use partially specified distributions 35 Call Centre Arrivals Palm-Khintchine thm suggests arrivals follow a nonhomogeneous Poisson process # arrivals in a period is Poisson different time intervals are independent Call centre HR folks and data: # arrivals has a bigger variance than Poisson busier than usual at 9am busier at 10am Uh oh 36 18

Randomly Varying Arrival Rates Vijay Mehrotra,, Sam Steckley One explanation: Arrival rate function is random Once arrival rate function realized, arrivals follow a NHPP with that rate function Why is this at least plausible? Mechanism generating arrivals can vary due to weather, promotions, etc Trying to capture the unknown physics 37 Yacht Match Race Simulation Wind direction Andy Philpott,, Hamish Sheild,, U of Auckland 2 yachts Design & tactical Qs Model wind speed and direction on a grid Spatial/temporal dependence We use a vector AR Well-understood joint distribution 38 19

Vector AR Process 39 Ambulance Travel Speeds Call locations are random So are travel speeds! Traffic lights Weather (visibility, road conditions) Congestion Should we explicitly model random travel speeds? 40 20

41 Responding From Base Randomness in travel speeds can improve performance! 42 21

What About Queueing Effects? Service time of a call = travel + nontravel Travel time = distance / speed and speed depends on macro and micro factors Macro factors affect all calls, so average of service time changes Micro factors influence individual calls, so variability of service time changes Looked at M/G/1 using transform inversion and M/G/ models 43 Fraction on time Size Matters 5 ambulances 30 ambulances 80% Utilization 44 22

Random Travel Speeds Help a bit if calls are clustered near the boundary of reachable calls Dependence structure matters: One form increases variability. The other increases mean service time Hurt heavily utilized fleets more than lightly-utilized fleets 45 More Info Henderson tutorial at WSC 05 discusses these examples in more depth Biller and Ghosh chapter in forthcoming Handbook of Simulation Prediction: Spatial/temporal models will become more common need some convenient model classes 46 23

Last Words Are you a tourist wanting to hook up with simulation folks? INFORMS Simulation Society business meeting tonight 6:15pm in the Taylor room Are you a colleague who thinks I misrepresented things? Speak now! Are you a researcher looking for a new area? Sim Opt in an application setting is a great area! 47 24