MIN-MAX CONFIDENCE INTERVALS

Size: px
Start display at page:

Download "MIN-MAX CONFIDENCE INTERVALS"

Transcription

1 MIN-MAX CONFIDENCE INTERVALS Johann Christoph Strelen Rheinische Friedrich Wilhelms Universität Bonn Römerstr. 164, Bonn, Germany July 2004

2 STOCHASTIC SIMULATION Random input = random output two different runs of the same model different output. Due to the stochastic nature of the simulation results, careful statistic analysis must be done for the correct interpretation of calculated values. If this is omitted, there is a significant probability of making erroneous inferences about the system under study. 2

3 CONFIDENCE INTERVALS Unknown parameter θ is to be estimated from an output process X 1, X 2,..., X n Confidence interval I(X 1,..., X n ) = [L(X 1,..., X n ), U(X 1,..., X n )] such that P {θ I(X 1,..., X n )} = 1 α Confidence level 1 α where the probability α, small, is given. Width U(X 1,..., X n ) L(X 1,..., X n ) should be small. 3

4 ESTIMATORS Functions T (X 1,..., X n ) for the estimation of the unknown parameter θ such that given an output x 1,..., x n one may expect T (x 1,..., x n ) θ E[T (X 1,..., X n )] = θ: T unbiased lim n E[T (X 1,..., X n )] = θ: T asymptotically unbiased 4

5 Statistical Theory For the construction of the confidence interval, probability distributions of the interval bounds, U(X 1,..., X n ) and L(X 1,..., X n ), are determined. Usual assumptions: the X 1,..., X n are independent random variables they are identically distributed often: they are normally distributed 5

6 CLASSICAL CONFIDENCE INTERVALS Ȳ ± t n 1,1 α/2 S2/n Sample (Y 1,..., Y n ) of independent, normally distributed random variables Confidence level 1 α, 0 < α < 1 Sample mean Ȳ = (Y Y n )/n Sample variance S2 = (Y Y 2 n )/(n 1) n n 1Ȳ 2 (1 α/2)-quantile of the Student distribution with n 1 degrees of freedom t n 1,1 α/2 But: In simulation, mostly the assumptions of the statistical theory are not fulfilled 6

7 Resort Central limit theorem: Y Y n is nearly normally distributed if the sample Y 1,..., Y n is IID and if n is large Independent replications of the simulation with different random number streams - the estimators in these runs are independent Grouping consecutive results of a long simulation run into batches - considered to be (nearly) independent Evaluating only the steady state phase of each simulation run - ignoring the transient phase 7

8 ACCURACY Inaccurate confidence intervals not unusual in simulation, e.g. assumed confidence level 90%, coverage only 80%. This means: In many different simulations, only approximately 80% of the confidence intervals contain the real value. Comparative numerical studies: More elaborated techniques (regenerative method, autoregressive processes, spectral estimation method, standardized time series method) may be less accurate than batch-means method and replication/deletion method. Median confidence intervals may be even more accurate. In long simulation runs, the accuracy of the confidence intervals is better. 8

9 Min-Max Confidence Intervals (MMCI), Median Confidence Intervals (MCI) A new confidence interval (CI) technique for simulation results Easy to apply Accurate Generally applicable 9

10 Main Features Easy to obtain: w independent replications (simulation runs) or a single simulation run with w subsequent phases for batches of data typically w = 5 or 6. The variance of the estimator is not used, correlated output is implicitly considered. Hence, a serious problem is omitted which usually arises when confidence intervals for simulation results are derived. Even if the variance does not exist, an MCI can be constructed whereas a classical CI cannot. Sequential procedure: If a median confidence interval (MCI) is too wide, given a confidence level, it can be narrowed: Each of the replications are augmented, beginning with the last state. Similar for batches. 10

11 If a measure is estimated with a function of some estimators, an MCI can be given. Example: Λ(n) estimates the throughput of a queue and W (n) the mean waitig time. Then the product Λ(n)W (n) estimates the mean number of customers in the queue (Littles Formula). For some samples of independent random variables, e.g. normally distributed, we found MCIs which are sometimes slightly wider than usual CIs. But such simple statistic occurs seldom in simulation. Here the output is usually dependent, and the distribution is unknown. Under these circumstances, classical CIs are usually too narrow, the confidence level is not realistic, the CIs too often do not cover the real unknown value, they are only approximate. MCIs are more accurate. 11

12 The MCI technique is exact when the median and the mean of an estimator coincide. This holds for symmetrical distributions the most important one in simulation is the normal distribution. Due to the central limit theorem and long simulation runs with fast computers many estimators are nearly normally distributed. But in principle, the MCI technique is not restricted to the case median = mean. In this general case, one must know a single value of the estimator distribution function F θ (x), namely the probability F = F θ (θ) where θ is the unknown parameter. Not each confidence level is possible, only the values 1 F w (1 F ) w, w = 2, 3,... Here, w is the number of independent replications or of batches of data. In the special case median = mean, F = 0.5 holds, and the possible confidence levels are 50%, 75%, 87.5%, 93.75%, %, 98.4%, 99.2%, 99.6%, 99.8%, 99.9%,... 12

13 The Basic Principle Sample X 1,1,..., X 1,m of random variables, one run of a steady-state simulation or of n terminating runs. θ unknown parameter to estimate. T (X 1,1,..., X 1,m ) estimator, distribution function F θ (x). Novel kind of confidence interval [ T min, T max ) (1) where and T min = min 1 i w T i, T max = max 1 i w T i, T i = T (X i,1,..., X i,m ), i = 1,..., w estimators for w independent replications X i,1,..., X i,m of the sample X 1,1,..., X 1,m. 13

14 Theorem 1 The interval (1) is a confidence interval for the parameter θ with the confidence level 1 F w (1 F ) w, i.e. P {T min θ < T max } = 1 F w (1 F ) w holds where F = F θ (θ), the value of the estimator distribution function at θ. The Most Important Special Case: Mean = Median Here, the unknown parameter is the median of the estimator, F θ (θ) = 1/2 This holds for unbiased estimators and symmetrical distributions, e.g. the estimator is normally distributed. Then for the confidence interval P {T min θ < T max } = w 1 holds, and the possible confidence levels are w 1, w = 2, 3,... 14

15 Batch Median Confidence Intervals for steady state statistics. We applied the idea of the batch means method: Grouping output data into batches and assuming these batches to being independent. A single simulation run: First the transient phase, then w phases for w batches of output data. From each batch one obtaines an estimate ˆT i, i = 1,..., w. The batch mean confidence interval (BMCI) is [ min 1 i w ˆT i, max 1 i w ˆT i ). 15

16 Interesting application where F can be calculated: Order statistics as estimates for quantiles. Consider samples X 1,..., X n and the according ordered sequence X (1),..., X (n), X (i) X (j) if i < j, where the X i are IID with the strictly increasing distribution function F (x). The q-quantile θ = x q, q (0, 1), F (x q ) = q, is estimated by X (r), r {1, 2,..., n}. Let F θ (x) denote the distribution function of the estimator, namely X (r). 16

17 Here, F = F θ (x) is known: Theorem 2 If the q-quantile x q is estimated by X (r), the min-max confidence interval (1) has precisely the confidence level of theorem 1 with F = n i=r n i q i (1 q) n i. (2) Remarks 1. Here the value F = F θ (x q ) is independent of the actual distribution function of the sample elements X i. 2. Theorem 2 is not useful for the simulation of the extremes, q = 0 or q = 1. Here one gets the confidence level Usually, k qn is chosen. 17

18 Corollary If the sample size n is odd, r = n/2 and q = 0.5, i.e. the median is estimated, F = 0.5 holds. 18

19 Confidence Intervals in Simulation are Usually Approximate Assumptions are not satisfied, in general What means approximate confidence The distribution of the estimator (normal, interval? If for a parameter of a simulation Student) Independency of the r.v. in the sample For some methods other assumptions For median confidence intervals the assumptions are weaker: Only symmetry of the distribution of the estimator model, many confidence intervals are calculated in many simulations, the real value lies in some of them, in the others it does not. The coverage C is the fraction of runs where it is within. If the limit of this coverage equals the confidence level CL = 1 α, the confidence interval technique is exact, otherwise approximate: The confidence level is not reached, Independency of the replications, not of CL C. the r.v. within them 19

20 Numerical Experience Many simulation studies. Comparison of classical confidence interval methods with median confidence intervals or with batch median confidence intervals. Each Study: Many independent simulation experiments for the estimation of the coverage of each considered confidence interval technique. Each simulation experiment: w= 5 independent replications for median confidence intervals (MCI) and for the replication/deletion method or w= 5 batches for batch median confidence intervals (BMCI) and for the batch means method. w= 5 implies a confidence level CL = 93.75% for the MCIs and BMCIs. Measure for the accuracy: The error CL C = confidence level observed coverage. 20

21 1. M/M/1 Queueing System: Waiting Times (Delays) Law and Kelton comparative study for different well known methods for confidence Batch Means intervals. Standardized Time Series Utilization 0.8; known to be statistically Spectrum Analysis difficult. Autoregressive Method independent simulation experiments for Regenerative Method Classical each run length n and each CI method, n Regenerative Method Jackknife = 2560 delays e.g. coverage C. We conducted an according simulation Batch Median Confidence Intervals Errors CL C study with the same model and the same run lengths including batch median confidence intervals (BMCI). Error means confidence level CL = 90%, observed coverage C = 75.5%, e.g. 21

22 2. M/M/1 Queueing System Comparison of the replication/deletion method (RD) and median confidence intervals (MCI). Low and high utilization (ρ = 0.25 and 0.8). Short and long simulation runs. ρ Run replication/deletion median confidence intervals Short Long Short Long Errors CL C Long runs: Both methods good Short runs: MCIs slightly better 22

23 3. M/M/1 Queueing System, Ratios of Estimators The same M/M/1-model as before. Comparison of jackknife intervals and median confidence intervals for the mean delay, Ŵ (r), as ratio of ˆQ/ˆλ of the mean number of jobs in the waiting room and the mean throughput. ρ Run RD, Jackknife Median Confidence Intervals Short Long Short Long Errors CL C Median confidence intervals are more accurate. 23

24 4. Pareto distribution We are interested in parameters of heavy-tailed Pareto distributions, F (x) = 1 x a, 0 < a <= 2, x 1, with expectation a/(a 1) for a > 1, median 2 1/a, the variance does not exist simulation experiments, each with sample size n = The classical confidence interval for the expectation does not exist. Median confidence intervals for the expectation: a CL C good bad 1.1 not acceptable Median confidence intervals for the order statistic for the median: a Confidence Interval Median CI Errors CL C

25 5. Reliability Model The model consists of three components and will function as long as component 1 works and either component 2 or 3 works. G i is the time to failure of component i, i = 1, 2, 3, and G = min{g 1, max{g 2, G 3 }} the time to failure of the whole system. The random variables G i are independent, and each G i has a Weibull distribution F (x) = 1 exp( x), x > 0. The estimator of the expectation of G has a very skewed and nonnormal distribution, all confidence intervals are quite inaccurate, for small sample sizes simulation experiments, each with sample size n = 5 or 40. n Classical CI Median CI Errors CL C 25

26 Potential Further Development of the Technique The assumption of symmetry of the estimator distribution can be omitted, even the estimator may be biased, only F = F θ (θ), the value of the estimator distribution function at θ, the unknown parameter, must be known. Then we speak of min-max confidence intervals (MMCI). They are exact if the w replications are independent, their confidence level is CL = 1 F w (1 F ) w. Crucial problem: This value F θ (θ). We do not know an adequate method for estimating it efficiently. 26

27 But this MMCI idea works, we tried a brute-force procedure: Very long and expensive simulations for an empirical distribution of the r.v. G of example 5, then the distribution function ˆF θ (x) of the estimator with convolution, and with an estimation of the unknown parameter, ˆθ, we obtained ˆF = ˆF θ (ˆθ) and an estimate ĈL. n Coverage ĈL Coverages and Estimated Confidence Levels Accurate, isn t it? But so not practicable 27

Confidence Intervals for Cp

Confidence Intervals for Cp Chapter 296 Confidence Intervals for Cp Introduction This routine calculates the sample size needed to obtain a specified width of a Cp confidence interval at a stated confidence level. Cp is a process

More information

How To Check For Differences In The One Way Anova

How To Check For Differences In The One Way Anova MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

More information

The Variability of P-Values. Summary

The Variability of P-Values. Summary The Variability of P-Values Dennis D. Boos Department of Statistics North Carolina State University Raleigh, NC 27695-8203 boos@stat.ncsu.edu August 15, 2009 NC State Statistics Departement Tech Report

More information

How To Find The Optimal Base Stock Level In A Supply Chain

How To Find The Optimal Base Stock Level In A Supply Chain Optimizing Stochastic Supply Chains via Simulation: What is an Appropriate Simulation Run Length? Arreola-Risa A 1, Fortuny-Santos J 2, Vintró-Sánchez C 3 Abstract The most common solution strategy for

More information

Non Parametric Inference

Non Parametric Inference Maura Department of Economics and Finance Università Tor Vergata Outline 1 2 3 Inverse distribution function Theorem: Let U be a uniform random variable on (0, 1). Let X be a continuous random variable

More information

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE 1 2 CONTENTS OF DAY 2 I. More Precise Definition of Simple Random Sample 3 Connection with independent random variables 3 Problems with small populations 8 II. Why Random Sampling is Important 9 A myth,

More information

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in

More information

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

Point and Interval Estimates

Point and Interval Estimates Point and Interval Estimates Suppose we want to estimate a parameter, such as p or µ, based on a finite sample of data. There are two main methods: 1. Point estimate: Summarize the sample by a single number

More information

4. Continuous Random Variables, the Pareto and Normal Distributions

4. Continuous Random Variables, the Pareto and Normal Distributions 4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random

More information

Online Appendices to the Corporate Propensity to Save

Online Appendices to the Corporate Propensity to Save Online Appendices to the Corporate Propensity to Save Appendix A: Monte Carlo Experiments In order to allay skepticism of empirical results that have been produced by unusual estimators on fairly small

More information

From the help desk: Bootstrapped standard errors

From the help desk: Bootstrapped standard errors The Stata Journal (2003) 3, Number 1, pp. 71 80 From the help desk: Bootstrapped standard errors Weihua Guan Stata Corporation Abstract. Bootstrapping is a nonparametric approach for evaluating the distribution

More information

Aachen Summer Simulation Seminar 2014

Aachen Summer Simulation Seminar 2014 Aachen Summer Simulation Seminar 2014 Lecture 07 Input Modelling + Experimentation + Output Analysis Peer-Olaf Siebers pos@cs.nott.ac.uk Motivation 1. Input modelling Improve the understanding about how

More information

GENERATING SIMULATION INPUT WITH APPROXIMATE COPULAS

GENERATING SIMULATION INPUT WITH APPROXIMATE COPULAS GENERATING SIMULATION INPUT WITH APPROXIMATE COPULAS Feras Nassaj Johann Christoph Strelen Rheinische Friedrich-Wilhelms-Universitaet Bonn Institut fuer Informatik IV Roemerstr. 164, 53117 Bonn, Germany

More information

Java Modules for Time Series Analysis

Java Modules for Time Series Analysis Java Modules for Time Series Analysis Agenda Clustering Non-normal distributions Multifactor modeling Implied ratings Time series prediction 1. Clustering + Cluster 1 Synthetic Clustering + Time series

More information

A Uniform Asymptotic Estimate for Discounted Aggregate Claims with Subexponential Tails

A Uniform Asymptotic Estimate for Discounted Aggregate Claims with Subexponential Tails 12th International Congress on Insurance: Mathematics and Economics July 16-18, 2008 A Uniform Asymptotic Estimate for Discounted Aggregate Claims with Subexponential Tails XUEMIAO HAO (Based on a joint

More information

Permutation Tests for Comparing Two Populations

Permutation Tests for Comparing Two Populations Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of

More information

1. (First passage/hitting times/gambler s ruin problem:) Suppose that X has a discrete state space and let i be a fixed state. Let

1. (First passage/hitting times/gambler s ruin problem:) Suppose that X has a discrete state space and let i be a fixed state. Let Copyright c 2009 by Karl Sigman 1 Stopping Times 1.1 Stopping Times: Definition Given a stochastic process X = {X n : n 0}, a random time τ is a discrete random variable on the same probability space as

More information

Confidence Intervals for Cpk

Confidence Intervals for Cpk Chapter 297 Confidence Intervals for Cpk Introduction This routine calculates the sample size needed to obtain a specified width of a Cpk confidence interval at a stated confidence level. Cpk is a process

More information

MATHEMATICAL METHODS OF STATISTICS

MATHEMATICAL METHODS OF STATISTICS MATHEMATICAL METHODS OF STATISTICS By HARALD CRAMER TROFESSOK IN THE UNIVERSITY OF STOCKHOLM Princeton PRINCETON UNIVERSITY PRESS 1946 TABLE OF CONTENTS. First Part. MATHEMATICAL INTRODUCTION. CHAPTERS

More information

Need for Sampling. Very large populations Destructive testing Continuous production process

Need for Sampling. Very large populations Destructive testing Continuous production process Chapter 4 Sampling and Estimation Need for Sampling Very large populations Destructive testing Continuous production process The objective of sampling is to draw a valid inference about a population. 4-

More information

Statistical Rules of Thumb

Statistical Rules of Thumb Statistical Rules of Thumb Second Edition Gerald van Belle University of Washington Department of Biostatistics and Department of Environmental and Occupational Health Sciences Seattle, WA WILEY AJOHN

More information

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name: Glo bal Leadership M BA BUSINESS STATISTICS FINAL EXAM Name: INSTRUCTIONS 1. Do not open this exam until instructed to do so. 2. Be sure to fill in your name before starting the exam. 3. You have two hours

More information

2. Linear regression with multiple regressors

2. Linear regression with multiple regressors 2. Linear regression with multiple regressors Aim of this section: Introduction of the multiple regression model OLS estimation in multiple regression Measures-of-fit in multiple regression Assumptions

More information

5.1 Identifying the Target Parameter

5.1 Identifying the Target Parameter University of California, Davis Department of Statistics Summer Session II Statistics 13 August 20, 2012 Date of latest update: August 20 Lecture 5: Estimation with Confidence intervals 5.1 Identifying

More information

Review of basic statistics and the simplest forecasting model: the sample mean

Review of basic statistics and the simplest forecasting model: the sample mean Review of basic statistics and the simplest forecasting model: the sample mean Robert Nau Fuqua School of Business, Duke University August 2014 Most of what you need to remember about basic statistics

More information

Exact Confidence Intervals

Exact Confidence Intervals Math 541: Statistical Theory II Instructor: Songfeng Zheng Exact Confidence Intervals Confidence intervals provide an alternative to using an estimator ˆθ when we wish to estimate an unknown parameter

More information

Lecture Notes Module 1

Lecture Notes Module 1 Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

More information

AUTOMATING DISCRETE EVENT SIMULATION OUTPUT ANALYSIS AUTOMATIC ESTIMATION OF NUMBER OF REPLICATIONS, WARM-UP PERIOD AND RUN LENGTH.

AUTOMATING DISCRETE EVENT SIMULATION OUTPUT ANALYSIS AUTOMATIC ESTIMATION OF NUMBER OF REPLICATIONS, WARM-UP PERIOD AND RUN LENGTH. Proceedings of the 2009 INFORMS Simulation Society Research Workshop L.H. Lee, M. E. Kuhl, J. W. Fowler and S.Robinson, eds. AUTOMATING DISCRETE EVENT SIMULATION OUTPUT ANALYSIS AUTOMATIC ESTIMATION OF

More information

Towards a Reliable Statistical Oracle and its Applications

Towards a Reliable Statistical Oracle and its Applications Towards a Reliable Statistical Oracle and its Applications Johannes Mayer Abteilung Angewandte Informationsverarbeitung Universität Ulm mayer@mathematik.uni-ulm.de Abstract It is shown how based on the

More information

1 Simulating Brownian motion (BM) and geometric Brownian motion (GBM)

1 Simulating Brownian motion (BM) and geometric Brownian motion (GBM) Copyright c 2013 by Karl Sigman 1 Simulating Brownian motion (BM) and geometric Brownian motion (GBM) For an introduction to how one can construct BM, see the Appendix at the end of these notes A stochastic

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

Bootstrapping Big Data

Bootstrapping Big Data Bootstrapping Big Data Ariel Kleiner Ameet Talwalkar Purnamrita Sarkar Michael I. Jordan Computer Science Division University of California, Berkeley {akleiner, ameet, psarkar, jordan}@eecs.berkeley.edu

More information

Supplement to Call Centers with Delay Information: Models and Insights

Supplement to Call Centers with Delay Information: Models and Insights Supplement to Call Centers with Delay Information: Models and Insights Oualid Jouini 1 Zeynep Akşin 2 Yves Dallery 1 1 Laboratoire Genie Industriel, Ecole Centrale Paris, Grande Voie des Vignes, 92290

More information

Chapter 6: Point Estimation. Fall 2011. - Probability & Statistics

Chapter 6: Point Estimation. Fall 2011. - Probability & Statistics STAT355 Chapter 6: Point Estimation Fall 2011 Chapter Fall 2011 6: Point1 Estimat / 18 Chap 6 - Point Estimation 1 6.1 Some general Concepts of Point Estimation Point Estimate Unbiasedness Principle of

More information

Quantitative Methods for Finance

Quantitative Methods for Finance Quantitative Methods for Finance Module 1: The Time Value of Money 1 Learning how to interpret interest rates as required rates of return, discount rates, or opportunity costs. 2 Learning how to explain

More information

Statistics 104: Section 6!

Statistics 104: Section 6! Page 1 Statistics 104: Section 6! TF: Deirdre (say: Dear-dra) Bloome Email: dbloome@fas.harvard.edu Section Times Thursday 2pm-3pm in SC 109, Thursday 5pm-6pm in SC 705 Office Hours: Thursday 6pm-7pm SC

More information

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY 1. Introduction Besides arriving at an appropriate expression of an average or consensus value for observations of a population, it is important to

More information

Variables Control Charts

Variables Control Charts MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. Variables

More information

Uncertainty quantification for the family-wise error rate in multivariate copula models

Uncertainty quantification for the family-wise error rate in multivariate copula models Uncertainty quantification for the family-wise error rate in multivariate copula models Thorsten Dickhaus (joint work with Taras Bodnar, Jakob Gierl and Jens Stange) University of Bremen Institute for

More information

Nonparametric adaptive age replacement with a one-cycle criterion

Nonparametric adaptive age replacement with a one-cycle criterion Nonparametric adaptive age replacement with a one-cycle criterion P. Coolen-Schrijner, F.P.A. Coolen Department of Mathematical Sciences University of Durham, Durham, DH1 3LE, UK e-mail: Pauline.Schrijner@durham.ac.uk

More information

Confidence Intervals for Spearman s Rank Correlation

Confidence Intervals for Spearman s Rank Correlation Chapter 808 Confidence Intervals for Spearman s Rank Correlation Introduction This routine calculates the sample size needed to obtain a specified width of Spearman s rank correlation coefficient confidence

More information

Analysis of a Production/Inventory System with Multiple Retailers

Analysis of a Production/Inventory System with Multiple Retailers Analysis of a Production/Inventory System with Multiple Retailers Ann M. Noblesse 1, Robert N. Boute 1,2, Marc R. Lambrecht 1, Benny Van Houdt 3 1 Research Center for Operations Management, University

More information

Stochastic Inventory Control

Stochastic Inventory Control Chapter 3 Stochastic Inventory Control 1 In this chapter, we consider in much greater details certain dynamic inventory control problems of the type already encountered in section 1.3. In addition to the

More information

Internet Appendix to False Discoveries in Mutual Fund Performance: Measuring Luck in Estimated Alphas

Internet Appendix to False Discoveries in Mutual Fund Performance: Measuring Luck in Estimated Alphas Internet Appendix to False Discoveries in Mutual Fund Performance: Measuring Luck in Estimated Alphas A. Estimation Procedure A.1. Determining the Value for from the Data We use the bootstrap procedure

More information

Life Table Analysis using Weighted Survey Data

Life Table Analysis using Weighted Survey Data Life Table Analysis using Weighted Survey Data James G. Booth and Thomas A. Hirschl June 2005 Abstract Formulas for constructing valid pointwise confidence bands for survival distributions, estimated using

More information

MATH4427 Notebook 2 Spring 2016. 2 MATH4427 Notebook 2 3. 2.1 Definitions and Examples... 3. 2.2 Performance Measures for Estimators...

MATH4427 Notebook 2 Spring 2016. 2 MATH4427 Notebook 2 3. 2.1 Definitions and Examples... 3. 2.2 Performance Measures for Estimators... MATH4427 Notebook 2 Spring 2016 prepared by Professor Jenny Baglivo c Copyright 2009-2016 by Jenny A. Baglivo. All Rights Reserved. Contents 2 MATH4427 Notebook 2 3 2.1 Definitions and Examples...................................

More information

The Assumption(s) of Normality

The Assumption(s) of Normality The Assumption(s) of Normality Copyright 2000, 2011, J. Toby Mordkoff This is very complicated, so I ll provide two versions. At a minimum, you should know the short one. It would be great if you knew

More information

Computational Statistics and Data Analysis

Computational Statistics and Data Analysis Computational Statistics and Data Analysis 53 (2008) 17 26 Contents lists available at ScienceDirect Computational Statistics and Data Analysis journal homepage: www.elsevier.com/locate/csda Coverage probability

More information

How To Price Garch

How To Price Garch 2011 3rd International Conference on Information and Financial Engineering IPEDR vol.12 (2011) (2011) IACSIT Press, Singapore A Study on Heston-Nandi GARCH Option Pricing Model Suk Joon Byun KAIST Business

More information

LOGIT AND PROBIT ANALYSIS

LOGIT AND PROBIT ANALYSIS LOGIT AND PROBIT ANALYSIS A.K. Vasisht I.A.S.R.I., Library Avenue, New Delhi 110 012 amitvasisht@iasri.res.in In dummy regression variable models, it is assumed implicitly that the dependent variable Y

More information

Panel Data Econometrics

Panel Data Econometrics Panel Data Econometrics Master of Science in Economics - University of Geneva Christophe Hurlin, Université d Orléans University of Orléans January 2010 De nition A longitudinal, or panel, data set is

More information

Generating Random Numbers Variance Reduction Quasi-Monte Carlo. Simulation Methods. Leonid Kogan. MIT, Sloan. 15.450, Fall 2010

Generating Random Numbers Variance Reduction Quasi-Monte Carlo. Simulation Methods. Leonid Kogan. MIT, Sloan. 15.450, Fall 2010 Simulation Methods Leonid Kogan MIT, Sloan 15.450, Fall 2010 c Leonid Kogan ( MIT, Sloan ) Simulation Methods 15.450, Fall 2010 1 / 35 Outline 1 Generating Random Numbers 2 Variance Reduction 3 Quasi-Monte

More information

Gambling and Data Compression

Gambling and Data Compression Gambling and Data Compression Gambling. Horse Race Definition The wealth relative S(X) = b(x)o(x) is the factor by which the gambler s wealth grows if horse X wins the race, where b(x) is the fraction

More information

Chapter 7 Section 1 Homework Set A

Chapter 7 Section 1 Homework Set A Chapter 7 Section 1 Homework Set A 7.15 Finding the critical value t *. What critical value t * from Table D (use software, go to the web and type t distribution applet) should be used to calculate the

More information

Population Mean (Known Variance)

Population Mean (Known Variance) Confidence Intervals Solutions STAT-UB.0103 Statistics for Business Control and Regression Models Population Mean (Known Variance) 1. A random sample of n measurements was selected from a population with

More information

Multivariate Logistic Regression

Multivariate Logistic Regression 1 Multivariate Logistic Regression As in univariate logistic regression, let π(x) represent the probability of an event that depends on p covariates or independent variables. Then, using an inv.logit formulation

More information

The Normal distribution

The Normal distribution The Normal distribution The normal probability distribution is the most common model for relative frequencies of a quantitative variable. Bell-shaped and described by the function f(y) = 1 2σ π e{ 1 2σ

More information

Multivariate Normal Distribution

Multivariate Normal Distribution Multivariate Normal Distribution Lecture 4 July 21, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #4-7/21/2011 Slide 1 of 41 Last Time Matrices and vectors Eigenvalues

More information

ESTIMATION OF THE EFFECTIVE DEGREES OF FREEDOM IN T-TYPE TESTS FOR COMPLEX DATA

ESTIMATION OF THE EFFECTIVE DEGREES OF FREEDOM IN T-TYPE TESTS FOR COMPLEX DATA m ESTIMATION OF THE EFFECTIVE DEGREES OF FREEDOM IN T-TYPE TESTS FOR COMPLEX DATA Jiahe Qian, Educational Testing Service Rosedale Road, MS 02-T, Princeton, NJ 08541 Key Words" Complex sampling, NAEP data,

More information

Estimation and Confidence Intervals

Estimation and Confidence Intervals Estimation and Confidence Intervals Fall 2001 Professor Paul Glasserman B6014: Managerial Statistics 403 Uris Hall Properties of Point Estimates 1 We have already encountered two point estimators: th e

More information

Week 4: Standard Error and Confidence Intervals

Week 4: Standard Error and Confidence Intervals Health Sciences M.Sc. Programme Applied Biostatistics Week 4: Standard Error and Confidence Intervals Sampling Most research data come from subjects we think of as samples drawn from a larger population.

More information

Simple Random Sampling

Simple Random Sampling Source: Frerichs, R.R. Rapid Surveys (unpublished), 2008. NOT FOR COMMERCIAL DISTRIBUTION 3 Simple Random Sampling 3.1 INTRODUCTION Everyone mentions simple random sampling, but few use this method for

More information

Alessandro Birolini. ineerin. Theory and Practice. Fifth edition. With 140 Figures, 60 Tables, 120 Examples, and 50 Problems.

Alessandro Birolini. ineerin. Theory and Practice. Fifth edition. With 140 Figures, 60 Tables, 120 Examples, and 50 Problems. Alessandro Birolini Re ia i it En ineerin Theory and Practice Fifth edition With 140 Figures, 60 Tables, 120 Examples, and 50 Problems ~ Springer Contents 1 Basic Concepts, Quality and Reliability Assurance

More information

Lecture 8. Confidence intervals and the central limit theorem

Lecture 8. Confidence intervals and the central limit theorem Lecture 8. Confidence intervals and the central limit theorem Mathematical Statistics and Discrete Mathematics November 25th, 2015 1 / 15 Central limit theorem Let X 1, X 2,... X n be a random sample of

More information

Random access protocols for channel access. Markov chains and their stability. Laurent Massoulié.

Random access protocols for channel access. Markov chains and their stability. Laurent Massoulié. Random access protocols for channel access Markov chains and their stability laurent.massoulie@inria.fr Aloha: the first random access protocol for channel access [Abramson, Hawaii 70] Goal: allow machines

More information

Analysis of a production-inventory system with unreliable production facility

Analysis of a production-inventory system with unreliable production facility Analysis of a production-inventory system with unreliable production facility Katrien Ramaekers Gerrit K Janssens Transportation Research Institute Hasselt University - Campus Diepenbeek Wetenschapspark

More information

The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series.

The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series. Cointegration The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series. Economic theory, however, often implies equilibrium

More information

Bias in the Estimation of Mean Reversion in Continuous-Time Lévy Processes

Bias in the Estimation of Mean Reversion in Continuous-Time Lévy Processes Bias in the Estimation of Mean Reversion in Continuous-Time Lévy Processes Yong Bao a, Aman Ullah b, Yun Wang c, and Jun Yu d a Purdue University, IN, USA b University of California, Riverside, CA, USA

More information

E3: PROBABILITY AND STATISTICS lecture notes

E3: PROBABILITY AND STATISTICS lecture notes E3: PROBABILITY AND STATISTICS lecture notes 2 Contents 1 PROBABILITY THEORY 7 1.1 Experiments and random events............................ 7 1.2 Certain event. Impossible event............................

More information

Chapter 4: Vector Autoregressive Models

Chapter 4: Vector Autoregressive Models Chapter 4: Vector Autoregressive Models 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie IV.1 Vector Autoregressive Models (VAR)...

More information

Maximum likelihood estimation of mean reverting processes

Maximum likelihood estimation of mean reverting processes Maximum likelihood estimation of mean reverting processes José Carlos García Franco Onward, Inc. jcpollo@onwardinc.com Abstract Mean reverting processes are frequently used models in real options. For

More information

GLMs: Gompertz s Law. GLMs in R. Gompertz s famous graduation formula is. or log µ x is linear in age, x,

GLMs: Gompertz s Law. GLMs in R. Gompertz s famous graduation formula is. or log µ x is linear in age, x, Computing: an indispensable tool or an insurmountable hurdle? Iain Currie Heriot Watt University, Scotland ATRC, University College Dublin July 2006 Plan of talk General remarks The professional syllabus

More information

Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab

Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab Monte Carlo Simulation: IEOR E4703 Fall 2004 c 2004 by Martin Haugh Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab 1 Overview of Monte Carlo Simulation 1.1 Why use simulation?

More information

Chapter 7 Section 7.1: Inference for the Mean of a Population

Chapter 7 Section 7.1: Inference for the Mean of a Population Chapter 7 Section 7.1: Inference for the Mean of a Population Now let s look at a similar situation Take an SRS of size n Normal Population : N(, ). Both and are unknown parameters. Unlike what we used

More information

Chapter 7. One-way ANOVA

Chapter 7. One-way ANOVA Chapter 7 One-way ANOVA One-way ANOVA examines equality of population means for a quantitative outcome and a single categorical explanatory variable with any number of levels. The t-test of Chapter 6 looks

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Suppose following data have been collected (heights of 99 five-year-old boys) 117.9 11.2 112.9 115.9 18. 14.6 17.1 117.9 111.8 16.3 111. 1.4 112.1 19.2 11. 15.4 99.4 11.1 13.3 16.9

More information

M/M/1 and M/M/m Queueing Systems

M/M/1 and M/M/m Queueing Systems M/M/ and M/M/m Queueing Systems M. Veeraraghavan; March 20, 2004. Preliminaries. Kendall s notation: G/G/n/k queue G: General - can be any distribution. First letter: Arrival process; M: memoryless - exponential

More information

A Coefficient of Variation for Skewed and Heavy-Tailed Insurance Losses. Michael R. Powers[ 1 ] Temple University and Tsinghua University

A Coefficient of Variation for Skewed and Heavy-Tailed Insurance Losses. Michael R. Powers[ 1 ] Temple University and Tsinghua University A Coefficient of Variation for Skewed and Heavy-Tailed Insurance Losses Michael R. Powers[ ] Temple University and Tsinghua University Thomas Y. Powers Yale University [June 2009] Abstract We propose a

More information

An Introduction to Extreme Value Theory

An Introduction to Extreme Value Theory An Introduction to Extreme Value Theory Petra Friederichs Meteorological Institute University of Bonn COPS Summer School, July/August, 2007 Applications of EVT Finance distribution of income has so called

More information

Universally Accepted Lean Six Sigma Body of Knowledge for Green Belts

Universally Accepted Lean Six Sigma Body of Knowledge for Green Belts Universally Accepted Lean Six Sigma Body of Knowledge for Green Belts The IASSC Certified Green Belt Exam was developed and constructed based on the topics within the body of knowledge listed here. Questions

More information

Probability and statistics; Rehearsal for pattern recognition

Probability and statistics; Rehearsal for pattern recognition Probability and statistics; Rehearsal for pattern recognition Václav Hlaváč Czech Technical University in Prague Faculty of Electrical Engineering, Department of Cybernetics Center for Machine Perception

More information

Monte Carlo and Empirical Methods for Stochastic Inference (MASM11/FMS091)

Monte Carlo and Empirical Methods for Stochastic Inference (MASM11/FMS091) Monte Carlo and Empirical Methods for Stochastic Inference (MASM11/FMS091) Magnus Wiktorsson Centre for Mathematical Sciences Lund University, Sweden Lecture 5 Sequential Monte Carlo methods I February

More information

A COMPARISON OF STATISTICAL METHODS FOR COST-EFFECTIVENESS ANALYSES THAT USE DATA FROM CLUSTER RANDOMIZED TRIALS

A COMPARISON OF STATISTICAL METHODS FOR COST-EFFECTIVENESS ANALYSES THAT USE DATA FROM CLUSTER RANDOMIZED TRIALS A COMPARISON OF STATISTICAL METHODS FOR COST-EFFECTIVENESS ANALYS THAT U DATA FROM CLUSTER RANDOMIZED TRIALS M Gomes, E Ng, R Grieve, R Nixon, J Carpenter and S Thompson Health Economists Study Group meeting

More information

The Binomial Distribution

The Binomial Distribution The Binomial Distribution James H. Steiger November 10, 00 1 Topics for this Module 1. The Binomial Process. The Binomial Random Variable. The Binomial Distribution (a) Computing the Binomial pdf (b) Computing

More information

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4) Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

More information

Testing against a Change from Short to Long Memory

Testing against a Change from Short to Long Memory Testing against a Change from Short to Long Memory Uwe Hassler and Jan Scheithauer Goethe-University Frankfurt This version: January 2, 2008 Abstract This paper studies some well-known tests for the null

More information

Confidence Intervals for the Difference Between Two Means

Confidence Intervals for the Difference Between Two Means Chapter 47 Confidence Intervals for the Difference Between Two Means Introduction This procedure calculates the sample size necessary to achieve a specified distance from the difference in sample means

More information

DIRECT REDUCTION OF BIAS OF THE CLASSI- CAL HILL ESTIMATOR

DIRECT REDUCTION OF BIAS OF THE CLASSI- CAL HILL ESTIMATOR REVSTAT Statistical Journal Volume 3, Number 2, November 2005, 113 136 DIRECT REDUCTION OF BIAS OF THE CLASSI- CAL HILL ESTIMATOR Authors: Frederico Caeiro Universidade Nova de Lisboa, FCTDM) and CEA,

More information

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters

More information

CPC/CPA Hybrid Bidding in a Second Price Auction

CPC/CPA Hybrid Bidding in a Second Price Auction CPC/CPA Hybrid Bidding in a Second Price Auction Benjamin Edelman Hoan Soo Lee Working Paper 09-074 Copyright 2008 by Benjamin Edelman and Hoan Soo Lee Working papers are in draft form. This working paper

More information

Stationary random graphs on Z with prescribed iid degrees and finite mean connections

Stationary random graphs on Z with prescribed iid degrees and finite mean connections Stationary random graphs on Z with prescribed iid degrees and finite mean connections Maria Deijfen Johan Jonasson February 2006 Abstract Let F be a probability distribution with support on the non-negative

More information

Probability and Random Variables. Generation of random variables (r.v.)

Probability and Random Variables. Generation of random variables (r.v.) Probability and Random Variables Method for generating random variables with a specified probability distribution function. Gaussian And Markov Processes Characterization of Stationary Random Process Linearly

More information

SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one?

SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one? SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one? Simulations for properties of estimators Simulations for properties

More information

Testing against a Change from Short to Long Memory

Testing against a Change from Short to Long Memory Testing against a Change from Short to Long Memory Uwe Hassler and Jan Scheithauer Goethe-University Frankfurt This version: December 9, 2007 Abstract This paper studies some well-known tests for the null

More information

Tests for exponentiality against the M and LM-classes of life distributions

Tests for exponentiality against the M and LM-classes of life distributions Tests for exponentiality against the M and LM-classes of life distributions B. Klar Universität Karlsruhe Abstract This paper studies tests for exponentiality against the nonparametric classes M and LM

More information

Hydrodynamic Limits of Randomized Load Balancing Networks

Hydrodynamic Limits of Randomized Load Balancing Networks Hydrodynamic Limits of Randomized Load Balancing Networks Kavita Ramanan and Mohammadreza Aghajani Brown University Stochastic Networks and Stochastic Geometry a conference in honour of François Baccelli

More information

Tutorial 5: Hypothesis Testing

Tutorial 5: Hypothesis Testing Tutorial 5: Hypothesis Testing Rob Nicholls nicholls@mrc-lmb.cam.ac.uk MRC LMB Statistics Course 2014 Contents 1 Introduction................................ 1 2 Testing distributional assumptions....................

More information

A credibility method for profitable cross-selling of insurance products

A credibility method for profitable cross-selling of insurance products Submitted to Annals of Actuarial Science manuscript 2 A credibility method for profitable cross-selling of insurance products Fredrik Thuring Faculty of Actuarial Science and Insurance, Cass Business School,

More information