MIN-MAX CONFIDENCE INTERVALS
|
|
- Gwen Gaines
- 7 years ago
- Views:
Transcription
1 MIN-MAX CONFIDENCE INTERVALS Johann Christoph Strelen Rheinische Friedrich Wilhelms Universität Bonn Römerstr. 164, Bonn, Germany July 2004
2 STOCHASTIC SIMULATION Random input = random output two different runs of the same model different output. Due to the stochastic nature of the simulation results, careful statistic analysis must be done for the correct interpretation of calculated values. If this is omitted, there is a significant probability of making erroneous inferences about the system under study. 2
3 CONFIDENCE INTERVALS Unknown parameter θ is to be estimated from an output process X 1, X 2,..., X n Confidence interval I(X 1,..., X n ) = [L(X 1,..., X n ), U(X 1,..., X n )] such that P {θ I(X 1,..., X n )} = 1 α Confidence level 1 α where the probability α, small, is given. Width U(X 1,..., X n ) L(X 1,..., X n ) should be small. 3
4 ESTIMATORS Functions T (X 1,..., X n ) for the estimation of the unknown parameter θ such that given an output x 1,..., x n one may expect T (x 1,..., x n ) θ E[T (X 1,..., X n )] = θ: T unbiased lim n E[T (X 1,..., X n )] = θ: T asymptotically unbiased 4
5 Statistical Theory For the construction of the confidence interval, probability distributions of the interval bounds, U(X 1,..., X n ) and L(X 1,..., X n ), are determined. Usual assumptions: the X 1,..., X n are independent random variables they are identically distributed often: they are normally distributed 5
6 CLASSICAL CONFIDENCE INTERVALS Ȳ ± t n 1,1 α/2 S2/n Sample (Y 1,..., Y n ) of independent, normally distributed random variables Confidence level 1 α, 0 < α < 1 Sample mean Ȳ = (Y Y n )/n Sample variance S2 = (Y Y 2 n )/(n 1) n n 1Ȳ 2 (1 α/2)-quantile of the Student distribution with n 1 degrees of freedom t n 1,1 α/2 But: In simulation, mostly the assumptions of the statistical theory are not fulfilled 6
7 Resort Central limit theorem: Y Y n is nearly normally distributed if the sample Y 1,..., Y n is IID and if n is large Independent replications of the simulation with different random number streams - the estimators in these runs are independent Grouping consecutive results of a long simulation run into batches - considered to be (nearly) independent Evaluating only the steady state phase of each simulation run - ignoring the transient phase 7
8 ACCURACY Inaccurate confidence intervals not unusual in simulation, e.g. assumed confidence level 90%, coverage only 80%. This means: In many different simulations, only approximately 80% of the confidence intervals contain the real value. Comparative numerical studies: More elaborated techniques (regenerative method, autoregressive processes, spectral estimation method, standardized time series method) may be less accurate than batch-means method and replication/deletion method. Median confidence intervals may be even more accurate. In long simulation runs, the accuracy of the confidence intervals is better. 8
9 Min-Max Confidence Intervals (MMCI), Median Confidence Intervals (MCI) A new confidence interval (CI) technique for simulation results Easy to apply Accurate Generally applicable 9
10 Main Features Easy to obtain: w independent replications (simulation runs) or a single simulation run with w subsequent phases for batches of data typically w = 5 or 6. The variance of the estimator is not used, correlated output is implicitly considered. Hence, a serious problem is omitted which usually arises when confidence intervals for simulation results are derived. Even if the variance does not exist, an MCI can be constructed whereas a classical CI cannot. Sequential procedure: If a median confidence interval (MCI) is too wide, given a confidence level, it can be narrowed: Each of the replications are augmented, beginning with the last state. Similar for batches. 10
11 If a measure is estimated with a function of some estimators, an MCI can be given. Example: Λ(n) estimates the throughput of a queue and W (n) the mean waitig time. Then the product Λ(n)W (n) estimates the mean number of customers in the queue (Littles Formula). For some samples of independent random variables, e.g. normally distributed, we found MCIs which are sometimes slightly wider than usual CIs. But such simple statistic occurs seldom in simulation. Here the output is usually dependent, and the distribution is unknown. Under these circumstances, classical CIs are usually too narrow, the confidence level is not realistic, the CIs too often do not cover the real unknown value, they are only approximate. MCIs are more accurate. 11
12 The MCI technique is exact when the median and the mean of an estimator coincide. This holds for symmetrical distributions the most important one in simulation is the normal distribution. Due to the central limit theorem and long simulation runs with fast computers many estimators are nearly normally distributed. But in principle, the MCI technique is not restricted to the case median = mean. In this general case, one must know a single value of the estimator distribution function F θ (x), namely the probability F = F θ (θ) where θ is the unknown parameter. Not each confidence level is possible, only the values 1 F w (1 F ) w, w = 2, 3,... Here, w is the number of independent replications or of batches of data. In the special case median = mean, F = 0.5 holds, and the possible confidence levels are 50%, 75%, 87.5%, 93.75%, %, 98.4%, 99.2%, 99.6%, 99.8%, 99.9%,... 12
13 The Basic Principle Sample X 1,1,..., X 1,m of random variables, one run of a steady-state simulation or of n terminating runs. θ unknown parameter to estimate. T (X 1,1,..., X 1,m ) estimator, distribution function F θ (x). Novel kind of confidence interval [ T min, T max ) (1) where and T min = min 1 i w T i, T max = max 1 i w T i, T i = T (X i,1,..., X i,m ), i = 1,..., w estimators for w independent replications X i,1,..., X i,m of the sample X 1,1,..., X 1,m. 13
14 Theorem 1 The interval (1) is a confidence interval for the parameter θ with the confidence level 1 F w (1 F ) w, i.e. P {T min θ < T max } = 1 F w (1 F ) w holds where F = F θ (θ), the value of the estimator distribution function at θ. The Most Important Special Case: Mean = Median Here, the unknown parameter is the median of the estimator, F θ (θ) = 1/2 This holds for unbiased estimators and symmetrical distributions, e.g. the estimator is normally distributed. Then for the confidence interval P {T min θ < T max } = w 1 holds, and the possible confidence levels are w 1, w = 2, 3,... 14
15 Batch Median Confidence Intervals for steady state statistics. We applied the idea of the batch means method: Grouping output data into batches and assuming these batches to being independent. A single simulation run: First the transient phase, then w phases for w batches of output data. From each batch one obtaines an estimate ˆT i, i = 1,..., w. The batch mean confidence interval (BMCI) is [ min 1 i w ˆT i, max 1 i w ˆT i ). 15
16 Interesting application where F can be calculated: Order statistics as estimates for quantiles. Consider samples X 1,..., X n and the according ordered sequence X (1),..., X (n), X (i) X (j) if i < j, where the X i are IID with the strictly increasing distribution function F (x). The q-quantile θ = x q, q (0, 1), F (x q ) = q, is estimated by X (r), r {1, 2,..., n}. Let F θ (x) denote the distribution function of the estimator, namely X (r). 16
17 Here, F = F θ (x) is known: Theorem 2 If the q-quantile x q is estimated by X (r), the min-max confidence interval (1) has precisely the confidence level of theorem 1 with F = n i=r n i q i (1 q) n i. (2) Remarks 1. Here the value F = F θ (x q ) is independent of the actual distribution function of the sample elements X i. 2. Theorem 2 is not useful for the simulation of the extremes, q = 0 or q = 1. Here one gets the confidence level Usually, k qn is chosen. 17
18 Corollary If the sample size n is odd, r = n/2 and q = 0.5, i.e. the median is estimated, F = 0.5 holds. 18
19 Confidence Intervals in Simulation are Usually Approximate Assumptions are not satisfied, in general What means approximate confidence The distribution of the estimator (normal, interval? If for a parameter of a simulation Student) Independency of the r.v. in the sample For some methods other assumptions For median confidence intervals the assumptions are weaker: Only symmetry of the distribution of the estimator model, many confidence intervals are calculated in many simulations, the real value lies in some of them, in the others it does not. The coverage C is the fraction of runs where it is within. If the limit of this coverage equals the confidence level CL = 1 α, the confidence interval technique is exact, otherwise approximate: The confidence level is not reached, Independency of the replications, not of CL C. the r.v. within them 19
20 Numerical Experience Many simulation studies. Comparison of classical confidence interval methods with median confidence intervals or with batch median confidence intervals. Each Study: Many independent simulation experiments for the estimation of the coverage of each considered confidence interval technique. Each simulation experiment: w= 5 independent replications for median confidence intervals (MCI) and for the replication/deletion method or w= 5 batches for batch median confidence intervals (BMCI) and for the batch means method. w= 5 implies a confidence level CL = 93.75% for the MCIs and BMCIs. Measure for the accuracy: The error CL C = confidence level observed coverage. 20
21 1. M/M/1 Queueing System: Waiting Times (Delays) Law and Kelton comparative study for different well known methods for confidence Batch Means intervals. Standardized Time Series Utilization 0.8; known to be statistically Spectrum Analysis difficult. Autoregressive Method independent simulation experiments for Regenerative Method Classical each run length n and each CI method, n Regenerative Method Jackknife = 2560 delays e.g. coverage C. We conducted an according simulation Batch Median Confidence Intervals Errors CL C study with the same model and the same run lengths including batch median confidence intervals (BMCI). Error means confidence level CL = 90%, observed coverage C = 75.5%, e.g. 21
22 2. M/M/1 Queueing System Comparison of the replication/deletion method (RD) and median confidence intervals (MCI). Low and high utilization (ρ = 0.25 and 0.8). Short and long simulation runs. ρ Run replication/deletion median confidence intervals Short Long Short Long Errors CL C Long runs: Both methods good Short runs: MCIs slightly better 22
23 3. M/M/1 Queueing System, Ratios of Estimators The same M/M/1-model as before. Comparison of jackknife intervals and median confidence intervals for the mean delay, Ŵ (r), as ratio of ˆQ/ˆλ of the mean number of jobs in the waiting room and the mean throughput. ρ Run RD, Jackknife Median Confidence Intervals Short Long Short Long Errors CL C Median confidence intervals are more accurate. 23
24 4. Pareto distribution We are interested in parameters of heavy-tailed Pareto distributions, F (x) = 1 x a, 0 < a <= 2, x 1, with expectation a/(a 1) for a > 1, median 2 1/a, the variance does not exist simulation experiments, each with sample size n = The classical confidence interval for the expectation does not exist. Median confidence intervals for the expectation: a CL C good bad 1.1 not acceptable Median confidence intervals for the order statistic for the median: a Confidence Interval Median CI Errors CL C
25 5. Reliability Model The model consists of three components and will function as long as component 1 works and either component 2 or 3 works. G i is the time to failure of component i, i = 1, 2, 3, and G = min{g 1, max{g 2, G 3 }} the time to failure of the whole system. The random variables G i are independent, and each G i has a Weibull distribution F (x) = 1 exp( x), x > 0. The estimator of the expectation of G has a very skewed and nonnormal distribution, all confidence intervals are quite inaccurate, for small sample sizes simulation experiments, each with sample size n = 5 or 40. n Classical CI Median CI Errors CL C 25
26 Potential Further Development of the Technique The assumption of symmetry of the estimator distribution can be omitted, even the estimator may be biased, only F = F θ (θ), the value of the estimator distribution function at θ, the unknown parameter, must be known. Then we speak of min-max confidence intervals (MMCI). They are exact if the w replications are independent, their confidence level is CL = 1 F w (1 F ) w. Crucial problem: This value F θ (θ). We do not know an adequate method for estimating it efficiently. 26
27 But this MMCI idea works, we tried a brute-force procedure: Very long and expensive simulations for an empirical distribution of the r.v. G of example 5, then the distribution function ˆF θ (x) of the estimator with convolution, and with an estimation of the unknown parameter, ˆθ, we obtained ˆF = ˆF θ (ˆθ) and an estimate ĈL. n Coverage ĈL Coverages and Estimated Confidence Levels Accurate, isn t it? But so not practicable 27
Confidence Intervals for Cp
Chapter 296 Confidence Intervals for Cp Introduction This routine calculates the sample size needed to obtain a specified width of a Cp confidence interval at a stated confidence level. Cp is a process
More informationHow To Check For Differences In The One Way Anova
MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way
More informationThe Variability of P-Values. Summary
The Variability of P-Values Dennis D. Boos Department of Statistics North Carolina State University Raleigh, NC 27695-8203 boos@stat.ncsu.edu August 15, 2009 NC State Statistics Departement Tech Report
More informationHow To Find The Optimal Base Stock Level In A Supply Chain
Optimizing Stochastic Supply Chains via Simulation: What is an Appropriate Simulation Run Length? Arreola-Risa A 1, Fortuny-Santos J 2, Vintró-Sánchez C 3 Abstract The most common solution strategy for
More informationNon Parametric Inference
Maura Department of Economics and Finance Università Tor Vergata Outline 1 2 3 Inverse distribution function Theorem: Let U be a uniform random variable on (0, 1). Let X be a continuous random variable
More informationCONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE
1 2 CONTENTS OF DAY 2 I. More Precise Definition of Simple Random Sample 3 Connection with independent random variables 3 Problems with small populations 8 II. Why Random Sampling is Important 9 A myth,
More informationInstitute of Actuaries of India Subject CT3 Probability and Mathematical Statistics
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in
More informationOverview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written
More informationLAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.
More informationPoint and Interval Estimates
Point and Interval Estimates Suppose we want to estimate a parameter, such as p or µ, based on a finite sample of data. There are two main methods: 1. Point estimate: Summarize the sample by a single number
More information4. Continuous Random Variables, the Pareto and Normal Distributions
4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random
More informationOnline Appendices to the Corporate Propensity to Save
Online Appendices to the Corporate Propensity to Save Appendix A: Monte Carlo Experiments In order to allay skepticism of empirical results that have been produced by unusual estimators on fairly small
More informationFrom the help desk: Bootstrapped standard errors
The Stata Journal (2003) 3, Number 1, pp. 71 80 From the help desk: Bootstrapped standard errors Weihua Guan Stata Corporation Abstract. Bootstrapping is a nonparametric approach for evaluating the distribution
More informationAachen Summer Simulation Seminar 2014
Aachen Summer Simulation Seminar 2014 Lecture 07 Input Modelling + Experimentation + Output Analysis Peer-Olaf Siebers pos@cs.nott.ac.uk Motivation 1. Input modelling Improve the understanding about how
More informationGENERATING SIMULATION INPUT WITH APPROXIMATE COPULAS
GENERATING SIMULATION INPUT WITH APPROXIMATE COPULAS Feras Nassaj Johann Christoph Strelen Rheinische Friedrich-Wilhelms-Universitaet Bonn Institut fuer Informatik IV Roemerstr. 164, 53117 Bonn, Germany
More informationJava Modules for Time Series Analysis
Java Modules for Time Series Analysis Agenda Clustering Non-normal distributions Multifactor modeling Implied ratings Time series prediction 1. Clustering + Cluster 1 Synthetic Clustering + Time series
More informationA Uniform Asymptotic Estimate for Discounted Aggregate Claims with Subexponential Tails
12th International Congress on Insurance: Mathematics and Economics July 16-18, 2008 A Uniform Asymptotic Estimate for Discounted Aggregate Claims with Subexponential Tails XUEMIAO HAO (Based on a joint
More informationPermutation Tests for Comparing Two Populations
Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of
More information1. (First passage/hitting times/gambler s ruin problem:) Suppose that X has a discrete state space and let i be a fixed state. Let
Copyright c 2009 by Karl Sigman 1 Stopping Times 1.1 Stopping Times: Definition Given a stochastic process X = {X n : n 0}, a random time τ is a discrete random variable on the same probability space as
More informationConfidence Intervals for Cpk
Chapter 297 Confidence Intervals for Cpk Introduction This routine calculates the sample size needed to obtain a specified width of a Cpk confidence interval at a stated confidence level. Cpk is a process
More informationMATHEMATICAL METHODS OF STATISTICS
MATHEMATICAL METHODS OF STATISTICS By HARALD CRAMER TROFESSOK IN THE UNIVERSITY OF STOCKHOLM Princeton PRINCETON UNIVERSITY PRESS 1946 TABLE OF CONTENTS. First Part. MATHEMATICAL INTRODUCTION. CHAPTERS
More informationNeed for Sampling. Very large populations Destructive testing Continuous production process
Chapter 4 Sampling and Estimation Need for Sampling Very large populations Destructive testing Continuous production process The objective of sampling is to draw a valid inference about a population. 4-
More informationStatistical Rules of Thumb
Statistical Rules of Thumb Second Edition Gerald van Belle University of Washington Department of Biostatistics and Department of Environmental and Occupational Health Sciences Seattle, WA WILEY AJOHN
More informationGood luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:
Glo bal Leadership M BA BUSINESS STATISTICS FINAL EXAM Name: INSTRUCTIONS 1. Do not open this exam until instructed to do so. 2. Be sure to fill in your name before starting the exam. 3. You have two hours
More information2. Linear regression with multiple regressors
2. Linear regression with multiple regressors Aim of this section: Introduction of the multiple regression model OLS estimation in multiple regression Measures-of-fit in multiple regression Assumptions
More information5.1 Identifying the Target Parameter
University of California, Davis Department of Statistics Summer Session II Statistics 13 August 20, 2012 Date of latest update: August 20 Lecture 5: Estimation with Confidence intervals 5.1 Identifying
More informationReview of basic statistics and the simplest forecasting model: the sample mean
Review of basic statistics and the simplest forecasting model: the sample mean Robert Nau Fuqua School of Business, Duke University August 2014 Most of what you need to remember about basic statistics
More informationExact Confidence Intervals
Math 541: Statistical Theory II Instructor: Songfeng Zheng Exact Confidence Intervals Confidence intervals provide an alternative to using an estimator ˆθ when we wish to estimate an unknown parameter
More informationLecture Notes Module 1
Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific
More informationAUTOMATING DISCRETE EVENT SIMULATION OUTPUT ANALYSIS AUTOMATIC ESTIMATION OF NUMBER OF REPLICATIONS, WARM-UP PERIOD AND RUN LENGTH.
Proceedings of the 2009 INFORMS Simulation Society Research Workshop L.H. Lee, M. E. Kuhl, J. W. Fowler and S.Robinson, eds. AUTOMATING DISCRETE EVENT SIMULATION OUTPUT ANALYSIS AUTOMATIC ESTIMATION OF
More informationTowards a Reliable Statistical Oracle and its Applications
Towards a Reliable Statistical Oracle and its Applications Johannes Mayer Abteilung Angewandte Informationsverarbeitung Universität Ulm mayer@mathematik.uni-ulm.de Abstract It is shown how based on the
More information1 Simulating Brownian motion (BM) and geometric Brownian motion (GBM)
Copyright c 2013 by Karl Sigman 1 Simulating Brownian motion (BM) and geometric Brownian motion (GBM) For an introduction to how one can construct BM, see the Appendix at the end of these notes A stochastic
More informationSimple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
More informationBootstrapping Big Data
Bootstrapping Big Data Ariel Kleiner Ameet Talwalkar Purnamrita Sarkar Michael I. Jordan Computer Science Division University of California, Berkeley {akleiner, ameet, psarkar, jordan}@eecs.berkeley.edu
More informationSupplement to Call Centers with Delay Information: Models and Insights
Supplement to Call Centers with Delay Information: Models and Insights Oualid Jouini 1 Zeynep Akşin 2 Yves Dallery 1 1 Laboratoire Genie Industriel, Ecole Centrale Paris, Grande Voie des Vignes, 92290
More informationChapter 6: Point Estimation. Fall 2011. - Probability & Statistics
STAT355 Chapter 6: Point Estimation Fall 2011 Chapter Fall 2011 6: Point1 Estimat / 18 Chap 6 - Point Estimation 1 6.1 Some general Concepts of Point Estimation Point Estimate Unbiasedness Principle of
More informationQuantitative Methods for Finance
Quantitative Methods for Finance Module 1: The Time Value of Money 1 Learning how to interpret interest rates as required rates of return, discount rates, or opportunity costs. 2 Learning how to explain
More informationStatistics 104: Section 6!
Page 1 Statistics 104: Section 6! TF: Deirdre (say: Dear-dra) Bloome Email: dbloome@fas.harvard.edu Section Times Thursday 2pm-3pm in SC 109, Thursday 5pm-6pm in SC 705 Office Hours: Thursday 6pm-7pm SC
More informationBiostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY
Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY 1. Introduction Besides arriving at an appropriate expression of an average or consensus value for observations of a population, it is important to
More informationVariables Control Charts
MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. Variables
More informationUncertainty quantification for the family-wise error rate in multivariate copula models
Uncertainty quantification for the family-wise error rate in multivariate copula models Thorsten Dickhaus (joint work with Taras Bodnar, Jakob Gierl and Jens Stange) University of Bremen Institute for
More informationNonparametric adaptive age replacement with a one-cycle criterion
Nonparametric adaptive age replacement with a one-cycle criterion P. Coolen-Schrijner, F.P.A. Coolen Department of Mathematical Sciences University of Durham, Durham, DH1 3LE, UK e-mail: Pauline.Schrijner@durham.ac.uk
More informationConfidence Intervals for Spearman s Rank Correlation
Chapter 808 Confidence Intervals for Spearman s Rank Correlation Introduction This routine calculates the sample size needed to obtain a specified width of Spearman s rank correlation coefficient confidence
More informationAnalysis of a Production/Inventory System with Multiple Retailers
Analysis of a Production/Inventory System with Multiple Retailers Ann M. Noblesse 1, Robert N. Boute 1,2, Marc R. Lambrecht 1, Benny Van Houdt 3 1 Research Center for Operations Management, University
More informationStochastic Inventory Control
Chapter 3 Stochastic Inventory Control 1 In this chapter, we consider in much greater details certain dynamic inventory control problems of the type already encountered in section 1.3. In addition to the
More informationInternet Appendix to False Discoveries in Mutual Fund Performance: Measuring Luck in Estimated Alphas
Internet Appendix to False Discoveries in Mutual Fund Performance: Measuring Luck in Estimated Alphas A. Estimation Procedure A.1. Determining the Value for from the Data We use the bootstrap procedure
More informationLife Table Analysis using Weighted Survey Data
Life Table Analysis using Weighted Survey Data James G. Booth and Thomas A. Hirschl June 2005 Abstract Formulas for constructing valid pointwise confidence bands for survival distributions, estimated using
More informationMATH4427 Notebook 2 Spring 2016. 2 MATH4427 Notebook 2 3. 2.1 Definitions and Examples... 3. 2.2 Performance Measures for Estimators...
MATH4427 Notebook 2 Spring 2016 prepared by Professor Jenny Baglivo c Copyright 2009-2016 by Jenny A. Baglivo. All Rights Reserved. Contents 2 MATH4427 Notebook 2 3 2.1 Definitions and Examples...................................
More informationThe Assumption(s) of Normality
The Assumption(s) of Normality Copyright 2000, 2011, J. Toby Mordkoff This is very complicated, so I ll provide two versions. At a minimum, you should know the short one. It would be great if you knew
More informationComputational Statistics and Data Analysis
Computational Statistics and Data Analysis 53 (2008) 17 26 Contents lists available at ScienceDirect Computational Statistics and Data Analysis journal homepage: www.elsevier.com/locate/csda Coverage probability
More informationHow To Price Garch
2011 3rd International Conference on Information and Financial Engineering IPEDR vol.12 (2011) (2011) IACSIT Press, Singapore A Study on Heston-Nandi GARCH Option Pricing Model Suk Joon Byun KAIST Business
More informationLOGIT AND PROBIT ANALYSIS
LOGIT AND PROBIT ANALYSIS A.K. Vasisht I.A.S.R.I., Library Avenue, New Delhi 110 012 amitvasisht@iasri.res.in In dummy regression variable models, it is assumed implicitly that the dependent variable Y
More informationPanel Data Econometrics
Panel Data Econometrics Master of Science in Economics - University of Geneva Christophe Hurlin, Université d Orléans University of Orléans January 2010 De nition A longitudinal, or panel, data set is
More informationGenerating Random Numbers Variance Reduction Quasi-Monte Carlo. Simulation Methods. Leonid Kogan. MIT, Sloan. 15.450, Fall 2010
Simulation Methods Leonid Kogan MIT, Sloan 15.450, Fall 2010 c Leonid Kogan ( MIT, Sloan ) Simulation Methods 15.450, Fall 2010 1 / 35 Outline 1 Generating Random Numbers 2 Variance Reduction 3 Quasi-Monte
More informationGambling and Data Compression
Gambling and Data Compression Gambling. Horse Race Definition The wealth relative S(X) = b(x)o(x) is the factor by which the gambler s wealth grows if horse X wins the race, where b(x) is the fraction
More informationChapter 7 Section 1 Homework Set A
Chapter 7 Section 1 Homework Set A 7.15 Finding the critical value t *. What critical value t * from Table D (use software, go to the web and type t distribution applet) should be used to calculate the
More informationPopulation Mean (Known Variance)
Confidence Intervals Solutions STAT-UB.0103 Statistics for Business Control and Regression Models Population Mean (Known Variance) 1. A random sample of n measurements was selected from a population with
More informationMultivariate Logistic Regression
1 Multivariate Logistic Regression As in univariate logistic regression, let π(x) represent the probability of an event that depends on p covariates or independent variables. Then, using an inv.logit formulation
More informationThe Normal distribution
The Normal distribution The normal probability distribution is the most common model for relative frequencies of a quantitative variable. Bell-shaped and described by the function f(y) = 1 2σ π e{ 1 2σ
More informationMultivariate Normal Distribution
Multivariate Normal Distribution Lecture 4 July 21, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #4-7/21/2011 Slide 1 of 41 Last Time Matrices and vectors Eigenvalues
More informationESTIMATION OF THE EFFECTIVE DEGREES OF FREEDOM IN T-TYPE TESTS FOR COMPLEX DATA
m ESTIMATION OF THE EFFECTIVE DEGREES OF FREEDOM IN T-TYPE TESTS FOR COMPLEX DATA Jiahe Qian, Educational Testing Service Rosedale Road, MS 02-T, Princeton, NJ 08541 Key Words" Complex sampling, NAEP data,
More informationEstimation and Confidence Intervals
Estimation and Confidence Intervals Fall 2001 Professor Paul Glasserman B6014: Managerial Statistics 403 Uris Hall Properties of Point Estimates 1 We have already encountered two point estimators: th e
More informationWeek 4: Standard Error and Confidence Intervals
Health Sciences M.Sc. Programme Applied Biostatistics Week 4: Standard Error and Confidence Intervals Sampling Most research data come from subjects we think of as samples drawn from a larger population.
More informationSimple Random Sampling
Source: Frerichs, R.R. Rapid Surveys (unpublished), 2008. NOT FOR COMMERCIAL DISTRIBUTION 3 Simple Random Sampling 3.1 INTRODUCTION Everyone mentions simple random sampling, but few use this method for
More informationAlessandro Birolini. ineerin. Theory and Practice. Fifth edition. With 140 Figures, 60 Tables, 120 Examples, and 50 Problems.
Alessandro Birolini Re ia i it En ineerin Theory and Practice Fifth edition With 140 Figures, 60 Tables, 120 Examples, and 50 Problems ~ Springer Contents 1 Basic Concepts, Quality and Reliability Assurance
More informationLecture 8. Confidence intervals and the central limit theorem
Lecture 8. Confidence intervals and the central limit theorem Mathematical Statistics and Discrete Mathematics November 25th, 2015 1 / 15 Central limit theorem Let X 1, X 2,... X n be a random sample of
More informationRandom access protocols for channel access. Markov chains and their stability. Laurent Massoulié.
Random access protocols for channel access Markov chains and their stability laurent.massoulie@inria.fr Aloha: the first random access protocol for channel access [Abramson, Hawaii 70] Goal: allow machines
More informationAnalysis of a production-inventory system with unreliable production facility
Analysis of a production-inventory system with unreliable production facility Katrien Ramaekers Gerrit K Janssens Transportation Research Institute Hasselt University - Campus Diepenbeek Wetenschapspark
More informationThe VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series.
Cointegration The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series. Economic theory, however, often implies equilibrium
More informationBias in the Estimation of Mean Reversion in Continuous-Time Lévy Processes
Bias in the Estimation of Mean Reversion in Continuous-Time Lévy Processes Yong Bao a, Aman Ullah b, Yun Wang c, and Jun Yu d a Purdue University, IN, USA b University of California, Riverside, CA, USA
More informationE3: PROBABILITY AND STATISTICS lecture notes
E3: PROBABILITY AND STATISTICS lecture notes 2 Contents 1 PROBABILITY THEORY 7 1.1 Experiments and random events............................ 7 1.2 Certain event. Impossible event............................
More informationChapter 4: Vector Autoregressive Models
Chapter 4: Vector Autoregressive Models 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie IV.1 Vector Autoregressive Models (VAR)...
More informationMaximum likelihood estimation of mean reverting processes
Maximum likelihood estimation of mean reverting processes José Carlos García Franco Onward, Inc. jcpollo@onwardinc.com Abstract Mean reverting processes are frequently used models in real options. For
More informationGLMs: Gompertz s Law. GLMs in R. Gompertz s famous graduation formula is. or log µ x is linear in age, x,
Computing: an indispensable tool or an insurmountable hurdle? Iain Currie Heriot Watt University, Scotland ATRC, University College Dublin July 2006 Plan of talk General remarks The professional syllabus
More informationOverview of Monte Carlo Simulation, Probability Review and Introduction to Matlab
Monte Carlo Simulation: IEOR E4703 Fall 2004 c 2004 by Martin Haugh Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab 1 Overview of Monte Carlo Simulation 1.1 Why use simulation?
More informationChapter 7 Section 7.1: Inference for the Mean of a Population
Chapter 7 Section 7.1: Inference for the Mean of a Population Now let s look at a similar situation Take an SRS of size n Normal Population : N(, ). Both and are unknown parameters. Unlike what we used
More informationChapter 7. One-way ANOVA
Chapter 7 One-way ANOVA One-way ANOVA examines equality of population means for a quantitative outcome and a single categorical explanatory variable with any number of levels. The t-test of Chapter 6 looks
More informationDescriptive Statistics
Descriptive Statistics Suppose following data have been collected (heights of 99 five-year-old boys) 117.9 11.2 112.9 115.9 18. 14.6 17.1 117.9 111.8 16.3 111. 1.4 112.1 19.2 11. 15.4 99.4 11.1 13.3 16.9
More informationM/M/1 and M/M/m Queueing Systems
M/M/ and M/M/m Queueing Systems M. Veeraraghavan; March 20, 2004. Preliminaries. Kendall s notation: G/G/n/k queue G: General - can be any distribution. First letter: Arrival process; M: memoryless - exponential
More informationA Coefficient of Variation for Skewed and Heavy-Tailed Insurance Losses. Michael R. Powers[ 1 ] Temple University and Tsinghua University
A Coefficient of Variation for Skewed and Heavy-Tailed Insurance Losses Michael R. Powers[ ] Temple University and Tsinghua University Thomas Y. Powers Yale University [June 2009] Abstract We propose a
More informationAn Introduction to Extreme Value Theory
An Introduction to Extreme Value Theory Petra Friederichs Meteorological Institute University of Bonn COPS Summer School, July/August, 2007 Applications of EVT Finance distribution of income has so called
More informationUniversally Accepted Lean Six Sigma Body of Knowledge for Green Belts
Universally Accepted Lean Six Sigma Body of Knowledge for Green Belts The IASSC Certified Green Belt Exam was developed and constructed based on the topics within the body of knowledge listed here. Questions
More informationProbability and statistics; Rehearsal for pattern recognition
Probability and statistics; Rehearsal for pattern recognition Václav Hlaváč Czech Technical University in Prague Faculty of Electrical Engineering, Department of Cybernetics Center for Machine Perception
More informationMonte Carlo and Empirical Methods for Stochastic Inference (MASM11/FMS091)
Monte Carlo and Empirical Methods for Stochastic Inference (MASM11/FMS091) Magnus Wiktorsson Centre for Mathematical Sciences Lund University, Sweden Lecture 5 Sequential Monte Carlo methods I February
More informationA COMPARISON OF STATISTICAL METHODS FOR COST-EFFECTIVENESS ANALYSES THAT USE DATA FROM CLUSTER RANDOMIZED TRIALS
A COMPARISON OF STATISTICAL METHODS FOR COST-EFFECTIVENESS ANALYS THAT U DATA FROM CLUSTER RANDOMIZED TRIALS M Gomes, E Ng, R Grieve, R Nixon, J Carpenter and S Thompson Health Economists Study Group meeting
More informationThe Binomial Distribution
The Binomial Distribution James H. Steiger November 10, 00 1 Topics for this Module 1. The Binomial Process. The Binomial Random Variable. The Binomial Distribution (a) Computing the Binomial pdf (b) Computing
More informationSummary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)
Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume
More informationTesting against a Change from Short to Long Memory
Testing against a Change from Short to Long Memory Uwe Hassler and Jan Scheithauer Goethe-University Frankfurt This version: January 2, 2008 Abstract This paper studies some well-known tests for the null
More informationConfidence Intervals for the Difference Between Two Means
Chapter 47 Confidence Intervals for the Difference Between Two Means Introduction This procedure calculates the sample size necessary to achieve a specified distance from the difference in sample means
More informationDIRECT REDUCTION OF BIAS OF THE CLASSI- CAL HILL ESTIMATOR
REVSTAT Statistical Journal Volume 3, Number 2, November 2005, 113 136 DIRECT REDUCTION OF BIAS OF THE CLASSI- CAL HILL ESTIMATOR Authors: Frederico Caeiro Universidade Nova de Lisboa, FCTDM) and CEA,
More informationIntroduction. Hypothesis Testing. Hypothesis Testing. Significance Testing
Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters
More informationCPC/CPA Hybrid Bidding in a Second Price Auction
CPC/CPA Hybrid Bidding in a Second Price Auction Benjamin Edelman Hoan Soo Lee Working Paper 09-074 Copyright 2008 by Benjamin Edelman and Hoan Soo Lee Working papers are in draft form. This working paper
More informationStationary random graphs on Z with prescribed iid degrees and finite mean connections
Stationary random graphs on Z with prescribed iid degrees and finite mean connections Maria Deijfen Johan Jonasson February 2006 Abstract Let F be a probability distribution with support on the non-negative
More informationProbability and Random Variables. Generation of random variables (r.v.)
Probability and Random Variables Method for generating random variables with a specified probability distribution function. Gaussian And Markov Processes Characterization of Stationary Random Process Linearly
More informationSIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one?
SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one? Simulations for properties of estimators Simulations for properties
More informationTesting against a Change from Short to Long Memory
Testing against a Change from Short to Long Memory Uwe Hassler and Jan Scheithauer Goethe-University Frankfurt This version: December 9, 2007 Abstract This paper studies some well-known tests for the null
More informationTests for exponentiality against the M and LM-classes of life distributions
Tests for exponentiality against the M and LM-classes of life distributions B. Klar Universität Karlsruhe Abstract This paper studies tests for exponentiality against the nonparametric classes M and LM
More informationHydrodynamic Limits of Randomized Load Balancing Networks
Hydrodynamic Limits of Randomized Load Balancing Networks Kavita Ramanan and Mohammadreza Aghajani Brown University Stochastic Networks and Stochastic Geometry a conference in honour of François Baccelli
More informationTutorial 5: Hypothesis Testing
Tutorial 5: Hypothesis Testing Rob Nicholls nicholls@mrc-lmb.cam.ac.uk MRC LMB Statistics Course 2014 Contents 1 Introduction................................ 1 2 Testing distributional assumptions....................
More informationA credibility method for profitable cross-selling of insurance products
Submitted to Annals of Actuarial Science manuscript 2 A credibility method for profitable cross-selling of insurance products Fredrik Thuring Faculty of Actuarial Science and Insurance, Cass Business School,
More information