More on the correct use of omnibus tests for normality

Similar documents
Monitoring Frequency of Change By Li Qin

A MOST PROBABLE POINT-BASED METHOD FOR RELIABILITY ANALYSIS, SENSITIVITY ANALYSIS AND DESIGN OPTIMIZATION

Frequentist vs. Bayesian Statistics

The predictability of security returns with simple technical trading rules

On the predictive content of the PPI on CPI inflation: the case of Mexico

The risk of using the Q heterogeneity estimator for software engineering experiments

Effect Sizes Based on Means

A Multivariate Statistical Analysis of Stock Trends. Abstract

A Simple Model of Pricing, Markups and Market. Power Under Demand Fluctuations

Risk and Return. Sample chapter. e r t u i o p a s d f CHAPTER CONTENTS LEARNING OBJECTIVES. Chapter 7

Stochastic Derivation of an Integral Equation for Probability Generating Functions

Normally Distributed Data. A mean with a normal value Test of Hypothesis Sign Test Paired observations within a single patient group

Two-resource stochastic capacity planning employing a Bayesian methodology

Beyond the F Test: Effect Size Confidence Intervals and Tests of Close Fit in the Analysis of Variance and Contrast Analysis

The Fundamental Incompatibility of Scalable Hamiltonian Monte Carlo and Naive Data Subsampling

Softmax Model as Generalization upon Logistic Discrimination Suffers from Overfitting

Managing specific risk in property portfolios

An important observation in supply chain management, known as the bullwhip effect,

Risk in Revenue Management and Dynamic Pricing

DAY-AHEAD ELECTRICITY PRICE FORECASTING BASED ON TIME SERIES MODELS: A COMPARISON

Concurrent Program Synthesis Based on Supervisory Control

Failure Behavior Analysis for Reliable Distributed Embedded Systems

Web Application Scalability: A Model-Based Approach

The impact of metadata implementation on webpage visibility in search engine results (Part II) q

Financial Time Series Analysis (FTSA) Lecture 1: Introduction

Large-Scale IP Traceback in High-Speed Internet: Practical Techniques and Theoretical Foundation

Multiperiod Portfolio Optimization with General Transaction Costs

CABRS CELLULAR AUTOMATON BASED MRI BRAIN SEGMENTATION

Comparing Dissimilarity Measures for Symbolic Data Analysis

Pressure Drop in Air Piping Systems Series of Technical White Papers from Ohio Medical Corporation

Forensic Science International

Machine Learning with Operational Costs

Variables Control Charts

Safety evaluation of digital post-release environment sensor data interface for distributed fuzing systems

Synopsys RURAL ELECTRICATION PLANNING SOFTWARE (LAPER) Rainer Fronius Marc Gratton Electricité de France Research and Development FRANCE

An inventory control system for spare parts at a refinery: An empirical comparison of different reorder point methods

An Introduction to Risk Parity Hossein Kazemi

Analysis of Effectiveness of Web based E- Learning Through Information Technology

NEWSVENDOR PROBLEM WITH PRICING: PROPERTIES, ALGORITHMS, AND SIMULATION

Probabilistic models for mechanical properties of prestressing strands

c 2009 Je rey A. Miron 3. Examples: Linear Demand Curves and Monopoly

High Quality Offset Printing An Evolutionary Approach

Implementation of Statistic Process Control in a Painting Sector of a Automotive Manufacturer

CFRI 3,4. Zhengwei Wang PBC School of Finance, Tsinghua University, Beijing, China and SEBA, Beijing Normal University, Beijing, China

Drinking water systems are vulnerable to

The Lognormal Distribution Engr 323 Geppert page 1of 6 The Lognormal Distribution

ENFORCING SAFETY PROPERTIES IN WEB APPLICATIONS USING PETRI NETS

From Simulation to Experiment: A Case Study on Multiprocessor Task Scheduling

Characterizing and Modeling Network Traffic Variability

Stability Improvements of Robot Control by Periodic Variation of the Gain Parameters

Business Development Services and Small Business Growth in Bangladesh

STATISTICAL CHARACTERIZATION OF THE RAILROAD SATELLITE CHANNEL AT KU-BAND

INFERRING APP DEMAND FROM PUBLICLY AVAILABLE DATA 1

Chapter 2 - Porosity PIA NMR BET

Piracy and Network Externality An Analysis for the Monopolized Software Industry

IMPROVING NAIVE BAYESIAN SPAM FILTERING

Compensating Fund Managers for Risk-Adjusted Performance

Moving Objects Tracking in Video by Graph Cuts and Parameter Motion Model

6.042/18.062J Mathematics for Computer Science December 12, 2006 Tom Leighton and Ronitt Rubinfeld. Random Walks

Re-Dispatch Approach for Congestion Relief in Deregulated Power Systems

A Note on Integer Factorization Using Lattices

Computational Finance The Martingale Measure and Pricing of Derivatives

COST CALCULATION IN COMPLEX TRANSPORT SYSTEMS

Alpha Channel Estimation in High Resolution Images and Image Sequences

Fluent Software Training TRN Solver Settings. Fluent Inc. 2/23/01

THE RELATIONSHIP BETWEEN EMPLOYEE PERFORMANCE AND THEIR EFFICIENCY EVALUATION SYSTEM IN THE YOTH AND SPORT OFFICES IN NORTH WEST OF IRAN

A highly-underactuated robotic hand with force and joint angle sensors

POISSON PROCESSES. Chapter Introduction Arrival processes

PRIME NUMBERS AND THE RIEMANN HYPOTHESIS

THE HEBREW UNIVERSITY OF JERUSALEM

The Economics of the Cloud: Price Competition and Congestion

Interaction Expressions A Powerful Formalism for Describing Inter-Workflow Dependencies

FREQUENCIES OF SUCCESSIVE PAIRS OF PRIME RESIDUES

Guideline relating the. Solactive Global Oil Equities Net Total Return Index (Solactive Global Oil Equities)

Load Balancing Mechanism in Agent-based Grid

Project Finance as a Risk- Management Tool in International Syndicated Lending

Applications of Regret Theory to Asset Pricing

X How to Schedule a Cascade in an Arbitrary Graph

2D Modeling of the consolidation of soft soils. Introduction

Large firms and heterogeneity: the structure of trade and industry under oligopoly

Dynamics of Open Source Movements

Rummage Web Server Tuning Evaluation through Benchmark

Sage HRMS I Planning Guide. The HR Software Buyer s Guide and Checklist

The Online Freeze-tag Problem

Pinhole Optics. OBJECTIVES To study the formation of an image without use of a lens.

Static and Dynamic Properties of Small-world Connection Topologies Based on Transit-stub Networks

F inding the optimal, or value-maximizing, capital

PROPERTIES OF THE SAMPLE CORRELATION OF THE BIVARIATE LOGNORMAL DISTRIBUTION

OVERVIEW OF THE CAAMPL EARLY WARNING SYSTEM IN ROMANIAN BANKING

Methods for Estimating Kidney Disease Stage Transition Probabilities Using Electronic Medical Records

Index Numbers OPTIONAL - II Mathematics for Commerce, Economics and Business INDEX NUMBERS

BUBBLES AND CRASHES. By Dilip Abreu and Markus K. Brunnermeier 1

NAVAL POSTGRADUATE SCHOOL THESIS

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Red vs. Blue - Aneue of TCP congestion Control Model

Service Network Design with Asset Management: Formulations and Comparative Analyzes

On Software Piracy when Piracy is Costly

IEEM 101: Inventory control

CANADIAN WATER SECURITY ASSESSMENT FRAMEWORK: TOOLS FOR ASSESSING WATER SECURITY AND IMPROVING WATERSHED GOVERNANCE

Joint Production and Financing Decisions: Modeling and Analysis

Transcription:

Economics Letters 90 (2006) 304 309 www.elsevier.com/locate/econbase More on the correct use of omnibus tests for normality Geoffrey Poitras Simon Fraser University, Burnaby, B.C., Canada VSA ls6 Received 29 January 2005; received in revised form 1 July 2005; acceted 16 August 2005 Available online 24 January 2006 Abstract This Monte Carlo study comares the small samle roerties of some commonly used omnibus and directional tests, based on the standardized third and fourth moments, for assessing the normality of random variables: the omnibus D Agostino K 2 test and the directional comonents, and three versions of the Jarque Bera test. D 2005 Elsevier B.V. All rights reserved. Keywords: Normality test; Omnibus test JEL classification: C10; C20 1. Introduction Few subjects in alied statistics have received as much attention as the roblem of testing whether a samle is from a normal oulation. For more than a century, the subject has attracted the attention of leading figures in statistics. The absence of exact solutions for the samling distributions generated a large number of simulation studies exloring the ower of these statistics as both directional and omnibus tests, e.g., D Agostino and Pearson (1973), and Bowman and Shelton (1975). Desite convincing evidence from these studies that convergence of the samling distributions to asymtotic results was very slow, esecially for b 2, many studies of normality testing in econometrics have emhasized the asymtotic Jarque Bera test (Jarque and Bera, 1980, 1987). Some confusion over the aroriate method of secifying the samling distribution ersists, e.g., Urzúa (1996), and the E-mail address: oitras@sfu.ca. URL: www.sfu.ca/~oitras. 0165-1765/$ - see front matter D 2005 Elsevier B.V. All rights reserved. doi:10.1016/j.econlet.2005.08.016

G. Poitras / Economics Letters 90 (2006) 304 309 305 aroriate testing rocedure to use in secific situations remains unclear, e.g., Atwood et al. (2003). In articular, when emloying and b2 tests, the finite samle imlications of using a secific omnibus or directional test are generally unrecognized. 2. Moment and b2 tests The aroriate construction of and b2 tests for normality has gradually been recognized in econometrics, e.g., Dufour et al. (1998), and Önder and Zaman (2005). Under general circumstances, the ower of tests in this category comare favorably with emirical distribution function tests and correlation tests based on the order statistics (Thode, 2002). In ractice, the null hyothesis of normality is usually secified in comosite form where l and r are unknown. Secification of the alternative hyothesis can involve: a secific distribution, e.g., the uniform; a class of distributions with identifiable shae roerties, e.g., fat tailed symmetric; or a general non-normal alternative. While an omnibus test combining the information in and b2 is indicated for a general non-normal alternative, increases in ower can be obtained where the class of alternatives can be narrowed. Where a secific alternative distribution is indicated, the use of an aroriately secified likelihood ratio test makes it is ossible that a uniformly most owerful, location and scale invariant test may be available. However, due to analytical comlexity of the solutions, the number of cases where such directional tests are available for ractical alication is limited. In cases where some information about the alternative distribution is available, a directional test using only or b2 is indicated. Considerable attention has been given to the construction of omnibus tests that combine information from and b2, e.g., Thode (2002,.54 58), D Agostino and Stehens (1986, ch. 7 and 9), and Urzúa (1996). A number of aroaches have been suggested. The simlest construction is found with the Jarque Bera (JB) test where the asymtotic normal values for skewness and kurtosis are used to construct a v 2 (2) test involving the first two moments of the asymtotic distributions: " 2 JB ¼ T 6 ð þ b 2 3Þ 2 24 # fv 2 ðþ 2 where T is the samle size, ¼ m3 = ðm 2 Þ 3=2 ; b 2 ¼ m 4 = ðm 2 Þ 2 and the central moments are defined as m j = P (x j m 1 ) j /T and m 1 is the samle mean. Urzúa (1996) imroves on this formulation by using results for the mean and variance of the samling distribution to formulate a chi-squared test with better small samle roerties. Even with this imrovement, neither formulation recognizes that the samling distributions may not be close to normal in finite samles. Due to the slow convergence to the asymtotic results, this form of the omnibus test will have roblems even in moderately sized samles, e.g., T =100. The samling distribution roblems associated with the JB test are confirmed by various simulation studies that have found the JB test to be incorrectly sized in small and moderate samles, e.g., Poitras (1992), and Dufour et al. (1998). The size distortions are so significant that various studies recommend against the use of the JB test, in favor of other omnibus moment tests such as D Agostino s K 2 test. The Monte Carlo results given below confirm this result and demonstrate that, while having slightly imroved size comared to JB, the Urzúa form of the test still is incorrectly sized. An obvious solution to this roblem is to use Monte Carlo methods (e.g., Dufour and Khalaf, 2001) to correctly size the JB test

306 G. Poitras / Economics Letters 90 (2006) 304 309 by simulating the correct critical values. Being formed from the location and scale invariant statistics, and b2, makes the JB test an excellent candidate for such resizing. This raises the question about the ower of the correctly sized JB test relative to alternative omnibus and b2 tests. Presumably, the samling distribution roblems that led to the incorrect sizing of the JB test will ersist when the test is correctly sized. This intuition is not suorted in the Monte Carlo simulations reorted below where the size-corrected JB test is found to have marginally higher ower than the K 2 test against a range of (true) alternative distributions with fat-tails. The asymtotic omnibus aroach emloyed in econometrics is in stark contrast to the aroach develoed in statistics where the distributional roerties of the directional tests were formulated well before efforts were exended on an omnibus test. Early theoretical work determined the moments of the distribution of oulation kurtosis (and skewness) in normal samles as well as the associated roerties of the samling distribution. Series solutions were used to fit moment aroximations to the actual distributions. In order to accurately aroximate the finite samle distributions for and b2, information from the third and fourth moment of the samling distributions for these statistics was incororated. From this oint, one line of research led to the tabulation of critical values of the distributions for (Pearson and Hartley, 1966) and b2 (D Agostino and Pearson, 1973). Knowledge of critical values ermitted roerly sized directional tests to be imlemented. Subsequent results determined the aroriate transformation of the samling distributions to achieve normality of and b 2 in finite samles. In articular, while the distributions for and b2 are asymtotically normal, large samle sizes are needed before the distributions are well behaved. D Agostino (1970) determined that Johnson s unbounded SU curve was aroriate for transforming. Similar work on kurtosis led eventually to an effective and simle transform of b 2 to normality based on the Wilson Hilferty transformation (Anscombe and Glynn, 1983). The omnibus D Agostino s K 2 test combines the aroriately transformed directional comonents and b2, e.g., Thode (2002, ch. 3). 3. Directional versus omnibus testing: Monte Carlo results While it would seem desirable to test against as many ossible alternative distributions as ossible, it is well known that this can result in a significant loss of ower. For examle, at a given significance level (a) omnibus and b2 tests would require higher values of to reject a skewed alternative than would be required for a directional test using just at the same a. This issue is comlicated for omnibus and b2 tests because, though and b2 are uncorrelated, the samling distributions are not indeendent. Even for a (true) symmetric non-normal alternative, will eventually reject the null, just as b 2 will eventually reject a (true) skewed distribution. As such, the Monte Carlo results given in Table 1 rovide evidence on the loss of ower associated with choosing an omnibus test over a directional test at a secific finite samle size, as well as the seed of convergence to the asymtotic results. In addition, because the samling distributions are not well-behaved in finite samles, it is an oen question as to whether a size-corrected JB test will have the same finite samle ower as the D Agostino K 2 statistic due to the reliance on the asymtotic normality of and b2 to formulate the v 2 (2) test. This issue is also addressed in Table 1. The Monte Carlo simulations in Table 1 reort the ower of seven normality test statistics for nine different distributional secifications of the oulation residuals. The tests are: the D Agostino s K 2 test and the directional comonents and b2 ; three forms of the Jarque Bera test, the asymtotic version

G. Poitras / Economics Letters 90 (2006) 304 309 307 Table 1 Tabulated critical values for, b2 taken from Bowman and Shelton (ch. 7 in D Agostino and Stehens, 1986) b 2 K 2 SR JB JB* JBSC Estimated normality test owers (a =10%) for oulation residuals when the null hyothesis of normality is true: T =25, 50 T = 25.099.098.098.101.039.074.100 T = 50.098.092.096.098.048.068.100 Estimated normality test owers (a = 10%) for oulation residuals when the alternative hyothesis is true: T = 25, 50 True H 1 : stable (av=1.9, d =0) T = 25.228.209.228.184.167.216.227 T = 50.646.651.657.640.638.665.671 True H 1 : stable (av=1.9, d =1) T = 25.255.218.251.164.182.237.253 T = 50.677.656.672.624.653.672.687 True H 1 : stable (av=1.6, d =0) T = 25.526.544.562.491.499.577.573 T = 50.837.888.893.862.882.898.901 True H 1 : stable (av=1.6, d =1) T = 25.649.522.603.327.542.594.637 T = 50.952.876.934.776.514.717.939 True H 1 : stable (av=1.0, d =0) T = 25.848.926.930.839.916.945.941 T = 50.959.999.999.990.999.999.999 True H 1 : stable (av=1.0, d =l) T = 25.984.895.968.477.954.964.978 T = 50 1.000.998 1.000.863 1.000 1.000 1.000 True H 1 : uniform T = 25.018.600.407.701.000.001.047 T = 50.502.969.938.991.513.549.749 True H 1 : lognormal T = 25.971.736.914.240.870.910.962 T = 50 1.000.962 1.000.710.999 1.000 1.000 Critical values for the studentized range (SR) rovided by Pearson and Stehens (1964). Size-corrected JB test critical values are (2.645 for T =25, 3.155 for T =50). The number of iterations at each true alternative (null) samle size are 7000 (21,000) for T =25, 3500 (10,500) for T =50. (JB), the Urzúa (1996) version that adjusts for the samling distribution of the mean and variance (JB*), and a version of the test that uses size-adjusted critical values (JBSC). Results are also reorted for the studentized range, the most owerful scale and location invariant test for normality against a uniformly distributed alternative. The nine distributions examined are the normal, uniform and lognormal distributions, together with six versions of the stable distribution. The eight alternative distributions selected all feature rominently in emirical studies. The uniform distribution arises in testing situations involving, for examle, unit roots (e.g., Goldman and Tsurumi, 2003) and robability integral transforms (e.g., Clements, 2004). In addition to being widely used to model security rices, the lognormal has been used for the distribution of other variables such as firm size (e.g., Cabral and Mata, 2003). Stable distributions have a long history in both theoretical and alied distribution theory involving variables as diverse as stock rices and motion icture rofits (e.g., De Vany and Walls, 2004). The stable distributions are characterized by four arameters; in addition to location and scale arameters, there is a

308 G. Poitras / Economics Letters 90 (2006) 304 309 disersion arameter, the characteristic exonent (av: 0V av V 2), and a skewness arameter (d: 1Vd V1). The class of stable distributions include the normal (av=2) and Cauchy (av=1) as secial cases. The stable distributions are well suited to simulation studies of normality tests due to the ability to recisely control variation in disersion and skewness. Each cell in the table reorts the rejection frequency. When the null is true, small deviations from the critical value of a =0.100 can be due to Monte Carlo samling variation of the statistics. Larger deviations are due to the incorrect sizing associated with a significantly biased statistic. The Monte Carlo results confirm an incorrect sizing of the JB and JB* tests severe enough to warrant a recommendation that these tests should not be used without aroriately sized critical values. While the correction rovided by Urzúa (1996) does artially alleviate the size distortion of the asymtotic test, the imrovement is insufficient to warrant use of the test in ractice. In contrast, the size-corrected version of the Jarque Bera test (JBSC) has decided strengths. The JBSC has marginally higher ower than the D Agostino K 2 against all the alternative distributions with oulation kurtosis larger than for the normal distribution. Only a directional test has higher ower against (true) skewed alternatives. This romising erformance for JBSC is undermined by the strikingly oor erformance when the thintailed uniform distribution is the true alternative. The loss in ower comared to D Agostino s K 2 is still substantial even for samles of T = 50. If the alternative hyothesis is a general non-normal alternative, where there is no rior knowledge as to whether the distribution is thin- or fat tailed, then the results for the uniform argue against the use of JBSC as an omnibus test, in favor of D Agostino K 2. Comaring JBSC with K 2, the loss in ower for the fat tailed alternatives is slight comared to overwhelming gains when the thin-tailed alternative is true. A directional test based on b 2 does comare well against all tests excet the directional SR when the uniform alternative is true, but there is a loss in ower comared to D Agostino s K 2 for the other alternative distributions. On balance, for normality testing where there is some rior information that the alternative distribution may be thin-tailed, then D Agostino s K 2 is the referred omnibus test. Acknowledgements The author would like to thank the anonymous referee, Peter Kennedy, Michael Stehens, John Heaney and members of the Mathematical Statistics worksho at SFU for useful comments. References Anscombe, F., Glynn, W., 1983. Distribution of the kurtosis statistic b 2 for normal statistics. Biometrika 70, 227 234. Atwood, J., Shaik, S., Watts, M., 2003. Are cro yields normally distributed? A reexamination. American Journal of Agricultural Economics 85, 888 901. Bowman, K., Shelton, L., 1975. Omnibus test contours for deartures from normality based on and b2. Biometrika 62, 243 250. Cabral, L., Mata, J., 2003. On the evolution of the firm size distribution: facts and theory. American Economic Review 93, 1075 1091. Clements, M., 2004. Evaluating the bank of England density forecasts of inflation. Economic Journal 114, 844 877. D Agostino, R., 1970. Transformation to normality of the null distribution of g 1. Biometrika 58, 679 681. D Agostino, R., Pearson, E., 1973. Tests for deartures from normality. Emirical results for the distribution of and b2. Biometrika 60, 613 622.

G. Poitras / Economics Letters 90 (2006) 304 309 309 D Agostino, R., Stehens, M., 1986. Goodness-of-fit Techniques. Marcel Dekker, New York. De Vany, A., Walls, D., 2004. Motion icture rofit, the stable Paretian hyothesis, and the curse of the suerstar. Journal of Economic Dynamics and Control 28, 1035 1058. Dufour, J., Khalaf, L., 2001. Monte Carlo test methods in econometrics. In: Baltagi, Badi (Ed.), Comanion to Theoretical Econometrics. Blackwell, Oxford, UK. Chater 23. Dufour, J., Farhat, A., Gardiol, L., Khalaf, L., 1998. Simulation-based finite samle normality tests in linear regressions. Econometrics Journal 1, C154 C173. Goldman, E., Tsurumi, H., 2003. Asymtotic distribution of a unit root rocess under double truncation. Communications in Statistics. Theory and Methods 32, 2059 2072. Jarque, C., Bera, A., 1980. Efficient tests for normality, homoskedasticity and serial indeendence of regression residuals. Economics Letters 6, 255 259. Jarque, C., Bera, A., 1987. A test for normality of observations and regression residuals. International Statistical Review 55, 163 172. Önder, A., Zaman, A., 2005. Robust tests for normality of errors in regression models. Economics Letters 86, 63 68. Pearson, E., Hartley, H., 1966. Tables for statisticians, 3rd ed. Biometrika, vol. I. Cambridge University Press, Cambridge, UK. Pearson, E., Stehens, M., 1964. The ratio of range to standard deviation in the same normal samle. Biometrika 51, 484 487. Poitras, G., 1992. Testing regression disturbances for normality with stable alternatives: further Monte Carlo evidence. Journal of Statistical Comutation and Simulation 41, 109 123. Thode, H., 2002. Testing for Normality. Marcel Dekker, New York. Urzúa, C., 1996. On the correct use of omnibus tests for normality. Economics Letters 53, 247 251.