A Comparison of Hypothesis Testing Methods for the Mean of a Log-Normal Distribution

Similar documents

Chapter 14 Nonparametric Statistics

The Binomial Multi- Section Transformer

Hypothesis testing. Null and alternative hypotheses

Maximum Likelihood Estimators.

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown

One-sample test of proportions

ECONOMICS. Calculating loan interest no

Properties of MLE: consistency, asymptotic normality. Fisher information.

Output Analysis (2, Chapters 10 &11 Law)

Ant Colony Algorithm Based Scheduling for Handling Software Project Delay

Confidence Intervals for One Mean

I. Chi-squared Distributions

Distributed Storage Allocations for Optimal Delay

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method

1. C. The formula for the confidence interval for a population mean is: x t, which was

Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval

Impacts of the Collocation Window on the Accuracy of Altimeter/Buoy Wind Speed Comparison A Simulation Study. Ge Chen 1,2

CDAS: A Crowdsourcing Data Analytics System

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights

Empirical Study on the Second-stage Sample Size

A Test of Normality. 1 n S 2 3. n 1. Now introduce two new statistics. The sample skewness is defined as:

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution

GSR: A Global Stripe-based Redistribution Approach to Accelerate RAID-5 Scaling

A probabilistic proof of a binomial identity

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

CHAPTER 4: NET PRESENT VALUE

arxiv: v2 [math.pr] 13 Oct 2009

PSYCHOLOGICAL STATISTICS

Mann-Whitney U 2 Sample Test (a.k.a. Wilcoxon Rank Sum Test)

Practice Problems for Test 3

5: Introduction to Estimation

Case Study. Normal and t Distributions. Density Plot. Normal Distributions

Determining the sample size

1 Computing the Standard Deviation of Sample Means

Lesson 17 Pearson s Correlation Coefficient

Chapter 7 Methods of Finding Estimators

Analyzing Longitudinal Data from Complex Surveys Using SUDAAN

Normal Distribution.

Modified Line Search Method for Global Optimization

An Efficient Polynomial Approximation of the Normal Distribution Function & Its Inverse Function

Sampling Distribution And Central Limit Theorem

1 Correlation and Regression Analysis

Soving Recurrence Relations

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES

LECTURE 13: Cross-validation

Investigation of Atwood s machines as Series and Parallel networks

THE TWO-VARIABLE LINEAR REGRESSION MODEL

Lesson 15 ANOVA (analysis of variance)

Statistical inference: example 1. Inferential Statistics

Infinite Sequences and Series

Measures of Spread and Boxplots Discrete Math, Section 9.4

Now here is the important step

COMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S 2 CONTROL CHART FOR THE CHANGES IN A PROCESS

Math C067 Sampling Distributions

How To Calculate Stretch Factor Of Outig I Wireless Network

Confidence Intervals

SAMPLE QUESTIONS FOR FINAL EXAM. (1) (2) (3) (4) Find the following using the definition of the Riemann integral: (2x + 1)dx

Research Article Analyzing Big Data with the Hybrid Interval Regression Methods

PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM

Section 11.3: The Integral Test

Descriptive Statistics

Chapter 7: Confidence Interval and Sample Size

, a Wishart distribution with n -1 degrees of freedom and scale matrix.

Approximating Area under a curve with rectangles. To find the area under a curve we approximate the area using rectangles and then use limits to find

The analysis of the Cournot oligopoly model considering the subjective motive in the strategy selection

BASIC STATISTICS. f(x 1,x 2,..., x n )=f(x 1 )f(x 2 ) f(x n )= f(x i ) (1)

Department of Computer Science, University of Otago

hp calculators HP 12C Statistics - average and standard deviation Average and standard deviation concepts HP12C average and standard deviation

The Computational Rise and Fall of Fairness

The following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles

SOLAR POWER PROFILE PREDICTION FOR LOW EARTH ORBIT SATELLITES

CHAPTER 3 THE TIME VALUE OF MONEY

Digital Interactive Kanban Advertisement System Using Face Recognition Methodology

A zero one programming model for RNA structures with arc length 4

New exact solutions for the combined sinh-cosh-gordon equation

Overview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means)

Overview of some probability distributions.

Theorems About Power Series

Annuities Under Random Rates of Interest II By Abraham Zaks. Technion I.I.T. Haifa ISRAEL and Haifa University Haifa ISRAEL.

OMG! Excessive Texting Tied to Risky Teen Behaviors

GIS and analytic hierarchy process for land evaluation

MEI Structured Mathematics. Module Summary Sheets. Statistics 2 (Version B: reference to new book)

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:

Exam 3. Instructor: Cynthia Rudin TA: Dimitrios Bisias. November 22, 2011

ARTICLE IN PRESS. Statistics & Probability Letters ( ) A Kolmogorov-type test for monotonicity of regression. Cecile Durot

TO: Users of the ACTEX Review Seminar on DVD for SOA Exam MLC

Convexity, Inequalities, and Norms

Quadrat Sampling in Population Ecology

Plug-in martingales for testing exchangeability on-line

A modified Kolmogorov-Smirnov test for normality

Exploratory Data Analysis

Using a Packet Sniffer to Analyze the Efficiency and Power of Encryption Techniques Used to Protect Data Over a Computer Network

GCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number.

Learning objectives. Duc K. Nguyen - Corporate Finance 21/10/2014

Throughput and Delay Analysis of Hybrid Wireless Networks with Multi-Hop Uplinks

CS103A Handout 23 Winter 2002 February 22, 2002 Solving Recurrence Relations

Taking DCOP to the Real World: Efficient Complete Solutions for Distributed Multi-Event Scheduling

Transcription:

World Applied Scieces Joural (6): 845-849 ISS 88-495 IDOSI Publicatios A Copariso of Hypothesis Testig ethods for the ea of a og-oral Distributio 3 F. egahdari K. Abdollahezhad ad A.A. Jafari Islaic Azad iversity eyriz Brach Ira Departet of Statistics Golesta iversity Gorga Ira 3 Departet of Statistics Yazd iversity Yazd Ira Abstract: This paper deals with testig the ea of a log-oral populatio. We apply a ewly developed Coputatioal Approach Test (CAT) which is essetially a paraetric bootstrap ethod. A advatage of the CAT is that it does ot require the explicit kowledge of the saplig distributio of the test statistic. The CAT is the copared with three accepted tests- Cox ethod odified Cox ethod ad geeralized p-value ethod with ote Carlo siulatios. Our detailed studies idicate soe iterestig results icludig the facts that the size ad power of CAT is better tha other ethods. sig real data we have illustrated our ethod. Key words:coputatioal Approach test og-oral distributio Geeralized p-value Testig hypothesis ITRODCTIO For exaple icoe data ca ofte be cosidered to be log-oral. Oe way of aalyzig such data is to logthe Accessibility coputatioal resources has trasfor the origial variable X ad to base the iferece cotributed to the fudaetal researches to be o the trasfored variable Y I(X). This eas the carried out i ay areas of atheatical scieces. distributio fro which our data eerges ca be Coplex theoretical results ca ow be better approxiated with a log-oral distributio. I this paper realized through uerical coputatios ad/or we have discussed the hypothesis tests of the arithetic ote Carlo siulatios well before they ca be ea value of X i a log-oral distributio. It is true that verified aalytically. Recetly [] developed a siple the edia is ofte used to describe the average of coputatioal techique called Coputatioal skewed distributios like icoe distributios. However Approach Test (CAT) for hypothesis testig there are situatios whe the arithetic ea is a probles. The CAT looks siilar to paraetric paraeter of iterest. bootstrap but has soe great differeces-especially et X deote the rado variable that follows a logthe way it exploits the uisace paraeter(s). Aog oral distributio with probability desity fuctio the oteworthy aspects of this ethod it is observed that: (i) the CAT is a paraetric ethod which does ot require fx ( x) exp{ (l x ) } the kowledge of the exact saplig distributios of the x paraeter estiators; ad as a result (ii) the CAT ca be used eve for a very coplicated paraetric odel which E( X ) exp( + ). We let Y deote the logtrasfored ofte relies heavily o the asyptotic results oly. [] applied the CAT for Behres-Fisher proble ad orally distributed variable Y I(X) that capared it with four test ethods ad [3] showed that has ea value µ ad variace. for the oe-way AOVA the CAT ca provide alost as This article is orgaized as follows. I Sectio we uch power as the classical F-test uder the propose a CAT for testig hypothesis for log-oral hooscedastic oral odel. ea. I Sectio 3 we illustrate our approach usig a real I applied statistics classes we soeties coe exaple. The size of our ethod is copared with other across data that eed to be trasfored prior to aalysis. existig ethods i Sectio 4. Correspodig Author: A.A. Jafari Departet of Statistics Yazd iversity Yazd Ira E-ail: aajafari4@yahoo.co. 845

World Appl. Sci. J. (6): 845-849 ethods for Testig ea: Assue that X i i... is a idepedet rado saple fro log-oral populatio i.e. X i ~ log-oral (µ ) where µ ad are ukow.based o the above idepedet saples the our iterested test is exp( + ) equivalet to the followig test: where. ote that the above test is A H : vs. H : (< or >) Cox ethod ad odified Cox ethod: Deote the saple ea ad saple variace of Y with Y ad S u respectively where i u i i i Y Y S ( Y Y). A estiatio for is ad a 4 u Su H : vs. H : (< or >) A where + l( ). ow set Y i I(X i) i... The Y i ~ (µ ). et be a paraeter of () iterest ad let the uisace paraeter There are several ethods for testig. The ethods iclude a ave ethod based o trasforatio; a ethod proposed by Cox; a odified versio of the Cox ethod; a ethod otivated by large-saple theory; ad a ethod based o geeralized p-value. Accordig to siulatio results the Cox ethod odified Cox ethod ad geeralized p-value are better tha other ethods. Therefore we cosider these three ethods ad copare the with our proposed CAT. estiatio for the variace of Y + S u S +. Cox has suggested that a test statistic for ( ) hypothesis test ca be derive as[4 5] 4 Z. S S + ( ) is give by A approxiate distributio for this statistics is stadard oral distributio. Therefore we ca reject H if Z > z a/ where z a/ is the upper -level cut-off poit stadard oral distributio. Also we ca use t-studet with - degrees of freedo. Therefore we reject H if Z > t a/- where t a/- is the upper -level cut-off poit t--distributio. Geeralized P-value: Geeralized p-value ca be used for iferece about paraeters whe there is uisace paraeter. [6] suggested the followig procedure for coputig a geeralized p-value for the log-oral ea; For a give dataset x i...x set yi I (x i) i... ad calculate y ad s u fro the data. For j to Geerate Z ~ () ad ~ ( ). Set u s ( ) u s Tj y Z +. et I j if > otherwise I j Set p j T j I j the () ( ) {p -p } is a ote Carlo i estiate of the geeralized p-value for testig. The Cat for ea: et X X...X is a rado saple fro desity f(x ) where the fuctioal for of f is assued to be kow. The paraeter is partitioed as () () ( ) where if available is the uisace paraeter ad is the paraeter of iterest. The ethodology of the CAT for testig H : at a desired level a is give through the followig steps (for ore detail see []. Step : Obtai Step : the E of Assue that H is true i.e. set H :. The () fid the E of fro the data agai. Call this as () the `restricted E of ' uder H deoted by () R. 846

World Appl. Sci. J. (6): 845-849 Geerate artificial saple X X...X i.i.d. fro desity f( x ) a large uber of ties (say l () R ties). For each of these replicated saples () recalculated the E of ( ) (pretedig that were ukow). Retai oly the copoet that is relevat for et these recalculated E values of be. et... ()... ( ) Step 3: For testig H agaist : H A : < (if such a alterative is eaigful) at level a defie ( ). Reject H if <. Alteratively calculate the p-value as: (uber of ( )'s < ) l p I. < ) : ( l ( l ) For testig H agaist : > at (( ) ) H A level a defie. Reject H if >. Alteratively calculate the p-value as: (uber of ( )'s > ) l p I. > ) I.. (( /) ) (uber of ( l) 's < ) p ad be the ordered values of ( l ( l ) () For testig H : agaist H : defie the cut-off poits as ad Reject H if (( /) ) < <. Alteratively the p-value is coputed as: p i(p p ) where (uber of ( l) 's > ) p. The followig steps give the ipleetatio of our proposed CAT for hypothesis testig of log-oral proble. Step : Get the Es of the paraeters as where () Y + S b S b Step : ( l ) i b i i i Y Y S ( Y Y). Assue that is true i.e. The H Yi ~ ( () () () ) where is ukow. The Es of the paraeter () () which are called the restricted E' is () R + + yi i ( ). Geerate artificial saple Y...Y i.i.d. fro ( () () R R ) a large uber of ties (say ties). For each of these replicated saples recalculated the E of. et these recalculated E values of. be... ( l Yl + S ). bl et ()... be the ordered values of ( ) l. Step 3: The sae as Step 3 above with the exceptio that 's are used istead of ad is used i place of. ( l ) The size ad power coputatio for log-oral is doe through the followig stages. () For fixed ad geerate iid observatios of ( ) size fro () () where () +. Get () ad. Set ( value) ad get the restricted E () R H () of as. 847

World Appl. Sci. J. (6): 845-849 Table : Carbo ooxide levels at a oil refiery i Califoria. Date 9//99 /4/99 /3/99 /3/99 //99 CO level.5 4 5 Date 8/6/99 9//99 9//99 3/3/993 CO level 7 5 5 Table : The actual size of tests whe the oial level is ---------------------------------------------------- µ Test 5 7 5 Cox.73.3.3.9 odified Cox.6.95.88.76 geeralized 56.5.5.48 CAT.5.33.4.4.5 Cox.69.9.7.89 odified Cox..9.83.74 geeralized.58.54.54.54 CAT.5.33.4.47 Cox 6.3 86 odified Cox.4.87.79.7 geeralized.58.53.5.47 CAT.5.3.38.4 Cox 3 8.94.8 odified Cox.74.7.67.66 geeralized.57.5.5.5 CAT.9.9.39.4 µ Test 5 3 5 Cox 79 77.67.6 odified Cox.67.66.59.54 geeralized.5.5.49.45 CAT.45.49.46.44.5 Cox 79.78.69.6 odified Cox.69.67.6.55 geeralized.5.53.53.49 CAT.45.5.5.47 Cox.76.7 65.59 odified Cox.63.6.58.53 geeralized.5.53.48.48 CAT.47.49.46.45 Cox 66.65 65.53 odified Cox.55.55.56.49 geeralized.54.5.5.5 CAT.48.45.47.5 ow geerate Y (Y...Y ) i.i.d. fro ( () () R R ) I... Retai oly the E values of be.... Order these E values of as.... Get ( ) () ( ) (( ) ) get I I( ). ad (these are the lower ad upper a % cut-off poits). ow brig the fro the above Step- ad Repeat the above Step- through Step-5 a large uber of ties (say ties) ad get the I values as I I...I. Fially the power is approxiated by Table 3: the power of tests whe the oial level is with. --------------------------------------------------- µ Test 5 7 5 Cox.4.98 74.5 odified Cox.93.7.57.43 geeralized.76.7.74.77 CAT.49.55.63.69.5 Cox..9.69.5 odified Cox.75.6.53.4 geeralized.79.77.79.85 CAT.5.58.64.75 Cox..84 6 44 odified Cox.7.6.46.35 geeralized.89.89.94.3 CAT.57.65.78.9 Cox 94.67.46.43 odified Cox.5.4.8.5 geeralized.4.3.37.6 CAT.75.95..43 µ Test 5 3 5 Cox.45.4.4.48 odified Cox.37.34.3.4 geeralized.87.96.98.3 CAT.8.89.9.8.5 Cox.39.38.4.59 odified Cox.3.3.33.5 geeralized.97.4.7.4 CAT.89.96..37 Cox.38.39.47.76 odified Cox.9.3.35.66 geeralized..3.35.75 CAT..4.6.68 Cox.58.8.96 odified Cox.38.6.79.76 geeralized.9.8.33.337 CAT.68..6.3 CAT I i i Real Exaple: The data i Table are ie easureets of carbo ooxide levels i the air. The easureets were ade close to a Califoria oil refiery i 99-993. We will use these data to tests the ea carbo ooxide level. Iitial ivestigatios of these data ad of other siilar datasets idicate that a log-oral odel ay be appropriate. The data are posted at lib.stat.cu.edu/das/. The p-values for Cox ethod odified Cox ethod geeralized p-value ethod ad CAT for testig H : 3 vs H : 3 are.76.379.685 ad.878 respectively. Therefore The four ethods do ot reject H. Siulatio Study: A siulatio study is perfored to copare size ad power test of four ethods; i) the Cox ethod ii) the odified Cox ethod iii) the geeralized p-value ethod ad iv) the CAT. 848

World Appl. Sci. J. (6): 845-849 Table 4: The power of tests whe the oial level is with ----------------------------------------------------- µ Test 5 7 5 Cox.8.75.5.34 odified Cox.76.55.39.4 geeralized.4.7.6.33 CAT.73.86...5 Cox.8.69.45.3 odified Cox.68.48.34. geeralized.6.5.34.54 CAT.84.99.6.4 Cox.97.6.39.3 odified Cox.58.4.7.8 geeralized.33.45.58.89 CAT.98.9.39.7 Cox.7.49.47.85 odified Cox.36.8..44 geeralized.88.5.58.33 CAT.39.76.4.34 µ Test 5 3 5 Cox.35.45.53. odified Cox.6.3.39.7 geeralized.54.79.95.7 CAT.45.68.85.63.5 Cox.39.56.7.63 odified Cox.5.39.5.45 geeralized.83.7.33.37 CAT.7.3.3.37 Cox.48.77..3 odified Cox.9.53.78.8 geeralized.6.66.9.43 CAT.9.5.76.4 Cox.39.6.83.5 odified Cox.94.66.4.494 geeralized.386.456.58.69 CAT.358.433.488.679 For this propose we geerated saples with sizes 5 7 5 5 3 5 fro a log-oral distributio with paraeters µ.5 ad ( - µ) 5 replicatio were used. We cosider the test H : 3 vs. H A : 3. The actual size of tests are give i Table ad the power of tests for 3.54 are give i Tables 3 ad 4 respectively. It ca be observed fro the below tables that the actual size of test of CAT ethod is always less tha the oial level however this ca ot be happe i the other ethods. I the geeralized p-value ethod the actual size is as good as the CAT ethod but its actual size is ofte ore tha the oial level. The other two ethods are very liberal. oreover i all ethods the actual size will be close to the oial level as the saple size icreases (Table ). REFERECES. Pal. W.K. i ad C.H. ig 7. A coputatioal approach to statistical ifereces J. Appl. Probab. Statist. : 3-35.. Chag C.H. ad. Pal 8. A Revisit to the Behres-Fisher Proble: Copariso of Five Test ethods Couicatios i statistics-siulatio ad Coputatio 37: 64-85. 3. Chag C.H.. Pal W. i ad J.J. i. Coparig several populatio eas: a paraetric bootstrap ethod ad its copariso with usual AOVA F test as well as AO Coputatioal Statistics 5: 7-95. 4. Zhou X.H. ad S. Gao 997. Cofidece itervals for the log-oral ea Statistics i ed. 6: 783-79. 5. Zhou X.H. S. Gao ad S.. Hui 997. ethods for coparig the eas of two idepedet log-oral saples Bioetrics 53: 9-35. 6. Krishaoorthy K. ad T. athew 3. Ifereces o the eas of logoral distributios usig geeralized p-values ad geeralized cofidece itervals J. Statistical Plaig ad Iferece 5: 3-. 849