MODIFIED PARAMETRIC BOOTSTRAP: A ROBUST ALTERNATIVE TO CLASSICAL TEST
|
|
|
- Lora Higgins
- 9 years ago
- Views:
Transcription
1 MODIFIED PARAMETRIC BOOTSTRAP: A ROBUST ALTERNATIVE TO CLASSICAL TEST Zahayu Md Yusof, Nurul Hanis Harun, Sharipah Sooad Syed Yahaya & Suhaida Abdullah School of Quantitative Sciences College of Arts and Sciences Universiti Utara Malaysia [email protected], [email protected], [email protected], [email protected] ABSTRACT Classical parametric test such as ANOVA often used in testing the equality of central tendency since this method provide a good control of Type I error and generally more powerful than other statistical methods. However, ANOVA is known to be adversely affected by non-normality, heteroscedasticity, and unbalanced design. Type I error and power rates are substantially affected when these problems occur simultaneously. Continuously using ANOVA under the influence of these problems eventually will result in unreliable findings. Normality and homogeneity are two assumptions that need to be fulfilled when dealing with classical parametric test and not all data encompassed with these assumptions. Thus, this study proposed a robust procedure that insensitive to these assumptions namely Parametric Bootstrap (PB) with a popular robust estimator, MAD n. The p-value produced by modified PB was then compared with the p-value of ANOVA and Kruskal-Wallis test. The finding showed that modified PB able to produce significant result in testing the equality of central tendency measure. Field of Research: robust statistics, education Introduction ANOVA is used to determine the mean equality for more than two groups while independent samples t-test is frequently used when researchers want to make inferences about two independent groups by using the sample mean. However, a characteristic of these procedures is the fact that making inference depends on certain assumptions that need to be fulfilled. There are three main assumptions that need to be fulfilled before making inference on the classical parametric test namely: (a) collecting data from independents groups, (b) the data are normally distributed and (c) variances in the groups are equal (homoscedasticity). However, the specific interest of this study will focus only for the assumptions of normality and equality of variances in the groups since these assumptions are rarely met in real data. Violation of normality and equality of variances assumptions can have drastic effect on the result of classical parametric test especially on Type I error and Type II error (Erceg-Hurn & Mirosevich, 2008). Type I error occur when the true null hypothesis is rejected while Type II error occur when the null hypothesis is failed to reject even though it is false. The probability that Type II error will not occur is considered the power of a test. Failure to meet normality and equality of variances assumptions can cause the Type I error rate to distort. For example, the actual probability of Type I error rate should be in between of and if the significant level (α) is set at This brings the meaning that the probability of Type I error must be within the level of significant bound when the null hypothesis is assumed to be true. However, violation of any of these assumptions can lead to inflated the Type I error rates and consequently will 2013, Langkawi, Malaysia. (e-isbn ). Organized by WorldConferences.net 353
2 make the Type I error contained outside the level of significant bound when the null hypothesis is assumed to be true (Wilcox & Keselman, 2010). Classical parametric test are often used in testing the equality of central tendency such as mean, mode and median. However, classical parametric test have underlying assumptions that need to be fulfilled before analyzing the data. Recently, many published articles have shown that violation in classical parametric test assumptions can give biased results (Wilcox, 2002; Lix & Keselman, 1998; Micceri, 1989). This situation attracted researchers in finding a test statistic that can control the Type I error much better under the condition of non-normality distributions and unequal variances. Thus, robust statistics was introduced as an alternative approach in handling the violation of normality distribution and equality of variances assumptions. According to Hampel (2001), robust statistics is the stability theory of statistical procedures. It means that the statistical procedures insensitive to the violation of non-normality and unequal variances and hence will provide a good controlling of Type I error rates. Several procedures have been introduced and recommended for analyzing the data when the assumptions of normality distribution and equality of variances are violated (Krishnamoorthy, Lu & Mathew, 2007; Md Yusof, Abdullah & Yahaya, 2012; Fan & Hancock, 2012). Among the earlier procedures used by researchers are Welch test (1951), James test (1951) and Box test (1953). These three tests appear to be the most prevalent in controlling Type I error rate and providing competitive power under varying variances heterogenous conditions. However, these tests can be biased when the data are both unequal variances and non-normal distribution especially when the group design is unequal. In term of robustness of these alternative procedures, no one approach is the best in all the situations. To overcome this problem, many researchers had contributed in the development of alternative approach such as robust procedures. Robust procedures involve replacing the original mean and variances with robust measures of location. For example, some researchers proposed using trimmed mean and Winsorized variances when applying alternatives approaches such as James test, Welch test and Parametric Bootstrap test (Lix & Keselman, 1998: Keselman, et al., 2002: Md Yusof, Abdullah, Syed Yahaya & Othman, 2011) since robust procedures can improve robustness. The applying test intended to provide much better Type I error control when computed with trimmed means and Winsorized variance (Lix & Keselman, 1998). Among the latest procedures in detecting differences between location measures or assessing the effects of a treatment variable across groups is a statistic known as PB which was proposed by Krishnamoorthy et al., (2007). An interesting characteristic of this statistic is that no trimming needs to be done on the data when they are skewed. It is the primary goal of this paper to investigate the robustness of this statistic towards non-normality and heteroscedasticity by combining the statistic with some other alternatives scale estimators in controlling the Type I error rates and at the same time trying to increase the power of the test. 2. Methods In this section, we discussed on the modified PB method, which combines PB statistic with one of the scale estimators suggested by Rousseuw and Croux (1993) and the approximation of the unknown sampling distribution of the modified PB was done by using the bootstrap percentile method. 2013, Langkawi, Malaysia. (e-isbn ). Organized by WorldConferences.net 354
3 2.1 PB Statistic This study will use Parametric Bootstrap procedure as test statistic. Krishnamoorthy et al. (2007) proposed Parametric Bootstrap test as a relatively new statistics for comparing the equality of central tendency such as means of independent under unequal variances in the groups. Parametric Bootstrap test involves generating sample statistics from parametric model where the parameters are replaced by their estimates using bootstrap method. The objective of Krishnamoorthy et al. (2007) study is to compare the proposed Parametric Bootstrap test with three other tests such as Welch test, James test and generalized F (GF) test. Based on the results obtained, Parametric Bootstrap intended to provide a good Type I error control and more powerful than the original Welch test, James test and the generalized F (FG) test. Parametric Bootstrap test also can provide a good controlling of Type I error even for small sample sizes and the number of groups was large where the sample test statistics, T N0 is compute using the following formula: ² T N0 = - / / However, according to Cribbie et al. (2010), Welch test using robust estimators (trimmed means and winsorized variance) provided an excellent Type I error control compared to Parametric Bootstrap test under the presence of non-normal data and unequal variances. In addition, the power for the Welch test using trimmed means and Winsorizes variances always more powerful than the original Welch test, James test and Parametric Bootstrap test. To further reducing the effect of non-normality, Cribbie et al. (2012) demonstrated the Parametric Bootstrap test using trimmed means and Winsorized variances provide better Type I error control compared to the original Welch test and Parametric Bootstrap test when comparing the central tendencies of groups with non-normal distributions and unequal variances. Apart from that, Parametric Bootstrap test with trimmed mean is a test statistic that was shown to provide a good controlled of Type I error even for small sample sizes and there are many groups (Cribbie et al., 2012). Bootstrap can be carried out by parametric approach and nonparametric approach. According to Lee (1994), Parametric Bootstrap result may be more accurate than their nonparametric version. Hence, this study will not consider the Nonparametric Bootstrap. 2.2 Scale estimator Let X = x, x,..., x ) be a random sample from any distribution and let the sample median be ( 1 2 n denoted by med i x i MADn MADn is median absolute deviation about the median. Given by with b as a constant, this scale estimator is very robust with best possible breakdown point and bounded influence function. Huber (1981) identified MADn as the single most useful ancillary estimate of scale due to its high breakdown property. MADn is simple and easy to compute. The constant b is needed to make the estimator consistent for the parameter of interest. For example if the observations are randomly sampled from a normal distribution, by including b = , the MADn will estimateσ, the standard deviation. With constant b = 1, MADn will estimate 0.75σ, and this is known as MAD. 2013, Langkawi, Malaysia. (e-isbn ). Organized by WorldConferences.net 355
4 2.3 Bootstrap Method Since the sampling distribution of PB is intractable, and its asymptotic null distribution may not be of much use for practical sample sizes, the bootstrap method is considered to give a better approximation. Therefore, to assess statistical significance in this study, percentile bootstrap method (see, e.g. Efron and Tibshirani, 1993) was used. According to Babu, Padmanabhan and Puri (1999), the bootstrap method is known to give a better approximation than the one on the normal approximation theory and this method is attractive, especially when the samples are of moderate size. Bootstrap was introduced by Efron (1979) as a computer-based method for estimating the standard error of an estimator, θˆ. This method has gained a great deal of popularity in empirical research. The word bootstrap is used to indicate that the observed data are used not only to obtain an estimate of the parameter but also to generate new samples from which many more estimates may be obtained, and hence an idea of the variability of the estimate (Staude & Sheather, 1990). The basic idea is that in the absence of any other information about a population, the values in a random sample are the best guide to the distribution, and resampling the sample is the best guide to what can be expected from resampling the population. To obtain the p-value, the percentile bootstrap method is used as follows, (1) Calculate PB based on the available data. (2) Generate bootstrap samples by randomly sampling with replacement n j observations from the * * * jth group yielding Y Y,... Y. 1 j, 2 j, n j j (3) Each if the sample points in the bootstrapped groups must be centered at their respective estimated medians. (4) Use the bootstrap sample to compute the PB statistic denoted by PB *. (5) Repeat Step 2 to Step 4 B times yielding PB situations when n 12 (Wilcox, 2005). * (6) Calculate the p-value as (# of PB 1B > PB1) / B * * * 11,PB12,..., PB1 B. B = 599 appears sufficient in most Type I error and power of test corresponding to each method will be determined and compared. 3. Analysis on Real Data The performance of the modified PB method was demonstrated on real data. Four classes (groups) of Decision Analysis (2 nd Semester 2010/2011) conducted by 4 different lecturers were chosen at random. The final marks were recorded and tested for the equality between the classes. The sample sizes for Class 1, 2, 3 and 4 were 33, 32, 19 and 20 respectively. The descriptive statistics for each of the groups and the results of the test in the form of p-values are given in Table 1 and Table 2 respectively. 2013, Langkawi, Malaysia. (e-isbn ). Organized by WorldConferences.net 356
5 Table 1: Descriptive statistics for each group Lecturer Sample size (N) Mean of the marks Std. Deviatio n Std. Error 95% Confidence Interval for Mean Minimum Maximum Lower Bound Upper Bound Total Table 2: Real Data Groups Data We employed the Shapiro-Wilk test in order to determine the normality of data analysis since the small sample sizes was used. Based on Shapiro-Wilk test, group 2, group 3 and group 4 were found out to be non-normally distributed with the p-values of 0.130, and Table 3: Results of the test using different methods Methods p-value ANOVA Kruskall Wallis PB with MADn For comparison, the data were tested using all the three procedures mentioned in this study namely ANOVA, Kruskall Wallis and the modified PB. As can be observed in Table 2, when testing using ANOVA, the result fails to reject the null hypothesis such that the performance for all groups is equal. On contrary, when using Kruskall Wallis and modified PB method, the tests show significant result (reject the null hypothesis). The former result indicates that ANOVA fails to detect the difference which exists between the groups. Both the non parametric (Kruskall Wallis) and robust methods (modified PB) show better detection. Even though Kruskall Wallis shows stronger significance (p = ) as compared to the modified PB (p = ), but Kruskall Wallis in general only gave a brief information on the data. Thus, misrepresentation of the result could occur. 4. Conclusion The goal of this paper is to find the alternative procedures in testing location parameter for skewed distribution by simultaneously controlling the Type I error and power rates. Classical method such as ANOVA is not robust to nonnormality and heteroscedasticity. When these problem occur at the same 2013, Langkawi, Malaysia. (e-isbn ). Organized by WorldConferences.net 357
6 time, the Type I error will inflate causing spurious rejections of the null hypotheses and power of test can be substantially reduced from theoretical values, which will result in differences going undetected. Realizing the need of a good statistic in addressing these problems, we integrate the PB statistic by Krishnamoorthy et al., (2007) with the high breakdown scale estimators of Rousseuw and Croux (1993) and these new method are known as the modified PB method. This paper has shown some improvement in the statistical solution of detecting differences between location parameters. In controlling the Type I error rate, the study reported in this paper leads us to formulate the following conclusions and recommendations. When symmetry is suspect, we can avoid trimming the observations by using the PB with MADn suggested in our study. Acknowledgement The authors would like to acknowledge the work that led to this paper, which was partially funded by the RAGS provided by Ministry of Education. References Babu, G.J., Padmanabhan, A.R and Puri, M.L. (1999). Robust one-way ANOVA under possibly non regular conditions. Biometrical Journal, 41: Box, G. E. P. (1953). Non-normality and tests on variances. Biometrika,10, Cribbie, R. A., Fiksenbaum, L., Keselman, H. J., & Wilcox, R. R. (2012). Effect of nonnormality on test statistics for one-way independent groups designs. British Journal of Mathematical and Statistical Psychology, 65, Efron, B. (1979). Bootstrap methods: another look at the jackknife. Annals of Statistics. 7: 1-26 Efron, B and Tibshirani, R.J. (1993). An Introduction to the Bootstrap. Chapman & Hall Inc. Erceg-Hurn, D. M., & Mirosovich, V. M. (2008). Modern robust statistical methods: An easy way to maximize the accuracy and power of your research. American Psychologist, 63, Fan, W., & Hancock, G. R. (2012). Robust means modeling: an alternative for hypothesis testing of independent means under variance heterogeneity and nonnormality. Journal of Educational and Behavioral Statistics, 37, Hampel, F. R. (2001). Robust statistics: A brief introduction and overview. Huber, P. J. (1981). Robust statistics. New York: Wiley. James, G. S. (1951). The comparison of several groups of observations when the ratios of the population variances are unknown. Biometrika, 38: Keselman, H. J., Wilcox, R. R., Othman, A. R., & Fradette, K. (2002). Trimming, transforming statistics, and bootstrapping: circumventing the biasing effects of heteroscedasticity and nonnormality. Journal of Modern Applied Statistical Methods, 1, Krishnamoorthy, K., Lu, F., & Mathew, T. (2007). A parametric bootstrap approach for ANOVA with unequal variances: Fixed and random models. Computational Statistics and Data Analysi, 51, , Langkawi, Malaysia. (e-isbn ). Organized by WorldConferences.net 358
7 Lee, S.M.N. (1994). Optimal choice between parametric and nonparametric bootstrap estimates. Lix, L. M., & Keselman, H. J. (1998). To trim or not to trim: Tests of location equality under heteroscedasticity and non-normality. Educational and Psychological Measurement. Math. Proc. Cambridge Philosophical Society, 115, Md Yusof, Z. M., Abdullah, S., Syed Yahaya, S. S., & Othman, A. R. (2011). Testing the equality of central tendency measures using various trimming strategies. African Journal of Mathematics and Computer Science Research, 4(1), Md Yusof, Z. M., Abdullah, S., & Syed Yahaya, S. S. (2012). Type I error rates of parametric, robust and nonparametric methods for two group cases. World Applied Sciences Journal, 16(12), Micceri, T. (1989). The unicorn, the normal curve, and other improbable creatures. Psychological Bulletion, Rousseeuw, P.J.and Croux, C. (1993). Alternatives to the median absolute deviation. Journal of the American Statistical Association, 88: Staudte, R.G. and Sheather, S.J. (1990). Robust Estimation and Testing. John Wiley & Sons, Inc., New York Welch, B.L. (1951). On the comparison of several mean values: an alternative approach. Biometrika, 38: Wilcox, R. R. (2002). Understanding the practical advantages of modern ANOVA methods. Journal of Clinical Child and Adolescent Psychology, 31, Wilcox, R. R. (2005). Introduction to robust estimation and hypothesis testing (2nd Ed.). San Diego, CA: Academic Pres. Wilcox, R. R., & Keselman, H. J. (2010). Modern robust data analysis methods: Measures of central tendency , Langkawi, Malaysia. (e-isbn ). Organized by WorldConferences.net 359
INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)
INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) As with other parametric statistics, we begin the one-way ANOVA with a test of the underlying assumptions. Our first assumption is the assumption of
Permutation Tests for Comparing Two Populations
Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of
Non Parametric Inference
Maura Department of Economics and Finance Università Tor Vergata Outline 1 2 3 Inverse distribution function Theorem: Let U be a uniform random variable on (0, 1). Let X be a continuous random variable
13: Additional ANOVA Topics. Post hoc Comparisons
13: Additional ANOVA Topics Post hoc Comparisons ANOVA Assumptions Assessing Group Variances When Distributional Assumptions are Severely Violated Kruskal-Wallis Test Post hoc Comparisons In the prior
1 Nonparametric Statistics
1 Nonparametric Statistics When finding confidence intervals or conducting tests so far, we always described the population with a model, which includes a set of parameters. Then we could make decisions
UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST
UNDERSTANDING The independent-samples t test evaluates the difference between the means of two independent or unrelated groups. That is, we evaluate whether the means for two independent groups are significantly
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.
Rank-Based Non-Parametric Tests
Rank-Based Non-Parametric Tests Reminder: Student Instructional Rating Surveys You have until May 8 th to fill out the student instructional rating surveys at https://sakai.rutgers.edu/portal/site/sirs
THE KRUSKAL WALLLIS TEST
THE KRUSKAL WALLLIS TEST TEODORA H. MEHOTCHEVA Wednesday, 23 rd April 08 THE KRUSKAL-WALLIS TEST: The non-parametric alternative to ANOVA: testing for difference between several independent groups 2 NON
Introduction to Analysis of Variance (ANOVA) Limitations of the t-test
Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Limitations of the t-test Although the t-test is commonly used, it has limitations Can only
UNDERSTANDING THE TWO-WAY ANOVA
UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables
Robust t Tests. James H. Steiger. Department of Psychology and Human Development Vanderbilt University
Robust t Tests James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) 1 / 29 Robust t Tests 1 Introduction 2 Effect of Violations
How To Check For Differences In The One Way Anova
MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way
individualdifferences
1 Simple ANalysis Of Variance (ANOVA) Oftentimes we have more than two groups that we want to compare. The purpose of ANOVA is to allow us to compare group means from several independent samples. In general,
Exact Nonparametric Tests for Comparing Means - A Personal Summary
Exact Nonparametric Tests for Comparing Means - A Personal Summary Karl H. Schlag European University Institute 1 December 14, 2006 1 Economics Department, European University Institute. Via della Piazzuola
Variables Control Charts
MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. Variables
SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES
SCHOOL OF HEALTH AND HUMAN SCIENCES Using SPSS Topics addressed today: 1. Differences between groups 2. Graphing Use the s4data.sav file for the first part of this session. DON T FORGET TO RECODE YOUR
Lecture Notes Module 1
Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific
Study Guide for the Final Exam
Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make
II. DISTRIBUTIONS distribution normal distribution. standard scores
Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,
Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm
Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm
Descriptive Statistics
Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize
The Assumption(s) of Normality
The Assumption(s) of Normality Copyright 2000, 2011, J. Toby Mordkoff This is very complicated, so I ll provide two versions. At a minimum, you should know the short one. It would be great if you knew
Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)
Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption
NCSS Statistical Software
Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the
Biostatistics: Types of Data Analysis
Biostatistics: Types of Data Analysis Theresa A Scott, MS Vanderbilt University Department of Biostatistics [email protected] http://biostat.mc.vanderbilt.edu/theresascott Theresa A Scott, MS
UNDERSTANDING THE DEPENDENT-SAMPLES t TEST
UNDERSTANDING THE DEPENDENT-SAMPLES t TEST A dependent-samples t test (a.k.a. matched or paired-samples, matched-pairs, samples, or subjects, simple repeated-measures or within-groups, or correlated groups)
How To Test For Significance On A Data Set
Non-Parametric Univariate Tests: 1 Sample Sign Test 1 1 SAMPLE SIGN TEST A non-parametric equivalent of the 1 SAMPLE T-TEST. ASSUMPTIONS: Data is non-normally distributed, even after log transforming.
Quantitative Methods for Finance
Quantitative Methods for Finance Module 1: The Time Value of Money 1 Learning how to interpret interest rates as required rates of return, discount rates, or opportunity costs. 2 Learning how to explain
Parametric and non-parametric statistical methods for the life sciences - Session I
Why nonparametric methods What test to use? Rank Tests Parametric and non-parametric statistical methods for the life sciences - Session I Liesbeth Bruckers Geert Molenberghs Interuniversity Institute
Tutorial 5: Hypothesis Testing
Tutorial 5: Hypothesis Testing Rob Nicholls [email protected] MRC LMB Statistics Course 2014 Contents 1 Introduction................................ 1 2 Testing distributional assumptions....................
Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means
Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis
Impact of Skewness on Statistical Power
Modern Applied Science; Vol. 7, No. 8; 013 ISSN 1913-1844 E-ISSN 1913-185 Published by Canadian Center of Science and Education Impact of Skewness on Statistical Power Ötüken Senger 1 1 Kafkas University,
Standard Deviation Estimator
CSS.com Chapter 905 Standard Deviation Estimator Introduction Even though it is not of primary interest, an estimate of the standard deviation (SD) is needed when calculating the power or sample size of
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written
Two-Sample T-Tests Assuming Equal Variance (Enter Means)
Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of
The Variability of P-Values. Summary
The Variability of P-Values Dennis D. Boos Department of Statistics North Carolina State University Raleigh, NC 27695-8203 [email protected] August 15, 2009 NC State Statistics Departement Tech Report
Modern Robust Statistical Methods
Modern Robust Statistical Methods An Easy Way to Maximize the Accuracy and Power of Your Research David M. Erceg-Hurn Vikki M. Mirosevich University of Western Australia Government of Western Australia
UNIVERSITY OF NAIROBI
UNIVERSITY OF NAIROBI MASTERS IN PROJECT PLANNING AND MANAGEMENT NAME: SARU CAROLYNN ELIZABETH REGISTRATION NO: L50/61646/2013 COURSE CODE: LDP 603 COURSE TITLE: RESEARCH METHODS LECTURER: GAKUU CHRISTOPHER
NCSS Statistical Software
Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the
Statistical tests for SPSS
Statistical tests for SPSS Paolo Coletti A.Y. 2010/11 Free University of Bolzano Bozen Premise This book is a very quick, rough and fast description of statistical tests and their usage. It is explicitly
Statistics 3202 Introduction to Statistical Inference for Data Analytics 4-semester-hour course
Statistics 3202 Introduction to Statistical Inference for Data Analytics 4-semester-hour course Prerequisite: Stat 3201 (Introduction to Probability for Data Analytics) Exclusions: Class distribution:
T test as a parametric statistic
KJA Statistical Round pissn 2005-619 eissn 2005-7563 T test as a parametric statistic Korean Journal of Anesthesiology Department of Anesthesia and Pain Medicine, Pusan National University School of Medicine,
MASTER COURSE SYLLABUS-PROTOTYPE PSYCHOLOGY 2317 STATISTICAL METHODS FOR THE BEHAVIORAL SCIENCES
MASTER COURSE SYLLABUS-PROTOTYPE THE PSYCHOLOGY DEPARTMENT VALUES ACADEMIC FREEDOM AND THUS OFFERS THIS MASTER SYLLABUS-PROTOTYPE ONLY AS A GUIDE. THE INSTRUCTORS ARE FREE TO ADAPT THEIR COURSE SYLLABI
CHAPTER 14 NONPARAMETRIC TESTS
CHAPTER 14 NONPARAMETRIC TESTS Everything that we have done up until now in statistics has relied heavily on one major fact: that our data is normally distributed. We have been able to make inferences
Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing
Chapter 8 Hypothesis Testing 1 Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing 8-3 Testing a Claim About a Proportion 8-5 Testing a Claim About a Mean: s Not Known 8-6 Testing
Introduction to Quantitative Methods
Introduction to Quantitative Methods October 15, 2009 Contents 1 Definition of Key Terms 2 2 Descriptive Statistics 3 2.1 Frequency Tables......................... 4 2.2 Measures of Central Tendencies.................
Recall this chart that showed how most of our course would be organized:
Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical
Chapter 4. Probability and Probability Distributions
Chapter 4. robability and robability Distributions Importance of Knowing robability To know whether a sample is not identical to the population from which it was selected, it is necessary to assess the
t-test Statistics Overview of Statistical Tests Assumptions
t-test Statistics Overview of Statistical Tests Assumption: Testing for Normality The Student s t-distribution Inference about one mean (one sample t-test) Inference about two means (two sample t-test)
HYPOTHESIS TESTING: POWER OF THE TEST
HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,
BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420
BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420 1. Which of the following will increase the value of the power in a statistical test
1.5 Oneway Analysis of Variance
Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments
Introduction to Hypothesis Testing OPRE 6301
Introduction to Hypothesis Testing OPRE 6301 Motivation... The purpose of hypothesis testing is to determine whether there is enough statistical evidence in favor of a certain belief, or hypothesis, about
Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.
Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing
Nonparametric Statistics
Nonparametric Statistics References Some good references for the topics in this course are 1. Higgins, James (2004), Introduction to Nonparametric Statistics 2. Hollander and Wolfe, (1999), Nonparametric
Non-Parametric Tests (I)
Lecture 5: Non-Parametric Tests (I) KimHuat LIM [email protected] http://www.stats.ox.ac.uk/~lim/teaching.html Slide 1 5.1 Outline (i) Overview of Distribution-Free Tests (ii) Median Test for Two Independent
Error Type, Power, Assumptions. Parametric Tests. Parametric vs. Nonparametric Tests
Error Type, Power, Assumptions Parametric vs. Nonparametric tests Type-I & -II Error Power Revisited Meeting the Normality Assumption - Outliers, Winsorizing, Trimming - Data Transformation 1 Parametric
Skewed Data and Non-parametric Methods
0 2 4 6 8 10 12 14 Skewed Data and Non-parametric Methods Comparing two groups: t-test assumes data are: 1. Normally distributed, and 2. both samples have the same SD (i.e. one sample is simply shifted
Examining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish
Examining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish Statistics Statistics are quantitative methods of describing, analysing, and drawing inferences (conclusions)
3.4 Statistical inference for 2 populations based on two samples
3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted
Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test
Nonparametric Two-Sample Tests Sign test Mann-Whitney U-test (a.k.a. Wilcoxon two-sample test) Kolmogorov-Smirnov Test Wilcoxon Signed-Rank Test Tukey-Duckworth Test 1 Nonparametric Tests Recall, nonparametric
Chapter 2. Hypothesis testing in one population
Chapter 2. Hypothesis testing in one population Contents Introduction, the null and alternative hypotheses Hypothesis testing process Type I and Type II errors, power Test statistic, level of significance
research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other
1 Hypothesis Testing Richard S. Balkin, Ph.D., LPC-S, NCC 2 Overview When we have questions about the effect of a treatment or intervention or wish to compare groups, we use hypothesis testing Parametric
Research Methods & Experimental Design
Research Methods & Experimental Design 16.422 Human Supervisory Control April 2004 Research Methods Qualitative vs. quantitative Understanding the relationship between objectives (research question) and
NCSS Statistical Software. One-Sample T-Test
Chapter 205 Introduction This procedure provides several reports for making inference about a population mean based on a single sample. These reports include confidence intervals of the mean or median,
Chapter 12 Nonparametric Tests. Chapter Table of Contents
Chapter 12 Nonparametric Tests Chapter Table of Contents OVERVIEW...171 Testing for Normality...... 171 Comparing Distributions....171 ONE-SAMPLE TESTS...172 TWO-SAMPLE TESTS...172 ComparingTwoIndependentSamples...172
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance
MEASURES OF LOCATION AND SPREAD
Paper TU04 An Overview of Non-parametric Tests in SAS : When, Why, and How Paul A. Pappas and Venita DePuy Durham, North Carolina, USA ABSTRACT Most commonly used statistical procedures are based on the
Chapter 7. One-way ANOVA
Chapter 7 One-way ANOVA One-way ANOVA examines equality of population means for a quantitative outcome and a single categorical explanatory variable with any number of levels. The t-test of Chapter 6 looks
Comparing Means in Two Populations
Comparing Means in Two Populations Overview The previous section discussed hypothesis testing when sampling from a single population (either a single mean or two means from the same population). Now we
NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)
NONPARAMETRIC STATISTICS 1 PREVIOUSLY parametric statistics in estimation and hypothesis testing... construction of confidence intervals computing of p-values classical significance testing depend on assumptions
Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015
Stat 411/511 THE RANDOMIZATION TEST Oct 16 2015 Charlotte Wickham stat511.cwick.co.nz Today Review randomization model Conduct randomization test What about CIs? Using a t-distribution as an approximation
From the help desk: Bootstrapped standard errors
The Stata Journal (2003) 3, Number 1, pp. 71 80 From the help desk: Bootstrapped standard errors Weihua Guan Stata Corporation Abstract. Bootstrapping is a nonparametric approach for evaluating the distribution
The Wilcoxon Rank-Sum Test
1 The Wilcoxon Rank-Sum Test The Wilcoxon rank-sum test is a nonparametric alternative to the twosample t-test which is based solely on the order in which the observations from the two samples fall. We
Name: Date: Use the following to answer questions 3-4:
Name: Date: 1. Determine whether each of the following statements is true or false. A) The margin of error for a 95% confidence interval for the mean increases as the sample size increases. B) The margin
Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.
Robust Tests for the Equality of Variances Author(s): Morton B. Brown and Alan B. Forsythe Source: Journal of the American Statistical Association, Vol. 69, No. 346 (Jun., 1974), pp. 364-367 Published
Descriptive Statistics
Descriptive Statistics Suppose following data have been collected (heights of 99 five-year-old boys) 117.9 11.2 112.9 115.9 18. 14.6 17.1 117.9 111.8 16.3 111. 1.4 112.1 19.2 11. 15.4 99.4 11.1 13.3 16.9
STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance
Principles of Statistics STA-201-TE This TECEP is an introduction to descriptive and inferential statistics. Topics include: measures of central tendency, variability, correlation, regression, hypothesis
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate
Chapter G08 Nonparametric Statistics
G08 Nonparametric Statistics Chapter G08 Nonparametric Statistics Contents 1 Scope of the Chapter 2 2 Background to the Problems 2 2.1 Parametric and Nonparametric Hypothesis Testing......................
There are three kinds of people in the world those who are good at math and those who are not. PSY 511: Advanced Statistics for Psychological and Behavioral Research 1 Positive Views The record of a month
5.1 Identifying the Target Parameter
University of California, Davis Department of Statistics Summer Session II Statistics 13 August 20, 2012 Date of latest update: August 20 Lecture 5: Estimation with Confidence intervals 5.1 Identifying
How Far is too Far? Statistical Outlier Detection
How Far is too Far? Statistical Outlier Detection Steven Walfish President, Statistical Outsourcing Services [email protected] 30-325-329 Outline What is an Outlier, and Why are
Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test
Experimental Design Power and Sample Size Determination Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison November 3 8, 2011 To this point in the semester, we have largely
SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one?
SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one? Simulations for properties of estimators Simulations for properties
p ˆ (sample mean and sample
Chapter 6: Confidence Intervals and Hypothesis Testing When analyzing data, we can t just accept the sample mean or sample proportion as the official mean or proportion. When we estimate the statistics
Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013
Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives
Sample Size and Power in Clinical Trials
Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance
Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS
Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS About Omega Statistics Private practice consultancy based in Southern California, Medical and Clinical
Two-sample inference: Continuous data
Two-sample inference: Continuous data Patrick Breheny April 5 Patrick Breheny STA 580: Biostatistics I 1/32 Introduction Our next two lectures will deal with two-sample inference for continuous data As
Inference for two Population Means
Inference for two Population Means Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison October 27 November 1, 2011 Two Population Means 1 / 65 Case Study Case Study Example
4. Continuous Random Variables, the Pareto and Normal Distributions
4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random
