The Concept of the Null Hypothesis

Similar documents
LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

Philosophical argument

Hypothesis testing - Steps

1.5 Oneway Analysis of Variance

Topic #6: Hypothesis. Usage

DDBA 8438: Introduction to Hypothesis Testing Video Podcast Transcript

Science and Scientific Reasoning. Critical Thinking

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

DEVELOPING HYPOTHESIS AND

The Null Hypothesis. Geoffrey R. Loftus University of Washington

Some Essential Statistics The Lure of Statistics

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

HYPOTHESIS TESTING: POWER OF THE TEST

Effects of CEO turnover on company performance

CHAPTER 14 NONPARAMETRIC TESTS

2 Sample t-test (unequal sample sizes and unequal variances)

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

Framing And Testing Hypotheses

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

"Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1

Statistical tests for SPSS

Cosmological Arguments for the Existence of God S. Clarke

The Mann-Whitney U test. Peter Shaw

Likelihood: Frequentist vs Bayesian Reasoning

Sample Size and Power in Clinical Trials

Unit 26 Estimation with Confidence Intervals

Online 12 - Sections 9.1 and 9.2-Doug Ensley

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

A Short Guide to Significant Figures

Background Biology and Biochemistry Notes A

CHANCE ENCOUNTERS. Making Sense of Hypothesis Tests. Howard Fincher. Learning Development Tutor. Upgrade Study Advice Service

HOW TO WRITE A LABORATORY REPORT

3.4 Statistical inference for 2 populations based on two samples

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Exact Nonparametric Tests for Comparing Means - A Personal Summary

Friedman's Two-way Analysis of Variance by Ranks -- Analysis of k-within-group Data with a Quantitative Response Variable

Introduction to Hypothesis Testing OPRE 6301

Describing Populations Statistically: The Mean, Variance, and Standard Deviation

How To Test For Significance On A Data Set

Descriptive Statistics

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Vieta s Formulas and the Identity Theorem

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

Crosstabulation & Chi Square

Testing Research and Statistical Hypotheses

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct

The Public Policy Process W E E K 1 2 : T H E S C I E N C E O F T H E P O L I C Y P R O C E S S

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Testing Hypotheses About Proportions

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

Statistiek I. Proportions aka Sign Tests. John Nerbonne. CLCG, Rijksuniversiteit Groningen.

StatCrunch and Nonparametric Statistics

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Comparing Means in Two Populations

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Week 4: Standard Error and Confidence Intervals

Simple Regression Theory II 2010 Samuel L. Baker

Part 2: Analysis of Relationship Between Two Variables

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

The Science of Biology

Statistics 2014 Scoring Guidelines

1 Hypothesis Testing. H 0 : population parameter = hypothesized value:

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Comparison of frequentist and Bayesian inference. Class 20, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom

Lesson 9 Hypothesis Testing

NPV Versus IRR. W.L. Silber We know that if the cost of capital is 18 percent we reject the project because the NPV

Science and Religion

Scientific Method Worksheet

Mind on Statistics. Chapter 12

WISE Power Tutorial All Exercises

22. HYPOTHESIS TESTING

CALCULATIONS & STATISTICS

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Fairfield Public Schools

Understand the role that hypothesis testing plays in an improvement project. Know how to perform a two sample hypothesis test.

Pre-Algebra Lecture 6

Hypothesis Testing: Two Means, Paired Data, Two Proportions

Lecture Notes Module 1

Classroom Management Plan

How To Understand The Relation Between Simplicity And Probability In Computer Science

Handouts for teachers

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

Unit 27: Comparing Two Means

II. DISTRIBUTIONS distribution normal distribution. standard scores

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Nonye Azih and B.O. Nwosu Department of Business Education, Ebonyi State University, Abakaliki, Nigeria

Multiple-Comparison Procedures

The Human Genome. Genetics and Personality. The Human Genome. The Human Genome 2/19/2009. Chapter 6. Controversy About Genes and Personality

Unit 3 Handout 1: DesJardin s Environmental Ethics. Chapter 6 Biocentric Ethics and the Inherent Value of Life

CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS

Study Guide for the Final Exam

Observing and describing the behavior of a subject without influencing it in any way.

Analysis and Interpretation of Clinical Trials. How to conclude?

Transcription:

The Concept of the Null Hypothesis NRC 601 Research Concepts in Natural Resources Department of Natural Resources Conservation University of Massachusetts Amherst Fall 2009

Null: - definition in Webster s dictionary (1993). - from the Latin nullus, ne- not + ullus any. - 1. having no binding force: invalid; 2. amounting to nothing: nil; 3. having no value. - hypothesis is singular; hypotheses is plural. Null Hypothesis: - a statement of no difference. - a starting point of a scientific investigation. - a way of simplifying our approach and conforming to the concept of falsification.

Francis Bacon: Promote negative instances... Try to disprove an idea... Cannot prove beyond a doubt Karl Popper: Our belief in any natural law cannot have a safer basis than our unsuccessful attempts to refute it.?? Charles Romesburg: Use the H-D method or else...

Sir William of Ockham (c 1285-1349) - English scholar and Franciscan monk - taught at Oxford University 1310-1324 - an empiricist; questioned current philosophies and the power of the pope charged with heresy because of his Master s thesis - thought to have died of the bubonic plague Believed that unnecessary complexity was vein and insulting to God... Occam s razor: "What can be done with fewer [assumptions] is done in vain with more." aka The Principle of Parsimony

Null hypotheses do two things: (a) Restructure our question into the falsification framework, as proposed by Bacon, Popper, et al.; (b) Try to account for patterns in the data in the simplest way possible, which usually means...... that we attribute variation in the data to randomness or measurement error.

For example: - We want to know if snowmobiles result in increase stress in wolves. - We compare the hormone levels in blood of wolves exposed to snowmobiles to hormone levels in blood of wolves not exposed to snowmobiles. - The means of the 2 groups are not likely to be exactly equivalent... The question is, is the difference enough to be due to some external force (i.e., snowmobiles) other than random variation alone?

The null hypothesis is paired with an alternative: (a) The null states that there is no pattern ; i.e., no difference between (2) or among (>2) groups, or no relationship between two continuous variables. (b) The alternative states that a pattern does exist; i.e., there are distinct differences between or among groups, or a clear relationship exists between two continuous variables. (c) If the alternative is the case, then you must ask how such patterns relate to the scientific hypothesis you are testing.

Statistical Hypotheses versus Scientific Hypotheses: (a) The statistical hypothesis tests for a pattern, and helps you decide, based on the test statistic, if there is no pattern (NULL) or there is a pattern (ALTERNATIVE) at some specific level of probability. (b) A scientific hypothesis is your candidate explanation for how or why that pattern exists... [... keeping in mind that we can, and should, have multiple competing hypotheses.]

Intermediate Disturbance Hypothesis = biodiversity will be greatest where disturbance is intermediate, and lower at the two extremes of very low disturbance and very high disturbance. Does this apply to human development? Hypothesize that biodiversity of dragonflies will be highest at intermediate development levels. H o : B HIGH = B MED = B LOW H A1 : B HIGH B MED B LOW H A2 : B HIGH < B MED > B LOW

A sidebar: -- more structural complexity, and thus more niches; -- more diversity in food resources; -- more competition among predators; -- less opportunity for habitat specialists... Etc. The Intermediate Disturbance Hypothesis might more accurately be described as a prediction, i.e., we predict that biodiversity will be greatest where disturbance is intermediate, and lower at the two extremes of very low disturbance and very high disturbance. Our hypotheses, i.e., our candidate explanations for this phenomenon, could be the following... In areas with intermediate levels of disturbance, there is...

The procedure: - Set up H o (the NULL) and H A (the ALTERNATIVE). - Conduct an appropriate statistical test and produce the test statistic, which is the numerical result of the test, and an associated probability value (or P-value). - If the test statistic is sufficiently large, and the associated P-value sufficiently small, you reject the null of no difference and conclude that, in our case, the diversity of dragonflies is different among the 3 levels of human disturbance.

Thus, we rejected H o in favor of H A. If our test statistic was low and P was high, we would NOT accept the null,... we would fail to reject the null. We would state that there is no evidence that level of disturbance is related to diversity of species in our study,... but we would not be able to say that this is never the case.

P-values: Traditionally, we pre-select some level of alpha (i.e., P) to indicate significance Usually, P 0.05... But in field biology, we sometimes use P 0.10 Now, if we reject the null: H o : B HIGH = B MED = B LOW and simply report, biodiversity differed among areas with different levels of disturbance (P 0.05) not very informative. But if we report means and variances (SE) for each area, and the test statistic, degrees of freedom, and exact P-value much more informative.

P-values, more: The lower the P-value, the more confidence you can be: P 0.01, you might say the difference is highly significant P 0.05, you might say the difference is significant P 0.10, you might say the difference is suggestive (although I would probably say significant) What you would want to report (and not report): P = 0.04 --- the exact P-value. P = 0.04 or maybe P 0.039 --- limit significant digits. P = 0.04, not P =.04 --- no naked decimals. And always include the means and some measure of variability, and the test statistic and degrees of freedom.

P-values, still more: Our primary goal is to reject the null, but we don t want to say there is a difference (i.e., reject H o ) if there really is not: H o : B HIGH = B MED = B LOW is really true (all are equal) but we say they are not equal If we did this, we would be making a TYPE I ERROR. Some researchers refer to this as spurious results. This hurts our credibility as good scientists, so we set alpha low (e.g., 0.05) to avoid this particular problem.

P-values, and still more: However, if there really is a difference, we don t want to miss it: H o : B HIGH = B MED = B LOW is really not true (all are not equal), but we say they are equal This would be a TYPE II ERROR, which could be bad if some treatment (e.g., a pollutant, clearcutting) truly hurts an endangered species. Studies with low sample sizes (n) are especially prone to Type II errors... How might you try to counteract this?