# COMP6053 lecture: Hypothesis testing, t-tests, p-values, type-i and type-ii errors.

Save this PDF as:

Size: px
Start display at page:

## Transcription

1 COMP6053 lecture: Hypothesis testing, t-tests, p-values, type-i and type-ii errors

2 The t-test This lecture introduces the t-test -- our first real statistical test -- and the related t-distribution. The t-test is used for such things as: determining the likelihood that a sample comes from a population with a specified mean deciding whether two samples come from the same population or not, i.e., do their means appear to be significantly different? The t-test is not mysterious; its logic follows on from the sampling experiments we discussed last time.

3 Hypothesis testing Before getting into the details of the t-test, we need to place it in the wider context of statistical hypothesis testing. You may already know the terms "null hypothesis" and "alternative hypothesis". These terms fit into the pattern of statistical inference we discussed right at the start of the module: suppose that the world works in a certain way, calculate the chances of seeing our data in a world that really did work that way, use the result of the calculation to reflect on how much we like our original supposition.

4 Hypothesis testing Consider a forensic example: a person has died and we're not sure whether it was natural causes (H null ) or poisoning (H alternative ). If we suppose it was natural causes, we can ask what the probability is of a person of that age, fitness, medical history, etc., having their heart stop without warning. If that probability is very small, we will naturally look more favourably on the poisoning hypothesis.

5 Hypothesis testing: investment example Another example: you inherit some money, and you ask a friend who knows about the stock market to invest it for you. One year later, your friend tells you all the money is gone.

6 Hypothesis testing: investment example How angry are you? Do you trust your friend? The null hypothesis: your friend simply had an unlucky run of investments. It could have happened to anyone. The alternative hypothesis: your friend has cheated you.

7 Hypothesis testing: investment example What we really want to know is whether your friend has cheated you, of course. We might express that as "what's the probability that he cheated me, given that he's claiming to have lost all the money?" (The Bayesian version of the question.) We could also say "he either cheated me or he didn't: which one should I believe given that he's claiming to have lost all the money?" (The frequentist version of the question.) But there may be no direct way to know the answer to those questions.

8 Hypothesis testing: investment example We could turn the question around and say "He is either unlucky (H null ) or a cheat (H alt ). Under each of those hypotheses, what are the chances of seeing a loss of all the money?" Consider H alt. Assume he's cheating you; what are the chances that he'd report a total loss of the money after one year? This is quite hard to answer: it depends on just how sophisticated a cheat he is. Consider H null. What are the chances that an honest investor would lose the money in the market as it's been over the last year? A more tractable question...

9 Hypothesis testing: investment example You could ask some independent experts just how tough the market has been that year. You could simulate a range of investment strategies using historical market data. You could look empirically at how many people out of the wider population of investors lost all their money over the last year.

10 Hypothesis testing: investment example Using one or all of these methods, let's say you find that it's been a very tough year, and in fact there's a 50% chance of an honest investor having lost all their money. It's therefore hard to rule out the null hypothesis. You're forced to conclude something like "He may well be honest." But let's say you find that it's been a great year, and that only 1 honest investor in 1000 lost money. If you want to hang onto the null hypothesis (honesty) under these circumstances, you have to accept that a very unlikely thing has happened.

11 Hypothesis testing: investment example So because of the small probability of the observed data (total loss) given the hypothesis (honesty) you are nudged towards the conclusion that the alternative hypothesis (cheating) is likely to be true. Let's say you're in this situation all the time: you run a hedge fund and you have many traders working for you, any of whom might decide to cheat you one year. You could choose to adopt some threshold for p(total loss given assumption of honesty) that means you'll reject the honesty assumption.

12 Hypothesis testing: investment example Let's say you decide (somewhat arbitrarily) to go with 1% as your threshold. If someone loses money, and the probability of an honest trader losing their funds is 1% or lower, you conclude that the person is not honest. This is not a bad decision rule, but note that it can't be perfect. Sometimes an honest person will get extremely unlucky and then be unfairly accused by you, and sometimes a cheat will steal from you but p(loss honesty) will be above the 1% threshold.

13 Statistical tests In the terms of our investment example, a statistical test is just a procedure for calculating p(loss honesty) or its equivalent. The t-test is one such test. In general we want to know p (observed data H null ). This is all a "p-value" is. Because of a throwaway remark by Ronald Fisher, the threshold for rejecting H null has been set at p 5% in many fields. There is nothing magical about this value, however.

14 Logic of the one-sample t-test We found in previous lectures that if we take repeated samples from a population, even of quite small size, the distribution of the means of those samples quickly approximates the bell curve of the normal distribution. If we're dealing with big sample sizes, the distribution of the sample means is as close as makes no difference to being the normal distribution. But the match is not perfect for small samples though. This is where the t-distribution comes in.

15 Fictional data exercise We can demonstrate the logic of the t-test by working through an exercise in sampling from a fictional distribution. Let's say we have some kind of IQ measure where the true distribution is normal, with a mean of 100 and a standard deviation of 10.

16

17 Particular sample: size 2, mean = 97.6

18 Particular sample: size 6, mean = 98.9

19 What is a sample? What are we doing when we take a sample of size N and calculate the mean? We know that there's a "meta-distribution" that describes the mean and standard deviations of the sample means. (Think of the green histograms.) The mean of this meta-distribution is the original population mean, and the standard deviation is the population standard deviation divided by the square root of the sample size (i.e., the standard error). So when we calculate the mean of one sample, we are drawing a random variate from this meta-distribution.

20

21

22

23 Distribution of the sample means This distribution of the sample means "tightens up" as the sample size gets bigger. We've seen this before. We can characterize this meta-distribution in terms of its own mean and standard deviation, although of course we need to estimate those from our sample in real situations. Calculating the mean of a sample equates to drawing one variate from the meta-distribution. Therefore we can ask how often we're likely to see extreme values of the sample mean, i.e., values that lie in the tails of the meta-distribution.

24 How to calculate the probability of extreme sample means? If our sample size was big enough, we could use the normal distribution to make this calculation. We would ask how many standard deviations away from the overall mean our particular sample mean was: this is called a Z-score if the distribution is normal. In our case we're going to calculate basically the same thing and call it a t-score because we can't assume normality.

25 Calculating a t-score We want to know how many standard deviations away from the overall mean of the sampling distribution our one particular sample is. Let's flesh out the example: we take our IQ testing scheme to a new country, and we want to know whether the people here are any smarter or dumber than they were at home. This gives us our null and alternative hypotheses. The null hypothesis is that there's no difference in the IQ scores between the two countries: the mean is 100 in both cases.

26 Calculating a t-score Our alternative hypothesis is simply the converse, that there is some difference in IQ scores between the countries. Note that we usually don't have a commitment about whether the difference, if there is one, will be positive or negative. H null : that μ = 100. You also see "H 0: μ 0 = 100". We collect a sample of 6 people, and give them IQ tests. They score: 101, 112, 100, 107, 94, 104.

27 Calculating a t-score The mean of the six scores is 103. This is higher than the null hypothesis suggests, but should we get excited? The standard deviation of the six scores is But we don't want the plain SD, we want the sample standard deviation (division by N-1) because we're trying to estimate the population standard deviation. Remember we have to work with what we have. In this case, that's the tiny sample of 6 scores.

28 Calculating a t-score So the sample standard deviation is The standard error is going to be 6.20 / sqrt(6), which is We now have our best guess at the meta-distribution of the sample-of-size-six means: in the absence of any other information, we'd say that its mean is 103 and its standard deviation is However, our null hypothesis is that our six numbers come from a distribution with a mean of 100, i.e., the same as back home.

29 Calculating a t-score We might have helped ourselves to the assumption that the standard deviation of IQ scores in this new country is 10, the same as at home. But we're not going to do that: who is to say that IQ doesn't have a different spread here? So our null hypothesis says: let's imagine that our sample mean comes from a distribution of sample means with mean of 100 and standard deviation of This gives us our t-statistic:

30 Calculating a t-score So a t-score is a lot like a z-score: it's essentially measuring the number of standard deviations from an expected mean that some measurement is. In our case, the t-score is ( ) / 2.53 = That's not a great distance from the mean: recall the 1.96 threshold for z-scores that equates to the most extreme 5% of the distribution. Similarly, it turns out that a t-score of ±1.18, or a more extreme value, happens 28.9% of the time (this is our p- value). So we're not motivated to reject H null.

31 Linking a t-score to a p-value In the old days you would look up a table of critical p-values for the t-distribution with an appropriate number of degrees of freedom. Degrees of freedom come up a lot in statistics. It's just a measure of how many free parameters something has. For one-sample t-tests, the degrees of freedom are N-1, where N is the sample size. This is because to get a particular value of t, the last score in the sample is not free to vary.

32 Linking a t-score to a p-value We need to specify whether we're interested in a one-tailed or a two-tailed test. A two-tailed test is the default option. This means that we have no strong commitment on whether the sample mean is likely to be higher or lower than the mean specified in the null hypothesis. Thus we include both extreme tails of the distribution when figuring out our p-value. If for some reason we only cared about evidence for an alternative hypothesis that the mean score was (e.g.) higher, we could use a one-tailed test and look at only one side of the t-distribution in figuring out p.

33 Linking a t-score to a p-value In Python, use from scipy import stats and then stats. ttest_1samp(iqscores, 100). In R, t.test(iqscores, mu=100). Or we could do it "empirically" through simulating the sampling process...

34

35

36

37 The t-distribution

38 The t-distribution summarized With modest sample sizes, we need to make a correction for the fact that our distribution of sample means is not actually normal. The t-distribution achieves this. It's really a family of distributions, one for each sample size. It has a lower peak and fatter tails than the normal distribution (especially so for really small sample sizes, such as 2) to capture the fact that small samples will produce extremely inaccurate estimates more often.

39 Some history and a gratuitous link to beer The t-test is also known as "Student's t-test". Why?

40 Some history and a gratuitous link to beer It was devised by William Sealy Gosset, a chemist working for the Guinness brewery in Dublin. Gosset devised the t-test as a way of cheaply monitoring the quality of batches of beer by taking small samples from those batches. Gosset published the test in 1908, but was forced to use a pseudonym ("Student") by Guiness, who regarded their use of statistics as a trade secret.

41 Other kinds of t-tests The one-sample t-test is what we've covered so far. The one-sample test can be used to deal with simple experimental designs in which we measure something before and after an intervention. For example, does drug X lower blood pressure, or does diet Y lead to weight loss? For each case in the study, we subtract the "before" score from the "after" score to get a difference. We can then examine the null hypothesis that the mean of the differences is zero, i.e., that the intervention makes no difference.

42 The two-sample t-test The two-sample t-test is an extension of the same idea. It is used to test the null hypothesis that two different samples in an experiment are drawn from the same population, i.e., that they have the same mean. For example, does drug A work any better or worse than drug B in reducing blood pressure? Do men and women systematically differ on their IQ scores?

43 The two-sample t-test There are some mathematical complications based on whether or not the sample sizes are the same and whether or not we can assume equal variances across the two samples. However: in practice you're unlikely to do many two-sample t-tests. You are more likely to use an analysis of variance (ANOVA) or a regression.

44 Type-I and type-ii errors Statistical test finds something, H null rejected Statistical test is negative, can't reject H null There's an effect in the real world There is in fact no effect A hit; you've found something False alarm, type I error Missed a real effect, type II error Correct to remain sceptical of H alt

45 Type-I and type-ii errors The key idea in statistics is to calculate p ( data H null ) and then reject H null if this p-value is very low. But what counts as "very low"? What's the right threshold is for rejecting H null? There's no right answer. Thresholds of 0.05 and 0.01 have been adopted in some quarters. It really depends on what the consequences of different kinds of errors might be.

46 Type-I and type-ii errors By setting your p-value threshold for rejecting H null, also known as an "alpha level", you can directly control your type- I error rate. How much does it bother you to believe that something is true when it isn't? (This is a type-i error.) If you're in the business of building aircraft navigation systems, you probably want to make sure you don't fall into this kind of error, and so you'll set you alpha level very low, perhaps

47 Type-I and type-ii errors The difficulty is that by adopting a very conservative type-i error rate, you necessarily increase your type-ii error rate. So you may now miss some things are actually true, e.g., you dismiss the idea of adopting a new part that could have slightly improved the performance of your system. If you are in the venture capital business, perhaps you're OK with making lots of type-i errors (backing companies that won't do well) but want to make sure you don't miss out on the chance to buy into the next Google. So you'd use a generous alpha level (p < 0.1) to minimize your type-ii error rate.

48 Additional material A great video lecture on thinking critically about p-values (Geoff Cumming, LaTrobe University). An argument that simulation allows us to determine p- values empirically and that we shouldn't be obsessed with choosing the right statistical test (Allen Downey). The Python code for the graphs and simulations in this lecture.

### The Philosophy of Hypothesis Testing, Questions and Answers 2006 Samuel L. Baker

HYPOTHESIS TESTING PHILOSOPHY 1 The Philosophy of Hypothesis Testing, Questions and Answers 2006 Samuel L. Baker Question: So I'm hypothesis testing. What's the hypothesis I'm testing? Answer: When you're

### Testing: is my coin fair?

Testing: is my coin fair? Formally: we want to make some inference about P(head) Try it: toss coin several times (say 7 times) Assume that it is fair ( P(head)= ), and see if this assumption is compatible

### Independent samples t-test. Dr. Tom Pierce Radford University

Independent samples t-test Dr. Tom Pierce Radford University The logic behind drawing causal conclusions from experiments The sampling distribution of the difference between means The standard error of

### Confidence intervals, t tests, P values

Confidence intervals, t tests, P values Joe Felsenstein Department of Genome Sciences and Department of Biology Confidence intervals, t tests, P values p.1/31 Normality Everybody believes in the normal

### Two-sample hypothesis testing, I 9.07 3/09/2004

Two-sample hypothesis testing, I 9.07 3/09/2004 But first, from last time More on the tradeoff between Type I and Type II errors The null and the alternative: Sampling distribution of the mean, m, given

### Two-sample inference: Continuous data

Two-sample inference: Continuous data Patrick Breheny April 5 Patrick Breheny STA 580: Biostatistics I 1/32 Introduction Our next two lectures will deal with two-sample inference for continuous data As

### Homework #3 is due Friday by 5pm. Homework #4 will be posted to the class website later this week. It will be due Friday, March 7 th, at 5pm.

Homework #3 is due Friday by 5pm. Homework #4 will be posted to the class website later this week. It will be due Friday, March 7 th, at 5pm. Political Science 15 Lecture 12: Hypothesis Testing Sampling

### Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Introduction to Hypothesis Testing 1 Hypothesis Testing A hypothesis test is a statistical procedure that uses sample data to evaluate a hypothesis about a population Hypothesis is stated in terms of the

### Descriptive Statistics

Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

### Statistics Review PSY379

Statistics Review PSY379 Basic concepts Measurement scales Populations vs. samples Continuous vs. discrete variable Independent vs. dependent variable Descriptive vs. inferential stats Common analyses

### DDBA 8438: Introduction to Hypothesis Testing Video Podcast Transcript

DDBA 8438: Introduction to Hypothesis Testing Video Podcast Transcript JENNIFER ANN MORROW: Welcome to "Introduction to Hypothesis Testing." My name is Dr. Jennifer Ann Morrow. In today's demonstration,

### Module 5 Hypotheses Tests: Comparing Two Groups

Module 5 Hypotheses Tests: Comparing Two Groups Objective: In medical research, we often compare the outcomes between two groups of patients, namely exposed and unexposed groups. At the completion of this

### Describing Populations Statistically: The Mean, Variance, and Standard Deviation

Describing Populations Statistically: The Mean, Variance, and Standard Deviation BIOLOGICAL VARIATION One aspect of biology that holds true for almost all species is that not every individual is exactly

### Single sample hypothesis testing, II 9.07 3/02/2004

Single sample hypothesis testing, II 9.07 3/02/2004 Outline Very brief review One-tailed vs. two-tailed tests Small sample testing Significance & multiple tests II: Data snooping What do our results mean?

### Inferential Statistics

Inferential Statistics Sampling and the normal distribution Z-scores Confidence levels and intervals Hypothesis testing Commonly used statistical methods Inferential Statistics Descriptive statistics are

### LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

### COMP6053 lecture: Analysis of variance.

COMP6053 lecture: Analysis of variance jn2@ecs.soton.ac.uk Variances are additive Variance is such a useful measure because the variance of a combined distribution is equal to the sum of the variances

### Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative

### OLS is not only unbiased it is also the most precise (efficient) unbiased estimation technique - ie the estimator has the smallest variance

Lecture 5: Hypothesis Testing What we know now: OLS is not only unbiased it is also the most precise (efficient) unbiased estimation technique - ie the estimator has the smallest variance (if the Gauss-Markov

### Chapter 7 Part 2. Hypothesis testing Power

Chapter 7 Part 2 Hypothesis testing Power November 6, 2008 All of the normal curves in this handout are sampling distributions Goal: To understand the process of hypothesis testing and the relationship

### Hypothesis Testing hypothesis testing approach formulation of the test statistic

Hypothesis Testing For the next few lectures, we re going to look at various test statistics that are formulated to allow us to test hypotheses in a variety of contexts: In all cases, the hypothesis testing

### T-test in SPSS Hypothesis tests of proportions Confidence Intervals (End of chapter 6 material)

T-test in SPSS Hypothesis tests of proportions Confidence Intervals (End of chapter 6 material) Definition of p-value: The probability of getting evidence as strong as you did assuming that the null hypothesis

### Power & Effect Size power Effect Size

Power & Effect Size Until recently, researchers were primarily concerned with controlling Type I errors (i.e. finding a difference when one does not truly exist). Although it is important to make sure

### Advanced Techniques for Mobile Robotics Statistical Testing

Advanced Techniques for Mobile Robotics Statistical Testing Wolfram Burgard, Cyrill Stachniss, Kai Arras, Maren Bennewitz Statistical Testing for Evaluating Experiments Deals with the relationship between

### Unit 21 Student s t Distribution in Hypotheses Testing

Unit 21 Student s t Distribution in Hypotheses Testing Objectives: To understand the difference between the standard normal distribution and the Student's t distributions To understand the difference between

### t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

t-tests in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com www.excelmasterseries.com

### Box plots & t-tests. Example

Box plots & t-tests Box Plots Box plots are a graphical representation of your sample (easy to visualize descriptive statistics); they are also known as box-and-whisker diagrams. Any data that you can

### Sample Size Determination

Sample Size Determination Population A: 10,000 Population B: 5,000 Sample 10% Sample 15% Sample size 1000 Sample size 750 The process of obtaining information from a subset (sample) of a larger group (population)

### Example for testing one population mean:

Today: Sections 13.1 to 13.3 ANNOUNCEMENTS: We will finish hypothesis testing for the 5 situations today. See pages 586-587 (end of Chapter 13) for a summary table. Quiz for week 8 starts Wed, ends Monday

### If you need statistics to analyze your experiment, then you've done the wrong experiment.

When do you need statistical calculations? When analyzing data, your goal is simple: You wish to make the strongest possible conclusion from limited amounts of data. To do this, you need to overcome two

### Business Statistics. Lecture 8: More Hypothesis Testing

Business Statistics Lecture 8: More Hypothesis Testing 1 Goals for this Lecture Review of t-tests Additional hypothesis tests Two-sample tests Paired tests 2 The Basic Idea of Hypothesis Testing Start

### II. DISTRIBUTIONS distribution normal distribution. standard scores

Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

### Chapter 08. Introduction

Chapter 08 Introduction Hypothesis testing may best be summarized as a decision making process in which one attempts to arrive at a particular conclusion based upon "statistical" evidence. A typical hypothesis

About Hypothesis Testing TABLE OF CONTENTS About Hypothesis Testing... 1 What is a HYPOTHESIS TEST?... 1 Hypothesis Testing... 1 Hypothesis Testing... 1 Steps in Hypothesis Testing... 2 Steps in Hypothesis

### 1 Hypotheses test about µ if σ is not known

1 Hypotheses test about µ if σ is not known In this section we will introduce how to make decisions about a population mean, µ, when the standard deviation is not known. In order to develop a confidence

### Water Quality Problem. Hypothesis Testing of Means. Water Quality Example. Water Quality Example. Water quality example. Water Quality Example

Water Quality Problem Hypothesis Testing of Means Dr. Tom Ilvento FREC 408 Suppose I am concerned about the quality of drinking water for people who use wells in a particular geographic area I will test

### Hypothesis Testing with z Tests

CHAPTER SEVEN Hypothesis Testing with z Tests NOTE TO INSTRUCTOR This chapter is critical to an understanding of hypothesis testing, which students will use frequently in the coming chapters. Some of the

### THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

THERE ARE TWO WAYS TO DO HYPOTHESIS TESTING WITH STATCRUNCH: WITH SUMMARY DATA (AS IN EXAMPLE 7.17, PAGE 236, IN ROSNER); WITH THE ORIGINAL DATA (AS IN EXAMPLE 8.5, PAGE 301 IN ROSNER THAT USES DATA FROM

### Inferences About Differences Between Means Edpsy 580

Inferences About Differences Between Means Edpsy 580 Carolyn J. Anderson Department of Educational Psychology University of Illinois at Urbana-Champaign Inferences About Differences Between Means Slide

### MCQ TESTING OF HYPOTHESIS

MCQ TESTING OF HYPOTHESIS MCQ 13.1 A statement about a population developed for the purpose of testing is called: (a) Hypothesis (b) Hypothesis testing (c) Level of significance (d) Test-statistic MCQ

### Hypothesis Testing Level I Quantitative Methods. IFT Notes for the CFA exam

Hypothesis Testing 2014 Level I Quantitative Methods IFT Notes for the CFA exam Contents 1. Introduction... 3 2. Hypothesis Testing... 3 3. Hypothesis Tests Concerning the Mean... 10 4. Hypothesis Tests

### Sociology 6Z03 Topic 15: Statistical Inference for Means

Sociology 6Z03 Topic 15: Statistical Inference for Means John Fox McMaster University Fall 2016 John Fox (McMaster University) Soc 6Z03: Statistical Inference for Means Fall 2016 1 / 41 Outline: Statistical

### QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS This booklet contains lecture notes for the nonparametric work in the QM course. This booklet may be online at http://users.ox.ac.uk/~grafen/qmnotes/index.html.

### MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

MONT 07N Understanding Randomness Solutions For Final Examination May, 00 Short Answer (a) (0) How are the EV and SE for the sum of n draws with replacement from a box computed? Solution: The EV is n times

### Probability, Binomial Distributions and Hypothesis Testing Vartanian, SW 540

Probability, Binomial Distributions and Hypothesis Testing Vartanian, SW 540 1. Assume you are tossing a coin 11 times. The following distribution gives the likelihoods of getting a particular number of

### TRANSCRIPT: In this lecture, we will talk about both theoretical and applied concepts related to hypothesis testing.

This is Dr. Chumney. The focus of this lecture is hypothesis testing both what it is, how hypothesis tests are used, and how to conduct hypothesis tests. 1 In this lecture, we will talk about both theoretical

### Introduction to Hypothesis Testing

I. Terms, Concepts. Introduction to Hypothesis Testing A. In general, we do not know the true value of population parameters - they must be estimated. However, we do have hypotheses about what the true

### Null and Alternative Hypotheses. Lecture # 3. Steps in Conducting a Hypothesis Test (Cont d) Steps in Conducting a Hypothesis Test

Lecture # 3 Significance Testing Is there a significant difference between a measured and a standard amount (that can not be accounted for by random error alone)? aka Hypothesis testing- H 0 (null hypothesis)

### c. The factor is the type of TV program that was watched. The treatment is the embedded commercials in the TV programs.

STAT E-150 - Statistical Methods Assignment 9 Solutions Exercises 12.8, 12.13, 12.75 For each test: Include appropriate graphs to see that the conditions are met. Use Tukey's Honestly Significant Difference

### Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Comparing Two Groups Chapter 7 describes two ways to compare two populations on the basis of independent samples: a confidence interval for the difference in population means and a hypothesis test. The

### Hypothesis Testing Summary

Hypothesis Testing Summary Hypothesis testing begins with the drawing of a sample and calculating its characteristics (aka, statistics ). A statistical test (a specific form of a hypothesis test) is an

### THE LOGIC OF HYPOTHESIS TESTING. The general process of hypothesis testing remains constant from one situation to another.

THE LOGIC OF HYPOTHESIS TESTING Hypothesis testing is a statistical procedure that allows researchers to use sample to draw inferences about the population of interest. It is the most commonly used inferential

### Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

Section 7.1 Introduction to Hypothesis Testing Schrodinger s cat quantum mechanics thought experiment (1935) Statistical Hypotheses A statistical hypothesis is a claim about a population. Null hypothesis

### Statistical Inference and t-tests

1 Statistical Inference and t-tests Objectives Evaluate the difference between a sample mean and a target value using a one-sample t-test. Evaluate the difference between a sample mean and a target value

### Inferential Statistics. Probability. From Samples to Populations. Katie Rommel-Esham Education 504

Inferential Statistics Katie Rommel-Esham Education 504 Probability Probability is the scientific way of stating the degree of confidence we have in predicting something Tossing coins and rolling dice

### Chapter 6: t test for dependent samples

Chapter 6: t test for dependent samples ****This chapter corresponds to chapter 11 of your book ( t(ea) for Two (Again) ). What it is: The t test for dependent samples is used to determine whether the

### Lecture 18: Two-Sample Hypothesis Test of Means

Statistics 18 sample_t_test.pdf Michael Hallstone, Ph.D. hallston@hawaii.edu Lecture 18: Two-Sample Hypothesis Test of Means Some Common Sense Assumptions for Two Sample Hypothesis Tests 1. The test variable

### Test of Hypotheses. Since the Neyman-Pearson approach involves two statistical hypotheses, one has to decide which one

Test of Hypotheses Hypothesis, Test Statistic, and Rejection Region Imagine that you play a repeated Bernoulli game: you win \$1 if head and lose \$1 if tail. After 10 plays, you lost \$2 in net (4 heads

### Chapter 8 Hypothesis Testing

Chapter 8 Hypothesis Testing Chapter problem: Does the MicroSort method of gender selection increase the likelihood that a baby will be girl? MicroSort: a gender-selection method developed by Genetics

### Chapter 1 Hypothesis Testing

Chapter 1 Hypothesis Testing Principles of Hypothesis Testing tests for one sample case 1 Statistical Hypotheses They are defined as assertion or conjecture about the parameter or parameters of a population,

### Data Analysis and Uncertainty Part 3: Hypothesis Testing/Sampling

Data Analysis and Uncertainty Part 3: Hypothesis Testing/Sampling Instructor: Sargur N. University at Buffalo The State University of New York srihari@cedar.buffalo.edu Topics 1. Hypothesis Testing 1.

### One-Sample t-test. Example 1: Mortgage Process Time. Problem. Data set. Data collection. Tools

One-Sample t-test Example 1: Mortgage Process Time Problem A faster loan processing time produces higher productivity and greater customer satisfaction. A financial services institution wants to establish

### STT 430/630/ES 760 Lecture Notes: Chapter 6: Hypothesis Testing 1. February 23, 2009 Chapter 6: Introduction to Hypothesis Testing

STT 430/630/ES 760 Lecture Notes: Chapter 6: Hypothesis Testing 1 February 23, 2009 Chapter 6: Introduction to Hypothesis Testing One of the primary uses of statistics is to use data to infer something

### Simple Regression Theory II 2010 Samuel L. Baker

SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the

### An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

The Islamic University of Gaza Faculty of Commerce Department of Economics and Political Sciences An Introduction to Statistics Course (ECOE 130) Spring Semester 011 Chapter 10- TWO-SAMPLE TESTS Practice

### Lecture Notes Module 1

Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

### DDBA 8438: The t Test for Independent Samples Video Podcast Transcript

DDBA 8438: The t Test for Independent Samples Video Podcast Transcript JENNIFER ANN MORROW: Welcome to The t Test for Independent Samples. My name is Dr. Jennifer Ann Morrow. In today's demonstration,

### Chapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation

Chapter 9 Two-Sample Tests Paired t Test (Correlated Groups t Test) Effect Sizes and Power Paired t Test Calculation Summary Independent t Test Chapter 9 Homework Power and Two-Sample Tests: Paired Versus

### fifty Fathoms Statistics Demonstrations for Deeper Understanding Tim Erickson

fifty Fathoms Statistics Demonstrations for Deeper Understanding Tim Erickson Contents What Are These Demos About? How to Use These Demos If This Is Your First Time Using Fathom Tutorial: An Extended Example

### Statistiek I. t-tests. John Nerbonne. CLCG, Rijksuniversiteit Groningen. John Nerbonne 1/35

Statistiek I t-tests John Nerbonne CLCG, Rijksuniversiteit Groningen http://wwwletrugnl/nerbonne/teach/statistiek-i/ John Nerbonne 1/35 t-tests To test an average or pair of averages when σ is known, we

### I. Basics of Hypothesis Testing

Introduction to Hypothesis Testing This deals with an issue highly similar to what we did in the previous chapter. In that chapter we used sample information to make inferences about the range of possibilities

### Lesson 9 Hypothesis Testing

Lesson 9 Hypothesis Testing Outline Logic for Hypothesis Testing Critical Value Alpha (α) -level.05 -level.01 One-Tail versus Two-Tail Tests -critical values for both alpha levels Logic for Hypothesis

### Machine Learning. Topic: Evaluating Hypotheses

Machine Learning Topic: Evaluating Hypotheses Bryan Pardo, Machine Learning: EECS 349 Fall 2011 How do you tell something is better? Assume we have an error measure. How do we tell if it measures something

### A Logic of Prediction and Evaluation

5 - Hypothesis Testing in the Linear Model Page 1 A Logic of Prediction and Evaluation 5:12 PM One goal of science: determine whether current ways of thinking about the world are adequate for predicting

### Chapter 26: Tests of Significance

Chapter 26: Tests of Significance Procedure: 1. State the null and alternative in words and in terms of a box model. 2. Find the test statistic: z = observed EV. SE 3. Calculate the P-value: The area under

### 13 Two-Sample T Tests

www.ck12.org CHAPTER 13 Two-Sample T Tests Chapter Outline 13.1 TESTING A HYPOTHESIS FOR DEPENDENT AND INDEPENDENT SAMPLES 270 www.ck12.org Chapter 13. Two-Sample T Tests 13.1 Testing a Hypothesis for

### Outline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test

The t-test Outline Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test - Dependent (related) groups t-test - Independent (unrelated) groups t-test Comparing means Correlation

### Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a

### Study Guide for the Final Exam

Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

### Sampling Distribution of the Mean & Hypothesis Testing

Sampling Distribution of the Mean & Hypothesis Testing Let s first review what we know about sampling distributions of the mean (Central Limit Theorem): 1. The mean of the sampling distribution will be

### Two-sample hypothesis testing, II 9.07 3/16/2004

Two-sample hypothesis testing, II 9.07 3/16/004 Small sample tests for the difference between two independent means For two-sample tests of the difference in mean, things get a little confusing, here,

### WISE Power Tutorial All Exercises

ame Date Class WISE Power Tutorial All Exercises Power: The B.E.A.. Mnemonic Four interrelated features of power can be summarized using BEA B Beta Error (Power = 1 Beta Error): Beta error (or Type II

### Let s explore SAS Proc T-Test

Let s explore SAS Proc T-Test Ana Yankovsky Research Statistical Analyst Screening Programs, AHS Ana.Yankovsky@albertahealthservices.ca Goals of the presentation: 1. Look at the structure of Proc TTEST;

### Section 12.2, Lesson 3. What Can Go Wrong in Hypothesis Testing: The Two Types of Errors and Their Probabilities

Today: Section 2.2, Lesson 3: What can go wrong with hypothesis testing Section 2.4: Hypothesis tests for difference in two proportions ANNOUNCEMENTS: No discussion today. Check your grades on eee and

### Sampling and Hypothesis Testing

Population and sample Sampling and Hypothesis Testing Allin Cottrell Population : an entire set of objects or units of observation of one sort or another. Sample : subset of a population. Parameter versus

### AMS 5 HYPOTHESIS TESTING

AMS 5 HYPOTHESIS TESTING Hypothesis Testing Was it due to chance, or something else? Decide between two hypotheses that are mutually exclusive on the basis of evidence from observations. Test of Significance

### General Procedure for Hypothesis Test. Five types of statistical analysis. 1. Formulate H 1 and H 0. General Procedure for Hypothesis Test

Five types of statistical analysis General Procedure for Hypothesis Test Descriptive Inferential Differences Associative Predictive What are the characteristics of the respondents? What are the characteristics

### Comparing Two or more than Two Groups

CRJ 716 Using Computers in Social Research Comparing Two or more than Two Groups Comparing Means, Conducting T-Tests and ANOVA Agron Kaci John Jay College Chapter 9/1: Comparing Two or more than Two Groups

### Selected Nonparametric and Parametric Statistical Tests for Two-Sample Cases 1

Selected Nonparametric and Parametric Statistical Tests for Two-Sample Cases The T-statistic is used to test differences in the means of two groups. The grouping variable is categorical and data for the

### Chi-Square Distribution. is distributed according to the chi-square distribution. This is usually written

Chi-Square Distribution If X i are k independent, normally distributed random variables with mean 0 and variance 1, then the random variable is distributed according to the chi-square distribution. This

### How to Conduct a Hypothesis Test

How to Conduct a Hypothesis Test The idea of hypothesis testing is relatively straightforward. In various studies we observe certain events. We must ask, is the event due to chance alone, or is there some

### The Basics of a Hypothesis Test

Overview The Basics of a Test Dr Tom Ilvento Department of Food and Resource Economics Alternative way to make inferences from a sample to the Population is via a Test A hypothesis test is based upon A

### Nonparametric tests, Bootstrapping

Nonparametric tests, Bootstrapping http://www.isrec.isb-sib.ch/~darlene/embnet/ Hypothesis testing review 2 competing theories regarding a population parameter: NULL hypothesis H ( straw man ) ALTERNATIVEhypothesis

### ANOVA MULTIPLE CHOICE QUESTIONS. In the following multiple-choice questions, select the best answer.

ANOVA MULTIPLE CHOICE QUESTIONS In the following multiple-choice questions, select the best answer. 1. Analysis of variance is a statistical method of comparing the of several populations. a. standard

### SPSS on two independent samples. Two sample test with proportions. Paired t-test (with more SPSS)

SPSS on two independent samples. Two sample test with proportions. Paired t-test (with more SPSS) State of the course address: The Final exam is Aug 9, 3:30pm 6:30pm in B9201 in the Burnaby Campus. (One

### COMP6053 lecture: Relationship between two variables: correlation, covariance and r-squared. jn2@ecs.soton.ac.uk

COMP6053 lecture: Relationship between two variables: correlation, covariance and r-squared jn2@ecs.soton.ac.uk Relationships between variables So far we have looked at ways of characterizing the distribution

### HypoTesting. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question.

Name: Class: Date: HypoTesting Multiple Choice Identify the choice that best completes the statement or answers the question. 1. A Type II error is committed if we make: a. a correct decision when the

### Chapter 2 Drawing Statistical Conclusions

Chapter 2 Drawing Statistical Conclusions STAT 3022 School of Statistic, University of Minnesota 2013 spring 1 / 50 Population and Parameters Suppose there s a population of interest that we wish to study: