Barry Kissane & Marian Kemp

Similar documents
Independent samples t-test. Dr. Tom Pierce Radford University

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Week 4: Standard Error and Confidence Intervals

Sample Size and Power in Clinical Trials

Unit 26 Estimation with Confidence Intervals

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Fairfield Public Schools

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Hypothesis Testing for Beginners

MEASURES OF VARIATION

Using Excel for inferential statistics

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

Describing Populations Statistically: The Mean, Variance, and Standard Deviation

Binomial Sampling and the Binomial Distribution

SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one?

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

Projects Involving Statistics (& SPSS)

Descriptive Statistics and Measurement Scales

Lecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions

Simple Regression Theory II 2010 Samuel L. Baker

Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance

5.1 Identifying the Target Parameter

How To Check For Differences In The One Way Anova

Introduction to Hypothesis Testing OPRE 6301

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

4. Continuous Random Variables, the Pareto and Normal Distributions

12: Analysis of Variance. Introduction

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

II. DISTRIBUTIONS distribution normal distribution. standard scores

SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12

Chapter 7 Section 7.1: Inference for the Mean of a Population

LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

Lecture Notes Module 1

Confidence Intervals for the Difference Between Two Means

12.5: CHI-SQUARE GOODNESS OF FIT TESTS

Two-sample inference: Continuous data

Hypothesis testing - Steps

Name: Date: Use the following to answer questions 3-4:

HYPOTHESIS TESTING WITH SPSS:

Chapter 7 Section 1 Homework Set A

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Lesson 9 Hypothesis Testing

Session 7 Bivariate Data and Analysis

Introduction to Hypothesis Testing

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

Confidence Intervals for One Standard Deviation Using Standard Deviation

Means, standard deviations and. and standard errors

AN INTRODUCTION TO HYPOTHESIS-TESTING AND EXPERIMENTAL DESIGN

Content Sheet 7-1: Overview of Quality Control for Quantitative Tests

Step 6: Writing Your Hypotheses Written and Compiled by Amanda J. Rockinson-Szapkiw

Comparing Means in Two Populations

Continuing, we get (note that unlike the text suggestion, I end the final interval with 95, not 85.

CALCULATIONS & STATISTICS

2 Sample t-test (unequal sample sizes and unequal variances)

6.4 Normal Distribution

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

DATA INTERPRETATION AND STATISTICS

Lesson 4 Measures of Central Tendency

Physics Lab Report Guidelines

research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other

Reflections on Probability vs Nonprobability Sampling

Week 3&4: Z tables and the Sampling Distribution of X

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

Suitable for: Beginners with absolutely no previous experience. Beginners who appear particularly shy or nervous.

Motor and Household Insurance: Pricing to Maximise Profit in a Competitive Market

Unit 7: Normal Curves

Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences

Lab 11. Simulations. The Concept

3.4 Statistical inference for 2 populations based on two samples

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

Measures of Central Tendency and Variability: Summarizing your Data for Others

Understand the role that hypothesis testing plays in an improvement project. Know how to perform a two sample hypothesis test.

Guide to Writing a Project Report

COMP6053 lecture: Relationship between two variables: correlation, covariance and r-squared.

3. Mathematical Induction

Descriptive Statistics

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

What is the Probability of Pigging Out

Simulation Exercises to Reinforce the Foundations of Statistical Thinking in Online Classes

Premaster Statistics Tutorial 4 Full solutions

Simple linear regression

Using R for Linear Regression

Types of Error in Surveys

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

of course the mean is p. That is just saying the average sample would have 82% answering

Chapter 23 Inferences About Means

Having a coin come up heads or tails is a variable on a nominal scale. Heads is a different category from tails.

Chapter 3. Sampling. Sampling Methods

Tutorial 5: Hypothesis Testing

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Introduction to Quantitative Methods

Testing Research and Statistical Hypotheses

Understanding Options: Calls and Puts

Transcription:

Confidence intervals for a mean* As noted earlier, samples are used in practice to infer information about a population. Provided a sample is taken at random, statistics offers some powerful procedures to allow us to state results with some precision. One of these concerns the idea of a confidence interval. You will be able to use the calculator directly to construct and use these. We will illustrate the ideas and procedures with an example. Suppose a plant nursery was concerned with how high a particular bush will grow from the seedlings that it sells, so that they can advise their customers accordingly. They selected at random a sample of 20 seedlings and grew them carefully according to the directions and under controlled conditions. They measured the final heights of the bushes (in centimetres) and obtained the following results: 163, 165, 166, 170, 171, 174, 175, 175, 175, 177, 178, 180, 182, 184, 185, 188, 189, 189, 193, 201 How might this information be used to inform their customers? Enter the data into List 1 of your calculator. One possibility is to consider the mean and standard deviation of the heights, as shown below. The sample mean is 179 cm, and the histogram shows that there was a spread of bush heights. (The width of the histogram intervals is 5 cm). The standard deviation of the bush heights was about 9.69 cm. What information might these convey about the population of bush heights (grown under ideal conditions)? The population here comprises the bush heights to which all their seedlings are likely to grow, not just the 20 in their sample. It is clear that the seedlings will grow to different heights (not all the same height), so our interest is in the mean height of the bushes, which is probably described to customers as the average height. Remarkably, when samples are chosen at random, statistical theory tells us about the distribution of sample means, noted in the previous section. There is not space in this module to describe the details, but a statistician will be able to make a prediction for the population from just one sample, and also suggest how likely the prediction is to be correct! The distribution is more varied (and thus less accurate) for a smaller sample than a larger sample. But here we have only one sample mean, since the nursery has been able to obtain only one sample (of size 20). Presumably, if they were to take another sample, the results would not be the same. One way of making an inference about the population is to describe a confidence interval, suggesting that the mean of the population is likely to be within a certain interval, based on the sample information. The calculator has been programmed in advance to allow you to obtain a confidence interval, provided you know enough about the data concerned. In this case, the single best estimate for the population mean is the sample mean of 179 cm. But, when random sampling is involved, another sample of size 20 is likely to produce a different sample mean, so having only a single value of the mean is of limited use. The critical idea here is that the sample has been chosen at random and that we don't know anything about the population except the sample information. The first of the following calculator screens shows that INTR is one of the menu choices. 99

Module 8: Advanced data analysis Tap INTR (r) to select the confidence interval commands to see that there are two kinds of confidence interval available (z and t). To construct a confidence interval in this case, statistical theory tells us to use the t-distribution, since we do not know the population standard deviation. (It is very rare to know the population standard deviation, in fact, so you will almost always use t instead of z. Choose z only if you do know the population standard deviation.) The final choice involves whether you have one sample (1-S) or two samples (2-S); in this case, clearly there is only one sample, so tap 1-S (q) to see the screen below. Set up your calculator with the choices shown here. The screen records that the data are in the form of a list and they are located in List 1. The confidence level has been entered as 0.9 to give a 90% confidence interval for the mean. Then move the cursor to the Execute command and choose CALC (q) to construct the confidence interval, as shown above on the right. The 90% confidence interval is described in the screen above as (175.16, 182.84). This is suggesting that the population mean is between the two values of 175.16 cm and 182.84 cm. It is called a 90% confidence interval because statistical theory tells us that if we were to repeat this entire process many times, 90% of the intervals constructed in this way would actually include the population mean (and, of course, 10% of the intervals would not include the population mean). We do not know what the population mean is, and it may or may not be within the confidence interval (175.16, 182.84), but the theory and the calculator together have given us more information than we had from just the sample. The choice of a 90% confidence interval is quite arbitrary. You can choose any value you wish between 0% and 100%. A higher value is sensible, since of course the process is then more likely to capture the population mean, but will result in a larger interval. For comparison, a 95% confidence interval is shown below. Notice that the interval this time is (174.35, 183.65), a bit wider than before, with a process that is a bit more likely to capture the population mean. Incidentally, you might also notice that the screens above show the standard deviation calculated from the random sample as 9.9366; this is the so-called unbiased standard deviation (reported also above in the original sample statistics). It is the single best estimate of the population standard deviation. The sample standard deviation of 9.69 is sometimes called the biased standard deviation, Learning Mathematics with Graphics Calculators 100

as it is not the best available estimate of the population standard deviation, as it a bit too small. In fact, the only information needed to construct a confidence interval about a population mean in this way is the mean, (unbiased) standard deviation and size of the sample. So, if the actual data are not available, you can use the calculator to construct the confidence interval by entering these values directly. The screens below show this for the case of the 90% confidence interval here, when Variable is selected rather than List. Of course, the results for the confidence interval are the same. Hypothesis testing Another way of using a sample to make inferences about a population involves testing an hypothesi. An hypothesis is like a careful guess, that is specially made so that its likelihood of being correct can be investigated. In the case of the seedlings described above, suppose that the nursery staff had been informed by the original seed suppliers that the bushes would grow on average to 182 cm in height. The sample data can be used to test whether this seems to be a reasonable description of the seedlings. Our hypothesis in this case is that the mean height to which the bushes grow is 182 cm. Our interest is in whether or not this seems reasonable; we can test this hypothesis by taking our sample data. The hypothesis we are testing is called the null hypothesis, represented by the symbol H 0. We have seen already that the mean of a sample of data is not necessarily the same as the population mean, so we are not surprised that it is not exactly 182 cm. The statistical question is whether it is reasonable to get a sample like the one above from such a population, or, alternatively, whether the population mean seems not to be 182 cm. Our hypothesis test will result in either rejecting the null hypothesis (in favour of its alternative) or in not rejecting it (i.e., accepting that it does not seem unreasonable). Symbolically, our null hypothesis is: H 0 : µ = 182. Our alternative hypothesis is H 1 : µ 182. Again, further details of the statistical theory are omitted here, assuming you will study them elsewhere. We consider only how the calculator performs the necessary test. In the window shown below, note that one of the menu items is TEST (e) Selecting this menu reveals that several statistical tests are available, as shown. In this case, we choose the t-tests and, specifically, the one-sample t-test, similar to our choices above for confidence intervals. (The choice of t instead of z is because we do not have assumed information about the population standard deviation.) 101

Module 8: Advanced data analysis The next screen shows how the test is communicated to the calculator, using 1-S (q). The null hypothesis value for the population mean is 182 (cm) and the data are in List 1. The screen below also shows that there is are one-tailed tests available as well as the two-tailed test used here. (The details of these one-tailed tests are left to you, after you have learned the necessary theory.) When the test is executed, you will have a choice of a calculation or a drawing (or both) to show the details of the test. The drawings below give the shape of the t-distribution, in this case very similar to that of a normal distribution. Selecting P (w) shows the probability associated with the sample mean, assuming the hypothesised mean of 182 is correct. Selecting T (q) shows the associated t-values instead of the probability. Use $ and! to move from one to the other. The sampling distribution graphshows t-statistics associated with samples of size 20 (for which the number of degrees of freedom are 20 1 = 19) when the null hypothesis is true. The mean of the theoretical distribution is zero, but non-zero values are likely in practice, due to random variation. In the present case, the actual sample mean (of 179 cm) is 3 cm away from the hypothesised mean (of 182 cm); the shaded parts of the graph indicate that values this far away from the mean (in either direction) or even further can occur at random 0.19281 or 19.3% of the time. If you choose the CALC (q) option instead of DRAW (u), the t-statistics are shown. These are the same as those on the graphs, of course. It is usually a good idea to look at the graph as well as the statistics, to help interpret the results. Many people would say that something that happens around 19% of the time is not especially unusual, and is not strong enough evidence to reject the null hypothesis. If the percentage were quite small (say 5% or less), we may be more inclined to reject the null hypothesis as an unlikely description of reality and instead accept its alternative. But this is not the case here. So the result of our test is to accept the null hypothesis, µ = 182 cm (i.e., there is insufficient evidence to reject the null hypothesis). So the claim of the seed suppliers seems reasonable, given these sample data. Learning Mathematics with Graphics Calculators 102

The numerical information shown above should be interpreted in a similar way. The t-statistic of -1.3501 provides the means of conducting the test. The number p = 0.19281 expresses the probability of getting a value this far from the mean t-value (which is zero) when the null hypothesis is true. This number is sometimes referred to as a p-value. You can also evaluate the probability of t-statistics between ±1.3501 using the t-distribution directly. Return to the data screen and choose DIST (y) and then t (w). The screens below show that 80.72% of the time, the t-values with 19 degrees of freedom are between the two values, so that 19.18% of the time they are outside this range, consistent with the above calculations. It is common practice to decide in advance of looking at any data what will be regarded as too unusual to accept the null hypothesis and thus will be grounds for rejecting the null hypothesis. A common choice is p 5%. Another common choice is p 1%. When the null hypothesis is rejected on the grounds that p < 5%, we say that the result is statistically significant (or just significant) at the 5% level. One meaning of this is that there is still a 5% chance that the null hypothesis is actually true and we merely have an unusually deviant sample from the population. The hypothesis test described here has been conducted using the actual data in a list. As for confidence intervals, you can also use the calculator to conduct the test based on the summary information from the random sample. You need to know only the sample size, the mean and the unbiased standard deviation (which are commonly reported in statistical research reports, while the actual data are frequently not available). The screen at left below shows how the information is entered into the calculator, after first selecting Variable rather than List as the data source. The results are the same as before, as expected. A graphical representation is also available. *This material has been reproduced, with permission, from Module 8 (Advanced data analysis), of the publication: Kissane, B. & Kemp, M. (2014) Learning Mathematics with Graphics Calculators. Tokyo, Japan: CASIO. (pp 99-103) The entire publication can be freely downloaded from the CASIO Worldwide Education Website at: http://edu.casio.com/education/activity/ 103