STATISTICS PROJECT: Hypothesis Testing



Similar documents
Two-sample t-tests. - Independent samples - Pooled standard devation - The equal variance assumption

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Projects Involving Statistics (& SPSS)

Independent t- Test (Comparing Two Means)

CHAPTER 5 COMPARISON OF DIFFERENT TYPE OF ONLINE ADVERTSIEMENTS. Table: 8 Perceived Usefulness of Different Advertisement Types

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct

Two Related Samples t Test

Friedman's Two-way Analysis of Variance by Ranks -- Analysis of k-within-group Data with a Quantitative Response Variable

HYPOTHESIS TESTING WITH SPSS:

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

INDIVIDUAL MASTERY for: St#: Test: CH 9 Acceleration Test on 29/07/2015 Grade: B Score: % (35.00 of 41.00)

INDIVIDUAL MASTERY for: St#: Test: CH 9 Acceleration Test on 09/06/2015 Grade: A Score: % (38.00 of 41.00)

2 GENETIC DATA ANALYSIS

CALCULATIONS & STATISTICS

Study Guide for the Final Exam

Testing Research and Statistical Hypotheses

Chapter 23 Inferences About Means

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

Correlational Research

WISE Power Tutorial All Exercises

Chapter 7: Simple linear regression Learning Objectives

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

AP STATISTICS (Warm-Up Exercises)

p ˆ (sample mean and sample

Descriptive Statistics

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

CHAPTER 7 PRESENTATION AND ANALYSIS OF THE RESEARCH FINDINGS

Hypothesis testing - Steps

Chapter 7 Section 1 Homework Set A

Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217

Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

Independent samples t-test. Dr. Tom Pierce Radford University

Normality Testing in Excel

Simple Regression Theory II 2010 Samuel L. Baker

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Chapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation

Opgaven Onderzoeksmethoden, Onderdeel Statistiek

Unit 26 Estimation with Confidence Intervals

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Analysis of Variance ANOVA

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management

Understand the role that hypothesis testing plays in an improvement project. Know how to perform a two sample hypothesis test.

Confidence intervals

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

Stats for Strategy Fall 2012 First-Discussion Handout: Stats Using Calculators and MINITAB

Recall this chart that showed how most of our course would be organized:

Chi Square Tests. Chapter Introduction

Comparing Means in Two Populations

Providing Quality Service

Workshop Tool in Moodle

Intro to Parametric & Nonparametric Statistics

Two-sample hypothesis testing, II /16/2004

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other

Using Stata for One Sample Tests

Understanding Confidence Intervals and Hypothesis Testing Using Excel Data Table Simulation

II. DISTRIBUTIONS distribution normal distribution. standard scores

When reviewing the literature on

Chapter 2 Probability Topics SPSS T tests

5/31/2013. Chapter 8 Hypothesis Testing. Hypothesis Testing. Hypothesis Testing. Outline. Objectives. Objectives

CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS

RUNNING HEAD: TUTORING TO INCREASE STUDENT ACHIEVEMENT USING TUTORING TO INCREASE STUDENT ACHIEVEMENT ON END OF COURSE ASSESSMENTS. By KATHYRENE HAYES

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Chi-square test Fisher s Exact test

2.2 Derivative as a Function

CHAPTER 12 TESTING DIFFERENCES WITH ORDINAL DATA: MANN WHITNEY U

11. Analysis of Case-control Studies Logistic Regression

Course Syllabus MATH 110 Introduction to Statistics 3 credits

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

AMS 5 Statistics. Instructor: Bruno Mendes mendes@ams.ucsc.edu, Office 141 Baskin Engineering. July 11, 2008

Statistics 2014 Scoring Guidelines

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Unit 1 Number Sense. In this unit, students will study repeating decimals, percents, fractions, decimals, and proportions.

LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

Mathematical Knowledge level of Primary Education Department students

STATISTICS 8, FINAL EXAM. Last six digits of Student ID#: Circle your Discussion Section:

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

Chapter 7 Section 7.1: Inference for the Mean of a Population

Self-Acceptance. A Frog Thing by E. Drachman (2005) California: Kidwick Books LLC. ISBN Grade Level: Third grade

Hypothesis Testing --- One Mean

Principles of Hypothesis Testing for Public Health

Chapter 23. Two Categorical Variables: The Chi-Square Test

Name: Date: Use the following to answer questions 3-4:

2 Sample t-test (unequal sample sizes and unequal variances)

Simple Linear Regression Inference

Consider a study in which. How many subjects? The importance of sample size calculations. An insignificant effect: two possibilities.

WHAT IS A JOURNAL CLUB?

Lecture Notes Module 1

12.5: CHI-SQUARE GOODNESS OF FIT TESTS

AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

Transcription:

STATISTICS PROJECT: Hypothesis Testing See my comments in red. Scoring last page. INTRODUCTION My topic is the average tuition cost of a 4-yr. public college. Since I will soon be transferring to a 4-yr. college, I thought this topic would be perfect. "The College Board" says that the average tuition cost of college is $5836 per year. I will be researching online the costs of different public colleges to test this claim. I will be using the T-test for a mean, since my sample is going to be less than 30 and an unknown population standard deviation. I will also use Chi-Square Test of Independence. HYPOTHESIS I think the average cost of tuition is lower than the average stated by The College Board. Ho: mu >/= $5836. H1: mu< $5836 (Claim) DATA ANALYSIS I collected my data from various college websites. I looked up the cost of tuition per year and the number of students enrolled. Here is what I came up with: College Tuition Number of Students Central Washington University $4392 10,200 University of Washington $5985 25,469 Washington State University $5888 18,432 Western Washington University $4356 13,000 Evergreen State University $4590 4400 Eastern Washington University $5904 10,000 Peninsula College $3639 10,120 University of Oregon $6174 20,394 Portland State University $5208 24,284 Oregon State University $5604 19,362 Southern Oregon University $5233 5000 Eastern Oregon University $4500 3000 Western Oregon University $5763 4500

University of Idaho $4410 11,739 Idaho State University $4400 13,000 There weren t really any large gaps or outliers in the data that I collected. There was a gap between 5,000 10,000 students. But the rest was mostly consistent. The lowest tuition was $3639 from Peninsula College and the highest tuition was $6174 from the University of Oregon. Some of the websites were hard to find the information I wanted, but I eventually found it. Some of the websites were specific as to undergraduate or graduate and some probably contain both. I should have done further research to make sure that my numbers only contain undergraduates and not graduates. So, that is one possible mistake in the data collection. You ve explained your strengths and weaknesses in collection well. HYPOTHESIS TESTING T-Test for a Mean Step 1: State the hypothesis and identify the claim. I claim that the average cost of college tuition is less than $5836 per year as concluded from The College Board. At a=.025, can it be concluded that the average is less than $5836 based on a sample of 15 colleges? H0: mu>/= $5836 H1: mu<$5836 (claim) Step 2: Find the critical value At a=.025 and d.f. = 14, the critical value is -2.145. Step 3: Compute the sample test value. m= 5069.73, s=787.80 t= (5069.73-5836)/(787.80/sqrt(15)) = -3.767 Step 4: Make the decision to reject or not reject the null hypothesis. Reject the null hypotheses since -3.767 falls in the critical region. Step 5: Summarize the results. I will reject the null hypotheses since there is enough evidence to support the claim that the average cost of tuition is less than $5836 per year.

Chi-Squared Independence Test Step 1: State the hypotheses and identify the claim. I claim that there is a correlation between the number of students at a college and the cost of tuition per year. Here is the data that I collected: Cost of Number of Total Tuition Students 3000-9,999 10,000-16,999 17,000-23,999 24,000-30,999 $3500-4500 1 5 0 0 6 $4501-5500 2 0 0 1 3 $5501-6500 1 1 3 1 6 Total 4 6 3 2 15 At.025, can we conclude that the cost of tuition is dependent on the number of students? Ho: The cost of tuition is independent of the number of students that attend the college. (x²=0) H1: The cost of tuition is dependent on the number of students that attend the college. (claim) (x²>0) Step 2: Find the critical value: The critical value is 14.449 since the degrees of freedom are (3-1)(4-1)=6. Step 3: Compute the test value. First we have to find the expected value: E1,1 = (6)(4)/15=1.6 E2,1 = (3)(4)/15=.8 E3,1 = (6)(4)/15=1.6 E1,2 = (6)(6)/15=2.4 E2,2 = (3)(6)/15=1.2 E3,2 = (6)(6)/15=2.4 E1,3 = (6)(3)/15=1.2 E2,3 = (3)(3)-15=.6 E3,3 = (6)(3)/15=1.2 E1,4 = (6)(2)/15=.8 E2,4 = (3)(2)/15=.4 E3,4 = (6)(2)/15=.8 I did some spot checking and it looks good. I m going to trust you on chi-square. The completed table is shown: Cost of Number of Total Tuition Students 3000-9,999 10,000-16,999 17,000-23,999 24,000-30,999 $3500-4500 1 (1.6) 5 (2.4) 0 (1.2) 0 (.8) 6 $4501-5500 2 (.8) 0 (1.2) 0 (.6) 1 (.4) 3 $5501-6500 1 (1.6) 1 (2.4) 3 (1.2) 1 (.8) 6 Total 4 6 3 2 15

Then the test value is x² = (O-E)²/E = (1-1.6)²/1.6 + (5-2.4)²/2.4 + (0-1.2)²/1.2 + (0-.8)²/.8 + (2-.8)²/.8 + (0-1.2)²/1.2 + (0-.6)²/.6 + (1-.4)²/.4 + (1-1.6)²/1.6 + (1-2.4)²/2.4 + (3-1.2)²/1.2 + (1-.8)²/.8 = 13.333 Step 4: Make the decision to reject or not to reject the null hypothesis. Do not reject the null hypothesis since 13.333 is less than 14.449. Step 5: Summarize the results. There is not enough evidence to support the claim that the cost of tuition is dependent on the number of students that attend the college. SUMMARY My first hypothesis test about the tuition cost of 4-year universities being less than the average was correct. The average as stated by The College Board said that the tuition was $5836 per year. I thought that was a little high. The average tuition of the fifteen colleges that I researched was $5069.73. Maybe if I would have researched colleges all around the country instead of just our surrounding states I would have come up with different numbers. Another thing that may have caused this test to be a little off was that when I was collecting data, some of the costs of tuition may include other fees and some may not. When I looked them up, some fees were listed separately and some were not. This could have lead to a Type I error where the null hypothesis was true and it was rejected. Good note on the possibility of a Type I error. My second hypothesis test about whether the cost of tuition is dependant on the number of students that attend the college was rejected. I thought that the fewer the students that attend a specific college, that tuition would be cheaper, but that wasn t the case. One main problem I can see with colleting my data is that on the college websites for the number of students, some said over or approximately. So, these weren t the exact numbers of students enrolled. Also, as stated earlier, some of the students could be undergraduates or graduates. Some of the websites didn t list them separately. Tuition is higher for graduates, so they should not have been included in this study and it would have thrown off the number of students. So, these may have affected the outcome a little, but I don t think enough for it to change the hypothesis. Small differences, including rounding, can really throw off your test value in this type of test. It would have also been interesting to test to see whether the tuition is higher in urban areas where more people live verses rural areas where there are not as

many people. I would be inclined to say that this is true, but it would need to be tested further to say for sure. It would also be interesting to do this same testing for private colleges to see if they have the same results. I thought this was fun to come up with our own hypothesis and try to prove ourselves right or wrong using what we have learned all quarter. It was a good test of our skills and it made me get a better understanding of how the formulas really work rather than just doing the homework examples in the book. Excellent summary/critique and project overall. I ve been teaching this class for over 8 years and your project is one of the best. Can I please use this as a sample?

Hypothesis and Proposal 10 points Written Project: Data Analysis 15 points Grading Rubric Statistics Project, Spring 2007 Objective Met (C, B-) Exceeded (B, B+) Outstanding (A-, A) Score Statement of hypothesis in words or symbols, type of test you plan to use. Brief description the data set including outliers, gaps, and other observations. Statement of hypothesis in words and symbols. Description of test you ll use and how you plan to implement it. Description and interpretation of data set (what and why). Statement of hypothesis in words and symbols, including motivation. Type of hypothesis test and plans for implementation. Description and interpretation of data set. Discussion of potential problems including how data could be improved. 10 15 Written Project: Hypothesis Testing and Confidence Intervals 30 points Conduct a mathematically accurate hypothesis test. Conduct two mathematically correct hypothesis tests or one test and one confidence interval Conduct two (or more) mathematically correct hypothesis tests that allow you to draw meaningful conclusions. 29 Written Project: Summary and Self- Critique 25 points Re-iterate outcome of hypothesis testing. Identify areas of strength and weakness. Overall Subjective Component 10 points Project is mathematically and grammatically correct. Displays adequate understanding of hypothesis testing. Draft and Edits 10 points Draft in by due date = 5 points Explain outcome of hypothesis testing, discuss possibility of Type I or Type II errors. Explore areas of strength and weakness by making suggestions for improvement and proposing further research. Project is accurate, interesting, and wellpresented. Displays mastery of hypothesis testing. Interpret outcome of hypothesis testing and possibility of Type I or II errors. Explain errors. Draw connections between hypothesis tests, data collection. Discuss areas of strength and weakness. Project is accurate, creative, and well-presented. Hypothesis testing done at mastery level with meaningful connections throughout and a thorough summary and recommendation (I was wowed ) Draft submitted and constructive review of one other classmate s done by due dates = 10 points 25 10 10 Overall Score 99 Written by Angela Redmon. Last updated 07/12/06.