Elementary Statistics Sample Exam #3



Similar documents
1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Final Exam Practice Problem Answers

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

One-Way Analysis of Variance (ANOVA) Example Problem

Regression Analysis: A Complete Example

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

1. The parameters to be estimated in the simple linear regression model Y=α+βx+ε ε~n(0,σ) are: a) α, β, σ b) α, β, ε c) a, b, s d) ε, 0, σ

Part 2: Analysis of Relationship Between Two Variables

POLYNOMIAL AND MULTIPLE REGRESSION. Polynomial regression used to fit nonlinear (e.g. curvilinear) data into a least squares linear regression model.

Chapter 23. Two Categorical Variables: The Chi-Square Test

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

2. Simple Linear Regression

STAT 350 Practice Final Exam Solution (Spring 2015)

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares

2013 MBA Jump Start Program. Statistics Module Part 3

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

DATA INTERPRETATION AND STATISTICS

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS

Additional sources Compilation of sources:

Chapter 7: Simple linear regression Learning Objectives

SPSS Guide: Regression Analysis

Analysing Questionnaires using Minitab (for SPSS queries contact -)

Is it statistically significant? The chi-square test

STATISTICS FOR PSYCHOLOGISTS

Example: Boats and Manatees

How To Run Statistical Tests in Excel

One-Way Analysis of Variance

Univariate Regression

Premaster Statistics Tutorial 4 Full solutions

Regression step-by-step using Microsoft Excel

Name: Date: Use the following to answer questions 2-3:

Linear Regression. Chapter 5. Prediction via Regression Line Number of new birds and Percent returning. Least Squares

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

Multiple Linear Regression

12: Analysis of Variance. Introduction

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Analysis of Variance. MINITAB User s Guide 2 3-1

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

CHAPTER 13. Experimental Design and Analysis of Variance

Simple Regression Theory II 2010 Samuel L. Baker

Data Analysis Tools. Tools for Summarizing Data

5. Linear Regression

2 Sample t-test (unequal sample sizes and unequal variances)

MULTIPLE REGRESSION WITH CATEGORICAL DATA

Using Excel for inferential statistics

Exercise 1.12 (Pg )

12.5: CHI-SQUARE GOODNESS OF FIT TESTS

Factors affecting online sales

Study Guide for the Final Exam

1.5 Oneway Analysis of Variance

LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

Pearson's Correlation Tests

Pearson s Correlation

Recall this chart that showed how most of our course would be organized:

MTH 140 Statistics Videos

Predictor Coef StDev T P Constant X S = R-Sq = 0.0% R-Sq(adj) = 0.

Descriptive Statistics

Section 13, Part 1 ANOVA. Analysis Of Variance

Bill Burton Albert Einstein College of Medicine April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1

10. Analysis of Longitudinal Studies Repeat-measures analysis

GLM I An Introduction to Generalized Linear Models

Simple Linear Regression Inference

Statistical Models in R

Introduction to General and Generalized Linear Models

Chapter 23. Inferences for Regression

An analysis appropriate for a quantitative outcome and a single quantitative explanatory. 9.1 The model behind linear regression

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

1 Simple Linear Regression I Least Squares Estimation

PELLISSIPPI STATE COMMUNITY COLLEGE MASTER SYLLABUS INTRODUCTION TO STATISTICS MATH 2050

Multiple Regression. Page 24

Math 108 Exam 3 Solutions Spring 00

Association Between Variables

General Regression Formulae ) (N-2) (1 - r 2 YX

11. Analysis of Case-control Studies Logistic Regression

An analysis method for a quantitative outcome and two categorical explanatory variables.

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

August 2012 EXAMINATIONS Solution Part I

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

An Introduction to Statistical Tests for the SAS Programmer Sara Beck, Fred Hutchinson Cancer Research Center, Seattle, WA

Statistical Models in R

1 Theory: The General Linear Model

This can dilute the significance of a departure from the null hypothesis. We can focus the test on departures of a particular form.

RARITAN VALLEY COMMUNITY COLLEGE ACADEMIC COURSE OUTLINE MATH 111H STATISTICS II HONORS

Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship)

1.1. Simple Regression in Excel (Excel 2010).

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

Two-sample hypothesis testing, II /16/2004

List of Examples. Examples 319

STATISTICS 8, FINAL EXAM. Last six digits of Student ID#: Circle your Discussion Section:

Example G Cost of construction of nuclear power plants

Fairfield Public Schools

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

3.4 Statistical inference for 2 populations based on two samples

Simple Methods and Procedures Used in Forecasting

Transcription:

Elementary Statistics Sample Exam #3 Instructions. No books or telephones. Only the supplied calculators are allowed. The exam is worth 100 points. 1. A chi square goodness of fit test is considered to be valid if each of the expected cell frequencies is A. greater than 0. B. less than 5. C. between 0 and 5. D. at most 1. E. at least 5. Use the following to answer the next three questions: A physiologist is interested in determining the proportion of algae samples from a local rivulet that belong to a particular phyla, and he believes they should be uniformly distributed. A random sample of 60 alga were obtained, and each was categorized as being Rhodophyta, Chlorophyta, or Heterokontophyta. The observed counts were 25, 25, and 10, respectively. 2. The chi square statistic is A. 0. B. 7.50. C. 20. D. 150. 3. When determining the significance of the chi square statistic, the physiologist would use A. 1 degree of freedom. B. 2 degrees of freedom. C. 3 degrees of freedom. D. 4 degrees of freedom. 4. The p value of chi square goodness of fit test in this case is A. greater than 0.10. B. below 0.10 but above 0.05. C. below 0.05 but above 0.01. D. below 0.01. Page 1 of 7

5. In a χ 2 test for independence, the statistic based on a contingency table with 6 rows and 5 columns will have how many degrees of freedom? A. 30 B. 24 C. 5 D. 20 E. 25 Use the following to answer the following two questions: A fisheries biologist is interested in studying the relationship between width and weight in horseshoe crabs. She collects a random sample of such crabs and cross classifies them based on these variables as given below. Width (in cm) Weight 0 5 5 10 10 15 15 20 < 1.8kg 39 36 29 18 > 1.8kg 11 14 21 32 6. Suppose we wish to test the null hypothesis that there is no association between their width and weight. Under the null hypothesis, what is the expected number of crabs in the low-weight class and widest width class? A. 18.0 B. 25.0 C. 30.5 D. 50.0 7. Which hypotheses are being tested by the chi square test? A. The null hypothesis is that width and weight are independent, and the alternative is that they are dependent. B. The null hypothesis is that the mean number of crabs that are in the low weight-class is the same for each of the four width classes, and the alternative is that these means are different. C. The null hypothesis is that the distributions of the number of crabs that are in the low and high weight classes are the same for the four widths. The alternative says the distributions are different. D. The null hypothesis is that the distributions of the total number of crabs sampled in each of the four widths are the same. The alternative is that these distributions are different. Page 2 of 7

8. The line described by the regression equation attempts to A. pass through as many points as possible. B. pass through as few points as possible. C. minimize the number of points it touches. D. minimize the squared distance from the points. 9. The fraction of the variation in the values of a response y that is explained by the least-squares regression of y on x is A. the correlation coefficient. B. the slope of the least-squares regression line. C. the square of the correlation coefficient. D. the intercept of the least-squares regression line. 10. Given the bivariate sample, (x 1, y 1 ), (x 2, y 2 ),, (x n, y n ), suppose x = 2, ȳ = 3 s x = 1, r = 1 3, s y = 6. Which of the following is the regression line? A. y = 2x 4 B. y = x 2 C. y = 2x 1 D. y = x 6 11. The regression equation for predicting number of speeding tickets (Y ) from information about driver age (X) is Y = 0.065(X) + 5.57. How many tickets would you predict for a twenty year old? A. 6 B. 4.27 C. 5.57 D. 1 12. A clinical psychologist finds the relationship between the number of weeks spent in a therapy hospital (X = HOSPITAL) and number of seizures per week (Y = SEIZURES) is described by the following equation: Ŷ = 14.09 0.91X. This is based on a sample size of 50 patients and is associated with r = 0.93. The proportion of variance in SEIZURES accounted for by HOSPITAL (i.e., the coefficient of determination) is Page 3 of 7

A. 0.93 B. -0.93 C. 0.86 D.. -0.86 E. 14.09 13. Suppose a straight line is fit to data having response variable y and explanatory variable x. Predicting values of y for values of x outside the spread of the observed data is called A. contingency. B. extrapolation. C. causation. D. correlation. 14. Changing the units of measurements on the Y variable will affect all but which one of the following? A. The estimated intercept parameter. B. The estimated slope parameter. C. The total sum of squares for the regression. D. R squared for the regression. E. The estimated standard errors. 15. Which of the following statistical techniques is used when values of more than one variable are used to predict the value of another variable? A. Multiple regression B. ANOVA C. ANCOVA D. MANCOVA 16. A portion of an ANOVA summary table is shown below. Source Sum of Squares Degrees of Freedom Between 19 3 Within (error) 37 4 Total 56 The Mean Square Error, MSE, is A. 9.25 Page 4 of 7

B. 12.33 C. 18.50 D. 37.0 Use the following information for the next six questions How much corn should be planted per acre for a farmer to get the highest yield? Too few plants will give a low yield, while too many plants will compete with each other for moisture and nutrients, resulting in a lower yield. Four levels of planting density are to be studied: 12,000, 16,000, 20,000, and 24,000 plants per acre. The experimenters had 12 acres available for the study, and three acres were assigned at random to each of the planting densities. The data follows: Plants (per acre) Yield (bushels per acre) 12,000 150.1 113.0 118.4 16,000 166.9 120.7 135.2 20,000 165.3 130.1 139.6 24,000 134.7 138.4 156.1 Assume the data are four independent SRSs, one from each of the four populations of planting densities, and that the distribution of the yields is normal. A partial ANOVA table produced by MINITAB follows, along with the means and standard deviation of the yields for the four groups. One-way ANOVA: yield versus density Source DF SS MS F P Density 589 Error 356 Total Density N Mean StDev 12,000 3 127.17 20.04 16,000 3 140.93 23.63 20,000 3 145.00 18.21 24,000 3 143.07 11.44 17. The degrees of freedom for density (group) are A. 2. B. 3. Page 5 of 7

C. 8. D. 11. 18. The degrees of freedom for error are A. 2. B. 3. C. 8. D. 11. 19. The null hypothesis for the ANOVA is that the population mean yield A. is the same for all four planting densities. B. is increasing as the planting density gets larger. C. is decreasing as the planting density gets larger. D. first increases and then decreases as the planting density gets larger. 20. The sum of squares for error is A. 196. B. 356. C. 589. D. 2848. 21. The pooled standard deviation is A. 11.44. B. 18.87. C. 23.63. D. 22.48. 22. The value of the F statistic is A. 0.55. B. 4.73 C. 1.82. D. 4.83. 23. A researcher is performing a two way ANOVA using two factors. The first factor has 6 levels and the second factor has 9 levels. In the ANOVA Summary Table, the degrees of freedom for the interaction between the first and second factors will be A. 40 Page 6 of 7

B. 45 C. 48 D. 54 Page 7 of 7