Practice Final Exam Multiple-Choice and True-False Questions

Similar documents
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Final Exam Practice Problem Answers

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

SPSS Guide: Regression Analysis

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

University of Chicago Graduate School of Business. Business 41000: Business Statistics

Fairfield Public Schools

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

Simple Regression Theory II 2010 Samuel L. Baker

Chapter 7: Simple linear regression Learning Objectives

Regression Analysis: A Complete Example

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

Study Guide for the Final Exam

Name: Date: Use the following to answer questions 3-4:

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

Multiple Linear Regression

2013 MBA Jump Start Program. Statistics Module Part 3

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Solución del Examen Tipo: 1

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

STAT 350 Practice Final Exam Solution (Spring 2015)

1. The parameters to be estimated in the simple linear regression model Y=α+βx+ε ε~n(0,σ) are: a) α, β, σ b) α, β, ε c) a, b, s d) ε, 0, σ

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

ch12 practice test SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

Chapter 5 Analysis of variance SPSS Analysis of variance

Example: Boats and Manatees

2. Simple Linear Regression

STATISTICS 8, FINAL EXAM. Last six digits of Student ID#: Circle your Discussion Section:

UNDERSTANDING ANALYSIS OF COVARIANCE (ANCOVA)

Hypothesis testing - Steps

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

Estimation of σ 2, the variance of ɛ

17. SIMPLE LINEAR REGRESSION II

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

11. Analysis of Case-control Studies Logistic Regression

One-Way Analysis of Variance (ANOVA) Example Problem

Chapter 23. Inferences for Regression

Linear Models in STATA and ANOVA

Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Math 108 Exam 3 Solutions Spring 00

Statistical Functions in Excel

table to see that the probability is (b) What is the probability that x is between 16 and 60? The z-scores for 16 and 60 are: = 1.

Introduction to Quantitative Methods

Introduction to Linear Regression

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade

Chicago Booth BUSINESS STATISTICS Final Exam Fall 2011

Violent crime total. Problem Set 1

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups

Case Study in Data Analysis Does a drug prevent cardiomegaly in heart failure?

ANOVA ANOVA. Two-Way ANOVA. One-Way ANOVA. When to use ANOVA ANOVA. Analysis of Variance. Chapter 16. A procedure for comparing more than two groups

Coefficient of Determination

International Statistical Institute, 56th Session, 2007: Phil Everson

Section 13, Part 1 ANOVA. Analysis Of Variance

Statistics 151 Practice Midterm 1 Mike Kowalski

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp

Introduction to Regression and Data Analysis

Statistical tests for SPSS

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Rockefeller College University at Albany

Factors affecting online sales

Using Excel for Statistical Analysis

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

Lets suppose we rolled a six-sided die 150 times and recorded the number of times each outcome (1-6) occured. The data is

2. What is the general linear model to be used to model linear trend? (Write out the model) = or

August 2012 EXAMINATIONS Solution Part I

12: Analysis of Variance. Introduction

The correlation coefficient

Using R for Linear Regression

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Part 2: Analysis of Relationship Between Two Variables

2. Linear regression with multiple regressors

Simple Linear Regression Inference

Pearson's Correlation Tests

Name: (b) Find the minimum sample size you should use in order for your estimate to be within 0.03 of p when the confidence level is 95%.

Descriptive Statistics

Regression step-by-step using Microsoft Excel

4. Continuous Random Variables, the Pareto and Normal Distributions

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Week 4: Standard Error and Confidence Intervals

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

An analysis appropriate for a quantitative outcome and a single quantitative explanatory. 9.1 The model behind linear regression

STATISTICA Formula Guide: Logistic Regression. Table of Contents

Statistics 2014 Scoring Guidelines

Correlation and Simple Linear Regression

Point Biserial Correlation Tests

Simple linear regression

5. Linear Regression

1.5 Oneway Analysis of Variance

Population Mean (Known Variance)

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

1 Simple Linear Regression I Least Squares Estimation

Transcription:

Practice Final Exam Multiple-Choice and True-False Questions 1) We are using a regression model to make a height prediction for a child of a specified age. If we wish to generate an interval for our regression line that covers our prediction of the height of our next pick of a child with 95% certainty, we would use a: a) confidence interval b) prediction interval c) neither d) either Answer: b 2) T or F: For a given sample, a 50% confidence interval for the mean is narrower than a 90% confidence interval for the mean. Answer: T 3) T or F: In the context of linear regression, R 2 tells us the proportion of total variability that can be explained by our model. Answer: T 4) T or F: The p-value is the probability of rejecting your null hypothesis. Answer: F 5) T or F: If we randomly select 5 dorms in which to set up security cameras, and we see that crime has decreased in all 5 dorms, we can conclude that the security cameras caused the decrease in crime. Answer: F 6) Consider our studies of confidence intervals. Fill in the blanks with random or fixed. The parameter is _Fixed. The statistic is Random. The interval is _Random. 7) Which of the following would be expected to result in a larger standard error of the mean? (A) a larger sample size (B) a smaller sample size (C) a smaller population standard deviation (D) a larger population standard deviation (E) Choices (B) and (D) Answer: (E) Choices (B) and (D) (a smaller sample size; a larger population standard deviation)

Given the formula for the standard error of the mean, increasing the numerator or decreasing the denominator will both result in a larger standard error. A more variable population will result in more variable sample means, and a smaller sample size will also result in more variable sample means, in both cases resulting in a larger sample error. 8) A nutritionist has conducted a multiple linear regression predicting the number of calories in breakfast cereals based on the amount of fat, sugar, and fiber in grams. Unfortunately her printer is broken and some of the R output has been blocked out. Which of the following is a correct statement the nutritionist can conclude based on the visible R output (and without looking at any tables, etc.)? (A) The coefficient of determination is approximately 19.8 (B) The overall F-test is not statistically significant at the α = 0.10 level (C) The adjusted R-squared is between 0.71 and 1.00 (D) The coefficient for the intercept has a p-value greater than 0.05 (E) The coefficient for the variable Fiber has a p-value less than 0.10 Answer: (E) The coefficient for the variable Fiber has a p-value less than 0.10 We can conclude this by noting that the t value for the Fiber variable is 6.030/1.992 = 3.03, which gives us a p-value well below 0.10. The other responses are based on misinterpretations of the output. 9) Which of the following is/are true about the p value? (Circle all that apply)

A. Indicates the probability of seeing the observed result, and results more extreme, by chance alone (given that the null hypothesis is true). B. Indicates the probability that the null hypothesis is true. C. Rules out the role of bias and/or confounding D. Indicates that the results observed are of medical or public health significance Answer: A 10) If you observe a significant association in a study, which of the following is the least likely alternative explanation of the association? (choose only one) A. Bias B. Confounding C. Lack of power D. A and B E. B and C Answer: C) 11) In a hypothesis testing about a population mean, the p value is found to be 0.04. Which of the following is/are true about the population mean? Assume that the population mean given the null hypothesis is µ o. Circle all that apply. A. The 95% confidence interval includes the µ o B. The 99% confidence interval includes the µ o C. The 90% confidence interval includes the µ o D. All of the above are true E. None of the above is true. Answer: B 12) Which of the following is/are the assumptions of linear models: A. The response variable is normally distributed B. The residuals are normally distributed C. All the observed units are independent from each other. D. The relationship between the response variable and the predictors are linear E. All of the above F. None of the above Answer: B, C, D

13) The confidence interval at the 95% level of confidence for the true population proportion was reported to be (0.750, 0.950). Which of the following is a possible 90% confidence interval from the same sample? a) (0.766, 0.934) b) (0.777, 0.900) c) (0.731, 0.969) d) (0.050, 0.250) Answer: a). Since we are decreasing the amount of confidence, the size of the interval must also decrease. Answer A is the only option available that is smaller than the reported interval. 14) Which of the following statements about the Central Limit Theorem (CLT) is correct? a) The CLT states that the sample mean x is always equal to the population mean, m. b) The CLT states that the sampling distribution of the sample mean x is approximately normal for large sample sizes ( n > 30 ). c) The CLT states that the sample mean x is equal to the population mean m, provided that n > 30. d) The CLT states that the sampling distribution of the population mean m is approximately normal, provided that n > 30 Answer: b) 15) You have measured the systolic blood pressure of a random sample of 30 employees of a company. A 95% confidence interval for the mean systolic blood pressure for the employees is computed to be (122, 138). Which of the following statements gives a valid interpretation of this interval? Answer: d) a) 95% of the sample of employees has a systolic blood pressure between 122 and 138. b) 95 % of the employees in the company have a systolic blood pressure between 122 and 138. c) If the sampling procedure were repeated 100 times, then approximately 95 of the sample means would be between 122 and 138. d) If the sampling procedure were repeated 100 times, then approximately 95 of the resulting 100 confidence intervals would contain the true mean systolic blood pressure for all employees of the company. e) We are 95% confident the sample mean is between 122 and 138.

1.64). 1.96). 16) Sixty-five percent of all divorce cases cite incompatibility as the underlying reason. If four couples file for a divorce, what is the probability that no couples will state incompatibility as the reason? a) 0.015 b) 0.05 c) 0.18 d) 0.31 e) 0.35 Answer: a). P(None incompatible) =(1-.65)^4 = 0.015 17) A house cleaning service claims that it can clean a four-bedroom house in less than 2 hours. A sample of n = 36 houses is taken and the sample mean is found to be 1.97 hours and the sample standard deviation is found to be 0.1 hours. Using a 0.05 level of significance the correct conclusion is: a) reject the null because the test statistic (-1.8) is < the critical value (-1.64). b) do not reject the null because the test statistic (-1.8) is < the critical value (- c) reject the null because the test statistic (-1.8) is > the critical value (-1.96). d) do not reject the null because the test statistic (-1.8) is > the critical value (- Answer: a). This is a one- sided test, so the critical value is - 1.64, instead of - 1.96 18) A hypothesis test is done in which the alternative hypothesis is that more than 10% of a population is left-handed. The p-value for the test is calculated to be 0.25. Which statement is correct? a) We can conclude that more than 10% of the population is left-handed. b) We can conclude that more than 25% of the population is left-handed. c) We can conclude that exactly 25% of the population is left-handed. d) We cannot conclude that more than 10% of the population is left-handed. Answer: d) Since the p- value is large, we cannot reject the null hypothesis. 19) As the degrees of freedom for the t distribution increase, the distribution approaches a) value of zero for the mean. b) The t distribution c) The normal distribution. d) The binomial distribution.

20) Which statement is NOT true about hypothesis tests? a) Hypothesis tests are only valid when the sample is representative of the population for the question of interest. b) Hypotheses are statements about the population represented by the samples. c) Hypotheses are statements about the sample (or samples) from the population. d) Conclusions are statements about the population represented by the samples. 21) In regression analysis, if the coefficient of determination (R 2 ) is 1.0, then: a. SSE (error sum of squares) must be 1.0 b. SSR (regression sum of squares) must be 1.0 c. SSE must be 0.0 d. SSR must be 0.0 22) What do residuals represent in the simple linear regression model? a) The difference between the actual Y values and the mean of Y. b) The difference between the actual Y values and the predicted Y values. c) The square root of the slope. d) The predicted value of Y for the average X value e) None of the above. Answer: b) 23) The probability that a region prone to hurricanes will be hit by a hurricane in any single year is 0.1. What is the expected number of hurricanes to hit the area in the next 90 years? a) 9 b) 3 c) 8.1 d) 2.85 e) None of the above Answer: a)

24) For which of the following hypotheses tests above would the p-value be the same whether the sample mean is 44 or 46? a) I. b) I. and IV. c) II. and III. d) IV. Answer: I 25) We are told that a 95% prediction interval for a response variable, y, is (23.2, 35.6) from a simple regression on a sample of n = 100 observations at x* = 10. Which of the following is a reasonable estimate for the confidence interval for µ y at x* = 10? a) (13.2, 45.6) b) (24.2, 27.8) c) (28.6, 30.2) d) (33.2, 45.6)