Regression. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question.
|
|
|
- Amelia Johnson
- 9 years ago
- Views:
Transcription
1 Class: Date: Regression Multiple Choice Identify the choice that best completes the statement or answers the question. 1. Given the least squares regression line y8 = 5 2x: a. the relationship between x and y is positive. b. the relationship between x and y is negative. c. as x decreases, so does y. 2. A regression analysis between sales (in $1,000) and advertising (in $1,000) resulted in the following least squares line: y8 = x. This implies that: a. as advertising increases by $1,000, sales increases by $5,000. b. as advertising increases by $1,000, sales increases by $80,000. c. as advertising increases by $5, sales increases by $ Which of the following techniques is used to predict the value of one variable on the basis of other variables? a. Correlation analysis b. Coefficient of correlation c. Covariance d. Regression analysis 4. The residual is defined as the difference between: a. the actual value of y and the estimated value of y b. the actual value of x and the estimated value of x c. the actual value of y and the estimated value of x d. the actual value of x and the estimated value of y 5. Testing whether the slope of the population regression line could be zero is equivalent to testing whether the: a. sample coefficient of correlation could be zero b. standard error of estimate could be zero c. population coefficient of correlation could be zero d. sum of squares for error could be zero 6. A regression line using 25 observations produced SSR = and SSE = The standard error of estimate was: a b c If all the points in a scatter diagram lie on the least squares regression line, then the coefficient of correlation must be: a. 1.0 b. 1.0 c. either 1.0 or 1.0 d
2 8. If the coefficient of correlation is 0.60, then the coefficient of determination is: a b c d The standard error of estimate s ε is given by: a. SSE / (n 2) b. SSE / (n 2) c. SSE / (n 2) d. SSE / n If the standard error of estimate s ε = 20 and n = 10, then the sum of squares for error, SSE, is: a. 400 b. 3,200 c. 4,000 d. 40, In regression analysis, the coefficient of determination R 2 measures the amount of variation in y that is: a. caused by the variation in x. b. explained by the variation in x. c. unexplained by the variation in x. 12. In a regression problem, if the coefficient of determination is 0.95, this means that: a. 95% of the y values are positive. b. 95% of the variation in y can be explained by the variation in x. c. 95% of the y values are predicted correctly by the model. 13. The sample correlation coefficient between x and y is It has been found out that the p-value is when testing H 0 : ρ = 0 against the two-sided alternative H 1 : ρ 0. To test H 0 : ρ = 0 against the one-sided alternative H 1 : ρ > 0 at a significant level of 0.193, the p-value will be equal to a b c d In simple linear regression, which of the following statements indicates there is no linear relationship between the variables x and y? a. Coefficient of determination is 1.0. b. Coefficient of correlation is 0.0. c. Sum of squares for error is
3 15. In simple linear regression, the coefficient of correlation r and the least squares estimate b 1 of the population slope β 1 : a. must be equal. b. must have the same sign. c. are not related. 16. If the coefficient of correlation is 0.90, then the percentage of the variation in the dependent variable y that is explained by the variation in the independent variable x is: a. 90% b. 81% c. 95% 17. The width of the confidence interval estimate for the predicted value of y depends on a. the standard error of the estimate b. the value of x for which the prediction is being made c. the sample size d. All of these choices are true. 18. In a multiple regression model, the mean of the probability distribution of the error variable ε is assumed to be: a. 1.0 b. 0.0 c. k, where k is the number of independent variables included in the model. 19. In a multiple regression analysis involving 6 independent variables, the total variation in y is 900 and SSR = 600. What is the value of SSE? a. 300 b c For a multiple regression model the following statistics are given: Total variation in y = 250, SSE = 50, k = 4, and n = 20. Then, the coefficient of determination adjusted for the degrees of freedom is: a b c d A multiple regression model has the form: y8 = x 1 + 6x 2. As x 2 increases by one unit, holding x 1 constant, then the value of y will increase by: a. 2 units b units c. 6 units on average d. None of these choices 3
4 22. If all the points for a multiple regression model with two independent variables were right on the regression plane, then the coefficient of determination would equal: a. 0. b. 1. c. 2, since there are two independent variables. 23. For a multiple regression model, the total variation in y can be expressed as: a. SSR + SSE. b. SSR SSE. c. SSE SSR. d. SSR / SSE. 24. A multiple regression equation includes 5 independent variables, and the coefficient of determination is The percentage of the variation in y that is explained by the regression equation is: a. 81% b. 90% c. 86% d. about 16% 25. In a multiple regression model, the value of the coefficient of determination has to fall between a. 1 and +1. b. 0 and +1. c. 1 and 0. True/False Indicate whether the statement is true or false. 26. In a simple linear regression problem, the least squares line is y8 = x, and the coefficient of determination is The coefficient of correlation must be A zero correlation coefficient between a pair of random variables means that there is no linear relationship between the random variables. 28. In reference to the equation y8 = x x 2, the value 0.80 is the y-intercept. 29. In multiple regression, the standard error of estimate is defined by s ε = sample size and k is the number of independent variables. SSE / (n k), where n is the 30. One of the consequences of multicollinearity in multiple regression is biased estimates on the slope coefficients. 31. Multicollinearity is present when there is a high degree of correlation between the independent variables included in the regression model. 4
5 Short Answer Car Speed and Gas Mileage An economist wanted to analyze the relationship between the speed of a car (x) and its gas mileage (y). As an experiment a car is operated at several different speeds and for each speed the gas mileage is measured. These data are shown below. Speed Gas Mileage {Car Speed and Gas Mileage Narrative} Estimate the gas mileage of a car traveling 70 mph. 33. A scatter diagram includes the following data points: x y Two regression models are proposed: (1) y8 = x, and (2) y8 = 4.0x. Using the least squares method, which of these regression models provides the better fit to the data? Why? Sunshine and Skin Cancer A medical statistician wanted to examine the relationship between the amount of sunshine (x) in hours, and incidence of skin cancer (y). As an experiment he found the number of skin cancer cases detected per 100,000 of population and the average daily sunshine in eight counties around the country. These data are shown below. Average Daily Sunshine Skin Cancer per 100, {Sunshine and Skin Cancer Narrative} Draw a scatter diagram of the data and plot the least squares regression line on it. 35. {Sunshine and Skin Cancer Narrative} Estimate the number of skin cancer cases per 100,000 people who live in a state that gets 6 hours of sunshine on average. 36. {Sunshine and Skin Cancer Narrative} Can we conclude at the 1% significance level that there is a linear relationship between sunshine and skin cancer? 5
6 Game Winnings & Education An ardent fan of television game shows has observed that, in general, the more educated the contestant, the less money he or she wins. To test her belief she gathers data about the last eight winners of her favorite game show. She records their winnings in dollars and the number of years of education. The results are as follows. Contestant Years of Education Winnings {Game Winnings & Education Narrative} Draw a scatter diagram of the data. Comment on whether it appears that a linear model might be appropriate. 38. {Game Winnings & Education Narrative} Interpret the value of the slope of the regression line. 39. {Game Winnings & Education Narrative} Conduct a test of the population coefficient of correlation to determine at the 5% significance level whether a negative linear relationship exists between years of education and TV game shows' winnings. 6
7 Oil Quality and Price Quality of oil is measured in API gravity degrees--the higher the degrees API, the higher the quality. The table shown below is produced by an expert in the field who believes that there is a relationship between quality and price per barrel. Oil degrees API Price per barrel (in $) A partial Minitab output follows: Descriptive Statistics Variable N Mean StDev SE Mean Degrees Price Covariances Degrees Price Degrees Price Regression Analysis Predictor Coef StDev T P Constant Degrees S = R-Sq = 92.46% R-Sq(adj) = 91.7% Analysis of Variance Source DF SS MS F P Regression Residual Error Total {Oil Quality and Price Narrative} Draw a scatter diagram of the data. Comment on whether it appears that a linear model might be appropriate to describe the relationship between the quality of oil and price per barrel. 7
8 41. The following 10 observations of variables x and y were collected. x y a. Calculate the standard error of estimate. b. Test to determine if there is enough evidence at the 5% significance level to indicate that x and y are negatively linearly related. c. Calculate the coefficient of correlation, and describe what this statistic tells you about the regression line. Sales and Experience The general manager of a chain of furniture stores believes that experience is the most important factor in determining the level of success of a salesperson. To examine this belief she records last month's sales (in $1,000s) and the years of experience of 10 randomly selected salespeople. These data are listed below. Salesperson Years of Experience Sales (Sales and Experience Narrative} Determine the coefficient of determination and discuss what its value tells you about the two variables. 43. {Sales and Experience Narrative} Estimate with 95% confidence the average monthly sales of all salespersons with 10 years of experience. 44. {Sales and Experience Narrative} Compute the standardized residuals. 8
9 Life Expectancy An actuary wanted to develop a model to predict how long individuals will live. After consulting a number of physicians, she collected the age at death (y), the average number of hours of exercise per week (x 1 ), the cholesterol level (x 2 ), and the number of points that the individual's blood pressure exceeded the recommended value (x 3 ). A random sample of 40 individuals was selected. The computer output of the multiple regression model is shown below. THE REGRESSION EQUATION IS y = x x x 3 Predictor Coef StDev T Constant x x x S = 9.47 R Sq = 22.5% ANALYSIS OF VARIANCE Source of Variation df SS MS F Regression Error Total {Life Expectancy Narrative} Is there sufficient evidence at the 5% significance level to infer that the number of points that the individual's blood pressure exceeded the recommended value and the age at death are negatively linearly related? 9
10 Regression Answer Section MULTIPLE CHOICE 1. ANS: B PTS: 1 REF: SECTION ANS: A PTS: 1 REF: SECTION ANS: D PTS: 1 REF: SECTION ANS: A PTS: 1 REF: SECTION ANS: C PTS: 1 REF: SECTION ANS: B PTS: 1 REF: SECTION ANS: C PTS: 1 REF: SECTION ANS: C PTS: 1 REF: SECTION ANS: C PTS: 1 REF: SECTION ANS: B PTS: 1 REF: SECTION ANS: B PTS: 1 REF: SECTION ANS: B PTS: 1 REF: SECTION ANS: A PTS: 1 REF: SECTION ANS: B PTS: 1 REF: SECTION ANS: B PTS: 1 REF: SECTION ANS: B PTS: 1 REF: SECTION ANS: D PTS: 1 REF: SECTION ANS: B PTS: 1 REF: SECTION ANS: A PTS: 1 REF: SECTION ANS: B PTS: 1 REF: SECTION ANS: C PTS: 1 REF: SECTION ANS: B PTS: 1 REF: SECTION ANS: A PTS: 1 REF: SECTION ANS: A PTS: 1 REF: SECTION ANS: B PTS: 1 REF: SECTION TRUE/FALSE 26. ANS: F PTS: 1 REF: SECTION ANS: T PTS: 1 REF: SECTION ANS: T PTS: 1 REF: SECTION ANS: F PTS: 1 REF: SECTION ANS: F PTS: 1 REF: SECTION ANS: T PTS: 1 REF: SECTION
11 SHORT ANSWER 32. ANS: When x = 70, y8 = mpg. PTS: 1 REF: SECTION ANS: SSE = 4.95 and for models 1 and 2, respectively. Therefore, model (1) fits the data better than model (2). PTS: 1 REF: SECTION ANS: PTS: 1 REF: SECTION ANS: When x = 6, y8 = PTS: 1 REF: SECTION ANS: H 0 : ρ = 0 vs. H 1 : ρ 0 Rejection region: t > t 0.005,6 = Test statistic: t = Conclusion: Reject the null hypothesis. We conclude at the 1% significance level that there is a linear relationship between sunshine and skin cancer, according to this data. The relationship is positive, indicating that more sunshine is associated with more skin cancer cases. PTS: 1 REF: SECTION
12 37. ANS: It appears that a linear model is appropriate; further analysis is needed. PTS: 1 REF: SECTION ANS: For each additional year of education a contestant has, his or her winnings on TV game shows decreases by an average of approximately $ PTS: 1 REF: SECTION ANS: H 0 : ρ = 0 vs. H 1 : ρ < 0 Rejection region: t < t 0.05,6 = Test statistic: t = Conclusion: Reject the null hypothesis. A negative linear relationship exists between years of education and TV game shows' winnings, according to this data. PTS: 1 REF: SECTION
13 40. ANS: A linear model might be appropriate to describe the relationship between the quality of oil and price per barrel. PTS: 1 REF: SECTION ANS: a. s ε = b. H 0 : β 1 = 0 vs. H 0 : β 1 < 0 Rejection region: t < t 0.05,8 = 1.86 Test statistic: t = Conclusion: Reject the null hypothesis. There is enough evidence at the 5% significance level to indicate that x and y have a negative linear relationship, according to this data. As speed increases, gas mileage decreases. c. r = This indicates a very strong negative linear relationship between the two variables. PTS: 1 REF: SECTION ANS: R 2 = , which means that 95.36% of the variation in sales is explained by the variation in years of experience of the salesperson. PTS: 1 REF: SECTION ANS: ± Thus LCL = (in $1000s), and UCL = (in $1000s). PTS: 1 REF: SECTION ANS: 1.100, 1.210, 0.373, 2.108, 0.483, 0.026, 1.086, 0.538, 0.178, and PTS: 1 REF: SECTION
14 45. ANS: H 0 : β 3 = 0 vs. H 1 : β 3 < 0 Rejection region: t < t 0.05, Test statistic: t = Conclusion: Don't reject the null hypothesis. No, sufficient evidence at the 5% significance level to infer that the number of points that the individual's blood pressure exceeded the recommended value and the age at death are negatively linearly related. PTS: 1 REF: SECTION
Regression Analysis: A Complete Example
Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty
1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96
1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years
1. The parameters to be estimated in the simple linear regression model Y=α+βx+ε ε~n(0,σ) are: a) α, β, σ b) α, β, ε c) a, b, s d) ε, 0, σ
STA 3024 Practice Problems Exam 2 NOTE: These are just Practice Problems. This is NOT meant to look just like the test, and it is NOT the only thing that you should study. Make sure you know all the material
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a
17. SIMPLE LINEAR REGRESSION II
17. SIMPLE LINEAR REGRESSION II The Model In linear regression analysis, we assume that the relationship between X and Y is linear. This does not mean, however, that Y can be perfectly predicted from X.
CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression
Opening Example CHAPTER 13 SIMPLE LINEAR REGREION SIMPLE LINEAR REGREION! Simple Regression! Linear Regression Simple Regression Definition A regression model is a mathematical equation that descries the
Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares
Topic 4 - Analysis of Variance Approach to Regression Outline Partitioning sums of squares Degrees of freedom Expected mean squares General linear test - Fall 2013 R 2 and the coefficient of correlation
Part 2: Analysis of Relationship Between Two Variables
Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable
Multiple Linear Regression
Multiple Linear Regression A regression with two or more explanatory variables is called a multiple regression. Rather than modeling the mean response as a straight line, as in simple regression, it is
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing
STAT 350 Practice Final Exam Solution (Spring 2015)
PART 1: Multiple Choice Questions: 1) A study was conducted to compare five different training programs for improving endurance. Forty subjects were randomly divided into five groups of eight subjects
Hypothesis testing - Steps
Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =
Module 5: Multiple Regression Analysis
Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College
August 2012 EXAMINATIONS Solution Part I
August 01 EXAMINATIONS Solution Part I (1) In a random sample of 600 eligible voters, the probability that less than 38% will be in favour of this policy is closest to (B) () In a large random sample,
2. What is the general linear model to be used to model linear trend? (Write out the model) = + + + or
Simple and Multiple Regression Analysis Example: Explore the relationships among Month, Adv.$ and Sales $: 1. Prepare a scatter plot of these data. The scatter plots for Adv.$ versus Sales, and Month versus
2. Simple Linear Regression
Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according
Final Exam Practice Problem Answers
Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal
ch12 practice test SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.
ch12 practice test 1) The null hypothesis that x and y are is H0: = 0. 1) 2) When a two-sided significance test about a population slope has a P-value below 0.05, the 95% confidence interval for A) does
TRINITY COLLEGE. Faculty of Engineering, Mathematics and Science. School of Computer Science & Statistics
UNIVERSITY OF DUBLIN TRINITY COLLEGE Faculty of Engineering, Mathematics and Science School of Computer Science & Statistics BA (Mod) Enter Course Title Trinity Term 2013 Junior/Senior Sophister ST7002
Section 1: Simple Linear Regression
Section 1: Simple Linear Regression Carlos M. Carvalho The University of Texas McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/ 1 Regression: General Introduction
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance
Univariate Regression
Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is
Chapter 7: Simple linear regression Learning Objectives
Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -
Example: Boats and Manatees
Figure 9-6 Example: Boats and Manatees Slide 1 Given the sample data in Table 9-1, find the value of the linear correlation coefficient r, then refer to Table A-6 to determine whether there is a significant
" Y. Notation and Equations for Regression Lecture 11/4. Notation:
Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through
Regression step-by-step using Microsoft Excel
Step 1: Regression step-by-step using Microsoft Excel Notes prepared by Pamela Peterson Drake, James Madison University Type the data into the spreadsheet The example used throughout this How to is a regression
Simple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
Chapter 23. Inferences for Regression
Chapter 23. Inferences for Regression Topics covered in this chapter: Simple Linear Regression Simple Linear Regression Example 23.1: Crying and IQ The Problem: Infants who cry easily may be more easily
Solution Let us regress percentage of games versus total payroll.
Assignment 3, MATH 2560, Due November 16th Question 1: all graphs and calculations have to be done using the computer The following table gives the 1999 payroll (rounded to the nearest million dolars)
Premaster Statistics Tutorial 4 Full solutions
Premaster Statistics Tutorial 4 Full solutions Regression analysis Q1 (based on Doane & Seward, 4/E, 12.7) a. Interpret the slope of the fitted regression = 125,000 + 150. b. What is the prediction for
International Statistical Institute, 56th Session, 2007: Phil Everson
Teaching Regression using American Football Scores Everson, Phil Swarthmore College Department of Mathematics and Statistics 5 College Avenue Swarthmore, PA198, USA E-mail: [email protected] 1. Introduction
SPSS Guide: Regression Analysis
SPSS Guide: Regression Analysis I put this together to give you a step-by-step guide for replicating what we did in the computer lab. It should help you run the tests we covered. The best way to get familiar
DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9
DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,
Simple Regression Theory II 2010 Samuel L. Baker
SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the
Week TSX Index 1 8480 2 8470 3 8475 4 8510 5 8500 6 8480
1) The S & P/TSX Composite Index is based on common stock prices of a group of Canadian stocks. The weekly close level of the TSX for 6 weeks are shown: Week TSX Index 1 8480 2 8470 3 8475 4 8510 5 8500
Causal Forecasting Models
CTL.SC1x -Supply Chain & Logistics Fundamentals Causal Forecasting Models MIT Center for Transportation & Logistics Causal Models Used when demand is correlated with some known and measurable environmental
ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2
University of California, Berkeley Prof. Ken Chay Department of Economics Fall Semester, 005 ECON 14 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE # Question 1: a. Below are the scatter plots of hourly wages
1 Simple Linear Regression I Least Squares Estimation
Simple Linear Regression I Least Squares Estimation Textbook Sections: 8. 8.3 Previously, we have worked with a random variable x that comes from a population that is normally distributed with mean µ and
Predictor Coef StDev T P Constant 970667056 616256122 1.58 0.154 X 0.00293 0.06163 0.05 0.963. S = 0.5597 R-Sq = 0.0% R-Sq(adj) = 0.
Statistical analysis using Microsoft Excel Microsoft Excel spreadsheets have become somewhat of a standard for data storage, at least for smaller data sets. This, along with the program often being packaged
2013 MBA Jump Start Program. Statistics Module Part 3
2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just
IAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results
IAPRI Quantitative Analysis Capacity Building Series Multiple regression analysis & interpreting results How important is R-squared? R-squared Published in Agricultural Economics 0.45 Best article of the
1.1. Simple Regression in Excel (Excel 2010).
.. Simple Regression in Excel (Excel 200). To get the Data Analysis tool, first click on File > Options > Add-Ins > Go > Select Data Analysis Toolpack & Toolpack VBA. Data Analysis is now available under
5. Linear Regression
5. Linear Regression Outline.................................................................... 2 Simple linear regression 3 Linear model............................................................. 4
Review Jeopardy. Blue vs. Orange. Review Jeopardy
Review Jeopardy Blue vs. Orange Review Jeopardy Jeopardy Round Lectures 0-3 Jeopardy Round $200 How could I measure how far apart (i.e. how different) two observations, y 1 and y 2, are from each other?
X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)
CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.
Estimation of σ 2, the variance of ɛ
Estimation of σ 2, the variance of ɛ The variance of the errors σ 2 indicates how much observations deviate from the fitted surface. If σ 2 is small, parameters β 0, β 1,..., β k will be reliably estimated
Basic Statistics and Data Analysis for Health Researchers from Foreign Countries
Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Volkert Siersma [email protected] The Research Unit for General Practice in Copenhagen Dias 1 Content Quantifying association
10. Analysis of Longitudinal Studies Repeat-measures analysis
Research Methods II 99 10. Analysis of Longitudinal Studies Repeat-measures analysis This chapter builds on the concepts and methods described in Chapters 7 and 8 of Mother and Child Health: Research methods.
SIMON FRASER UNIVERSITY
SIMON FRASER UNIVERSITY BUEC 333: Statistics for Business and Economics. MIDTERM EXAM: PART I Instructor: Alex Jameson Appiah February. 27, 1996. Time: 50 mins. Name: ------------------------------------------------------
c 2015, Jeffrey S. Simonoff 1
Modeling Lowe s sales Forecasting sales is obviously of crucial importance to businesses. Revenue streams are random, of course, but in some industries general economic factors would be expected to have
Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011
Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Name: Section: I pledge my honor that I have not violated the Honor Code Signature: This exam has 34 pages. You have 3 hours to complete this
Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),
Chapter 0 Key Ideas Correlation, Correlation Coefficient (r), Section 0-: Overview We have already explored the basics of describing single variable data sets. However, when two quantitative variables
4. Multiple Regression in Practice
30 Multiple Regression in Practice 4. Multiple Regression in Practice The preceding chapters have helped define the broad principles on which regression analysis is based. What features one should look
Elementary Statistics Sample Exam #3
Elementary Statistics Sample Exam #3 Instructions. No books or telephones. Only the supplied calculators are allowed. The exam is worth 100 points. 1. A chi square goodness of fit test is considered to
Analysing Questionnaires using Minitab (for SPSS queries contact -) [email protected]
Analysing Questionnaires using Minitab (for SPSS queries contact -) [email protected] Structure As a starting point it is useful to consider a basic questionnaire as containing three main sections:
Interaction between quantitative predictors
Interaction between quantitative predictors In a first-order model like the ones we have discussed, the association between E(y) and a predictor x j does not depend on the value of the other predictors
Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear.
Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear. In the main dialog box, input the dependent variable and several predictors.
11. Analysis of Case-control Studies Logistic Regression
Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:
Point Biserial Correlation Tests
Chapter 807 Point Biserial Correlation Tests Introduction The point biserial correlation coefficient (ρ in this chapter) is the product-moment correlation calculated between a continuous random variable
General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.
General Method: Difference of Means 1. Calculate x 1, x 2, SE 1, SE 2. 2. Combined SE = SE1 2 + SE2 2. ASSUMES INDEPENDENT SAMPLES. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n
SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.
SIMPLE LINEAR CORRELATION Simple linear correlation is a measure of the degree to which two variables vary together, or a measure of the intensity of the association between two variables. Correlation
Copyright 2007 by Laura Schultz. All rights reserved. Page 1 of 5
Using Your TI-83/84 Calculator: Linear Correlation and Regression Elementary Statistics Dr. Laura Schultz This handout describes how to use your calculator for various linear correlation and regression
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression We are often interested in studying the relationship among variables to determine whether they are associated with one another. When we think that changes in a
Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm
Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm
1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number
1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression
General Regression Formulae ) (N-2) (1 - r 2 YX
General Regression Formulae Single Predictor Standardized Parameter Model: Z Yi = β Z Xi + ε i Single Predictor Standardized Statistical Model: Z Yi = β Z Xi Estimate of Beta (Beta-hat: β = r YX (1 Standard
DATA INTERPRETATION AND STATISTICS
PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE
Mind on Statistics. Chapter 13
Mind on Statistics Chapter 13 Sections 13.1-13.2 1. Which statement is not true about hypothesis tests? A. Hypothesis tests are only valid when the sample is representative of the population for the question
Section 14 Simple Linear Regression: Introduction to Least Squares Regression
Slide 1 Section 14 Simple Linear Regression: Introduction to Least Squares Regression There are several different measures of statistical association used for understanding the quantitative relationship
Using R for Linear Regression
Using R for Linear Regression In the following handout words and symbols in bold are R functions and words and symbols in italics are entries supplied by the user; underlined words and symbols are optional
Simple linear regression
Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between
Section 13, Part 1 ANOVA. Analysis Of Variance
Section 13, Part 1 ANOVA Analysis Of Variance Course Overview So far in this course we ve covered: Descriptive statistics Summary statistics Tables and Graphs Probability Probability Rules Probability
2 Sample t-test (unequal sample sizes and unequal variances)
Variations of the t-test: Sample tail Sample t-test (unequal sample sizes and unequal variances) Like the last example, below we have ceramic sherd thickness measurements (in cm) of two samples representing
Factors affecting online sales
Factors affecting online sales Table of contents Summary... 1 Research questions... 1 The dataset... 2 Descriptive statistics: The exploratory stage... 3 Confidence intervals... 4 Hypothesis tests... 4
Hypothesis Testing --- One Mean
Hypothesis Testing --- One Mean A hypothesis is simply a statement that something is true. Typically, there are two hypotheses in a hypothesis test: the null, and the alternative. Null Hypothesis The hypothesis
CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there
CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there is a relationship between variables, To find out the
Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means
Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis
KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management
KSTAT MINI-MANUAL Decision Sciences 434 Kellogg Graduate School of Management Kstat is a set of macros added to Excel and it will enable you to do the statistics required for this course very easily. To
Stat 412/512 CASE INFLUENCE STATISTICS. Charlotte Wickham. stat512.cwick.co.nz. Feb 2 2015
Stat 412/512 CASE INFLUENCE STATISTICS Feb 2 2015 Charlotte Wickham stat512.cwick.co.nz Regression in your field See website. You may complete this assignment in pairs. Find a journal article in your field
Confidence Intervals for Spearman s Rank Correlation
Chapter 808 Confidence Intervals for Spearman s Rank Correlation Introduction This routine calculates the sample size needed to obtain a specified width of Spearman s rank correlation coefficient confidence
Econometrics Simple Linear Regression
Econometrics Simple Linear Regression Burcu Eke UC3M Linear equations with one variable Recall what a linear equation is: y = b 0 + b 1 x is a linear equation with one variable, or equivalently, a straight
Simple Linear Regression, Scatterplots, and Bivariate Correlation
1 Simple Linear Regression, Scatterplots, and Bivariate Correlation This section covers procedures for testing the association between two continuous variables using the SPSS Regression and Correlate analyses.
Course Objective This course is designed to give you a basic understanding of how to run regressions in SPSS.
SPSS Regressions Social Science Research Lab American University, Washington, D.C. Web. www.american.edu/provost/ctrl/pclabs.cfm Tel. x3862 Email. [email protected] Course Objective This course is designed
CORRELATION ANALYSIS
CORRELATION ANALYSIS Learning Objectives Understand how correlation can be used to demonstrate a relationship between two factors. Know how to perform a correlation analysis and calculate the coefficient
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
Final Exam Review MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) A researcher for an airline interviews all of the passengers on five randomly
Chapter 5 Analysis of variance SPSS Analysis of variance
Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,
Confidence Intervals for Cp
Chapter 296 Confidence Intervals for Cp Introduction This routine calculates the sample size needed to obtain a specified width of a Cp confidence interval at a stated confidence level. Cp is a process
Moderation. Moderation
Stats - Moderation Moderation A moderator is a variable that specifies conditions under which a given predictor is related to an outcome. The moderator explains when a DV and IV are related. Moderation
One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups
One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups In analysis of variance, the main research question is whether the sample means are from different populations. The
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
Ch. Correlation and Regression. Correlation Interpret Scatter Plots and Correlations MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate
Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010
Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Week 1 Week 2 14.0 Students organize and describe distributions of data by using a number of different
Week 5: Multiple Linear Regression
BUS41100 Applied Regression Analysis Week 5: Multiple Linear Regression Parameter estimation and inference, forecasting, diagnostics, dummy variables Robert B. Gramacy The University of Chicago Booth School
POLYNOMIAL AND MULTIPLE REGRESSION. Polynomial regression used to fit nonlinear (e.g. curvilinear) data into a least squares linear regression model.
Polynomial Regression POLYNOMIAL AND MULTIPLE REGRESSION Polynomial regression used to fit nonlinear (e.g. curvilinear) data into a least squares linear regression model. It is a form of linear regression
Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 )
Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 ) and Neural Networks( 類 神 經 網 路 ) 許 湘 伶 Applied Linear Regression Models (Kutner, Nachtsheim, Neter, Li) hsuhl (NUK) LR Chap 10 1 / 35 13 Examples
MGT 267 PROJECT. Forecasting the United States Retail Sales of the Pharmacies and Drug Stores. Done by: Shunwei Wang & Mohammad Zainal
MGT 267 PROJECT Forecasting the United States Retail Sales of the Pharmacies and Drug Stores Done by: Shunwei Wang & Mohammad Zainal Dec. 2002 The retail sale (Million) ABSTRACT The present study aims
Correlational Research
Correlational Research Chapter Fifteen Correlational Research Chapter Fifteen Bring folder of readings The Nature of Correlational Research Correlational Research is also known as Associational Research.
HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...
HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men
