Stat 503. Solutions to Homework #11 (115 points) (11) (19)

Save this PDF as:

Size: px
Start display at page:

Transcription

1 Leucine (ng) Stat 503 Solutions to Homework #11 (115 points) Problem 1.3 (p. 536) In a study of protein synthesis in the oocyte (developing egg cell) of the frog Xenopus laevis, a biologist injected individual oocytes with radioactively labelled leucine. At various times after injection, he made radioactivity measurements and calculated how much of the leucine had been incorporated into protein. The results are given in the accompanying table; each leucine value is the content of labelled leucine in two oocytes. All oocytes were from the same female. (a) Use linear regression to estimate the rate of incorporation of the labelled leucine. The rate of incorporation is the slope of the regression line. b 1 = SP XY / SS X = 81.9/800 = (b) Plot the data and draw the regression line on your graph. b 0 = y b 1 x = 0.83 (0.093)(30) = Time (min) Leucine (ng) X Y mean SS SP XY 81.9 SS(resid) (11) (3) (6) (c) Calculate the residual standard deviation. Leucine Incorporation 1.5 y = 0.093x Time (min) s Y X = SS( resid ) n Problem 1.5 (p. 537) Twenty plots, each 10x4 metres, were randomly chosen in a large field of corn. For each plot, the plant density (number of plants in the plot) and the mean cob weight (gm of grain per cob) were observed. The results are given in the table. (19)

2 COB WEIGHT Y PLANT DENSITY COB WEIGHT PLANT DENSITY COB WEIGHT X Y X Y Preliminary calculations yield the following results: x = y = 4.1 SS X = 0,09.0 SS Y = 11,831.8 SP XY = -14,563.1 SS(resid) = (a) Calculate the linear regression of Y on X. b 1 = SP XY / SS X = /009.0 = b 0 = y - b 1 x = (-0.706)(18.05) = The equation of the linear regression line is Y = X Regression Plot y = x PLANT DENSITY X (b) Plot the data and draw the regression line on your graph. (c) Interpret the value of the slope of the regression line, b 1, in the context of this setting. On average, a one-cob increase in plant density will result in a reduction of 0.7 g in the average corn yield for each cob in that plot. (d) Calculate s Y and s Y X and specify the units of each. s Y = SS Y = 4.95 gm n 1 19 SS( resid ) s Y X = = 8.6 gm n 18 The units of both s Y and s Y X are the same as the units for Y, namely grams. (6)

3 Fungus Growth (mm) (e) Interpret the value of s Y X in the context of this setting. The standard deviation of the points from the line is 8.6. Approximately 68% of observations for Y will be within 8.6 of the value predicted by the line, and approximately 95% of observations for Y will be within 17.4 of the value predicted by the line. Problem 1.6 (p. 538) Laetisaric acid is a compound that holds promise for control of fungus diseases in crop plants. The accompanying data show the results of growing the fungus Pythium ultimum in various concentrations of laetisaric acid. Each growth value is the average of four radial measurements of a P. ultimum colony grown in a petri dish for 4 hours; there were two petri dishes at each concentration. Laetisaric Acid Fungus Growth Laetisaric Acid Fungus Growth Concentration Concentration X(g/ml) Y(mm) X(g/ml) Y(mm) mean SS X = 1,303 SS Y = SP XY = SS(resid) = (a) Calculate the linear regression of Y on X. b 1 = SP XY / SS X = / 1303 = 0.71 b 0 = y b 1 x = 3.64 ( 0.71)(11.5) = The linear regression is Y = X. (b) Plot the data and draw the regression line on your graph. (14) Problem y = -0.71x R = Laetisaric Acid Concentration (mcg/ml)

4 COB WEIGHT Y (c) Calculate s Y X. What are the units of s Y X? s Y X = SS( resid ) = 1.95 mm. n 1 (d) Draw a ruler line on your graph to show the magnitude of s Y X. (See Figure 1.7.) (see above graph) (3) Problem 1.11b) (p. 541) For the corn yield data of Exercise 1.5, prepare a plot like figure 1.8, showing the data, the fitted regression line, and two lines whose vertical distance above and below the regression line is s Y X. What percentage of the data points are within s Y X of the regression line? (7) [It's okay to put the lines on the same graph as that for 1.5] Regression Plot PLANT DENSITY X Of the 0 data points, 13 are within 8.6 of the straight line, and 7 are outside that range. Therefore 13/0 = 65% are within s Y X of the regression line.

5 PLANT COB is resid >8.6? DENSITY WEIGHT X Y ŷ residual 0 = no, 1 = yes Problem 1.14 (p. 547) Refer to the cob weight data of Exercise 1.5. Assume that the linear model holds. (a) Estimate the mean cob weight to be expected in a plot containing (i) 100 plants; (ii) 10 plants. Use the linear regression equation to find the predicted Y for those values of X. (i) With x = 100, ŷ = = 44 g (ii) With x = 10, ŷ = = 30 g (b) Assume that each plant produces one cob. How much grain would we expect to get from a plot containing (i) 100 plants? (ii) 10 plants? The total amount of corn from a plot is the mean weight per cob times the number of cobs. (i) 44 g 100 = 4,400 g = 4.4 kg of corn (ii) 30 g 10 = 7,600 g = 7.6 kg of corn (8) Problem 1.1 (p. 55) Refer to the cob weight data of Exercise 1.5. Construct a 95% confidence interval for 1. b 1 = syx 8.6 SE b1 = SS 009 = X df = 0 - = 18 ; t.05 = % CI for 1 is (.101)(0.0606) = = [-0.848, ] We are 95% confident that < 1 <

6 Problem 1. (p. 55) Refer to the fungus growth data of Exercise 1.6. (a) Calculate the standard error of the slope, b 1. SS X = 1303 and s Y X = 1.9 from Problem 1.5 above syx 1.95 SE b1 = SS 1303 = X (b) Consider the null hypothesis that laetisaric acid has no effect on growth of the fungus. Assuming that the linear model is applicable, formulate this as a hypothesis about the true regression line, and test the hypothesis against the alternative that laetisaric acid inhibits growth of the fungus. Let = Does laetisaric acid inhibit fungus growth? Let 1 be the slope of the true regression line. H 0 : 1 = 0 ; laetisaric acid has no effect on growth of the fungus H A : 1 < 0 ; laetisaric acid inhibits growth of the fungus Use a directional t-test. t s = b 1 / SE b1 has a t-distribution with 10 df under H 0. With = 0.05, critical value is Will reject H 0 if t s < b 1 = 0.71 (from 1.6 earlier) t s = 0.71 / = < 1.81 so reject H 0. This study provides evidence at the 0.05 significance level that laetisaric acid inhibits growth of the fungus. (16) (14) Problem 1.8 (p. 564) Proceed as in Exercise 1.7, but use the cob weight data of Exercise 1.5 (a) Calculate the correlation coefficient. SPXY r = = 0.94 SS SS X Y (b) Calculate s Y and s Y X ; specify the units for each. Verify the approximate relationship between s Y, s Y X, and r. s Y = 4.95 g (from last HW) s Y X = 8.6 g (from last HW) s Y X / s Y = 4.95/8.6 = ( ) r = The approximate relationship is verified since (c) Calculate the regression line of Y on X. The regression line is Y = X from last HW. (13) (d) Construct a scatterplot of the data and draw the regression line on your graph. Place ruler lines on the scatterplot to show the magnitudes of s Y and s Y X. (See Figure 1.16.)

7 Regression Plot y = x C 60 O 50 B 40 W 30 EI 0 G 10 HT 00 Y s Y s Y X PLANT DENSITY X Problem 1.30 (p. 564) In a study of,669 adult men, the correlation between age and systolic blood pressure was found to be r = The SD of systolic blood pressures among all,669 men was 19.5 mm Hg. Assuming that the linear model is applicable, estimate the SD of systolic blood pressures among men 50 years old. Find s Y X. Since s Y X / s Y 1 r, therefore s Y X = s Y 1 r = = The SD of systolic blood pressures among 50-year-old men is approximately 17.6 mm Hg. Problem 1.3 (p. 564) A veterinary anatomist measured the density of nerve cells at specified sites in the intestine of nine horses. Each density value is the average of counts of nerve cells in five equal sections of tissue. The results are given in the accompanying table for site I (midregion of jejunum) and site II (mesenteric region of jejunum). (These data were also given in Exercise 9.8). (a) Calculate the correlation coefficient between the densities at the two sites. SPXY r = SS SS = X Y (b) Construct a scatterplot of the data. Animal Site 1 Site mean SS SP XY (8) (3)

8 Site Nerve Cell Density Site 1 (c) Four potential sources of variation in these data are: (i) errors in counting the nerve cells, (ii) sampling error due to choosing certain slices for counting, (iii) variation from one horse to another, and (iv) variation from site to site within a horse. Which of these sources of variation would tend to produce positive correlation between the sites? [Hint: Ask yourself how each source contributes to the appearance of the scatterplot.] Source (iii) would tend to produce positive correlation between the sites. Variation in nerve cell density from one horse to another would tend to separate points for different horses but would presumably affect each horse in the same way at both points, that is, move X and Y together, creating positive correlation. Problem 1.33 Refer to Exercise 1.3. Test the hypothesis that the true (population) correlation coefficient is zero against the alternative that it is positive. Let =0.05. Are nerve cell densities between the two sites correlated? Let be the true population correlation coefficient. H 0 : = 0 ; nerve cell densities at the two sites are uncorrelated H A : > 0 ; nerve cell densities at the two sites are positively correlated n Use a directional t-test. t s = r has a t-distribution with 7 df under H 0. 1 r With = 0.05, the critical value is H 0 will be rejected if t s > r = 0.81 (from previous question) 9 t s = 0.81 = Since > 1.895, H 0 is rejected. This study provides evidence at the 0.05 significance level that nerve cell densities are correlated between the two sites. (3) (10)

HW 9 SOLUTIONS Regression and Correlation

HW 9 SOLUTIONS Regression and Correlation 1. 12.41. The three residual plots, (i), (ii), and (iii), were generated after fitting regression lines to the three scatterplots, (a), (b), and (c). Which residual

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

Regression Analysis: A Complete Example

Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty

A correlation exists between two variables when one of them is related to the other in some way.

Lecture #10 Chapter 10 Correlation and Regression The main focus of this chapter is to form inferences based on sample data that come in pairs. Given such paired sample data, we want to determine whether

Simple Linear Regression

Inference for Regression Simple Linear Regression IPS Chapter 10.1 2009 W.H. Freeman and Company Objectives (IPS Chapter 10.1) Simple linear regression Statistical model for linear regression Estimating

Chapter 11: Two Variable Regression Analysis

Department of Mathematics Izmir University of Economics Week 14-15 2014-2015 In this chapter, we will focus on linear models and extend our analysis to relationships between variables, the definitions

p1^ = 0.18 p2^ = 0.12 A) 0.150 B) 0.387 C) 0.300 D) 0.188 3) n 1 = 570 n 2 = 1992 x 1 = 143 x 2 = 550 A) 0.270 B) 0.541 C) 0.520 D) 0.

Practice for chapter 9 and 10 Disclaimer: the actual exam does not mirror this. This is meant for practicing questions only. The actual exam in not multiple choice. Find the number of successes x suggested

Chapter 9. Section Correlation

Chapter 9 Section 9.1 - Correlation Objectives: Introduce linear correlation, independent and dependent variables, and the types of correlation Find a correlation coefficient Test a population correlation

Using Minitab for Regression Analysis: An extended example

Using Minitab for Regression Analysis: An extended example The following example uses data from another text on fertilizer application and crop yield, and is intended to show how Minitab can be used to

Elementary Statistics Sample Exam #3

Elementary Statistics Sample Exam #3 Instructions. No books or telephones. Only the supplied calculators are allowed. The exam is worth 100 points. 1. A chi square goodness of fit test is considered to

AP Statistics Semester Exam Review Chapters 1-3

AP Statistics Semester Exam Review Chapters 1-3 1. Here are the IQ test scores of 10 randomly chosen fifth-grade students: 145 139 126 122 125 130 96 110 118 118 To make a stemplot of these scores, you

Simple Linear Regression in SPSS STAT 314

Simple Linear Regression in SPSS STAT 314 1. Ten Corvettes between 1 and 6 years old were randomly selected from last year s sales records in Virginia Beach, Virginia. The following data were obtained,

SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

Practice for Chapter 9 and 10 The acutal exam differs. SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Find the number of successes x suggested by the

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a

Multiple Regression in SPSS STAT 314

Multiple Regression in SPSS STAT 314 I. The accompanying data is on y = profit margin of savings and loan companies in a given year, x 1 = net revenues in that year, and x 2 = number of savings and loan

ACTM State Exam-Statistics

ACTM State Exam-Statistics For the 25 multiple-choice questions, make your answer choice and record it on the answer sheet provided. Once you have completed that section of the test, proceed to the tie-breaker

E205 Final: Version B

Name: Class: Date: E205 Final: Version B Multiple Choice Identify the choice that best completes the statement or answers the question. 1. The owner of a local nightclub has recently surveyed a random

SELF-TEST: SIMPLE REGRESSION

ECO 22000 McRAE SELF-TEST: SIMPLE REGRESSION Note: Those questions indicated with an (N) are unlikely to appear in this form on an in-class examination, but you should be able to describe the procedures

Math 62 Statistics Sample Exam Questions

Math 62 Statistics Sample Exam Questions 1. (10) Explain the difference between the distribution of a population and the sampling distribution of a statistic, such as the mean, of a sample randomly selected

DATA INTERPRETATION AND STATISTICS

PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE

Lesson Lesson Outline Outline

Lesson 15 Linear Regression Lesson 15 Outline Review correlation analysis Dependent and Independent variables Least Squares Regression line Calculating l the slope Calculating the Intercept Residuals and

Module 5 Hypotheses Tests: Comparing Two Groups

Module 5 Hypotheses Tests: Comparing Two Groups Objective: In medical research, we often compare the outcomes between two groups of patients, namely exposed and unexposed groups. At the completion of this

e = random error, assumed to be normally distributed with mean 0 and standard deviation σ

1 Linear Regression 1.1 Simple Linear Regression Model The linear regression model is applied if we want to model a numeric response variable and its dependency on at least one numeric factor variable.

Chapter 7: Simple linear regression Learning Objectives

Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -

Simple Linear Regression Inference

Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

In Chapter 2, we used linear regression to describe linear relationships. The setting for this is a

Math 143 Inference on Regression 1 Review of Linear Regression In Chapter 2, we used linear regression to describe linear relationships. The setting for this is a bivariate data set (i.e., a list of cases/subjects

The scatterplot indicates a positive linear relationship between waist size and body fat percentage:

STAT E-150 Statistical Methods Multiple Regression Three percent of a man's body is essential fat, which is necessary for a healthy body. However, too much body fat can be dangerous. For men between the

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

Opening Example CHAPTER 13 SIMPLE LINEAR REGREION SIMPLE LINEAR REGREION! Simple Regression! Linear Regression Simple Regression Definition A regression model is a mathematical equation that descries the

where b is the slope of the line and a is the intercept i.e. where the line cuts the y axis.

Least Squares Introduction We have mentioned that one should not always conclude that because two variables are correlated that one variable is causing the other to behave a certain way. However, sometimes

Hypothesis testing - Steps

Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =

SOCI Homework 6 Key

SOCI 252-002 Homework 6 Key Professor François Nielsen Chapter 27 2. (pg. 702 drug use) a) The percentage of 9th graders in these countries who have used other drugs is estimated to have increased 0.615%

Construct a scatterplot for the given data. 2) x Answer:

Review for Test 5 STA 2023 spr 2014 Name Given the linear correlation coefficient r and the sample size n, determine the critical values of r and use your finding to state whether or not the given r represents

2013 MBA Jump Start Program. Statistics Module Part 3

2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.

SIMPLE LINEAR CORRELATION Simple linear correlation is a measure of the degree to which two variables vary together, or a measure of the intensity of the association between two variables. Correlation

Part 2: Analysis of Relationship Between Two Variables

Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable

Inferential Statistics

Inferential Statistics Sampling and the normal distribution Z-scores Confidence levels and intervals Hypothesis testing Commonly used statistical methods Inferential Statistics Descriptive statistics are

Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal

Statistics for Management II-STAT 362-Final Review

Statistics for Management II-STAT 362-Final Review Multiple Choice Identify the letter of the choice that best completes the statement or answers the question. 1. The ability of an interval estimate to

Using Excel for inferential statistics

FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied

Simple Regression and Correlation

Simple Regression and Correlation Today, we are going to discuss a powerful statistical technique for examining whether or not two variables are related. Specifically, we are going to talk about the ideas

Paired Differences and Regression

Paired Differences and Regression Students sometimes have difficulty distinguishing between paired data and independent samples when comparing two means. One can return to this topic after covering simple

Premaster Statistics Tutorial 4 Full solutions

Premaster Statistics Tutorial 4 Full solutions Regression analysis Q1 (based on Doane & Seward, 4/E, 12.7) a. Interpret the slope of the fitted regression = 125,000 + 150. b. What is the prediction for

12/31/2016. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2

PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 Understand linear regression with a single predictor Understand how we assess the fit of a regression model Total Sum of Squares

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

Slide 1 Section 14 Simple Linear Regression: Introduction to Least Squares Regression There are several different measures of statistical association used for understanding the quantitative relationship

Regression. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question.

Class: Date: Regression Multiple Choice Identify the choice that best completes the statement or answers the question. 1. Given the least squares regression line y8 = 5 2x: a. the relationship between

Factors affecting online sales

Factors affecting online sales Table of contents Summary... 1 Research questions... 1 The dataset... 2 Descriptive statistics: The exploratory stage... 3 Confidence intervals... 4 Hypothesis tests... 4

Residuals. Residuals = ª Department of ISM, University of Alabama, ST 260, M23 Residuals & Minitab. ^ e i = y i - y i

A continuation of regression analysis Lesson Objectives Continue to build on regression analysis. Learn how residual plots help identify problems with the analysis. M23-1 M23-2 Example 1: continued Case

Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.

1 Commands in JMP and Statcrunch Below are a set of commands in JMP and Statcrunch which facilitate a basic statistical analysis. The first part concerns commands in JMP, the second part is for analysis

UCLA STAT 13 Statistical Methods - Final Exam Review Solutions Chapter 7 Sampling Distributions of Estimates

UCLA STAT 13 Statistical Methods - Final Exam Review Solutions Chapter 7 Sampling Distributions of Estimates 1. (a) (i) µ µ (ii) σ σ n is exactly Normally distributed. (c) (i) is approximately Normally

2011 # AP Exam Solutions 2011 # # # #1 5/16/2011

2011 AP Exam Solutions 1. A professional sports team evaluates potential players for a certain position based on two main characteristics, speed and strength. (a) Speed is measured by the time required

Regression in SPSS. Workshop offered by the Mississippi Center for Supercomputing Research and the UM Office of Information Technology

Regression in SPSS Workshop offered by the Mississippi Center for Supercomputing Research and the UM Office of Information Technology John P. Bentley Department of Pharmacy Administration University of

SIMPLE REGRESSION ANALYSIS

SIMPLE REGRESSION ANALYSIS Introduction. Regression analysis is used when two or more variables are thought to be systematically connected by a linear relationship. In simple regression, we have only two

Regression and Correlation

Regression and Correlation Topics Covered: Dependent and independent variables. Scatter diagram. Correlation coefficient. Linear Regression line. by Dr.I.Namestnikova 1 Introduction Regression analysis

Econ 371 Problem Set #3 Answer Sheet

Econ 371 Problem Set #3 Answer Sheet 4.1 In this question, you are told that a OLS regression analysis of third grade test scores as a function of class size yields the following estimated model. T estscore

College of the Canyons Math 140 Exam 1 Amy Morrow. Name:

Name: Answer the following questions NEATLY. Show all necessary work directly on the exam. Scratch paper will be discarded unread. One point each part unless otherwise marked. 1. Owners of an exercise

MULTIPLE REGRESSION EXAMPLE

MULTIPLE REGRESSION EXAMPLE For a sample of n = 166 college students, the following variables were measured: Y = height X 1 = mother s height ( momheight ) X 2 = father s height ( dadheight ) X 3 = 1 if

BIOSTATISTICS QUIZ ANSWERS 1. When you read scientific literature, do you know whether the statistical tests that were used were appropriate and why they were used? a. Always b. Mostly c. Rarely d. Never

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing

Multiple Regression Analysis in Minitab 1

Multiple Regression Analysis in Minitab 1 Suppose we are interested in how the exercise and body mass index affect the blood pressure. A random sample of 10 males 50 years of age is selected and their

ch12 practice test SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

ch12 practice test 1) The null hypothesis that x and y are is H0: = 0. 1) 2) When a two-sided significance test about a population slope has a P-value below 0.05, the 95% confidence interval for A) does

AMS7: WEEK 8. CLASS 1. Correlation Monday May 18th, 2015

AMS7: WEEK 8. CLASS 1 Correlation Monday May 18th, 2015 Type of Data and objectives of the analysis Paired sample data (Bivariate data) Determine whether there is an association between two variables This

Sydney Roberts Predicting Age Group Swimmers 50 Freestyle Time 1. 1. Introduction p. 2. 2. Statistical Methods Used p. 5. 3. 10 and under Males p.

Sydney Roberts Predicting Age Group Swimmers 50 Freestyle Time 1 Table of Contents 1. Introduction p. 2 2. Statistical Methods Used p. 5 3. 10 and under Males p. 8 4. 11 and up Males p. 10 5. 10 and under

Chapter 23. Inferences for Regression

Chapter 23. Inferences for Regression Topics covered in this chapter: Simple Linear Regression Simple Linear Regression Example 23.1: Crying and IQ The Problem: Infants who cry easily may be more easily

Using Your TI-83/84 Calculator: Linear Correlation and Regression Elementary Statistics Dr. Laura Schultz This handout describes how to use your calculator for various linear correlation and regression

An analysis appropriate for a quantitative outcome and a single quantitative explanatory. 9.1 The model behind linear regression

Chapter 9 Simple Linear Regression An analysis appropriate for a quantitative outcome and a single quantitative explanatory variable. 9.1 The model behind linear regression When we are examining the relationship

Regression III: Dummy Variable Regression

Regression III: Dummy Variable Regression Tom Ilvento FREC 408 Linear Regression Assumptions about the error term Mean of Probability Distribution of the Error term is zero Probability Distribution of

Outline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test

The t-test Outline Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test - Dependent (related) groups t-test - Independent (unrelated) groups t-test Comparing means Correlation

Regression in ANOVA. James H. Steiger. Department of Psychology and Human Development Vanderbilt University

Regression in ANOVA James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) 1 / 30 Regression in ANOVA 1 Introduction 2 Basic Linear

The. The test is a statistical test to compare observed results with theoretical expected results. The calculation generates a

The 2 Test Use this test when: The measurements relate to the number of individuals in particular categories; The observed number can be compared with an expected number which is calculated from a theory.

Prob & Stats. Chapter 9 Review

Chapter 9 Review Construct the indicated confidence interval for the difference between the two population means. Assume that the two samples are independent simple random samples selected from normally

Exercise 1.12 (Pg. 22-23)

Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.

2. What is the general linear model to be used to model linear trend? (Write out the model) = + + + or

Simple and Multiple Regression Analysis Example: Explore the relationships among Month, Adv.\$ and Sales \$: 1. Prepare a scatter plot of these data. The scatter plots for Adv.\$ versus Sales, and Month versus

Stat 412/512 CASE INFLUENCE STATISTICS. Charlotte Wickham. stat512.cwick.co.nz. Feb 2 2015

Stat 412/512 CASE INFLUENCE STATISTICS Feb 2 2015 Charlotte Wickham stat512.cwick.co.nz Regression in your field See website. You may complete this assignment in pairs. Find a journal article in your field

Chapter 11: Linear Regression - Inference in Regression Analysis - Part 2

Chapter 11: Linear Regression - Inference in Regression Analysis - Part 2 Note: Whether we calculate confidence intervals or perform hypothesis tests we need the distribution of the statistic we will use.

The importance of graphing the data: Anscombe s regression examples

The importance of graphing the data: Anscombe s regression examples Bruce Weaver Northern Health Research Conference Nipissing University, North Bay May 30-31, 2008 B. Weaver, NHRC 2008 1 The Objective

Statistics 112 Regression Cheatsheet Section 1B - Ryan Rosario

Statistics 112 Regression Cheatsheet Section 1B - Ryan Rosario I have found that the best way to practice regression is by brute force That is, given nothing but a dataset and your mind, compute everything

Example: Boats and Manatees

Figure 9-6 Example: Boats and Manatees Slide 1 Given the sample data in Table 9-1, find the value of the linear correlation coefficient r, then refer to Table A-6 to determine whether there is a significant

A. Karpinski

Chapter 3 Multiple Linear Regression Page 1. Overview of multiple regression 3-2 2. Considering relationships among variables 3-3 3. Extending the simple regression model to multiple predictors 3-4 4.

Pearson's Correlation Tests

Chapter 800 Pearson's Correlation Tests Introduction The correlation coefficient, ρ (rho), is a popular statistic for describing the strength of the relationship between two variables. The correlation

Simple Linear Regression One Binary Categorical Independent Variable

Simple Linear Regression Does sex influence mean GCSE score? In order to answer the question posed above, we want to run a linear regression of sgcseptsnew against sgender, which is a binary categorical

Case Study in Data Analysis Does a drug prevent cardiomegaly in heart failure?

Case Study in Data Analysis Does a drug prevent cardiomegaly in heart failure? Harvey Motulsky hmotulsky@graphpad.com This is the first case in what I expect will be a series of case studies. While I mention

table to see that the probability is 0.8413. (b) What is the probability that x is between 16 and 60? The z-scores for 16 and 60 are: 60 38 = 1.

Review Problems for Exam 3 Math 1040 1 1. Find the probability that a standard normal random variable is less than 2.37. Looking up 2.37 on the normal table, we see that the probability is 0.9911. 2. Find

, then the form of the model is given by: which comprises a deterministic component involving the three regression coefficients (

Multiple regression Introduction Multiple regression is a logical extension of the principles of simple linear regression to situations in which there are several predictor variables. For instance if we

Unit 26: Small Sample Inference for One Mean

Unit 26: Small Sample Inference for One Mean Prerequisites Students need the background on confidence intervals and significance tests covered in Units 24 and 25. Additional Topic Coverage Additional coverage

Bivariate Analysis. Correlation. Correlation. Pearson's Correlation Coefficient. Variable 1. Variable 2

Bivariate Analysis Variable 2 LEVELS >2 LEVELS COTIUOUS Correlation Used when you measure two continuous variables. Variable 2 2 LEVELS X 2 >2 LEVELS X 2 COTIUOUS t-test X 2 X 2 AOVA (F-test) t-test AOVA

Questions and Answers on Hypothesis Testing and Confidence Intervals

Questions and Answers on Hypothesis Testing and Confidence Intervals L. Magee Fall, 2008 1. Using 25 observations and 5 regressors, including the constant term, a researcher estimates a linear regression

2 Sample t-test (unequal sample sizes and unequal variances)

Variations of the t-test: Sample tail Sample t-test (unequal sample sizes and unequal variances) Like the last example, below we have ceramic sherd thickness measurements (in cm) of two samples representing

1 Confidence intervals

Math 143 Inference for Means 1 Statistical inference is inferring information about the distribution of a population from information about a sample. We re generally talking about one of two things: 1.

Paired 2 Sample t-test

Variations of the t-test: Paired 2 Sample 1 Paired 2 Sample t-test Suppose we are interested in the effect of different sampling strategies on the quality of data we recover from archaeological field surveys.

9.1 (a) The standard deviation of the four sample differences is given as.68. The standard error is SE (ȳ1 - ȳ 2 ) = SE d - = s d n d

CHAPTER 9 Comparison of Paired Samples 9.1 (a) The standard deviation of the four sample differences is given as.68. The standard error is SE (ȳ1 - ȳ 2 ) = SE d - = s d n d =.68 4 =.34. (b) H 0 : The mean

Homework 11. Part 1. Name: Score: / null

Name: Score: / Homework 11 Part 1 null 1 For which of the following correlations would the data points be clustered most closely around a straight line? A. r = 0.50 B. r = -0.80 C. r = 0.10 D. There is

Hypothesis Testing. April 21, 2009

Hypothesis Testing April 21, 2009 Your Claim is Just a Hypothesis I ve never made a mistake. Once I thought I did, but I was wrong. Your Claim is Just a Hypothesis Confidence intervals quantify how sure

Statistiek II. John Nerbonne. March 24, 2010. Information Science, Groningen Slides improved a lot by Harmut Fitz, Groningen!

Information Science, Groningen j.nerbonne@rug.nl Slides improved a lot by Harmut Fitz, Groningen! March 24, 2010 Correlation and regression We often wish to compare two different variables Examples: compare

Multiple Linear Regression

Multiple Linear Regression A regression with two or more explanatory variables is called a multiple regression. Rather than modeling the mean response as a straight line, as in simple regression, it is

Calculate Confidence Intervals Using the TI Graphing Calculator

Calculate Confidence Intervals Using the TI Graphing Calculator Confidence Interval for Population Proportion p Confidence Interval for Population μ (σ is known 1 Select: STAT / TESTS / 1-PropZInt x: number

Sample Exam #1 Elementary Statistics

Sample Exam #1 Elementary Statistics Instructions. No books, notes, or calculators are allowed. 1. Some variables that were recorded while studying diets of sharks are given below. Which of the variables

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters

Simple Linear Regression Chapter 11

Simple Linear Regression Chapter 11 Rationale Frequently decision-making situations require modeling of relationships among business variables. For instance, the amount of sale of a product may be related