STAT 155 Introductory Statistics. Lecture 10: Cautions about Regression and Correlation, Causation
|
|
- Marshall Green
- 7 years ago
- Views:
Transcription
1 The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL STAT 155 Introductory Statistics Lecture 10: Cautions about Regression and Correlation, Causation 10/03/06 Lecture 10 1
2 Review Least-Squares Regression Lines Equation and interpretation of the line Prediction using the line Correlation and Regression Coefficient of Determination 10/03/06 Lecture 10 2
3 Regression Diagnostics Look at residuals (errors): A residual is the difference between an observed value of the response variable and the value predicted by the regression line, i.e., residual = y yˆ. The sum of the least-squares residuals is always zero. Why? 10/03/06 Lecture 10 3
4 Residual Plots A residual plot is a scatterplot of the regression residuals against the explanatory variable. Residual plots help us assess the fit of a regression line. 10/03/06 Lecture 10 4
5 Age vs. Height 10/03/06 Lecture 10 5
6 Residual Plot If the regression line catches the overall pattern of the data, there should be no pattern in the residual. totally random 10/03/06 Lecture 10 6
7 nonlinear nonconstant variation 10/03/06 Lecture 10 7
8 Diabetes Patient: FPG vs. HbA FPG: fasting plasma glucose. HbA: percent of red blood cells that have a glucose molecule attached. Both are measuring blood glucose. We expect a positive association. 18 subjects, r = See the scatterplot on the next page. 10/03/06 Lecture 10 8
9 Diabetes Patient: FPG vs. HbA 10/03/06 Lecture 10 9
10 Outliers and Influential Observations An outlier is a point that lies outside the overall pattern of the other points. Outliers in the y direction have large residuals, but other outliers may not. An influential obs. is a point that the regression line would be significantly changed with or without it. Outliers in the x direction are often influential points. But not always 10/03/06 Lecture 10 10
11 Diabetes Patient: FPG vs. HbA 10/03/06 Lecture 10 11
12 Outliers & Influential Obs. Outliers in the y direction can be spotted from the residual plot. Influential points can be identified by fitting regression lines with/without those points. More serious. Can not be identified via residual plot. Scatterplot gives us some hint. 10/03/06 Lecture 10 12
13 Cautions about correlation and regression Linear only DO NOT extrapolate Not resistant Beware lurking variables Beware correlations based on averaged data The restricted-range problem 10/03/06 Lecture 10 13
14 Lurking Variable A lurking (hidden) variable is a variable that has an important effect on the relationship among the variables in a study, but is not included among the variables being studied. Examples: SAT scores and college grades Lurking variable: IQ 10/03/06 Lecture 10 14
15 Lurking variables can create nonsense correlations. For the world s nations, let x be the number of TVs/person and y be the average life expectancy; A high positive correlation nations with more TV sets have higher life expectancies. Could we lengthen the lives of people in Rwanda by shipping them more TVs? Lurking variable: wealth of the nation Rich nations: more TV sets. Rich nations: longer life expectancies because of better nutrition, clean water, and better health care. There is no cause-and-effect tie between TV sets and length of life. Association vs. causation. 10/03/06 Lecture 10 15
16 Misleading correlation (two clusters) 10/03/06 Lecture 10 16
17 Beware correlations based on averaged data A correlation based on averages over many individuals is usually higher than the correlation between the same variables based on data for individuals. Age vs. Height (Basketball) score % vs. practice time 10/03/06 Lecture 10 17
18 The restricted-range problem A restricted-range problem occurs when one does not get to observe the full range of the variables. When data suffer from restricted range, r and r 2 are lower than they would be if the full range could be observed. SAT scores vs. College GPA Princeton vs. Generic State College (Ex 2.22) 10/03/06 Lecture 10 18
19 Causation vs. Association Some studies want to find the existence of causation. Example of causation: Increased drinking of alcohol causes a decrease in coordination. Smoking and Lung Cancer. Example of association: The above two examples. SAT scores and Freshman year GPA. 10/03/06 Lecture 10 19
20 Association does not imply causation. An association between two variables x and y can reflect many types of relationship among x, y, and one or more lurking variables. An association between a predictor x and a response y, even if it is very strong, is not by itself good evidence that changes in x actually cause changes in y. 10/03/06 Lecture 10 20
21 Explaining Association 10/03/06 Lecture 10 21
22 Explaining Association: Causation Cause-and-effect Examples Amount of fertilizer and yield of corn Weight of a car and its MPG Dosage of a drug and the survival rate of the mice 10/03/06 Lecture 10 22
23 Explaining Association: Common Response Lurking variables Both x and y change in response to changes in z, the lurking variable There may not be direct causal link between x and y. Examples: SAT scores vs. College GPA (IQ, Attitude) Monthly flow of money into stock mutual funds vs. rate of return for the stock market (Market Condition, Investor Attitude) 10/03/06 Lecture 10 23
24 Explaining Association: Confounding Two variables are confounded when their effects on a response variable are mixed together. One explanatory variable may be confounded with other explanatory variables or lurking variables. Examples: More education leads to higher income. Family background Religious people live longer. Life style 10/03/06 Lecture 10 24
25 Establishing causation The only compelling method: Designed experiment (More in Chapter 3) Hot disputes: Does gun control reduce violent crime? Does meat consumption in your diet cause heart diseases? Does smoking cause lung cancer? 10/03/06 Lecture 10 25
26 Does smoking CAUSE lung cancer? causation: smoking causes lung cancer. common response: people who have a genetic predisposition to lung cancer also have a genetic predisposition to smoking. confounding: people who drink too much, don't exercise, eat unhealthy foods, etc. are more likely to get lung cancer as a result of their lifestyle. Such people may be more likely to be smokers as well. 10/03/06 Lecture 10 26
27 Some guidelines when designed experiment is impossible: strong association association consistent across various studies higher dose associated with stronger responses the cause precedes the effect in time plausibility 10/03/06 Lecture 10 27
28 Take Home Message Residual Plots Outliers and Influential Observations Lurking Variables Cautions about Correlation and Regression Explaining associations: Causation Common response Confounding How to establish causation? 10/03/06 Lecture 10 28
Linear Regression. Chapter 5. Prediction via Regression Line Number of new birds and Percent returning. Least Squares
Linear Regression Chapter 5 Regression Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). We can then predict the average response for all subjects
More informationMULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
Review MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) All but one of these statements contain a mistake. Which could be true? A) There is a correlation
More informationChapter 7: Simple linear regression Learning Objectives
Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -
More informationSection 14 Simple Linear Regression: Introduction to Least Squares Regression
Slide 1 Section 14 Simple Linear Regression: Introduction to Least Squares Regression There are several different measures of statistical association used for understanding the quantitative relationship
More informationStat 412/512 CASE INFLUENCE STATISTICS. Charlotte Wickham. stat512.cwick.co.nz. Feb 2 2015
Stat 412/512 CASE INFLUENCE STATISTICS Feb 2 2015 Charlotte Wickham stat512.cwick.co.nz Regression in your field See website. You may complete this assignment in pairs. Find a journal article in your field
More informationSection 3 Part 1. Relationships between two numerical variables
Section 3 Part 1 Relationships between two numerical variables 1 Relationship between two variables The summary statistics covered in the previous lessons are appropriate for describing a single variable.
More informationSTT 200 LECTURE 1, SECTION 2,4 RECITATION 7 (10/16/2012)
STT 200 LECTURE 1, SECTION 2,4 RECITATION 7 (10/16/2012) TA: Zhen (Alan) Zhang zhangz19@stt.msu.edu Office hour: (C500 WH) 1:45 2:45PM Tuesday (office tel.: 432-3342) Help-room: (A102 WH) 11:20AM-12:30PM,
More informationChapter 13 Introduction to Linear Regression and Correlation Analysis
Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing
More informationCorrelation. The relationship between height and weight
LP 1E: Correlation, corr and cause 1 Correlation Correlation: The relationship between two variables. A correlation occurs between a series of data, not an individual. Correlation coefficient: A measure
More informationLecture 13/Chapter 10 Relationships between Measurement (Quantitative) Variables
Lecture 13/Chapter 10 Relationships between Measurement (Quantitative) Variables Scatterplot; Roles of Variables 3 Features of Relationship Correlation Regression Definition Scatterplot displays relationship
More informationCorrelation Coefficient The correlation coefficient is a summary statistic that describes the linear relationship between two numerical variables 2
Lesson 4 Part 1 Relationships between two numerical variables 1 Correlation Coefficient The correlation coefficient is a summary statistic that describes the linear relationship between two numerical variables
More informationIntroduction to Statistics and Quantitative Research Methods
Introduction to Statistics and Quantitative Research Methods Purpose of Presentation To aid in the understanding of basic statistics, including terminology, common terms, and common statistical methods.
More information1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96
1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years
More informationHomework 8 Solutions
Math 17, Section 2 Spring 2011 Homework 8 Solutions Assignment Chapter 7: 7.36, 7.40 Chapter 8: 8.14, 8.16, 8.28, 8.36 (a-d), 8.38, 8.62 Chapter 9: 9.4, 9.14 Chapter 7 7.36] a) A scatterplot is given below.
More informationLecture 11: Chapter 5, Section 3 Relationships between Two Quantitative Variables; Correlation
Lecture 11: Chapter 5, Section 3 Relationships between Two Quantitative Variables; Correlation Display and Summarize Correlation for Direction and Strength Properties of Correlation Regression Line Cengage
More informationCorrelational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots
Correlational Research Stephen E. Brock, Ph.D., NCSP California State University, Sacramento 1 Correlational Research A quantitative methodology used to determine whether, and to what degree, a relationship
More informationCorrelation key concepts:
CORRELATION Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson s coefficient of correlation c) Spearman s Rank correlation coefficient d)
More information2. Here is a small part of a data set that describes the fuel economy (in miles per gallon) of 2006 model motor vehicles.
Math 1530-017 Exam 1 February 19, 2009 Name Student Number E There are five possible responses to each of the following multiple choice questions. There is only on BEST answer. Be sure to read all possible
More informationSimple linear regression
Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between
More information13. Poisson Regression Analysis
136 Poisson Regression Analysis 13. Poisson Regression Analysis We have so far considered situations where the outcome variable is numeric and Normally distributed, or binary. In clinical work one often
More informationAnswer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade
Statistics Quiz Correlation and Regression -- ANSWERS 1. Temperature and air pollution are known to be correlated. We collect data from two laboratories, in Boston and Montreal. Boston makes their measurements
More informationAP STATISTICS REVIEW (YMS Chapters 1-8)
AP STATISTICS REVIEW (YMS Chapters 1-8) Exploring Data (Chapter 1) Categorical Data nominal scale, names e.g. male/female or eye color or breeds of dogs Quantitative Data rational scale (can +,,, with
More informationMTH 140 Statistics Videos
MTH 140 Statistics Videos Chapter 1 Picturing Distributions with Graphs Individuals and Variables Categorical Variables: Pie Charts and Bar Graphs Categorical Variables: Pie Charts and Bar Graphs Quantitative
More informationChapter 7 Scatterplots, Association, and Correlation
78 Part II Exploring Relationships Between Variables Chapter 7 Scatterplots, Association, and Correlation 1. Association. a) Either weight in grams or weight in ounces could be the explanatory or response
More informationc. Construct a boxplot for the data. Write a one sentence interpretation of your graph.
MBA/MIB 5315 Sample Test Problems Page 1 of 1 1. An English survey of 3000 medical records showed that smokers are more inclined to get depressed than non-smokers. Does this imply that smoking causes depression?
More informationStatistics 151 Practice Midterm 1 Mike Kowalski
Statistics 151 Practice Midterm 1 Mike Kowalski Statistics 151 Practice Midterm 1 Multiple Choice (50 minutes) Instructions: 1. This is a closed book exam. 2. You may use the STAT 151 formula sheets and
More informationCorrelation. What Is Correlation? Perfect Correlation. Perfect Correlation. Greg C Elvers
Correlation Greg C Elvers What Is Correlation? Correlation is a descriptive statistic that tells you if two variables are related to each other E.g. Is your related to how much you study? When two variables
More informationPreview. What is a correlation? Las Cucarachas. Equal Footing. z Distributions 2/12/2013. Correlation
Preview Correlation Experimental Psychology Arlo Clark-Foos Correlation Characteristics of Relationship to Cause-Effect Partial Correlations Standardized Scores (z scores) Psychometrics Reliability Validity
More informationch12 practice test SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.
ch12 practice test 1) The null hypothesis that x and y are is H0: = 0. 1) 2) When a two-sided significance test about a population slope has a P-value below 0.05, the 95% confidence interval for A) does
More informationX X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)
CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.
More informationPrevention of and the Screening for Diabetes Part I Insulin Resistance By James L. Holly, MD Your Life Your Health The Examiner January 19, 2012
Prevention of and the Screening for Diabetes Part I Insulin Resistance By James L. Holly, MD Your Life Your Health The Examiner January 19, 2012 In 2002, SETMA began a relationship with Joslin Diabetes
More informationExample: Boats and Manatees
Figure 9-6 Example: Boats and Manatees Slide 1 Given the sample data in Table 9-1, find the value of the linear correlation coefficient r, then refer to Table A-6 to determine whether there is a significant
More information. 58 58 60 62 64 66 68 70 72 74 76 78 Father s height (inches)
PEARSON S FATHER-SON DATA The following scatter diagram shows the heights of 1,0 fathers and their full-grown sons, in England, circa 1900 There is one dot for each father-son pair Heights of fathers and
More informationChapter 10. Key Ideas Correlation, Correlation Coefficient (r),
Chapter 0 Key Ideas Correlation, Correlation Coefficient (r), Section 0-: Overview We have already explored the basics of describing single variable data sets. However, when two quantitative variables
More informationa) Find the five point summary for the home runs of the National League teams. b) What is the mean number of home runs by the American League teams?
1. Phone surveys are sometimes used to rate TV shows. Such a survey records several variables listed below. Which ones of them are categorical and which are quantitative? - the number of people watching
More informationEasy Read. How can we make sure everyone gets the right health care? How can we make NHS care better?
Easy Read How can we make NHS care better? How can we make sure everyone gets the right health care? What can we do to make the NHS good now and in the future? How can we afford to keep the NHS going?
More informationObesity in America: A Growing Trend
Obesity in America: A Growing Trend David Todd P e n n s y l v a n i a S t a t e U n i v e r s i t y Utilizing Geographic Information Systems (GIS) to explore obesity in America, this study aims to determine
More informationPitcairn Medical Practice New Patient Questionnaire
/ / *Areas are mandatory. Failure to complete may delay the time taken to process your registration *Surname: *Forename(s): *Address: *Date of Birth/CHI: / Marital Status: Sex: Male / Female (delete as
More informationAP Statistics Ch 3 Aim 1: Scatter Diagrams
Page 1 of4 Univariate nata Bivariate Data data involving only one variable such as test scores, height, etc. data involving the relationships between two variables, such as test scores and time studying,
More informationBasic Statistics and Data Analysis for Health Researchers from Foreign Countries
Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Volkert Siersma siersma@sund.ku.dk The Research Unit for General Practice in Copenhagen Dias 1 Content Quantifying association
More informationCorrelation and Regression
Correlation and Regression Scatterplots Correlation Explanatory and response variables Simple linear regression General Principles of Data Analysis First plot the data, then add numerical summaries Look
More informationUnivariate Regression
Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is
More informationList of Examples. Examples 319
Examples 319 List of Examples DiMaggio and Mantle. 6 Weed seeds. 6, 23, 37, 38 Vole reproduction. 7, 24, 37 Wooly bear caterpillar cocoons. 7 Homophone confusion and Alzheimer s disease. 8 Gear tooth strength.
More informationMULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
Module 7 Test Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. You are given information about a straight line. Use two points to graph the equation.
More informationChapter 23. Inferences for Regression
Chapter 23. Inferences for Regression Topics covered in this chapter: Simple Linear Regression Simple Linear Regression Example 23.1: Crying and IQ The Problem: Infants who cry easily may be more easily
More informationElementary Statistics
Elementary Statistics Chapter 1 Dr. Ghamsary Page 1 Elementary Statistics M. Ghamsary, Ph.D. Chap 01 1 Elementary Statistics Chapter 1 Dr. Ghamsary Page 2 Statistics: Statistics is the science of collecting,
More informationSTAT 350 Practice Final Exam Solution (Spring 2015)
PART 1: Multiple Choice Questions: 1) A study was conducted to compare five different training programs for improving endurance. Forty subjects were randomly divided into five groups of eight subjects
More information2. Simple Linear Regression
Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according
More information11. Analysis of Case-control Studies Logistic Regression
Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:
More informationIntroduction to History & Research Methods of Psychology
Term Explanation Application/Example/Extension Psychology is the scientific study of mental processes and behavior Influences on Psychology Psychology started as only the study of mental processes, but
More informationUnit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a
More informationCORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there
CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there is a relationship between variables, To find out the
More informationOverview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS
Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS About Omega Statistics Private practice consultancy based in Southern California, Medical and Clinical
More informationOutline: Demand Forecasting
Outline: Demand Forecasting Given the limited background from the surveys and that Chapter 7 in the book is complex, we will cover less material. The role of forecasting in the chain Characteristics of
More informationsocscimajor yes no TOTAL female 25 35 60 male 30 27 57 TOTAL 55 62 117
Review for Final Stat 10 (1) The table below shows data for a sample of students from UCLA. (a) What percent of the sampled students are male? 57/117 (b) What proportion of sampled students are social
More informationChapter 6. Examples (details given in class) Who is Measured: Units, Subjects, Participants. Research Studies to Detect Relationships
Announcements: Midterm Friday. Bring calculator and one sheet of notes. Can t use the calculator on your cell phone. Assigned seats, random ID check. Review Wed. Review sheet posted on website. Fri discussion
More informationWHAT IS DIABETES MELLITUS? CAUSES AND CONSEQUENCES. Living your life as normal as possible
WHAT IS DIABETES MELLITUS? CAUSES AND CONSEQUENCES DEDBT01954 Lilly Deutschland GmbH Werner-Reimers-Straße 2-4 61352 Bad Homburg Living your life as normal as possible www.lilly-pharma.de www.lilly-diabetes.de
More informationAP Stats- Mrs. Daniel Chapter 4 MC Practice
AP Stats- Mrs. Daniel Chapter 4 MC Practice Name: 1. Archaeologists plan to examine a sample of 2-meter-square plots near an ancient Greek city for artifacts visible in the ground. They choose separate
More informationSTATISTICS 8, FINAL EXAM. Last six digits of Student ID#: Circle your Discussion Section: 1 2 3 4
STATISTICS 8, FINAL EXAM NAME: KEY Seat Number: Last six digits of Student ID#: Circle your Discussion Section: 1 2 3 4 Make sure you have 8 pages. You will be provided with a table as well, as a separate
More informationUsing Excel for Statistical Analysis
Using Excel for Statistical Analysis You don t have to have a fancy pants statistics package to do many statistical functions. Excel can perform several statistical tests and analyses. First, make sure
More information1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number
1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression
More informationElements of statistics (MATH0487-1)
Elements of statistics (MATH0487-1) Prof. Dr. Dr. K. Van Steen University of Liège, Belgium December 10, 2012 Introduction to Statistics Basic Probability Revisited Sampling Exploratory Data Analysis -
More informationIs a monetary incentive a feasible solution to some of the UK s most pressing health concerns?
Norwich Economics Papers June 2010 Is a monetary incentive a feasible solution to some of the UK s most pressing health concerns? ALEX HAINES A monetary incentive is not always the key to all of life's
More information2013 MBA Jump Start Program. Statistics Module Part 3
2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just
More informationMajor dietary patterns are related to plasma concentrations of markers of inflammation and endothelial dysfunction
Major dietary patterns are related to plasma concentrations of markers of inflammation and endothelial dysfunction Esther Lopez Garcia, Matthias B Schulze, Teresa T Fung, James B Meigs, Nader Rifai, JoAnn
More information4. Simple regression. QBUS6840 Predictive Analytics. https://www.otexts.org/fpp/4
4. Simple regression QBUS6840 Predictive Analytics https://www.otexts.org/fpp/4 Outline The simple linear model Least squares estimation Forecasting with regression Non-linear functional forms Regression
More informationAP Statistics. Chapter 4 Review
Name AP Statistics Chapter 4 Review 1. In a study of the link between high blood pressure and cardiovascular disease, a group of white males aged 35 to 64 was followed for 5 years. At the beginning of
More informationWHO STEPwise approach to chronic disease risk factor surveillance (STEPS)
WHO STEPwise approach to chronic disease risk factor surveillance (STEPS) Promotion of Fruits and Vegetables for Health African Regional Workshop for Anglophone Countries Mount Meru Hotel, Arusha, Tanzania
More informationExercise 1.12 (Pg. 22-23)
Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.
More informationDATA INTERPRETATION AND STATISTICS
PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE
More informationChapter 9 Descriptive Statistics for Bivariate Data
9.1 Introduction 215 Chapter 9 Descriptive Statistics for Bivariate Data 9.1 Introduction We discussed univariate data description (methods used to eplore the distribution of the values of a single variable)
More informationBIG DATA SCIENTIFIC AND COMMERCIAL APPLICATIONS (ITNPD4) LECTURE: DATA SCIENCE IN MEDICINE
BIG DATA SCIENTIFIC AND COMMERCIAL APPLICATIONS (ITNPD4) LECTURE: DATA SCIENCE IN MEDICINE Gabriela Ochoa http://www.cs.stir.ac.uk/~goc/ OVERVIEW Module My Contributions Description: with guest lectures
More informationIntroduction to Regression and Data Analysis
Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it
More informationScatter Plot, Correlation, and Regression on the TI-83/84
Scatter Plot, Correlation, and Regression on the TI-83/84 Summary: When you have a set of (x,y) data points and want to find the best equation to describe them, you are performing a regression. This page
More informationRelationships Between Two Variables: Scatterplots and Correlation
Relationships Between Two Variables: Scatterplots and Correlation Example: Consider the population of cars manufactured in the U.S. What is the relationship (1) between engine size and horsepower? (2)
More information4. Multiple Regression in Practice
30 Multiple Regression in Practice 4. Multiple Regression in Practice The preceding chapters have helped define the broad principles on which regression analysis is based. What features one should look
More informationDEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9
DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,
More informationExamining a Fitted Logistic Model
STAT 536 Lecture 16 1 Examining a Fitted Logistic Model Deviance Test for Lack of Fit The data below describes the male birth fraction male births/total births over the years 1931 to 1990. A simple logistic
More informationThe Great Debate Correlation vs Causation
The Great Debate Correlation vs Causation Simply Put.. Correlation is: A mutual relation between two or more things where one has a measurable effect on the other. Causation is: A relationship in which
More informationMGT 267 PROJECT. Forecasting the United States Retail Sales of the Pharmacies and Drug Stores. Done by: Shunwei Wang & Mohammad Zainal
MGT 267 PROJECT Forecasting the United States Retail Sales of the Pharmacies and Drug Stores Done by: Shunwei Wang & Mohammad Zainal Dec. 2002 The retail sale (Million) ABSTRACT The present study aims
More informationModule 3: Correlation and Covariance
Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis
More informationScientific Methods II: Correlational Research
Scientific Methods II: Correlational Research EXAMPLES "MARRIAGE SLOWS CANCER DEATHS Evidence that married people have a better chance of surviving cancer than do singles means that the unmarried might
More informationChapter 1: The Nature of Probability and Statistics
Chapter 1: The Nature of Probability and Statistics Learning Objectives Upon successful completion of Chapter 1, you will have applicable knowledge of the following concepts: Statistics: An Overview and
More informationThe Importance of Statistics Education
The Importance of Statistics Education Professor Jessica Utts Department of Statistics University of California, Irvine http://www.ics.uci.edu/~jutts jutts@uci.edu Outline of Talk What is Statistics? Four
More informationDo Supplemental Online Recorded Lectures Help Students Learn Microeconomics?*
Do Supplemental Online Recorded Lectures Help Students Learn Microeconomics?* Jennjou Chen and Tsui-Fang Lin Abstract With the increasing popularity of information technology in higher education, it has
More informationSimple Linear Regression
STAT 101 Dr. Kari Lock Morgan Simple Linear Regression SECTIONS 9.3 Confidence and prediction intervals (9.3) Conditions for inference (9.1) Want More Stats??? If you have enjoyed learning how to analyze
More informationThe Mozart effect Methods of Scientific Research
The Mozart effect Methods of Scientific Research Chapter 2 Experimental Research: p42 49 http://www.mozarteffect.com/ http://www.amazon.com/mozart-sonata-pianos-schubert-fantasia/dp/b0000cf330 http://www.youtube.com/watch?v=hhqn2qjhlcm
More informationTRINITY COLLEGE. Faculty of Engineering, Mathematics and Science. School of Computer Science & Statistics
UNIVERSITY OF DUBLIN TRINITY COLLEGE Faculty of Engineering, Mathematics and Science School of Computer Science & Statistics BA (Mod) Enter Course Title Trinity Term 2013 Junior/Senior Sophister ST7002
More informationHealth Risk Appraisal Profile
Language (Character Set): Health Risk Appraisal Profile Congratulations for completing your Health Risk Appraisal Questionnaire! Last update: June 4, 2012; 8:27:54 CDT This Health Risk Appraisal is not
More informationModule 5: Multiple Regression Analysis
Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College
More informationMULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
Ch. Correlation and Regression. Correlation Interpret Scatter Plots and Correlations MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate
More information1) The table lists the smoking habits of a group of college students. Answer: 0.218
FINAL EXAM REVIEW Name ) The table lists the smoking habits of a group of college students. Sex Non-smoker Regular Smoker Heavy Smoker Total Man 5 52 5 92 Woman 8 2 2 220 Total 22 2 If a student is chosen
More informationPremaster Statistics Tutorial 4 Full solutions
Premaster Statistics Tutorial 4 Full solutions Regression analysis Q1 (based on Doane & Seward, 4/E, 12.7) a. Interpret the slope of the fitted regression = 125,000 + 150. b. What is the prediction for
More informationLinear Models in STATA and ANOVA
Session 4 Linear Models in STATA and ANOVA Page Strengths of Linear Relationships 4-2 A Note on Non-Linear Relationships 4-4 Multiple Linear Regression 4-5 Removal of Variables 4-8 Independent Samples
More informationDescribing Relationships between Two Variables
Describing Relationships between Two Variables Up until now, we have dealt, for the most part, with just one variable at a time. This variable, when measured on many different subjects or objects, took
More informationHiddenLevers Statistical Analysis Approach
HiddenLevers Statistical Analysis Approach HiddenLevers' core model uses a multilevel approach to find meaningful relationships between macro-economic indicators (levers) and investment assets. The model
More informationPredictive Modeling and Big Data
Predictive Modeling and Presented by Eileen Burns, FSA, MAAA Milliman Agenda Current uses of predictive modeling in the life insurance industry Potential applications of 2 1 June 16, 2014 [Enter presentation
More informationService courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.
Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are
More informationMortality Assessment Technology: A New Tool for Life Insurance Underwriting
Mortality Assessment Technology: A New Tool for Life Insurance Underwriting Guizhou Hu, MD, PhD BioSignia, Inc, Durham, North Carolina Abstract The ability to more accurately predict chronic disease morbidity
More informationIntroduction to Linear Regression
14. Regression A. Introduction to Simple Linear Regression B. Partitioning Sums of Squares C. Standard Error of the Estimate D. Inferential Statistics for b and r E. Influential Observations F. Regression
More information