A Cohort Study of Traffic-related Air Pollution and Mortality in Toronto, Canada: Online Appendix

Size: px
Start display at page:

Download "A Cohort Study of Traffic-related Air Pollution and Mortality in Toronto, Canada: Online Appendix"

Transcription

1 A Cohort Study of Traffic-related Air Pollution and Mortality in Toronto, Canada: Online Appendix Michael Jerrett, 1 Murray M. Finkelstein, 2 Jeff R. Brook, 3 M. Altaf Arain, 4 Palvos Kanaroglou, 4 Dave M. Stieb, 5 Nicholas L. Gilbert, 5 Dave Verma, 6 Norm Finkelstein, 4 Kenneth R. Chapman, 7 Malcolm R. Sears 8 1 Division of Environmental Health Sciences, School of Public Health, University of California, Berkeley, Berkeley, California, United States 2 Department of Family and Community Medicine, Mount Sinai Hospital,University of Toronto, Toronto, Ontario, Canada 3 Meteorological Services, Environment Canada 4 School of Geography and Earth Science, McMaster University, Hamilton, Ontario, Canada 5 Air Health Effects Section, Health Canada 6 Occupational Medicine Program, McMaster University, Hamilton, Ontario, Canada 7 Division of Respirology, Department of Medicine, University of Toronto, Toronto, Ontario, Canada 8 Division of Respirology, Department of Medicine, McMaster University, Hamilton, Ontario, Canada Corresponding Author: Michael Jerrett University of California, Berkeley School of Public Health Division of Environmental Health Sciences 50 University Hall (Mailing Address) 7 University Hall (Office and GIS Lab) Berkeley, CA jerrett@berkeley.edu Tel: Fax:

2 Purpose This appendix gives additional details on the measurement and modeling used to estimate nitrogen dioxide exposures for the epidemiological analyses presented in the main paper. As noted in the main body of the paper, we have already published model specifications for the 02 and 04 land use regressions, and complete details can be found elsewhere (Jerrett et al. 07; Finkelstein and Jerrett 07). This online appendix summarizes both models to assist with interpreting the epidemiologic results in the main paper. Descriptive statistics and box plots for data collected in 02 and 04 are listed in Supplemental Materials Table 1 and Figure 1 below. The data shows that there was a little more than two times difference between the years. Supplemental Material Table 1: Descriptive statistics of all NO 2 measurements in parts per billion (02, 04) NO2 (02) NO2 (04) N Range Minimum Maximum Mean Std. Deviation In Supplemental Material, Figure 1, we see that with 46 co-located measurements there was strong correlation with R = 0.82 even with the large difference in the magnitude of NO 2 measurements. 2

3 N = NO2 (04) 6 4 Rsq = NO2 (02) NO2 (04) NO2 (02) Supplemental Material Figure 1: Boxplot and scatterplot of co-located monitoring values Supplemental Materials Table 2 and Figure 2 below show the model specifications and the observed on predicted value scatterplots for the Fall 02 model. Supplemental Material Table 2: Summary of the regression results for the logarithmic NO 2 Model (02). Number of obs = 94 Source SS df MS F(7,87) = 27.4 Regression Prob > F = 0 Residual R-square = 0.69 Total Adj. R-square = 0.67 Root MSE = Variable* Coefficient Std. Error t Prob > t VIF LN(NO 2 ) (Constant) 8.06E RD1_0 1.84E RD2_ E IND E DC E X -8.01E D_WIND E TRAF E *RD1_0 measure of expressway within 0m; RD2_50 measure of major roads within 50m; IND750 measure of industrial land use within 750m; DC00 density of dwellings within 00m (Kernel estimate); X UTM NAD83 x-coordinate; D_WIND15 Boolean identifier whether downwind and within 1500m of nearest expressway at PMpeak traffic; TRAF500 Density measure of 24 hour traffic counts within 500m. 3

4 Observed values Unstandardized Predicted Value Supplemental Material Figure 2: Logarithmic-observed mean NO 2 (02) on predicted value. We present similar results for the 04 LUR model in Table 3, with the observed on predicted scatterplot shown in Figure 3. Supplemental Material Table 3: Summary of the regression results for NO 2 Model (04). Number of obs = 1 Source SS df MS F(5,96) = 46.1 Regression Prob > F = Residual R-square = Total Adj. R-square = 0.69 Root MSE = Variable* Coefficient Std. Error t Prob > t VIF NO 2 [ppb] (Constant) 9.62E X_MEAN -2.68E EXCS E RD2_00 2.E RD2_50 1.E EA E * X_MEAN mean deviated UTM NAD83 x-coordinate; EXCS400 measure of area of road expressway encasements within 400m; RD2_00 measure of major roads within 00m; RD2_50 measure of major roads within 50m; EA2500 measure of population density using kernel density with a bandwidth of 2500m of population values at enumeration area centroids. 4

5 30 Observed Values 0 Rsq = Unstandardized Predicted Value Supplemental Material Figure 3: Observed NO 2 (04) on predicted values. Even though the models fit different variables, when comparing 02 to 04 they display similar spatial gradients (see Figure 4). We averaged the two surfaces after assignment to the study subjects in the cohort and used the average estimate for the epidemiological analyses. 5

6 Figure 4: Prediction Maps of 02 and 04 NO2 land use regression models 6

7 References Finkelstein MM, Jerrett M. 07. A study of the relationships between Parkinson's disease and markers of traffic-derived and environmental manganese air pollution in two Canadian cities. Environ Res 4(3): Jerrett M, Arain MA, Kanaroglou P, Beckerman B, Crouse D, Gilbert NL, et al. 07. Modelling the intraurban variability of ambient traffic pollution in Toronto, Canada. J Toxicol Environ Health A 70(3-4):

Establishing an air pollution monitoring network for intra-urban population exposure assessment: a location-allocation approach

Establishing an air pollution monitoring network for intra-urban population exposure assessment: a location-allocation approach Establishing an air pollution monitoring network for intra-urban population exposure assessment: a location-allocation approach P.S. Kanaroglou, M. Jerrett, J. Morrison, B. Beckerman, M.A. Arain, N.L.

More information

Establishing an air pollution monitoring network for intraurban population exposure assessment: A location-allocation approach

Establishing an air pollution monitoring network for intraurban population exposure assessment: A location-allocation approach Atmospheric Environment 39 (2005) 2399 2409 www.elsevier.com/locate/atmosenv Establishing an air pollution monitoring network for intraurban population exposure assessment: A location-allocation approach

More information

MULTIPLE REGRESSION EXAMPLE

MULTIPLE REGRESSION EXAMPLE MULTIPLE REGRESSION EXAMPLE For a sample of n = 166 college students, the following variables were measured: Y = height X 1 = mother s height ( momheight ) X 2 = father s height ( dadheight ) X 3 = 1 if

More information

Correlation and Regression

Correlation and Regression Correlation and Regression Scatterplots Correlation Explanatory and response variables Simple linear regression General Principles of Data Analysis First plot the data, then add numerical summaries Look

More information

August 2012 EXAMINATIONS Solution Part I

August 2012 EXAMINATIONS Solution Part I August 01 EXAMINATIONS Solution Part I (1) In a random sample of 600 eligible voters, the probability that less than 38% will be in favour of this policy is closest to (B) () In a large random sample,

More information

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares Topic 4 - Analysis of Variance Approach to Regression Outline Partitioning sums of squares Degrees of freedom Expected mean squares General linear test - Fall 2013 R 2 and the coefficient of correlation

More information

Linear Regression Models with Logarithmic Transformations

Linear Regression Models with Logarithmic Transformations Linear Regression Models with Logarithmic Transformations Kenneth Benoit Methodology Institute London School of Economics kbenoit@lse.ac.uk March 17, 2011 1 Logarithmic transformations of variables Considering

More information

MODEL I: DRINK REGRESSED ON GPA & MALE, WITHOUT CENTERING

MODEL I: DRINK REGRESSED ON GPA & MALE, WITHOUT CENTERING Interpreting Interaction Effects; Interaction Effects and Centering Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised February 20, 2015 Models with interaction effects

More information

Statistics 104 Final Project A Culture of Debt: A Study of Credit Card Spending in America TF: Kevin Rader Anonymous Students: LD, MH, IW, MY

Statistics 104 Final Project A Culture of Debt: A Study of Credit Card Spending in America TF: Kevin Rader Anonymous Students: LD, MH, IW, MY Statistics 104 Final Project A Culture of Debt: A Study of Credit Card Spending in America TF: Kevin Rader Anonymous Students: LD, MH, IW, MY ABSTRACT: This project attempted to determine the relationship

More information

1.1. Simple Regression in Excel (Excel 2010).

1.1. Simple Regression in Excel (Excel 2010). .. Simple Regression in Excel (Excel 200). To get the Data Analysis tool, first click on File > Options > Add-Ins > Go > Select Data Analysis Toolpack & Toolpack VBA. Data Analysis is now available under

More information

Department of Economics Session 2012/2013. EC352 Econometric Methods. Solutions to Exercises from Week 10 + 0.0077 (0.052)

Department of Economics Session 2012/2013. EC352 Econometric Methods. Solutions to Exercises from Week 10 + 0.0077 (0.052) Department of Economics Session 2012/2013 University of Essex Spring Term Dr Gordon Kemp EC352 Econometric Methods Solutions to Exercises from Week 10 1 Problem 13.7 This exercise refers back to Equation

More information

Interaction effects between continuous variables (Optional)

Interaction effects between continuous variables (Optional) Interaction effects between continuous variables (Optional) Richard Williams, University of Notre Dame, http://www.nd.edu/~rwilliam/ Last revised February 0, 05 This is a very brief overview of this somewhat

More information

IAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results

IAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results IAPRI Quantitative Analysis Capacity Building Series Multiple regression analysis & interpreting results How important is R-squared? R-squared Published in Agricultural Economics 0.45 Best article of the

More information

Data Mining and Data Warehousing. Henryk Maciejewski. Data Mining Predictive modelling: regression

Data Mining and Data Warehousing. Henryk Maciejewski. Data Mining Predictive modelling: regression Data Mining and Data Warehousing Henryk Maciejewski Data Mining Predictive modelling: regression Algorithms for Predictive Modelling Contents Regression Classification Auxiliary topics: Estimation of prediction

More information

The importance of graphing the data: Anscombe s regression examples

The importance of graphing the data: Anscombe s regression examples The importance of graphing the data: Anscombe s regression examples Bruce Weaver Northern Health Research Conference Nipissing University, North Bay May 30-31, 2008 B. Weaver, NHRC 2008 1 The Objective

More information

Marginal Effects for Continuous Variables Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised February 21, 2015

Marginal Effects for Continuous Variables Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised February 21, 2015 Marginal Effects for Continuous Variables Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised February 21, 2015 References: Long 1997, Long and Freese 2003 & 2006 & 2014,

More information

The average hotel manager recognizes the criticality of forecasting. However, most

The average hotel manager recognizes the criticality of forecasting. However, most Introduction The average hotel manager recognizes the criticality of forecasting. However, most managers are either frustrated by complex models researchers constructed or appalled by the amount of time

More information

Regression Analysis: A Complete Example

Regression Analysis: A Complete Example Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty

More information

Chapter 4 and 5 solutions

Chapter 4 and 5 solutions Chapter 4 and 5 solutions 4.4. Three different washing solutions are being compared to study their effectiveness in retarding bacteria growth in five gallon milk containers. The analysis is done in a laboratory,

More information

2. Simple Linear Regression

2. Simple Linear Regression Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according

More information

Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear.

Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear. Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear. In the main dialog box, input the dependent variable and several predictors.

More information

SPSS Guide: Regression Analysis

SPSS Guide: Regression Analysis SPSS Guide: Regression Analysis I put this together to give you a step-by-step guide for replicating what we did in the computer lab. It should help you run the tests we covered. The best way to get familiar

More information

Multicollinearity Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised January 13, 2015

Multicollinearity Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised January 13, 2015 Multicollinearity Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised January 13, 2015 Stata Example (See appendices for full example).. use http://www.nd.edu/~rwilliam/stats2/statafiles/multicoll.dta,

More information

Addressing Alternative. Multiple Regression. 17.871 Spring 2012

Addressing Alternative. Multiple Regression. 17.871 Spring 2012 Addressing Alternative Explanations: Multiple Regression 17.871 Spring 2012 1 Did Clinton hurt Gore example Did Clinton hurt Gore in the 2000 election? Treatment is not liking Bill Clinton 2 Bivariate

More information

Please follow the directions once you locate the Stata software in your computer. Room 114 (Business Lab) has computers with Stata software

Please follow the directions once you locate the Stata software in your computer. Room 114 (Business Lab) has computers with Stata software STATA Tutorial Professor Erdinç Please follow the directions once you locate the Stata software in your computer. Room 114 (Business Lab) has computers with Stata software 1.Wald Test Wald Test is used

More information

California SCHIP Caregivers Perceptions of Dental Care

California SCHIP Caregivers Perceptions of Dental Care California SCHIP Caregivers Perceptions of Dental Care J.J. CRALL, C UCLA / MCHB National Oral Health Policy Center, LA, CA J. BROWN, RAND Survey Research Group, Santa Monica, CA L.U. BROWN, Managed Risk

More information

1. The parameters to be estimated in the simple linear regression model Y=α+βx+ε ε~n(0,σ) are: a) α, β, σ b) α, β, ε c) a, b, s d) ε, 0, σ

1. The parameters to be estimated in the simple linear regression model Y=α+βx+ε ε~n(0,σ) are: a) α, β, σ b) α, β, ε c) a, b, s d) ε, 0, σ STA 3024 Practice Problems Exam 2 NOTE: These are just Practice Problems. This is NOT meant to look just like the test, and it is NOT the only thing that you should study. Make sure you know all the material

More information

Solution Let us regress percentage of games versus total payroll.

Solution Let us regress percentage of games versus total payroll. Assignment 3, MATH 2560, Due November 16th Question 1: all graphs and calculations have to be done using the computer The following table gives the 1999 payroll (rounded to the nearest million dolars)

More information

Univariate Regression

Univariate Regression Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is

More information

Lab 5 Linear Regression with Within-subject Correlation. Goals: Data: Use the pig data which is in wide format:

Lab 5 Linear Regression with Within-subject Correlation. Goals: Data: Use the pig data which is in wide format: Lab 5 Linear Regression with Within-subject Correlation Goals: Data: Fit linear regression models that account for within-subject correlation using Stata. Compare weighted least square, GEE, and random

More information

DETERMINANTS OF CAPITAL ADEQUACY RATIO IN SELECTED BOSNIAN BANKS

DETERMINANTS OF CAPITAL ADEQUACY RATIO IN SELECTED BOSNIAN BANKS DETERMINANTS OF CAPITAL ADEQUACY RATIO IN SELECTED BOSNIAN BANKS Nađa DRECA International University of Sarajevo nadja.dreca@students.ius.edu.ba Abstract The analysis of a data set of observation for 10

More information

Heavy Metal Pollution and Race as Factors in Hypertension and Heart Disease

Heavy Metal Pollution and Race as Factors in Hypertension and Heart Disease Heavy Metal Pollution and Race as Factors in Hypertension and Heart Disease Roger D. Masters Dartmouth College Jonathan Kahn s report on FDA approval of BiDil, a drug targeted to hypertension in blacks

More information

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( ) Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

More information

ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2

ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2 University of California, Berkeley Prof. Ken Chay Department of Economics Fall Semester, 005 ECON 14 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE # Question 1: a. Below are the scatter plots of hourly wages

More information

Geostatistics Exploratory Analysis

Geostatistics Exploratory Analysis Instituto Superior de Estatística e Gestão de Informação Universidade Nova de Lisboa Master of Science in Geospatial Technologies Geostatistics Exploratory Analysis Carlos Alberto Felgueiras cfelgueiras@isegi.unl.pt

More information

Failure to take the sampling scheme into account can lead to inaccurate point estimates and/or flawed estimates of the standard errors.

Failure to take the sampling scheme into account can lead to inaccurate point estimates and/or flawed estimates of the standard errors. Analyzing Complex Survey Data: Some key issues to be aware of Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised January 24, 2015 Rather than repeat material that is

More information

Lecture 15. Endogeneity & Instrumental Variable Estimation

Lecture 15. Endogeneity & Instrumental Variable Estimation Lecture 15. Endogeneity & Instrumental Variable Estimation Saw that measurement error (on right hand side) means that OLS will be biased (biased toward zero) Potential solution to endogeneity instrumental

More information

Workshop: Using Spatial Analysis and Maps to Understand Patterns of Health Services Utilization

Workshop: Using Spatial Analysis and Maps to Understand Patterns of Health Services Utilization Enhancing Information and Methods for Health System Planning and Research, Institute for Clinical Evaluative Sciences (ICES), January 19-20, 2004, Toronto, Canada Workshop: Using Spatial Analysis and Maps

More information

The Numbers Behind the MLB Anonymous Students: AD, CD, BM; (TF: Kevin Rader)

The Numbers Behind the MLB Anonymous Students: AD, CD, BM; (TF: Kevin Rader) The Numbers Behind the MLB Anonymous Students: AD, CD, BM; (TF: Kevin Rader) Abstract This project measures the effects of various baseball statistics on the win percentage of all the teams in MLB. Data

More information

Nonlinear relationships Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised February 20, 2015

Nonlinear relationships Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised February 20, 2015 Nonlinear relationships Richard Williams, University of Notre Dame, http://www.nd.edu/~rwilliam/ Last revised February, 5 Sources: Berry & Feldman s Multiple Regression in Practice 985; Pindyck and Rubinfeld

More information

Forecasting Analytics. Group members: - Arpita - Kapil - Kaushik - Ridhima - Ushhan

Forecasting Analytics. Group members: - Arpita - Kapil - Kaushik - Ridhima - Ushhan Forecasting Analytics Group members: - Arpita - Kapil - Kaushik - Ridhima - Ushhan Business Problem Forecast daily sales of dairy products (excluding milk) to make a good prediction of future demand, and

More information

Using Stata 9 & Higher for OLS Regression Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised January 8, 2015

Using Stata 9 & Higher for OLS Regression Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised January 8, 2015 Using Stata 9 & Higher for OLS Regression Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised January 8, 2015 Introduction. This handout shows you how Stata can be used

More information

Using Geostatistical Tools for Mapping Traffic- Related Air Pollution in Urban Areas

Using Geostatistical Tools for Mapping Traffic- Related Air Pollution in Urban Areas International Environmental Modelling and Software Society (iemss) 7th Intl. Congress on Env. Modelling and Software, San Diego, CA, USA, Daniel P. Ames, Nigel W.T. Quinn and Andrea E. Rizzoli (Eds.) http://www.iemss.org/society/index.php/iemss-2014-proceedings

More information

Quick Stata Guide by Liz Foster

Quick Stata Guide by Liz Foster by Liz Foster Table of Contents Part 1: 1 describe 1 generate 1 regress 3 scatter 4 sort 5 summarize 5 table 6 tabulate 8 test 10 ttest 11 Part 2: Prefixes and Notes 14 by var: 14 capture 14 use of the

More information

THE QUALITY GAP: A STUDY OF NONPROFIT AND COMMERCIAL CHILD CARE CENTRES IN CANADA December 2004

THE QUALITY GAP: A STUDY OF NONPROFIT AND COMMERCIAL CHILD CARE CENTRES IN CANADA December 2004 THE QUALITY GAP: A STUDY OF NONPROFIT AND COMMERCIAL CHILD CARE CENTRES IN CANADA December 2004 Gordon Cleveland and Michael Krashinsky, Division of Management, University of Toronto at Scarborough INTRODUCTION

More information

Getting Correct Results from PROC REG

Getting Correct Results from PROC REG Getting Correct Results from PROC REG Nathaniel Derby, Statis Pro Data Analytics, Seattle, WA ABSTRACT PROC REG, SAS s implementation of linear regression, is often used to fit a line without checking

More information

ESTIMATING AVERAGE TREATMENT EFFECTS: IV AND CONTROL FUNCTIONS, II Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

ESTIMATING AVERAGE TREATMENT EFFECTS: IV AND CONTROL FUNCTIONS, II Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics ESTIMATING AVERAGE TREATMENT EFFECTS: IV AND CONTROL FUNCTIONS, II Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Quantile Treatment Effects 2. Control Functions

More information

Discussion Section 4 ECON 139/239 2010 Summer Term II

Discussion Section 4 ECON 139/239 2010 Summer Term II Discussion Section 4 ECON 139/239 2010 Summer Term II 1. Let s use the CollegeDistance.csv data again. (a) An education advocacy group argues that, on average, a person s educational attainment would increase

More information

Study Plan Master in Public Health ( Non-Thesis Track)

Study Plan Master in Public Health ( Non-Thesis Track) Study Plan Master in Public Health ( Non-Thesis Track) I. General Rules and conditions : 1. This plan conforms to the regulations of the general frame of the Graduate Studies. 2. Specialties allowed to

More information

Developing a Translog Cost Function for Pharmaceutical Distribution

Developing a Translog Cost Function for Pharmaceutical Distribution Developing a Translog Cost Function for Pharmaceutical Distribution Gaurav Jetly b Christian Rossetti a * Michael Kay b Donald Warsing a Robert Handfield a *contact author a Department of Business Management

More information

MGT 267 PROJECT. Forecasting the United States Retail Sales of the Pharmacies and Drug Stores. Done by: Shunwei Wang & Mohammad Zainal

MGT 267 PROJECT. Forecasting the United States Retail Sales of the Pharmacies and Drug Stores. Done by: Shunwei Wang & Mohammad Zainal MGT 267 PROJECT Forecasting the United States Retail Sales of the Pharmacies and Drug Stores Done by: Shunwei Wang & Mohammad Zainal Dec. 2002 The retail sale (Million) ABSTRACT The present study aims

More information

BIOL 933 Lab 6 Fall 2015. Data Transformation

BIOL 933 Lab 6 Fall 2015. Data Transformation BIOL 933 Lab 6 Fall 2015 Data Transformation Transformations in R General overview Log transformation Power transformation The pitfalls of interpreting interactions in transformed data Transformations

More information

Data Analysis Methodology 1

Data Analysis Methodology 1 Data Analysis Methodology 1 Suppose you inherited the database in Table 1.1 and needed to find out what could be learned from it fast. Say your boss entered your office and said, Here s some software project

More information

STAT 350 Practice Final Exam Solution (Spring 2015)

STAT 350 Practice Final Exam Solution (Spring 2015) PART 1: Multiple Choice Questions: 1) A study was conducted to compare five different training programs for improving endurance. Forty subjects were randomly divided into five groups of eight subjects

More information

Mobile monitoring of air pollution in cities: the case of Hamilton, Ontario, Canada

Mobile monitoring of air pollution in cities: the case of Hamilton, Ontario, Canada PAPER www.rsc.org/jem Journal of Environmental Monitoring monitoring of air pollution in cities: the case of Hamilton, Ontario, Canada Julie Wallace,* a Denis Corr, b Patrick Deluca, a Pavlos Kanaroglou

More information

Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:

Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices: Doing Multiple Regression with SPSS Multiple Regression for Data Already in Data Editor Next we want to specify a multiple regression analysis for these data. The menu bar for SPSS offers several options:

More information

Wingz Ergonomic Computer Keyboard An overview of research leading to a new keyboard design

Wingz Ergonomic Computer Keyboard An overview of research leading to a new keyboard design Wingz Ergonomic Computer Keyboard An overview of research leading to a new keyboard design Advanced Research Computers Inc. March 2011 Abstract The Wingz Smartkeyboard is an advance in computer keyboard

More information

R e s e a r c h R e p o r t

R e s e a r c h R e p o r t R e s e a r c h R e p o r t H E A L T H E F F E CTS IN STITUTE Number 140 May 2009 PRESS VERSION Extended Follow-Up and Spatial Analysis of the American Cancer Society Study Linking Particulate Air Pollution

More information

Didacticiel - Études de cas

Didacticiel - Études de cas 1 Topic Regression analysis with LazStats (OpenStat). LazStat 1 is a statistical software which is developed by Bill Miller, the father of OpenStat, a wellknow tool by statisticians since many years. These

More information

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Chapter 13 Introduction to Linear Regression and Correlation Analysis Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing

More information

Hamilton Truck Route Study

Hamilton Truck Route Study Prepared for the City of Hamilton March 2012 Pavlos S. Kanaroglou, Ph.D. Vivek Korikanthimath, Ph.D. McMaster Institute of Transportation and Logistics McMaster University Hamilton, Ontario March 2012

More information

MODELING AUTO INSURANCE PREMIUMS

MODELING AUTO INSURANCE PREMIUMS MODELING AUTO INSURANCE PREMIUMS Brittany Parahus, Siena College INTRODUCTION The findings in this paper will provide the reader with a basic knowledge and understanding of how Auto Insurance Companies

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

25 Working with categorical data and factor variables

25 Working with categorical data and factor variables 25 Working with categorical data and factor variables Contents 25.1 Continuous, categorical, and indicator variables 25.1.1 Converting continuous variables to indicator variables 25.1.2 Converting continuous

More information

An Analysis of the Undergraduate Tuition Increases at the University of Minnesota Duluth

An Analysis of the Undergraduate Tuition Increases at the University of Minnesota Duluth Proceedings of the National Conference On Undergraduate Research (NCUR) 2012 Weber State University March 29-31, 2012 An Analysis of the Undergraduate Tuition Increases at the University of Minnesota Duluth

More information

Air Pollution and Mortality - Spatial Analysis

Air Pollution and Mortality - Spatial Analysis ORIGINAL ARTICLE Spatial Analysis of Air Pollution and Mortality in Los Angeles Michael Jerrett,* Richard T. Burnett, Renjun Ma, C. Arden Pope III, Daniel Krewski, K. Bruce Newbold, George Thurston,**

More information

Northern Colorado Retail Study: A shift-share analysis 2000 to 2010

Northern Colorado Retail Study: A shift-share analysis 2000 to 2010 Northern Colorado Retail Study: A shift-share analysis 2000 to 2010 Everitt Real Estate Center Steven P Laposa, PhD Christopher Hannum, PhD Economics Candidate Austin Carter, Senior (Real Estate Major)

More information

Simple linear regression

Simple linear regression Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between

More information

International Statistical Institute, 56th Session, 2007: Phil Everson

International Statistical Institute, 56th Session, 2007: Phil Everson Teaching Regression using American Football Scores Everson, Phil Swarthmore College Department of Mathematics and Statistics 5 College Avenue Swarthmore, PA198, USA E-mail: peverso1@swarthmore.edu 1. Introduction

More information

ADVANCING THE USE OF MOBILE MONITORING DATA FOR AIR POLLUTION MODELLING

ADVANCING THE USE OF MOBILE MONITORING DATA FOR AIR POLLUTION MODELLING ADVANCING THE USE OF MOBILE MONITORING DATA FOR AIR POLLUTION MODELLING ADVANCING THE USE OF MOBILE MONITORING DATA FOR AIR POLLUTION MODELLING By Matthew D. Adams, HBESc, MES A Thesis Submitted to the

More information

Nonlinear Regression Functions. SW Ch 8 1/54/

Nonlinear Regression Functions. SW Ch 8 1/54/ Nonlinear Regression Functions SW Ch 8 1/54/ The TestScore STR relation looks linear (maybe) SW Ch 8 2/54/ But the TestScore Income relation looks nonlinear... SW Ch 8 3/54/ Nonlinear Regression General

More information

Stata Walkthrough 4: Regression, Prediction, and Forecasting

Stata Walkthrough 4: Regression, Prediction, and Forecasting Stata Walkthrough 4: Regression, Prediction, and Forecasting Over drinks the other evening, my neighbor told me about his 25-year-old nephew, who is dating a 35-year-old woman. God, I can t see them getting

More information

Modeling Carrier Truckload Freight Rates in Spot Markets

Modeling Carrier Truckload Freight Rates in Spot Markets Modeling Carrier Truckload Freight Rates in Spot Markets 5 th METRANS International Freight Conference Long Beach, CA October 8-10, 2013 C. Lindsey, H.S. Mahmassani, A. Frei, H. Alibabai, Y.W. Park, D.

More information

Handling missing data in Stata a whirlwind tour

Handling missing data in Stata a whirlwind tour Handling missing data in Stata a whirlwind tour 2012 Italian Stata Users Group Meeting Jonathan Bartlett www.missingdata.org.uk 20th September 2012 1/55 Outline The problem of missing data and a principled

More information

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables. SIMPLE LINEAR CORRELATION Simple linear correlation is a measure of the degree to which two variables vary together, or a measure of the intensity of the association between two variables. Correlation

More information

Moderator and Mediator Analysis

Moderator and Mediator Analysis Moderator and Mediator Analysis Seminar General Statistics Marijtje van Duijn October 8, Overview What is moderation and mediation? What is their relation to statistical concepts? Example(s) October 8,

More information

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test March 2014

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test March 2014 UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test March 2014 STAB22H3 Statistics I Duration: 1 hour and 45 minutes Last Name: First Name: Student number: Aids

More information

Ann-Renée Blais. Ph.D. in Psychology (Quantitative) The Ohio State University, Columbus, Ohio August 2001

Ann-Renée Blais. Ph.D. in Psychology (Quantitative) The Ohio State University, Columbus, Ohio August 2001 Ann-Renée Blais 1133 Sheppard Ave. W., P.O. Box 2000, Toronto, Ontario, CANADA M3M 3B9 tel: 416-635-2000 x3082 fax: 416-635-2013 e-mail: ann-renee.blais@drdc-rddc.gc.ca EDUCATION Ph.D. in Psychology (Quantitative)

More information

Use of Monte Carlo Simulation for a Peer Review Process Performance Model

Use of Monte Carlo Simulation for a Peer Review Process Performance Model Use of Monte Carlo Simulation for a Peer Review Process Performance Model Presenter: Emerald Russo, Systems Engineering US Combat Systems, BAE Systems Credit for photo references found at end of presentation.

More information

MATH 564 Project Report. Analysis of Desktop Virtualization Capacity with. Linear Regression Model

MATH 564 Project Report. Analysis of Desktop Virtualization Capacity with. Linear Regression Model MATH 564 Project Report Analsis of Desktop Virtualization Capacit with Linear Regression Model Hongwei Jin CWID:A20288745 Dec. 1 st, 2012 1. Problem Describe a) Background Information At the beginning,

More information

Chapter 23. Inferences for Regression

Chapter 23. Inferences for Regression Chapter 23. Inferences for Regression Topics covered in this chapter: Simple Linear Regression Simple Linear Regression Example 23.1: Crying and IQ The Problem: Infants who cry easily may be more easily

More information

Outliers Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised April 7, 2016

Outliers Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised April 7, 2016 Outliers Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised April 7, 2016 These notes draw heavily from several sources, including Fox s Regression Diagnostics; Pindyck

More information

Outline: Demand Forecasting

Outline: Demand Forecasting Outline: Demand Forecasting Given the limited background from the surveys and that Chapter 7 in the book is complex, we will cover less material. The role of forecasting in the chain Characteristics of

More information

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,

More information

is paramount in advancing any economy. For developed countries such as

is paramount in advancing any economy. For developed countries such as Introduction The provision of appropriate incentives to attract workers to the health industry is paramount in advancing any economy. For developed countries such as Australia, the increasing demand for

More information

RELATIONSHIP BETWEEN WORKING CAPITAL MANAGEMENT AND PROFITABILITY IN TURKEY INDUSTRIAL LISTED COMPANIES

RELATIONSHIP BETWEEN WORKING CAPITAL MANAGEMENT AND PROFITABILITY IN TURKEY INDUSTRIAL LISTED COMPANIES RELATIONSHIP BETWEEN WORKING CAPITAL MANAGEMENT AND PROFITABILITY IN TURKEY INDUSTRIAL LISTED COMPANIES Prof.Dr. Necdet SAGLAM Lecturer Aziz KAGITCI Assistant Prof.Dr. Semih BUYUKIPEKCI Abstract The present

More information

Spatial approaches to epidemiology and public health: experiences from SAHSU, UK. Dr Linda Beale Research Associate, SAHSU. Imperial College London

Spatial approaches to epidemiology and public health: experiences from SAHSU, UK. Dr Linda Beale Research Associate, SAHSU. Imperial College London Imperial College London Spatial approaches to epidemiology and public health: experiences from SAHSU, UK Dr Linda Beale Research Associate, SAHSU Page 1 The Small Area Health Statistics Unit (SAHSU) Commenced

More information

Estimation of environmental exposure to ground-level ozone: an example of modelling for the Québec population

Estimation of environmental exposure to ground-level ozone: an example of modelling for the Québec population Estimation of environmental exposure to ground-level ozone: an example of modelling for the Québec population FOREWORD The Québec government's Plan d action 2006-2012 sur les changements climatiques (2006-2012

More information

c 2015, Jeffrey S. Simonoff 1

c 2015, Jeffrey S. Simonoff 1 Modeling Lowe s sales Forecasting sales is obviously of crucial importance to businesses. Revenue streams are random, of course, but in some industries general economic factors would be expected to have

More information

Title: Modeling for Prediction Linear Regression with Excel, Minitab, Fathom and the TI-83

Title: Modeling for Prediction Linear Regression with Excel, Minitab, Fathom and the TI-83 Title: Modeling for Prediction Linear Regression with Excel, Minitab, Fathom and the TI-83 Brief Overview: In this lesson section, the class is going to be exploring data through linear regression while

More information

ORTHOGONAL POLYNOMIAL CONTRASTS INDIVIDUAL DF COMPARISONS: EQUALLY SPACED TREATMENTS

ORTHOGONAL POLYNOMIAL CONTRASTS INDIVIDUAL DF COMPARISONS: EQUALLY SPACED TREATMENTS ORTHOGONAL POLYNOMIAL CONTRASTS INDIVIDUAL DF COMPARISONS: EQUALLY SPACED TREATMENTS Many treatments are equally spaced (incremented). This provides us with the opportunity to look at the response curve

More information

Chapter 7: Simple linear regression Learning Objectives

Chapter 7: Simple linear regression Learning Objectives Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -

More information

Determining Factors of a Quick Sale in Arlington's Condo Market. Team 2: Darik Gossa Roger Moncarz Jeff Robinson Chris Frohlich James Haas

Determining Factors of a Quick Sale in Arlington's Condo Market. Team 2: Darik Gossa Roger Moncarz Jeff Robinson Chris Frohlich James Haas Determining Factors of a Quick Sale in Arlington's Condo Market Team 2: Darik Gossa Roger Moncarz Jeff Robinson Chris Frohlich James Haas Executive Summary The real estate market for condominiums in Northern

More information

Curve Fitting. Before You Begin

Curve Fitting. Before You Begin Curve Fitting Chapter 16: Curve Fitting Before You Begin Selecting the Active Data Plot When performing linear or nonlinear fitting when the graph window is active, you must make the desired data plot

More information

ADVANCED FORECASTING MODELS USING SAS SOFTWARE

ADVANCED FORECASTING MODELS USING SAS SOFTWARE ADVANCED FORECASTING MODELS USING SAS SOFTWARE Girish Kumar Jha IARI, Pusa, New Delhi 110 012 gjha_eco@iari.res.in 1. Transfer Function Model Univariate ARIMA models are useful for analysis and forecasting

More information

Week 5: Multiple Linear Regression

Week 5: Multiple Linear Regression BUS41100 Applied Regression Analysis Week 5: Multiple Linear Regression Parameter estimation and inference, forecasting, diagnostics, dummy variables Robert B. Gramacy The University of Chicago Booth School

More information

AP Statistics. Chapter 4 Review

AP Statistics. Chapter 4 Review Name AP Statistics Chapter 4 Review 1. In a study of the link between high blood pressure and cardiovascular disease, a group of white males aged 35 to 64 was followed for 5 years. At the beginning of

More information

HOW TO USE MINITAB: DESIGN OF EXPERIMENTS. Noelle M. Richard 08/27/14

HOW TO USE MINITAB: DESIGN OF EXPERIMENTS. Noelle M. Richard 08/27/14 HOW TO USE MINITAB: DESIGN OF EXPERIMENTS 1 Noelle M. Richard 08/27/14 CONTENTS 1. Terminology 2. Factorial Designs When to Use? (preliminary experiments) Full Factorial Design General Full Factorial Design

More information