A Cohort Study of Traffic-related Air Pollution and Mortality in Toronto, Canada: Online Appendix



Similar documents
Establishing an air pollution monitoring network for intra-urban population exposure assessment: a location-allocation approach

Establishing an air pollution monitoring network for intraurban population exposure assessment: A location-allocation approach

MULTIPLE REGRESSION EXAMPLE

Correlation and Regression

August 2012 EXAMINATIONS Solution Part I

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares

Linear Regression Models with Logarithmic Transformations

MODEL I: DRINK REGRESSED ON GPA & MALE, WITHOUT CENTERING

Statistics 104 Final Project A Culture of Debt: A Study of Credit Card Spending in America TF: Kevin Rader Anonymous Students: LD, MH, IW, MY

1.1. Simple Regression in Excel (Excel 2010).

Department of Economics Session 2012/2013. EC352 Econometric Methods. Solutions to Exercises from Week (0.052)

Interaction effects between continuous variables (Optional)

IAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results

Data Mining and Data Warehousing. Henryk Maciejewski. Data Mining Predictive modelling: regression

The importance of graphing the data: Anscombe s regression examples

Marginal Effects for Continuous Variables Richard Williams, University of Notre Dame, Last revised February 21, 2015

The average hotel manager recognizes the criticality of forecasting. However, most

Regression Analysis: A Complete Example

Chapter 4 and 5 solutions

2. Simple Linear Regression

Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear.

SPSS Guide: Regression Analysis

Multicollinearity Richard Williams, University of Notre Dame, Last revised January 13, 2015

Addressing Alternative. Multiple Regression Spring 2012

Please follow the directions once you locate the Stata software in your computer. Room 114 (Business Lab) has computers with Stata software

California SCHIP Caregivers Perceptions of Dental Care

1. The parameters to be estimated in the simple linear regression model Y=α+βx+ε ε~n(0,σ) are: a) α, β, σ b) α, β, ε c) a, b, s d) ε, 0, σ

Solution Let us regress percentage of games versus total payroll.

Univariate Regression

Lab 5 Linear Regression with Within-subject Correlation. Goals: Data: Use the pig data which is in wide format:

DETERMINANTS OF CAPITAL ADEQUACY RATIO IN SELECTED BOSNIAN BANKS

Heavy Metal Pollution and Race as Factors in Hypertension and Heart Disease

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2

Geostatistics Exploratory Analysis

Failure to take the sampling scheme into account can lead to inaccurate point estimates and/or flawed estimates of the standard errors.

Lecture 15. Endogeneity & Instrumental Variable Estimation

Workshop: Using Spatial Analysis and Maps to Understand Patterns of Health Services Utilization

The Numbers Behind the MLB Anonymous Students: AD, CD, BM; (TF: Kevin Rader)

Nonlinear relationships Richard Williams, University of Notre Dame, Last revised February 20, 2015

Forecasting Analytics. Group members: - Arpita - Kapil - Kaushik - Ridhima - Ushhan

Using Stata 9 & Higher for OLS Regression Richard Williams, University of Notre Dame, Last revised January 8, 2015

Using Geostatistical Tools for Mapping Traffic- Related Air Pollution in Urban Areas

Quick Stata Guide by Liz Foster

THE QUALITY GAP: A STUDY OF NONPROFIT AND COMMERCIAL CHILD CARE CENTRES IN CANADA December 2004

Getting Correct Results from PROC REG

ESTIMATING AVERAGE TREATMENT EFFECTS: IV AND CONTROL FUNCTIONS, II Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

Discussion Section 4 ECON 139/ Summer Term II

Study Plan Master in Public Health ( Non-Thesis Track)

Developing a Translog Cost Function for Pharmaceutical Distribution

proveeks_bilag.out The SAS System 22:27 Thursday, November 27, Source DF Squares Square F Value Pr > F

MGT 267 PROJECT. Forecasting the United States Retail Sales of the Pharmacies and Drug Stores. Done by: Shunwei Wang & Mohammad Zainal

BIOL 933 Lab 6 Fall Data Transformation

Data Analysis Methodology 1

STAT 350 Practice Final Exam Solution (Spring 2015)

Mobile monitoring of air pollution in cities: the case of Hamilton, Ontario, Canada

Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:

Wingz Ergonomic Computer Keyboard An overview of research leading to a new keyboard design

R e s e a r c h R e p o r t

Didacticiel - Études de cas

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Hamilton Truck Route Study

MODELING AUTO INSURANCE PREMIUMS

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

25 Working with categorical data and factor variables

An Analysis of the Undergraduate Tuition Increases at the University of Minnesota Duluth

Air Pollution and Mortality - Spatial Analysis

Northern Colorado Retail Study: A shift-share analysis 2000 to 2010

Simple linear regression

International Statistical Institute, 56th Session, 2007: Phil Everson

ADVANCING THE USE OF MOBILE MONITORING DATA FOR AIR POLLUTION MODELLING

Nonlinear Regression Functions. SW Ch 8 1/54/

Stata Walkthrough 4: Regression, Prediction, and Forecasting

Modeling Carrier Truckload Freight Rates in Spot Markets

Handling missing data in Stata a whirlwind tour

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.

Moderator and Mediator Analysis

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test March 2014

Ann-Renée Blais. Ph.D. in Psychology (Quantitative) The Ohio State University, Columbus, Ohio August 2001

Use of Monte Carlo Simulation for a Peer Review Process Performance Model

MATH 564 Project Report. Analysis of Desktop Virtualization Capacity with. Linear Regression Model

Chapter 23. Inferences for Regression

Outliers Richard Williams, University of Notre Dame, Last revised April 7, 2016

Outline: Demand Forecasting

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

is paramount in advancing any economy. For developed countries such as

RELATIONSHIP BETWEEN WORKING CAPITAL MANAGEMENT AND PROFITABILITY IN TURKEY INDUSTRIAL LISTED COMPANIES

Spatial approaches to epidemiology and public health: experiences from SAHSU, UK. Dr Linda Beale Research Associate, SAHSU. Imperial College London

Estimation of environmental exposure to ground-level ozone: an example of modelling for the Québec population

c 2015, Jeffrey S. Simonoff 1

Title: Modeling for Prediction Linear Regression with Excel, Minitab, Fathom and the TI-83

ORTHOGONAL POLYNOMIAL CONTRASTS INDIVIDUAL DF COMPARISONS: EQUALLY SPACED TREATMENTS

Chapter 7: Simple linear regression Learning Objectives

Determining Factors of a Quick Sale in Arlington's Condo Market. Team 2: Darik Gossa Roger Moncarz Jeff Robinson Chris Frohlich James Haas

Curve Fitting. Before You Begin

ADVANCED FORECASTING MODELS USING SAS SOFTWARE

Week 5: Multiple Linear Regression

AP Statistics. Chapter 4 Review

HOW TO USE MINITAB: DESIGN OF EXPERIMENTS. Noelle M. Richard 08/27/14

Transcription:

A Cohort Study of Traffic-related Air Pollution and Mortality in Toronto, Canada: Online Appendix Michael Jerrett, 1 Murray M. Finkelstein, 2 Jeff R. Brook, 3 M. Altaf Arain, 4 Palvos Kanaroglou, 4 Dave M. Stieb, 5 Nicholas L. Gilbert, 5 Dave Verma, 6 Norm Finkelstein, 4 Kenneth R. Chapman, 7 Malcolm R. Sears 8 1 Division of Environmental Health Sciences, School of Public Health, University of California, Berkeley, Berkeley, California, United States 2 Department of Family and Community Medicine, Mount Sinai Hospital,University of Toronto, Toronto, Ontario, Canada 3 Meteorological Services, Environment Canada 4 School of Geography and Earth Science, McMaster University, Hamilton, Ontario, Canada 5 Air Health Effects Section, Health Canada 6 Occupational Medicine Program, McMaster University, Hamilton, Ontario, Canada 7 Division of Respirology, Department of Medicine, University of Toronto, Toronto, Ontario, Canada 8 Division of Respirology, Department of Medicine, McMaster University, Hamilton, Ontario, Canada Corresponding Author: Michael Jerrett University of California, Berkeley School of Public Health Division of Environmental Health Sciences 50 University Hall (Mailing Address) 7 University Hall (Office and GIS Lab) Berkeley, CA 947-7360 Email: jerrett@berkeley.edu Tel: 5-725-9991 Fax: 5-642-5815 1

Purpose This appendix gives additional details on the measurement and modeling used to estimate nitrogen dioxide exposures for the epidemiological analyses presented in the main paper. As noted in the main body of the paper, we have already published model specifications for the 02 and 04 land use regressions, and complete details can be found elsewhere (Jerrett et al. 07; Finkelstein and Jerrett 07). This online appendix summarizes both models to assist with interpreting the epidemiologic results in the main paper. Descriptive statistics and box plots for data collected in 02 and 04 are listed in Supplemental Materials Table 1 and Figure 1 below. The data shows that there was a little more than two times difference between the years. Supplemental Material Table 1: Descriptive statistics of all NO 2 measurements in parts per billion (02, 04) NO2 (02) NO2 (04) N Range Minimum Maximum Mean Std. Deviation 94 43.67 17.41 61.08 32.2365 9.18813 0.73 7.03 27.76 14.5433 4.48639 In Supplemental Material, Figure 1, we see that with 46 co-located measurements there was strong correlation with R = 0.82 even with the large difference in the magnitude of NO 2 measurements. 2

60 50 41 18 16 40 14 30 12 8 0 N = 46 46 NO2 (04) 6 4 Rsq = 0.6792 30 40 50 60 NO2 (02) NO2 (04) NO2 (02) Supplemental Material Figure 1: Boxplot and scatterplot of co-located monitoring values Supplemental Materials Table 2 and Figure 2 below show the model specifications and the observed on predicted value scatterplots for the Fall 02 model. Supplemental Material Table 2: Summary of the regression results for the logarithmic NO 2 Model (02). Number of obs = 94 Source SS df MS F(7,87) = 27.4 Regression 5.09 7 0.727 Prob > F = 0 Residual 2.29 86 0.027 R-square = 0.69 Total 7.38 93 Adj. R-square = 0.67 Root MSE = 0.163 Variable* Coefficient Std. Error t Prob > t VIF LN(NO 2 ) (Constant) 8.06E+00 1.177 6.85 0.00 RD1_0 1.84E-01 0.0 9.04 0.00 1. RD2_50 5.56E-01 0.300 1.85 0.07 1.11 IND750 1.63E-03 0.001 3.04 0.00 1. DC00 8.28E-05 0.000 4.66 0.00 1.38 X -8.01E-06 0.000-4.31 0.00 1.06 D_WIND1500 1.32E-01 0.040 3.30 0.00 1. TRAF500 1.11E-03 0.001 1.96 0.05 1.32 *RD1_0 measure of expressway within 0m; RD2_50 measure of major roads within 50m; IND750 measure of industrial land use within 750m; DC00 density of dwellings within 00m (Kernel estimate); X UTM NAD83 x-coordinate; D_WIND15 Boolean identifier whether downwind and within 1500m of nearest expressway at PMpeak traffic; TRAF500 Density measure of 24 hour traffic counts within 500m. 3

4.2 4.0 3.8 3.6 3.4 Observed values 3.2 3.0 2.8 2.8 3.0 3.2 3.4 3.6 3.8 4.0 Unstandardized Predicted Value Supplemental Material Figure 2: Logarithmic-observed mean NO 2 (02) on predicted value. We present similar results for the 04 LUR model in Table 3, with the observed on predicted scatterplot shown in Figure 3. Supplemental Material Table 3: Summary of the regression results for NO 2 Model (04). Number of obs = 1 Source SS df MS F(5,96) = 46.1 Regression 1411.45 5 282.291 Prob > F = 0.000 Residual 581.86 95 6.125 R-square = 0.7080943 Total 1993.31 0 Adj. R-square = 0.69 Root MSE = 2.475 Variable* Coefficient Std. Error t Prob > t VIF NO 2 [ppb] (Constant) 9.62E+00 0.690 13.93 0.00 X_MEAN -2.68E-04 0.000-9.36 0.00 1.02 EXCS400 9.83E-01 0.123 8.00 0.00 1.12 RD2_00 2.E-01 0.087 2.52 0.01 1.62 RD2_50 1.E+01 5.6 2.15 0.03 1.28 EA2500 5.02E-04 0.000 3.60 0.00 1.52 * X_MEAN mean deviated UTM NAD83 x-coordinate; EXCS400 measure of area of road expressway encasements within 400m; RD2_00 measure of major roads within 00m; RD2_50 measure of major roads within 50m; EA2500 measure of population density using kernel density with a bandwidth of 2500m of population values at enumeration area centroids. 4

30 Observed Values 0 Rsq = 0.7081 0 30 Unstandardized Predicted Value Supplemental Material Figure 3: Observed NO 2 (04) on predicted values. Even though the models fit different variables, when comparing 02 to 04 they display similar spatial gradients (see Figure 4). We averaged the two surfaces after assignment to the study subjects in the cohort and used the average estimate for the epidemiological analyses. 5

Figure 4: Prediction Maps of 02 and 04 NO2 land use regression models 6

References Finkelstein MM, Jerrett M. 07. A study of the relationships between Parkinson's disease and markers of traffic-derived and environmental manganese air pollution in two Canadian cities. Environ Res 4(3):4-432. Jerrett M, Arain MA, Kanaroglou P, Beckerman B, Crouse D, Gilbert NL, et al. 07. Modelling the intraurban variability of ambient traffic pollution in Toronto, Canada. J Toxicol Environ Health A 70(3-4):0-212. 7