An Example of SAS Application in Public Health Research --- Predicting Smoking Behavior in Changqiao District, Shanghai, China

Size: px
Start display at page:

Download "An Example of SAS Application in Public Health Research --- Predicting Smoking Behavior in Changqiao District, Shanghai, China"

Transcription

1 An Example of SAS Application in Public Health Research --- Predicting Smoking Behavior in Changqiao District, Shanghai, China Ding Ding, San Diego State University, San Diego, CA ABSTRACT Finding predictors of health-related behaviors is an important step in public health research. According to the Behavioral Ecological Model (BEM), people behave in certain ways because physical and social contingencies have reinforced this behavior (Hovell, Wahlgren, Gehrman, 2002). So far the BEM has still not been applied to study the smoking epidemic in China. To fill the gap of previous research, a multiple logistic model, which contained a hierarchy of variables on different levels of matecontingencies, was constructed with the help of SAS out of a sample from Changqiao District, Shanghai, China. INTRODUCTION As the world s most populous developing country, with a population of 1.3 billion, and more than 350 million smokers, China is the world s largest producer and consumer of tobacco (Yong et al., 1999). From the 2002 National Smoking Survey, smoking prevalence among Chinese males was 66.0%, (Yang, Ma, Li, & Zhou, 2005). Smoking is already responsible for more than 12% of Chinese male adult deaths (Liu, Peto, Chen, Boreham, Wu, & Li, 1998), and the percentage is still growing. Minimal research has been conducted in China on smoking behavior, and includes only univariate and bivariate analyses, which couldn t include potential predictors on different levels of environment that reinforce smoking behavior. In this study, in order to test potential predictors simultaneously, examine interactions between variables, and adjust for confounders, multiple logistic regression analysis was performed to construct a final model, which predicted current smoking behavior in the study population. All statistical analysis was conducted using SAS DATA COLLECTION/ DATA ENTRY Data were collected by the Fudan University, School of Public Health in Shanghai, China. Potential participants were selected by random number dialing through a resident information network at the Changqiao Residents Committee. Questionnaire based interviews were done by trained interviewers and data were collected during the interviews. Data were entered in to an SPSS file (Version 14.0). Ten percent of the data were double entered for quality control. DATA INPUT In order to input data from SPSS file into an SAS file, the original dataset was saved as a portable file in SPSS and then inputted into SAS using the following statement:

2 libname final spss 'C:\Shanghai Project\Data\final.por'; data chq ; set final. chqdat; run; In this way, a data set named CHQ was created in SAS. In order to examine whether the data set was inputted right, the PROC CONTENTS statement was used and the output showed that all the variables were inputted correctly. DATA CLEANING The layout of the data file CHQ is presented in Table 1. Table 1 Description of the data file CHQ Variable Names Description Variable Type Valid Value ID ID Number for participants Numeric 1-3 digits AGE Age Numeric 18-80* CHILDREN Having children living at home Numeric 1 or 2 EDU Education level Numeric 1-8 EMPLOY Employment status Numeric 1-9 FDSMK Friend smoke Numeric 1-3 FDTOL Friend tolerate others smoking Numeric 1-3 HEALTH Health status Numeric 1-5 INCOME Family monthly income Numeric 1-7 KNOW Knowledge about adverse health Numeric 3-15 effects of smoking MAR Marital status Numeric 1-4 SEX Gender Numeric 1 or 2 SMK Smoking status** Numeric 1 or 2 SPT_M Social acceptability for male smoking Numeric 9-27 * The valid range of age is determined by the recruiting criteria. ** Smokers have been defined as smoking more than 100 cigarettes lifetime First, FROC FREQ was used for every variable of interest to check if there were any data out of the range of valid values. The next step was to determine where the invalid data occurred in the raw data set. The following program was used for age, the rest of the variables were similar. if age < 18 or age > 80 then put age= ID=; run; After the locations of invalid data were found, the raw data was examined in order to attain the true value of the variable.

3 DATA MANIPULATION Once the data were checked for accuracy, PROC FREQ was used to obtain the distribution of variables. The outcome showed that variables EDU, HEALTH, INCOME, KNOW followed highly skewed distributions. Variables EMPLOY SPT_M had too many categories and not enough scores in certain categories. Age was a continuous variable. In order to facilitate further analysis, variables EDU, HEALTH, INCOME, KNOW, EMPLOY, SPT_M were recoded into new variables with less categories; AGE was collapsed into a categorical variable. IF-THEN statements were used for recoding. Cut-off points were determined by the data distribution and the theoretical meaning of each category. Table 2 shows the variables after recoding. Table 2 Original and recoded variables in data file CHQ Original variable Recoded Variable Categories AGE AGECAT 1=19-42; 2=42-48; 3=48-60; 4=60-79 CHILDREN AT HOME 1= Yes; 0=No EDU EDUCAT 1=0-9 year education; 2=10-12 years; 3=13+ years EMPLOY EMCAT 1=retired; 2=current working; 3=students; 4=laid offs FDSMK 1=most friends; 2=some friends; 3= No friends FDTOL 1=most friends; 2=some friends; 3= No friends HEALTH HLTHCAT 1=good; 0= average INCOME ICCAT 1= RMB; 2= RMB; 3=3000+RMB KNOW KWCAT 1=high; 0=low MAR MARCAT 1=not married (including devoiced, widowed); 2= married SEX 1=male; 0=female SMK 1=yes; 0=no SPT_M SPTCAT 1=low; 2= average; 3=high DATA ANALYSIS 1. Descriptive Analysis Table 3 shows the frequencies of all of the independent variables. Table 3 Participant Characteristics (N=243) Categorical Variable Number % Age Gender Male Female

4 Education 0-9 years years years Marital Status Married Single/divorced/widowed Having children living in same household Yes No Current Employment Status Retired Working Full time student Laid off Family monthly income (after tax, Unit: RMB) Health Status Good Average Friends smoking Most friends Some friends Almost no friends Friends tolerate others smoking Most friends Some friends Almost no friends Social acceptability of smoking Low Average High Knowledge about negative health effects of smoking High Low Smoking status Yes No Bivariate Analysis Previous studies and statistics have stated a remarkable discrepancy between the smoking rate among Chinese males and females. To justify further analysis and the conceptual model, the relationship between gender and current smoking must be looked into first. Using the following program, a contingency table was created and a Chi- Square statistic was calculated. proc freq data=chq; table sex*smk/chisq; title 'smoking by gender'; run;

5 SAS output showed that no female participants were smokers and 75 out of 123 male participants were smokers. In this way, further analysis would only be limited to the subgroup of male participants. The WHERE statement was used to create this subgroup. In the male subgroup, bivariate analysis was preformed individually between each independent variable and the dependent variable (smoking status) to identify the association between potential predictors and smoking status. The results are shown in Table 4. Table 4 Bivariate Analysis of independent variables (N=123) Categorical Variable smokers ( n=75) nonsmokers (n=48) p-value Age P< (14.67%) 15 (31.25%) (40.00%) 5 (10.42%) (30.67%) 7 (14.58%) (14.67%) 21 (43.75%) Education P= years 23 (30.67%) 12 (25.00%) years 41 (54.67%) 20 (41.67%) 13+ years 11 (14.67%) 16 (33.33%) Marital Status P= Married 57 (76.00%) 34 (70.83%) Single/divorced/widowed 18 (24.00%) 14 (29.17%) Having children living in at home P= Yes 28 (37.33%) 12 (25.00%) No 47 (62.67%) 36 (75.00%) Current Employment Status P= Retired 19 (25.68%) 24 (51.06%) Working 36 (48.00%) 19 (39.58%) Full time student and housewives 1 (1.35%) 3 (6.38%) Laid off 19 (25.68%) 2 (4.26%) Family monthly income (RMB) P = (36.00%) 9 (18.75%) (41.33%) 25 (52.08%) (22.67%) 14 (29.17%) Health Status p= Good 36 (48.00%) 26 (54.17%) Not so good 39 (52.00%) 22 (45.83%) Friends smoking P< Most friends 44 (58.67%) 7 (14.58%) Some friends 25 (33.33%) 31(64.58%) Almost no friends 5 (6.67%) 10 (20.83%) Friends tolerate others smoking P= Most friends 37(49.33%) 13 (27.08%) Some friends 26(34.67%) 22 (45.83%) Almost no friends 11 (14.67%) 13 (27.08%) Social acceptability of smoking P= Low 6 (8.00%) 10 (20.83%) Average 59 (78.67%) 29 (60.42%) High 10 (13.33%) 9 (18.75%) Knowledge about negative health effects of smoking P= High 56 (74.67%) 40 (83.33%) Low 47 (62.67%) 8 (16.67%)

6 3. Multiple Logistic Regression Analysis Subsequent to bivariate analysis, variables with a significance level less than 0.05 were included in the logistic regression model. Even though some variables had a significance level higher than 0.05, they might still confound the association between the independent variables and the dependent variable. Considering this, variables with a significance level were also included into the logistic regression model. However, the decision of whether or not to keep these variables needed to be tested. A PROC LOGISTIC statement was used to test the model. proc logistic data=chq; class agecat(ref='1') educat(ref='1') emcat (ref='2')iccat(ref='1') children (ref='0') fdsmk (ref='3')fdtol(ref='3') sptcat (ref='1')/ param=reference; model smk(ref='0')= fdsmk agecat emcat educat fdtol sptcat iccat children / rl lackfit aggregate scale=none rsquare; run; The first model sequentially included friends smoke, age, employment status, educational level, friends tolerance of others smoking, social support for smoking, family income, children living at home. This model explained 35.46% of the variance of smoking status (F=2.94, P<0.0001). The next step was to test whether the exclusion of variables CHILDREN, ICCAT, SPTCAT, FDTOL from the first model would change the association between the independent variables and dependent variable by a significantly large amount. PROC LOGISTIC was used, removing one of the four potential confounders each time from the original model. By removing variables CHILDREN and INCOME from the model, none of the odds ratios of other variables had changed by more than 10%, thus these two variables were left out of the model. By excluding FDTOL and SPTCAT from the model, with a percent change more than 10%, FDTOL and SPTCAT were left inside the model as confounders because they distorted the relationship between variable FDSMK and SMK. The final model included friends smoke, age, employment status, educational level, friends tolerance of others smoking, perceived social acceptability of smoking. This model explained 35.16% (F=3.50, P<0.0001) of the variance in smoking status outcome. With a P-value =0.60, this logistic regression model showed a good fit to the sample data. Odds Ratios, confidence levels and p-values from the Logistic Regression model are summarized in Table 5.

7 Table 5 Multiple analysis of factors associated with smoking status using logistic regression (N=123) Variable OR 95% CI p-value Age * Education 0-9 years years years Current Employment Status Currently Working 1 Retired Full time student Laid off * Friends smoking Almost no friends 1 Some friends Most friends * Friends tolerate others smoking Almost no friends 1 Some friends Most friends Perceived social acceptability of smoking Low 1 Average * High * P-value <0.05 According to the final model, adjusting for educational level and friends tolerance of others smoking, males aged were 6.39 times more likely to smoke than males aged Males who were laid-off were times as likely to smoke as males who were currently smoking. Males whose friends were mostly smokers were times more likely to smoke than males with almost no friends smoking. Males who felt somewhat accepted by society to smoke had a 6.98 times higher risk to smoke compared to males who felt unaccepted by the society to smoke. CONCLUSION SAS has been widely used in qualitative research in the field of Public Health. This study is an example of the application of SAS in an epidemiological study on smoking behavior. The findings of this study have verified the Behavioral Ecological Model statistically regarding the smoking behavior in the Chinese male. This will help understand smoking behavior in the social environment and tailor intervention for future health promotion in China.

8 CONTACT INFORMATION Ding Ding Graduate School of Public Health, San Diego State University Center for Behavioral Epidemiology and Community Health, 9245 Sky Park Court, Suite 230 San Diego, CA,

How to set the main menu of STATA to default factory settings standards

How to set the main menu of STATA to default factory settings standards University of Pretoria Data analysis for evaluation studies Examples in STATA version 11 List of data sets b1.dta (To be created by students in class) fp1.xls (To be provided to students) fp1.txt (To be

More information

Multiple logistic regression analysis of cigarette use among high school students

Multiple logistic regression analysis of cigarette use among high school students Multiple logistic regression analysis of cigarette use among high school students ABSTRACT Joseph Adwere-Boamah Alliant International University A binary logistic regression analysis was performed to predict

More information

Experiment on Web based recruitment of Cell Phone Only respondents

Experiment on Web based recruitment of Cell Phone Only respondents Experiment on Web based recruitment of Cell Phone Only respondents 2008 AAPOR Annual Conference, New Orleans By: Chintan Turakhia, Abt SRBI Inc. Mark A. Schulman, Abt SRBI Inc. Seth Brohinsky, Abt SRBI

More information

Can Annuity Purchase Intentions Be Influenced?

Can Annuity Purchase Intentions Be Influenced? Can Annuity Purchase Intentions Be Influenced? Jodi DiCenzo, CFA, CPA Behavioral Research Associates, LLC Suzanne Shu, Ph.D. UCLA Anderson School of Management Liat Hadar, Ph.D. The Arison School of Business,

More information

ABSTRACT INTRODUCTION STUDY DESCRIPTION

ABSTRACT INTRODUCTION STUDY DESCRIPTION ABSTRACT Paper 1675-2014 Validating Self-Reported Survey Measures Using SAS Sarah A. Lyons MS, Kimberly A. Kaphingst ScD, Melody S. Goodman PhD Washington University School of Medicine Researchers often

More information

Survey of Wisconsin Renters: Smoking Rules, Attitudes & Preferences

Survey of Wisconsin Renters: Smoking Rules, Attitudes & Preferences Report > May 2012 Joseph Cera Survey Center Manager Researcher Lilana Guillermo Undergraduate Research Assistant Peter Maier Acting Center Director Associate Scientist Survey of Wisconsin Renters: Smoking

More information

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate

More information

In ^YOUR1 lifetime, ^HAVE ^YOU2 smoked a total of 100 or more cigarettes (about 4 packs)?

In ^YOUR1 lifetime, ^HAVE ^YOU2 smoked a total of 100 or more cigarettes (about 4 packs)? SMOKING (SMK) SMK_BEG SMK_C1 SMK_QINT SMK_Q201A SMKE_01A If (do SMK block = 1), go to SMK_QINT. Otherwise, go to SMK_END. The next questions are about smoking. INTERVIEWER: Press to continue. In

More information

Binary Logistic Regression

Binary Logistic Regression Binary Logistic Regression Main Effects Model Logistic regression will accept quantitative, binary or categorical predictors and will code the latter two in various ways. Here s a simple model including

More information

Organizing Your Approach to a Data Analysis

Organizing Your Approach to a Data Analysis Biost/Stat 578 B: Data Analysis Emerson, September 29, 2003 Handout #1 Organizing Your Approach to a Data Analysis The general theme should be to maximize thinking about the data analysis and to minimize

More information

Complementary and alternative medicine use in Chinese women with breast cancer: A Taiwanese survey

Complementary and alternative medicine use in Chinese women with breast cancer: A Taiwanese survey Complementary and alternative medicine use in Chinese women with breast cancer: A Taiwanese survey Dr Fang-Ying (Sylvia) Chu Department of Nursing, Tzu Chi College of Technology, Hua Lien, Taiwan 1 BACKGROUND

More information

SUGI 29 Statistics and Data Analysis

SUGI 29 Statistics and Data Analysis Paper 194-29 Head of the CLASS: Impress your colleagues with a superior understanding of the CLASS statement in PROC LOGISTIC Michelle L. Pritchard and David J. Pasta Ovation Research Group, San Francisco,

More information

Where there is a will

Where there is a will Where there is a will Legal Documents Among The 50+ Population: Findings From an AARP Survey Prepared for AARP Program Development and Services by AARP Research Group Released April 2000 LEGAL DOCUMENTS

More information

NHS Diabetes Prevention Programme (NHS DPP) Non-diabetic hyperglycaemia. Produced by: National Cardiovascular Intelligence Network (NCVIN)

NHS Diabetes Prevention Programme (NHS DPP) Non-diabetic hyperglycaemia. Produced by: National Cardiovascular Intelligence Network (NCVIN) NHS Diabetes Prevention Programme (NHS DPP) Non-diabetic hyperglycaemia Produced by: National Cardiovascular Intelligence Network (NCVIN) Date: August 2015 About Public Health England Public Health England

More information

Basic Statistical and Modeling Procedures Using SAS

Basic Statistical and Modeling Procedures Using SAS Basic Statistical and Modeling Procedures Using SAS One-Sample Tests The statistical procedures illustrated in this handout use two datasets. The first, Pulse, has information collected in a classroom

More information

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm

More information

Developing Risk Adjustment Techniques Using the SAS@ System for Assessing Health Care Quality in the lmsystem@

Developing Risk Adjustment Techniques Using the SAS@ System for Assessing Health Care Quality in the lmsystem@ Developing Risk Adjustment Techniques Using the SAS@ System for Assessing Health Care Quality in the lmsystem@ Yanchun Xu, Andrius Kubilius Joint Commission on Accreditation of Healthcare Organizations,

More information

Finding Supporters. Political Predictive Analytics Using Logistic Regression. Multivariate Solutions

Finding Supporters. Political Predictive Analytics Using Logistic Regression. Multivariate Solutions Finding Supporters Political Predictive Analytics Using Logistic Regression Multivariate Solutions What is Logistic Regression? In a political application, logistic regression can describe the outcome

More information

The association between health risk status and health care costs among the membership of an Australian health plan

The association between health risk status and health care costs among the membership of an Australian health plan HEALTH PROMOTION INTERNATIONAL Vol. 18, No. 1 Oxford University Press 2003. All rights reserved Printed in Great Britain The association between health risk status and health care costs among the membership

More information

A PROSPECTIVE EVALUATION OF THE RELATIONSHIP BETWEEN REASONS FOR DRINKING AND DSM-IV ALCOHOL-USE DISORDERS

A PROSPECTIVE EVALUATION OF THE RELATIONSHIP BETWEEN REASONS FOR DRINKING AND DSM-IV ALCOHOL-USE DISORDERS Pergamon Addictive Behaviors, Vol. 23, No. 1, pp. 41 46, 1998 Copyright 1998 Elsevier Science Ltd Printed in the USA. All rights reserved 0306-4603/98 $19.00.00 PII S0306-4603(97)00015-4 A PROSPECTIVE

More information

Internet Gambling in Canada: Prevalence, Patterns and Land-Based Comparisons

Internet Gambling in Canada: Prevalence, Patterns and Land-Based Comparisons Internet Gambling in Canada: Prevalence, Patterns and Land-Based Comparisons Dr. Robert Wood & Dr. Robert Williams University of Lethbridge Current Study Funded by a Level IV grant from the Ontario Problem

More information

Diabetes Prevention in Latinos

Diabetes Prevention in Latinos Diabetes Prevention in Latinos Matthew O Brien, MD, MSc Assistant Professor of Medicine and Public Health Northwestern Feinberg School of Medicine Institute for Public Health and Medicine October 17, 2013

More information

Assignments Analysis of Longitudinal data: a multilevel approach

Assignments Analysis of Longitudinal data: a multilevel approach Assignments Analysis of Longitudinal data: a multilevel approach Frans E.S. Tan Department of Methodology and Statistics University of Maastricht The Netherlands Maastricht, Jan 2007 Correspondence: Frans

More information

4.1 Exploratory Analysis: Once the data is collected and entered, the first question is: "What do the data look like?"

4.1 Exploratory Analysis: Once the data is collected and entered, the first question is: What do the data look like? Data Analysis Plan The appropriate methods of data analysis are determined by your data types and variables of interest, the actual distribution of the variables, and the number of cases. Different analyses

More information

Excel Charts & Graphs

Excel Charts & Graphs MAX 201 Spring 2008 Assignment #6: Charts & Graphs; Modifying Data Due at the beginning of class on March 18 th Introduction This assignment introduces the charting and graphing capabilities of SPSS and

More information

Individual and province inequalities in health among older people in China: evidence and policy implications

Individual and province inequalities in health among older people in China: evidence and policy implications Individual and province inequalities in health among older people in China: evidence and policy implications Prof Maria Evandrou, Prof Jane Falkingham, Dr Zhixin Frank Feng, Dr Athina Vlachantoni Centre

More information

IBM SPSS Direct Marketing 22

IBM SPSS Direct Marketing 22 IBM SPSS Direct Marketing 22 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 22, release

More information

Utilization of Services in a Randomized Trial Testing Phone- and Web-based Behavioral Interventions for Smoking Cessation

Utilization of Services in a Randomized Trial Testing Phone- and Web-based Behavioral Interventions for Smoking Cessation Utilization of Services in a Randomized Trial Testing Phone- and Web-based Behavioral Interventions for Smoking Cessation Susan M. Zbikowski, Ph.D. Society of Behavioral Medicine April 22-25, 2009 Introduction

More information

The Demand for Financial Planning Services 1

The Demand for Financial Planning Services 1 The Demand for Financial Planning Services 1 Sherman D. Hanna, Ohio State University Professor, Consumer Sciences Department Ohio State University 1787 Neil Avenue Columbus, OH 43210-1290 Phone: 614-292-4584

More information

Caregiving Impact on Depressive Symptoms for Family Caregivers of Terminally Ill Cancer Patients in Taiwan

Caregiving Impact on Depressive Symptoms for Family Caregivers of Terminally Ill Cancer Patients in Taiwan Caregiving Impact on Depressive Symptoms for Family Caregivers of Terminally Ill Cancer Patients in Taiwan Siew Tzuh Tang, RN, DNSc Associate Professor, School of Nursing Chang Gung University, Taiwan

More information

Analyzing and interpreting data Evaluation resources from Wilder Research

Analyzing and interpreting data Evaluation resources from Wilder Research Wilder Research Analyzing and interpreting data Evaluation resources from Wilder Research Once data are collected, the next step is to analyze the data. A plan for analyzing your data should be developed

More information

Data Mining: An Overview of Methods and Technologies for Increasing Profits in Direct Marketing. C. Olivia Rud, VP, Fleet Bank

Data Mining: An Overview of Methods and Technologies for Increasing Profits in Direct Marketing. C. Olivia Rud, VP, Fleet Bank Data Mining: An Overview of Methods and Technologies for Increasing Profits in Direct Marketing C. Olivia Rud, VP, Fleet Bank ABSTRACT Data Mining is a new term for the common practice of searching through

More information

WHO STEPS Surveillance Support Materials. STEPS Epi Info Training Guide

WHO STEPS Surveillance Support Materials. STEPS Epi Info Training Guide STEPS Epi Info Training Guide Department of Chronic Diseases and Health Promotion World Health Organization 20 Avenue Appia, 1211 Geneva 27, Switzerland For further information: www.who.int/chp/steps WHO

More information

Modeling Lifetime Value in the Insurance Industry

Modeling Lifetime Value in the Insurance Industry Modeling Lifetime Value in the Insurance Industry C. Olivia Parr Rud, Executive Vice President, Data Square, LLC ABSTRACT Acquisition modeling for direct mail insurance has the unique challenge of targeting

More information

Logistic (RLOGIST) Example #7

Logistic (RLOGIST) Example #7 Logistic (RLOGIST) Example #7 SUDAAN Statements and Results Illustrated EFFECTS UNITS option EXP option SUBPOPX REFLEVEL Input Data Set(s): SAMADULTED.SAS7bdat Example Using 2006 NHIS data, determine for

More information

If You Think Investing is Gambling, You re Doing it Wrong!

If You Think Investing is Gambling, You re Doing it Wrong! If You Think Investing is Gambling, You re Doing it Wrong! Warren Buffet Jennifer Arthur, M.Sc. PhD Candidate, University of Adelaide Supervisor: Dr. Paul Delfabbro 10th European Conference on Gambling

More information

FACILITATOR/MENTOR GUIDE

FACILITATOR/MENTOR GUIDE FACILITATOR/MENTOR GUIDE Descriptive analysis variables table shells hypotheses Measures of association methods design justify analytic assess calculate analysis problem stratify confounding statistical

More information

When to Use a Particular Statistical Test

When to Use a Particular Statistical Test When to Use a Particular Statistical Test Central Tendency Univariate Descriptive Mode the most commonly occurring value 6 people with ages 21, 22, 21, 23, 19, 21 - mode = 21 Median the center value the

More information

IBM SPSS Direct Marketing 23

IBM SPSS Direct Marketing 23 IBM SPSS Direct Marketing 23 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 23, release

More information

HSRA2011 The Impacts of Health Insurance on Health Care Utilization Among the Elderly in China

HSRA2011 The Impacts of Health Insurance on Health Care Utilization Among the Elderly in China The Impacts of Health Insurance on Health Care Utilization Among the Elderly in China Xin Li, PhD Shanghai Jiao Tong University, China Outline Background Literature Objectives Methods Results Conclusions

More information

With Depression Without Depression 8.0% 1.8% Alcohol Disorder Drug Disorder Alcohol or Drug Disorder

With Depression Without Depression 8.0% 1.8% Alcohol Disorder Drug Disorder Alcohol or Drug Disorder Minnesota Adults with Co-Occurring Substance Use and Mental Health Disorders By Eunkyung Park, Ph.D. Performance Measurement and Quality Improvement May 2006 In Brief Approximately 16% of Minnesota adults

More information

How to Get More Value from Your Survey Data

How to Get More Value from Your Survey Data Technical report How to Get More Value from Your Survey Data Discover four advanced analysis techniques that make survey research more effective Table of contents Introduction..............................................................2

More information

Substance Use, Treatment Need and Receipt of Treatment in Minnesota:

Substance Use, Treatment Need and Receipt of Treatment in Minnesota: Substance Use, Treatment Need and Receipt of Treatment in Minnesota: Results from Minnesota Student Survey, Minnesota Survey on Adult Substance Use, and Drug and Alcohol Abuse Normative Evaluation System

More information

Logistic (RLOGIST) Example #3

Logistic (RLOGIST) Example #3 Logistic (RLOGIST) Example #3 SUDAAN Statements and Results Illustrated PREDMARG (predicted marginal proportion) CONDMARG (conditional marginal proportion) PRED_EFF pairwise comparison COND_EFF pairwise

More information

Customer Profiling for Marketing Strategies in a Healthcare Environment MaryAnne DePesquo, Phoenix, Arizona

Customer Profiling for Marketing Strategies in a Healthcare Environment MaryAnne DePesquo, Phoenix, Arizona Paper 1285-2014 Customer Profiling for Marketing Strategies in a Healthcare Environment MaryAnne DePesquo, Phoenix, Arizona ABSTRACT In this new era of healthcare reform, health insurance companies have

More information

CELL PHONE USE WHILE DRIVING April, 2014

CELL PHONE USE WHILE DRIVING April, 2014 CELL PHONE USE WHILE DRIVING April, 2014 KEY POINTS: Cell phone use while driving decreased after legislation banning the activity was introduced and again after enactment of fines, but increased in 2012.

More information

Assessment of Factors Affecting Clinical Practice Competency of Undergraduate Health Science Students in Hawassa University, South, Ethiopia

Assessment of Factors Affecting Clinical Practice Competency of Undergraduate Health Science Students in Hawassa University, South, Ethiopia Assessment of Factors Affecting Clinical Practice Competency of Undergraduate Health Science Students in Hawassa University, South, Ethiopia Rekiku Fikre College of medicine and health sciences, Department

More information

Inferential Statistics. What are they? When would you use them?

Inferential Statistics. What are they? When would you use them? Inferential Statistics What are they? When would you use them? What are inferential statistics? Why learn about inferential statistics? Why use inferential statistics? When are inferential statistics utilized?

More information

Unit 12 Logistic Regression Supplementary Chapter 14 in IPS On CD (Chap 16, 5th ed.)

Unit 12 Logistic Regression Supplementary Chapter 14 in IPS On CD (Chap 16, 5th ed.) Unit 12 Logistic Regression Supplementary Chapter 14 in IPS On CD (Chap 16, 5th ed.) Logistic regression generalizes methods for 2-way tables Adds capability studying several predictors, but Limited to

More information

Living a happy, healthy and satisfying life Tineke de Jonge, Christianne Hupkens, Jan-Willem Bruggink, Statistics Netherlands, 15-09-2009

Living a happy, healthy and satisfying life Tineke de Jonge, Christianne Hupkens, Jan-Willem Bruggink, Statistics Netherlands, 15-09-2009 Living a happy, healthy and satisfying life Tineke de Jonge, Christianne Hupkens, Jan-Willem Bruggink, Statistics Netherlands, 15-09-2009 1 Introduction Subjective well-being (SWB) refers to how people

More information

Multivariate Logistic Regression

Multivariate Logistic Regression 1 Multivariate Logistic Regression As in univariate logistic regression, let π(x) represent the probability of an event that depends on p covariates or independent variables. Then, using an inv.logit formulation

More information

Logistic (RLOGIST) Example #1

Logistic (RLOGIST) Example #1 Logistic (RLOGIST) Example #1 SUDAAN Statements and Results Illustrated EFFECTS RFORMAT, RLABEL REFLEVEL EXP option on MODEL statement Hosmer-Lemeshow Test Input Data Set(s): BRFWGT.SAS7bdat Example Using

More information

Advanced Statistical Analysis of Mortality. Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc. 160 University Avenue. Westwood, MA 02090

Advanced Statistical Analysis of Mortality. Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc. 160 University Avenue. Westwood, MA 02090 Advanced Statistical Analysis of Mortality Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc 160 University Avenue Westwood, MA 02090 001-(781)-751-6356 fax 001-(781)-329-3379 trhodes@mib.com Abstract

More information

ln(p/(1-p)) = α +β*age35plus, where p is the probability or odds of drinking

ln(p/(1-p)) = α +β*age35plus, where p is the probability or odds of drinking Dummy Coding for Dummies Kathryn Martin, Maternal, Child and Adolescent Health Program, California Department of Public Health ABSTRACT There are a number of ways to incorporate categorical variables into

More information

GETTING YOUR DATA INTO SPSS

GETTING YOUR DATA INTO SPSS GETTING YOUR DATA INTO SPSS UNIVERSITY OF GUELPH LUCIA COSTANZO lcostanz@uoguelph.ca REVISED SEPTEMBER 2011 CONTENTS Getting your Data into SPSS... 0 SPSS availability... 3 Data for SPSS Sessions... 4

More information

Comparing return to work outcomes between vocational rehabilitation providers after adjusting for case mix using statistical models

Comparing return to work outcomes between vocational rehabilitation providers after adjusting for case mix using statistical models Comparing return to work outcomes between vocational rehabilitation providers after adjusting for case mix using statistical models Prepared by Jim Gaetjens Presented to the Institute of Actuaries of Australia

More information

HIGH-RISK STOCK TRADING: INVESTMENT OR GAMBLING?

HIGH-RISK STOCK TRADING: INVESTMENT OR GAMBLING? HIGH-RISK STOCK TRADING: INVESTMENT OR GAMBLING? Jennifer Arthur, M.Sc. PhD Candidate, University of Adelaide Co-Authors: Dr. Paul Delfabbro & Dr. Robert Williams 14 th Annual Alberta Research Gambling

More information

Aileen Murphy, Department of Economics, UCC, Ireland. WORKING PAPER SERIES 07-10

Aileen Murphy, Department of Economics, UCC, Ireland. WORKING PAPER SERIES 07-10 AN ECONOMETRIC ANALYSIS OF SMOKING BEHAVIOUR IN IRELAND Aileen Murphy, Department of Economics, UCC, Ireland. DEPARTMENT OF ECONOMICS WORKING PAPER SERIES 07-10 1 AN ECONOMETRIC ANALYSIS OF SMOKING BEHAVIOUR

More information

Free Trial - BIRT Analytics - IAAs

Free Trial - BIRT Analytics - IAAs Free Trial - BIRT Analytics - IAAs 11. Predict Customer Gender Once we log in to BIRT Analytics Free Trial we would see that we have some predefined advanced analysis ready to be used. Those saved analysis

More information

Biostatistics: Types of Data Analysis

Biostatistics: Types of Data Analysis Biostatistics: Types of Data Analysis Theresa A Scott, MS Vanderbilt University Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott Theresa A Scott, MS

More information

Categorical Data Analysis

Categorical Data Analysis Richard L. Scheaffer University of Florida The reference material and many examples for this section are based on Chapter 8, Analyzing Association Between Categorical Variables, from Statistical Methods

More information

Data Analysis, Research Study Design and the IRB

Data Analysis, Research Study Design and the IRB Minding the p-values p and Quartiles: Data Analysis, Research Study Design and the IRB Don Allensworth-Davies, MSc Research Manager, Data Coordinating Center Boston University School of Public Health IRB

More information

Provision of Smoking Cessation by Ontario Dental Health Professionals

Provision of Smoking Cessation by Ontario Dental Health Professionals Provision of Smoking Cessation by Ontario Dental Health Professionals Alexey Babayan Jolene Dubray Farzana Haji Robert Schwartz May 2012 Ontario Tobacco Research Unit i Suggested Citation: Babayan A, Dubray

More information

T obacco use is rising among young adults (aged 18 24

T obacco use is rising among young adults (aged 18 24 ii20 Tobacco use by Massachusetts public college students: long term effect of the Massachusetts Tobacco Control Program N A Rigotti, S Regan, N E Majchrzak, J R Knight, H Wechsler... Tobacco Control 2002;11(Suppl

More information

Americans Current Views on Smoking 2013: An AARP Bulletin Survey

Americans Current Views on Smoking 2013: An AARP Bulletin Survey Americans Current Views on Smoking 2013: An AARP Bulletin Survey November 2013 Americans Current Views on Smoking 2013: An AARP Bulletin Survey Report Prepared by Al Hollenbeck, Ph.D. Copyright 2013 AARP

More information

WORKSHOP #5: DATA MANAGEMENT FOR DIETARY INTERVENTIONS

WORKSHOP #5: DATA MANAGEMENT FOR DIETARY INTERVENTIONS Global Nutrition and Epidemiologic Transition Initiative WORKSHOP #5: DATA MANAGEMENT FOR DIETARY INTERVENTIONS Biling Hong and Donna Spiegelman Harvard School of Public Health Software used in the data

More information

Research Methods & Experimental Design

Research Methods & Experimental Design Research Methods & Experimental Design 16.422 Human Supervisory Control April 2004 Research Methods Qualitative vs. quantitative Understanding the relationship between objectives (research question) and

More information

THE CORRELATION BETWEEN PHYSICAL HEALTH AND MENTAL HEALTH

THE CORRELATION BETWEEN PHYSICAL HEALTH AND MENTAL HEALTH HENK SWINKELS (STATISTICS NETHERLANDS) BRUCE JONAS (US NATIONAL CENTER FOR HEALTH STATISTICS) JAAP VAN DEN BERG (STATISTICS NETHERLANDS) THE CORRELATION BETWEEN PHYSICAL HEALTH AND MENTAL HEALTH IN THE

More information

If several different trials are mentioned in one publication, the data of each should be extracted in a separate data extraction form.

If several different trials are mentioned in one publication, the data of each should be extracted in a separate data extraction form. General Remarks This template of a data extraction form is intended to help you to start developing your own data extraction form, it certainly has to be adapted to your specific question. Delete unnecessary

More information

Washoe County Senior Services 2013 Survey Data: Service User

Washoe County Senior Services 2013 Survey Data: Service User Washoe County Senior Services 2013 Survey Data: Service User Profile Prepared by Zebbedia G. Gibb & Peter Reed UNR Sanford Center for Aging (Jan 2014) Overall Summary: Income was the single best predictor

More information

The Impact of Familial and Marital Status on the Performance of Life Insurance Agents The Case of Taiwan

The Impact of Familial and Marital Status on the Performance of Life Insurance Agents The Case of Taiwan International Journal of Business and Economics, 2013, Vol. 12, No. 1, 15-26 The Impact of Familial and Marital Status on the Performance of Life Insurance Agents The Case of Taiwan Chiehwei Hung Department

More information

Mind on Statistics. Chapter 4

Mind on Statistics. Chapter 4 Mind on Statistics Chapter 4 Sections 4.1 Questions 1 to 4: The table below shows the counts by gender and highest degree attained for 498 respondents in the General Social Survey. Highest Degree Gender

More information

2009 Mississippi Youth Tobacco Survey. Office of Health Data and Research Office of Tobacco Control Mississippi State Department of Health

2009 Mississippi Youth Tobacco Survey. Office of Health Data and Research Office of Tobacco Control Mississippi State Department of Health 9 Mississippi Youth Tobacco Survey Office of Health Data and Research Office of Tobacco Control Mississippi State Department of Health Acknowledgements... 1 Glossary... 2 Introduction... 3 Sample Design

More information

Logistic Regression. BUS 735: Business Decision Making and Research

Logistic Regression. BUS 735: Business Decision Making and Research Goals of this section 2/ 8 Specific goals: Learn how to conduct regression analysis with a dummy independent variable. Learning objectives: LO2: Be able to construct and use multiple regression models

More information

EARLY VS. LATE ENROLLERS: DOES ENROLLMENT PROCRASTINATION AFFECT ACADEMIC SUCCESS? 2007-08

EARLY VS. LATE ENROLLERS: DOES ENROLLMENT PROCRASTINATION AFFECT ACADEMIC SUCCESS? 2007-08 EARLY VS. LATE ENROLLERS: DOES ENROLLMENT PROCRASTINATION AFFECT ACADEMIC SUCCESS? 2007-08 PURPOSE Matthew Wetstein, Alyssa Nguyen & Brianna Hays The purpose of the present study was to identify specific

More information

Role of Customer Response Models in Customer Solicitation Center s Direct Marketing Campaign

Role of Customer Response Models in Customer Solicitation Center s Direct Marketing Campaign Role of Customer Response Models in Customer Solicitation Center s Direct Marketing Campaign Arun K Mandapaka, Amit Singh Kushwah, Dr.Goutam Chakraborty Oklahoma State University, OK, USA ABSTRACT Direct

More information

Disparities Between Asthma Management and Insurance Type Among Children

Disparities Between Asthma Management and Insurance Type Among Children o r i g i n a l c o m m u n i c a t i o n Disparities Between Asthma Management and Insurance Type Among Children Crystal N. Piper, MPH, MHA, PhD; Keith Elder, PhD; Saundra Glover, PhD; Jong-Deuk Baek,

More information

The Effects of Demographics on Consumer Perceptions of Identity Theft in Rural and Urban Settings

The Effects of Demographics on Consumer Perceptions of Identity Theft in Rural and Urban Settings The Effects of Demographics on Consumer Perceptions of Identity Theft in Rural and Urban Settings Axton Betz, Ph.D. Student, Iowa State University 1 Identity theft is a serious crime in which an individual

More information

Seroprevalence and risk factors of Lassa fever infection in Nasarawa State, Nigeria 2013

Seroprevalence and risk factors of Lassa fever infection in Nasarawa State, Nigeria 2013 Seroprevalence and risk factors of Lassa fever infection in Nasarawa State, Nigeria 2013 Muhammad Shakir Balogun COHORT 3 Nigeria-FELTP Supervisors: Dr. AT Olayinka, Dr. AI Mamman Outline Background Methodology

More information

How to Become a Research Associate in San Diego

How to Become a Research Associate in San Diego SAN DIEGO COMMUNITY COLLEGE DISTRICT EMPLOYMENT OPPORTUNITY Research Associate THE POSITION Applications are now being accepted to establish a current six-month eligibility list for the classification

More information

A Decision Guide on the Uses and Applications of EpiData Entry and EpiData Analysis Software

A Decision Guide on the Uses and Applications of EpiData Entry and EpiData Analysis Software A Decision Guide on the Uses and Applications of EpiData Entry and EpiData Analysis Software Created as part of a collaborative project 1 : Association of Public Health Epidemiologists in Ontario (APHEO),

More information

Same-sex Couples Consistency in Reports of Marital Status. Housing and Household Economic Statistics Division

Same-sex Couples Consistency in Reports of Marital Status. Housing and Household Economic Statistics Division Same-sex Couples Consistency in Reports of Marital Status Author: Affiliation: Daphne Lofquist U.S. Census Bureau Housing and Household Economic Statistics Division Phone: 301-763-2416 Fax: 301-457-3500

More information

STATISTICA Formula Guide: Logistic Regression. Table of Contents

STATISTICA Formula Guide: Logistic Regression. Table of Contents : Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary

More information

A CASE-CONTROL STUDY OF DOG BITE RISK FACTORS IN A DOMESTIC SETTING TO CHILDREN AGED 9 YEARS AND UNDER

A CASE-CONTROL STUDY OF DOG BITE RISK FACTORS IN A DOMESTIC SETTING TO CHILDREN AGED 9 YEARS AND UNDER A CASE-CONTROL STUDY OF DOG BITE RISK FACTORS IN A DOMESTIC SETTING TO CHILDREN AGED 9 YEARS AND UNDER L. Watson, K. Ashby, L. Day, S. Newstead, E. Cassell Background In Victoria, Australia, an average

More information

Laboratory 3 Type I, II Error, Sample Size, Statistical Power

Laboratory 3 Type I, II Error, Sample Size, Statistical Power Laboratory 3 Type I, II Error, Sample Size, Statistical Power Calculating the Probability of a Type I Error Get two samples (n1=10, and n2=10) from a normal distribution population, N (5,1), with population

More information

Society of Actuaries Middle Market Life Insurance Segmentation Program (Phase 1: Young Families)

Society of Actuaries Middle Market Life Insurance Segmentation Program (Phase 1: Young Families) Society of Actuaries Middle Market Life Insurance Segmentation Program (Phase 1: Young Families) September 2012 Sponsored By: SOA Marketing and Distribution Section SOA Product Development Section SOA

More information

A secondary analysis of primary care survey data to explore differences in response by ethnicity.

A secondary analysis of primary care survey data to explore differences in response by ethnicity. A secondary analysis of primary care survey data to explore differences in response by ethnicity. A report commissioned by the National Association for Patient Participation Autumn 2006 1.1 Introduction

More information

Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets

Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets http://info.salford-systems.com/jsm-2015-ctw August 2015 Salford Systems Course Outline Demonstration of two classification

More information

SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg

SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg IN SPSS SESSION 2, WE HAVE LEARNT: Elementary Data Analysis Group Comparison & One-way

More information

Constructing a Basefile for Simulating Kunming s Medical Insurance Scheme of Urban Employees

Constructing a Basefile for Simulating Kunming s Medical Insurance Scheme of Urban Employees INTERNATIONAL JOURNAL OF MICROSIMULATION (2011) 4(3) 3-16 Constructing a Basefile for Simulating Kunming s Medical Insurance Scheme of Urban Employees Xiong Linping Department of Health Services Management,

More information

Tobacco Use Among Students Attending Historically Black Colleges and Universities: Prevalence, Patterns and Norms

Tobacco Use Among Students Attending Historically Black Colleges and Universities: Prevalence, Patterns and Norms 12th Annual Summer Public Health Research Videoconference on Minority Health Tobacco Use Among Students Attending Historically Black Colleges and Universities: Prevalence, Patterns and Norms Michelle Laws,

More information

PharmaSUG2011 Paper HS03

PharmaSUG2011 Paper HS03 PharmaSUG2011 Paper HS03 Using SAS Predictive Modeling to Investigate the Asthma s Patient Future Hospitalization Risk Yehia H. Khalil, University of Louisville, Louisville, KY, US ABSTRACT The focus of

More information

Simple Random Sampling

Simple Random Sampling Source: Frerichs, R.R. Rapid Surveys (unpublished), 2008. NOT FOR COMMERCIAL DISTRIBUTION 3 Simple Random Sampling 3.1 INTRODUCTION Everyone mentions simple random sampling, but few use this method for

More information

Introduction to Statistics and Quantitative Research Methods

Introduction to Statistics and Quantitative Research Methods Introduction to Statistics and Quantitative Research Methods Purpose of Presentation To aid in the understanding of basic statistics, including terminology, common terms, and common statistical methods.

More information

Row vs. Column Percents. tab PRAYER DEGREE, row col

Row vs. Column Percents. tab PRAYER DEGREE, row col Bivariate Analysis - Crosstabulation One of most basic research tools shows how x varies with respect to y Interpretation of table depends upon direction of percentaging example Row vs. Column Percents.

More information

New York Study of Booster Seat Effects on Injury Reduction Compared to Safety Belts in Children Aged 4-8 in Motor Vehicle Crashes

New York Study of Booster Seat Effects on Injury Reduction Compared to Safety Belts in Children Aged 4-8 in Motor Vehicle Crashes New York Study of Booster Seat Effects on Injury Reduction Compared to Safety Belts in Children Aged 4-8 in Motor Vehicle Crashes Kainan Sun, Ph.D., Michael Bauer, M.S. Sarah M. Sperry, M.S., Susan Hardman

More information

Beware that Low Urine Creatinine! by Vera F. Dolan MSPH FALU, Michael Fulks MD, Robert L. Stout PhD

Beware that Low Urine Creatinine! by Vera F. Dolan MSPH FALU, Michael Fulks MD, Robert L. Stout PhD 1 Beware that Low Urine Creatinine! by Vera F. Dolan MSPH FALU, Michael Fulks MD, Robert L. Stout PhD Executive Summary: The presence of low urine creatinine at insurance testing is associated with increased

More information

Medical expenditures of work-related injuries among immigrant workers in USA

Medical expenditures of work-related injuries among immigrant workers in USA Medical expenditures of work-related injuries among immigrant workers in USA Huiyun Xiang, MD, MPH, PhD Associate Professor Director for International Programs Center for Injury Research and Policy The

More information

Predictors of Physical Therapy Use in Patients with Rheumatoid Arthritis

Predictors of Physical Therapy Use in Patients with Rheumatoid Arthritis Predictors of Physical Therapy Use in Patients with Rheumatoid Arthritis Maura Iversen,, PT, DPT, SD, MPH 1,2,3 Ritu Chhabriya,, MSPT 4 Nancy Shadick, MD 2,3 1 Department of Physical Therapy, Northeastern

More information