Assignments Analysis of Longitudinal data: a multilevel approach
|
|
- Luke West
- 8 years ago
- Views:
Transcription
1 Assignments Analysis of Longitudinal data: a multilevel approach Frans E.S. Tan Department of Methodology and Statistics University of Maastricht The Netherlands Maastricht, Jan 2007 Correspondence: Frans E.S. Tan, Methodology and Statistics, University of Maastricht, P. O. box 616, 6200 MD Maastricht, The Netherlands. Tel: , frans.tan@stat.unimaas.nl 1
2 See guidelines for performing a multilevel/longitudinal data analysis at the back. 1. Growth data (SPSS system file: growthdata.sav). (Pothoff & Roy) Study design: Orthodontic growth measurements for 11 girls and 16 boys For each subject the distance (in mm) between the pituitary and the maximally fissure was recorded at ages 8,10,12,14. These two locations can be easily identified on x-ray. Variables: - Distance - Sex: 0 =boy; 1=girl - Age: Age in years Design: Distance Measurements all subjects X X X X age Analyse the data by comparing (using the linear regression method) growth and growth velocity between boys and girls. Consult the following sub-questions: a. Plot the distance versus age for boys and girls separately (in one plot by means of set markers by sex). Plot also the fitted lines through the scatter plots (in the chart option). b. Perform a standard linear regression analysis of the following model dis tan ce = β 0 1Age 2Sex 3 Age Sex + ε. Save the unstandardised predicted values. c. Plot the predicted values versus age for boys and girls separately in one plot. Which model describes the observed data best? Argue whether there is a difference between boys and girls w.r.t. growth-velocity of the head circumference? Show that the regression parameter β 3 can be interpreted as the difference between the regression slopes of both sexes. 2. Consider the study about the relationship between alcohol consumption and violent behaviour (SPSS system file: alc_violent.sav). Study design: Random sample of five subjects. Each subject was measured five times between 1950 and 1958 Not all subjects were measured in the same year (unbalanced design) Goals study: testing the hypothesis that alcohol is positively related to violent behaviour 2
3 Design: X= Violent/alcohol Measurements Subject 5 X X X X X Subject 4 X X X X X Subject 3 X X X X X Subject 2 X X X X X Subject 1 X X X X X time Analyse the relationship between alcohol consumption and violent behaviour. Is there a difference between the relationship at the group level and at the subject level? Consult the following sub-questions: a. Plot violent versus alcohol. Describe what you see. b. Perform a standard linear regression analysis of violent on alcohol and save the unstandardised predicted values. Plot the predicted values versus alcohol to visualized your findings and interpret your results. c. Plot violent versus alcohol for each subject (use set markers subjects). d. Perform a standard linear regression analysis of the relationship between alcohol vs. violent behaviour, and with subject as a discrete covariate (use dummy variables) in the regression model (also save your unstandardised predicted values). Compare your results with that of b (do not forget to Plot to visualized your findings. Set markers by: subject). Can you explain the difference? 3. Growth data (Pothoff & Roy) (SPSS system file: growthdata.sav). Consider the study about the orthodontic growth of boys and girls. Compare growth and growth velocity between boys and girls as in assignment 1 using the SPSS option mixed (see guidelines). Analyse with OLS and with random effects and compare the two methods. Consult the following sub-questions: a. Plot the subject specific profiles (a plot of individual changes over time). See guidelines b. Plot the mean profiles (a plot of mean changes over time, separately for boys and girls) Question: Compare the growth velocity of boys and girls at the group level and at the subject level. Describe what you see based on these plots. c. Perform an OLS regression analysis with Mixed Models of the following model dis tan ce = β 0 1Age 2Sex 3 Age Sex + ε
4 Questions: Compare with your findings from assignment 1. What is the interpretation ofβ0 1Age 2Sex 3Age Sex? Indicate this in the plots. d. Perform a random intercept model Questions: Is the interaction between age and sex significant? Compare the output with that of (c). Explain the discrepancy with respect to the s.e. s of b 3. What is the interpretation ofβ 0 i 1Age 2Sex 3 Age Sex for a specific subject i? Indicate this in the plots. What is the interpretation of the first-level variance (R cov -matrix)? What is the interpretation of the second-level variance (G cov -matrix)? What is the interpretation of the overall variances and covariances (correlations) (V cov matrix)? Indicate these in the plots. Determine the V cov matrix and V corr matrix and interpret the ICC. e. Perform a marginal model with the (homogeneous) Compound symmetry covariance structure. Questions: Compare the variances and covariances of the output with that of (d). f. Perform a random intercept model with an AR(1) serial correlation. Compare the results with that of (d). g. Perform a random slope (random slope for age) model Question: Compare all the output and argue which model you would prefer. Determine the V cov matrix of the models in f and g. 4. Aggregation of longitudinal data; Ecological fallacy Consider the study about the relationship between alcohol consumption and violent behaviour (SPSS system file: alc_violent.sav). Analyse the relationship about alcohol and violent as in assignment 2 and using SPSS option mixed. Analyse with OLS and with random effects and compare the two methods. Consult the following sub-questions: a. Plot violent vs. alcohol for each subject (use the Interactive scatter plot option). b. What can you say about the subject specific relationship between alcohol and violent behaviour? c. Perform an OLS regression analysis with Mixed Models of the relationship between alcohol and violent behaviour. d. Compare the output in (c) with what you expected considering the plot in (a). e. Perform a mixed model analysis with a random intercept model to study the subjectspecific relationship between alcohol vs. violent behaviour and compare this output with the previous ones (set estimation maximum scoring steps : 10). 4
5 f. Compare the results from (e) with the results from assignment (2.d). g. Discuss the overall results of the analysis that you have performed and relate the results to the "ecological fallacy" phenomenon. 5. Interpersonal proximity Description of the study (teacher.sav) Brekelmans and Creton (1993) made a study of the development over time of evaluations of teachers by their pupils. Starting from the first year of their teaching career, teachers were evaluated on their interpersonal behaviour in the classroom. This happened repeatedly, at intervals of about one year. Results are presented about the proximity dimension, representing the degree of cooperation or closeness between a teacher and his or her students. The higher the proximity score of a teacher, the more cooperation is perceived by his or her students. There are four measurement occasions: after 0, 1, 2, and 3 years of experience. Thus, the time variable assumes the values 0 through 3. A total of 51 teachers were studied. The number of observations for the 4 moments decreased from 46 at t=0 to 32 at t=3. Hence, we are dealing with an unbalanced design. Non-response at various moments may be considered to be random. Another variable in the dataset is gender. Gender (0=male; 1=female) could possibly be a predictor of the proximity score of the teacher. It is also possible that gender has an influence on the relationship between the measurement occasion and the proximity score. Note: In the data file there is also a variable occ_cat. This variable is identical to the variable occ. Design (there are missing observations): Proximity Measurements all teachers X X X X occasion a. Discuss the multilevel design b. Plot the teacher specific proximity score vs. occasion (use the variable occ ) for each Gender and a plot of the gender specific proximity score vs. occasion c. What can you say about the (Teacher specific/ gender specific) pattern of proximity score across occasions? d. Argue that the following model specification does make sense. Pr ox = β0 1occ0 2occ1 3occ2 4Sex 5occ0 Sex 6occ1 Sex 7occ2 random part + R The variable occ i denotes the dummy variable for occasion i, i = 1,2. Can you make an educated guess whether a random intercept or a random slope (with random intercept) would be most appropriate to describe the data? e. Perform an OLS regression of the model specified in (d), with Occasion as a categorical variable (factor) and interaction occasion and Gender. Sex + 5
6 f. Perform successively with the same fixed part as in (e) (set estimation maximum scoring steps : 10): 0. A random intercept model. 1. A random intercept/slope model (take the random slope of the quantitative variable occasion. Use as cov. Type for the random effects: Unstructured). 2. A random intercept/slope model with an AR (1) serial correlation and homogeneous variances. 3. A random intercept model with AR (1) and heterogeneous variances and 4. A model with AR (1) and heterogeneous variances. Compare with (e). g. Calculate the corresponding V cov -matrices following the calculations mentioned in the transparencies. h. Which model would you consider as most appropriate and why? i. Would you conclude that there is a difference in change of proximity score between male and female teachers (fit the model for male and female teachers separately)? 6. Growth data (Pothoff & Roy) (SPSS system file: growthdata.sav). Consider the study about the orthodontic growth of boys and girls. a. Run the OLS model dis tan ce = β 0 1Age 2Sex 3 Age Sex + ε and plot the residuals against age. Go to Graphs Interactive Line... Click the reset button. Drag the variable resid_1 to the box for the y-variable Drag the variable age to the box for the x-variable. Drag the variable Subj to the 'color' box. Right-click on the variable Subj and select categorical. Drag the variable sex to the 'style' box. Select convert. Click paste and run the syntax. Questions: What can you say about the variance over time? b. Plot a scatter-plot matrix of the residuals and determine the correlation matrix for the different time points First, transpose the data: The values of Age should be transposed into columns. respnr sex age growth
7 respnr sex dist_8 dist_10 dist_12 dist_ Transposing rows into columns: Rename the variable Distance to Dist and resid_1 to res_1 with the following syntax: Rename variables distance = dist. Rename variables resid_1=res. (Note: If you do not rename the variables, then the final variable names will be too long after transposing the data). Go to Data Restructure. Spss asks you if you want to save: Do not save (it is not necessary to save the new dist and res variable). Choose Restructure selected cases into variables. Click volgende. Subj is the Identifier variable, Age is the Index variable. Click Volgende 3 times. Choose paste the syntax generated by the wizard into a syntax window Click voltooien. Run the syntax. Save your datafile under a new name. Scatter-plot matrix of residuals: Go to Graphs Scatter Choose matrix. Select res.8, res.10, res.12, res.14 and put them in the Matrix Variables box. Click paste and run the syntax. correlationmatrix of the responses: Go to Analyze Correlate Bivariate... Select the variables dist.8, dist.10, dist.12 and dist.14. Click paste and run the syntax. Questions: Do the scatter-plots change with time? Which model will probably come out based on the exploratory analysis? 7
8 c. Consider some other (reasonable) alternative models with different covariance structures than the random intercept model and determine the most adequate model using the basic guidelines mentioned in the course. d. Check whether the proximity model in assignment 5 will also be obtained following the basic guidelines mentioned in the course. 7. Life event study (SPSS system file: lifesubset.sav). (Nieboer et al.) a. Reconstruct the analysis of the Life event data. Follow the basic guidelines mentioned in the course. Use both the gain score and the Ancova approach. For the gain score approach use the ' lifesubset.sav' file. For the ancova approach use the 'lifeancova.sav' file. b. Is the difference between male and female at time point 12 significantly different than at time point 3? c. What are your conclusions concerning the difference between male and female on the one hand and carers and widowers on the other hand over time for both the gain score analysis and the ancova analysis? d. Explain why an interaction term between Gender and Time is specified in the gainscore analysis and not in the ancova analysis? e. Which model would you prefer in this case? 8. Alzheimer Study (SPSS system file: Alzheimer.sav, Alzheimer_vert.sav, Alzheimer_hor.sav' ) a. Investigate the missingness pattern of the Alzheimer data (see transparencies). Assume that missingness is due to monotone dropout. Open the file Alzheimer.sav. Go to Analyze Descriptive Statistics Frequencies. Click week into the 'variable(s)' box. Deduce the missingness pattern from the frequency table and fill in the following table. Pattern counts
9 Frequency % b. Plot the proportion of patients in the study vs. time for: center gender treatment To plot the proportion of patients vs. time for each center we need: - the total number of patients per center - the number of patients in the study per center per time point Guidelines for a plot per center (general case when large data sets are involved): Number of patients per center Sort the data in ascending order. Start with an equal number of weeks per patients. Open the Alzheimer_vert.sav file. Go to Data Sort cases. Click center into the 'Sort by' box. Create data with the number of patients per center. Go to Data Aggregate. Click center into the 'Break variable' box. Select the box ' Save number of cases in break group variable'. Change the name of the variable (n_break) to nmeas. Click on the button 'File...' and change the directory to which the new file is written into... Change the name of the file into 'alz_aggr_center_npat.sav' Open the new aggregate file. The variable nmeas has to be divided by 8 to get the number of patients per center instead of the number of measurements per center. Go to Transform Compute. Click nmeas in the 'Numeric Expression' box. Add the expression ' /8'. Type npat in the 'Target variable' box. Save the file. 9
10 Number of patients in study per center per time point Sort data in ascending order of center and weeks. Open the Alzheimer_vert.sav file. Go to Data Sort cases. Click center into the 'Sort by' box. Click week into the 'Sort by' box. Go to Data Aggregate. Click center into the 'Break variable' box. Click week into the 'Break variable' box. Click alz into the 'Aggregate variable' box. Click on the button 'Name&Label' Change the name to patinstu. Click on the button 'Function'. Select the option unweighted. Click on the button 'File...' and change the directory to which the new file is written into... Change the name of the file into 'alz_aggr_center.sav'. Match the two aggregate files Open the file 'alz_aggr_center.sav'. Go to Data Merge files Add variables. Select the file ' 'alz_aggr_center_npat.sav' Select the box 'Match cases on key variables in sorted files Select 'external file is keyed table'. Click center into the 'Key variables' box. Calculate the proportion of patients in the study per week Go to Transform Compute. Type the expression 'patinstu/npat' in the 'Numeric Expression' box. The target variable is p_instud. Plot the proportion of patients in the study vs. time with separate lines for center Follow the guidelines mentioned under 'mean profiles'. Follow the same procedure for gender and treatment. 10
11 c. Perform a logistic regression to evaluate whether the occurrence of missing is predicted by treatment, gender, age, center and the 1st and 2nd measurement. Open the file 'Alzheimer_hor.sav'. The variable 'miss' indicates whether a patient has one or more missings values on the Alzheimer score. Go to Analyze Regression Binary Logistic. Click the variable miss into the 'Dependent:' box. Click the variables treatm, gender, center, age, alz.1 and alz.2 into the 'Covariates:' box. Click on the button Categorical: Click treatm and center into the 'Categorical covariates:' box. Select 'first' as the Reference category for both variables. Click on Change. Click Continue. Choose 'Backward LR as method for the analysis. Click ok. On which predictors does the missingness depend? d. Is the underlying missing value mechanism MCAR or MAR (assume not MNAR)? 9. Test Assignment Longitudinal data analysis Beating the blues (system file: data81.sav) A clinical trial was designed to assess the effectiveness of an interactive program using multimedia techniques for the delivery of cognitive behavioral therapy for depressed patients and known as Beating the Blues (BtB). In a randomized controlled trial of the program, patients with depression recruited in primary care were randomized to either the BtB program, or to Treatment as Usual (TAU). The variable Treat represents these two treatments (Treat = 1 if BtB, and Treat = 0 if TAU). The outcome measure (Depress) used in the trial was the Beck Depression Inventory II with higher values indicating more depression. Measurements of this variable were made on five occasions: - prior to treatment ( Bdipre) - follow up at 2, 3, 5, and 8 months after treatment (months) - There is a considerable number of missing values caused by patients dropping out of the study - There are repeated measurements of the outcome taken on each patient post treatment, along with a baseline pre-treatment measurement. The question of most interest about these data is whether the BtB program does better than TAU in treating depression. Perform a longitudinal analysis. Write a report of your findings. Include the following considerations in your analysis. 11
12 - Determine the pattern of missing observations and investigate whether the underlying missing-mechanism is MCAR or MAR. - Determine by the choice of the design and model selection which mixed effects model is most suitable. - Are there any interactions involved? - Compare the approach based on ancova with that based on change scores. Which approach is preferable? - Determine the corresponding V cov of the final model and compare with the observed covariance matrix - Interpret your results. What is your final conclusion regarding the research question? 12
13 General guidelines for performing a multilevel and longitudinal data analysis with the SPSS option Mixed Linear An independent variable should be specified as a covariate in SPSS if it is a quantitative variable or a qualitative variable with 2 categories. It should be specified as a factor if it is a qualitative variable with more than 2 categories. The Mixed Models option in SPSS will automatically compute dummy variables from the qualitative variable with the highest category as a reference. Plotting longitudinal data. Subject specific profiles. Open your dataset Go to Graphs Interactive Line... Click on reset. Drag the dependent variable to the box for the y-variable Drag the (time dependent) independent variable to the box for the x-variable. Drag the identification variable to the 'colour' box Drag the grouping variable to the 'style' box. Select convert. Right-click on the identification variable and select categorical. Mean profiles. Go to Graphs Interactive Line... Click on reset. Drag the dependent variable to the box for the y-variable Drag the (time dependent) independent variable to the box for the x-variable. Drag the grouping variable to the 'colour' box. Right-click on the grouping variable and select categorical. Go to the tab Dots and Lines and select Dots. Performing an OLS regression model. Click on the reset button. Click the dependent variable into the 'Dependent variable:' box Click the quantitative (or dichotomous) independent variables into the 'Covariate(s):' box Click the qualitative independent variables into the 'Factor(s):' box Click the fixed button. Select the independent variables. If the full model is required, then factorial should be selected, and the independent variables should be selected simultaneously. Click the Add button. Click the statistics button and select the following checkboxes: Parameter estimates and covariances of residuals. Performing a random effects model. Go to Analyze Mixed models Linear... Click on the reset button. Click the identification variable into the 'subjects:' box 13
14 Click the dependent variable into the 'Dependent variable:' box Click the quantitative independent variables into the 'Covariate(s):' box Click the qualitative independent variables into the 'Factor(s):' box Click the fixed button. Select the independent variables. If the full model is required, then factorial should be selected, and the independent variables should be selected simultaneously. Click the Add button. Click the random button. Select the 'Include Intercept' checkbox if a random intercept is required. If random slope is required, then select the relevant independent variable en put it in the Model box. Choose unstructured as Covariance type. Click the identification variable from the 'Subjects:' into the 'Combinations:' box. Click the statistics button and select the following checkboxes: Parameter estimates, covariances of random effects and Covariances of residuals. Specifying a serial correlation. Go to Analyze Mixed models Linear... Click on the reset button. Click the identification variable into the subjects: box and age into the 'Repeated:' box. Choose a covariance structure option as the Repeated Covariance type. Note: there are several covariance structure like AR(1) (which is homogeneous), AR(1) heterogeneous, unstructured, toeplitz, scaled identity etc. Click the dependent variable into the 'Dependent variable:' box, age and sex into the 'Covariate(s):' box Click the statistics button and select the following checkboxes: Parameter estimates, Covariance of random effects and Covariances of residuals. Click the fixed button. Select the required independent variables and click the Add button. Performing an LR test: calculation of corresponding p-value Go to Transform Compute... Click the function CDF.CHISQ into the 'Numeric expression:' box. Type '1- ' before the function. Replace the first question mark by the difference in -2 restricted LL between the 2 models. Replace the second question mark by the number of degrees of freedom. Fill in a name for the target variable in the ' target variable' box. Click ok. 14
Introduction to Longitudinal Data Analysis
Introduction to Longitudinal Data Analysis Longitudinal Data Analysis Workshop Section 1 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section 1: Introduction
More informationDirections for using SPSS
Directions for using SPSS Table of Contents Connecting and Working with Files 1. Accessing SPSS... 2 2. Transferring Files to N:\drive or your computer... 3 3. Importing Data from Another File Format...
More informationBiostatistics Short Course Introduction to Longitudinal Studies
Biostatistics Short Course Introduction to Longitudinal Studies Zhangsheng Yu Division of Biostatistics Department of Medicine Indiana University School of Medicine Zhangsheng Yu (Indiana University) Longitudinal
More informationAnalyzing Intervention Effects: Multilevel & Other Approaches. Simplest Intervention Design. Better Design: Have Pretest
Analyzing Intervention Effects: Multilevel & Other Approaches Joop Hox Methodology & Statistics, Utrecht Simplest Intervention Design R X Y E Random assignment Experimental + Control group Analysis: t
More informationSTATISTICA Formula Guide: Logistic Regression. Table of Contents
: Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary
More informationDEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9
DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,
More informationIntroduction Course in SPSS - Evening 1
ETH Zürich Seminar für Statistik Introduction Course in SPSS - Evening 1 Seminar für Statistik, ETH Zürich All data used during the course can be downloaded from the following ftp server: ftp://stat.ethz.ch/u/sfs/spsskurs/
More informationSimple Linear Regression, Scatterplots, and Bivariate Correlation
1 Simple Linear Regression, Scatterplots, and Bivariate Correlation This section covers procedures for testing the association between two continuous variables using the SPSS Regression and Correlate analyses.
More informationWhen to use Excel. When NOT to use Excel 9/24/2014
Analyzing Quantitative Assessment Data with Excel October 2, 2014 Jeremy Penn, Ph.D. Director When to use Excel You want to quickly summarize or analyze your assessment data You want to create basic visual
More informationHLM software has been one of the leading statistical packages for hierarchical
Introductory Guide to HLM With HLM 7 Software 3 G. David Garson HLM software has been one of the leading statistical packages for hierarchical linear modeling due to the pioneering work of Stephen Raudenbush
More informationSAS Syntax and Output for Data Manipulation:
Psyc 944 Example 5 page 1 Practice with Fixed and Random Effects of Time in Modeling Within-Person Change The models for this example come from Hoffman (in preparation) chapter 5. We will be examining
More informationModeration. Moderation
Stats - Moderation Moderation A moderator is a variable that specifies conditions under which a given predictor is related to an outcome. The moderator explains when a DV and IV are related. Moderation
More informationSimple Predictive Analytics Curtis Seare
Using Excel to Solve Business Problems: Simple Predictive Analytics Curtis Seare Copyright: Vault Analytics July 2010 Contents Section I: Background Information Why use Predictive Analytics? How to use
More informationBelow is a very brief tutorial on the basic capabilities of Excel. Refer to the Excel help files for more information.
Excel Tutorial Below is a very brief tutorial on the basic capabilities of Excel. Refer to the Excel help files for more information. Working with Data Entering and Formatting Data Before entering data
More informationIntroduction to Multilevel Modeling Using HLM 6. By ATS Statistical Consulting Group
Introduction to Multilevel Modeling Using HLM 6 By ATS Statistical Consulting Group Multilevel data structure Students nested within schools Children nested within families Respondents nested within interviewers
More informationData exploration with Microsoft Excel: analysing more than one variable
Data exploration with Microsoft Excel: analysing more than one variable Contents 1 Introduction... 1 2 Comparing different groups or different variables... 2 3 Exploring the association between categorical
More informationIntroduction to Data Analysis in Hierarchical Linear Models
Introduction to Data Analysis in Hierarchical Linear Models April 20, 2007 Noah Shamosh & Frank Farach Social Sciences StatLab Yale University Scope & Prerequisites Strong applied emphasis Focus on HLM
More informationInstructions for SPSS 21
1 Instructions for SPSS 21 1 Introduction... 2 1.1 Opening the SPSS program... 2 1.2 General... 2 2 Data inputting and processing... 2 2.1 Manual input and data processing... 2 2.2 Saving data... 3 2.3
More informationIBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA
CALIFORNIA STATE UNIVERSITY, LOS ANGELES INFORMATION TECHNOLOGY SERVICES IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA Summer 2013, Version 2.0 Table of Contents Introduction...2 Downloading the
More informationScatter Plots with Error Bars
Chapter 165 Scatter Plots with Error Bars Introduction The procedure extends the capability of the basic scatter plot by allowing you to plot the variability in Y and X corresponding to each point. Each
More informationHow To Run Statistical Tests in Excel
How To Run Statistical Tests in Excel Microsoft Excel is your best tool for storing and manipulating data, calculating basic descriptive statistics such as means and standard deviations, and conducting
More informationChapter 23. Inferences for Regression
Chapter 23. Inferences for Regression Topics covered in this chapter: Simple Linear Regression Simple Linear Regression Example 23.1: Crying and IQ The Problem: Infants who cry easily may be more easily
More informationChapter 7: Simple linear regression Learning Objectives
Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -
More informationBill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1
Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1 Calculate counts, means, and standard deviations Produce
More informationCOMPUTING AND VISUALIZING LOG-LINEAR ANALYSIS INTERACTIVELY
COMPUTING AND VISUALIZING LOG-LINEAR ANALYSIS INTERACTIVELY Pedro Valero-Mora Forrest W. Young INTRODUCTION Log-linear models provide a method for analyzing associations between two or more categorical
More informationAdditional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm
Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm
More informationHow to set the main menu of STATA to default factory settings standards
University of Pretoria Data analysis for evaluation studies Examples in STATA version 11 List of data sets b1.dta (To be created by students in class) fp1.xls (To be provided to students) fp1.txt (To be
More informationData analysis process
Data analysis process Data collection and preparation Collect data Prepare codebook Set up structure of data Enter data Screen data for errors Exploration of data Descriptive Statistics Graphs Analysis
More informationLinear Mixed-Effects Modeling in SPSS: An Introduction to the MIXED Procedure
Technical report Linear Mixed-Effects Modeling in SPSS: An Introduction to the MIXED Procedure Table of contents Introduction................................................................ 1 Data preparation
More informationBowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition
Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition Online Learning Centre Technology Step-by-Step - Excel Microsoft Excel is a spreadsheet software application
More information11. Analysis of Case-control Studies Logistic Regression
Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:
More informationChapter 5 Analysis of variance SPSS Analysis of variance
Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,
More informationExploring Relationships between Highest Level of Education and Income using Corel Quattro Pro
Exploring Relationships between Highest Level of Education and Income using Corel Quattro Pro Created by Michael Lieff (m@lieff.net) Faculty of Education, Queen s University While on practicum at Statistics
More informationIntroducing the Multilevel Model for Change
Department of Psychology and Human Development Vanderbilt University GCM, 2010 1 Multilevel Modeling - A Brief Introduction 2 3 4 5 Introduction In this lecture, we introduce the multilevel model for change.
More informationJanuary 26, 2009 The Faculty Center for Teaching and Learning
THE BASICS OF DATA MANAGEMENT AND ANALYSIS A USER GUIDE January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS Table of Contents Table of Contents... i
More informationData Analysis Tools. Tools for Summarizing Data
Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool
More informationStatistical Analysis Using SPSS for Windows Getting Started (Ver. 2014/11/6) The numbers of figures in the SPSS_screenshot.pptx are shown in red.
Statistical Analysis Using SPSS for Windows Getting Started (Ver. 2014/11/6) The numbers of figures in the SPSS_screenshot.pptx are shown in red. 1. How to display English messages from IBM SPSS Statistics
More informationI L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN
Beckman HLM Reading Group: Questions, Answers and Examples Carolyn J. Anderson Department of Educational Psychology I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN Linear Algebra Slide 1 of
More informationSPSS Introduction. Yi Li
SPSS Introduction Yi Li Note: The report is based on the websites below http://glimo.vub.ac.be/downloads/eng_spss_basic.pdf http://academic.udayton.edu/gregelvers/psy216/spss http://www.nursing.ucdenver.edu/pdf/factoranalysishowto.pdf
More informationHYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate
More informationPlease follow these guidelines when preparing your answers:
PR- ASSIGNMNT 3000500 Quantitative mpirical Research The objective of the pre- assignment is to review the course prerequisites and get familiar with SPSS software. The assignment consists of three parts:
More informationSPSS (Statistical Package for the Social Sciences)
SPSS (Statistical Package for the Social Sciences) What is SPSS? SPSS stands for Statistical Package for the Social Sciences The SPSS home-page is: www.spss.com 2 What can you do with SPSS? Run Frequencies
More informationIBM SPSS Missing Values 22
IBM SPSS Missing Values 22 Note Before using this information and the product it supports, read the information in Notices on page 23. Product Information This edition applies to version 22, release 0,
More informationSPSS Explore procedure
SPSS Explore procedure One useful function in SPSS is the Explore procedure, which will produce histograms, boxplots, stem-and-leaf plots and extensive descriptive statistics. To run the Explore procedure,
More informationIBM SPSS Direct Marketing 23
IBM SPSS Direct Marketing 23 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 23, release
More informationR with Rcmdr: BASIC INSTRUCTIONS
R with Rcmdr: BASIC INSTRUCTIONS Contents 1 RUNNING & INSTALLATION R UNDER WINDOWS 2 1.1 Running R and Rcmdr from CD........................................ 2 1.2 Installing from CD...............................................
More informationSPSS Tutorial, Feb. 7, 2003 Prof. Scott Allard
p. 1 SPSS Tutorial, Feb. 7, 2003 Prof. Scott Allard The following tutorial is a guide to some basic procedures in SPSS that will be useful as you complete your data assignments for PPA 722. The purpose
More informationMain Effects and Interactions
Main Effects & Interactions page 1 Main Effects and Interactions So far, we ve talked about studies in which there is just one independent variable, such as violence of television program. You might randomly
More informationIBM SPSS Direct Marketing 22
IBM SPSS Direct Marketing 22 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 22, release
More informationASSIGNMENT 4 PREDICTIVE MODELING AND GAINS CHARTS
DATABASE MARKETING Fall 2015, max 24 credits Dead line 15.10. ASSIGNMENT 4 PREDICTIVE MODELING AND GAINS CHARTS PART A Gains chart with excel Prepare a gains chart from the data in \\work\courses\e\27\e20100\ass4b.xls.
More informationBinary Logistic Regression
Binary Logistic Regression Main Effects Model Logistic regression will accept quantitative, binary or categorical predictors and will code the latter two in various ways. Here s a simple model including
More informationSAS Software to Fit the Generalized Linear Model
SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling
More information10. Analysis of Longitudinal Studies Repeat-measures analysis
Research Methods II 99 10. Analysis of Longitudinal Studies Repeat-measures analysis This chapter builds on the concepts and methods described in Chapters 7 and 8 of Mother and Child Health: Research methods.
More informationIBM SPSS Statistics for Beginners for Windows
ISS, NEWCASTLE UNIVERSITY IBM SPSS Statistics for Beginners for Windows A Training Manual for Beginners Dr. S. T. Kometa A Training Manual for Beginners Contents 1 Aims and Objectives... 3 1.1 Learning
More informationElectronic Thesis and Dissertations UCLA
Electronic Thesis and Dissertations UCLA Peer Reviewed Title: A Multilevel Longitudinal Analysis of Teaching Effectiveness Across Five Years Author: Wang, Kairong Acceptance Date: 2013 Series: UCLA Electronic
More information(More Practice With Trend Forecasts)
Stats for Strategy HOMEWORK 11 (Topic 11 Part 2) (revised Jan. 2016) DIRECTIONS/SUGGESTIONS You may conveniently write answers to Problems A and B within these directions. Some exercises include special
More informationSPSS Resources. 1. See website (readings) for SPSS tutorial & Stats handout
Analyzing Data SPSS Resources 1. See website (readings) for SPSS tutorial & Stats handout Don t have your own copy of SPSS? 1. Use the libraries to analyze your data 2. Download a trial version of SPSS
More informationProblem of Missing Data
VASA Mission of VA Statisticians Association (VASA) Promote & disseminate statistical methodological research relevant to VA studies; Facilitate communication & collaboration among VA-affiliated statisticians;
More informationChapter 15. Mixed Models. 15.1 Overview. A flexible approach to correlated data.
Chapter 15 Mixed Models A flexible approach to correlated data. 15.1 Overview Correlated data arise frequently in statistical analyses. This may be due to grouping of subjects, e.g., students within classrooms,
More informationTable of Contents. Preface
Table of Contents Preface Chapter 1: Introduction 1-1 Opening an SPSS Data File... 2 1-2 Viewing the SPSS Screens... 3 o Data View o Variable View o Output View 1-3 Reading Non-SPSS Files... 6 o Convert
More informationInstitute of Actuaries of India Subject CT3 Probability and Mathematical Statistics
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in
More informationSPSS Tests for Versions 9 to 13
SPSS Tests for Versions 9 to 13 Chapter 2 Descriptive Statistic (including median) Choose Analyze Descriptive statistics Frequencies... Click on variable(s) then press to move to into Variable(s): list
More informationMISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group
MISSING DATA TECHNIQUES WITH SAS IDRE Statistical Consulting Group ROAD MAP FOR TODAY To discuss: 1. Commonly used techniques for handling missing data, focusing on multiple imputation 2. Issues that could
More informationCONTENTS PREFACE 1 INTRODUCTION 1 2 DATA VISUALIZATION 19
PREFACE xi 1 INTRODUCTION 1 1.1 Overview 1 1.2 Definition 1 1.3 Preparation 2 1.3.1 Overview 2 1.3.2 Accessing Tabular Data 3 1.3.3 Accessing Unstructured Data 3 1.3.4 Understanding the Variables and Observations
More informationUsing SPSS, Chapter 2: Descriptive Statistics
1 Using SPSS, Chapter 2: Descriptive Statistics Chapters 2.1 & 2.2 Descriptive Statistics 2 Mean, Standard Deviation, Variance, Range, Minimum, Maximum 2 Mean, Median, Mode, Standard Deviation, Variance,
More informationCharting LibQUAL+(TM) Data. Jeff Stark Training & Development Services Texas A&M University Libraries Texas A&M University
Charting LibQUAL+(TM) Data Jeff Stark Training & Development Services Texas A&M University Libraries Texas A&M University Revised March 2004 The directions in this handout are written to be used with SPSS
More information5. Correlation. Open HeightWeight.sav. Take a moment to review the data file.
5. Correlation Objectives Calculate correlations Calculate correlations for subgroups using split file Create scatterplots with lines of best fit for subgroups and multiple correlations Correlation The
More informationSPSS The Basics. Jennifer Thach RHS Assessment Office March 3 rd, 2014
SPSS The Basics Jennifer Thach RHS Assessment Office March 3 rd, 2014 Why use SPSS? - Used heavily in the Social Science & Business world - Ability to perform basic to high-level statistical analysis (i.e.
More informationRegression Clustering
Chapter 449 Introduction This algorithm provides for clustering in the multiple regression setting in which you have a dependent variable Y and one or more independent variables, the X s. The algorithm
More informationIntroduction to Regression and Data Analysis
Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it
More informationSPSS and AM statistical software example.
A detailed example of statistical analysis using the NELS:88 data file and ECB, to perform a longitudinal analysis of 1988 8 th graders in the year 2000: SPSS and AM statistical software example. Overall
More informationInstructions for data-entry and data-analysis using Epi Info
Instructions for data-entry and data-analysis using Epi Info After collecting data using the tools for evaluation and feedback available in the Hand Hygiene Implementation Toolkit (available at http://www.who.int/gpsc/5may/tools
More informationFactors affecting online sales
Factors affecting online sales Table of contents Summary... 1 Research questions... 1 The dataset... 2 Descriptive statistics: The exploratory stage... 3 Confidence intervals... 4 Hypothesis tests... 4
More informationSPSS Manual for Introductory Applied Statistics: A Variable Approach
SPSS Manual for Introductory Applied Statistics: A Variable Approach John Gabrosek Department of Statistics Grand Valley State University Allendale, MI USA August 2013 2 Copyright 2013 John Gabrosek. All
More informationUsing Microsoft Excel to Plot and Analyze Kinetic Data
Entering and Formatting Data Using Microsoft Excel to Plot and Analyze Kinetic Data Open Excel. Set up the spreadsheet page (Sheet 1) so that anyone who reads it will understand the page (Figure 1). Type
More informationDiagrams and Graphs of Statistical Data
Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in
More informationDealing with Data in Excel 2010
Dealing with Data in Excel 2010 Excel provides the ability to do computations and graphing of data. Here we provide the basics and some advanced capabilities available in Excel that are useful for dealing
More informationDoing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:
Doing Multiple Regression with SPSS Multiple Regression for Data Already in Data Editor Next we want to specify a multiple regression analysis for these data. The menu bar for SPSS offers several options:
More informationAn SPSS companion book. Basic Practice of Statistics
An SPSS companion book to Basic Practice of Statistics SPSS is owned by IBM. 6 th Edition. Basic Practice of Statistics 6 th Edition by David S. Moore, William I. Notz, Michael A. Flinger. Published by
More informationAn introduction to IBM SPSS Statistics
An introduction to IBM SPSS Statistics Contents 1 Introduction... 1 2 Entering your data... 2 3 Preparing your data for analysis... 10 4 Exploring your data: univariate analysis... 14 5 Generating descriptive
More informationUCINET Visualization and Quantitative Analysis Tutorial
UCINET Visualization and Quantitative Analysis Tutorial Session 1 Network Visualization Session 2 Quantitative Techniques Page 2 An Overview of UCINET (6.437) Page 3 Transferring Data from Excel (From
More informationSPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg
SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg IN SPSS SESSION 2, WE HAVE LEARNT: Elementary Data Analysis Group Comparison & One-way
More informationLinear Models in STATA and ANOVA
Session 4 Linear Models in STATA and ANOVA Page Strengths of Linear Relationships 4-2 A Note on Non-Linear Relationships 4-4 Multiple Linear Regression 4-5 Removal of Variables 4-8 Independent Samples
More informationCurriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010
Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Week 1 Week 2 14.0 Students organize and describe distributions of data by using a number of different
More informationMTH 140 Statistics Videos
MTH 140 Statistics Videos Chapter 1 Picturing Distributions with Graphs Individuals and Variables Categorical Variables: Pie Charts and Bar Graphs Categorical Variables: Pie Charts and Bar Graphs Quantitative
More informationOverview Classes. 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7)
Overview Classes 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7) 2-4 Loglinear models (8) 5-4 15-17 hrs; 5B02 Building and
More informationThe Forgotten JMP Visualizations (Plus Some New Views in JMP 9) Sam Gardner, SAS Institute, Lafayette, IN, USA
Paper 156-2010 The Forgotten JMP Visualizations (Plus Some New Views in JMP 9) Sam Gardner, SAS Institute, Lafayette, IN, USA Abstract JMP has a rich set of visual displays that can help you see the information
More informationA Brief Introduction to SPSS Factor Analysis
A Brief Introduction to SPSS Factor Analysis SPSS has a procedure that conducts exploratory factor analysis. Before launching into a step by step example of how to use this procedure, it is recommended
More informationModule 4 - Multiple Logistic Regression
Module 4 - Multiple Logistic Regression Objectives Understand the principles and theory underlying logistic regression Understand proportions, probabilities, odds, odds ratios, logits and exponents Be
More informationChapter 13 Introduction to Linear Regression and Correlation Analysis
Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing
More informationC:\Users\<your_user_name>\AppData\Roaming\IEA\IDBAnalyzerV3
Installing the IDB Analyzer (Version 3.1) Installing the IDB Analyzer (Version 3.1) A current version of the IDB Analyzer is available free of charge from the IEA website (http://www.iea.nl/data.html,
More informationWe are often interested in the relationship between two variables. Do people with more years of full-time education earn higher salaries?
Statistics: Correlation Richard Buxton. 2008. 1 Introduction We are often interested in the relationship between two variables. Do people with more years of full-time education earn higher salaries? Do
More informationTechnical report. in SPSS AN INTRODUCTION TO THE MIXED PROCEDURE
Linear mixedeffects modeling in SPSS AN INTRODUCTION TO THE MIXED PROCEDURE Table of contents Introduction................................................................3 Data preparation for MIXED...................................................3
More information2013 MBA Jump Start Program. Statistics Module Part 3
2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just
More informationChapter 4 Displaying and Describing Categorical Data
Chapter 4 Displaying and Describing Categorical Data Chapter Goals Learning Objectives This chapter presents three basic techniques for summarizing categorical data. After completing this chapter you should
More informationSAS VISUAL ANALYTICS AN OVERVIEW OF POWERFUL DISCOVERY, ANALYSIS AND REPORTING
SAS VISUAL ANALYTICS AN OVERVIEW OF POWERFUL DISCOVERY, ANALYSIS AND REPORTING WELCOME TO SAS VISUAL ANALYTICS SAS Visual Analytics is a high-performance, in-memory solution for exploring massive amounts
More informationMicrosoft Excel Tutorial
Microsoft Excel Tutorial Microsoft Excel spreadsheets are a powerful and easy to use tool to record, plot and analyze experimental data. Excel is commonly used by engineers to tackle sophisticated computations
More information1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96
1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years
More informationSPSS Step-by-Step Tutorial: Part 2
SPSS Step-by-Step Tutorial: Part 2 For SPSS Version 11.5 DataStep Development 2004 1 Transformations and recoding revisited 5 Introduction 5 Value labels 5 SPSS Tutorial and Help 6 Using online help 6
More information