T Test, Anova & Regression Analysis of Node level Network Data

Similar documents
UCINET Visualization and Quantitative Analysis Tutorial

Multilevel Models for Social Network Analysis

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

UCINET Quick Start Guide

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS

1/27/2013. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2

SPSS Resources. 1. See website (readings) for SPSS tutorial & Stats handout

How to do a Business Network Analysis

LAB 1 Intro to Ucinet & Netdraw

Linear Models in STATA and ANOVA

Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear.

Directions for using SPSS

Statistical Analysis Using SPSS for Windows Getting Started (Ver. 2014/11/6) The numbers of figures in the SPSS_screenshot.pptx are shown in red.

Borgatti, Steven, Everett, Martin, Johnson, Jeffrey (2013) Analyzing Social Networks Sage

TO DISLIKE AND TO BE LIKED: SELF-MONITORING, AFFECT-INTENSIVE RELATIONS AND WORK PERFORMANCE. Zuzana Sasovova

SPSS Guide: Regression Analysis

Social Network Analysis

Matthew A. Jones, Ph.D.

Introduction to Data Analysis in Hierarchical Linear Models

Moderation. Moderation

The Dummy s Guide to Data Analysis Using SPSS

Additional sources Compilation of sources:

January 26, 2009 The Faculty Center for Teaching and Learning

Selznick, P. (1948). Foundations of the Theory of Organization.

Multiple Regression. Page 24

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

There are six different windows that can be opened when using SPSS. The following will give a description of each of them.

UNDERSTANDING THE TWO-WAY ANOVA

An introduction to IBM SPSS Statistics

Reliability Analysis

IBM SPSS Statistics 20 Part 1: Descriptive Statistics

Examining graduate committee faculty compositions- A social network analysis example. Kathryn Shirley and Kelly D. Bradley. University of Kentucky

Introduction to Social Network Analysis Workshop

XPost: Excel Workbooks for the Post-estimation Interpretation of Regression Models for Categorical Dependent Variables

Trust, Job Satisfaction, Organizational Commitment, and the Volunteer s Psychological Contract

We are often interested in the relationship between two variables. Do people with more years of full-time education earn higher salaries?

SPSS Explore procedure

Groups and Positions in Complete Networks

HISTORICAL DEVELOPMENTS AND THEORETICAL APPROACHES IN SOCIOLOGY Vol. I - Social Network Analysis - Wouter de Nooy

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

This chapter will demonstrate how to perform multiple linear regression with IBM SPSS

How to Analyze Company Using Social Network?

Binary Logistic Regression

Scatter Plot, Correlation, and Regression on the TI-83/84

Equivalence Concepts for Social Networks

TI-Inspire manual 1. Instructions. Ti-Inspire for statistics. General Introduction

A case study of social network analysis of the discussion area of a virtual learning platform

Multiple Regression Using SPSS

Bill Burton Albert Einstein College of Medicine April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1

How To Analyze The Social Interaction Between Students Of Ou

DATA ANALYSIS. QEM Network HBCU-UP Fundamentals of Education Research Workshop Gerunda B. Hughes, Ph.D. Howard University

Association Between Variables

Understanding and Quantifying EFFECT SIZES

SAS Analyst for Windows Tutorial

LMX as a Predictor of Performance Behaviour: Empirical Evidence from Life Insurance Sector of Pakistan

SPSS: Getting Started. For Windows

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

MULTIPLE REGRESSION WITH CATEGORICAL DATA

Report on the Ontario Principals Council Leadership Study

Using Excel in Research. Hui Bian Office for Faculty Excellence

Forecasting in STATA: Tools and Tricks

Εισαγωγή στην πολυεπίπεδη μοντελοποίηση δεδομένων με το HLM. Βασίλης Παυλόπουλος Τμήμα Ψυχολογίας, Πανεπιστήμιο Αθηνών

Introduction to Quantitative Methods

Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:

PRELIMINARY ITEM STATISTICS USING POINT-BISERIAL CORRELATION AND P-VALUES

How To Run Statistical Tests in Excel

Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS

Chapter 23. Inferences for Regression

Inside Social Network Analysis

Introducing the Multilevel Model for Change

Multiple logistic regression analysis of cigarette use among high school students

Data exploration with Microsoft Excel: analysing more than one variable

IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA

Multivariate Analysis of Variance (MANOVA)

KAIST Business School

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Table of Contents. Preface

Course Objective This course is designed to give you a basic understanding of how to run regressions in SPSS.

Study on the Working Capital Management Efficiency in Indian Leather Industry- An Empirical Analysis

TARYN L. STANKO. University of Oregon Phone: Eugene, OR Fax:

Imputation of missing network data: Some simple procedures

Chapter 5 Analysis of variance SPSS Analysis of variance

RECRUITERS PRIORITIES IN PLACING MBA FRESHER: AN EMPIRICAL ANALYSIS

Advertising value of mobile marketing through acceptance among youth in Karachi

Point Biserial Correlation Tests

Simple Predictive Analytics Curtis Seare

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

A Network Approach to Spatial Data Infrastructure Applying Social Network Analysis in SDI research

RMTD 404 Introduction to Linear Models

FACTOR ANALYSIS NASC

Creating, Solving, and Graphing Systems of Linear Equations and Linear Inequalities

Transcription:

T Test, Anova & Regression Analysis of Node level Network Data Definition: Node level T Test, Anova and regression analysis allow us to test whether the aggregated number of ties each individual has is associated with other variables. For example, does the hierarchical level or gender of an individual predict how many of their colleagues they seek for advice? It is also possible to see if the number of ties an individual has is associated with their level of performance. Note that this type of analysis uses total or aggregated ties as opposed to examining each tie individually (see the dyadlevel analysis handout for this option). Implications: Knowing the association between individual characteristics such as location or hierarchical level and advice seeking or other network behaviors helps managers develop the networks of their staff. For example, research suggests being higher in the hierarchy predicts being sought for advice but not necessarily advice seeking. Analysis: Standard T test, Anova and linear regression analysis are not appropriate for node level data because the aggregated measures for each node (i.e., people surveyed) are not independent of one another. This contravenes one of the basic assumptions of linear regression analysis. Therefore, we use a permutation test to generate the significance level, which accounts for the non independence of the cases. Node level analysis is different from tie level analysis. In node level analysis, we aggregate the number of ties each individual, rather than looking at each tie. For example, we can examine if being at a certain hierarchical level predicts the total number of ties an individual has. In the tie level analysis, using the quadratic assignment procedure (QAP), we are examining what predicts a tie between two nodes (individuals).

T Test Analysis Node level T Test analysis in UCINET In this analysis, we are going to examine whether men or women have more advice seeking (information) ties. Advice seeking are the outgoing ties from an individual. A T Test allows us to examine whether a binary variable (e.g., gender) is associated with a continuous variable (e.g., number of advice seeking ties). Please note that it is not possible to use a T Test to analyze a categorical variable of more than two categories. For that we need to conduct an Anova test. Creating the advice seeking and gender node level data files. Step 1. Load the information network into UCINET and call it info. Dichotomize your info network: Transform > dichotomize. Then select the info network and dichotomize at greater than or equal to 4. Step 2. Run the degree centrality routine: Network > Centrality and Power > Degree. Then in the input network box type info_ge_4 or click and select the info_ge_4 network. In the output file, which is called info_ge_4 deg, the advice seeking variable is column 1. Step 3. Load the attribute network into UCINET and call it attrib. The gender variable is column 2 in the attrib file. The variable is coded 0 for men and 1 for women. Running a T Test in UCINET Step 1. To run a T Test: Tools > Testing Hypotheses > Node level > T Test. Step 2. In the dependent variable box, select the info_ge_4 deg file. First, we will test for advice seeking ties, therefore select col 1. Step 3. In the independent variable box, select attrib, We are testing for gender, which is col 2. Then press ok. We could rerun this analysis using column 2 of our info_ge_4 deg file. This would allow us to see if there were a difference in being sought out for advice with regard to gender. The analysis could also be rerun with the other binary variable in the attribute file: Region.

T Test Analysis In our output file, Group 1 is men and Group 2 is women. The N of Obs indicates that there are 35 men and 11 women. The mean number of advice seeking ties for men is 4.143, and for women it is 2.091. The significance test at the bottom of the output file indicates that the difference in means is 2.052. Men on average seek advice from 2.052 more people than women. The standard statistical decision point for significance is a two tailed test of below 0.05. In our output, the two tailed significance is 0.0679, and therefore not below 0.5. This suggests that while men on average seek advice more than women, the difference is not statistically significant.

Anova Analysis Node level Anova in UCINET In this analysis, we are going to examine whether being in a certain practice is associated with having more advice seeking (information) ties. The practice variable is categorical and an individual can be in one of seven practices. An Anova test allows us to examine whether a categorical variable (e.g., practice) is associated with a continuous variable (e.g., number of advice seeking ties). Running an Anova in UCINET Step 1. To run an Anova test: Tools > Testing Hypotheses > Node level > Anova. Step 2. In the dependent variable box, select the info_ge_4 deg file. First, we will test for advice seeking ties; therefore, select col 1. Step 3. In the independent variable box. select attrib. We are testing for practice, which is col 3. Then press ok. The standard statistical decision point for significance is a significance test of below 0.05. In our output the significance is 0.0040, and therefore is below 0.5. This suggests that people in some practices seek advice more than people in other practices. However, we cannot tell from the output which practices are different from each other. The analysis could also be rerun with the other categorical variables in the attribute file: Location and Hierarchy.

Regression Analysis Node level regression analysis in UCINET In this analysis, we are going to examine whether gender, hierarchy, region and number of friendship ties are associated with having more advice seeking (information) ties. Adviceseeking ties are the outgoing ties from an individual. A regression analysis allows us to estimate whether various different variables are associated with advice seeking. This is different from the T Test and Anova analysis which only allowed us to examine if one variable was associated with advice seeking. Creating the friendship, gender, hierarchy and region node level file. To do the regression analysis we need two different files. First, we need a file that includes our outgoing friendship ties, gender, hierarchy and region data. Second, we need a file for advice seeking. We can use the info_ge_4 deg file that we created earlier for advice seeking (column 1). Step 1. Open the attrib file using the Excel matrix editor in UCINET and delete the columns for practice and location. Save the file using the save active sheet as UCINET dataset command and call it indep variables. Step 2. Load the friendship network into UCINET and call it friend. Dichotomize your friend network. Transform > dichotomize. Then select the friend network and dichotomize at greater than or equal to 4. Step 3. Run the degree centrality routine: Network > Centrality and Power > Degree. Then in the input network box type friend_ge_4 or click and select the friend_ge_4 network. In the output file, which is called friend_ge_4 deg, the outgoing friendship variable is column 1. Step 4. Open the friend_ge_4 deg file using the Excel matrix editor in UCINET. Copy the first column of data. Then open the indep_variables file in the Excel matrix editor. Paste the data into the file. Name the column header friend, then save the file. Running a regression analysis in UCINET Step 1. To run a node level regression: Tools > Testing Hypotheses > Node level > Regression. Step 2. In the dependent dataset box, select the info_ge_4 deg file. First, we will test for advice seeking ties; therefore, the dependent column # should be set to 1. Step 3. In the independent dataset box, select f. The independent column #s should be set to all. Then press ok. We could rerun this analysis using column 2 of our info_ge_4 deg file. This would allow us to see if there were a different result with regard to being sought out for advice.

Regression Analysis At the top of the output file is the correlation matrix detailing the correlation coefficients between each pair of variables. Notice that the correlation between outdegree (which is our advice seeking variable) and friend is relatively high, with a coefficient of 0.782. Under model fit, the first thing to notice is the Adjusted R square value, which is 0.648. This suggests that our model accounts for 64.8% of the variance in the advice seeking network. This is also a relatively high value. The next thing to examine is the regression coefficients table. The proportion as large column works in a similar way to the significance tests in the earlier analysis. Any value under 0.05 indicates that an independent variable is significant. In this output, only friend is significant, with 0.000. The unstandardized coefficient is positive (1.022502), indicating that the number of friendship ties a person has positively predicts the number of advice seeking ties they will have.

Bibliography Methodological papers and books: Borgatti, S. P., Everett, M. G., & Johnson, J. C. 2013. Analyzing social networks. Los Angeles, CA: Sage. Freeman, L. C. 1978. Centrality in social networks conceptual clarification. Social Networks, 1(3), 215 239. Hanneman, R. A., & Riddle, M. 2005. Introduction to social network methods. University of California, Riverside. Published in digital form at http://faculty.ucr.edu/~hanneman/nettext/. Scott, J. 2012. Social network analysis. London: Sage. Wasserman, S., & Faust, K. 1994. Social network analysis: Methods and applications. Cambridge, United Kingdom: Cambridge University Press. Empirical papers and review papers: Brass, D. J., Galaskiewicz, J., Greve, H. R., & Tsai, W. 2004. Taking stock of networks and organizations: A multilevel perspective. Academy of Management Journal, 47(6), 795 817. Cross, R., & Cummings, J. N. 2004. Tie and network correlates of individual performance in knowledge intensive work. Academy of Management Journal, 47(6), 928 937. Kilduff, M., & Brass, D. J. 2010. Organizational social network research: Core ideas and key debates. The Academy of Management Annals, 4(1), 317 357. Klein, K. J., Lim, B. C., Saltz, J. L., & Mayer, D. M. 2004. How do they get there? An examination of the antecedents of centrality in team networks. Academy of Management Journal, 47(6), 952 963. Mehra, A., Kilduff, M., & Brass, D. J. 2001. The social networks of high and low self monitors: Implications for workplace performance. Administrative Science Quarterly, 46(1), 121 146. Sparrowe, R. T., Liden, R. C., Wayne, S. J., & Kraimer, M. L. 2001. Social networks and the performance of individuals and groups. Academy of Management Journal, 44(2), 316 325. Andrew Parker, PhD, is a visiting professor at the University of Kentucky. He has conducted network analysis in over 100 multinational organizations and government agencies. He was a Senior Consultant at IBM s Institute for Knowledge Management and a research fellow at the Network Roundtable at the University of Virginia as well as an advisor to the Knowledge and Innovation Network at Warwick Business School. His research has appeared in Science, Organization Studies, Journal of Applied Psychology, Journal of Applied Behavioral Science, Social Networks, Management Communication Quarterly, Sloan Management Review, Organizational Dynamics and California Management Review. He is also the co author of The Hidden Power of Social Networks. He received his PhD from Stanford University.