Research Methods & Experimental Design



Similar documents
Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS

Descriptive Statistics

Study Guide for the Final Exam

Additional sources Compilation of sources:

SPSS ADVANCED ANALYSIS WENDIANN SETHI SPRING 2011

Post-hoc comparisons & two-way analysis of variance. Two-way ANOVA, II. Post-hoc testing for main effects. Post-hoc testing 9.

T-test & factor analysis

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

SPSS Tests for Versions 9 to 13

Chapter 5 Analysis of variance SPSS Analysis of variance

UNDERSTANDING THE TWO-WAY ANOVA

Data Analysis, Research Study Design and the IRB

UNDERSTANDING ANALYSIS OF COVARIANCE (ANCOVA)

Projects Involving Statistics (& SPSS)

DATA COLLECTION AND ANALYSIS

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Nonparametric Statistics

Chapter Eight: Quantitative Methods

Fairfield Public Schools

Statistics Review PSY379

Rank-Based Non-Parametric Tests

The Dummy s Guide to Data Analysis Using SPSS

Introduction to Statistics and Quantitative Research Methods

UNIVERSITY OF NAIROBI

Analysis of Data. Organizing Data Files in SPSS. Descriptive Statistics

SPSS Explore procedure

TABLE OF CONTENTS. About Chi Squares What is a CHI SQUARE? Chi Squares Hypothesis Testing with Chi Squares... 2

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

Data analysis process

WHAT IS A JOURNAL CLUB?

This chapter discusses some of the basic concepts in inferential statistics.

Independent t- Test (Comparing Two Means)

II. DISTRIBUTIONS distribution normal distribution. standard scores

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Parametric and non-parametric statistical methods for the life sciences - Session I

Nursing Journal Toolkit: Critiquing a Quantitative Research Article

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

Research Methodology: Tools

SPSS Guide: Regression Analysis

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other

Sample Size and Power in Clinical Trials

Mixed 2 x 3 ANOVA. Notes

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

Outline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test

THE KRUSKAL WALLLIS TEST

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

EPS 625 INTERMEDIATE STATISTICS FRIEDMAN TEST

1.5 Oneway Analysis of Variance

Friedman's Two-way Analysis of Variance by Ranks -- Analysis of k-within-group Data with a Quantitative Response Variable

Correlational Research

Observing and describing the behavior of a subject without influencing it in any way.

Statistical tests for SPSS

Using Excel for inferential statistics

Statistics. One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples

Mathematics within the Psychology Curriculum

Comparing Means in Two Populations

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

2 Precision-based sample size calculations

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Experimental Design and Hypothesis Testing. Rick Balkin, Ph.D.

Introduction to Fixed Effects Methods

individualdifferences

Descriptive Analysis

EXCEL Analysis TookPak [Statistical Analysis] 1. First of all, check to make sure that the Analysis ToolPak is installed. Here is how you do it:

MASTER COURSE SYLLABUS-PROTOTYPE PSYCHOLOGY 2317 STATISTICAL METHODS FOR THE BEHAVIORAL SCIENCES

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES.

Part 2: Analysis of Relationship Between Two Variables

Analysing Questionnaires using Minitab (for SPSS queries contact -)

Introduction to Quantitative Methods

Biostatistics: Types of Data Analysis

Analyzing Intervention Effects: Multilevel & Other Approaches. Simplest Intervention Design. Better Design: Have Pretest

IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA

Statistiek II. John Nerbonne. October 1, Dept of Information Science

Introduction to Minitab and basic commands. Manipulating data in Minitab Describing data; calculating statistics; transformation.

Multivariate Analysis of Variance. The general purpose of multivariate analysis of variance (MANOVA) is to determine

Statistics for Sports Medicine

Deciding which statistical test to use:

STATISTICAL ANALYSIS WITH EXCEL COURSE OUTLINE

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Lean Six Sigma Black Belt-EngineRoom

Ordinal Regression. Chapter

Statistics in Medicine Research Lecture Series CSMC Fall 2014

ABSORBENCY OF PAPER TOWELS

A Study to Predict No Show Probability for a Scheduled Appointment at Free Health Clinic

Analysis of Variance ANOVA

DATA ANALYSIS. QEM Network HBCU-UP Fundamentals of Education Research Workshop Gerunda B. Hughes, Ph.D. Howard University

Chi-square test Fisher s Exact test

Study Design and Statistical Analysis

SPSS Guide How-to, Tips, Tricks & Statistical Techniques

Simple Linear Regression Inference

Chapter 15. Mixed Models Overview. A flexible approach to correlated data.

Intro to Parametric & Nonparametric Statistics

Crosstabulation & Chi Square

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares

Bivariate Statistics Session 2: Measuring Associations Chi-Square Test

Transcription:

Research Methods & Experimental Design 16.422 Human Supervisory Control April 2004

Research Methods Qualitative vs. quantitative Understanding the relationship between objectives (research question) and variables is critical Information Data Information=data + analysis Planning in advance is a must To include how data will be analyzed

Qualitative Research Methods Social & cultural phenomenon Case studies Focus groups Observations Usability testing Can be quantitative Interviews Questionnaires

Quantitative Research Methods Natural phenomenon Mathematical modeling Experiments Optimization Game theory Surveys Bottom line statistics are a must

Project Assignment Design and conduct an experiment in which you explore some measure of human performance through testing, analyze the results, and discuss the broader implications. Design an actual display that uses automation for decision support While formal experimental testing is not required, a small group of users should be used to identify problems with the design to include functionality evaluation as well as recommendation for future improvements and systems integration.

The Experimental Design Process Research Question (Hypothesis) Design Experiment Collect Data Analyze Data Draw Conclusions

Experimental Design Design of Experiments (DOE) defined: A theory concerning the minimum number of experiments necessary to develop an empirical model of a research question and a methodology for setting up the necessary experiments. A parsimony model Human subject vs. object experimentation Other DOE Constraints Time Money

Experimental Design Basics Two kinds of data gathering methodologies Observation Can t prove cause & effect but can establish associations. Hawthorne effect, social facilitation Experimental Cause & effect Variables of interest factors vs. treatments Independent variable Treatment manipulations of variables of interest Treatment vs. control group Dependent variable is what you are measuring

More Basics Confounds Randomization Concerns Randomization prevents experimental bias Assignment by experimenter Counterbalancing Statistical assumptions A requirement for statistical tests of significance Why would you use the observation methodology instead of experiments?

DOE Terminology Replications Independent observations of a single treatment. Variance The measuring stick that compares different treatments. Internal validity The extent to which an experiment accomplishes its goal(s). Reproducibility Given the appropriate information, the ability of others to replicate the experiment.

DOE Terminology (cont.) External validity How representative of the target population is the sample? Can the results be generalized? Generalizations for field experiments are easier to justify than lab experiments because of artificialities. Medical Trials Placebo Double Blind If so, what is the population to which it can be generalized? Can the results be generalized to the real world?

ANOVA Within group variance is noise and between group variance is information we seek. ANOVA separates these out. Data Analysis Data Types Variables Categorical Numerical Scales of Measurement Nominal Ordinal Interval Computer Programs Excel, SAS, S+, SPSS

Basic Statistical Tests Assumptions for comparison of means Independent & random Normality Variances roughly equal t-tests One or two samples Chi-square tests NID(0,1) Categorical data, non-parametric Chi square important because any sum of squares in normal random variables divided by the variance is chi-square distributed

Null Hypothesis: H o Defined: The difference in two different populations parameters is 0. H o : µ 1 = u 2 H a : µ 1 u 2 H o : Always predicts absence of a relationship & assumed to be true. If the null hypothesis is NOT rejected, we CANNOT conclude that there is no difference, only that the method did not detect any difference. p <.05????

A Very Important Research Question Does drinking cappuccino one hour before a test improve results? What is the metric (dependent variable)? Experimental Design Treatment group vs. control group A single comparison Experimental efficiency Perhaps we want to look at who makes the cappuccino (Seattle s, Starbucks, Pete s) as well as the difference between coffee and cappuccino. 2X3 Factorial Interaction effects

Caffeine/Performance Experiment Capp Coffee GB SB ER We now know the general layout of the experiment but what is missing?

Caffeine/Performance Experiment How many subjects do we need? Sample Size Related to power the complement of a Type II error Decision Ho True Ho False Type I error Correct decision Reject Ho p = α p = 1 - β = POWER Fail to reject Ho Correct decision p = 1 - α Type II error p = β Ask what Ho is? Null hypothesis no significant difference exists between experimental groups.

Don t Panic

Caffeine/Performance Experiment So how do you determine sample size? http://members.aol.com/johnp71/javastat.html Sensitivity is an issue # of factors influences sample size Recruitment Issues Population selection How do we assign subjects to treatment categories? Confounds Experience Self-selection Control techniques

Other Subject Considerations What is the most efficient way to use human subjects? Between subjects Within subjects Repeated measures Increases power but Confounds practice & fatigue Counterbalance Mixed subjects Pre-test/post-test Tests over time

Pre/post Test Considerations Between Subjects Intervention A Intervention B Pre-Test Post-Test Within Subjects Ideally pre-test scores will be equivalent You want to see a difference between the experimental and control group.

Statistical Tests (cont.) Analysis of variances (ANOVA) Testing the differences between two or more independent means (or groups) on one dependent measure (either a single or multiple independent variables). One way vs. factorial F test ratio of variances MANOVA

Other DOE considerations: Full Factorial Blocking More homogenous grouping Coffee of the day v. another kind Starbuck s at the Marriott vs. Galleria Pairing Increases precision by eliminating the variation between experimental units Randomization still possible Many others Full factorial should be run twice Tennis shoe example try to find out which sole is better for shoes so each boy wears two different shoes. Randomization comes in assigning which shoe to which foot.

What test to use? Pearson Correlation No Includes a Categorical variable Yes Only One Independent Variable No Only Between Subjects Variable No Mixed Two-Way ANOVA Yes Yes One-Way Analysis of Variance No Only Two Levels Between-Subjects 2-Way ANOVA Yes Only Between Subjects Variable No Within-Subjects t-test Yes Between Subjects t-test Adapted from University of Maryland Psychology World.

Example Experiment Are web-based case studies better than print versions. How can we test this? This question was tested with 2 classes with 2 different professors. What are the independent & dependent variables? Was it within/between/mixed? What statistical test should we use?

Results Dependent Variable: GRADES Source Corrected Model Intercept PROF TYPE PROF * TYPE Error Total Corrected Total Tests of Between-Subjects Effects Type III Sum of Squares df Mean Square F Sig. 173.681 a 4 43.420.986.420 190832.489 1 190832.489 4333.757.000 157.697 1 157.697 3.581.062 26.217 2 13.109.298.743 11.840 1 11.840.269.605 3654.818 83 44.034 673001.300 88 3828.499 87 a. R Squared =.045 (Adjusted R Squared = -.001)

Estimated Marginal Means 89.5 89.0 88.5 88.0 87.5 87.0 86.5 Interactions Estimated Marginal Means of GRADES PROF TYPE 86.0 D 85.5 P B L Interaction effect: the response of one variable depends on effect of another variable No interaction parallel lines Significant interaction: Which professor would you rather have?

Non-Parametric Tests Use when you have no good information about an underlying distribution Parametric tests: Parametric form - parameters either assumed to be known or estimated from the data The mean and variance of a normal distribution Null hypothesis can be stated in terms of parameters and the test statistic follows a known distribution. Non-parametric tests are still hypothesis tests, but they look at the overall distribution instead of a single parameter Particularly useful for small samples

All data is not normal. Parametric Correlation & Association Pearson T-tests Independent & dependent ANOVA Factorial Repeated measures MANOVA Linear regression Non-parametric Association Spearman Chi-Square Contingency tables Kruskal-Wallis test Sign-test Friedman ANOVA Logistic regression