E10: Controlled Experiments

Similar documents
The aims: Chapter 14: Usability testing and field studies. Usability testing. Experimental study. Example. Example

Research Methods & Experimental Design

9.63 Laboratory in Visual Cognition. Single Factor design. Single design experiment. Experimental design. Textbook Chapters

Correlational Research

PSYC 381 Statistics Arlo Clark-Foos, Ph.D.

Recommend Continued CPS Monitoring. 63 (a) 17 (b) 10 (c) (d) 20 (e) 25 (f) 80. Totals/Marginal

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Using Microsoft Excel to Analyze Data

CHAPTER 4 KEY PERFORMANCE INDICATORS

Fairfield Public Schools

Statistical tests for SPSS

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

A Statistical Analysis of Popular Lottery Winning Strategies

Experimental Designs (revisited)

PhD Qualifying Examination: Human-Computer Interaction

Nursing Journal Toolkit: Critiquing a Quantitative Research Article

Lab 2.1 Tracking Down the Bugs

Roadmap to Data Analysis. Introduction to the Series, and I. Introduction to Statistical Thinking-A (Very) Short Introductory Course for Agencies

Introduction to Quantitative Methods

Exploratory Research Design. Primary vs. Secondary data. Advantages and uses of SD

Experimental methods. Elisabeth Ahlsén Linguistic Methods Course

SPSS Guide: Regression Analysis

Module 5: Statistical Analysis

Overview of Factor Analysis

Chapter 7. One-way ANOVA

Statistics in Medicine Research Lecture Series CSMC Fall 2014

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

UNIVERSITY OF NAIROBI

p-values and significance levels (false positive or false alarm rates)

Additional sources Compilation of sources:

Using Excel (Microsoft Office 2007 Version) for Graphical Analysis of Data

Simple Linear Regression Inference

TABLE OF CONTENTS. About Chi Squares What is a CHI SQUARE? Chi Squares Hypothesis Testing with Chi Squares... 2

WHAT IS A JOURNAL CLUB?

Association Between Variables

Topic #6: Hypothesis. Usage

AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

Chapter 5 Analysis of variance SPSS Analysis of variance

Copyright. Network and Protocol Simulation. What is simulation? What is simulation? What is simulation? What is simulation?

2013 MBA Jump Start Program. Statistics Module Part 3

Obtaining Knowledge. Lecture 7 Methods of Scientific Observation and Analysis in Behavioral Psychology and Neuropsychology.

Skype for Business for Android

UNDERSTANDING ANALYSIS OF COVARIANCE (ANCOVA)

9.63 Laboratory in Cognitive Science. Interaction: memory experiment

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Step 6: Writing Your Hypotheses Written and Compiled by Amanda J. Rockinson-Szapkiw

RECRUITERS PRIORITIES IN PLACING MBA FRESHER: AN EMPIRICAL ANALYSIS

Bivariate Statistics Session 2: Measuring Associations Chi-Square Test

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

Models of Dissertation Research in Design

LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

SAMPLING & INFERENTIAL STATISTICS. Sampling is necessary to make inferences about a population.

CHAPTER 5 COMPARISON OF DIFFERENT TYPE OF ONLINE ADVERTSIEMENTS. Table: 8 Perceived Usefulness of Different Advertisement Types

Tennessee Mathematics Standards Implementation. Grade Six Mathematics. Standard 1 Mathematical Processes

CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS

Modelling and Big Data. Leslie Smith ITNPBD4, October Updated 9 October 2015

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots

ANOVA ANOVA. Two-Way ANOVA. One-Way ANOVA. When to use ANOVA ANOVA. Analysis of Variance. Chapter 16. A procedure for comparing more than two groups

How To Run Statistical Tests in Excel

MTH 140 Statistics Videos

HOW TO WRITE A LABORATORY REPORT

Pearson s Correlation

Selecting Research Participants

Guidelines for Using the Retrospective Think Aloud Protocol with Eye Tracking

Basic Concepts in Research and Data Analysis

Hypothesis testing - Steps

Chapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation

Pearson's Correlation Tests

Nonparametric tests these test hypotheses that are not statements about population parameters (e.g.,

Testing Hypotheses About Proportions

Reading with Mobile Phone & Large Display

Big Data and Data Analytics

A Hands-On Exercise Improves Understanding of the Standard Error. of the Mean. Robert S. Ryan. Kutztown University

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

ACADEMIC POLICY AND PLANNING COMMITTEE REQUEST FOR AHC GENERAL EDUCATION CONSIDERATION

The Mozart effect Methods of Scientific Research

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

Video Games and Academic Performance. Ronny Khadra. Cody Hackshaw. Leslie Mccollum. College of Coastal Georgia

Your Questions from Chapter 1. General Psychology PSYC 200. Your Questions from Chapter 1. Your Questions from Chapter 1. Science is a Method.

Name of Lesson: Properties of Equality A Review. Mathematical Topic: The Four Properties of Equality. Course: Algebra I

Introduction to Hypothesis Testing

Hypothesis Test for Mean Using Given Data (Standard Deviation Known-z-test)

Statistics 2014 Scoring Guidelines

User Interface Design

DOCTOR OF PHILOSOPHY DEGREE. Educational Leadership Doctor of Philosophy Degree Major Course Requirements. EDU721 (3.

Math 1050 Khan Academy Extra Credit Algebra Assignment

Overview. Triplett (1898) Social Influence - 1. PSYCHOLOGY 305 / 305G Social Psychology. Research in Social Psychology 2005

Analysis of Data. Organizing Data Files in SPSS. Descriptive Statistics

This chapter discusses some of the basic concepts in inferential statistics.

Week TSX Index

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures

Normality Testing in Excel

Home Internet Filter User Guide

Guidance for Peer Reviewers. The Journal of the American Osteopathic Association (JAOA)

Transcription:

E10: Controlled Experiments Quantitative, empirical method Used to identify the cause of a situation or set of events X is responsible for Y Directly manipulate and control variables Correlation does not imply causality Example: relationship between typing speed and time spent playing games Use a controlled experiment to verify an observation, a correlation, or a hunch Average typing speed (WPM) 90 80 70 60 50 40 30 20 0 1 2 3 4 5 6 7 8 9 10 Average gaming time per week (hours) DIS 1 Jan Borchers 1

Controlled Experiments: Steps 1. Formulate hypothesis Pie menu decreases seek time for menu items 2. Design experiment, pick variable and fixed parameters Type of menu target seek time 3. Run a pilot study to debug your experimental procedures Improving distribution of menu targets The examples are simplified from [Schneiderman et al., CHI 88] DIS 1 Jan Borchers 2

Controlled Experiments: Steps 4. Recruit subjects Undergraduate students with little or no mouse experience 5. Run experiment Each participant perform menu selection 10 times for each type of menu 6. Interpret results to accept or reject hypothesis Mean seek time: 2.26 (Pie), 2.64 (Linear) The difference is statistically significant (p =.0003) The examples are simplified from [Schneiderman et al., CHI 88] DIS 1 Jan Borchers 3

Controlled Experiments Subjects Similar to real users in profile (age, education, computer and domain expertise, system knowledge, ) Use at least 10 subjects Use more if you need finer details Statistical power analysis can tell you the exact number Variables Independent Variables (IVs): are varied under your control E.g., number of menu entries Each level of an independent variable is called a treatment Dependent Variables (DVs): are those you measure E.g., execution time, error rates, subjective preferences DIS 1 Jan Borchers 4

In-Class Exercise: Identifying Variables Identify independent variables and dependent variables from each of the following scenarios. Indicate levels of each independent variable: 1. A study investigating whether people who have attended a security training program generate and use more secure passwords than people who haven t received any security training 2. A research team examining the effectiveness of joysticks and trackballs for selecting static and moving targets 3. A research team examining whether virtual teams who use video instant messaging (IM) are more productive than teams who use text-only IM Examples from: Research Methods in HCI, Lazar et al. (2010) DIS 1 Jan Borchers 5

Hypothesis Predicts outcome of experiment Usually: claims that changing independent variables influences dependent variables Experiment goal: confirm research hypothesis (H1) No amount of experimentation can ever prove me right; a single experiment can prove me wrong. Albert Einstein Approach: Reject inverse null hypothesis (H0), i.e., no influence If we can determine that H0 is wrong, we can accept that H1 is true (naïve view) H0 is usually a precise statement we ll know the probability that H0 is incorrect E.g., Average WPM between gaming and non-gaming groups are equal The data should indicate that there is a very low probability that H0 is correct Being unable to reject H0 does not imply that you can accept H0 E.g., your number of participants may just have been too small DIS 1 Jan Borchers 6

In-Class Exercise: Identifying Hypotheses Identify a research hypothesis (H1) and null hypothesis (H0) for each of our scenarios: 1. A study investigating whether people who have attended a security training program generate and use more secure passwords than people who haven t received any security training 2. A research team examining the effectiveness of joysticks and trackballs for selecting static and moving targets 3. A research team examining whether virtual teams who use video instant messaging (IM) are more productive than teams who use text-only IM Examples from: Research Methods in HCI, Lazar et al. (2010) DIS 1 Jan Borchers 7

Basic Experimental Designs Between-groups design Each subject only does one variant of the experiment There are at least 2 groups to isolate effect of manipulation: Treatment group and control group + No learning effects across variants Good for tasks that are simple and involve limited cognitive processes, e.g., tapping, dragging, or visual search But: requires more users Within-groups design Each subject does all variants of the experiment + Fewer users required, individual differences canceled out Good for complex tasks, e.g., typing,reading, composition, problem solving But: learning effects may occur DIS 1 Jan Borchers 8

In-Class Exercise: Experimental Designs Which type of experimental design is appropriate for each scenario? 1. A study investigating whether people who have attended a security training program generate and use more secure passwords than people who haven t received any security training 2. A research team examining the effectiveness of joysticks and trackballs for selecting static and moving targets 3. A research team examining whether virtual teams who use video instant messaging (IM) are more productive than teams who use text-only IM Examples from: Research Methods in HCI, Lazar et al. (2010) DIS 1 Jan Borchers 9

Within-Group Design: Order Effect The order of presenting the treatments (IV levels) might affect the dependent variable Learning effect Fatigue effect Contrast effect: the effect of the first treatment carries over to influence the response to the second treatment Solutions Rest period between treatments Counterbalancing: all possible orders of treatments are included but: O(n!) Latin Square: A limited set of orders, O(n) DIS 1 Jan Borchers 10

Latin Square Each condition appears at each ordinal position Each condition precedes and follows each condition one time Example: six treatments: A, B, C, D, E, F 1 A B F C E D 2 B C A D F E 3 C D B E A F 4 D E C F B A 5 E F D A C B 6 F A E B D C DIS 1 Jan Borchers 11

Randomization Randomly assign treatment to participants Prevents systematic bias But: randomization counterbalancing With small numbers, randomization might not cover all combinations DIS 1 Jan Borchers 12