Experimental Design and Hypothesis Testing. Rick Balkin, Ph.D.



Similar documents
research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other

In an experimental study there are two types of variables: Independent variable (I will abbreviate this as the IV)

12/30/2012. Research Design. Quantitative Research: Types (Campbell & Stanley, 1963; Crowl, 1993)

Chapter Eight: Quantitative Methods

Research Methods & Experimental Design

RESEARCH DESIGN PART 2. Experimental Research Design. Purpose

Analyzing Intervention Effects: Multilevel & Other Approaches. Simplest Intervention Design. Better Design: Have Pretest

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

SPSS ADVANCED ANALYSIS WENDIANN SETHI SPRING 2011

Observing and describing the behavior of a subject without influencing it in any way.

Pre-experimental Designs for Description. Y520 Strategies for Educational Inquiry

Experimental Design. 1. Randomized assignment 2. Pre-test ( O1) 3. Independent Variable ( X ) 4. Post-Test ( O2 )


CRITICALLY APPRAISED PAPER (CAP)

Types of Group Comparison Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Causal-Comparative Research 1

Inclusion and Exclusion Criteria

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

UNDERSTANDING ANALYSIS OF COVARIANCE (ANCOVA)

Chapter 2 Quantitative, Qualitative, and Mixed Research

Data Definitions Adapted from the Glossary How to Design and Evaluate Research in Education by Jack R. Fraenkel and Norman E.

Example Research Scenarios

Nursing Journal Toolkit: Critiquing a Quantitative Research Article

Descriptive Statistics

Evaluation: Designs and Approaches

Technological Attitude and Academic Achievement of Physics Students in Secondary Schools (Pp )

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Exploratory Research Design. Primary vs. Secondary data. Advantages and uses of SD

Additional sources Compilation of sources:

interpretation and implication of Keogh, Barnes, Joiner, and Littleton s paper Gender,

WHAT IS A JOURNAL CLUB?

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

How to Make APA Format Tables Using Microsoft Word

A + dvancer College Readiness Online Improves. ACCUPLACER Scores of Miami Dade College Freshmen. May John Vassiliou, M. S.

Gender Stereotypes Associated with Altruistic Acts

Comparison of Research Designs Template

EXCEL Analysis TookPak [Statistical Analysis] 1. First of all, check to make sure that the Analysis ToolPak is installed. Here is how you do it:

Psychology 60 Fall 2013 Practice Exam Actual Exam: Next Monday. Good luck!

Multivariate Analysis of Variance. The general purpose of multivariate analysis of variance (MANOVA) is to determine

A DESIGN AND DEVELOPMENT OF E-LEARNING CONTENT FOR MULTIMEDIA TECHNOLOGY USING MULTIMEDIA GAME

Data analysis process

Module 4: Formulating M&E Questions and Indicators

Mind on Statistics. Chapter 13

Moore, D., & McCabe, D. (1993). Introduction to the practice of statistics. New York: Freeman.

Outcome of Drug Counseling of Outpatients in Chronic Obstructive Pulmonary Disease Clinic at Thawangpha Hospital

Basic Concepts in Research and Data Analysis

Guided Reading 9 th Edition. informed consent, protection from harm, deception, confidentiality, and anonymity.

Virtual Teaching in Higher Education: The New Intellectual Superhighway or Just Another Traffic Jam?

Financial Education in schools: the experience of the Bank of Italy and the Ministry of Education

Achievement in Physics Using Mastery Learning and Mind Mapping Approaches: Implication on Gender and Attitude

NORTHERN VIRGINIA COMMUNITY COLLEGE PSYCHOLOGY RESEARCH METHODOLOGY FOR THE BEHAVIORAL SCIENCES Dr. Rosalyn M.

Inferential Statistics. What are they? When would you use them?

Nonye Azih and B.O. Nwosu Department of Business Education, Ebonyi State University, Abakaliki, Nigeria

Statistics, Research, & SPSS: The Basics

Farhana Khurshid PhD scholar, King s College London

10. Analysis of Longitudinal Studies Repeat-measures analysis

Linear Models in STATA and ANOVA

A PRELIMINARY COMPARISON OF STUDENT LEARNING IN THE ONLINE VS THE TRADITIONAL INSTRUCTIONAL ENVIRONMENT

Study Guide for the Final Exam

Step 6: Writing Your Hypotheses Written and Compiled by Amanda J. Rockinson-Szapkiw

Sampling. COUN 695 Experimental Design

Application in Predictive Analytics. FirstName LastName. Northwestern University

Operational Definition for Evidence-Based Practices Addictions and Mental Health Division September 11, 2007

IPDET Module 6: Descriptive, Normative, and Impact Evaluation Designs

THE EFFECT OF PEER TUTORING AND EXPLICIT INSTRUCTIONAL STRATEGIES ON PRIMARY SCHOOL PUPILS LEARNING OUTCOMES IN MATHEMATICS

DATA COLLECTION AND ANALYSIS

Lesson 17: Margin of Error When Estimating a Population Proportion

Scientific Method. 2. Design Study. 1. Ask Question. Questionnaire. Descriptive Research Study. 6: Share Findings. 1: Ask Question.

RECRUITERS PRIORITIES IN PLACING MBA FRESHER: AN EMPIRICAL ANALYSIS

WWC Single Study Review A review of the design and summary of findings for an individual study

Effect of Computer-Assisted Packages on the Performance of Senior Secondary Students in Mathematics in Awka, Anambra State, Nigeria

ARTS FOR ACADEMIC ACHIEVEMENT Changing Student Attitudes Toward Math: Using Dance to Teach Math

Missouri School Counselor Performance Assessment (MoSCPA) Task Requirements

Syllabus for Ph.D. (Education) Entrance Test

Statistics Review PSY379

Cognitive Behavior Group Therapy in Mathematics Anxiety

When to Use a Particular Statistical Test

Running head: THE EFFECTS OF EXTRA-CURRICULAR ACTIVITIES

HOW TO WRITE A LABORATORY REPORT

Developmental Research Methods and Design. Types of Data. Research Methods in Aging. January, 2007

Other forms of ANOVA

Chapter 1: The Nature of Probability and Statistics

The Importance and Impact of Nursing Informatics Competencies for Baccalaureate Nursing Students and Registered Nurses

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct

The Procedures of Monte Carlo Simulation (and Resampling)

Modeling Peer Review in Example Annotation

Examining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish

The Influence of Parent-Child Attachment on Romantic Relationships

THE EFFECT OF MATHMAGIC ON THE ALGEBRAIC KNOWLEDGE AND SKILLS OF LOW-PERFORMING HIGH SCHOOL STUDENTS

Effects of Leadership Training on the Manufacturing Productivity of Informal Leaders

Does Affirmative Action Create Educational Mismatches in Law School?

The influence of communication on administration of secondary schools in Delta State, Nigeria

Mixed 2 x 3 ANOVA. Notes

Chapter 5 Analysis of variance SPSS Analysis of variance

MULTIPLE REGRESSION WITH CATEGORICAL DATA

Transcription:

Experimental Design and Hypothesis Testing Rick Balkin, Ph.D. 1

Let s s review hypothesis testing and experimental design 3 types of hypothesis testing in experimental research: z-test t-test F-test Balkin, R. (2008) 2

z-test Tests for statistically significant differences between a sample group and a population Balkin, R. (2008) 3

t-test Tests for statistically significant differences between two sample groups 2 types: Independent: tests for differences between groups and used in between group designs Dependent: tests for changes over time and used in within group designs Balkin, R. (2008) 4

F-test Tests for statistically significant differences among three or more groups Also known as an ANOVA Analysis Analysis of Variance When two or more dependent variables exist, it is known as MANOVA Multivariate Multivariate Analysis of Variance Balkin, R. (2008) 5

Categories of Experimental Design Pre-experimental: no random assignment of manipulation of IV Quasi-experimental: IV manipulation but no random assignment Explanatory non-experimental: IV is an assigned trait rather than a manipulated variable Sometimes referred to as expost facto True experimental: random assignment and IV manipulation Balkin, R. (2008) 6

Models for experimental design Models for experimental design may be as classified as two types: between groups,, in which outcomes are compared between two or more groups or within groups,, in which a single group is measured across time using two or more different treatments. Balkin, R. (2008) 7

Between groups design In a between groups design, the effect of the independent variable on the dependent variable is based upon the examination of group differences. Balkin, R. (2008) 8

Between groups design Four different types of between group designs Post-test only Pretest-posttest control group Solomon four group Factorial designs Balkin, R. (2008) 9

Consider the following scenario: A counselor wishes to know whether a peer mentoring program would be effective in assisting students who are at-risk for academic failure. The counselor utilizes the Youth Outcome Questionnaire (Y-OQ-SR-2.0) as a measure of program effectiveness. The Y-OQ- SR-2.0 is a youth survey designed to be repeatedly administered to adolescents to assess their ongoing progress in counseling. It has 30- items on a 5 point Likert scale. Internal consistency was assessed at.91, and is noted to have adequate validity (Wells, Burlingame, Rose, 1999). Balkin, R. (2008) 10

Post-test only Participants are randomly assigned to a treatment group and a comparison/control group. The treatment group receives some type of manipulation or intervention while a control group would receive none. A quantitative measure is then used to determine the effect of the intervention. In this case, the quantitative measure is the dependent variable and the presence or absence of the treatment is the independent variable. Balkin, R. (2008) 11

Post-test only--example Using our example for a post-test only design, the counselor would randomly assign students identified as at-risk for academic failure to receive peer mentoring over the next three months or to receive peer mentoring three months later after the initial group has received the treatment The effect of peer mentoring is being evaluated for change in well-being, as evidenced by the score on the Y-OQ-SR-2.0, the dependent variable. Balkin, R. (2008) 12

Post-test only--model The treatment group receives peer mentoring; the control group initially does not, and both groups are evaluated by observing their scores on the Y-OQ-SR-2.0 Posttest Only random assign. Group Treatment Control Peer Mentoring Yes No Posttest Yes Yes Balkin, R. (2008) 13

Testing for differences in a Posttest only Model In this model we have two groups an experimental group and control group. The DV is the score on the Y-OQ-SR-2.0 So, we can conduct an independent t-test to determine if statistically significant differences exist in the scores on the Y-OQ-SR-2.0 between the treatment and control groups Balkin, R. (2008) 14

Pretest-posttest control group Participants are randomly assigned to a treatment group and a comparison/control group. A quantitative measure is then used to determine the effect of the intervention. Both the treatment group and control group receive a pretest. The treatment group receives some type of manipulation or intervention while a control group would receive none. After the intervention, a posttest is administered. In this case, the groups are still compared based on posttest scores, but the researcher can be certain of the degree of equal groups at the onset of the study. Balkin, R. (2008) 15

Pretest-posttest control group-- Example Using our example for a pretest posttest control group design, the counselor would randomly assign students identified as at- risk for academic failure to receive peer mentoring over the next three months or to receive peer mentoring three months later after the initial group has received the treatment. Balkin, R. (2008) 16

Pretest-posttest control group-- Model Each group is administered the Y-OQ-SR-2.0 as a pretest in order to ensure equality of groups. The treatment group receives peer mentoring; the control group initially does not, and both groups are evaluated by observing their scores on the Y-OQ-SR-2.0 posttest Pretest Posttest Control Group random assign. Group Treatment Control Pretest Yes Yes Peer Mentoring Yes No Posttest Yes Yes Balkin, R. (2008) 17

Testing for differences in a Pretest- posttest control group--model This model may be tested in two ways: First, if the purpose of the pretest was to ensure group equality, then this model is tested the same way as the previous model. The DV is the score on the Y-OQ-SR-2.0, and we can conduct an independent t-test to determine if statistically significant differences exist in the scores on the Y-OQ-SR-2.0 between the treatment and control groups Balkin, R. (2008) 18

Testing for differences in a Pretest- posttest control group--model Second, the researcher has the option of also looking at changes over time because a pretest was administered. So two dependent t-tests could be conducted, one for the treatment group and one for the control group, to determine if the change in scores over time was statistically significant. Balkin, R. (2008) 19

Solomon four group Participants are randomly assigned to one of four groups: (a) a treatment group that receives both a pretest and a posttest (b) a treatment group that receives a posttest only (c) a control group that receives both a pretest and a posttest (d) a control group that receives a posttest only. Thus, only one treatment group and one control group are administered a pretest. Balkin, R. (2008) 20

Solomon four group Both treatment groups receive some type of manipulation or intervention while both control groups would receive none. After the intervention, a posttest is administered to all four groups. In this case, the groups are still compared based on posttest scores, but the researcher can be certain of the equivalence of the groups at the onset of the study and assess the impact of the pretest to ascertain whether or not a testing effect exists. Balkin, R. (2008) 21

Solomon four group--example Using our example for a Solomon four group, the counselor would randomly assign students identified as at-risk for academic failure to receive peer mentoring over the next three months or to receive peer mentoring three months later after the initial group has received the treatment. From the treatment group, those students are randomly assigned to be administered a pretest or no pretest. From the control group, those students are randomly assigned to be administered a pretest or no pretest. One treatment group and one control are administered the Y-OQ-SR-2.0 as a pretest in order to ensure equality of groups. Balkin, R. (2008) 22

Solomon four group--example The other treatment and control groups do not receive a pretest. The treatment groups receive peer mentoring; the control groups initially do not. The posttest scores from the two treatment groups and the posttest scores from the two control groups can be compared in order to ascertain whether the pretest contributed to differences in the scores. Balkin, R. (2008) 23

Solomon four group--model If the posttest scores for both treatment groups and the posttest scores for both control groups are similar, then the administration of the pretest had no effect. Assuming there is no testing effect, the posttest scores between the treatment groups and control groups can be compared to determine the effect of the intervention Solomon Four Group Random assign. Group Treatment Control Treatment Control Pretest Yes Yes Yes Balkin, R. (2008) 24 No No Peer Ment. Yes No Yes No Post- test Yes Yes Yes

Testing for differences in a Solomon four group--model This model may be tested in the following ways: First, remember that an advantage of this design is that any changes due to being administered a pretest can be ruled out. Therefore if there is no testing effect, then there should be no significant differences in the DV scores for the treatment groups, and there should no significant differences in the DV scores for the control groups To determine this, two independent t-tests are conducted: A test to compare Y-OQ-SR-2.0 scores for the treatment group that received a pretest and the treatment group that did not receive a pretest A test to compare Y-OQ-SR-2.0 scores for the control group that received a pretest and the control group that did not receive a pretest Balkin, R. (2008) 25

Testing for differences in a Solomon four group--model Second, differences between treatment and control groups need to be determined If there is no testing effect, as evidenced by results in the previous steps, then an independent t-test can be conducted between the treatment and control groups Balkin, R. (2008) 26

Testing for differences in a Solomon four group--model Third, changes over time could be assessed by looking at the treatment and control groups that received a pretest. Two dependent t-tests could be conducted to determine changes over time--one for the treatment group and one for the control group, to determine if the change in scores over time was statistically significant. Balkin, R. (2008) 27

What if I have more than two groups? In any of these designs, it is possible to have more than two groups. In our example, we have compared one group who receives a treatment, peer mentoring, and another group who receives no intervention. What if we added a third group, such as participants who participate in a psychoeducational group? Balkin, R. (2008) 28

What if I have more than two groups? Then we could conduct the analyses using a F-test. This statistic is design to identify statistically significant differences in three or more groups. So, we could simply replace the t-tests tests in the previous examples with F-tests. Balkin, R. (2008) 29

Factorial designs The purpose of a factorial design is to study change in the dependent variable across two or more independent variables. For example, instead of simply examining the effect of a peer mentoring program, the counselor wishes to know whether sex plays a role. Is the degree of change in well-being different across a peer mentoring intervention for males and females? Balkin, R. (2008) 30

Factorial designs In other words, two analyses will be conducted: (a) differences in Y-OQ-SR-2.0 scores across the treatment and control groups (b) differences in Y-OQ-SR-2.0 scores across males and females. Balkin, R. (2008) 31

Testing for differences in a factorial design This is generally done with a F-test A F-test can be used to analyze differences between males and females and between treatment and control groups in a single analysis! Designs like this are complex, but also widely used in social science research Balkin, R. (2008) 32

Within Group Design A within group design is utilized when a change in the dependent variable in a group is measured across time. In a within group design, the pretest also serves as a baseline in which to compare subsequent tests. Balkin, R. (2008) 33

Within Group Design--Example For example, the Y-OQ-SR-2.0 is administered at the onset of the peer mentoring study to get a baseline measure. Then the Y-OQ-SR-2.0 is administered four additional times on a monthly basis in order to compare progress to the initial administration. Within Group Design Observation 1 Observation 2 Observation 3 Observation 4 Balkin, R. (2008) 34

Testing for differences in a within- group design You have actually already seen this design at work in the previous examples when changes over time were tested. In the case in which changes over time are evaluated in a pretest and posttest, a dependent t-test is conducted When there are more than two repeated observations, then a F-test is conducted on each of the DV scores. Balkin, R. (2008) 35