Percent Change and Power Calculation. UCLA Advanced NeuroImaging Summer School, 2007



Similar documents
The Wondrous World of fmri statistics

Sample Size and Power in Clinical Trials

IMPACT EVALUATION: INSTRUMENTAL VARIABLE METHOD

Hypothesis testing - Steps

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Odds ratio, Odds ratio test for independence, chi-squared statistic.

Part 2: Analysis of Relationship Between Two Variables

HYPOTHESIS TESTING: POWER OF THE TEST

Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:

ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2


Covariance and Correlation

(Refer Slide Time: 2:03)

Simple Regression Theory II 2010 Samuel L. Baker

Estimation of σ 2, the variance of ɛ

Subjects: Fourteen Princeton undergraduate and graduate students were recruited to

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

fmri 實 驗 設 計 與 統 計 分 析 簡 介 Introduction to fmri Experiment Design & Statistical Analysis

Study Guide for the Final Exam

Non-Inferiority Tests for Two Means using Differences

Ordinal Regression. Chapter

individualdifferences

II. DISTRIBUTIONS distribution normal distribution. standard scores

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Projects Involving Statistics (& SPSS)

Multivariate Analysis of Variance. The general purpose of multivariate analysis of variance (MANOVA) is to determine

Financial Market Efficiency and Its Implications

Correlational Research

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

Econometrics I: Econometric Methods

1 Why is multiple testing a problem?

Descriptive Statistics

Research Methods & Experimental Design

THE KRUSKAL WALLLIS TEST

Two-sample hypothesis testing, II /16/2004

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

CALCULATIONS & STATISTICS

1 Overview and background

Module 5: Multiple Regression Analysis

Chapter 2. Hypothesis testing in one population

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

A full analysis example Multiple correlations Partial correlations

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Hypothesis Testing for Beginners

p ˆ (sample mean and sample

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

S-Parameters and Related Quantities Sam Wetterlin 10/20/09

problem arises when only a non-random sample is available differs from censored regression model in that x i is also unobserved

Skewed Data and Non-parametric Methods

Univariate Regression

Introduction to Quantitative Methods

Portfolio Construction in a Regression Framework

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct

2 Sample t-test (unequal sample sizes and unequal variances)

How to Get More Value from Your Survey Data

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Risk Decomposition of Investment Portfolios. Dan dibartolomeo Northfield Webinar January 2014

Pearson s Correlation

Econometrics Simple Linear Regression

Psychology and Economics (Lecture 17)

The Null Hypothesis. Geoffrey R. Loftus University of Washington

Calculating VaR. Capital Market Risk Advisors CMRA

Power Analysis for Correlation & Multiple Regression

Review Jeopardy. Blue vs. Orange. Review Jeopardy

Lecture 2 Matrix Operations

Introduction to Hypothesis Testing

Simple Linear Regression Inference

Coefficient of Determination

Exploratory Factor Analysis and Principal Components. Pekka Malo & Anton Frantsev 30E00500 Quantitative Empirical Research Spring 2016

CHAPTER 14 NONPARAMETRIC TESTS

Unit 3 (Review of) Language of Stress/Strain Analysis

Measures of Central Tendency and Variability: Summarizing your Data for Others

Planning sample size for randomized evaluations

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus

Object Recognition and Template Matching

Basics of Statistical Machine Learning

Social Return on Investment

The Purchase Price in M&A Deals

Package dsmodellingclient

Detecting Network Anomalies. Anant Shah

Model-free Functional Data Analysis MELODIC Multivariate Exploratory Linear Optimised Decomposition into Independent Components

Statistics in Medicine Research Lecture Series CSMC Fall 2014

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

SAMPLE SIZE CONSIDERATIONS

X = rnorm(5e4,mean=1,sd=2) # we need a total of 5 x 10,000 = 5e4 samples X = matrix(data=x,nrow=1e4,ncol=5)

1-3 id id no. of respondents respon 1 responsible for maintenance? 1 = no, 2 = yes, 9 = blank

by Maria Heiden, Berenberg Bank

Module 3: Correlation and Covariance

A Mixed Model Approach for Intent-to-Treat Analysis in Longitudinal Clinical Trials with Missing Values

Structural Equation Modelling (SEM)

Fitting Subject-specific Curves to Grouped Longitudinal Data

TURUN YLIOPISTO UNIVERSITY OF TURKU TALOUSTIEDE DEPARTMENT OF ECONOMICS RESEARCH REPORTS. A nonlinear moving average test as a robust test for ARCH

Lecture 15. Endogeneity & Instrumental Variable Estimation

Investment Statistics: Definitions & Formulas

Consider a study in which. How many subjects? The importance of sample size calculations. An insignificant effect: two possibilities.

Lecture 7: Privacy and Security in Mobile Computing. Cristian Borcea Department of Computer Science NJIT

Transcription:

Percent Change and Power Calculation UCLA Advanced NeuroImaging Summer School, 2007

Outline Calculating %-change How to do it What featquery does Power analysis Why you should use power calculations How to carry out power calculations

Why %-Change? As it is, parameter estimates do not reflect a specific unit T-stats are okay (they are unitless) What if we want to tell other people how large our activation was? Convert to %-change

%-change How big is the signal magnitude relative to baseline? Signal magnitue Mean

Block Design How to get the signal magnitude from parameter estimates p.e.=4, EV height=1 1x4=signal magnitude p.e.=2, EV height=2 2x2=signal magnitude

Block Design To make life easier, set min/max range of EV s=1! In FSL the Grand Mean Scaling sets mean~100 2 in all voxels PE/100=%-change! (roughly) To be completely accurate you should divide by the true mean

Event Related Design Tricky! Proximity and length of events affects height

Event Related Design How about using min/max range? Is it interpretable? I tell you I found a 2% change, calculated using the min/max range. Can you interpret this with your own design?

ER design Assume mean=100 %-change=(pe)(min/max range) 2=PE*(0.5) PE=4

ER design

All designs have an isolated 2 second event BUT the estimated signal is very different! ER design

ER Design Min/max range doesn t work as well for event related design Doesn t translate well to other designs. *warning* Featquery uses min/max range Instead of min/max range, choose something specific (isolated 2 second event) and make sure to report what you used!

1% change based on height of isolated 2-s event ER Design

ER Design 1% change based on height of isolated 2-s event Now we can compare across designs!

A note about contrasts What does the contrast [1 1-1 -1] mean? beta s are mean activations for 4 levels of subjects performing some task (beginners, some training, medium training, experts) Test beginners/some training - med/experts Difference is twice the mean difference!

A note about contrasts Although [1 1-1 -1] implies a difference that is twice the original scale, our test statistic is okay Since we re interested in preserving scale, use contrast that give us means Positive parts sum to 1 Negative parts sum to -1

Rules to get *almost* %- change in FSL Baseline is already ~100 2 EV is constructed appropriately Boxcar height =1 ER design: Specified event has height=1 I like to ignore the post stimulus undershoot Construct contrast that follow the rules Positive parts sum to 1 Negative parts sum to -1 PE*100~%-change

What if you didn t follow the rules Calculate the height of regressor as it was Height of block Height of specified typed of event Calculate contrast fix Number you d have to divide contrast by to fix it

Example Study with subjects that were beginners, some training, medium training, experts Level 1, isolated 2 second event (using the gamma HRF) has height=0.2917 Level 2, I used the contrast [1 1-1 -1] COPE=500 (from contrast estimate)

Featquery Featquery calculates %-change for ROI s Uses min/max range of effective regressor height Roughly speaking effective regressor corrects violation of the contrast rule and correlation between regressors Probably works fine for block design Very bad to run separately on first level runs for ER studies! Will use a different scale factor for each subject Min/Max range isn t best for ER design

What to do instead of featquery? There s a writeup with some code that uses avwstats and avwmaths located here http://mumford.bol.ucla.edu/perchange_gui de.pdf Some isloated event heights are in a table in this document Isloated event height should be calculated from a design with *very* small TR for best resolution

Power Analysis-Why? To answer the question. How many subjects do I need for my study? How many runs per subject should I collect? To create thorough grant applications that will make reviewers happy! Don t waste money on underpowered studies OR collecting data on more subjects than you need

What is Power? Null Distribution Alternative Distribution Power: The probability 0.4 of rejecting H 0 when 0.35 H A is true 0.3 Specify your null distribution Mean=0, variance=σ 2 0.25 0.2 Specify the effect size 0.15 (Δ), which leads to alternative distribution 0.1 Specify the false 0.05 α Power positive rate, α 0-4 -2 0 2 4 6 8 Δ/σ

Interpretation 1100 total voxels 100 voxels have β=δ A test with 50% power on average will detect 50 of these voxels with true activation 1000 voxels have β=0 α=5% implies on average 50 null voxels will have false positives fmri Power Test Result (observed) Reject H 0 Accept H 0 Truth (unobserved) H 0 True Type I Error α 50 Correct 950 H 0 False Power 50 Type II Error 50 1000 100

Necessary information N: Number of Subjects Adjusted to achieve sufficient power α: The size of the test you d like to use Commonly set to 0.05 (5% false positive rate) Δ: The size of the effect you re interested in detecting Based on intuition or similar studies σ 2 : The variance of Δ Has a complicated structure with very little intuition Depends on many things

Temporal autocorr. Cov(Y)=σ 2 w V Why is it so difficult for group fmri? Time......... Subject 1 Subject 2... Subject N Between subject variability, σ 2 B

Level 1 Y k = X k β K + ε k β k0 = β k1 β k2 + β k3 Y k : T k -vector timeseries for subject k X k : T k xp design matrix β k : p-vector of parameters ε k : T k -vector error term, Cov(ε k )=σ 2 kv k

Level 2 ^ β cont = X g β g + ε g = β g1 + β g2 cβ^ k X g : Nxp g design matrix β g : p g -vector of parameters ε g : N-vector error term Cov(ε g )=V g =diag{c(x T k V k -1 X k ) -1 σ k2 c T } + σ B 2 I N Within subject variability Between subject variability

Alternative distribution For a specific H A :c g β g =Δ t is distributed T n-pg, ncp ncp= Δ/c g (X gt V g -1 X g )c g T What do we need? N α Δ σ 2 c g X g σ 2 WV σ 2 B c X k σ 2 k V k

Estimating Parameters How to you estimate parameters for a future study? Look at other people s study results for similar studies Look at your own similar studies Average parameter estimates over ROIs of interest

Estimating σ 2 k V k Use covariance estimate supplied by software FSL uses a unstructured covariance that can have up to 13 or more parameters SPM2 uses a 2 parameter AR(1) approximation Fit AR(1)+WN model to software estimate Only 3 parameters Summarize parameters over ROI s Simple parameterization is easy to report and bridges covariance estimates across software Assume covariance is same over all subjects (necessary to extrapolate to larger populations)

Estimating σ 2 B SPM2 doesn t estimate, but we still need to approximate it for the power calculation You can see my paper for the details FSL estimates and saves this parameter in an image file Average over the ROI

Model Block design 15s on 15s off TR=3s Hrf: Gamma, sd=3 Parameters estimated from Block study FIAC single subject data Read 3 little pigs Same/different speaker, same/different sentence Looked at blocks with same sentence same speaker

Power as a function of run length and sample size

More importantly.cost! Cost to achieve 80% power Cost=$300 per subject+$10 per each extra minute

How Many Runs? Can also expand to a 3 level model and study impact of adding runs Example ER study Study used 3 runs per subject Estimate between run variability Assume within subject variability is the same across subjects Assume study design is same across subjects

How many runs?

How can you calculate power? Only 1 tool that I know of Fmripower! (created by me) Beta version at fmripower.org ROI based power analysis Can apply to old FSL anlayses Runs in Matlab Current version only allows user to specify different # s of subjects Assumes # of runs for future study will be the same Assumes between subject variability is same across subjects Doesn t control for multiple comparisons

Can I use power post hoc? Please don t! It doesn t make sense Power is a function of alpha If you rejected your null, post hoc power is always less than 50% See The Abuse of Power: The Pervasive Fallacy of Power Calculations for Data Analysis Hoenig et al, Amer. Stat. 2001 Try percent change threshold (see Tom Nichols website) Based on confidence intervals cutoff

Can I use power post hoc? Please don t! It doesn t make sense Power is a function of alpha If you rejected your null, post hoc power is always less than 50% See The Abuse of Power: The Pervasive Fallacy of Power Calculations for Data Analysis Hoenig et al, Amer. Stat. 2001 Try percent change threshold (see Tom Nichols website) Based on confidence intervals cutoff

fmripower

Conclusion Percent change should be calculated in a manner that is interpretable Min/max range does not lead to interpretable %- change Interpretable %-change is very handy for power calculations Power calculations can save you time and money Scan enough subjects to see an effect, don t waste time on an underpowered study Don t keep subjects in the scanner too long if you don t need to