Statistical basics for Biology: p s, alphas, and measurement scales.

Similar documents
II. DISTRIBUTIONS distribution normal distribution. standard scores

DATA ANALYSIS. QEM Network HBCU-UP Fundamentals of Education Research Workshop Gerunda B. Hughes, Ph.D. Howard University

DATA COLLECTION AND ANALYSIS

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots


Study Guide for the Final Exam

Statistics Review PSY379

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

Descriptive Statistics

Using Excel for inferential statistics

Measurement. How are variables measured?

Additional sources Compilation of sources:

UNIVERSITY OF NAIROBI

Statistics, Research, & SPSS: The Basics

Analysing Questionnaires using Minitab (for SPSS queries contact -)

Introduction to Statistics and Quantitative Research Methods

Analysis of Data. Organizing Data Files in SPSS. Descriptive Statistics

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there

Statistics. Measurement. Scales of Measurement 7/18/2012

WHAT IS A JOURNAL CLUB?

Algebra Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard

Sample Size and Power in Clinical Trials

MASTER COURSE SYLLABUS-PROTOTYPE PSYCHOLOGY 2317 STATISTICAL METHODS FOR THE BEHAVIORAL SCIENCES

Rank-Based Non-Parametric Tests

Published entries to the three competitions on Tricky Stats in The Psychologist

Research Methods & Experimental Design

Basic Concepts in Research and Data Analysis

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools

Introduction to Quantitative Methods

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS

Descriptive Statistics and Measurement Scales

Chapter G08 Nonparametric Statistics

Chapter 1: The Nature of Probability and Statistics

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Introduction to Regression and Data Analysis

Analyzing Research Data Using Excel

Biostatistics: Types of Data Analysis

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship)

Measurement & Data Analysis. On the importance of math & measurement. Steps Involved in Doing Scientific Research. Measurement

SPSS Explore procedure

THE UNIVERSITY OF TEXAS AT TYLER COLLEGE OF NURSING COURSE SYLLABUS NURS 5317 STATISTICS FOR HEALTH PROVIDERS. Fall 2013

Guided Reading 9 th Edition. informed consent, protection from harm, deception, confidentiality, and anonymity.

Statistical tests for SPSS

Difference tests (2): nonparametric

Quantitative Methods for Finance

Elementary Statistics

NAG C Library Chapter Introduction. g08 Nonparametric Statistics

CHAPTER 14 ORDINAL MEASURES OF CORRELATION: SPEARMAN'S RHO AND GAMMA

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Fat Content in Ground Meat: A statistical analysis

Pre-Algebra Academic Content Standards Grade Eight Ohio. Number, Number Sense and Operations Standard. Number and Number Systems

Bivariate Statistics Session 2: Measuring Associations Chi-Square Test

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

Fairfield Public Schools

Survey Data Analysis. Qatar University. Dr. Kenneth M.Coleman - University of Michigan

Week TSX Index

Statistics in Medicine Research Lecture Series CSMC Fall 2014

APPLIED MATHEMATICS ADVANCED LEVEL

DATA INTERPRETATION AND STATISTICS

Normality Testing in Excel

SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12

Confidence Intervals for Spearman s Rank Correlation

Simple Predictive Analytics Curtis Seare

Part 2: Analysis of Relationship Between Two Variables

TRAINING PROGRAM INFORMATICS

Foundation of Quantitative Data Analysis

Glossary of Terms Ability Accommodation Adjusted validity/reliability coefficient Alternate forms Analysis of work Assessment Battery Bias

Analysis of Questionnaires and Qualitative Data Non-parametric Tests

MATH BOOK OF PROBLEMS SERIES. New from Pearson Custom Publishing!

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Descriptive Analysis

Association Between Variables

Mathematics (MAT) MAT 061 Basic Euclidean Geometry 3 Hours. MAT 051 Pre-Algebra 4 Hours

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

Point Biserial Correlation Tests

AP Physics 1 and 2 Lab Investigations

On the Practice of Dichotomization of Quantitative Variables

Statistics Graduate Courses

Permutation Tests for Comparing Two Populations

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS

Interpretation of Somers D under four simple models

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

Tutorial 5: Hypothesis Testing

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

When to Use a Particular Statistical Test

Mathematics within the Psychology Curriculum

E3: PROBABILITY AND STATISTICS lecture notes

individualdifferences

Transcription:

334 Volume 25: Mini Workshops Statistical basics for Biology: p s, alphas, and measurement scales. Catherine Teare Ketter School of Marine Programs University of Georgia Athens Georgia 30602-3636 (706) 583-0862 cmscatk@uga.edu Catherine Teare Ketter received her B.S. and M.S. in Biology from the University of Alabama. She received her Ph.D. in research methodology and applied statistics in 1990 from the University of Alabama with an emphasis in the application of nonparametric multivariate techniques to biomedical and health behavioral data. Since 2000, Catherine has been responsible for developing the undergraduate marine sciences laboratory courses at the University of Georgia. During the summer Catherine teaches a nonscience major marine biology field course and a Marine Science for Teachers course. She holds a Master s level certificate in comprehensive science. Note: This is a summary of the original PowerPoint presentation made at the UNLV 2003 ABLE meeting. Reprinted From: Teare Ketter, C. 2004. Statistical basics for biology: p s, alphas, and measurement scales.. Pages 334-339, in Tested studies for laboratory teaching, Volume 25 (M. A. O Donnell, Editor). Proceedings of the 25 th Workshop/Conference of the Association for Biology Laboratory Education (ABLE), 414 pages. - Copyright policy: http://www.zoo.utoronto.ca/able/volumes/copyright.htm Although the laboratory exercises in ABLE proceedings volumes have been tested and due consideration has been given to safety, individuals performing these exercises must assume all responsibility for risk. The Association for Biology Laboratory Education (ABLE) disclaims any liability with regards to safety in connection with the use of the exercises in its proceedings volumes. 2004 Catherine Teare Ketter Statistical basics for Biology: p s, alphas, and measurement scales. Purpose: To address common misconceptions regarding and errors in the use of statistical procedures by basic science students. Overview of presentation 1. Scales of measurement 2. Nonparametric and parametric techniques 3. Descriptive versus inferential statistics 4. Population parameters versus sample statistics 5. Type I and Type II error 6. Tests of inference and modeling 7. Data transformation Scientists have long been concerned with the observation and quantification of relationships between objects and processes in biotic and abiotic systems. There has been controversy in the assignment of Association for Biology Laboratory Education (ABLE) ~ http://www.zoo.utoronto.ca/able

Volume 25: Mini Workshops 335 numbers to objects, behavioral and physical. In practice, this assignment of numbers to objects should be applied in a systematic and meaningful way. Scales of Measurement Data Types On the theory of scales and measurement. Stevens, S.S. 1946. Science. 103, 677-680. 1. Nominal Simplest level of measurement Discrete data which represent group membership to a category which does not have an underlying numerical value Data transformations which preserve group membership are acceptable Examples: ethnicity, color, pattern, soil type, media type, license plate numbers, football jersey numbers, etc. May also be dichotomous: present/absent, male/female, live/dead 2. Ordinal Next level of measurement From the word order Includes variables that can be ordered but for which there is no zero point and no exact numerical value Data transformations that preserve group membership and order are acceptable Often used for affective instruments which assess feelings or preferences Likert scale used in surveys strongly agree, agree, undecided, disagree, strongly disagree Distances between each ordered category are not necessarily the same Examples: shoe size, preference ranks (Thurstone rating scale) 3. Interval Includes group membership, order, and equal distance between points on the measurement scale Arbitrary zero point on measurement scale Data transformations which preserve group membership, rank-order, and interval equality are permissible Most common example of interval scale is temperature Celsius and Fahrenheit scales have arbitrary zero points, rank-order, an interval equality can convert from one scale to another Behavioral data semantic differential, composite subscale scores for intelligence, aptitude, creativity, etc. 4. Ratio Measurements in which group membership, rank-order, interval equality, and a true zero point are present Frequently encountered in the physical sciences Examples: acceleration (m/hr 2 or km/hr 2 ), velocity (m/hr or km/hr), density (mass/unit volume), and force (Newtons=kg m/s) Behavioral data does not produce ratio data o o Intelligence has no theoretical or observable zero point, for example. Only when a ratio of two known interval values on behavioral constructs is calculated are ratio level behavioral variables produced

336 Volume 25: Mini Workshops Permissible operations for ratio data include the multiplication by a positive constant Permissible transformations preserve group membership, rank-order, interval equality, and the existence of a zero point Summary of Measurement Scales (after S.S. Stevens, 1946) Scale Basic Empirical Operations Mathematical Group Structure Nominal Determination equality Permutation group x = f(x) f(x) means any one-to-one substitution Ordinal Interval Determination greater or lesser Determination of interval equality Isotonic group x = f(x) f(x) means any monotonic increasing function General linear group x = ax + b or differences Ratio Determination of equality Similarity of group of ratios x = ax Permissible Statistics Number of cases Mode Contingency correlation Median Percentile Ranks Rank-order Correlation Mean Standard Deviation Pearson product-moment Correlation Coefficient of Variation Table of Correlation Coefficients for Variables Measured with Different Scale Properties Adapted from Glass & Stanley (1970) Nominal Nominal with Underlying Normal Distribution Ordinal Nominal Nominal with Underlying Normal Distribution Ordinal Interval or Ratio phi * rank biserial point biserial * tetrachoric r rank biserial * * biserial Spearman s or Kendall s tau * Interval or Ratio point biserial biserial r * Pearson product moment correlation * No coefficient is specifically available.

Volume 25: Mini Workshops 337 Parametric versus nonparametric methods Nonparametric statistical methods Used with discrete, non-continuous data (nominal or ordinal variables) Used when the sample data are not normally distributed or when the underlying variable of interest is not normally distributed Parametric statistical methods Used with interval and ratio level data Used when the sample distribution is normally distributed Used when the variable of interest is normally distributed in the larger population from which the sample was taken Descriptive versus inferential statistics Descriptive statistics Operations performed on data which constitute a sample from some larger population Describe the properties of the sample population only Results are not generalizable to the larger population Limited sample size, restriction in range for the variable of interest in the sample, other factors limit the generalizability of study results Inferential statistics Operations performed sample data extracted from a larger parent population Sample statistics relate directly to the larger parent population from which the sample was taken Purpose of inferential statistical procedures is to evaluate hypotheses relating to a sample and generalize these relationships to the larger parent population Population parameters versus sample statistics scribed by a set of numerical measures called parameters Population mean, µ Population variance, σ Sample: A subset of measurements taken from a population The distribution of the parameter/variable in the sample should approximate the distribution of the parameter in the population problems: restriction of range Increasing sample size increases precision of the sample estimates of population parameters Sample mean, C (x-bar) Sample variance, s

338 Volume 25: Mini Workshops Type I and Type II error A type I error occurs when the null hypothesis is accepted when it is true. The probability of type I error is α. A type II error occurs when the null hypothesis is accepted when it is false. The probability of a type II error is β. For a given sample size, α and β are inversely related. As α increases, β decreases. A priori error rate that is acceptable for a test of hypothesis is α. α is usually set as 0.05 or 0.01 depending upon the purpose of the analyses. Decision Null Hypothesis False True Reject H 0 Correct Type I error Fail to reject H 0 Type II error Correct Distinction between p and α p is the probability associated with the acceptance of the null hypothesis, H 0, when it is true. Prior to a test of hypothesis, the researcher needs to set an a priori α, error rate. The probability associated with the calculated value of the test statistic is p. The level of statistical significance is determined by α, a test of hypothesis is significant if p α. Calculated values of the test statistic may have greater or lesser-associated probabilities, but it is not correct to say that one is more significant than another! Inferences and modeling The purpose of inferential statistical procedures is to evaluate hypotheses relating to a sample, and generalize these relationships to the larger parent population. Modeling generates an equation that can be used to predict future events using data collected over time. Modeling usually involves some type of regression analysis (can be linear, quadratic, polynomial, logistic, etc). Data transformation Data transformation is useful if: Data are not normally distributed Modeling Results of data analysis using transformed sample data are generalizable only to the population of transformed data. Transformation is NOT useful in the purpose of the analysis is to make inferences from the sample to the population from which the sample was derived.

Volume 25: Mini Workshops 339 Summary Four levels of measurement: nominal, ordinal, interval, and ratio Nonparametric statistical methods are used with nominal and ordinal data. Parametric statistical methods are used with interval and ratio data. Descriptive statistics produce results that describe the properties of the sample population only. Descriptive statistics cannot be used to make inferences about the larger population from which the sample was drawn. Descriptive statistics are appropriate when sampling is not random (restriction in range) or the underlying variable of interest is not normally distributed. The purpose of inferential statistical procedures is to evaluate hypotheses relating to a sample and to generalize these relationships to the larger parent population. A type I error occurs when the null hypothesis is accepted when it is true. The probability of type I error is α. A type II error occurs when the null hypothesis is accepted when it is false. The probability of a type II error is β. The level of statistical significance is determined by α, a test of hypothesis is significant if p α. Calculated values of the test statistic may have greater or lesser-associated probabilities, but it is not correct to say that one is more significant than another! Don t transform non-normally distributed data unless you are using a modeling technique if the purpose of the analyses is inferential, use the appropriate nonparametric technique! Take home message Take the time to talk with your class about measurement levels. The lowest level of measurement in your data set dictates the types of procedures you can employ in data analysis. Make a distinction between p and α. Statistics are a very useful tool but should not substitute for good experimental design. Design should dictate the analyses used. Have fun!