# Appendix 2 Statistical Hypothesis Testing 1

Save this PDF as:

Size: px
Start display at page:

## Transcription

1 BIL 151 Data Analysis, Statistics, and Probability By Dana Krempels, Ph.D. and Steven Green, Ph.D. Most biological measurements vary among members of a study population. These variations may occur for any number of reasons, from differences in genetic expression to the effects of environmental variables. In addition, there are variations in measurements that arise from the instruments used and from the way they re used, so the same event or object is unlikely to yield the same data twice, even from the same investigator let alone from different people and measuring tools. Hence, an investigator must measure as many individuals as possible to account for the variation in that population and must also employ analytic methods that account for errors, or variation, in the measurements themselves. When a particular value is being compared in two different populations, care must be taken to ensure that each population is represented as accurately and rigorously as possible by the samples and by measurements taken on them. This is one purpose of statistical analysis. I. Data, parameters and statistics In biological science, investigators often collect observations that can be tabulated as numerical facts, also known as data (singular = datum). Important measurements include counts (frequency) and those that describe characteristics (length, mass, etc.). Data from a sample are often used to calculate estimates of the average values of the population of interest (mean, mode, and median) and dispersion around those average values (variance, and standard deviation). There are three different types of data (Figure 1). Figure 1. A simple diagram showing the basic distinguishing features of qualitative ( either/or ) and quantitative data. Quantitative data can be either discrete (counted as integers) or continuous (measured along a continuum). 1. Qualitative/Attribute Data. These are descriptive, "either-or" measurements, and usually describe the presence or absence of a particular attribute. The presence or absence of a genetic trait ("freckles" or "no freckles") or type of genetic trait (A, B, AB or o blood types) are examples. Because such data have no specific sequence, they are considered unordered. 2. Quantitative/Numerical Data. These data correspond to numbers. Because they do have a specific sequence, they are considered ordered. Numerical data may be of two types: Appendix 2 Statistical Hypothesis Testing 1

2 a. Discrete Quantitative/Numerical Data. These correspond to biological observations counted as integers (whole numbers). The number of leaves on each member of a group of plants, the number of breaths per minute in a group of newborns or the number of beetles per square meter of forest floor are all examples of discrete numerical data. These data are ordered, but do not describe physical attributes of the things being counted. b. Continuous Quantitative/Numerical Data. These are data that fall along a numerical continuum. The limit of resolution of such data is the accuracy of the methods and instruments used to collect them. Examples are tail length, brain volume, percent body fat...anything that varies on a continuous scale. Rates (such as decomposition of hydrogen peroxide per minute or uptake of oxygen during respiration over the course of an hour) are also numerical continuous data. Exercise - Determining the Nature of Data For the next ten minutes, you and your teammates should make a list of five different classes EACH of (1) qualitative/attribute data, (2) discrete quantitative/numerical data, and (3) continuous quantitative/numerical data that you could MEASURE AND COLLECT FROM THE MEMBERS OF YOUR LABORATORY CLASS. (For example, you might list "hair color" as a type of attribute data, "number of hairs per square centimeter of scalp" as discrete numerical data, and "length of hair" as continuous numerical data.) List your examples in Table 1 below. When all groups are finished, your lab instructor will lead a brief discussion and ask each team to share some examples. Discuss whether each group's examples are correctly placed in each of the three categories, and why or why not. Table 1. Types of data collectible from Homo sapiens in BIL 151 laboratory: Attribute data Discrete numerical data Continuous numerical data Appendix 2 Statistical Hypothesis Testing 2

3 II. Mean, Mode and Median When an investigator collects numerical data from a group of subjects, s/he must determine how and with what frequency the data vary. For example, if one wished to study the distribution of shoe size in the human population, one might measure the shoe size of a subset that is a fair ( unbiased ) representation of the human population (say, a sample of 3,000 individuals) and graph the numbers with "shoe size" on the horizontal axis (also known as the abscissa) and "number of individuals" on the vertical axis (also known as the ordinate when it represents counts). The resulting figure would show the frequency distribution of the data: a representation of how often a particular value occurs in the sample. Measurements are never identical, so are distributed over a range of values. If they tend to cluster near the center of the range, then the sample can be fairly characterized by its arithmetic mean (the average value), its median (the value that places half the measurements above and half below) and its mode (the most common measurement in the range). These sample values can provide good estimates of the actual population values (parameters, defined below). To calculate a mean, sum the values measured for all individuals in a population of interest and divide it by the sample size. n _ Σ x i x = i=1 n III. Range, Variance and Standard Deviation When you are examining some measurable aspect of a population (such as a rate of reaction), the degree of variation around the mean should always be considered. In biological systems, there is almost always variation around the mean of any given value. In many biological studies, the estimation of degree of variation is as important, if not more important, than the mean. Measurements of variation include the range as well as dispersion around the mean, namely the variance and standard deviation. The simplest of these is the range, which is defined as the highest value minus the lowest value. Unfortunately, the greater the sample size, the greater the range, and because it employs essentially only the two extreme values, a great deal of information about variation between those extremes is lost. More useful are the variance and standard deviation, which are measures of deviations from the mean. The sample variance (s 2 ) is calculated as _ In which x is the sample mean, x is each individual value, and n is the sample size. Appendix 2 Statistical Hypothesis Testing 3

4 The standard deviation (s) is simply the square root of the variance: Parameters vs. Statistics If you were able to measure the second : fourth finger ratio of every adult Homo sapiens who ever existed, and then calculate a mean, median, mode, variance and standard deviation from those measurements, those values would be known as parameters. They represent the actual values as calculated from measuring every member of a population of interest. Obviously, it is very difficult to obtain data from every member of a population of interest, and impossible if that population is theoretically infinite in size. However, one can estimate parameters by randomly sampling members of the population. Such an estimate, calculated from measurements of a subset of the entire population, is known as a statistic. In general, parameters are written as Greek symbols equivalent to the Roman symbols used to represent statistics. For example, the standard deviation for a subset of an entire population is written as "s", whereas the true population parameter is written as σ. Statistics and statistical tests are used to test whether the results of an experiment are significantly different from what is expected. But what do we mean when we say "significant?" For that matter, what is meant by "expected" results? To answer these questions, we must consider the matter of probability. IV. Probability: What is meant by "expected results?" Probability calculations allow us to define the range of possible results of a series of trials, often in the form of a bell-shaped curve (also a frequency distribution) representing the likelihood of each possible result. In other words, these probability calculations allow us to determine likely or expected results from a particular series of events, be they a series of coin flips, rolls of the dice, or carefully controlled experimental trials. When an investigator samples a population, s/he is interested in determining whether some hypothesis about that population can or cannot be rejected based on the data gathered. How does an investigator know whether any "unusual" results are different enough from what is predicted or expected by a hypothesis so as to permit rejecting it, or whether that degree of difference might be simply due to chance? Let's use the example of an event that has several possible outcomes, with each of those outcomes having an equal probability of occurring. The probability (P) of any of those possible outcomes can be expressed as: a/n...where a = the # of occurrences of the event in question and n = the total number chances a particular result has to occur For example, because a die (that's the singular of dice) has six sides, there is always a 1/6 chance on any roll that any given number (let's say a "6") will come up. Appendix 2 Statistical Hypothesis Testing 4

5 When the probability of various events is known, scientific investigators can consider their combined probabilities: the likelihood of two or more events happening together. V. Hypotheses: one tail or two? The Scientific Process begins with three important steps: 1. Observation of some phenomenon that elicits a question/poses a problem. 2. Formulation of competing, testable hypotheses about that phenomenon 3. Prediction of all possible outcomes of experiments (or observations if experimentation is not possible) designed to test each hypothesis. For example, as you wander in a field of wildflowers, you notice that individuals of Mexican Poppies (Argemone mexicana) have two distinct morphologies, one with spines and one without (Figure 4). You also notice that the individuals with spines are far more numerous than the smooth individuals. You might wonder: Is there a reason for the difference in their numbers? Are spiny individuals better able to deter herbivores with their spines than the smooth individuals? That is your question. Figure 4. The Mexican Poppy, Argemone mexicana, has individuals with both spiny seed pods (shown) and smooth seed pods (not pictured). (http://commons.wikimedia.org) The question can then be stated as a testable hypothesis. In this case, you might state Individual Argemone mexicana with spiny seed pods suffer less seed predation than individuals with smooth seed pods. That is your overall, or experimental hypothesis. The next step is to formulate competing statistical hypotheses. Statistical hypotheses are stated in terms of two opposing statements, the null hypothesis (H o ) and the alternative hypothesis (H a ). The null hypothesis states that there is no significant difference between two populations being compared. The alternative hypothesis may be either directional (onetailed), stating the precise way in which the two populations will differ, or nondirectional (twotailed), not specifying the way in which two populations will differ. (Note from Dr. Green: almost nobody justifies the use of one-tailed tests at all, ever, under any circumstance and they imply that you deliberately blind yourself to an outcome in an unexpected direction. Such an outcome may be the most interesting results you ll ever have!) Appendix 2 Statistical Hypothesis Testing 5

7 The probability distribution of the Student s t-distribution is used to determine whether the observed difference between the means of two samples is unlikely to have arisen by chance if they were in fact drawn from the same population (or from populations with identical means). To make a very long and complex story short, you can use the mean, variance, and standard deviation of your reaction rates (treatment vs. control) to calculate a t-statistic. Every possible value of the t-statistic is linked to a certain probability that the observed difference in sample means is simply a matter of chance. B. The Student s t-test: A tool for determining whether there is a significant difference between two means The Student s t-test can be used to determine whether a difference between two means is significant. Note that significant in this sense is NOT the same as biologically meaningful. It refers only to whether the observed difference is unlikely to be due to chance ( statistically significant ). Means may be calculated from observations that are either paired (as when individuals in a single group are subjected to "before and after" measurements, and data points are paired for each tested individual) or independent (as when individuals in two similar sample populations are measured, but each individual in each sample population is measured only once). Slightly different calculations of the t-statistic must be used for paired means vs. independent means. Paired designs are always best because they eliminate the added influence on a difference in means arising from variation between samples due solely to their containing different individuals. Because your substrate (hydrogen peroxide) was permanently destroyed in your reaction, and because it is impossible to retrieve the same catalase molecules for repeated experiments, you used new reagents for each trial. Hence, you must analyze your results with the independent sample t test. 1. The Independent sample t-test The independent sample t-test is designed to show whether there is a significant difference between the means of two independent samples, those of your (1) treatment and (2) control groups. Use the following equation to calculate a t-statistic for your two independent mean (rates of reaction):...in which x 1 and and x 2 are the means of your two groups n 1 and n 2 are the numbers of trials under each condition s p 2 is the pooled variance, calculated as: Appendix 2 Statistical Hypothesis Testing 7

8 ...in which s 1 2 is the variance of group 1 s 2 2 is the variance of group 1 df 1 is the degrees of freedom for group 1 (df 1 = n 1-1) df 2 is the degrees of freedom for group 2 (df 2 = n 1 1) Recall that the variance (s 2 ) of a mean is calculated as In which _ x is the mean x is each individual value n is the sample size 2. Degrees of Freedom The last thing you must do is to determine the number of independent quantities in your system, or degrees of freedom. The degrees of freedom determine the significance level tied to every possible value of a statistic (such as the t-statistic). The degrees of freedom is the number of data points that are free to vary without changing the test statistic, and this changes depending on the type of statistic you are calculating. The degrees of freedom for the twosample t-test with independent means is calculated as the sum of the degrees of freedom of each test group: df = (n 1-1) + (n 2-1) C. Probability and significance The term "significant" is often used in every day conversation, yet few people know the statistical meaning of the word. In scientific endeavors, statistical significance has a very specific and important definition. The difference between an observed and expected result is said to be statistically significant if and only if: Under the assumption that there is no true difference, the probability that the observed difference would be at least as large as that actually seen simply as a matter of chance is less than a specific probability. For many kinds of biology that probability is usually set at 0.05 (or 5%) and is known as alpha or the probability of making a Type I error -- falsely rejecting the null hypothesis. Appendix 2 Statistical Hypothesis Testing 8

10 It is not enough to simply state that you have rejected or failed to reject a null or alternative hypothesis. You also must explain your results logically, and to the best of your ability on a molecular level. Check the rubric that your lab instructor will be using to evaluate your team s PowerPoint, and plan accordingly. It is linked to the syllabus on the BIL 151 home page. Table 3. Table of critical values for the two-sample t-test. The P levels (0.05) indicating rejection of the null hypothesis are shown in bold for both one-tailed and two-tailed hypotheses. (From Pearson and Hartley in Statistics in Medicine by T. Colton, Little, Brown and Co., Inc. publishers.) 2-tail --> tail --> df Appendix 2 Statistical Hypothesis Testing 10

### Biodiversity Data Analysis: Testing Statistical Hypotheses By Joanna Weremijewicz, Simeon Yurek, Steven Green, Ph. D. and Dana Krempels, Ph. D.

Biodiversity Data Analysis: Testing Statistical Hypotheses By Joanna Weremijewicz, Simeon Yurek, Steven Green, Ph. D. and Dana Krempels, Ph. D. In biological science, investigators often collect biological

### Inferential Statistics

Inferential Statistics Sampling and the normal distribution Z-scores Confidence levels and intervals Hypothesis testing Commonly used statistical methods Inferential Statistics Descriptive statistics are

### Statistics Review PSY379

Statistics Review PSY379 Basic concepts Measurement scales Populations vs. samples Continuous vs. discrete variable Independent vs. dependent variable Descriptive vs. inferential stats Common analyses

### Chapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation

Chapter 9 Two-Sample Tests Paired t Test (Correlated Groups t Test) Effect Sizes and Power Paired t Test Calculation Summary Independent t Test Chapter 9 Homework Power and Two-Sample Tests: Paired Versus

### Hypothesis Testing Level I Quantitative Methods. IFT Notes for the CFA exam

Hypothesis Testing 2014 Level I Quantitative Methods IFT Notes for the CFA exam Contents 1. Introduction... 3 2. Hypothesis Testing... 3 3. Hypothesis Tests Concerning the Mean... 10 4. Hypothesis Tests

### Descriptive Statistics

Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

### Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative

### Unit 29 Chi-Square Goodness-of-Fit Test

Unit 29 Chi-Square Goodness-of-Fit Test Objectives: To perform the chi-square hypothesis test concerning proportions corresponding to more than two categories of a qualitative variable To perform the Bonferroni

### II. DISTRIBUTIONS distribution normal distribution. standard scores

Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

### LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

### A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

CHAPTER 5. A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING 5.1 Concepts When a number of animals or plots are exposed to a certain treatment, we usually estimate the effect of the treatment

### Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools Occam s razor.......................................................... 2 A look at data I.........................................................

### Inferential Statistics. Probability. From Samples to Populations. Katie Rommel-Esham Education 504

Inferential Statistics Katie Rommel-Esham Education 504 Probability Probability is the scientific way of stating the degree of confidence we have in predicting something Tossing coins and rolling dice

### Research Variables. Measurement. Scales of Measurement. Chapter 4: Data & the Nature of Measurement

Chapter 4: Data & the Nature of Graziano, Raulin. Research Methods, a Process of Inquiry Presented by Dustin Adams Research Variables Variable Any characteristic that can take more than one form or value.

### Mathematics. Probability and Statistics Curriculum Guide. Revised 2010

Mathematics Probability and Statistics Curriculum Guide Revised 2010 This page is intentionally left blank. Introduction The Mathematics Curriculum Guide serves as a guide for teachers when planning instruction

### Chi Square Tests. Chapter 10. 10.1 Introduction

Contents 10 Chi Square Tests 703 10.1 Introduction............................ 703 10.2 The Chi Square Distribution.................. 704 10.3 Goodness of Fit Test....................... 709 10.4 Chi Square

### Data Analysis. Lecture Empirical Model Building and Methods (Empirische Modellbildung und Methoden) SS Analysis of Experiments - Introduction

Data Analysis Lecture Empirical Model Building and Methods (Empirische Modellbildung und Methoden) Prof. Dr. Dr. h.c. Dieter Rombach Dr. Andreas Jedlitschka SS 2014 Analysis of Experiments - Introduction

### Describing Populations Statistically: The Mean, Variance, and Standard Deviation

Describing Populations Statistically: The Mean, Variance, and Standard Deviation BIOLOGICAL VARIATION One aspect of biology that holds true for almost all species is that not every individual is exactly

### Dr. Peter Tröger Hasso Plattner Institute, University of Potsdam. Software Profiling Seminar, Statistics 101

Dr. Peter Tröger Hasso Plattner Institute, University of Potsdam Software Profiling Seminar, 2013 Statistics 101 Descriptive Statistics Population Object Object Object Sample numerical description Object

### EBM Cheat Sheet- Measurements Card

EBM Cheat Sheet- Measurements Card Basic terms: Prevalence = Number of existing cases of disease at a point in time / Total population. Notes: Numerator includes old and new cases Prevalence is cross-sectional

### Data Analysis Tools. Tools for Summarizing Data

Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool

### Probability Using Dice

Using Dice One Page Overview By Robert B. Brown, The Ohio State University Topics: Levels:, Statistics Grades 5 8 Problem: What are the probabilities of rolling various sums with two dice? How can you

### Fairfield Public Schools

Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

### Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone:

### Chapter 7. One-way ANOVA

Chapter 7 One-way ANOVA One-way ANOVA examines equality of population means for a quantitative outcome and a single categorical explanatory variable with any number of levels. The t-test of Chapter 6 looks

### HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

### Introduction to Regression and Data Analysis

Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it

There are three kinds of people in the world those who are good at math and those who are not. PSY 511: Advanced Statistics for Psychological and Behavioral Research 1 Positive Views The record of a month

### 2 GENETIC DATA ANALYSIS

2.1 Strategies for learning genetics 2 GENETIC DATA ANALYSIS We will begin this lecture by discussing some strategies for learning genetics. Genetics is different from most other biology courses you have

### AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

Ms. Foglia Date AP: LAB 8: THE CHI-SQUARE TEST Probability, Random Chance, and Genetics Why do we study random chance and probability at the beginning of a unit on genetics? Genetics is the study of inheritance,

### In the past, the increase in the price of gasoline could be attributed to major national or global

Chapter 7 Testing Hypotheses Chapter Learning Objectives Understanding the assumptions of statistical hypothesis testing Defining and applying the components in hypothesis testing: the research and null

### Tutorial 5: Hypothesis Testing

Tutorial 5: Hypothesis Testing Rob Nicholls nicholls@mrc-lmb.cam.ac.uk MRC LMB Statistics Course 2014 Contents 1 Introduction................................ 1 2 Testing distributional assumptions....................

### Outline of Topics. Statistical Methods I. Types of Data. Descriptive Statistics

Statistical Methods I Tamekia L. Jones, Ph.D. (tjones@cog.ufl.edu) Research Assistant Professor Children s Oncology Group Statistics & Data Center Department of Biostatistics Colleges of Medicine and Public

### Hypothesis Testing. Bluman Chapter 8

CHAPTER 8 Learning Objectives C H A P T E R E I G H T Hypothesis Testing 1 Outline 8-1 Steps in Traditional Method 8-2 z Test for a Mean 8-3 t Test for a Mean 8-4 z Test for a Proportion 8-5 2 Test for

### Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

### LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

Period Date LAB : THE CHI-SQUARE TEST Probability, Random Chance, and Genetics Why do we study random chance and probability at the beginning of a unit on genetics? Genetics is the study of inheritance,

### Foundation of Quantitative Data Analysis

Foundation of Quantitative Data Analysis Part 1: Data manipulation and descriptive statistics with SPSS/Excel HSRS #10 - October 17, 2013 Reference : A. Aczel, Complete Business Statistics. Chapters 1

### Lecture Notes Module 1

Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

### Testing Research and Statistical Hypotheses

Testing Research and Statistical Hypotheses Introduction In the last lab we analyzed metric artifact attributes such as thickness or width/thickness ratio. Those were continuous variables, which as you

### t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

t-tests in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com www.excelmasterseries.com

### Recall this chart that showed how most of our course would be organized:

Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

### MINITAB ASSISTANT WHITE PAPER

MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

### 6 Scalar, Stochastic, Discrete Dynamic Systems

47 6 Scalar, Stochastic, Discrete Dynamic Systems Consider modeling a population of sand-hill cranes in year n by the first-order, deterministic recurrence equation y(n + 1) = Ry(n) where R = 1 + r = 1

### NCSS Statistical Software

Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the

### 4. Introduction to Statistics

Statistics for Engineers 4-1 4. Introduction to Statistics Descriptive Statistics Types of data A variate or random variable is a quantity or attribute whose value may vary from one unit of investigation

### Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the

### Using Excel for inferential statistics

FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied

### The F distribution and the basic principle behind ANOVAs. Situating ANOVAs in the world of statistical tests

Tutorial The F distribution and the basic principle behind ANOVAs Bodo Winter 1 Updates: September 21, 2011; January 23, 2014; April 24, 2014; March 2, 2015 This tutorial focuses on understanding rather

### Module 5 Hypotheses Tests: Comparing Two Groups

Module 5 Hypotheses Tests: Comparing Two Groups Objective: In medical research, we often compare the outcomes between two groups of patients, namely exposed and unexposed groups. At the completion of this

### Sampling and Hypothesis Testing

Population and sample Sampling and Hypothesis Testing Allin Cottrell Population : an entire set of objects or units of observation of one sort or another. Sample : subset of a population. Parameter versus

### Information Theory and Coding Prof. S. N. Merchant Department of Electrical Engineering Indian Institute of Technology, Bombay

Information Theory and Coding Prof. S. N. Merchant Department of Electrical Engineering Indian Institute of Technology, Bombay Lecture - 17 Shannon-Fano-Elias Coding and Introduction to Arithmetic Coding

### Study Guide for the Final Exam

Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

### Descriptive Statistics and Measurement Scales

Descriptive Statistics 1 Descriptive Statistics and Measurement Scales Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample

### DATA INTERPRETATION AND STATISTICS

PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE

### Develop hypothesis and then research to find out if it is true. Derived from theory or primary question/research questions

Chapter 12 Hypothesis Testing Learning Objectives Examine the process of hypothesis testing Evaluate research and null hypothesis Determine one- or two-tailed tests Understand obtained values, significance,

### Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This

### Two-sample t-tests. - Independent samples - Pooled standard devation - The equal variance assumption

Two-sample t-tests. - Independent samples - Pooled standard devation - The equal variance assumption Last time, we used the mean of one sample to test against the hypothesis that the true mean was a particular

### THE UNIVERSITY OF TEXAS AT TYLER COLLEGE OF NURSING COURSE SYLLABUS NURS 5317 STATISTICS FOR HEALTH PROVIDERS. Fall 2013

THE UNIVERSITY OF TEXAS AT TYLER COLLEGE OF NURSING 1 COURSE SYLLABUS NURS 5317 STATISTICS FOR HEALTH PROVIDERS Fall 2013 & Danice B. Greer, Ph.D., RN, BC dgreer@uttyler.edu Office BRB 1115 (903) 565-5766

### 6.4 Normal Distribution

Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under

### Basic Concepts in Research and Data Analysis

Basic Concepts in Research and Data Analysis Introduction: A Common Language for Researchers...2 Steps to Follow When Conducting Research...3 The Research Question... 3 The Hypothesis... 4 Defining the

### Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of

### Nonparametric Statistics

1 14.1 Using the Binomial Table Nonparametric Statistics In this chapter, we will survey several methods of inference from Nonparametric Statistics. These methods will introduce us to several new tables

### CHAPTER 13. Experimental Design and Analysis of Variance

CHAPTER 13 Experimental Design and Analysis of Variance CONTENTS STATISTICS IN PRACTICE: BURKE MARKETING SERVICES, INC. 13.1 AN INTRODUCTION TO EXPERIMENTAL DESIGN AND ANALYSIS OF VARIANCE Data Collection

### Content Sheet 7-1: Overview of Quality Control for Quantitative Tests

Content Sheet 7-1: Overview of Quality Control for Quantitative Tests Role in quality management system Quality Control (QC) is a component of process control, and is a major element of the quality management

### TABLE OF CONTENTS. About Chi Squares... 1. What is a CHI SQUARE?... 1. Chi Squares... 1. Hypothesis Testing with Chi Squares... 2

About Chi Squares TABLE OF CONTENTS About Chi Squares... 1 What is a CHI SQUARE?... 1 Chi Squares... 1 Goodness of fit test (One-way χ 2 )... 1 Test of Independence (Two-way χ 2 )... 2 Hypothesis Testing

### When to use Excel. When NOT to use Excel 9/24/2014

Analyzing Quantitative Assessment Data with Excel October 2, 2014 Jeremy Penn, Ph.D. Director When to use Excel You want to quickly summarize or analyze your assessment data You want to create basic visual

### NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

### Research Methods & Experimental Design

Research Methods & Experimental Design 16.422 Human Supervisory Control April 2004 Research Methods Qualitative vs. quantitative Understanding the relationship between objectives (research question) and

### Measuring the Power of a Test

Textbook Reference: Chapter 9.5 Measuring the Power of a Test An economic problem motivates the statement of a null and alternative hypothesis. For a numeric data set, a decision rule can lead to the rejection

### statistics Chi-square tests and nonparametric Summary sheet from last time: Hypothesis testing Summary sheet from last time: Confidence intervals

Summary sheet from last time: Confidence intervals Confidence intervals take on the usual form: parameter = statistic ± t crit SE(statistic) parameter SE a s e sqrt(1/n + m x 2 /ss xx ) b s e /sqrt(ss

### Chapter Additional: Standard Deviation and Chi- Square

Chapter Additional: Standard Deviation and Chi- Square Chapter Outline: 6.4 Confidence Intervals for the Standard Deviation 7.5 Hypothesis testing for Standard Deviation Section 6.4 Objectives Interpret

### Random Uniform Clumped. 0 1 2 3 4 5 6 7 8 9 Number of Individuals per Sub-Quadrat. Number of Individuals per Sub-Quadrat

4-1 Population ecology Lab 4: Population dispersion patterns I. Introduction to population dispersion patterns The dispersion of individuals in a population describes their spacing relative to each other.

### CHI-SQUARE: TESTING FOR GOODNESS OF FIT

CHI-SQUARE: TESTING FOR GOODNESS OF FIT In the previous chapter we discussed procedures for fitting a hypothesized function to a set of experimental data points. Such procedures involve minimizing a quantity

### HOW TO WRITE A LABORATORY REPORT

HOW TO WRITE A LABORATORY REPORT Pete Bibby Dept of Psychology 1 About Laboratory Reports The writing of laboratory reports is an essential part of the practical course One function of this course is to

### Point Biserial Correlation Tests

Chapter 807 Point Biserial Correlation Tests Introduction The point biserial correlation coefficient (ρ in this chapter) is the product-moment correlation calculated between a continuous random variable

### EMPIRICAL FREQUENCY DISTRIBUTION

INTRODUCTION TO MEDICAL STATISTICS: Mirjana Kujundžić Tiljak EMPIRICAL FREQUENCY DISTRIBUTION observed data DISTRIBUTION - described by mathematical models 2 1 when some empirical distribution approximates

### General Procedure for Hypothesis Test. Five types of statistical analysis. 1. Formulate H 1 and H 0. General Procedure for Hypothesis Test

Five types of statistical analysis General Procedure for Hypothesis Test Descriptive Inferential Differences Associative Predictive What are the characteristics of the respondents? What are the characteristics

### Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

NEW YORK UNIVERSITY ROBERT F. WAGNER GRADUATE SCHOOL OF PUBLIC SERVICE Course Syllabus Spring 2016 Statistical Methods for Public, Nonprofit, and Health Management Section Format Day Begin End Building

### Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Introduction to Hypothesis Testing 1 Hypothesis Testing A hypothesis test is a statistical procedure that uses sample data to evaluate a hypothesis about a population Hypothesis is stated in terms of the

### Technology Step-by-Step Using StatCrunch

Technology Step-by-Step Using StatCrunch Section 1.3 Simple Random Sampling 1. Select Data, highlight Simulate Data, then highlight Discrete Uniform. 2. Fill in the following window with the appropriate

2. DATA AND EXERCISES (Geos2911 students please read page 8) 2.1 Data set The data set available to you is an Excel spreadsheet file called cyclones.xls. The file consists of 3 sheets. Only the third is

### 1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

### Appendix E: Graphing Data

You will often make scatter diagrams and line graphs to illustrate the data that you collect. Scatter diagrams are often used to show the relationship between two variables. For example, in an absorbance

### Analyzing and interpreting data Evaluation resources from Wilder Research

Wilder Research Analyzing and interpreting data Evaluation resources from Wilder Research Once data are collected, the next step is to analyze the data. A plan for analyzing your data should be developed

### Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption

### Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives

### Simple Linear Regression Inference

Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

### Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts

### Chapter 4 Lecture Notes

Chapter 4 Lecture Notes Random Variables October 27, 2015 1 Section 4.1 Random Variables A random variable is typically a real-valued function defined on the sample space of some experiment. For instance,

### THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

THERE ARE TWO WAYS TO DO HYPOTHESIS TESTING WITH STATCRUNCH: WITH SUMMARY DATA (AS IN EXAMPLE 7.17, PAGE 236, IN ROSNER); WITH THE ORIGINAL DATA (AS IN EXAMPLE 8.5, PAGE 301 IN ROSNER THAT USES DATA FROM

### Factorial Analysis of Variance

Chapter 560 Factorial Analysis of Variance Introduction A common task in research is to compare the average response across levels of one or more factor variables. Examples of factor variables are income

### BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

BNG 202 Biomechanics Lab Descriptive statistics and probability distributions I Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential

### Statistical Functions in Excel

Statistical Functions in Excel There are many statistical functions in Excel. Moreover, there are other functions that are not specified as statistical functions that are helpful in some statistical analyses.

### 11. Analysis of Case-control Studies Logistic Regression

Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:

### Experimental Designs (revisited)

Introduction to ANOVA Copyright 2000, 2011, J. Toby Mordkoff Probably, the best way to start thinking about ANOVA is in terms of factors with levels. (I say this because this is how they are described

### Chapter 3 RANDOM VARIATE GENERATION

Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.

### 5/31/2013. Chapter 8 Hypothesis Testing. Hypothesis Testing. Hypothesis Testing. Outline. Objectives. Objectives

C H 8A P T E R Outline 8 1 Steps in Traditional Method 8 2 z Test for a Mean 8 3 t Test for a Mean 8 4 z Test for a Proportion 8 6 Confidence Intervals and Copyright 2013 The McGraw Hill Companies, Inc.