Appendix A The Chi-Square (32) Test in Genetics

Similar documents
Mendelian Genetics in Drosophila

2 GENETIC DATA ANALYSIS

Chapter 9 Patterns of Inheritance

The Genetics of Drosophila melanogaster

5 GENETIC LINKAGE AND MAPPING

CCR Biology - Chapter 7 Practice Test - Summer 2012

CHROMOSOMES AND INHERITANCE

AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

PRACTICE PROBLEMS - PEDIGREES AND PROBABILITIES

Heredity. Sarah crosses a homozygous white flower and a homozygous purple flower. The cross results in all purple flowers.

I. Genes found on the same chromosome = linked genes

LAB 11 Drosophila Genetics

Genetics Lecture Notes Lectures 1 2

LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

Chromosomes, Mapping, and the Meiosis Inheritance Connection

Name: Class: Date: ID: A

Ringneck Doves. A Handbook of Care & Breeding

LAB : PAPER PET GENETICS. male (hat) female (hair bow) Skin color green or orange Eyes round or square Nose triangle or oval Teeth pointed or square

Hardy-Weinberg Equilibrium Problems

The Making of the Fittest: Natural Selection in Humans

Biology Final Exam Study Guide: Semester 2

17. A testcross A.is used to determine if an organism that is displaying a recessive trait is heterozygous or homozygous for that trait. B.

GENETIC CROSSES. Monohybrid Crosses

Mendelian and Non-Mendelian Heredity Grade Ten

The correct answer is c A. Answer a is incorrect. The white-eye gene must be recessive since heterozygous females have red eyes.

Bivariate Statistics Session 2: Measuring Associations Chi-Square Test

Genetics for the Novice

Mendelian inheritance and the

Paternity Testing. Chapter 23

HLA data analysis in anthropology: basic theory and practice

5. The cells of a multicellular organism, other than gametes and the germ cells from which it develops, are known as

Heredity - Patterns of Inheritance

Testing Research and Statistical Hypotheses

Is it statistically significant? The chi-square test

7A The Origin of Modern Genetics

Human Blood Types: Codominance and Multiple Alleles. Codominance: both alleles in the heterozygous genotype express themselves fully

MCB41: Second Midterm Spring 2009

Adverse Impact Ratio for Females (0/ 1) = 0 (5/ 17) = Adverse impact as defined by the 4/5ths rule was not found in the above data.

Appendix 2 Statistical Hypothesis Testing 1

Name: 4. A typical phenotypic ratio for a dihybrid cross is a) 9:1 b) 3:4 c) 9:3:3:1 d) 1:2:1:2:1 e) 6:3:3:6

Lesson Plan: GENOTYPE AND PHENOTYPE

Chapter 23. Two Categorical Variables: The Chi-Square Test

Chapter 4 Pedigree Analysis in Human Genetics. Chapter 4 Human Heredity by Michael Cummings 2006 Brooks/Cole-Thomson Learning

Basics of Marker Assisted Selection

P1 Gold X Black. 100% Black X. 99 Black and 77 Gold. Critical Values

CHAPTER 15 THE CHROMOSOMAL BASIS OF INHERITANCE. Section B: Sex Chromosomes

(1-p) 2. p(1-p) From the table, frequency of DpyUnc = ¼ (p^2) = #DpyUnc = p^2 = ¼(1-p)^2 + ½(1-p)p + ¼(p^2) #Dpy + #DpyUnc

Drosophila Genetics by Michael Socolich May, 2003

Bio EOC Topics for Cell Reproduction: Bio EOC Questions for Cell Reproduction:

Genetics and Evolution: An ios Application to Supplement Introductory Courses in. Transmission and Evolutionary Genetics

Two copies of each autosomal gene affect phenotype.

DRAGON GENETICS LAB -- Principles of Mendelian Genetics

A trait is a variation of a particular character (e.g. color, height). Traits are passed from parents to offspring through genes.

Test Positive True Positive False Positive. Test Negative False Negative True Negative. Figure 5-1: 2 x 2 Contingency Table

Answer Key Problem Set 5

Chi Square Tests. Chapter Introduction

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Baby Lab. Class Copy. Introduction

Problems 1-6: In tomato fruit, red flesh color is dominant over yellow flesh color, Use R for the Red allele and r for the yellow allele.

Having a coin come up heads or tails is a variable on a nominal scale. Heads is a different category from tails.

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Recommend Continued CPS Monitoring. 63 (a) 17 (b) 10 (c) (d) 20 (e) 25 (f) 80. Totals/Marginal

Module 4 (Effect of Alcohol on Worms): Data Analysis

Basic Principles of Forensic Molecular Biology and Genetics. Population Genetics

Odds ratio, Odds ratio test for independence, chi-squared statistic.

Descriptive Statistics

Title: Genetics and Hearing Loss: Clinical and Molecular Characteristics

Association Between Variables

EMPIRICAL FREQUENCY DISTRIBUTION

Genetics Module B, Anchor 3

Gene Mapping Techniques

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures

Influence of Sex on Genetics. Chapter Six

A and B are not absolutely linked. They could be far enough apart on the chromosome that they assort independently.

Biology 1406 Exam 4 Notes Cell Division and Genetics Ch. 8, 9

EXERCISE 11 MENDELIAN GENETICS PROBLEMS

BIOLOGY HIGHER LEVEL

Independent samples t-test. Dr. Tom Pierce Radford University

DNA Determines Your Appearance!

1. You are studying three autosomal recessive mutations in the fruit fly Drosophila

2 18. If a boy s father has haemophilia and his mother has one gene for haemophilia. What is the chance that the boy will inherit the disease? 1. 0% 2

Nonparametric Tests. Chi-Square Test for Independence

Difference of Means and ANOVA Problems

Got Lactase? The Co-evolution of Genes and Culture

Genetics with a Smile

Statistical Impact of Slip Simulator Training at Los Alamos National Laboratory

BioBoot Camp Genetics

Biology Notes for exam 5 - Population genetics Ch 13, 14, 15

UNDERSTANDING THE TWO-WAY ANOVA

Using Excel for inferential statistics

Chapter 5 Analysis of variance SPSS Analysis of variance

3.4 Statistical inference for 2 populations based on two samples

Terms: The following terms are presented in this lesson (shown in bold italics and on PowerPoint Slides 2 and 3):

GAW 15 Problem 3: Simulated Rheumatoid Arthritis Data Full Model and Simulation Parameters

Describing Populations Statistically: The Mean, Variance, and Standard Deviation

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

Bio 102 Practice Problems Mendelian Genetics and Extensions

B2 5 Inheritrance Genetic Crosses

The Chi-Square Test. STAT E-50 Introduction to Statistics

Transcription:

Appendix A The Chi-Square (32) Test in Genetics With infinitely large sample sizes, the ideal result of any particular genetic cross is exact conformation to the expected ratio. For example, a cross between two heterozygotes should produce an exact 3:1 ratio of dominant to recessive phenotypes. In any particular real- world experiment, with limited and sometimes very small sample sizes, results are expected to deviate somewhat from the exact theoretical ratio, due simply to chance. In order to evaluate a genetic hypothesis (for example, that a particular trait is due to a recessive allele segregating at a locus), we need a means to distinguish an experimental result that is consistent with the hypothesis within the bounds of simple chance deviations, apart from one that is intrinsically unlikely ( wrong ), given the data. Statistical tests are a means of quantifying the results of an experiment as evidence for or against a particular hypothesis. One of the simplest statistical tests is Chi-Square (32) Analysis, which compares the "goodness of fit" between observed and expected counts. An hypothesis is developed that predicts how a set of observations will fall into each of two or more categories (the expected result). These counts are compared with the experimental data (the observed result). Allowing for the sample size, the differences among the observed and expected results are reduced to a single number, the chi-square value. Because larger deviations from expectation are expected with more categories, the test also takes into account the degrees of freedom in the experiment. Comparison with a table of probability values shows the probability that the observed deviation could have been obtained by chance alone. [See the website for Bio2900 (Principles of Evolution) for further discussion of the concepts of null hypothesis, significance testing, and Type I & II error: http://www.mun.ca/biology/scarr/2900_hypothesis_testing.htm]. CHI-SQUARE CALCULATIONS The chi-square formula is: O = observed # of individuals with phenotype 3 2 = [(O - E) 2 / E] E = expected # of individuals with phenotype = sum of deviations for all phenotypes i) The chi-square test should be used only on the numerical data themselves, not on ratios or percentages derived from the data. ii) iii) iv) In experiments where the expected frequency in any phenotypic class is less than five, the true probability is usually slightly larger than the p given in the table. Expected values should be adjusted to the closest integer [you would not expect a fraction of an individual]; the sum totals of observed and expected values should not differ more than one individual. These calculations neglect a number of corrections, including those for small expected classes, multiple simultaneous tests, and the one-tail or two-tail nature of the test.

These will be discussed in your statistics course.

INTERPRETATION OF THE CHI-SQUARE TABLE The table of chi-square values (below) can be used to determine the probability that any particular result (observed deviation) could have been obtained by chance alone. i) Horizontal rows indicate the number of degrees of freedom (df). In general, df = (n - 1) where n is the number of observed classes or phenotypes. For each row, the p value at the top of the column is the probability that a particular chi-square value could have been obtained by chance alone. This is called the critical value for that level of probability. ii) iii) To use the table, enter it on the row corresponding to the number of degrees of freedom. Look for the column with the critical value closest to and less than the chi-square obtained in your calculations. The probability of your result is less than the p value for that column. For a biological experiment, we typically set the level of significance at p = 0.05. If the observed chi-square is greater than the critical value for p = 0.05, we conclude that the result could have been obtained by chance less than 5% of the time, and we reject the null hypothesis that chance alone is responsible for the result. We say that this result is statistically significant. Example #1: 3 2 = 15.0 and n = 8 phenotypes. The observed chi-square of 15.0 with df = 7 is greater than the critical value of 14.07 for p <.05. This means that the observed deviation represented by this chi-square value would be expected to occur by chance less than 5% of the time. The result is statistically significantly, and the hypothesis (that the data are consistent with the expected ratio) can be rejected. Example #2: 3 2 = 5.0 and n = 8 phenotypes. The observed chi-square of 5.0 with df = 7 lies between the values 2.83 and 6.35, which correspond to.90 > p >.50. A deviation as large as that observed would be expected to occur by chance more than 50% of the time. The difference between the observed and expected results is not statistically significant, and the null hypothesis (the ratio being tested) cannot be rejected. p = 0.9 0.50 0.20 0.05 0.01 0.001 df = 1 0.02 0.46 1.64 3.84 6.64 10.83 2 0.21 1.39 3.22 5.99 9.21 13.82 3 0.58 2.37 4.64 7.82 11.35 16.27 4 1.06 3.36 5.99 9.49 13.28 18.47 5 1.61 4.35 7.29 11.07 15.09 20.52 6 2.20 5.35 8.56 12.59 16.81 22.46 7 2.83 6.35 9.80 14.07 18.48 24.32 8 3.49 7.34 11.03 15.51 20.09 26.13 9 4.17 8.34 12.24 16.92 21.67 27.88 10 4.87 9.34 13.44 18.31 23.21 29.59 15 8.55 14.34 19.31 25.00 30.58 37.30 25 16.47 24.34 30.68 37.65 44.31 52.62 50 37.69 49.34 58.16 67.51 76.15 86.6

Appendix B Gene Map of Drosophila melanogaster From William S. Klug, Michael R. Cummings, Concepts of Genetics, Macmillan, 1994, 4 th ed., p. 132. This material has been copied under licence from CANCOPY. Resale or further copying of this material is strictly prohibited.

Appendix C Genetic Nomenclature & Notation for Drosophila Clear notation for any Drosophila genotype will indicate whether the locus involved is on an autosomal (II, III, or IV) or sex chromosome (I(X) or Y), and in the case of two (or more) loci, whether they are the same or different chromosomes (linked or unlinked, respectively). Dominant alleles at a locus are indicated by a capitalized symbol, recessive alleles by a lowercase symbol. Examples of such notation are as follows. 1) One autosomal locus: e.g. The genotype for ebony body on Chromosome III is ee, for wild-type body at that locus e + e +. For autosomal genes the genotype is the same for male and female, and can be homozygous or heterozygous. 2) One sex-linked locus: In Drosophila alleles may be present on the X chromosome but not on the Y chromosome, therefore the genotypes for male and female are different. The symbol ( ) indicates a male Y sex-chromosome and therefore the presence of only one allele. e.g. Bar eye on Chromosome I. Bar eye female has genotype BB, Bar eye male is B. Wildtype eye female is B + B +, wild-type eye male is B +. 3) Two unlinked autosomal loci e.g. vestigial wing (II) and ebony body (III) would have genotype vgvg ee and wild-type (wing and body at these loci) would have vg + vg + e + e +. 4) Two linked autosomal loci e.g. curled wing (III, 50.0) and ebony body (III, 70.7). The genotype is written to show the alleles on each homologue cu e/ cu e. Wild-type would be cu + e + / cu + e +. 5) Two sex-linked loci e.g. Bar eye (I 57.0) and forked bristle (I, 56.7). Female is Bf / Bf, male is Bf /. Wild-type female is B + f + / B + f +, wild-type male is B + f + /. 6) One sex-linked & one autosomal loci e.g. Bar eye (I) and vestigial wing (II). Female is BB vgvg, male is B vgvg wild-type female is B + B + vg + vg +, wild-type male is B + vg + vg +.