Chi-Square Distribution. is distributed according to the chi-square distribution. This is usually written

Similar documents
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Estimation and Confidence Intervals

Lecture 8. Confidence intervals and the central limit theorem

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Two-sample inference: Continuous data

Simple Linear Regression Inference

Section 13, Part 1 ANOVA. Analysis Of Variance

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:

Study Guide for the Final Exam

Chapter 7. One-way ANOVA

Probability Distribution

1.5 Oneway Analysis of Variance

Statistics Review PSY379

1. How different is the t distribution from the normal?

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Chapter 3 RANDOM VARIATE GENERATION

Description. Textbook. Grading. Objective

EMPIRICAL FREQUENCY DISTRIBUTION

Exact Confidence Intervals

CHI-SQUARE: TESTING FOR GOODNESS OF FIT

Outline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test

Syntax Menu Description Options Remarks and examples Stored results Methods and formulas References Also see. level(#) , options2

Descriptive Analysis

Confidence Intervals for Cp

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

MATH4427 Notebook 2 Spring MATH4427 Notebook Definitions and Examples Performance Measures for Estimators...

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Introduction to General and Generalized Linear Models

TImath.com. F Distributions. Statistics

Population Mean (Known Variance)

3.2 Measures of Spread

2 Sample t-test (unequal sample sizes and unequal variances)

Permutation Tests for Comparing Two Populations

Tests of Hypotheses Using Statistics

Name: Date: Use the following to answer questions 3-4:

Week 4: Standard Error and Confidence Intervals

How To Check For Differences In The One Way Anova

( ) = 1 x. ! 2x = 2. The region where that joint density is positive is indicated with dotted lines in the graph below. y = x

individualdifferences

PROBABILITY AND SAMPLING DISTRIBUTIONS

14.2 Do Two Distributions Have the Same Means or Variances?

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

Fairfield Public Schools

Non Parametric Inference

Part 2: Analysis of Relationship Between Two Variables

Standard Deviation Estimator

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups

Opgaven Onderzoeksmethoden, Onderdeel Statistiek

Descriptive Statistics

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Unit 26: Small Sample Inference for One Mean

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

2 Precision-based sample size calculations

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

STATISTICS FOR PSYCHOLOGISTS

Two-sample t-tests. - Independent samples - Pooled standard devation - The equal variance assumption

Multivariate normal distribution and testing for means (see MKB Ch 3)

Recall this chart that showed how most of our course would be organized:

Transformations and Expectations of random variables

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

Introduction to Statistics and Quantitative Research Methods

One-Way Analysis of Variance (ANOVA) Example Problem

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

One-Way Analysis of Variance

Objectives. 6.1, 7.1 Estimating with confidence (CIS: Chapter 10) CI)

Non-Inferiority Tests for Two Means using Differences

Lecture Notes Module 1

Independent t- Test (Comparing Two Means)

ESTIMATION OF THE EFFECTIVE DEGREES OF FREEDOM IN T-TYPE TESTS FOR COMPLEX DATA

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

4. Continuous Random Variables, the Pareto and Normal Distributions

1 Sufficient statistics

Final Exam Practice Problem Answers

STATISTICS PROJECT: Hypothesis Testing

Statistics for Analysis of Experimental Data

3.4 Statistical inference for 2 populations based on two samples

Chapter 23 Inferences About Means

( ) is proportional to ( 10 + x)!2. Calculate the

2 ESTIMATION. Objectives. 2.0 Introduction

Additional sources Compilation of sources:

research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other

MATH. ALGEBRA I HONORS 9 th Grade ALGEBRA I HONORS

An Introduction to Statistics using Microsoft Excel. Dan Remenyi George Onofrei Joe English

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon


Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

MULTIPLE LINEAR REGRESSION ANALYSIS USING MICROSOFT EXCEL. by Michael L. Orlov Chemistry Department, Oregon State University (1996)

Analysing Questionnaires using Minitab (for SPSS queries contact -)

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Appendix 2 Statistical Hypothesis Testing 1

12: Analysis of Variance. Introduction

Lecture 19: Conditional Logistic Regression

MEASURES OF VARIATION

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

Transcription:

Chi-Square Distribution If X i are k independent, normally distributed random variables with mean 0 and variance 1, then the random variable is distributed according to the chi-square distribution. This is usually written The chi-square distribution has one parameter: k - a positive integer that specifies the number of degrees of freedom (i.e. the number of X i ) A probability density function of the chi-square distribution is where Γ denotes the Gamma function, which takes particular values at the half-integers. In mathematics, the Gamma function (represented by the capitalized Greek letter Γ) is an extension of the factorial function to real and complex numbers. For a complex number z with positive real part it is defined by which can be extended to the rest of the complex plane, excepting the nonpositive integers. If z is a positive integer, then

chi-square

Degrees of Freedom Estimates of parameters can be based upon different amounts of information. The number of independent pieces of information that go into the estimate of a parameter is called the degrees of freedom (df). In general, the degrees of freedom of an estimate is equal to the number of independent scores that go into the estimate minus the number of parameters estimated as intermediate steps in the estimation of the parameter itself. For example, if the variance, σ², is to be estimated from a random sample of N independent scores, then the degrees of freedom is equal to the number of independent scores (N) minus the number of parameters estimated as intermediate steps (one, μ estimated by sample mean) and is therefore equal to N-1.

Student's t-distribution Student's t Parameters ν > 0 degrees of freedom (real) Support (pdf) Mean 0 for ν > 1, otherwise undefined Median 0 Mode 0 Variance, otherwise undefined Moment-generating function (mgf) (Not defined)

The Student's t-distribution (or also t-distribution). The derivation of the t-distribution was first published in 1908 by William Sealy Gosset, while he worked at a Guinness Brewery in Dublin. He was not allowed to publish under his own name, so the paper was written under the pseudonym Student. The t-test and the associated theory became well-known through the work of R.A. Fisher, who called the distribution "Student's distribution". Student's distribution arises when (as in nearly all practical statistical work) the population standard deviation is unknown and has to be estimated from the data. Textbook problems treating the standard deviation as if it were known are of two kinds: (1) those in which the sample size is so large that one may treat a data-based estimate of the variance as if it were certain, and (2) those that illustrate mathematical reasoning, in which the problem of estimating the standard deviation is temporarily ignored because that is not the point that the author or instructor is then explaining. Why use the Student's t-distribution Confidence intervals and hypothesis tests rely on Student's t-distribution to cope with uncertainty resulting from estimating the standard deviation from a sample, whereas if the population standard deviation were known, a normal distribution would be used. How Student's t-distribution comes about Suppose X 1,..., X n are independent random variables that are normally distributed with expected value μ and variance σ 2. Let be the sample mean, and be the sample variance. It is readily shown that the quantity is normally distributed with mean 0 and variance 1, since the sample mean distributed with mean μ and standard deviation. is normally

Gosset studied a related quantity, which differs from Z in that the exact standard deviation is replaced by the random variable. Technically, has a distribution by Cochran's theorem. Gosset's work showed that T has the probability density function with ν equal to n 1 and where Γ is the Gamma function. This may also be written as where B is the Beta function. The distribution of T is now called the t-distribution. The parameter ν is called the number of degrees of freedom. The distribution depends on ν, but not μ or σ; the lack of dependence on μ and σ is what makes the t-distribution important in both theory and practice. The moments of the t-distribution are It should be noted that the term for 0 < k < ν, k even, may be simplified using the properties of the Gamma function to

F-distribution Fisher-Snedecor Parameters deg. of freedom Support (pdf) Mean for d 2 > 2 Variance for d 2 > 4 Moment-generating function (mgf) see text for raw moments

In probability theory and statistics, the F-distribution is a continuous probability distribution. It is also known as Snedecor's F distribution or the Fisher-Snedecor distribution (after R.A. Fisher and George W. Snedecor). A random variate of the F-distribution arises as the ratio of two chi-squared variates: where U 1 and U 2 have chi-square distributions with d 1 and d 2 degrees of freedom respectively, and U 1 and U 2 are independent (see Cochran's theorem for an application). The F-distribution arises frequently as the null distribution of a test statistic, especially in likelihood-ratio tests, perhaps most notably in the analysis of variance; see F-test. The probability density function of an F(d 1, d 2 ) distributed random variable is given by for real x 0, where d 1 and d 2 are positive integers, and B is the beta function. One interesting property is that if