Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part II)

Size: px
Start display at page:

Download "Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part II)"

Transcription

1 Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part II) Florian Pelgrin HEC September-December 2010 Florian Pelgrin (HEC) Constrained estimators September-December / 34

2 1 Introduction 2 Hypothesis testing Overview Different tests... Outcomes of hypothesis testing Level of significance General methodology 3 The Student test Framework Student test-statistic Decision rules 4 The Fisher test Framework Fisher test-statistic Decision rule 5 Summary 6 Appendix Florian Pelgrin (HEC) Constrained estimators September-December / 34

3 1. Introduction Introduction Aim: Testing parameters in the multiple linear regression model: Assume normality of the error terms exact distribution of estimators; Different tests... Testing hypothesis about individual parameters; Testing single hypothesis involving more than one parameter; Testing multiple restrictions....using the Student and Fisher test statistics Florian Pelgrin (HEC) Constrained estimators September-December / 34

4 Introduction The main assumption is: u X N ( ) 0, σ0 2 I n i.e. the population error u is independent of X and normally distributed with zero mean and variance σ 2 0 I n. This could be a strong requirement (especially, in finite samples). This assumption implies the zero conditional mean assumption (Assumption 3, Chapter 2) and the spherical error terms assumption (Assumption 4, Chapter 2) other assumptions are assumed to hold. Florian Pelgrin (HEC) Constrained estimators September-December / 34

5 2. Hypothesis testing 2.1. Overview Hypothesis testing Overview A hypothesis is a statement about a population parameter. A standard hypothesis testing problem involves two complementary hypothesis: 1 The null hypothesis: H 0 2 The alternative hypothesis: H 1 or H a. If θ 0 is the parameter (vector) of interest: 1 H 0 : θ 0 Θ 0 Θ 2 H 1 : θ 0 Θ 1 Θ, with Θ 1 = Θ Θ 0. Florian Pelgrin (HEC) Constrained estimators September-December / 34

6 Hypothesis testing Overview The null hypothesis is tested by finding an appropriate subset of outcomes W Θ called the rejection region or critical region. For example, a standard rejection region W is of the form: W = {z : T S (z) > c} where T S is a test statistic and c is a critical value. The complement of the rejection region is the acceptance region. A (hypothesis) test is thus a rule that specified: 1 For which sample values the decision is made to fail to reject H 0 as true; 2 For which sample values the decision is made to reject H 0. Florian Pelgrin (HEC) Constrained estimators September-December / 34

7 2.2. Different tests... Hypothesis testing Different tests... Definition A hypothesis of the form H 0 : θ 0 = θ is called a simple hypothesis. A hypothesis of the form H 0 : θ 0 θ or H 0 : θ 0 θ is called a composite hypothesis. Florian Pelgrin (HEC) Constrained estimators September-December / 34

8 Hypothesis testing Different tests... Definition A test of the form: { H0 : θ 0 = θ H a : θ 0 θ is called a two-sided test. Example: { H0 : β 0,j = 0 H a : β 0,j 0 The null hypothesis means x j has no effect on y once the other x s are controlled for ; β 0,j 0 is a natural alternative hypothesis when there is no prior knowledge about the sign... Florian Pelgrin (HEC) Constrained estimators September-December / 34

9 Hypothesis testing Different tests... Definition A test of the form: { H0 : θ 0 θ H a : θ 0 > θ or { H0 : θ 0 θ H a : θ 0 < θ is called a one-sided test. Florian Pelgrin (HEC) Constrained estimators September-December / 34

10 Hypothesis testing Outcomes of hypothesis testing 2.3. Outcomes of hypothesis testing Truth Decision Fail to reject H 0 Reject H 0 H 0 Correct decision Type I error H a Type II error Correct decision 1 The probability of Type I error: P(reject H 0 H 0 is true). 2 The probability of Type II error: P(fail to reject H 0 H 1 is true). Florian Pelgrin (HEC) Constrained estimators September-December / 34

11 Hypothesis testing Outcomes of hypothesis testing Definition The power function of a test is defined by: β(θ 0 ) = 1 P(fail to reject H 0 H 1 ) 1 η(θ 0 ) The size of a test is defined to be: α = sup θ 0 P(reject H 0 H 0 ) A test is said to have level α if its size is less than or equal to α. Remark: One seeks to determine the critical region such that the power is maximum and with minimal size distortion. Florian Pelgrin (HEC) Constrained estimators September-December / 34

12 Hypothesis testing 2.4. Level of significance Level of significance The decision Reject H 0 or fail to reject H 0 is not so informative! Indeed, there is some arbitrariness to the choice of α (level). Another strategy is to ask, for every α, whether the test rejects at that level. Another alternative is to use the so-called p-value the smallest level of significance at which H 0 would be rejected given the value of the test-statistic. Florian Pelgrin (HEC) Constrained estimators September-December / 34

13 Hypothesis testing Level of significance More formally... Definition Suppose that for every α (0, 1), one has a size α test with rejection region W α. Then, the p-value is defined to be: p-value = inf{α : T S (Y ) W α }. The p-value is the smallest level at which one can reject H 0. The p-value is a measure of evidence against H 0 : p-value evidence <.01 Very strong evidence against H Strong evidence against H Weak evidence against H 0 >.10 Little or no evidence against H 0 Florian Pelgrin (HEC) Constrained estimators September-December / 34

14 Hypothesis testing Level of significance Remarks: 1. A large p-value does not mean strong evidence in favor of H A large p-value can occur for two reasons: 1 H 0 is true; 2 H 0 is false but the test has low power. 3. The p-value is not the probability that the null hypothesis is true! Florian Pelgrin (HEC) Constrained estimators September-December / 34

15 Hypothesis testing 2.5. General methodology General methodology Hypothesis testing is defined by the following general procedure Step 1: State the relevant null and alternative hypotheses (mis-stating the hypotheses muddies the rest of the procedure!); Step 2: Consider the statistical assumptions being made about the sample in doing the test (independence, distributions, etc) incorrect assumptions mean that the test is invalid! Step 3: Choose the appropriate test (exact or asymptotic tests) and thus state the relevant test statistic (say, T ). Step 4: Derive the distribution of the test statistic under the null hypothesis (sometimes it is well-known, sometimes it is more tedious!) for example, the Student t-distribution or the Fisher distribution. Florian Pelgrin (HEC) Constrained estimators September-December / 34

16 Hypothesis testing General methodology 2.5. General methodology (continued) Step 5: Determine the critical region (and thus the acceptance region). Step 6: Compute (using the observations!) the observed value of the test statistic T, say t obs. Step 7: Decide to either fail to reject the null hypothesis or reject in favor of the alternative assumption the decision rule is to reject the null hypothesis H 0 if the observed value of the test statistic, t obs is in the critical region, and to fail to reject the null hypothesis otherwise. Florian Pelgrin (HEC) Constrained estimators September-December / 34

17 3. The Student test 3.1. Framework The Student test Framework Consider the (population) model Y = Xβ 0 + u that satisfies the assumptions H1 H6 (Chapter 2). The β j s are unknown features of the population, but: One can formulate a hypothesis about their value; One can construct a test statistic with a known distribution under the maintained hypothesis; One can take a decision meaning reject H 0 if the value of the test statistic is too unlikely. Florian Pelgrin (HEC) Constrained estimators September-December / 34

18 The Student test Framework Three tests of interest: { H0 : β 0,j = a j H a : β 0,j > a j or < a j { H0 : β 0,j = a j H a : β 0,j a j and { H0 : β 0,j = β l H a : not H 0 where a j = 0 or a j 0. Florian Pelgrin (HEC) Constrained estimators September-December / 34

19 The Student test Framework All of these tests are based on the following results: 1. The exact distribution of the ordinary least squares estimator of β: ( ˆβ OLS X N β 0, σ0 2 (X X) 1). 2. Any linear combination of the ˆβ j,ols s is also normally distributed: A ˆβ OLS X N ( Aβ 0, σ 2 0A (X X) 1 A ). 3. Any subset of ˆβ OLS has a joint normal distribution. Florian Pelgrin (HEC) Constrained estimators September-December / 34

20 The Student test 3.2. Student test-statistic Student test-statistic Definition The Student test-statistic is defined to be: T s = ˆβ j,ols a ( ) j Γ(n k) se ˆβ j,ols where n is the number of observations, k is the number of explanatory variables (including the constant term), Γ is the Student t-distribution, and: ( ) se ˆβ j,ols = ˆσ OLS mjj with m jj is the j th diagonal element of (X X) 1. Proof: See Appendix 1. Florian Pelgrin (HEC) Constrained estimators September-December / 34

21 The Student test Decision rules 3.3. Decision rules Testing H 0 : β 0,j = a j against H a : β 0,j > a j Definition The critical region is that H 0 is rejected in favor of β 0,j > a j at the α% (say, 5%) significance level if: t obs > c 1 α where c 1 α is the α% (say, 5%) critical value or the percentile of order 1 α of a Student t-distribution with n k degrees of freedom, i.e. the solution of: α = P (reject H 0 H 0 is true) = P (T s > c 1 α ). Florian Pelgrin (HEC) Constrained estimators September-December / 34

22 The Student test Decision rules Figure 1: 5% rejection rule for H a : β 0,j > 0 (df=28) Area = t Rejection region 5% rejection rule for (df=18) Florian Pelgrin (HEC) Constrained estimators September-December / 34

23 The Student test Decision rules Testing H 0 : β 0,j = a j against H a : β 0,j < a j Definition The critical region is that H 0 is rejected in favor of β 0,j < a j at the α% (say, 5%) significance level if: t obs < c α where c α is the α% (say, 5%) critical value or the percentile of order α of a Student t-distribution with n k degrees of freedom, i.e. the solution of: α = P (reject H 0 H 0 is true) = P (T s < c α ). Florian Pelgrin (HEC) Constrained estimators September-December / 34

24 The Student test Decision rules Figure 2: 5% rejection rule for H a : β 0,j < 0 (df=18) Area = Rejection region t Florian Pelgrin (HEC) Constrained estimators September-December / 34

25 The Student test Decision rules Testing H 0 : β 0,j = a j against H a : β 0,j a j Definition The critical region is that H 0 is rejected in favor of β 0,j a j at the α% (say, 5%) significance level if: t obs > c 1 α 2 where c 1 α is the α 2 2 % (say, 2.5%) critical value or the percentile of order 1 α 2 of a Student t-distribution with n k degrees of freedom, i.e. the solution of: α ( ) 2 = P (reject H 0 H 0 is true) = P T s > 1 c 1 α. 2 Remark: c 1 α is chosen such that the probability mass in each tail of 2 the t-distribution equals α 2. Florian Pelgrin (HEC) Constrained estimators September-December / 34

26 The Student test Decision rules Figure 3: 5% rejection rule for H a : β 0,j 0 (df=25) Area = Area = Rejection region t Rejection region Other hypotheses about Florian Pelgrin (HEC) Constrained estimators September-December / 34

27 The Student test Decision rules Testing a linear combination of the parameters Suppose the test is: { H0 : β 0,j = β 0,l H a : β 0,j β 0,l The Student test statistic is given by: T s = ˆβ j,ols ˆβ ( l,ols ) se ˆβ j,ols ˆβ l,ols where ( se ˆβj,OLS ˆβ ) l,ols = V ( ) ( ) ( ˆβj,OLS + V ˆβl,OLS 2Cov ˆβj,OLS, ˆβ ) l,ols. The procedure is then the same as a standard two-sided test. Florian Pelgrin (HEC) Constrained estimators September-December / 34

28 The Fisher test Framework 4. The Fisher test 4.1. Framework Consider the linear Gaussian regression model (with suitable regularity conditions) and suppose that one seeks to test: H 0 : Rβ 0 = q constrained model H a : Rβ 0 q unconstrained model. Florian Pelgrin (HEC) Constrained estimators September-December / 34

29 The Fisher test Fisher test-statistic 4.2. Fisher test-statistic Definition The Fisher test statistic is given by: where T F = SSR 0 SSR a SSR a dl a dl 0 dl a F(dl 0 dl a, dl a ). dl 0 is the number of degrees of freedom under the null hypothesis (i.e., dl 0 = n (k p)); dl a is the number of degrees of freedom under the alternative hypothesis (i.e., dl a = n k); SSR 0 is the sum of squared (estimated) residuals under the null hypothesis; SSR a is the sum of squared (estimated) residuals under the alternative hypothesis. Florian Pelgrin (HEC) Constrained estimators September-December / 34

30 The Fisher test Fisher test-statistic Definition Consider the Gaussian multiple linear regression model: Y = Xβ 0 + u. The Fisher test statistic corresponding to { H0 : Rβ 0 = q H a : Rβ 0 q is given by: T F = û COLS 2 û OLS 2 û OLS 2 n k p F(p, n k). Florian Pelgrin (HEC) Constrained estimators September-December / 34

31 The Fisher test Fisher test-statistic A second derivation of the Fisher test statistic Definition Under suitable regularity conditions, the Fisher test statistic can also be written as: T F = 1 p σ 2 OLS ( R β ) OLS q [R(X X) 1 R ] 1 ( R β ) OLS q. Remark: An equivalent test statistic is also: T F = 1 ( βols p σ OLS 2 β ) COLS (X X) ( βols β ) COLS. Exercise: Show these two results. Florian Pelgrin (HEC) Constrained estimators September-December / 34

32 4.3. Decision rule The Fisher test Decision rule 1 Determine SSR 0 (constrained model) and SSR a (unconstrained model); 2 Determine the realization of the Fisher test statistic: F obs = (SSR 0 SSR a ) /p SSR a / (n k) where p is the number of linearly independent constraints. 3 Determine the critical region: W = {T F > F 1 α } and especially the critical value which corresponds to the (1 α) percentile of a Fisher distributed random variable F(p, n k). 4 Reject the null assumption H 0 (at the level α) if F obs > F 1 α. Florian Pelgrin (HEC) Constrained estimators September-December / 34

33 5. Summary Summary What is a simple hypothesis? A composite hypothesis? What is a one-sided test? A two-sided test? What is Type-I error? Type-II error? What is the p-value? What is the critical value? Decision rule? What is a Student test? In which cases can we use it? Derive the Student test statistic in standard cases. What is a Fisher test? In which sense(s) is it different from the Student test? Derive the different Fisher test statistics. Interpretation of results! Florian Pelgrin (HEC) Constrained estimators September-December / 34

34 Appendix Appendix 1: Student test statistic The result is based on the following theorem: Theorem Let U and V denote two independent random variables: Then: Using U = ˆβ j,ols a j σ jj U N (0, 1) Z χ 2 (n 1 ). U Z n 1 Γ(n 1 ). (with σ jj = σ 0 mjj ), Z = (n k) ˆσ2 OLS, n σ0 2 1 = n k, and the result that U are V are independent in probability (see Chapter 2), one gets the result. Florian Pelgrin (HEC) Constrained estimators September-December / 34

Chapter 4: Statistical Hypothesis Testing

Chapter 4: Statistical Hypothesis Testing Chapter 4: Statistical Hypothesis Testing Christophe Hurlin November 20, 2015 Christophe Hurlin () Advanced Econometrics - Master ESA November 20, 2015 1 / 225 Section 1 Introduction Christophe Hurlin

More information

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a

More information

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1) Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the

More information

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935) Section 7.1 Introduction to Hypothesis Testing Schrodinger s cat quantum mechanics thought experiment (1935) Statistical Hypotheses A statistical hypothesis is a claim about a population. Null hypothesis

More information

These axioms must hold for all vectors ū, v, and w in V and all scalars c and d.

These axioms must hold for all vectors ū, v, and w in V and all scalars c and d. DEFINITION: A vector space is a nonempty set V of objects, called vectors, on which are defined two operations, called addition and multiplication by scalars (real numbers), subject to the following axioms

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

Regression Analysis: A Complete Example

Regression Analysis: A Complete Example Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

HYPOTHESIS TESTING: POWER OF THE TEST

HYPOTHESIS TESTING: POWER OF THE TEST HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,

More information

Recall this chart that showed how most of our course would be organized:

Recall this chart that showed how most of our course would be organized: Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

More information

Chapter 3: The Multiple Linear Regression Model

Chapter 3: The Multiple Linear Regression Model Chapter 3: The Multiple Linear Regression Model Advanced Econometrics - HEC Lausanne Christophe Hurlin University of Orléans November 23, 2013 Christophe Hurlin (University of Orléans) Advanced Econometrics

More information

Penalized regression: Introduction

Penalized regression: Introduction Penalized regression: Introduction Patrick Breheny August 30 Patrick Breheny BST 764: Applied Statistical Modeling 1/19 Maximum likelihood Much of 20th-century statistics dealt with maximum likelihood

More information

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Two-Sample T-Tests Assuming Equal Variance (Enter Means) Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of

More information

Orthogonal Diagonalization of Symmetric Matrices

Orthogonal Diagonalization of Symmetric Matrices MATH10212 Linear Algebra Brief lecture notes 57 Gram Schmidt Process enables us to find an orthogonal basis of a subspace. Let u 1,..., u k be a basis of a subspace V of R n. We begin the process of finding

More information

Introduction to General and Generalized Linear Models

Introduction to General and Generalized Linear Models Introduction to General and Generalized Linear Models General Linear Models - part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby

More information

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015 Stat 411/511 THE RANDOMIZATION TEST Oct 16 2015 Charlotte Wickham stat511.cwick.co.nz Today Review randomization model Conduct randomization test What about CIs? Using a t-distribution as an approximation

More information

August 2012 EXAMINATIONS Solution Part I

August 2012 EXAMINATIONS Solution Part I August 01 EXAMINATIONS Solution Part I (1) In a random sample of 600 eligible voters, the probability that less than 38% will be in favour of this policy is closest to (B) () In a large random sample,

More information

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses Introduction to Hypothesis Testing 1 Hypothesis Testing A hypothesis test is a statistical procedure that uses sample data to evaluate a hypothesis about a population Hypothesis is stated in terms of the

More information

Permutation Tests for Comparing Two Populations

Permutation Tests for Comparing Two Populations Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of

More information

Hypothesis testing - Steps

Hypothesis testing - Steps Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =

More information

Non Parametric Inference

Non Parametric Inference Maura Department of Economics and Finance Università Tor Vergata Outline 1 2 3 Inverse distribution function Theorem: Let U be a uniform random variable on (0, 1). Let X be a continuous random variable

More information

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7. THERE ARE TWO WAYS TO DO HYPOTHESIS TESTING WITH STATCRUNCH: WITH SUMMARY DATA (AS IN EXAMPLE 7.17, PAGE 236, IN ROSNER); WITH THE ORIGINAL DATA (AS IN EXAMPLE 8.5, PAGE 301 IN ROSNER THAT USES DATA FROM

More information

6.2 Permutations continued

6.2 Permutations continued 6.2 Permutations continued Theorem A permutation on a finite set A is either a cycle or can be expressed as a product (composition of disjoint cycles. Proof is by (strong induction on the number, r, of

More information

1.5 Oneway Analysis of Variance

1.5 Oneway Analysis of Variance Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments

More information

Point Biserial Correlation Tests

Point Biserial Correlation Tests Chapter 807 Point Biserial Correlation Tests Introduction The point biserial correlation coefficient (ρ in this chapter) is the product-moment correlation calculated between a continuous random variable

More information

3.4 Statistical inference for 2 populations based on two samples

3.4 Statistical inference for 2 populations based on two samples 3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted

More information

Name: Date: Use the following to answer questions 3-4:

Name: Date: Use the following to answer questions 3-4: Name: Date: 1. Determine whether each of the following statements is true or false. A) The margin of error for a 95% confidence interval for the mean increases as the sample size increases. B) The margin

More information

Multiple Linear Regression in Data Mining

Multiple Linear Regression in Data Mining Multiple Linear Regression in Data Mining Contents 2.1. A Review of Multiple Linear Regression 2.2. Illustration of the Regression Process 2.3. Subset Selection in Linear Regression 1 2 Chap. 2 Multiple

More information

What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling

What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling Jeff Wooldridge NBER Summer Institute, 2007 1. The Linear Model with Cluster Effects 2. Estimation with a Small Number of Groups and

More information

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption

More information

STATISTICA Formula Guide: Logistic Regression. Table of Contents

STATISTICA Formula Guide: Logistic Regression. Table of Contents : Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary

More information

2013 MBA Jump Start Program. Statistics Module Part 3

2013 MBA Jump Start Program. Statistics Module Part 3 2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just

More information

The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series.

The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series. Cointegration The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series. Economic theory, however, often implies equilibrium

More information

1 Another method of estimation: least squares

1 Another method of estimation: least squares 1 Another method of estimation: least squares erm: -estim.tex, Dec8, 009: 6 p.m. (draft - typos/writos likely exist) Corrections, comments, suggestions welcome. 1.1 Least squares in general Assume Y i

More information

Testing a claim about a population mean

Testing a claim about a population mean Introductory Statistics Lectures Testing a claim about a population mean One sample hypothesis test of the mean Department of Mathematics Pima Community College Redistribution of this material is prohibited

More information

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING CHAPTER 5. A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING 5.1 Concepts When a number of animals or plots are exposed to a certain treatment, we usually estimate the effect of the treatment

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression A regression with two or more explanatory variables is called a multiple regression. Rather than modeling the mean response as a straight line, as in simple regression, it is

More information

12: Analysis of Variance. Introduction

12: Analysis of Variance. Introduction 1: Analysis of Variance Introduction EDA Hypothesis Test Introduction In Chapter 8 and again in Chapter 11 we compared means from two independent groups. In this chapter we extend the procedure to consider

More information

Multivariate normal distribution and testing for means (see MKB Ch 3)

Multivariate normal distribution and testing for means (see MKB Ch 3) Multivariate normal distribution and testing for means (see MKB Ch 3) Where are we going? 2 One-sample t-test (univariate).................................................. 3 Two-sample t-test (univariate).................................................

More information

Network quality control

Network quality control Network quality control Network quality control P.J.G. Teunissen Delft Institute of Earth Observation and Space systems (DEOS) Delft University of Technology VSSD iv Series on Mathematical Geodesy and

More information

6: Introduction to Hypothesis Testing

6: Introduction to Hypothesis Testing 6: Introduction to Hypothesis Testing Significance testing is used to help make a judgment about a claim by addressing the question, Can the observed difference be attributed to chance? We break up significance

More information

Lecture Notes Module 1

Lecture Notes Module 1 Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

More information

MATHEMATICAL METHODS OF STATISTICS

MATHEMATICAL METHODS OF STATISTICS MATHEMATICAL METHODS OF STATISTICS By HARALD CRAMER TROFESSOK IN THE UNIVERSITY OF STOCKHOLM Princeton PRINCETON UNIVERSITY PRESS 1946 TABLE OF CONTENTS. First Part. MATHEMATICAL INTRODUCTION. CHAPTERS

More information

Nonparametric tests these test hypotheses that are not statements about population parameters (e.g.,

Nonparametric tests these test hypotheses that are not statements about population parameters (e.g., CHAPTER 13 Nonparametric and Distribution-Free Statistics Nonparametric tests these test hypotheses that are not statements about population parameters (e.g., 2 tests for goodness of fit and independence).

More information

Solución del Examen Tipo: 1

Solución del Examen Tipo: 1 Solución del Examen Tipo: 1 Universidad Carlos III de Madrid ECONOMETRICS Academic year 2009/10 FINAL EXAM May 17, 2010 DURATION: 2 HOURS 1. Assume that model (III) verifies the assumptions of the classical

More information

Unit 26 Estimation with Confidence Intervals

Unit 26 Estimation with Confidence Intervals Unit 26 Estimation with Confidence Intervals Objectives: To see how confidence intervals are used to estimate a population proportion, a population mean, a difference in population proportions, or a difference

More information

Non-Inferiority Tests for One Mean

Non-Inferiority Tests for One Mean Chapter 45 Non-Inferiority ests for One Mean Introduction his module computes power and sample size for non-inferiority tests in one-sample designs in which the outcome is distributed as a normal random

More information

SAS Software to Fit the Generalized Linear Model

SAS Software to Fit the Generalized Linear Model SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling

More information

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS This booklet contains lecture notes for the nonparametric work in the QM course. This booklet may be online at http://users.ox.ac.uk/~grafen/qmnotes/index.html.

More information

Testing theory an introduction

Testing theory an introduction Testing theory an introduction P.J.G. Teunissen Series on Mathematical Geodesy and Positioning Testing theory an introduction Testing theory an introduction P.J.G. Teunissen Delft University of Technology

More information

3. Mathematical Induction

3. Mathematical Induction 3. MATHEMATICAL INDUCTION 83 3. Mathematical Induction 3.1. First Principle of Mathematical Induction. Let P (n) be a predicate with domain of discourse (over) the natural numbers N = {0, 1,,...}. If (1)

More information

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis

More information

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written

More information

Arkansas Tech University MATH 4033: Elementary Modern Algebra Dr. Marcel B. Finan

Arkansas Tech University MATH 4033: Elementary Modern Algebra Dr. Marcel B. Finan Arkansas Tech University MATH 4033: Elementary Modern Algebra Dr. Marcel B. Finan 3 Binary Operations We are used to addition and multiplication of real numbers. These operations combine two real numbers

More information

Redwood Building, Room T204, Stanford University School of Medicine, Stanford, CA 94305-5405.

Redwood Building, Room T204, Stanford University School of Medicine, Stanford, CA 94305-5405. W hittemoretxt050806.tex A Bayesian False Discovery Rate for Multiple Testing Alice S. Whittemore Department of Health Research and Policy Stanford University School of Medicine Correspondence Address:

More information

Chapter 2. Hypothesis testing in one population

Chapter 2. Hypothesis testing in one population Chapter 2. Hypothesis testing in one population Contents Introduction, the null and alternative hypotheses Hypothesis testing process Type I and Type II errors, power Test statistic, level of significance

More information

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as... HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

More information

SECTION 10-2 Mathematical Induction

SECTION 10-2 Mathematical Induction 73 0 Sequences and Series 6. Approximate e 0. using the first five terms of the series. Compare this approximation with your calculator evaluation of e 0.. 6. Approximate e 0.5 using the first five terms

More information

Statistical Models in R

Statistical Models in R Statistical Models in R Some Examples Steven Buechler Department of Mathematics 276B Hurley Hall; 1-6233 Fall, 2007 Outline Statistical Models Linear Models in R Regression Regression analysis is the appropriate

More information

Two-sample inference: Continuous data

Two-sample inference: Continuous data Two-sample inference: Continuous data Patrick Breheny April 5 Patrick Breheny STA 580: Biostatistics I 1/32 Introduction Our next two lectures will deal with two-sample inference for continuous data As

More information

Poisson Models for Count Data

Poisson Models for Count Data Chapter 4 Poisson Models for Count Data In this chapter we study log-linear models for count data under the assumption of a Poisson error structure. These models have many applications, not only to the

More information

Pearson's Correlation Tests

Pearson's Correlation Tests Chapter 800 Pearson's Correlation Tests Introduction The correlation coefficient, ρ (rho), is a popular statistic for describing the strength of the relationship between two variables. The correlation

More information

Similarity and Diagonalization. Similar Matrices

Similarity and Diagonalization. Similar Matrices MATH022 Linear Algebra Brief lecture notes 48 Similarity and Diagonalization Similar Matrices Let A and B be n n matrices. We say that A is similar to B if there is an invertible n n matrix P such that

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

Power and sample size in multilevel modeling

Power and sample size in multilevel modeling Snijders, Tom A.B. Power and Sample Size in Multilevel Linear Models. In: B.S. Everitt and D.C. Howell (eds.), Encyclopedia of Statistics in Behavioral Science. Volume 3, 1570 1573. Chicester (etc.): Wiley,

More information

How to calculate an ANOVA table

How to calculate an ANOVA table How to calculate an ANOVA table Calculations by Hand We look at the following example: Let us say we measure the height of some plants under the effect of different fertilizers. Treatment Measures Mean

More information

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance

More information

Degrees of Freedom and Model Search

Degrees of Freedom and Model Search Degrees of Freedom and Model Search Ryan J. Tibshirani Abstract Degrees of freedom is a fundamental concept in statistical modeling, as it provides a quantitative description of the amount of fitting performed

More information

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups In analysis of variance, the main research question is whether the sample means are from different populations. The

More information

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters

More information

Final Exam Practice Problem Answers

Final Exam Practice Problem Answers Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal

More information

Stat 5102 Notes: Nonparametric Tests and. confidence interval

Stat 5102 Notes: Nonparametric Tests and. confidence interval Stat 510 Notes: Nonparametric Tests and Confidence Intervals Charles J. Geyer April 13, 003 This handout gives a brief introduction to nonparametrics, which is what you do when you don t believe the assumptions

More information

Lecture 8. Confidence intervals and the central limit theorem

Lecture 8. Confidence intervals and the central limit theorem Lecture 8. Confidence intervals and the central limit theorem Mathematical Statistics and Discrete Mathematics November 25th, 2015 1 / 15 Central limit theorem Let X 1, X 2,... X n be a random sample of

More information

Chapter 6: Multivariate Cointegration Analysis

Chapter 6: Multivariate Cointegration Analysis Chapter 6: Multivariate Cointegration Analysis 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie VI. Multivariate Cointegration

More information

17. SIMPLE LINEAR REGRESSION II

17. SIMPLE LINEAR REGRESSION II 17. SIMPLE LINEAR REGRESSION II The Model In linear regression analysis, we assume that the relationship between X and Y is linear. This does not mean, however, that Y can be perfectly predicted from X.

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares Topic 4 - Analysis of Variance Approach to Regression Outline Partitioning sums of squares Degrees of freedom Expected mean squares General linear test - Fall 2013 R 2 and the coefficient of correlation

More information

Hypothesis Testing --- One Mean

Hypothesis Testing --- One Mean Hypothesis Testing --- One Mean A hypothesis is simply a statement that something is true. Typically, there are two hypotheses in a hypothesis test: the null, and the alternative. Null Hypothesis The hypothesis

More information

Master s Theory Exam Spring 2006

Master s Theory Exam Spring 2006 Spring 2006 This exam contains 7 questions. You should attempt them all. Each question is divided into parts to help lead you through the material. You should attempt to complete as much of each problem

More information

Quadratic forms Cochran s theorem, degrees of freedom, and all that

Quadratic forms Cochran s theorem, degrees of freedom, and all that Quadratic forms Cochran s theorem, degrees of freedom, and all that Dr. Frank Wood Frank Wood, fwood@stat.columbia.edu Linear Regression Models Lecture 1, Slide 1 Why We Care Cochran s theorem tells us

More information

Testing, Monitoring, and Dating Structural Changes in Exchange Rate Regimes

Testing, Monitoring, and Dating Structural Changes in Exchange Rate Regimes Testing, Monitoring, and Dating Structural Changes in Exchange Rate Regimes Achim Zeileis http://eeecon.uibk.ac.at/~zeileis/ Overview Motivation Exchange rate regimes Exchange rate regression What is the

More information

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples Comparing Two Groups Chapter 7 describes two ways to compare two populations on the basis of independent samples: a confidence interval for the difference in population means and a hypothesis test. The

More information

An analysis appropriate for a quantitative outcome and a single quantitative explanatory. 9.1 The model behind linear regression

An analysis appropriate for a quantitative outcome and a single quantitative explanatory. 9.1 The model behind linear regression Chapter 9 Simple Linear Regression An analysis appropriate for a quantitative outcome and a single quantitative explanatory variable. 9.1 The model behind linear regression When we are examining the relationship

More information

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression Opening Example CHAPTER 13 SIMPLE LINEAR REGREION SIMPLE LINEAR REGREION! Simple Regression! Linear Regression Simple Regression Definition A regression model is a mathematical equation that descries the

More information

The Power of the KPSS Test for Cointegration when Residuals are Fractionally Integrated

The Power of the KPSS Test for Cointegration when Residuals are Fractionally Integrated The Power of the KPSS Test for Cointegration when Residuals are Fractionally Integrated Philipp Sibbertsen 1 Walter Krämer 2 Diskussionspapier 318 ISNN 0949-9962 Abstract: We show that the power of the

More information

NOV - 30211/II. 1. Let f(z) = sin z, z C. Then f(z) : 3. Let the sequence {a n } be given. (A) is bounded in the complex plane

NOV - 30211/II. 1. Let f(z) = sin z, z C. Then f(z) : 3. Let the sequence {a n } be given. (A) is bounded in the complex plane Mathematical Sciences Paper II Time Allowed : 75 Minutes] [Maximum Marks : 100 Note : This Paper contains Fifty (50) multiple choice questions. Each question carries Two () marks. Attempt All questions.

More information

Randomized Block Analysis of Variance

Randomized Block Analysis of Variance Chapter 565 Randomized Block Analysis of Variance Introduction This module analyzes a randomized block analysis of variance with up to two treatment factors and their interaction. It provides tables of

More information

Exact Nonparametric Tests for Comparing Means - A Personal Summary

Exact Nonparametric Tests for Comparing Means - A Personal Summary Exact Nonparametric Tests for Comparing Means - A Personal Summary Karl H. Schlag European University Institute 1 December 14, 2006 1 Economics Department, European University Institute. Via della Piazzuola

More information

Chapter 7: Simple linear regression Learning Objectives

Chapter 7: Simple linear regression Learning Objectives Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -

More information

22. HYPOTHESIS TESTING

22. HYPOTHESIS TESTING 22. HYPOTHESIS TESTING Often, we need to make decisions based on incomplete information. Do the data support some belief ( hypothesis ) about the value of a population parameter? Is OJ Simpson guilty?

More information

Example: Boats and Manatees

Example: Boats and Manatees Figure 9-6 Example: Boats and Manatees Slide 1 Given the sample data in Table 9-1, find the value of the linear correlation coefficient r, then refer to Table A-6 to determine whether there is a significant

More information

Non-Inferiority Tests for Two Means using Differences

Non-Inferiority Tests for Two Means using Differences Chapter 450 on-inferiority Tests for Two Means using Differences Introduction This procedure computes power and sample size for non-inferiority tests in two-sample designs in which the outcome is a continuous

More information

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE 1 2 CONTENTS OF DAY 2 I. More Precise Definition of Simple Random Sample 3 Connection with independent random variables 3 Problems with small populations 8 II. Why Random Sampling is Important 9 A myth,

More information

MULTIPLE REGRESSION ANALYSIS OF MAIN ECONOMIC INDICATORS IN TOURISM. R, analysis of variance, Student test, multivariate analysis

MULTIPLE REGRESSION ANALYSIS OF MAIN ECONOMIC INDICATORS IN TOURISM. R, analysis of variance, Student test, multivariate analysis Journal of tourism [No. 8] MULTIPLE REGRESSION ANALYSIS OF MAIN ECONOMIC INDICATORS IN TOURISM Assistant Ph.D. Erika KULCSÁR Babeş Bolyai University of Cluj Napoca, Romania Abstract This paper analysis

More information

Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011

Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Name: Section: I pledge my honor that I have not violated the Honor Code Signature: This exam has 34 pages. You have 3 hours to complete this

More information

Solving Mass Balances using Matrix Algebra

Solving Mass Balances using Matrix Algebra Page: 1 Alex Doll, P.Eng, Alex G Doll Consulting Ltd. http://www.agdconsulting.ca Abstract Matrix Algebra, also known as linear algebra, is well suited to solving material balance problems encountered

More information

Non-Inferiority Tests for Two Proportions

Non-Inferiority Tests for Two Proportions Chapter 0 Non-Inferiority Tests for Two Proportions Introduction This module provides power analysis and sample size calculation for non-inferiority and superiority tests in twosample designs in which

More information

How To Check For Differences In The One Way Anova

How To Check For Differences In The One Way Anova MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

More information

Statistical Functions in Excel

Statistical Functions in Excel Statistical Functions in Excel There are many statistical functions in Excel. Moreover, there are other functions that are not specified as statistical functions that are helpful in some statistical analyses.

More information

University of Ljubljana Doctoral Programme in Statistics Methodology of Statistical Research Written examination February 14 th, 2014.

University of Ljubljana Doctoral Programme in Statistics Methodology of Statistical Research Written examination February 14 th, 2014. University of Ljubljana Doctoral Programme in Statistics ethodology of Statistical Research Written examination February 14 th, 2014 Name and surname: ID number: Instructions Read carefully the wording

More information