Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part II)
|
|
- Rodney Patterson
- 7 years ago
- Views:
Transcription
1 Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part II) Florian Pelgrin HEC September-December 2010 Florian Pelgrin (HEC) Constrained estimators September-December / 34
2 1 Introduction 2 Hypothesis testing Overview Different tests... Outcomes of hypothesis testing Level of significance General methodology 3 The Student test Framework Student test-statistic Decision rules 4 The Fisher test Framework Fisher test-statistic Decision rule 5 Summary 6 Appendix Florian Pelgrin (HEC) Constrained estimators September-December / 34
3 1. Introduction Introduction Aim: Testing parameters in the multiple linear regression model: Assume normality of the error terms exact distribution of estimators; Different tests... Testing hypothesis about individual parameters; Testing single hypothesis involving more than one parameter; Testing multiple restrictions....using the Student and Fisher test statistics Florian Pelgrin (HEC) Constrained estimators September-December / 34
4 Introduction The main assumption is: u X N ( ) 0, σ0 2 I n i.e. the population error u is independent of X and normally distributed with zero mean and variance σ 2 0 I n. This could be a strong requirement (especially, in finite samples). This assumption implies the zero conditional mean assumption (Assumption 3, Chapter 2) and the spherical error terms assumption (Assumption 4, Chapter 2) other assumptions are assumed to hold. Florian Pelgrin (HEC) Constrained estimators September-December / 34
5 2. Hypothesis testing 2.1. Overview Hypothesis testing Overview A hypothesis is a statement about a population parameter. A standard hypothesis testing problem involves two complementary hypothesis: 1 The null hypothesis: H 0 2 The alternative hypothesis: H 1 or H a. If θ 0 is the parameter (vector) of interest: 1 H 0 : θ 0 Θ 0 Θ 2 H 1 : θ 0 Θ 1 Θ, with Θ 1 = Θ Θ 0. Florian Pelgrin (HEC) Constrained estimators September-December / 34
6 Hypothesis testing Overview The null hypothesis is tested by finding an appropriate subset of outcomes W Θ called the rejection region or critical region. For example, a standard rejection region W is of the form: W = {z : T S (z) > c} where T S is a test statistic and c is a critical value. The complement of the rejection region is the acceptance region. A (hypothesis) test is thus a rule that specified: 1 For which sample values the decision is made to fail to reject H 0 as true; 2 For which sample values the decision is made to reject H 0. Florian Pelgrin (HEC) Constrained estimators September-December / 34
7 2.2. Different tests... Hypothesis testing Different tests... Definition A hypothesis of the form H 0 : θ 0 = θ is called a simple hypothesis. A hypothesis of the form H 0 : θ 0 θ or H 0 : θ 0 θ is called a composite hypothesis. Florian Pelgrin (HEC) Constrained estimators September-December / 34
8 Hypothesis testing Different tests... Definition A test of the form: { H0 : θ 0 = θ H a : θ 0 θ is called a two-sided test. Example: { H0 : β 0,j = 0 H a : β 0,j 0 The null hypothesis means x j has no effect on y once the other x s are controlled for ; β 0,j 0 is a natural alternative hypothesis when there is no prior knowledge about the sign... Florian Pelgrin (HEC) Constrained estimators September-December / 34
9 Hypothesis testing Different tests... Definition A test of the form: { H0 : θ 0 θ H a : θ 0 > θ or { H0 : θ 0 θ H a : θ 0 < θ is called a one-sided test. Florian Pelgrin (HEC) Constrained estimators September-December / 34
10 Hypothesis testing Outcomes of hypothesis testing 2.3. Outcomes of hypothesis testing Truth Decision Fail to reject H 0 Reject H 0 H 0 Correct decision Type I error H a Type II error Correct decision 1 The probability of Type I error: P(reject H 0 H 0 is true). 2 The probability of Type II error: P(fail to reject H 0 H 1 is true). Florian Pelgrin (HEC) Constrained estimators September-December / 34
11 Hypothesis testing Outcomes of hypothesis testing Definition The power function of a test is defined by: β(θ 0 ) = 1 P(fail to reject H 0 H 1 ) 1 η(θ 0 ) The size of a test is defined to be: α = sup θ 0 P(reject H 0 H 0 ) A test is said to have level α if its size is less than or equal to α. Remark: One seeks to determine the critical region such that the power is maximum and with minimal size distortion. Florian Pelgrin (HEC) Constrained estimators September-December / 34
12 Hypothesis testing 2.4. Level of significance Level of significance The decision Reject H 0 or fail to reject H 0 is not so informative! Indeed, there is some arbitrariness to the choice of α (level). Another strategy is to ask, for every α, whether the test rejects at that level. Another alternative is to use the so-called p-value the smallest level of significance at which H 0 would be rejected given the value of the test-statistic. Florian Pelgrin (HEC) Constrained estimators September-December / 34
13 Hypothesis testing Level of significance More formally... Definition Suppose that for every α (0, 1), one has a size α test with rejection region W α. Then, the p-value is defined to be: p-value = inf{α : T S (Y ) W α }. The p-value is the smallest level at which one can reject H 0. The p-value is a measure of evidence against H 0 : p-value evidence <.01 Very strong evidence against H Strong evidence against H Weak evidence against H 0 >.10 Little or no evidence against H 0 Florian Pelgrin (HEC) Constrained estimators September-December / 34
14 Hypothesis testing Level of significance Remarks: 1. A large p-value does not mean strong evidence in favor of H A large p-value can occur for two reasons: 1 H 0 is true; 2 H 0 is false but the test has low power. 3. The p-value is not the probability that the null hypothesis is true! Florian Pelgrin (HEC) Constrained estimators September-December / 34
15 Hypothesis testing 2.5. General methodology General methodology Hypothesis testing is defined by the following general procedure Step 1: State the relevant null and alternative hypotheses (mis-stating the hypotheses muddies the rest of the procedure!); Step 2: Consider the statistical assumptions being made about the sample in doing the test (independence, distributions, etc) incorrect assumptions mean that the test is invalid! Step 3: Choose the appropriate test (exact or asymptotic tests) and thus state the relevant test statistic (say, T ). Step 4: Derive the distribution of the test statistic under the null hypothesis (sometimes it is well-known, sometimes it is more tedious!) for example, the Student t-distribution or the Fisher distribution. Florian Pelgrin (HEC) Constrained estimators September-December / 34
16 Hypothesis testing General methodology 2.5. General methodology (continued) Step 5: Determine the critical region (and thus the acceptance region). Step 6: Compute (using the observations!) the observed value of the test statistic T, say t obs. Step 7: Decide to either fail to reject the null hypothesis or reject in favor of the alternative assumption the decision rule is to reject the null hypothesis H 0 if the observed value of the test statistic, t obs is in the critical region, and to fail to reject the null hypothesis otherwise. Florian Pelgrin (HEC) Constrained estimators September-December / 34
17 3. The Student test 3.1. Framework The Student test Framework Consider the (population) model Y = Xβ 0 + u that satisfies the assumptions H1 H6 (Chapter 2). The β j s are unknown features of the population, but: One can formulate a hypothesis about their value; One can construct a test statistic with a known distribution under the maintained hypothesis; One can take a decision meaning reject H 0 if the value of the test statistic is too unlikely. Florian Pelgrin (HEC) Constrained estimators September-December / 34
18 The Student test Framework Three tests of interest: { H0 : β 0,j = a j H a : β 0,j > a j or < a j { H0 : β 0,j = a j H a : β 0,j a j and { H0 : β 0,j = β l H a : not H 0 where a j = 0 or a j 0. Florian Pelgrin (HEC) Constrained estimators September-December / 34
19 The Student test Framework All of these tests are based on the following results: 1. The exact distribution of the ordinary least squares estimator of β: ( ˆβ OLS X N β 0, σ0 2 (X X) 1). 2. Any linear combination of the ˆβ j,ols s is also normally distributed: A ˆβ OLS X N ( Aβ 0, σ 2 0A (X X) 1 A ). 3. Any subset of ˆβ OLS has a joint normal distribution. Florian Pelgrin (HEC) Constrained estimators September-December / 34
20 The Student test 3.2. Student test-statistic Student test-statistic Definition The Student test-statistic is defined to be: T s = ˆβ j,ols a ( ) j Γ(n k) se ˆβ j,ols where n is the number of observations, k is the number of explanatory variables (including the constant term), Γ is the Student t-distribution, and: ( ) se ˆβ j,ols = ˆσ OLS mjj with m jj is the j th diagonal element of (X X) 1. Proof: See Appendix 1. Florian Pelgrin (HEC) Constrained estimators September-December / 34
21 The Student test Decision rules 3.3. Decision rules Testing H 0 : β 0,j = a j against H a : β 0,j > a j Definition The critical region is that H 0 is rejected in favor of β 0,j > a j at the α% (say, 5%) significance level if: t obs > c 1 α where c 1 α is the α% (say, 5%) critical value or the percentile of order 1 α of a Student t-distribution with n k degrees of freedom, i.e. the solution of: α = P (reject H 0 H 0 is true) = P (T s > c 1 α ). Florian Pelgrin (HEC) Constrained estimators September-December / 34
22 The Student test Decision rules Figure 1: 5% rejection rule for H a : β 0,j > 0 (df=28) Area = t Rejection region 5% rejection rule for (df=18) Florian Pelgrin (HEC) Constrained estimators September-December / 34
23 The Student test Decision rules Testing H 0 : β 0,j = a j against H a : β 0,j < a j Definition The critical region is that H 0 is rejected in favor of β 0,j < a j at the α% (say, 5%) significance level if: t obs < c α where c α is the α% (say, 5%) critical value or the percentile of order α of a Student t-distribution with n k degrees of freedom, i.e. the solution of: α = P (reject H 0 H 0 is true) = P (T s < c α ). Florian Pelgrin (HEC) Constrained estimators September-December / 34
24 The Student test Decision rules Figure 2: 5% rejection rule for H a : β 0,j < 0 (df=18) Area = Rejection region t Florian Pelgrin (HEC) Constrained estimators September-December / 34
25 The Student test Decision rules Testing H 0 : β 0,j = a j against H a : β 0,j a j Definition The critical region is that H 0 is rejected in favor of β 0,j a j at the α% (say, 5%) significance level if: t obs > c 1 α 2 where c 1 α is the α 2 2 % (say, 2.5%) critical value or the percentile of order 1 α 2 of a Student t-distribution with n k degrees of freedom, i.e. the solution of: α ( ) 2 = P (reject H 0 H 0 is true) = P T s > 1 c 1 α. 2 Remark: c 1 α is chosen such that the probability mass in each tail of 2 the t-distribution equals α 2. Florian Pelgrin (HEC) Constrained estimators September-December / 34
26 The Student test Decision rules Figure 3: 5% rejection rule for H a : β 0,j 0 (df=25) Area = Area = Rejection region t Rejection region Other hypotheses about Florian Pelgrin (HEC) Constrained estimators September-December / 34
27 The Student test Decision rules Testing a linear combination of the parameters Suppose the test is: { H0 : β 0,j = β 0,l H a : β 0,j β 0,l The Student test statistic is given by: T s = ˆβ j,ols ˆβ ( l,ols ) se ˆβ j,ols ˆβ l,ols where ( se ˆβj,OLS ˆβ ) l,ols = V ( ) ( ) ( ˆβj,OLS + V ˆβl,OLS 2Cov ˆβj,OLS, ˆβ ) l,ols. The procedure is then the same as a standard two-sided test. Florian Pelgrin (HEC) Constrained estimators September-December / 34
28 The Fisher test Framework 4. The Fisher test 4.1. Framework Consider the linear Gaussian regression model (with suitable regularity conditions) and suppose that one seeks to test: H 0 : Rβ 0 = q constrained model H a : Rβ 0 q unconstrained model. Florian Pelgrin (HEC) Constrained estimators September-December / 34
29 The Fisher test Fisher test-statistic 4.2. Fisher test-statistic Definition The Fisher test statistic is given by: where T F = SSR 0 SSR a SSR a dl a dl 0 dl a F(dl 0 dl a, dl a ). dl 0 is the number of degrees of freedom under the null hypothesis (i.e., dl 0 = n (k p)); dl a is the number of degrees of freedom under the alternative hypothesis (i.e., dl a = n k); SSR 0 is the sum of squared (estimated) residuals under the null hypothesis; SSR a is the sum of squared (estimated) residuals under the alternative hypothesis. Florian Pelgrin (HEC) Constrained estimators September-December / 34
30 The Fisher test Fisher test-statistic Definition Consider the Gaussian multiple linear regression model: Y = Xβ 0 + u. The Fisher test statistic corresponding to { H0 : Rβ 0 = q H a : Rβ 0 q is given by: T F = û COLS 2 û OLS 2 û OLS 2 n k p F(p, n k). Florian Pelgrin (HEC) Constrained estimators September-December / 34
31 The Fisher test Fisher test-statistic A second derivation of the Fisher test statistic Definition Under suitable regularity conditions, the Fisher test statistic can also be written as: T F = 1 p σ 2 OLS ( R β ) OLS q [R(X X) 1 R ] 1 ( R β ) OLS q. Remark: An equivalent test statistic is also: T F = 1 ( βols p σ OLS 2 β ) COLS (X X) ( βols β ) COLS. Exercise: Show these two results. Florian Pelgrin (HEC) Constrained estimators September-December / 34
32 4.3. Decision rule The Fisher test Decision rule 1 Determine SSR 0 (constrained model) and SSR a (unconstrained model); 2 Determine the realization of the Fisher test statistic: F obs = (SSR 0 SSR a ) /p SSR a / (n k) where p is the number of linearly independent constraints. 3 Determine the critical region: W = {T F > F 1 α } and especially the critical value which corresponds to the (1 α) percentile of a Fisher distributed random variable F(p, n k). 4 Reject the null assumption H 0 (at the level α) if F obs > F 1 α. Florian Pelgrin (HEC) Constrained estimators September-December / 34
33 5. Summary Summary What is a simple hypothesis? A composite hypothesis? What is a one-sided test? A two-sided test? What is Type-I error? Type-II error? What is the p-value? What is the critical value? Decision rule? What is a Student test? In which cases can we use it? Derive the Student test statistic in standard cases. What is a Fisher test? In which sense(s) is it different from the Student test? Derive the different Fisher test statistics. Interpretation of results! Florian Pelgrin (HEC) Constrained estimators September-December / 34
34 Appendix Appendix 1: Student test statistic The result is based on the following theorem: Theorem Let U and V denote two independent random variables: Then: Using U = ˆβ j,ols a j σ jj U N (0, 1) Z χ 2 (n 1 ). U Z n 1 Γ(n 1 ). (with σ jj = σ 0 mjj ), Z = (n k) ˆσ2 OLS, n σ0 2 1 = n k, and the result that U are V are independent in probability (see Chapter 2), one gets the result. Florian Pelgrin (HEC) Constrained estimators September-December / 34
Chapter 4: Statistical Hypothesis Testing
Chapter 4: Statistical Hypothesis Testing Christophe Hurlin November 20, 2015 Christophe Hurlin () Advanced Econometrics - Master ESA November 20, 2015 1 / 225 Section 1 Introduction Christophe Hurlin
More informationUnit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a
More informationClass 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)
Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the
More informationSection 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)
Section 7.1 Introduction to Hypothesis Testing Schrodinger s cat quantum mechanics thought experiment (1935) Statistical Hypotheses A statistical hypothesis is a claim about a population. Null hypothesis
More informationThese axioms must hold for all vectors ū, v, and w in V and all scalars c and d.
DEFINITION: A vector space is a nonempty set V of objects, called vectors, on which are defined two operations, called addition and multiplication by scalars (real numbers), subject to the following axioms
More informationSimple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
More informationRegression Analysis: A Complete Example
Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty
More informationLAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.
More informationHYPOTHESIS TESTING: POWER OF THE TEST
HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,
More informationRecall this chart that showed how most of our course would be organized:
Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical
More informationChapter 3: The Multiple Linear Regression Model
Chapter 3: The Multiple Linear Regression Model Advanced Econometrics - HEC Lausanne Christophe Hurlin University of Orléans November 23, 2013 Christophe Hurlin (University of Orléans) Advanced Econometrics
More informationPenalized regression: Introduction
Penalized regression: Introduction Patrick Breheny August 30 Patrick Breheny BST 764: Applied Statistical Modeling 1/19 Maximum likelihood Much of 20th-century statistics dealt with maximum likelihood
More informationTwo-Sample T-Tests Assuming Equal Variance (Enter Means)
Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of
More informationOrthogonal Diagonalization of Symmetric Matrices
MATH10212 Linear Algebra Brief lecture notes 57 Gram Schmidt Process enables us to find an orthogonal basis of a subspace. Let u 1,..., u k be a basis of a subspace V of R n. We begin the process of finding
More informationIntroduction to General and Generalized Linear Models
Introduction to General and Generalized Linear Models General Linear Models - part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby
More informationStat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015
Stat 411/511 THE RANDOMIZATION TEST Oct 16 2015 Charlotte Wickham stat511.cwick.co.nz Today Review randomization model Conduct randomization test What about CIs? Using a t-distribution as an approximation
More informationAugust 2012 EXAMINATIONS Solution Part I
August 01 EXAMINATIONS Solution Part I (1) In a random sample of 600 eligible voters, the probability that less than 38% will be in favour of this policy is closest to (B) () In a large random sample,
More informationIntroduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses
Introduction to Hypothesis Testing 1 Hypothesis Testing A hypothesis test is a statistical procedure that uses sample data to evaluate a hypothesis about a population Hypothesis is stated in terms of the
More informationPermutation Tests for Comparing Two Populations
Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of
More informationHypothesis testing - Steps
Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =
More informationNon Parametric Inference
Maura Department of Economics and Finance Università Tor Vergata Outline 1 2 3 Inverse distribution function Theorem: Let U be a uniform random variable on (0, 1). Let X be a continuous random variable
More informationTHE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.
THERE ARE TWO WAYS TO DO HYPOTHESIS TESTING WITH STATCRUNCH: WITH SUMMARY DATA (AS IN EXAMPLE 7.17, PAGE 236, IN ROSNER); WITH THE ORIGINAL DATA (AS IN EXAMPLE 8.5, PAGE 301 IN ROSNER THAT USES DATA FROM
More information6.2 Permutations continued
6.2 Permutations continued Theorem A permutation on a finite set A is either a cycle or can be expressed as a product (composition of disjoint cycles. Proof is by (strong induction on the number, r, of
More information1.5 Oneway Analysis of Variance
Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments
More informationPoint Biserial Correlation Tests
Chapter 807 Point Biserial Correlation Tests Introduction The point biserial correlation coefficient (ρ in this chapter) is the product-moment correlation calculated between a continuous random variable
More information3.4 Statistical inference for 2 populations based on two samples
3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted
More informationName: Date: Use the following to answer questions 3-4:
Name: Date: 1. Determine whether each of the following statements is true or false. A) The margin of error for a 95% confidence interval for the mean increases as the sample size increases. B) The margin
More informationMultiple Linear Regression in Data Mining
Multiple Linear Regression in Data Mining Contents 2.1. A Review of Multiple Linear Regression 2.2. Illustration of the Regression Process 2.3. Subset Selection in Linear Regression 1 2 Chap. 2 Multiple
More informationWhat s New in Econometrics? Lecture 8 Cluster and Stratified Sampling
What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling Jeff Wooldridge NBER Summer Institute, 2007 1. The Linear Model with Cluster Effects 2. Estimation with a Small Number of Groups and
More informationTwo-Sample T-Tests Allowing Unequal Variance (Enter Difference)
Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption
More informationSTATISTICA Formula Guide: Logistic Regression. Table of Contents
: Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary
More information2013 MBA Jump Start Program. Statistics Module Part 3
2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just
More informationThe VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series.
Cointegration The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series. Economic theory, however, often implies equilibrium
More information1 Another method of estimation: least squares
1 Another method of estimation: least squares erm: -estim.tex, Dec8, 009: 6 p.m. (draft - typos/writos likely exist) Corrections, comments, suggestions welcome. 1.1 Least squares in general Assume Y i
More informationTesting a claim about a population mean
Introductory Statistics Lectures Testing a claim about a population mean One sample hypothesis test of the mean Department of Mathematics Pima Community College Redistribution of this material is prohibited
More informationA POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
CHAPTER 5. A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING 5.1 Concepts When a number of animals or plots are exposed to a certain treatment, we usually estimate the effect of the treatment
More informationMultiple Linear Regression
Multiple Linear Regression A regression with two or more explanatory variables is called a multiple regression. Rather than modeling the mean response as a straight line, as in simple regression, it is
More information12: Analysis of Variance. Introduction
1: Analysis of Variance Introduction EDA Hypothesis Test Introduction In Chapter 8 and again in Chapter 11 we compared means from two independent groups. In this chapter we extend the procedure to consider
More informationMultivariate normal distribution and testing for means (see MKB Ch 3)
Multivariate normal distribution and testing for means (see MKB Ch 3) Where are we going? 2 One-sample t-test (univariate).................................................. 3 Two-sample t-test (univariate).................................................
More informationNetwork quality control
Network quality control Network quality control P.J.G. Teunissen Delft Institute of Earth Observation and Space systems (DEOS) Delft University of Technology VSSD iv Series on Mathematical Geodesy and
More information6: Introduction to Hypothesis Testing
6: Introduction to Hypothesis Testing Significance testing is used to help make a judgment about a claim by addressing the question, Can the observed difference be attributed to chance? We break up significance
More informationLecture Notes Module 1
Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific
More informationMATHEMATICAL METHODS OF STATISTICS
MATHEMATICAL METHODS OF STATISTICS By HARALD CRAMER TROFESSOK IN THE UNIVERSITY OF STOCKHOLM Princeton PRINCETON UNIVERSITY PRESS 1946 TABLE OF CONTENTS. First Part. MATHEMATICAL INTRODUCTION. CHAPTERS
More informationNonparametric tests these test hypotheses that are not statements about population parameters (e.g.,
CHAPTER 13 Nonparametric and Distribution-Free Statistics Nonparametric tests these test hypotheses that are not statements about population parameters (e.g., 2 tests for goodness of fit and independence).
More informationSolución del Examen Tipo: 1
Solución del Examen Tipo: 1 Universidad Carlos III de Madrid ECONOMETRICS Academic year 2009/10 FINAL EXAM May 17, 2010 DURATION: 2 HOURS 1. Assume that model (III) verifies the assumptions of the classical
More informationUnit 26 Estimation with Confidence Intervals
Unit 26 Estimation with Confidence Intervals Objectives: To see how confidence intervals are used to estimate a population proportion, a population mean, a difference in population proportions, or a difference
More informationNon-Inferiority Tests for One Mean
Chapter 45 Non-Inferiority ests for One Mean Introduction his module computes power and sample size for non-inferiority tests in one-sample designs in which the outcome is distributed as a normal random
More informationSAS Software to Fit the Generalized Linear Model
SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling
More informationQUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS
QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS This booklet contains lecture notes for the nonparametric work in the QM course. This booklet may be online at http://users.ox.ac.uk/~grafen/qmnotes/index.html.
More informationTesting theory an introduction
Testing theory an introduction P.J.G. Teunissen Series on Mathematical Geodesy and Positioning Testing theory an introduction Testing theory an introduction P.J.G. Teunissen Delft University of Technology
More information3. Mathematical Induction
3. MATHEMATICAL INDUCTION 83 3. Mathematical Induction 3.1. First Principle of Mathematical Induction. Let P (n) be a predicate with domain of discourse (over) the natural numbers N = {0, 1,,...}. If (1)
More informationLesson 1: Comparison of Population Means Part c: Comparison of Two- Means
Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis
More informationOverview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written
More informationArkansas Tech University MATH 4033: Elementary Modern Algebra Dr. Marcel B. Finan
Arkansas Tech University MATH 4033: Elementary Modern Algebra Dr. Marcel B. Finan 3 Binary Operations We are used to addition and multiplication of real numbers. These operations combine two real numbers
More informationRedwood Building, Room T204, Stanford University School of Medicine, Stanford, CA 94305-5405.
W hittemoretxt050806.tex A Bayesian False Discovery Rate for Multiple Testing Alice S. Whittemore Department of Health Research and Policy Stanford University School of Medicine Correspondence Address:
More informationChapter 2. Hypothesis testing in one population
Chapter 2. Hypothesis testing in one population Contents Introduction, the null and alternative hypotheses Hypothesis testing process Type I and Type II errors, power Test statistic, level of significance
More informationHYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...
HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men
More informationSECTION 10-2 Mathematical Induction
73 0 Sequences and Series 6. Approximate e 0. using the first five terms of the series. Compare this approximation with your calculator evaluation of e 0.. 6. Approximate e 0.5 using the first five terms
More informationStatistical Models in R
Statistical Models in R Some Examples Steven Buechler Department of Mathematics 276B Hurley Hall; 1-6233 Fall, 2007 Outline Statistical Models Linear Models in R Regression Regression analysis is the appropriate
More informationTwo-sample inference: Continuous data
Two-sample inference: Continuous data Patrick Breheny April 5 Patrick Breheny STA 580: Biostatistics I 1/32 Introduction Our next two lectures will deal with two-sample inference for continuous data As
More informationPoisson Models for Count Data
Chapter 4 Poisson Models for Count Data In this chapter we study log-linear models for count data under the assumption of a Poisson error structure. These models have many applications, not only to the
More informationPearson's Correlation Tests
Chapter 800 Pearson's Correlation Tests Introduction The correlation coefficient, ρ (rho), is a popular statistic for describing the strength of the relationship between two variables. The correlation
More informationSimilarity and Diagonalization. Similar Matrices
MATH022 Linear Algebra Brief lecture notes 48 Similarity and Diagonalization Similar Matrices Let A and B be n n matrices. We say that A is similar to B if there is an invertible n n matrix P such that
More information1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96
1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years
More informationPower and sample size in multilevel modeling
Snijders, Tom A.B. Power and Sample Size in Multilevel Linear Models. In: B.S. Everitt and D.C. Howell (eds.), Encyclopedia of Statistics in Behavioral Science. Volume 3, 1570 1573. Chicester (etc.): Wiley,
More informationHow to calculate an ANOVA table
How to calculate an ANOVA table Calculations by Hand We look at the following example: Let us say we measure the height of some plants under the effect of different fertilizers. Treatment Measures Mean
More informationMULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance
More informationDegrees of Freedom and Model Search
Degrees of Freedom and Model Search Ryan J. Tibshirani Abstract Degrees of freedom is a fundamental concept in statistical modeling, as it provides a quantitative description of the amount of fitting performed
More informationOne-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups
One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups In analysis of variance, the main research question is whether the sample means are from different populations. The
More informationIntroduction. Hypothesis Testing. Hypothesis Testing. Significance Testing
Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters
More informationFinal Exam Practice Problem Answers
Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal
More informationStat 5102 Notes: Nonparametric Tests and. confidence interval
Stat 510 Notes: Nonparametric Tests and Confidence Intervals Charles J. Geyer April 13, 003 This handout gives a brief introduction to nonparametrics, which is what you do when you don t believe the assumptions
More informationLecture 8. Confidence intervals and the central limit theorem
Lecture 8. Confidence intervals and the central limit theorem Mathematical Statistics and Discrete Mathematics November 25th, 2015 1 / 15 Central limit theorem Let X 1, X 2,... X n be a random sample of
More informationChapter 6: Multivariate Cointegration Analysis
Chapter 6: Multivariate Cointegration Analysis 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie VI. Multivariate Cointegration
More information17. SIMPLE LINEAR REGRESSION II
17. SIMPLE LINEAR REGRESSION II The Model In linear regression analysis, we assume that the relationship between X and Y is linear. This does not mean, however, that Y can be perfectly predicted from X.
More informationStudy Guide for the Final Exam
Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make
More informationOutline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares
Topic 4 - Analysis of Variance Approach to Regression Outline Partitioning sums of squares Degrees of freedom Expected mean squares General linear test - Fall 2013 R 2 and the coefficient of correlation
More informationHypothesis Testing --- One Mean
Hypothesis Testing --- One Mean A hypothesis is simply a statement that something is true. Typically, there are two hypotheses in a hypothesis test: the null, and the alternative. Null Hypothesis The hypothesis
More informationMaster s Theory Exam Spring 2006
Spring 2006 This exam contains 7 questions. You should attempt them all. Each question is divided into parts to help lead you through the material. You should attempt to complete as much of each problem
More informationQuadratic forms Cochran s theorem, degrees of freedom, and all that
Quadratic forms Cochran s theorem, degrees of freedom, and all that Dr. Frank Wood Frank Wood, fwood@stat.columbia.edu Linear Regression Models Lecture 1, Slide 1 Why We Care Cochran s theorem tells us
More informationTesting, Monitoring, and Dating Structural Changes in Exchange Rate Regimes
Testing, Monitoring, and Dating Structural Changes in Exchange Rate Regimes Achim Zeileis http://eeecon.uibk.ac.at/~zeileis/ Overview Motivation Exchange rate regimes Exchange rate regression What is the
More informationComparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples
Comparing Two Groups Chapter 7 describes two ways to compare two populations on the basis of independent samples: a confidence interval for the difference in population means and a hypothesis test. The
More informationAn analysis appropriate for a quantitative outcome and a single quantitative explanatory. 9.1 The model behind linear regression
Chapter 9 Simple Linear Regression An analysis appropriate for a quantitative outcome and a single quantitative explanatory variable. 9.1 The model behind linear regression When we are examining the relationship
More informationCHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression
Opening Example CHAPTER 13 SIMPLE LINEAR REGREION SIMPLE LINEAR REGREION! Simple Regression! Linear Regression Simple Regression Definition A regression model is a mathematical equation that descries the
More informationThe Power of the KPSS Test for Cointegration when Residuals are Fractionally Integrated
The Power of the KPSS Test for Cointegration when Residuals are Fractionally Integrated Philipp Sibbertsen 1 Walter Krämer 2 Diskussionspapier 318 ISNN 0949-9962 Abstract: We show that the power of the
More informationNOV - 30211/II. 1. Let f(z) = sin z, z C. Then f(z) : 3. Let the sequence {a n } be given. (A) is bounded in the complex plane
Mathematical Sciences Paper II Time Allowed : 75 Minutes] [Maximum Marks : 100 Note : This Paper contains Fifty (50) multiple choice questions. Each question carries Two () marks. Attempt All questions.
More informationRandomized Block Analysis of Variance
Chapter 565 Randomized Block Analysis of Variance Introduction This module analyzes a randomized block analysis of variance with up to two treatment factors and their interaction. It provides tables of
More informationExact Nonparametric Tests for Comparing Means - A Personal Summary
Exact Nonparametric Tests for Comparing Means - A Personal Summary Karl H. Schlag European University Institute 1 December 14, 2006 1 Economics Department, European University Institute. Via della Piazzuola
More informationChapter 7: Simple linear regression Learning Objectives
Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -
More information22. HYPOTHESIS TESTING
22. HYPOTHESIS TESTING Often, we need to make decisions based on incomplete information. Do the data support some belief ( hypothesis ) about the value of a population parameter? Is OJ Simpson guilty?
More informationExample: Boats and Manatees
Figure 9-6 Example: Boats and Manatees Slide 1 Given the sample data in Table 9-1, find the value of the linear correlation coefficient r, then refer to Table A-6 to determine whether there is a significant
More informationNon-Inferiority Tests for Two Means using Differences
Chapter 450 on-inferiority Tests for Two Means using Differences Introduction This procedure computes power and sample size for non-inferiority tests in two-sample designs in which the outcome is a continuous
More informationCONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE
1 2 CONTENTS OF DAY 2 I. More Precise Definition of Simple Random Sample 3 Connection with independent random variables 3 Problems with small populations 8 II. Why Random Sampling is Important 9 A myth,
More informationMULTIPLE REGRESSION ANALYSIS OF MAIN ECONOMIC INDICATORS IN TOURISM. R, analysis of variance, Student test, multivariate analysis
Journal of tourism [No. 8] MULTIPLE REGRESSION ANALYSIS OF MAIN ECONOMIC INDICATORS IN TOURISM Assistant Ph.D. Erika KULCSÁR Babeş Bolyai University of Cluj Napoca, Romania Abstract This paper analysis
More informationChicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011
Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Name: Section: I pledge my honor that I have not violated the Honor Code Signature: This exam has 34 pages. You have 3 hours to complete this
More informationSolving Mass Balances using Matrix Algebra
Page: 1 Alex Doll, P.Eng, Alex G Doll Consulting Ltd. http://www.agdconsulting.ca Abstract Matrix Algebra, also known as linear algebra, is well suited to solving material balance problems encountered
More informationNon-Inferiority Tests for Two Proportions
Chapter 0 Non-Inferiority Tests for Two Proportions Introduction This module provides power analysis and sample size calculation for non-inferiority and superiority tests in twosample designs in which
More informationHow To Check For Differences In The One Way Anova
MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way
More informationStatistical Functions in Excel
Statistical Functions in Excel There are many statistical functions in Excel. Moreover, there are other functions that are not specified as statistical functions that are helpful in some statistical analyses.
More informationUniversity of Ljubljana Doctoral Programme in Statistics Methodology of Statistical Research Written examination February 14 th, 2014.
University of Ljubljana Doctoral Programme in Statistics ethodology of Statistical Research Written examination February 14 th, 2014 Name and surname: ID number: Instructions Read carefully the wording
More information