y i1 = x i1 + f i + u i1 y i2 = x i2 + f i + u i2

Size: px
Start display at page:

Download "y i1 = x i1 + f i + u i1 y i2 = x i2 + f i + u i2"

Transcription

1 1. Economics 245A: Cluster Sampling & Matching (This document was created using the AMS Proceedings Article shell document.) Cluster sampling arises in a number of contexts. For example, consider a study of retirement saving. It is likely the case that retirement saving for employees within a rm will be correlated, because of common features of the rm (such as type of retirement plan) or because of common (often unobserved) characteristics of employees within a rm. Each rm represents a group, or cluster, and we may sample several workers from each of a large number of rms. Other examples might be a study of teenage peer e ects, in which we have a few teenagers from each of a large number of neighborhoods (the neighborhoods are the cluster) or high schools, or a study of siblings in a large sample of families (families are the cluster). The key is that we sample a large number of clusters and each cluster consists of a relatively small number of observations compared with the overall sample size. We allow the units within the cluster to be correlated, but we assume independence across clusters. 2. Matched Pairs Let us begin with a study of siblings in a large sample of families. The idea is to use siblings to control for unobserved family backgrounds. Our thought experiment is to have two identical individuals, for whom we vary one exogenous e ect. We attempt to capture our two identical individuals by studying siblings. For each family i there are two siblings y i1 = x i1 + f i + u i1 y i2 = x i2 + f i + u i2 where the equations are for siblings 1 and 2 and f i is an unobserved family e ect. The strict exogeneity assumption now implies that the error u is in each sibling s equation is uncorrelated with the explanatory variables in both equations. For example, let y be log(wage) and let x contain years of schooling. Then we must assume that sibling s schooling has no e ect on wages once we control for own schooling, the family e ect and other observed covariates. If f i is assumed to be uncorrelated with x i1 and x i2, then random e ects analysis can be used. 1

2 2 More commonly, f i is assumed to be correlated with x i1 and x i2, in which case di erencing across siblings to remove f i is the appropriate strategy. Under this strategy, x cannot contain common observable family background variables, as these are indistinguisable from f i. Standard IV estimators can be applied directly to the di erence equation y i1 y i2 = (x i1 x i2 ) + (u i1 u i2 ): 3. General Cluster Samples Matched pairs are a special case of a cluster sample. As noted above, observations within a cluster are thought to be correlated due to an unobserved cluster e ect. Suppose we model the retirement saving of individual m in cluster ( rm) g y gm = + x g + z gm + v gm ; where x g are explanatory variables that vary only at the rm level (i.e. rm characteristics), z gm are explanatory variables that vary within (and across) rms (that is, they vary at the employee level), there are G clusters and M g observations within each cluster (so there are di erent numbers of employees sampled from each rm) Cluster Intercept. A simple starting point, that is surprisingly exible, is to let x g consist only of a constant term, so that each rm has it s own mean level of saving y gm = c g + z gm + v gm : A (standard) strict exogeneity assumption requires that the error v gm be uncorrelated with the explanatory variables z gm for all individuals from cluster g. That is, the error for one employee in a rm must be uncorrelated with z for all other employees within the same rm. The cluster e ect c g usually renders this assumption plausible. If we assume that c g is uncorrelated with z gm (that is the di erences in average retirement saving across rms are not related to the characteristics of the employees within rms), then pooled OLS is consistent. If we allow for correlation between c g and z gm, then we demean within clusters to remove the cluster e ect and then use pooled OLS (or IV) on the demeaned data General Cluster: Large Group Asymptotics. Logic: from a population of clusters, we randomly draw G clusters, where each cluster has M g observations. It should be the case that G is su ciently large relative to M g that we can allow for unrestricted correlation within cluster.

3 3 We rst assume E (v gm jx g ; z gm ) = 0 m = 1; : : : ; M g and g = 1; : : : ; G: Note, we could replace this assumption with a weaker assumption, requiring only that the variables be uncorrelated. Note also that this is a weaker assumption than made above in that we only require the error v gm to be uncorrelated with z gm, hence the error for one employee may be correlated with the explanatory variables for other employees. Under this assumption the pooled OLS estimator is consistent if the number of groups grows (G! 1) and the group size remains constant (M g is xed). The estimator is p G asymptotically normal. To construct a robust variance estimtaor, note that v gm is likely correlated across individuals within a cluster and the variance may also vary across individuals (conditional heteroskeasticity), so we write the model at the group level and y g = y g1 ; : : : ; y gmg 0 y g = W g + v g where W g is the M g (1 + K + L) matrix of all regressors. The robust standard errors are obtained from! 1 WgW 0 g! Wg^v 0 g^v gw 0 g! 1 WgW 0 g ; where ^v g is the M g 1 vector of residuals from pooled OLS regression GLS. The pooled OLS estimator ignores the within cluster correlation of v gm. To take advantage, we must strenghten the exogeneity assumption to E (v gm jx g ; Z g ) = 0 where Z g is the M g L matrix of individual covariates for cluser g. Thus we return to the assumption under which the error for an individual is exogenous to the covariates for all other individuals. With this assumption, we rewrite the error as v gm = c g + u gm :

4 4 (In statistics, this equation in combination with the original linear model spec cation is termed a hierarchical linear model). The resulting covarince matrix for the error vector v g is the M g M g matrix 2 2 c + 2 u 2 3 c 6 V ar (v g ) = c 5 : c + 2 u While we typically assume that V ar (v g ) = V ar (v g jx g ; Z g ), so that we have conditional homoskedasticity, we can still gain e ciency by using GLS. We then estimate the model via GLS, using a consistent estimator of the covariance matrix Large Group Size Asymptotics. Logic: We stratify the population into G groups and then sample randomly M g times from each group. For example, Card and Kruger have G = 2 states (NJ and Pa), Bound has G = 34 and all states would have G = 50. To understand the pitfalls of applying standard analysis with small G, consider the case in which x g is scalar and z gm is not present y gm = + x g + c g + u gm ; where c g and u gm are independent of x g and fu gm g is iid with mean zero for each g. If c g is absent from the model, then pooled OLS is consistent and inference is straightforward. If V ar (u gm ) is constant across g, then standard OLS t-statistics are correct. If we allow for heteroskedasticity, then we simply use the Eicker-White correction (or feasible GLS, as we have multiple observations on each cluster). With cluster e ects, the analysis is quite di erent. Let c g N (0; 2 c), which we assume to be independent of fu gm g. The pooled OLS estimator ^ is identical to regression of y g on 1; x g for g = 1; : : : ; G: (This is sometimes referred to as the between-groups estimator). Conditional on x g, ^ inherits its distribution from fv g g, the within-group averages of the composite errors v gm = c g +u gm. Because c g is present, new observations do not add information about, beyond how they a ect the group average, y g. If we add strong assumptions, we can solve the inference problem. Speci cally, if we assume u gm N (0; 2 u) and M g = M for all g, then v g N 0; 2 c + 2 u. Hence M y g = + x g + v g

5 5 satis es the classic linear assumptions and we use inference on the t G 2 distritubion (note that M M G 2 is not the correct number of degrees-of-freedom). If the common group size, M, is large, then we can use large sample approximation to treat u gm as approximately normal. Further, even if group size di ers, if M g is large for all groups, then V ar (v g ) = 2 c + 2 u M g will be dominated by the rst term, and the approximation should work well (also if 2 u is small). In essence, we are ignoring estimation error in y g and analyzing the simple regression g = + x g + c g where we use y g in place of y g. This is very close to a standard check: estimate the model both with individual data and with cluster averages. With the cluster averages we lose e ciency but we do not need to make standard errors robust to within-cluster correlation. The main point is that above regression allows for conservative inference, as long as cluster sizes are large and cluster e ects are normal. For small G and large M g inference will be very conservative if cluster e ects are not present. While this may be desirable, it rules out some widely used tools for policy analysis. Return to our comparison of mean levels across two groups (perhaps the treated and control). Under random sampling and normality, the di erence in means between the two groups usually has M 1 +M 2 2 degrees-of-freedom. With even moderate group sizes we can relax normality and allow for di erent group variances and still conduct accurate inference. But in the above setup, we cannot conduct di erence in means analysis because G = 2. Such analysis was used to criticize Card and Kruger, because they failed to account for the state e ect c g in the composite error term v gm. But this is close to the common issue with di erence-in-di erence estimators, namely how to know if the observed e ect is all due to the policy change. Perhaps c g is part of the e ect to be estimated. Consider the following example. Over the summer a school district with two high schools, A and B, decides to provide computers to students at school B who have just nished their rst year. The announcement is made just prior to the start of the school year, so students cannot switch high schools. The response is the change in a standardized test score given to these students. If the students are randomly sampled, then a comparison of means should be accurate. Of course there may be other confounding factors, say the average increase in test scores at school B would have been higher anyway.

What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling

What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling Jeff Wooldridge NBER Summer Institute, 2007 1. The Linear Model with Cluster Effects 2. Estimation with a Small Number of Groups and

More information

Chapter 2. Dynamic panel data models

Chapter 2. Dynamic panel data models Chapter 2. Dynamic panel data models Master of Science in Economics - University of Geneva Christophe Hurlin, Université d Orléans Université d Orléans April 2010 Introduction De nition We now consider

More information

1. THE LINEAR MODEL WITH CLUSTER EFFECTS

1. THE LINEAR MODEL WITH CLUSTER EFFECTS What s New in Econometrics? NBER, Summer 2007 Lecture 8, Tuesday, July 31st, 2.00-3.00 pm Cluster and Stratified Sampling These notes consider estimation and inference with cluster samples and samples

More information

Panel Data Econometrics

Panel Data Econometrics Panel Data Econometrics Master of Science in Economics - University of Geneva Christophe Hurlin, Université d Orléans University of Orléans January 2010 De nition A longitudinal, or panel, data set is

More information

CAPM, Arbitrage, and Linear Factor Models

CAPM, Arbitrage, and Linear Factor Models CAPM, Arbitrage, and Linear Factor Models CAPM, Arbitrage, Linear Factor Models 1/ 41 Introduction We now assume all investors actually choose mean-variance e cient portfolios. By equating these investors

More information

Chapter 3: The Multiple Linear Regression Model

Chapter 3: The Multiple Linear Regression Model Chapter 3: The Multiple Linear Regression Model Advanced Econometrics - HEC Lausanne Christophe Hurlin University of Orléans November 23, 2013 Christophe Hurlin (University of Orléans) Advanced Econometrics

More information

SYSTEMS OF REGRESSION EQUATIONS

SYSTEMS OF REGRESSION EQUATIONS SYSTEMS OF REGRESSION EQUATIONS 1. MULTIPLE EQUATIONS y nt = x nt n + u nt, n = 1,...,N, t = 1,...,T, x nt is 1 k, and n is k 1. This is a version of the standard regression model where the observations

More information

Clustering in the Linear Model

Clustering in the Linear Model Short Guides to Microeconometrics Fall 2014 Kurt Schmidheiny Universität Basel Clustering in the Linear Model 2 1 Introduction Clustering in the Linear Model This handout extends the handout on The Multiple

More information

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written

More information

Chapter 10: Basic Linear Unobserved Effects Panel Data. Models:

Chapter 10: Basic Linear Unobserved Effects Panel Data. Models: Chapter 10: Basic Linear Unobserved Effects Panel Data Models: Microeconomic Econometrics I Spring 2010 10.1 Motivation: The Omitted Variables Problem We are interested in the partial effects of the observable

More information

ON THE ROBUSTNESS OF FIXED EFFECTS AND RELATED ESTIMATORS IN CORRELATED RANDOM COEFFICIENT PANEL DATA MODELS

ON THE ROBUSTNESS OF FIXED EFFECTS AND RELATED ESTIMATORS IN CORRELATED RANDOM COEFFICIENT PANEL DATA MODELS ON THE ROBUSTNESS OF FIXED EFFECTS AND RELATED ESTIMATORS IN CORRELATED RANDOM COEFFICIENT PANEL DATA MODELS Jeffrey M. Wooldridge THE INSTITUTE FOR FISCAL STUDIES DEPARTMENT OF ECONOMICS, UCL cemmap working

More information

IDENTIFICATION IN A CLASS OF NONPARAMETRIC SIMULTANEOUS EQUATIONS MODELS. Steven T. Berry and Philip A. Haile. March 2011 Revised April 2011

IDENTIFICATION IN A CLASS OF NONPARAMETRIC SIMULTANEOUS EQUATIONS MODELS. Steven T. Berry and Philip A. Haile. March 2011 Revised April 2011 IDENTIFICATION IN A CLASS OF NONPARAMETRIC SIMULTANEOUS EQUATIONS MODELS By Steven T. Berry and Philip A. Haile March 2011 Revised April 2011 COWLES FOUNDATION DISCUSSION PAPER NO. 1787R COWLES FOUNDATION

More information

ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2

ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2 University of California, Berkeley Prof. Ken Chay Department of Economics Fall Semester, 005 ECON 14 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE # Question 1: a. Below are the scatter plots of hourly wages

More information

Chapter 1. Vector autoregressions. 1.1 VARs and the identi cation problem

Chapter 1. Vector autoregressions. 1.1 VARs and the identi cation problem Chapter Vector autoregressions We begin by taking a look at the data of macroeconomics. A way to summarize the dynamics of macroeconomic data is to make use of vector autoregressions. VAR models have become

More information

Lecture 3: Differences-in-Differences

Lecture 3: Differences-in-Differences Lecture 3: Differences-in-Differences Fabian Waldinger Waldinger () 1 / 55 Topics Covered in Lecture 1 Review of fixed effects regression models. 2 Differences-in-Differences Basics: Card & Krueger (1994).

More information

1 Another method of estimation: least squares

1 Another method of estimation: least squares 1 Another method of estimation: least squares erm: -estim.tex, Dec8, 009: 6 p.m. (draft - typos/writos likely exist) Corrections, comments, suggestions welcome. 1.1 Least squares in general Assume Y i

More information

FIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS

FIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS FIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS Jeffrey M. Wooldridge Department of Economics Michigan State University East Lansing, MI 48824-1038

More information

Exact Nonparametric Tests for Comparing Means - A Personal Summary

Exact Nonparametric Tests for Comparing Means - A Personal Summary Exact Nonparametric Tests for Comparing Means - A Personal Summary Karl H. Schlag European University Institute 1 December 14, 2006 1 Economics Department, European University Institute. Via della Piazzuola

More information

Correlated Random Effects Panel Data Models

Correlated Random Effects Panel Data Models INTRODUCTION AND LINEAR MODELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. The Linear

More information

Empirical Methods in Applied Economics

Empirical Methods in Applied Economics Empirical Methods in Applied Economics Jörn-Ste en Pischke LSE October 2005 1 Observational Studies and Regression 1.1 Conditional Randomization Again When we discussed experiments, we discussed already

More information

Wooldridge, Introductory Econometrics, 3d ed. Chapter 12: Serial correlation and heteroskedasticity in time series regressions

Wooldridge, Introductory Econometrics, 3d ed. Chapter 12: Serial correlation and heteroskedasticity in time series regressions Wooldridge, Introductory Econometrics, 3d ed. Chapter 12: Serial correlation and heteroskedasticity in time series regressions What will happen if we violate the assumption that the errors are not serially

More information

Normalization and Mixed Degrees of Integration in Cointegrated Time Series Systems

Normalization and Mixed Degrees of Integration in Cointegrated Time Series Systems Normalization and Mixed Degrees of Integration in Cointegrated Time Series Systems Robert J. Rossana Department of Economics, 04 F/AB, Wayne State University, Detroit MI 480 E-Mail: r.j.rossana@wayne.edu

More information

Common sense, and the model that we have used, suggest that an increase in p means a decrease in demand, but this is not the only possibility.

Common sense, and the model that we have used, suggest that an increase in p means a decrease in demand, but this is not the only possibility. Lecture 6: Income and Substitution E ects c 2009 Je rey A. Miron Outline 1. Introduction 2. The Substitution E ect 3. The Income E ect 4. The Sign of the Substitution E ect 5. The Total Change in Demand

More information

1 Present and Future Value

1 Present and Future Value Lecture 8: Asset Markets c 2009 Je rey A. Miron Outline:. Present and Future Value 2. Bonds 3. Taxes 4. Applications Present and Future Value In the discussion of the two-period model with borrowing and

More information

Review Jeopardy. Blue vs. Orange. Review Jeopardy

Review Jeopardy. Blue vs. Orange. Review Jeopardy Review Jeopardy Blue vs. Orange Review Jeopardy Jeopardy Round Lectures 0-3 Jeopardy Round $200 How could I measure how far apart (i.e. how different) two observations, y 1 and y 2, are from each other?

More information

Solución del Examen Tipo: 1

Solución del Examen Tipo: 1 Solución del Examen Tipo: 1 Universidad Carlos III de Madrid ECONOMETRICS Academic year 2009/10 FINAL EXAM May 17, 2010 DURATION: 2 HOURS 1. Assume that model (III) verifies the assumptions of the classical

More information

Optimal insurance contracts with adverse selection and comonotonic background risk

Optimal insurance contracts with adverse selection and comonotonic background risk Optimal insurance contracts with adverse selection and comonotonic background risk Alary D. Bien F. TSE (LERNA) University Paris Dauphine Abstract In this note, we consider an adverse selection problem

More information

Conditional Investment-Cash Flow Sensitivities and Financing Constraints

Conditional Investment-Cash Flow Sensitivities and Financing Constraints WORING PAPERS IN ECONOMICS No 448 Conditional Investment-Cash Flow Sensitivities and Financing Constraints Stephen R. Bond and Måns Söderbom May 2010 ISSN 1403-2473 (print) ISSN 1403-2465 (online) Department

More information

LOGIT AND PROBIT ANALYSIS

LOGIT AND PROBIT ANALYSIS LOGIT AND PROBIT ANALYSIS A.K. Vasisht I.A.S.R.I., Library Avenue, New Delhi 110 012 amitvasisht@iasri.res.in In dummy regression variable models, it is assumed implicitly that the dependent variable Y

More information

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance

More information

Herding, Contrarianism and Delay in Financial Market Trading

Herding, Contrarianism and Delay in Financial Market Trading Herding, Contrarianism and Delay in Financial Market Trading A Lab Experiment Andreas Park & Daniel Sgroi University of Toronto & University of Warwick Two Restaurants E cient Prices Classic Herding Example:

More information

Out-of-Sample Forecast Tests Robust to the Choice of Window Size

Out-of-Sample Forecast Tests Robust to the Choice of Window Size Out-of-Sample Forecast Tests Robust to the Choice of Window Size Barbara Rossi and Atsushi Inoue (ICREA,UPF,CREI,BGSE,Duke) (NC State) April 1, 2012 Abstract This paper proposes new methodologies for evaluating

More information

Quality differentiation and entry choice between online and offline markets

Quality differentiation and entry choice between online and offline markets Quality differentiation and entry choice between online and offline markets Yijuan Chen Australian National University Xiangting u Renmin University of China Sanxi Li Renmin University of China ANU Working

More information

MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group

MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group MISSING DATA TECHNIQUES WITH SAS IDRE Statistical Consulting Group ROAD MAP FOR TODAY To discuss: 1. Commonly used techniques for handling missing data, focusing on multiple imputation 2. Issues that could

More information

Chapter 4: Vector Autoregressive Models

Chapter 4: Vector Autoregressive Models Chapter 4: Vector Autoregressive Models 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie IV.1 Vector Autoregressive Models (VAR)...

More information

The Binomial Distribution

The Binomial Distribution The Binomial Distribution James H. Steiger November 10, 00 1 Topics for this Module 1. The Binomial Process. The Binomial Random Variable. The Binomial Distribution (a) Computing the Binomial pdf (b) Computing

More information

Employment E ects of Service O shoring: Evidence from Matched Firms

Employment E ects of Service O shoring: Evidence from Matched Firms Employment E ects of Service O shoring: Evidence from Matched Firms Rosario Crinò Institut d Anàlisi Econòmica, CSIC December, 2009 Abstract This paper studies the e ects of service o shoring on the level

More information

Please follow the directions once you locate the Stata software in your computer. Room 114 (Business Lab) has computers with Stata software

Please follow the directions once you locate the Stata software in your computer. Room 114 (Business Lab) has computers with Stata software STATA Tutorial Professor Erdinç Please follow the directions once you locate the Stata software in your computer. Room 114 (Business Lab) has computers with Stata software 1.Wald Test Wald Test is used

More information

Financial Risk Management Exam Sample Questions/Answers

Financial Risk Management Exam Sample Questions/Answers Financial Risk Management Exam Sample Questions/Answers Prepared by Daniel HERLEMONT 1 2 3 4 5 6 Chapter 3 Fundamentals of Statistics FRM-99, Question 4 Random walk assumes that returns from one time period

More information

Comparing Features of Convenient Estimators for Binary Choice Models With Endogenous Regressors

Comparing Features of Convenient Estimators for Binary Choice Models With Endogenous Regressors Comparing Features of Convenient Estimators for Binary Choice Models With Endogenous Regressors Arthur Lewbel, Yingying Dong, and Thomas Tao Yang Boston College, University of California Irvine, and Boston

More information

Lecture 9: Keynesian Models

Lecture 9: Keynesian Models Lecture 9: Keynesian Models Professor Eric Sims University of Notre Dame Fall 2009 Sims (Notre Dame) Keynesian Fall 2009 1 / 23 Keynesian Models The de ning features of RBC models are: Markets clear Money

More information

UNIVERSITY OF WAIKATO. Hamilton New Zealand

UNIVERSITY OF WAIKATO. Hamilton New Zealand UNIVERSITY OF WAIKATO Hamilton New Zealand Can We Trust Cluster-Corrected Standard Errors? An Application of Spatial Autocorrelation with Exact Locations Known John Gibson University of Waikato Bonggeun

More information

Our development of economic theory has two main parts, consumers and producers. We will start with the consumers.

Our development of economic theory has two main parts, consumers and producers. We will start with the consumers. Lecture 1: Budget Constraints c 2008 Je rey A. Miron Outline 1. Introduction 2. Two Goods are Often Enough 3. Properties of the Budget Set 4. How the Budget Line Changes 5. The Numeraire 6. Taxes, Subsidies,

More information

14.451 Lecture Notes 10

14.451 Lecture Notes 10 14.451 Lecture Notes 1 Guido Lorenzoni Fall 29 1 Continuous time: nite horizon Time goes from to T. Instantaneous payo : f (t; x (t) ; y (t)) ; (the time dependence includes discounting), where x (t) 2

More information

Marketing Mix Modelling and Big Data P. M Cain

Marketing Mix Modelling and Big Data P. M Cain 1) Introduction Marketing Mix Modelling and Big Data P. M Cain Big data is generally defined in terms of the volume and variety of structured and unstructured information. Whereas structured data is stored

More information

From the help desk: Bootstrapped standard errors

From the help desk: Bootstrapped standard errors The Stata Journal (2003) 3, Number 1, pp. 71 80 From the help desk: Bootstrapped standard errors Weihua Guan Stata Corporation Abstract. Bootstrapping is a nonparametric approach for evaluating the distribution

More information

Employer Learning, Productivity and the Earnings Distribution: Evidence from Performance Measures Preliminary and Incomplete

Employer Learning, Productivity and the Earnings Distribution: Evidence from Performance Measures Preliminary and Incomplete Employer Learning, Productivity and the Earnings Distribution: Evidence from Performance Measures Preliminary and Incomplete Lisa B. Kahn and Fabian Lange y Yale University January 8, 2009 Abstract Two

More information

Department of Economics Session 2012/2013. EC352 Econometric Methods. Solutions to Exercises from Week 10 + 0.0077 (0.052)

Department of Economics Session 2012/2013. EC352 Econometric Methods. Solutions to Exercises from Week 10 + 0.0077 (0.052) Department of Economics Session 2012/2013 University of Essex Spring Term Dr Gordon Kemp EC352 Econometric Methods Solutions to Exercises from Week 10 1 Problem 13.7 This exercise refers back to Equation

More information

Midterm March 2015. (a) Consumer i s budget constraint is. c i 0 12 + b i c i H 12 (1 + r)b i c i L 12 (1 + r)b i ;

Midterm March 2015. (a) Consumer i s budget constraint is. c i 0 12 + b i c i H 12 (1 + r)b i c i L 12 (1 + r)b i ; Masters in Economics-UC3M Microeconomics II Midterm March 015 Exercise 1. In an economy that extends over two periods, today and tomorrow, there are two consumers, A and B; and a single perishable good,

More information

An introduction to Value-at-Risk Learning Curve September 2003

An introduction to Value-at-Risk Learning Curve September 2003 An introduction to Value-at-Risk Learning Curve September 2003 Value-at-Risk The introduction of Value-at-Risk (VaR) as an accepted methodology for quantifying market risk is part of the evolution of risk

More information

The Dynamics of UK and US In ation Expectations

The Dynamics of UK and US In ation Expectations The Dynamics of UK and US In ation Expectations Deborah Gefang Department of Economics University of Lancaster email: d.gefang@lancaster.ac.uk Simon M. Potter Gary Koop Department of Economics University

More information

Voluntary Voting: Costs and Bene ts

Voluntary Voting: Costs and Bene ts Voluntary Voting: Costs and Bene ts Vijay Krishna y and John Morgan z November 7, 2008 Abstract We study strategic voting in a Condorcet type model in which voters have identical preferences but di erential

More information

Multiple Linear Regression in Data Mining

Multiple Linear Regression in Data Mining Multiple Linear Regression in Data Mining Contents 2.1. A Review of Multiple Linear Regression 2.2. Illustration of the Regression Process 2.3. Subset Selection in Linear Regression 1 2 Chap. 2 Multiple

More information

Introducing the Multilevel Model for Change

Introducing the Multilevel Model for Change Department of Psychology and Human Development Vanderbilt University GCM, 2010 1 Multilevel Modeling - A Brief Introduction 2 3 4 5 Introduction In this lecture, we introduce the multilevel model for change.

More information

Trade Liberalization and the Economy:

Trade Liberalization and the Economy: Trade Liberalization and the Economy: Stock Market Evidence from Singapore Rasyad A. Parinduri Shandre M. Thangavelu January 11, 2009 Abstract We examine the e ect of the United States Singapore Free Trade

More information

The E ect of Trading Commissions on Analysts Forecast Bias

The E ect of Trading Commissions on Analysts Forecast Bias The E ect of Trading Commissions on Analysts Forecast Bias Anne Beyer and Ilan Guttman Stanford University September 2007 Abstract The paper models the interaction between a sell-side analyst and a risk-averse

More information

Clustered Standard Errors

Clustered Standard Errors Clustered Standard Errors 1. The Attraction of Differences in Differences 2. Grouped Errors Across Individuals 3. Serially Correlated Errors 1. The Attraction of Differences in Differences Estimates Typically

More information

Sharp and Diffuse Incentives in Contracting

Sharp and Diffuse Incentives in Contracting Risk & Sustainable Management Group Risk and Uncertainty Working Paper: R07#6 Research supported by an Australian Research Council Federation Fellowship http://www.arc.gov.au/grant_programs/discovery_federation.htm

More information

WORKING PAPER NO. 11-31 OUT-OF-SAMPLE FORECAST TESTS ROBUST TO THE CHOICE OF WINDOW SIZE

WORKING PAPER NO. 11-31 OUT-OF-SAMPLE FORECAST TESTS ROBUST TO THE CHOICE OF WINDOW SIZE WORKING PAPER NO. 11-31 OUT-OF-SAMPLE FORECAST TESTS ROBUST TO THE CHOICE OF WINDOW SIZE Barbara Rossi Duke University and Visiting Scholar, Federal Reserve Bank of Philadelphia Atsushi Inoue North Carolina

More information

1 Short Introduction to Time Series

1 Short Introduction to Time Series ECONOMICS 7344, Spring 202 Bent E. Sørensen January 24, 202 Short Introduction to Time Series A time series is a collection of stochastic variables x,.., x t,.., x T indexed by an integer value t. The

More information

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,

More information

Identifying Moral Hazard in Car Insurance Contracts 1

Identifying Moral Hazard in Car Insurance Contracts 1 Identifying Moral Hazard in Car Insurance Contracts 1 Sarit Weisburd The Hebrew University April 6, 2010 1 I would like to thank Yehuda Shavit for his thoughtful advice on the nature of insurance data

More information

Binary Outcome Models: Endogeneity and Panel Data

Binary Outcome Models: Endogeneity and Panel Data Binary Outcome Models: Endogeneity and Panel Data ECMT 676 (Econometric II) Lecture Notes TAMU April 14, 2014 ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, 2014 1 / 40 Topics Issues

More information

Centre for Central Banking Studies

Centre for Central Banking Studies Centre for Central Banking Studies Technical Handbook No. 4 Applied Bayesian econometrics for central bankers Andrew Blake and Haroon Mumtaz CCBS Technical Handbook No. 4 Applied Bayesian econometrics

More information

Chapter 4: Statistical Hypothesis Testing

Chapter 4: Statistical Hypothesis Testing Chapter 4: Statistical Hypothesis Testing Christophe Hurlin November 20, 2015 Christophe Hurlin () Advanced Econometrics - Master ESA November 20, 2015 1 / 225 Section 1 Introduction Christophe Hurlin

More information

Missing Data: Part 1 What to Do? Carol B. Thompson Johns Hopkins Biostatistics Center SON Brown Bag 3/20/13

Missing Data: Part 1 What to Do? Carol B. Thompson Johns Hopkins Biostatistics Center SON Brown Bag 3/20/13 Missing Data: Part 1 What to Do? Carol B. Thompson Johns Hopkins Biostatistics Center SON Brown Bag 3/20/13 Overview Missingness and impact on statistical analysis Missing data assumptions/mechanisms Conventional

More information

Understanding Order Flow

Understanding Order Flow Understanding Order Flow October 2005 Martin D. D. Evans 1 Richard K. Lyons Georgetown University and NBER U.C. Berkeley and NBER Department of Economics Haas School of Business Washington DC 20057 Berkeley,

More information

Using instrumental variables techniques in economics and finance

Using instrumental variables techniques in economics and finance Using instrumental variables techniques in economics and finance Christopher F Baum 1 Boston College and DIW Berlin German Stata Users Group Meeting, Berlin, June 2008 1 Thanks to Mark Schaffer for a number

More information

On Marginal Effects in Semiparametric Censored Regression Models

On Marginal Effects in Semiparametric Censored Regression Models On Marginal Effects in Semiparametric Censored Regression Models Bo E. Honoré September 3, 2008 Introduction It is often argued that estimation of semiparametric censored regression models such as the

More information

Panel Data: Linear Models

Panel Data: Linear Models Panel Data: Linear Models Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini Laura Magazzini (@univr.it) Panel Data: Linear Models 1 / 45 Introduction Outline What

More information

Multivariate Analysis of Variance (MANOVA)

Multivariate Analysis of Variance (MANOVA) Multivariate Analysis of Variance (MANOVA) Aaron French, Marcelo Macedo, John Poulsen, Tyler Waterson and Angela Yu Keywords: MANCOVA, special cases, assumptions, further reading, computations Introduction

More information

Mathematics. Rosella Castellano. Rome, University of Tor Vergata

Mathematics. Rosella Castellano. Rome, University of Tor Vergata and Loans Mathematics Rome, University of Tor Vergata and Loans Future Value for Simple Interest Present Value for Simple Interest You deposit E. 1,000, called the principal or present value, into a savings

More information

Adverse Selection. Chapter 3

Adverse Selection. Chapter 3 Chapter 3 Adverse Selection Adverse selection, sometimes known as The Winner s Curse or Buyer s Remorse, is based on the observation that it can be bad news when an o er is accepted. Suppose that a buyer

More information

Econometric Analysis of Cross Section and Panel Data

Econometric Analysis of Cross Section and Panel Data Econometric Analysis of Cross Section and Panel Data Je rey M. Wooldridge The MIT Press Cambridge, Massachusetts London, England ( 2002 Massachusetts Institute of Technology All rights reserved. No part

More information

Bias in the Estimation of Mean Reversion in Continuous-Time Lévy Processes

Bias in the Estimation of Mean Reversion in Continuous-Time Lévy Processes Bias in the Estimation of Mean Reversion in Continuous-Time Lévy Processes Yong Bao a, Aman Ullah b, Yun Wang c, and Jun Yu d a Purdue University, IN, USA b University of California, Riverside, CA, USA

More information

The Wage E ects of Not-for-Pro t and For-Pro t. Certi cations: Better Data, Somewhat Di erent Results

The Wage E ects of Not-for-Pro t and For-Pro t. Certi cations: Better Data, Somewhat Di erent Results The Wage E ects of Not-for-Pro t and For-Pro t Certi cations: Better Data, Somewhat Di erent Results Kevin Lang and Russell Weinstein y 7th June 2013 Abstract Using the Beginning Postsecondary Student

More information

A Basic Introduction to Missing Data

A Basic Introduction to Missing Data John Fox Sociology 740 Winter 2014 Outline Why Missing Data Arise Why Missing Data Arise Global or unit non-response. In a survey, certain respondents may be unreachable or may refuse to participate. Item

More information

The E ect of U.S. Agricultural Subsidies on Farm Expenses and the Agricultural Labor Market

The E ect of U.S. Agricultural Subsidies on Farm Expenses and the Agricultural Labor Market The E ect of U.S. Agricultural Subsidies on Farm Expenses and the Agricultural Labor Market Belinda Acuña y Department of Economics University of California, Santa Barbara Draft: October 2008 Abstract

More information

Module 5: Multiple Regression Analysis

Module 5: Multiple Regression Analysis Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College

More information

Investment and Financial Constraints: Empirical Evidence for Firms in Brazil and China

Investment and Financial Constraints: Empirical Evidence for Firms in Brazil and China Investment and Financial Constraints: Empirical Evidence for Firms in Brazil and China Stephen R. Bond Nu eld College and Department of Economics, University of Oxford and Institute for Fiscal Studies

More information

Random Effects Models for Longitudinal Survey Data

Random Effects Models for Longitudinal Survey Data Analysis of Survey Data. Edited by R. L. Chambers and C. J. Skinner Copyright 2003 John Wiley & Sons, Ltd. ISBN: 0-471-89987-9 CHAPTER 14 Random Effects Models for Longitudinal Survey Data C. J. Skinner

More information

IAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results

IAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results IAPRI Quantitative Analysis Capacity Building Series Multiple regression analysis & interpreting results How important is R-squared? R-squared Published in Agricultural Economics 0.45 Best article of the

More information

Tiered and Value-based Health Care Networks

Tiered and Value-based Health Care Networks Tiered and Value-based Health Care Networks Ching-to Albert Ma Henry Y. Mak Department of Economics Department of Economics Boston Univeristy Indiana University Purdue University Indianapolis 270 Bay State

More information

Paid Placement: Advertising and Search on the Internet

Paid Placement: Advertising and Search on the Internet Paid Placement: Advertising and Search on the Internet Yongmin Chen y Chuan He z August 2006 Abstract Paid placement, where advertisers bid payments to a search engine to have their products appear next

More information

Investigating the Relationship between Gold and Silver Prices

Investigating the Relationship between Gold and Silver Prices Investigating the Relationship between Gold and Silver Prices ALVARO ESCRIBANO 1 AND CLIVE W. J. GRANGER 2 * 1 Universidad Carlos III de Madrid, Spain 2 University of California, San Diego, USA ABSTRACT

More information

Topic 5: Stochastic Growth and Real Business Cycles

Topic 5: Stochastic Growth and Real Business Cycles Topic 5: Stochastic Growth and Real Business Cycles Yulei Luo SEF of HKU October 1, 2015 Luo, Y. (SEF of HKU) Macro Theory October 1, 2015 1 / 45 Lag Operators The lag operator (L) is de ned as Similar

More information

A Subset-Continuous-Updating Transformation on GMM Estimators for Dynamic Panel Data Models

A Subset-Continuous-Updating Transformation on GMM Estimators for Dynamic Panel Data Models Article A Subset-Continuous-Updating Transformation on GMM Estimators for Dynamic Panel Data Models Richard A. Ashley 1, and Xiaojin Sun 2,, 1 Department of Economics, Virginia Tech, Blacksburg, VA 24060;

More information

Information and Human Capital Management

Information and Human Capital Management Information and Human Capital Management Heski Bar-Isaac y Ian Jewitt z Clare Leaver x December 2008 Abstract Employees di er both in terms of general human capital and rm-speci c human capital (or match

More information

Chapter 1. Linear Panel Models and Heterogeneity

Chapter 1. Linear Panel Models and Heterogeneity Chapter 1. Linear Panel Models and Heterogeneity Master of Science in Economics - University of Geneva Christophe Hurlin, Université d Orléans Université d Orléans January 2010 Introduction Speci cation

More information

Review of Bivariate Regression

Review of Bivariate Regression Review of Bivariate Regression A.Colin Cameron Department of Economics University of California - Davis accameron@ucdavis.edu October 27, 2006 Abstract This provides a review of material covered in an

More information

Association Between Variables

Association Between Variables Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi

More information

The Microstructure of Currency Markets

The Microstructure of Currency Markets The Microstructure of Currency Markets Martin D. D. Evans Department of Economics Georgetown University and NBER July 2010 Abstract This article summarizes exchange-rate research using microstructure models.

More information

Introduction to Longitudinal Data Analysis

Introduction to Longitudinal Data Analysis Introduction to Longitudinal Data Analysis Longitudinal Data Analysis Workshop Section 1 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section 1: Introduction

More information

Testosterone levels as modi ers of psychometric g

Testosterone levels as modi ers of psychometric g Personality and Individual Di erences 28 (2000) 601±607 www.elsevier.com/locate/paid Testosterone levels as modi ers of psychometric g Helmuth Nyborg a, *, Arthur R. Jensen b a Institute of Psychology,

More information

Decision-Based Forecast Evaluation of UK Interest Rate Predictability*

Decision-Based Forecast Evaluation of UK Interest Rate Predictability* DEPARTMENT OF ECONOMICS Decision-Based Forecast Evaluation of UK Interest Rate Predictability* Stephen Hall, University of Leicester, UK Kevin Lee, University of Leicester, UK Kavita Sirichand, University

More information

The Loss in Efficiency from Using Grouped Data to Estimate Coefficients of Group Level Variables. Kathleen M. Lang* Boston College.

The Loss in Efficiency from Using Grouped Data to Estimate Coefficients of Group Level Variables. Kathleen M. Lang* Boston College. The Loss in Efficiency from Using Grouped Data to Estimate Coefficients of Group Level Variables Kathleen M. Lang* Boston College and Peter Gottschalk Boston College Abstract We derive the efficiency loss

More information

Chapter 3: Section 3-3 Solutions of Linear Programming Problems

Chapter 3: Section 3-3 Solutions of Linear Programming Problems Chapter 3: Section 3-3 Solutions of Linear Programming Problems D. S. Malik Creighton University, Omaha, NE D. S. Malik Creighton University, Omaha, NE Chapter () 3: Section 3-3 Solutions of Linear Programming

More information

Using Repeated Measures Techniques To Analyze Cluster-correlated Survey Responses

Using Repeated Measures Techniques To Analyze Cluster-correlated Survey Responses Using Repeated Measures Techniques To Analyze Cluster-correlated Survey Responses G. Gordon Brown, Celia R. Eicheldinger, and James R. Chromy RTI International, Research Triangle Park, NC 27709 Abstract

More information

problem arises when only a non-random sample is available differs from censored regression model in that x i is also unobserved

problem arises when only a non-random sample is available differs from censored regression model in that x i is also unobserved 4 Data Issues 4.1 Truncated Regression population model y i = x i β + ε i, ε i N(0, σ 2 ) given a random sample, {y i, x i } N i=1, then OLS is consistent and efficient problem arises when only a non-random

More information

Representation of functions as power series

Representation of functions as power series Representation of functions as power series Dr. Philippe B. Laval Kennesaw State University November 9, 008 Abstract This document is a summary of the theory and techniques used to represent functions

More information