MEASURING INFORMATION TECHNOLOGY SPILLOVERS

Similar documents

JOB HOPPING, INFORMATION TECHNOLOGY SPILLOVERS, AND PRODUCTIVITY GROWTH

Network of Practice, IT Knowledge Spillovers, and Productivity: Evidence from Enterprise Software

IT Productivity and Aggregation using Income Accounting

BIG DATA INVESTMENT, SKILLS, AND FIRM VALUE

Do R&D or Capital Expenditures Impact Wage Inequality? Evidence from the IT Industry in Taiwan ROC

The Relationship Between Computer Networks and Plants Labor Productivity

Market Value of R&D, Patents, and Organizational Capital: Finnish Evidence

Abstract. Keywords: Information technology capability, Firm performance, Enterprise systems, ERP systems

Marketing Mix Modelling and Big Data P. M Cain

Three-Way Complementarities: Performance Pay, Human Resource Analytics, and Information Technology

Crowdsourced Digital Goods and Firm Productivity

IT Knowledge Spillovers and Productivity: Evidence from Enterprise Software *

Multiple Linear Regression in Data Mining

Prasanna Tambe Stern School of Business, New York University New York, NY

Introduction. Current methodology

The primary goal of this thesis was to understand how the spatial dependence of

Markups and Firm-Level Export Status: Appendix

The Decline of the U.S. Labor Share. by Michael Elsby (University of Edinburgh), Bart Hobijn (FRB SF), and Aysegul Sahin (FRB NY)

How Much Equity Does the Government Hold?

Strength in Numbers: How Does Data-Driven Decisionmaking Affect Firm Performance?

Financial Risk Management Exam Sample Questions/Answers

Yao Zheng University of New Orleans. Eric Osmer University of New Orleans

THE MARKET VALUATION OF INNOVATION: THE CASE OF INDIAN MANUFACTURING

11. Internationalisation and firm productivity: firm and regional level effects. Authors Stefan Groot (PBL) Anet Weterings (PBL)

FOREIGN TAXES AND THE GROWING SHARE OF U.S. MULTINATIONAL COMPANY INCOME ABROAD: PROFITS, NOT SALES, ARE BEING GLOBALIZED.

Social Security Eligibility and the Labor Supply of Elderly Immigrants. George J. Borjas Harvard University and National Bureau of Economic Research

Combining Information Technology and Decentralized Work Organization: SMEs versus Larger Firms

Chapter 1. Vector autoregressions. 1.1 VARs and the identi cation problem

Intangible Assets: How the Interaction of Computers and Organizational Structure Affects Stock Market Valuations

Stock market booms and real economic activity: Is this time different?

The Loss in Efficiency from Using Grouped Data to Estimate Coefficients of Group Level Variables. Kathleen M. Lang* Boston College.

Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus

4. Simple regression. QBUS6840 Predictive Analytics.

Information Technology and Training in Emergency Call Centers

Chapter 11 Introduction to Survey Sampling and Analysis Procedures

The Life-Cycle Motive and Money Demand: Further Evidence. Abstract

Conditional guidance as a response to supply uncertainty

The Determinants and the Value of Cash Holdings: Evidence. from French firms

ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2

Computing Productivity: Firm-Level Evidence

This article presents a simple framework

Discussion of Momentum and Autocorrelation in Stock Returns

EFFECT OF INVENTORY MANAGEMENT EFFICIENCY ON PROFITABILITY: CURRENT EVIDENCE FROM THE U.S. MANUFACTURING INDUSTRY

Premium Copayments and the Trade-off between Wages and Employer-Provided Health Insurance. February 2011

Economy-Wide and Industry-Level Impact of Information Technology

Lecture Note: Digital Innovation Value Maximization

HAS THE PRODUCTIVITY BOOM FINALLY ARRIVED?

FDI as a source of finance in imperfect capital markets Firm-Level Evidence from Argentina

Assessing Three-Way Complementarities: Abstract

The Wage Return to Education: What Hides Behind the Least Squares Bias?

Response to Critiques of Mortgage Discrimination and FHA Loan Performance

Center for Research on Information Technology and Organizations UC Irvine

This study extends existing information technology (IT) productivity research by evaluating the contributions

IT Infrastructure Governance and IT Investment Performance: An Empirical Analysis

Combining Technology and Work Organization: An Analysis of Complementarity between IT and Decentralization across Firms of Different Size

The Impact of Information Technology on the Performance of Diversified Firms

How To Calculate Export Earnings

This content downloaded on Tue, 19 Feb :28:43 PM All use subject to JSTOR Terms and Conditions

Why Do High-Tech Firms Offer Perks at Work?

11.2 Monetary Policy and the Term Structure of Interest Rates

Inflation. Chapter Money Supply and Demand

Online Appendices to the Corporate Propensity to Save

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

Neural Networks for Sentiment Detection in Financial Text

Is Economics a Good Major for Future Lawyers? Evidence from Earnings Data

SYSTEMS OF REGRESSION EQUATIONS

The Effect of R&D Expenditures on Stock Returns, Price and Volatility

Integrated Resource Plan

Why High-Order Polynomials Should Not be Used in Regression Discontinuity Designs

The Business Value of CRM Systems: Productivity, Profitability, and Time Lag

Is the Forward Exchange Rate a Useful Indicator of the Future Exchange Rate?

Who Benefits More from Social Media: Evidence from Large-Sample Firm Value Analysis

INVESTING IN U.S. COMPETITIVENESS:

HOW IMPORTANT IS BUSINESS R&D FOR ECONOMIC GROWTH AND SHOULD THE GOVERNMENT SUBSIDISE IT?

Panel Data Econometrics

Information Technology, Workplace Organization, and the Demand for Skilled Labor: Firm-level Evidence *

How local are IT services markets: Proximity and IT outsourcing

INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition)

Bootstrapping Big Data

Impact of Knowledge Repository Use on the Performance of Technical Support Tasks (In Progress)

The Elasticity of Taxable Income: A Non-Technical Summary

Investment in Enterprise Resource Planning: Business Impact and Productivity Measures

What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling

Government Spending Multipliers in Developing Countries: Evidence from Lending by Official Creditors

2. Linear regression with multiple regressors

The Impact of ERP on Business Performance and Productivity

VI. Real Business Cycles Models

Do Currency Unions Affect Foreign Direct Investment? Evidence from US FDI Flows into the European Union

V. S. Kuznetsov WHAT IS THE IMPACT OF INFORMATION TECHNOLOGY ON REVENUE GROWTH, RISK AND DECISION MAKING?

An Econometric Assessment of Electricity Demand in the United States Using Panel Data and the Impact of Retail Competition on Prices

Re-examining the Cost-Effectiveness of the Research and Experimentation Tax Credit

Did Medicare Induce Pharmaceutical Innovation?

Do Supplemental Online Recorded Lectures Help Students Learn Microeconomics?*

MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group

A Panel Data Analysis of Corporate Attributes and Stock Prices for Indian Manufacturing Sector

R&D and Productivity Growth: A Review of the Literature

Chapter 10: Basic Linear Unobserved Effects Panel Data. Models:

This PDF is a selection from an out-of-print volume from the National Bureau of Economic Research

Export Pricing and Credit Constraints: Theory and Evidence from Greek Firms. Online Data Appendix (not intended for publication) Elias Dinopoulos

problem arises when only a non-random sample is available differs from censored regression model in that x i is also unobserved

Transcription:

MEASURING INFORMATION TECHNOLOGY SPILLOVERS Prasanna Tambe Stern School of Business, New York University New York, NY 10012 ptambe@stern.nyu.edu Lorin M. Hitt The Wharton School, University of Pennsylvania Philadelphia, PA 19104 lhitt@wharton.upenn.edu June 2011 Abstract The measurement of the impact of IT spillovers on productivity is an important emerging area of research. Studies of IT spillovers often adopt a production function approach commonly used for measuring R&D spillovers, in which an external pool of IT investment is modeled using weighted measures of the IT investments of other firms, industries, or countries. We show that when using this approach, measurement error in a firm s own IT inputs can create a significant upward bias on the estimated social returns to IT investment. This problem is particularly severe for estimating IT spillovers due to the high levels of measurement error in most available IT data. This bias can be addressed by using instrumental variable techniques to correct the measurement error in a firm s own IT inputs. Using panel data on IT investment, we show that measurement error corrected estimates of IT spillovers are 40 to 90% lower than uncorrected estimates. This bias term is increasing in the correlation between the IT pool and firms own IT investment. Therefore, when instruments are not available, the use of fine-grained data on transmission paths can be an effective solution because IT spillover channels are more likely than R&D spillover channels to cut across industry boundaries, minimizing the correlation between a firm s own IT investment and the constructed external IT pool. Implications for researchers, policy makers, and managers are discussed. Keywords: IT spillovers, IT productivity, measurement error Electronic copy available at: http://ssrn.com/abstract=1284373

1. Introduction Understanding the relationship between IT and productivity has been an important area of economic research for several decades. Although most prior literature has focused on the productivity of a firm s own IT investments, many researchers are now shifting attention towards the estimation of social returns (i.e. spillovers ) from IT investment, both in the US and abroad (Dedrick et al 2003; Van Reenen et al 2010). To assess the contribution of IT spillovers to productivity and growth, a common empirical strategy is to embed the aggregate pool of external IT investment from which a firm derives productivity spillovers into a "production function," along with conventional inputs and a firm's own IT investment levels. This approach is derived from a large and influential literature focused on estimating the impact of R&D spillovers on productivity (e.g., see Griliches 1992). A survey of the IT spillovers literature produced over thirty recent papers of those that attempted to measure IT spillovers, almost 80% used a variant of this approach. 1 This paper demonstrates that the application of this empirical framework to the IT context can produce estimates that significantly overstate the size of IT spillovers. It focuses on a form of the errorsin-variables problem in which measurement error is present in a firm s own investment, and the external pool is modeled by aggregating the IT investments of other firms. It is well known that when there is measurement error in the IT input, estimates of the productivity of a firm s own IT investments will be biased towards zero. Because of a) the problems with measuring and valuing IT and b) the importance of accurately measuring the contribution of IT to productivity, this measurement problem has occupied a central role in the empirical literature on the productivity of IT investments (Brynjolfsson 1993; Barua, Kriebel, and Mukhopadhyay 1995; Hitt and Brynjolfsson 1996; Greenan and Mariesse 1996; Barua and Lee 1997; Brynjolfsson and Hitt 2003). However, because the external IT pool measure is often highly 1 Recent papers discussing IT-related spillovers include Quah, 2001, Mun and Nadiri 2002, Stiroh 2002, Dedrick et al 2003, Van Leeuwen and van der Wiel 2003, Gelb, Getz, and Oberman 2003, Ganley, Kraemer, Wong 2003; Atrostic and Nguyen 2004, Rincon and Vecchi 2004, Lee and Guo 2004, Daveri and Silva 2004, Dutta and Otsuka 2004, Becchetti and Adriani 2005, O Mahoney and Vecchi 2005, Tanuwidjaja 2006, Cheng and Nault 2007, Park, Shin, and Shin 2007, Park, Shin, and Sanders 2007, Baker, Fuss and Waverman 2008, Chou, Kauffman, and Shao 2009, Lopez-Pueyo, Sanau, and Barcellina 2009, Venturini 2009, Zhang and Lee 2009, Acharya and Basu 2010, Chang and Gurbaxani 2010, Han, Chang, and Hahn 2010, Severgnini, 2010; Van Reenen et al 2010, Cheng and Nault 2011, Kooshki and Ismail 2011, Tambe and Hitt 2011. 2 Electronic copy available at: http://ssrn.com/abstract=1284373

correlated with firms' IT investments, measurement error can also result in a significant upward bias being transmitted to the IT spillover coefficient. The bias transmitted to the spillover coefficient will be especially large if the IT pool is constructed from the investments of other firms within an industry, an approach widely used when fine-grained data on spillover mechanisms 2 is unavailable. Because firms within an industry share common opportunity and factor costs, they often make similar investments, so the correlation between firms own investments and a spillover pool measure constructed in this way is often particularly high. This problem is particularly serious because of the measurement error in most available IT data, which even in the most widely used data sets has been estimated to be as high as 30-40% of the total measure variance (Brynjolfsson & Hitt, 2003). Relative to data on R&D expenditures, which captures a larger fraction of R&D spending, most types of IT data represent a small fraction of total IT expenditures, which in addition to computers can include organizational restructuring, software, IT wages, and training. Moreover, because IT researchers generally do not benefit from observable linkages such as patent citations that describe the direction of spillovers, they are more likely to use IT spillover models with undefined pathways that we show are more vulnerable to the transmission of bias. Under these conditions, the bias term transmitted to the spillover coefficient can be even larger than the true estimate. These biases can be worse when using panel methods, which increase the variance of error terms relative to the variation in productive inputs. We recommend two solutions. Although poorly measured IT inputs make it difficult to interpret IT spillover estimates, the use of instrumental variables can reduce or eliminate the bias. The classic approach to addressing the errors-in-variables problem involves the use of instrumental variables, in which measurement error in the instrument is uncorrelated with measurement error in the primary measure (Johnston & Dinardo, 1997). In this study, we use alternative measures of information 2 This paper uses the terms spillover mechanisms and transmission paths interchangeably to describe channels through which firms capture productivity benefits from the investments of other firms. Some examples are channels that promote the transfer of know-how related to IT best practices, such as labor mobility, shared membership in technical consortia, and software packages that encode business routines. 3

technology capital to correct for biases in our IT-spillover estimates. After correcting the measurement error in the IT input, the magnitudes of our IT spillover estimates fall by 40 to 90%. A similar approach applied to measuring R&D spillovers suggests much smaller biases, which is consistent with the hypothesis that larger measurement error in IT inputs makes this a particularly severe issue for the measurement of IT spillovers. Good instruments, however, are not always available, and weak instruments, especially within the errors-in-variables context, can cause problems for inference and estimation (Hausman, 2001). Therefore, we also discuss how the use of fine-grained data on spillover mechanisms can affect the size of the bias term. The bias term is increasing in the covariance between a firm s own IT investment and the IT pool, and decreasing in the variance of the IT pool. When productivity spillovers are transmitted between firms that face different opportunity and factor costs (e.g. those in different industries), the use of transmission path data can reduce co-variation between the IT pool and measures of own IT investment, reducing the size of the bias term. This is a useful distinction because unlike R&D spillovers, which tend to flow between technologically similar firms, IT-related process innovations frequently are equally likely to spill over across industry boundaries due to the general-purpose technology nature of IT. We demonstrate that the use of fine-grained data on transmission paths lowers the effects of the bias term for IT spillovers, but does not significantly impact the size of the bias term for R&D spillovers. These issues are important for several reasons. The measurement of IT spillovers has been identified as a promising area for future research because accurate estimates are important for developing a complete understanding of how IT affects growth (Dedrick et al 2003; Van Reenen et al 2010). As an economic externality, they are an issue of special interest to economists and policy makers. For example, in sectors where IT spillovers are economically important, policy makers may consider applying public funds to stimulate IT-enabled growth. Accurate estimates of the sizes of spillovers from IT investments are a prerequisite for effectively allocating these subsidies, and subsidies based on estimates of IT spillovers that overstate their true values will not have the desired effect. Establishing the size of IT 4

spillovers also has implications for understanding variation in IT returns and the allocation of IT value among firms, both of which are of interest to IT researchers and managers. The paper, therefore, contributes to a large and established literature on the measurement of IT productivity. Over the past two decades, the IT productivity literature has benefited greatly from studies that examine how modeling and measurement decisions affect estimates of IT value (some notable examples are Barua, Kriebel, and Mukhopadhyay 1995; Mooney, Gurbaxani, and Kraemer 1996; Barua and Lee 1997; Hitt and Brynjolfsson 1996; Chan 2000; Santhanam and Hartono 2003; Zhu and Kraemer 2003; Devaraj and Kohli 2003; Brynjolfsson and Hitt 2003; Burton-Jones and Straub 2006). This paper is in the same spirit, but is among the first to be focused on issues related to the measurement of social, rather than private returns to IT investments. Given the broad difficulties that researchers have faced measuring R&D spillovers, this is likely to be a fertile topic in fact, a number of recent and influential papers have focused on the challenges associated with using production functions to accurately measure R&D spillovers (Breschi and Lissoni, 2001; Knott, 2008; Knott, Wu, and Posen, 2009; Bloom, Schankermann, and Van Reenen, 2011). Section 2 highlights the relevance of this paper to the existing and emerging literature on IT productivity measurement. Section 3 analytically demonstrates the bias that is induced by measurement errors in the IT input. In Section 4, we use IT data to demonstrate that correcting the measurement error using instrumental variables significantly changes the estimated coefficients -- increasing the own IT coefficient and decreasing the estimates of the returns to the IT pool. We use representative data sets for information technology spillovers, both in cross-sectional and panel settings, and we consider how using different spillover transmission paths can affect the size of these biases. We also show that these distortions are considerably higher for IT spillovers than for R&D spillovers. Section 5 concludes the study. 2. Background Literature 5

One of the primary challenges faced by IT value researchers has been the lack of robust, consistently available measures of IT investment. Unlike data on R&D investments, regularly collected samples of firm-level IT data have been difficult to obtain. Instead, researchers have based IT productivity studies on data sets that are incomplete or available only over short time periods. For example, researchers have used IT capital stock data collected by marketing firms (Dewan and Min 1997; Brynjolfsson and Hitt 2003), IT employment data collected through surveys (Lichtenberg, 1995; Brynjolfsson and Hitt, 1996), IT asset allocations collected from managerial surveys (Bharadwaj 2000; Aral and Weill 2007; Saunders 2010; Mithas et al 2011), and IT labor data from archival sources (Tambe and Hitt 2011). Most of these data sources for example, estimates of IT capital stock from managerial surveys contain significant error because of the difficulties associated with estimating and valuing IT capital stock (Brynjolfsson 1993; Dedrick et al 2003). IT assets are difficult to measure for several reasons. First, because of the widespread use of information technologies within the firm, a large fraction of IT hardware purchases may be transacted without the knowledge of IT personnel, making it difficult for an IS manager to assess the total value of computer capital stock in the firm. Second, rapid changes in the quality-adjusted pricing of computer hardware have made it difficult to accurately estimate the value of information technologies within the firm. This problem is further aggravated because IT assets can differ along many intangible dimensions, and assigning values to all of these can be difficult (Brynjolfsson and Hitt, 2000). Finally, a considerable portion of the value created by IT, such as the creation of software, IT-enabled business processes or databases, is not recorded by conventional measurements of IT capital stock, but may represent a large fraction of a firm's IT investment. To the extent that the ratio of hardware to these types of IT assets varies across firms, further measurement error is introduced. Brynjolfsson and Hitt estimate that in the CITDB IT capital stock data set, probably the largest and most widely used data set for IT productivity research, the error variance may be as large as 30-40% of the total measure variance (Brynjolfsson and Hitt 2003). 6

This type of IT measurement error exerts a downward bias on estimates of the productivity of IT investments, which is a key reason that it proved difficult for many years to provide evidence of positive returns to IT investment (Brynjolfsson 1993). Because of the important role played by IT measurement error in the IT productivity paradox, contributions related to modeling and measurement error have played a key role in advancing the IT productivity literature (Barua, Kriebel, and Mukhopadhyay 1995; Greenan and Mairesse 1996; Barua and Lee 1997; Hitt and Brynjolfsson 1996; Brynjolfsson and Hitt 2003). To address the effects of measurement error on IT estimates, researchers have used several different approaches, including using long differences to remove the effects of random measurement error (Brynjolfsson and Hitt 2003), using instrumental variables to correct measurement error (Tambe and Hitt 2011), restricting the sample to a more homogenous set of firms (Barua and Lee 1997), and using statistical models of measurement error to correct biased estimates (Greenan and Mairesse 1996). In general, these studies find that correcting or reducing measurement error significantly raises the estimated returns to IT investment, as predicted by the classic errors-in-variables framework. As the IT productivity paradox has largely been resolved through the use of improved data and methods, some of the attention in the IT value literature has turned towards estimating social returns to IT investment, which has been identified as an important area for future research and is necessary for developing a full understanding of how IT investments affect growth (Dedrick et al 2003). Firms may derive benefits from the IT investments of other firms through mechanisms that facilitate the transfer of know-how related to new technologies, standards, and practices. Given the uncertainty and costs associated with matching new work practices to new technologies, it may be easier for firms to imitate other firms when adopting new IT-enabled production methods, rather than having to discover the right combinations of practices on their own through trial-and-error (Brynjolfsson and Hitt 2000). Other potential sources of IT externalities include benefits derived from networked applications that enable superior coordination with other firms including suppliers and customers (Cheng and Nault 2007, 2011). The productivity benefits derived from rising levels of external IT investment, therefore, may be substantial, and may have important implications for explaining a firm s own productivity as well other 7

economic phenomena such as regional variation in IT adoption and IT returns (Forman, Goldfarb, and Greenstein 2005). The literature providing econometric evidence of IT spillovers, however, is in its relative infancy. It has primarily relied on methods derived from the literature on R&D spillovers in which researchers embed a measure of the public R&D pool into a production function along with other factors of production to estimate how much the public pool of R&D knowledge contributes to productivity. However, even for estimating R&D spillovers, where investment data are more systematically and comprehensively collected, the circumstantial nature of the approach presents a number of econometric challenges. Surveys of the empirical R&D spillovers literature demonstrate significant variation in R&D spillover estimates (Griliches, 1992; Nadiri, 1993), and these surveys emphasize the importance of better data and techniques for creating more accurate estimates of R&D spillovers. Concerns about the interpretation of R&D spillover estimates have also motivated a recent literature focused on the limitations of the production function approach for assessing the contributions of R&D spillovers to productivity and growth (Breschi & Lissoni, 2001; Knott, 2008; Knott, Posen, & Wu, 2009; Bloom, Schankerman, & Van Reenen, 2011). These studies have focused attention on how differences in how R&D spillover pools are measured and modeled can lead to large differences in estimates, as well as on the importance of using data on the micro-foundations of knowledge diffusion to improve the accuracy with which spillovers can be studied. Like R&D spillovers, IT spillovers also have important implications for growth policy and strategic decision-making, and therefore merit the same attention to factors that affect the accuracy of estimates. Although the measurement of IT spillovers shares many econometric challenges in common with the measurement of R&D spillovers, the issues with measuring and valuing IT have been a longstanding problem in the IT literature and introduce unique difficulties for the measurement of IT spillovers. In the next section, we analytically demonstrate why IT measurement error which led to understated IT estimates in the earlier IT productivity literature may lead to inflated estimates of IT spillovers. 8

3. Analysis 3.1 Framework We begin by describing the production function approach commonly used for estimating the relationship between productivity and external investments in IT, motivated by the literature on the productivity of R&D spillovers. The basic framework, pioneered by Griliches (1979), relates output to conventional inputs (X it ), its own investments in the knowledge-generating asset (K it ), the aggregate investment in the industry (K at ), and total factor productivity (B). (1) Y it = BX 1 γ it K γ it K µ at Thus the productivity of a firm depends not only on its own investment in the knowledge-generating asset, but also on external investments in the knowledge-generating asset. Implicit in this model are transmission paths through which firms capture productivity benefits from the investments of other firms. For example, know-how from other countries may spill-in through trade or foreign direct investment, and firm-level productivity spillovers may occur through the transfer of knowledge from patent disclosures, supplier innovations, competitor analyses, inter-firm employee movements, customer collaborations, product market interaction, or other types of network linkages. Proxies for transmission path strength, such as geographic proximity, may also be informative about the direction and strength of knowledge flow between institutions. Variants of this model have been widely used with a variety of different dependent variables. For example, a firm s own IT investments and the aggregated investment of other firms in the industry have been tested against observable measures of innovation, such as patent counts, market value, firm sales, R&D investment, Tobin's q, or combinations of these measures as dependent variables. 3 In this paper, we focus on estimating the contribution of spillovers at the firm level and we emphasize results based on this production framework. Our results, however, are also applicable to studies at more aggregate levels, especially in contexts in which input measurement is subject to 3 See, e.g., Griliches (1992) for a survey of this literature. 9

significant error. In this paper, following the most common approach, we analyze a Cobb-Douglas specification in which output (Y) at time t of firm i affected by industry j is related to conventional inputs [capital C and labor L], knowledge capital (K), and the aggregated knowledge capital of other firms (K a ). A log-levels version of this model can be written as: (2) yit = a( i, j, t) + " ccit + " llit + " kkit + " k k a at +! it where lowercase variables denote logs and the coefficients are output elasticities. This model forms the basis of much of the empirical work that appears in the literature and in the rest of this paper. 3.2 Errors-In-Variables Bias The biases discussed in this paper arise from measurement error in inputs. It is well known that measurement error in an independent variable can bias the estimate of the coefficient on that variable towards zero, and that this bias can be transmitted to other variables, with the direction of the bias term determined by the variance-covariance matrix of the observations (Levi, 1993). Because firm investment patterns tend to mimic that of other firms in the same industry, measures of own IT investment will tend to co-vary with the IT pool in specifications such as the one shown in (2). For example, because firms in a given industry experience similar factor costs and technological opportunity, firms in industries with high IT intensity tend also to invest heavily in IT. Some of the bias from mismeasured IT, therefore, will be transmitted to the coefficient on the IT spillover term, biasing that coefficient upwards because own IT investment will in general be positively correlated with aggregate industry investment. The size of the bias term is increasing in the covariance between these inputs, and because of the small sizes of spillover estimates relative to other production inputs, the bias term can be of the same order as the estimates themselves. More formally, consider a model in which, for simplicity, we assume only two inputs to production: own investment (x k ), and the aggregate investment in the economy that is available to a firm (x s ), and suppose that the relation 10

(3) y! x +! x + µ = k k s s is known to hold between the true values of y, x k, and x s. We also make standard homoskedasticity assumptions, as well as assuming that E ( µ ) = 0, that all variables are measured as deviations from means, and that µ is independent of both x k and x s. Furthermore, suppose that the first independent variable, x k, is measured with error * (4) x k = x k +! k and that x s is constructed as the weighted average of the mismeasured input of other firms. j" I ( i) j" I ( i) * * (5) x s =! w j x jk =! w j ( x jk + $ jk ) = xs + $ s j# i j# i The OLS estimator can be written (6)! ˆ $1 = (" + #) "! where! is the covariance matrix of the observations x, and! is the covariance matrix of the error terms on the independent variables. In our two-variable case, this can be expanded to (7) &( ˆ # k 1 $! = %( ˆ 2 2 2 " () *) * ') s x x xk x ) s k s 2 2 2 &) ) * ') x x x k s k $ 2 $ ) % xk x ) s * k xs ) 2 xs ) ) xk xs 2 * xk ) 2 * s 2 xk xs ') #&( k #! $!!" %( s " The coefficient estimates on each term are a linear combination of their true values, attenuated by measurement error, and a bias term that is transmitted from the other input. In general, however, we expect " << because of the convexity of the weighting terms in the variance computation. 2 2! " s! k Furthermore, results from existing studies suggest that! k >! s, so we expect that the bias on the spillover coefficient will, in most cases, be considerably larger than the bias on the coefficient for the firm's own capital. The multiplier on the bias in the spillover coefficient is increasing in both the covariance between the two inputs, and in the error in the mismeasured input. When there is no measurement error, we recover the unity matrix, so that the OLS estimates converge to their true values. If the two inputs are uncorrelated, the estimates are attenuated versions of their true values, but there is no 11

transmission of bias from the other input. In practice, however, if the spillover term is constructed as in (5), then we expect! x k x to be non-zero because firms co-located in industries with high IT intensity, such s as finance, are all likely to invest heavily in IT. The size of the bias on the spillover coefficient is also decreasing in the variance of the spillover pool. All other things being equal, therefore, the largest bias terms will result from spillover pools that co-vary closely with the measure of own investment, and that exhibit little variation. Although the above analysis applies to cross-sectional models, many studies use panel methods to account for individual effects. With fixed-effects estimators, estimates of the contribution of IT capital stock come from the variation in investment across time from a single firm, which will, in general, be smaller than the variation in IT investment across firms. This decreases the "signal-to-noise" ratio between capital measurement and its error, further attenuating the contribution of own capital, and increasing the size of the bias term relative to the contribution of own capital (Griliches and Hausman, 1986; Mairesse, 1992). Therefore, the use of fixed-effects estimators in this context can lead to biases that are larger than in related cross-sectional regressions. In our empirical analysis below, we use a panel data set to demonstrate these problems in an empirical context. 4. Measurement of IT Spillovers 4.1. Overall Approach In this section, we first follow conventional methods to estimate the magnitudes of IT spillovers. We estimate a Cobb-Douglas production function in log-levels using data on value-added, capital, labor, IT, and measures of aggregate IT investment as described in equation (2) above. We then show that using instrumental variables to correct for measurement error in IT inputs significantly reduces the size of the spillover estimates. We also show that using data on spillover pathways that reduces the covariance between a firm s own IT investments and the external IT pool significantly reduces the bias term. In both 12

cases, we make comparisons with how similar adjustments affect R&D spillovers to demonstrate that these issues are particularly relevant for the measurement of IT spillovers. 4.2. Measurement Error and IT Spillovers Our IT data set is constructed by combining computer stock data from Computer Intelligence InfoCorp (CII) with financial information from Compustat. Rental prices and capital factors are provided by the Bureau of Labor Statistics (BLS), and price deflators are obtained from government sources. CII collects data for the 1000 largest firms in the United States (Fortune 1000). Our panel consists of observations from 529 firms over the eight-year period ranging from 1987-1994, resulting in 3954 total firm-year observations after omitting firms with incomplete data, and those which had missing data other than at the beginning or end of the measurement period. The firms in the sample are large, averaging $1 billion in value-added. Within the sample, 57% of the firms are from the manufacturing industry, 41% from service, and 2% from mining, construction and agriculture. Some service industries -- banking, insurance -- are largely excluded because many of the firms in these industries do not report ordinary capital stock on Compustat. Because these industries are particularly computer-intensive, the firms in our sample are somewhat less computer-intensive than the economy as a whole. Otherwise, our sample appears to be broadly representative of large firms in the U.S. economy, and firms in the sample account for about 15% of total U.S. economic output over our sample period. To test the hypothesis that measurement error produces an upward bias in IT spillover estimates, we correct for measurement error using an alternative measure of IT capital stock from the marketing research firm International Data Group (IDG) as an instrumental variable. To be an effective instrument, measurement error in the IDG-provided instrumental variable should be uncorrelated with the measurement error in our primary information technology capital stock measure as provided by CII. The independence of the error terms of these two different measures derives from the different methods through which these two market research firms value IT capital. 13

CII conducts surveys to track specific pieces of computer equipment at the site level and interviews information systems managers, at intervals ranging from monthly to annually, to obtain detailed information on a site s IT hardware assets. The interview process includes checking on hardware reported in previous interviews to make accurate comparisons. CII assesses the market value of each piece of hardware, and aggregates the numbers to form a measure of total hardware use at the firm. As mentioned above, these valuation data omit software, stored data, information system staff, and telecommunications equipment. Market valuation is performed by a proprietary algorithm developed by CII that takes into account market-based rental prices and machine configurations in determining an estimate. IDG, by comparison, uses very different methods to collect its data. IDG surveys a single officer in the firm. The officer is asked to report the market value of central processors and the PCs and terminals in that firm. The number of PCs and terminals is multiplied by an estimated value, determined by the average nominal PC price over 1989-1991 in Berndt and Griliches' (1990) study of hedonic prices for computers. Thus, both of these approaches include potential sources of error. However, because the CII data is collected at the site level from a number of managers, and the IDG data is collected from a single officer through survey questions, there is little reason to believe that errors between the two measures are correlated. Finally, because of higher accuracy from CII s more rigorous methods for counting, tracking, and valuing the assets, we use their data as our primary variable. First, we consider the impact of correcting measurement error using an instrumental variable. Note that the use of this instrument removes measurement error, but will not in general correct other types of biases such as those related to omitted variables. Therefore, the estimates should be viewed not as correct spillover estimates, but spillover estimates with biases related to measurement error removed. Of our 3,954 firm-year observations available from CII, only 1,324 have corresponding values for the IDG-provided instrumental variable. Rather than restrict our sample, we use the full data set and include a dummy variable to indicate if the instrument is not available. We use the full sample because the larger sample size allows us to increase the precision of our estimates, which is helpful for identifying 14

information technology related spillover effects. 4 Our strategy is to produce estimates of spillovers using the specification described in (2) and then to show that correcting for measurement error reduces the sizes of these estimates by reducing estimation bias. Our spillover pools are computed as the average IT investment of other firms in the same two-digit Standard Industry Classification (SIC). Table 1 shows the results of estimating (2) with our firm-level data. Columns 1 and 2 show estimates of the contribution of IT capital with and without the measurement error correction. Application of the instrument, in Column 2, increases the size of the coefficient on IT capital and reduces the contribution of all other inputs. This increase is consistent with the removal of bias caused by measurement errors in the dependent variable, and Hausman test results confirm that the change to the coefficient on IT capital is meaningful (t = 7.47, p < 0.01). These estimates are also similar in size to those reported by other studies using these data (e.g., Brynjolfsson and Hitt, 2003). Columns 3 and 4 show the results of OLS estimation with the IT pool included in the specification. The coefficient for IT spillover's contribution to output is about half that of the firm's IT capital, and like the other inputs, is reduced by application of the instrument (see Column 4). The corrected estimates of spillover are not significantly different from zero, and the point estimates are much smaller than the equivalent values in Column 3. Like in the first set of regressions, a Hausman test confirms the significance of the change to the IT capital coefficient (t = 6.86, p < 0.01). In Columns 2 and 4, we also report the results of our firststage regressions and Anderson-Rubin statistics, which provide evidence of the relevance and validity of our instrument in the IV analysis (p < 0.00 for both tests). In Columns (5) and (6), we show the results of similar regressions run on a subset of the data that contained manufacturing industries -- a sector in which studies often show evidence of much larger externalities. Column 5 shows that the estimate of the coefficient on spillover in manufacturing is more than twice as large as the spillover estimate in the full sample, and is considerably larger than the estimate on the contribution of a firm's internal IT capital 4 Although our spillover coefficient estimates are not significant with the 1,324-observation restricted sample, our results are not substantially changed. In both the restricted and unrestricted sample, application of the instrumental variable affects the coefficient estimate on internal IT capital in similar ways. 15

stock. Correcting for measurement error lowers the spillover estimate to less than half its earlier value, and it is no longer significantly different than zero. In Columns (1)-(5) of Table 2, we compute panel estimates on the full sample using several alternative approaches. Many spillover studies take a random-effects or a fixed-effects approach with longitudinal data to account for unobservable individual effects. We explore both fixed effects and random effects specifications. Columns 1 and 2 shows output elasticities for a fixed-effects specification before and after including a spillover term which was constructed from the average of IT capital stock from firms within the same industry as defined by two-digit Standard Industry Classification (SIC) codes. In contrast to our earlier models, the spillover term estimates are negative, consistent with a product market rivalry explanation (Jaffe, 1986; Bloom, Schankermann, and Van Reenen, 2011). Column 2 shows the measurement error correction results after using the IDG data as an instrument. Consistent with the removal of bias, this correction increases the estimate of the contribution of private IT capital stocks, and further reduces the estimate on the externalities term. Columns 4 and 5 show similar results with random-effects specifications, with the decline in the spillover term being of the same order as in the fixed effects models. Overall, the observations on the other coefficients in these models are largely consistent with theoretical values (in the non-instrumented regressions) and exhibit comparable behavior to that reported in previous work. One somewhat surprising result is that the direct coefficient on information technology capital rises significantly in the instrumental variables regression to a number that is well above its factor share. At least part of this increase is due to the correction for downward bias in IT capital contribution due to measurement error. However, the unusually high coefficient estimate may also be due to accentuating the endogeneity of computer investment that is, at least some of the correlation between our two measures of computer capital are due to short-run shocks driving up firm-level investment that may also be correlated with output. Whereas this highlights the need for better instruments for IT, it is notable that the decline in the spillover term occurs despite these biases. Furthermore, if we set IT capital to its theoretical value, we see a similar reduction in the spillover coefficient, which suggests that changes 16

in the direct IT coefficient by itself in the IV estimates are not responsible for the reduction in the spillover term estimates. The changes in the coefficient estimates when capital and labor set at factor share imply an error variance for IT capital measurement that is slightly less than 35% of the total variance in IT capital. The implied error variance for the spillover measurement is about one-third of this number. Thus, the relative sizes of the measurement errors required to produce the biases that we observe are in line with earlier studies that have found potential for significant error in IT capital measurement (Brynjolfsson and Hitt, 2003). Moreover, these calculations coincide with the intuition that when the spillover term is highly correlated with own IT investment, it is essentially representing a lower variance estimate of direct IT returns and thus picking up some of the direct IT effect. Overall, these results in both pooled OLS and panel specifications demonstrate that: (1) measurement error in firm-level inputs can create spurious increases in the estimated effects of externalities; (2) that these problems also occur for panel methods; and (3) that instrumental variables can be used to fix the measurement error and remove the bias. These biases can potentially account for a substantial amount of variation in IT spillover estimates. In Table 3, we show that under reasonable assumptions, using estimated values from the empirical analysis above, the ratio of the elasticities of the spillover pool to the elasticities of a firm s own IT investment can exhibit wide variation when we allow measurement error in the firm s own investments to bias the spillover term. To create the values in Table 3, we use the relationships derived in (7) to compute how measurement error in the IT input, along with changes to the covariance between the spillover pool and own IT investment, may affect the ratio of their estimated output elasticities. For reasonable values of the true ratio, the swing in values is large enough to accommodate a large amount of variation, and demonstrates that spillover estimates are sensitive to measurement error in the input data. 4.3. Measurement Error and R&D Spillovers To illustrate the importance of these issues for measuring IT spillovers, we show that these distortions are smaller for the R&D spillover case, which is the empirical context from which this 17

framework was derived. Therefore, the application of this existing framework to IT data presents estimation problems that are unique to IT productivity measurement. To demonstrate how measurement error in R&D affects R&D spillover estimates, we use patent counts as an instrumental variable to correct for measurement error in R&D capital, in which the R&D spillover variable is constructed as the average R&D investment of other firms in the same two-digit SIC classification. Later in this section, we take into account the technological proximity of firms according to patenting domains rather than the SIC code. Our R&D data comes from Compustat, and consists of firm observations from 1987-1999. Patent data comes from the NBER-Case Western patent database. 5 The full sample of firms for which both R&D and patent data are available consists of 1,562 firm-year observations. The connection between patents and R&D expenditures has been the subject of a large literature that has demonstrated that patent counts are highly correlated with R&D expenditures in a number of industries (Pakes and Griliches, 1980). Furthermore, for our purposes, patents are an effective instrument because although they are a noisy measure of knowledge production, there is little reason to expect correlation among measurement errors in patents and R&D investments. It is worth reemphasizing that the criterion for an appropriate instrument in this context is that the measurement error in the instrumental variable is uncorrelated with the measurement error in the underlying variable the measures themselves can and will be highly correlated as they are measures of the same underlying construct. Pakes and Griliches also, however, show that although patents co-vary closely with R&D expenditures between firms, they do a poor job of reflecting differences in R&D expenditures in the within-firm dimension, partially because of the lags between R&D and patenting activity and because of uncertainty surrounding depreciation rates of knowledge capital. Therefore, we limit our analysis in this section to pooled, crosssectional regressions, and we use two alternate specifications: 1) using (the logarithm of) patents in each year as our instrument, and 2) the logarithm of the sum of the patent counts over all years for each firm. 5 Hall, Jaffe, and Trajtenberg (2001) 18

We also limit the analysis in this section to manufacturing firms in order to focus on a sector in which patents are a robust measure of R&D activity. Table 4 shows the results of four regressions in which we estimate the productivity of R&D spillovers and the effects of using an instrumental variable to remove measurement errors. Columns 1 and 2 show results before including spillover measures with Column 2 showing the effects of applying the logged patent counts instrument. As expected, applying the instrument leads to higher estimates on private R&D stocks and lower estimates on the other factor inputs. Columns 3 and 4 show the estimates of the contributions of each input after the spillover term is included, with Column 4 showing the effects of applying the instrument. In Column 3, the coefficient estimate on R&D spillovers is of a similar size to estimates from existing R&D spillovers studies. Column 4 shows the estimates refined by using logarithms of patent counts as an instrument to remove the effects of measurement error in R&D capital. The correction lowers the estimate of R&D spillover returns by a smaller amount relative to the IT examples above, indicating that the large shifts in coefficients observed earlier are unique to IT spillover studies. 4.4. How Data on Transmission Paths Affects the Impact of Measurement Error In the preceding section, we used instrumental variables to demonstrate that measurement error induced biases can significantly affect the size of IT spillover estimates. In this section, we focus on how construction of the spillover pool measure affects the severity of the bias term. As mentioned above, larger biases result from spillover measures that 1) co-vary with a firm s own investment levels and 2) have smaller variances. Therefore, even in the absence of instruments, more accurate spillover estimates are obtained from models that incorporate fine-grained data on spillover pathways that decrease the covariance between the measures of own IT investment and IT spillover pools and that result in larger variance for the spillover pool. On the other hand, the bias will be relatively more severe when the relevant mechanisms increase the covariance between a firm s own IT investment and the IT spillover pool or reduce the variance of the spillover pool. The measurement benefits of using fine-grained data to 19

model transmission paths should be greater for IT spillovers because as a general-purpose technology, IT spillovers are less likely than R&D spillovers to be constrained by industry boundaries. We use a variety of data sources to show that when using data on spillover pathways decreases the covariance between private investments and the spillover pool, measurement error has a much smaller effect on spillover estimates. The first data set we use comes from the same sources described above, Computer Intelligence Infocorp (CII) data describing information technology investments at the intra-firm establishment level, and other financial measures at the firm level that come from Compustat databases. We exploit elements of the data at the establishment level to model proximity. Specifically, we take advantage of the fact that the establishment-level data assigns SIC codes to different establishments within each firm. Firms in the sample have an average of almost 69 establishments per firm. In addition, and most important for this study, each firm, through its establishments, occupies a variety of industry positions. On average, firms in this sample occupy more than four different two-digit SIC categories. Thus, firms that look similar at the aggregate firm level, in which they are generally assigned to a single SIC category, are quite different if compared at the establishment level. If technological proximity plays an important role in knowledge spillovers, then each firm may be best described as a collection of establishments, which in turn, operate in respective different technological areas and with respective access to different spillover pools. The firm's access to IT spillovers, then, is best modeled by accounting for diversity in its constituent establishments. Whereas firm-level analyses generally model the knowledge capital external to a firm as the industry average weighted sum of the investments of other firms, we use these additional measures of proximity in order to examine the role of covariance in bias. Our first comparison measure uses establishment-level data to model spillover pools available to a firm through its constituent establishments. Thus, the spillover pool available to a firm is the weighted sum of the spillover pools available to each of its establishments, and lowercase and uppercase indices represent establishments and firms, respectively. 20

s I =! i" I ( c & ' c i I % # s $ i The weights are determined by the ratio of IT capital at the establishment level to total IT capital at the firm level. Thus, conditional on the size of the spillover pool, corporate sites that are larger and more invested in information technologies will transfer more know-how from these spillover pools into the larger organization. The spillover pool of each establishment i is computed as the average of the information technology investments of all other establishments that occupy the same SIC industry I(i), not including other establishments that are in the same Firm F(i). s = 1 j" I ( i)! i c j N j# F ( i) Therefore, the spillover pool of each establishment within a firm is driven by the SIC code in which the establishment operates, independent of the technological position of the parent firm. Our spillover measure, therefore, differs from traditional firm-level measures because use of establishment-level data provides a richer description of spillover channels, and because the variation produced by these richer models allows some separation between spillover pools and a firm's technological position. The second modeling structure we test in this section uses data on IT labor flows, which has been advanced as one specific transmission path for IT spillovers (Dedrick et al, 2003). The IT labor flow data we use are based on a partnership with a leading online jobs board and are described in detail in other published work (citations blinded for review, available upon request). Our approach is to compute the IT pool as the IT intensity of all other firms from which a firm hires at least 5% of its new IT workers, which implies that a firm receives larger productivity spillovers from organizations from which they hire many employees. For the purposes of this paper, the most notable fact about these IT labor flow data is that for the sample under investigation, IT workers appear commonly move across industries so the spillover pool accessible through labor flows is not highly correlated with a firms own IT investments, suggesting that the impact of measurement error on the spillover coefficient estimate will be minimal. 21

Table 5 shows how the covariance between IT capital stock measures and the IT spillover pool measure impacts the bias term in the different models. The use of establishment level data lowers the multiplier somewhat, but the use of IT labor flows that tend to cut across industry dramatically lowers the multiplier on the measurement error term, suggesting that the use of establishment data will slightly lower the size of the bias term transmitted to the spillover coefficient, and that the use of IT labor flow data will lower it even further, potentially reducing the bias term by an order of magnitude. Table 6 shows the results of regressions comparing the performance of the different spillover measures both with and without instrumental variable correction of measurement errors. All regressions in both tables are pooled in levels, with controls for industry and year. The uncorrected elasticities of IT capital are similar across all three models. None of the coefficient estimates on the spillover terms are significantly different than zero in our sample. As expected, however, correcting measurement error in (2) and (4) substantially decreases the magnitude of the spillover estimate. The corrected estimates in (2) and (4) indicate that that the use of establishment data somewhat decreases the size of the bias term transmitted to the spillover coefficient, although the size of the bias term when using establishment data is still substantial. The corrected estimate in (6), however, is similar to the uncorrected estimate, indicating that comparatively little of the bias term from IT measurement error is transmitted to the spillover coefficient due to the low covariance between the two measures indicated in Table 5. The estimates from Table 6 indicate that the size of the bias term transmitted to the spillover coefficient can vary considerably depending on the transmission path being tested, and in some contexts, may not significantly alter the spillover coefficient at all. Relative to IT spillovers, R&D spillovers are more likely to flow among firms occupying a similar technological position therefore, the use of detailed transmission paths generally will not be as effective in reducing the covariance between own investment and the spillover pool. Our approach, drawing on one widely replicated model, uses the patenting behavior of firms to model spillovers between them (Griliches, 1979; Jaffe, 1986). We show that correcting the measurement error does not remove 22