FEAR: A Software Package for Frontier Efficiency Analysis with R
|
|
|
- Joan Davidson
- 10 years ago
- Views:
Transcription
1 FEAR: A Software Package for Frontier Efficiency Analysis with R Paul W. Wilson December 2006 Abstract This paper describes a software package for computing nonparametric efficiency estimates, making inference, and testing hypotheses in frontier models. Commands are provided for bootstrapping as well as computation of some new, robust estimators of efficiency, etc. JEL Classification nos. C14, C15, C44 Keywords: nonparametric, DEA, efficiency, software, benchmarking The John E. Walker Department of Economics, 222 Sirrine Hall, Clemson University, Clemson, South Carolina USA; [email protected].
2 1 Introduction An extensive literature on measurement of efficiency in production has evolved from the pioneering work of Debreu (1951) and Farrell (1957). A large part of this literature uses linear-programming based measures of efficiency along the lines of Charnes et al. (1978) and Färe et al. (1985); these methods have been termed Data Envelopment Analysis (DEA). Gattoufi et al. (2004) cite more than 1,800 articles on DEA published in more than 490 refereed journals. DEA and similar non-parametric estimators offer numerous advantages, the most obvious being that one need not specify a (potentially erroneous) functional relationship between production inputs and outputs. Until very recently the nonparametric efficiency literature has ignored statistical issues such as inference, hypothesis testing, etc. However, statistical inference and hypothesis testing are now possible with DEA and other nonparametric efficiency estimators due to results surveyed by Simar and Wilson (2000b, 2007b). Standard software packages (e.g., LIMDEP, STATA, TSP) used by econometricians do not include procedures for DEA or other nonparametric efficiency estimators. Several specialized, commercial software packages are available, as well as a small number of noncommercial, free-ware programs; these have been reviewed by Hollingsworth (1997, 1999, 2004) and Barr (2004). Academic versions of the commercial packages range in price from about US$280 to US$1500 at current exchange rates. To varying degrees, the existing packages perform well the tasks they were designed for. Each includes facilities for reading data into the program, in some cases in a variety of formats, and procedures for estimating models that the authors have programmed into their software. A common complaint heard among practitioners, however, runs along the lines of package XYZ will not let me estimate the model I want! The existing packages are designed for ease of use (again, with varying degrees of success), but at a cost inflexibility, limiting the user to models and procedures the authors have explicitly made available. Moreover, none of the existing packages include procedures for statistical inference. Although the asymptotic distribution of DEA estimators is now known (see Kneip et al., 2003, for details) for the general case with p inputs and q outputs, bootstrap methods remain the only useful approach for inference. Yet, none of the existing packages include procedures for bootstrapping in 1
3 frontier models. Similarly, none of the existing packages provide procedures for the newlydeveloped robust alternatives to DEA surveyed in Simar and Wilson (2007b). FEAR consists of a software library that can be linked to the general-purpose statistical package R. The routines included in FEAR allow the user to compute DEA estimates of technical, allocative, and overall efficiency while assuming either variable, non-increasing, or constant returns to scale. The routines are highly flexible, allowing measurement of efficiency of one group of observations relative to a technology defined by a second, reference group of observations. Consequently, the routines can be used to compute estimates of Malmquist indices and their components in any of the decompositions that have been proposed, scale efficiency measures, super-efficiency scores along the lines of Andersen and Petersen (1993), and other measures that might be of interest. Commands are also included to facilitate implementation of the bootstrap methods described by Simar and Wilson (1998, 2000a); in particular, FEAR s boot.sw98 command can be used to impliment the homogenous bootstrap algorithm described by Simar and Wilson; some programming using commands in FEAR and R are is required when using the current version of FEAR to implement the heterogeneous bootstrap algorithm described by Simar and Wilson. Commands in FEAR can also be combined to implement statistical tests for irrelevant inputs and outputs or aggregation possibilities as described in Simar and Wilson (2001b), statistical tests of constant returns to scale versus non-increasing or varying returns to scale as described in Simar and Wilson (2001a), or the iterated bootstrap procedure described in Simar and Wilson (2004). 1 A routine for maximum likelihood estimation of a truncated regression model is included for regressing DEA efficiency estimates on environmental variables as described in Simar and Wilson (2007a). In addition, FEAR includes commands to estimate Malmquist indices are various decompostions of Malmquist indices, with an implementation of the bootstrap algorithm for described by Simar and Wilson (1999); to compute free-disposal hull (FDH) efficiency estimates (Deprins et al., 1984); 1 The statistical tests described by Simar and Wilson (2001b) and Simar and Wilson (2001a) do not depend on dubious assumptions of distributions of efficiency scores. Banker (1996) and Banker and Natarajan (2004) proposed statistics for testing various hypotheses in DEA applications, with critical values chosen either from F-distributions, but the results of Kneip et al. (2003) make it clear that the statistics proposed by Banker cannot have F distributions. Banker (1996) and Banker and Natarajan (2004) also proposed Kolmogorov Smirnov tests based on DEA efficiency estimates, but without taking into account the dependence among these estimates discussed by Simar and Wilson (2007a). 2
4 to perform outlier analysis using the methods of Wilson (1993); to compute the new, robust, root-n consistent order-m efficiency estimators described by Cazals et al. (2002); and to compute DEA estimates of cost, revenue, and profit efficiency. A number of these features are unavailable in existing software packages. 2 Where to Get FEAR and R R is a language and environment for statistical computing graphics. It is an implementation of the S language developed at Bell Laboratories, but unlike the commercial version of S marketed as S-Plus by Lucent Technologies, R is freely available under the Free Software Foundation s GNU General Public License. According to the R project s web pages ( R provides a wide variety of statistical... and graphical techniques, and is highly extensible... One of R s strengths is the ease with which well-designed publication-quality plots can be produced, including mathematical symbols and formulae where needed... R is an integrated suite of software facilities for data manipulation, calculation and graphical display. It includes (i) an effective data handling and storage facility; (ii) a suite of operators for calculations on arrays, in particular matrices; (iii) a large, coherent, integrated collection of intermediate tools for data analysis; (iv) graphical facilities for data analysis and display either on-screen or on hardcopy; and (v) a well-developed, simple and effective programming language which includes conditionals, loops, user-defined recursive functions and input and output facilities. R includes an online help facility and extensive documentation in the form of manuals included with the package. In addition, several books describing uses of R are available (eg., Dalgaard, 2002; Venables and Ripley, 2002; and Verzani, 2004); see also the recent review by Racine and Hyndman (2002). The current version of R can be downloaded from Pre-compiled binary versions are available for a variety of platforms; at present, however, FEAR is being made available only for the Microsoft Windows (NT, 95 and later) operating systems running on Intel (and clone) machines. After downloading and installing R, FEAR can be downloaded from 3
5 A Command Reference and User s Guide can also be downloaded from this site; the User s Guide gives instructions for installing FEAR into R, as well as a number of examples illustrating some of the capabilities of FEAR. The web site also contains licensing information; while use of FEAR is free for academic purposes, potential government, commercial, or other users outside an educational institution should contact the author before using the FEAR software. The current version of FEAR (at the time of this writing) is 1.0; as features are added and modifications are made, new versions will be released (with higher version numbers). Every attempt will be made to maintain backward-compatibility with earlier versions of FEAR, and versions prior to the current release will remain available for some time. 3 Using FEAR and R Once R and FEAR have been downloaded and installed, R s graphical user interface (GUI) can be started by clicking on the desktop R icon; commands are typed at a prompt in the RConsole window contained within the RGui window. After installing FEAR, R s online help facility can be used to find documentation on the commands implemented in FEAR, as shown in Figure 1. Clicking on a command name displays a page giving details on use of the command, including arguments that must be passed, optional arguments and any defaults, etc., as well as a detailed description of what is returned by the command. FEAR includes data that can be used to illustrate the library s capabilities. First, the command library(fear) must be typed in the RConsole window to load the FEAR library, as shown in Figure 2. One might then type help(ccr) to learn that the Charnes et al. (1981) data contain 5 inputs used to produce 3 outputs among 70 schools. The following commands load these data and then organize the input vectors in a (5 70) matrix and the output vectors in a (3 70) matrix: data(ccr) x=t(matrix(c(ccr$x1,ccr$x2,ccr$x3,ccr$x4,ccr$x5), nrow=70,ncol=5)) y=t(matrix(c(ccr$y1,ccr$y2,ccr$y3),nrow=70,ncol=5)) The following commands reproduce the outlier analysis in Wilson (1993): tmp=ap(x,y,ndel=12) 4
6 windows() ap.plot(ratio=tmp$ratio) The resulting plot is shown in Figure 4. Next, Shephard (1970) output-distance function estimates can be computed: d.out=dea(x,y,orientation=2) R s summary command summary(d.out) gives summary statistics on the estimates. Alternatively, the summary command can be combined with the ifelse statement to obtain summary statistics on the distance function estimates that are different from unity: summary(ifelse(d.out==1,na,d.out)) as shown in Figure 3. FEAR s show.dens command can be used to produce plots of kernel density estimates for the distance function estimates contained in d.in and d.out; the necessary bandwidth parameters can be obtained with FEAR s eff.bw command: h=eff.bw(d.out) show.dens(dist=d.out,bw=h, XLAB="output distance function estimates") The resulting plots are shown in Figure 5; here, they have been drawn on the user s screen, but alternatively, R s postscript command can be used to write postscript code for the plots to a file for inclusion in a manuscript, etc. The show.dens command uses a reflection method to avoid inconsistency problems for the kernel density estimator and the lower (upper) boundary 1 for the input (output) efficiency estimates. The command boot.sw98 implements the bootstrap algorithm described by Simar and Wilson (1998); this command can be used to estimate 95-percent confidence intervals for the input distance functions corresponding to each observation in the Charnes et al. (1981) data by typing result=boot.sw98(xobs=x,yobs=y,dhat=d.out,orientation=2) 5
7 The following command will produce L A TEX code for a table displaying results from the last command; the first column contains the observation number, the second column contains the input distance function estimates, while the third and fourth columns contain the lower and upper bounds of the estimated confidence intervals: tmp=paste(c(1:70),"&",d.in,"&", result$conf.int[,1],"&", result$conf.int[,2],"\\",sep=" ") 4 Concluding Remarks FEAR is a very flexible, extensible package unlike any currently available for estimation of productivity and efficiency. The cost of this flexibility is that the user must type commands at a command-line prompt; for some, this may be unsatisfactory. For others, however, FEAR will be a useful tool, allowing one to estimate quantities that have not been explicitly programmed into other packages, and that have perhaps not been anticipated by this or other authors. In addition to the commands illustrated above, the current version FEAR includes commands to compute DEA estimates of cost, revenue, and profit efficiency, Malmquist indices and various decompositions, and order-m efficiency estimates. 6
8 References Andersen, P., and Petersen, N.C. (1993), A procedure for ranking efficient units in data envelopment analysis, Management Science, 39, Banker, R.D. (1996), Hypothesis tests using data envelopment analysis, Journal of Productivity Analysis, 7, Banker, R.D., and Natarajan, R. (2004), Statistical tests based on dea efficiency scores, In Handbook on Data Envelopment Analysis, ed. W.W. Cooper, L.M. Seiford, and J. Zhu, New York: Kluwer Academic Publishers, Inc. chapter 11, pp Barr, R.S. (2004), DEA software tools and technology, In Handbook on Data Envelopment Analysis, ed. W.W. Cooper, L.M. Seiford, and J. Zhu, Boston: Kluwer Academic Publishers pp Cazals, C., Florens, J.P., and Simar, L. (2002), Nonparametric frontier estimation: A robust approach, Journal of Econometrics, 106, Charnes, A., Cooper, W.W., and Rhodes, E. (1978), Measuring the inefficiency of decision making units, European Journal of Operational Research, 2, (1981), Evaluating program and managerial efficiency: An application of data envelopment analysis to program follow through, Management Science, 27, Dalgaard, P. (2002), Introductory Statistics with R, New York: Springer-Verlag, Inc. Debreu, G. (1951), The coefficient of resource utilization, Econometrica, 19, Deprins, D., Simar, L., and Tulkens, H. (1984), Measuring labor inefficiency in post offices, In The Performance of Public Enterprises: Concepts and Measurements, ed. M. Marchand P. Pestieau and H. Tulkens, Amsterdam: North-Holland pp Färe, R., Grosskopf, S., and Lovell, C.A.K. (1985), The Measurement of Efficiency of Production, Boston: Kluwer-Nijhoff Publishing. Farrell, M.J. (1957), The measurement of productive efficiency, Journal of the Royal Statistical Society A, 120, Gattoufi, S., Oral, M., and Reisman, A. (2004), Data envelopment analysis literature: A bibliography update ( ), Socio-Economic Planning Sciences, 38, Hollingsworth, B. (1997), A review of data envelopment analysis software, Economic Journal, 107, (1999), Data envelopment analysis and productivity analysis: A review of the options, Economic Journal, 109, F458 F462. (2004), Non parametric efficiency measurement, Economic Journal, 114, F307 F311. Kneip, A., Simar, L., and Wilson, P.W. (2003), Asymptotics for DEA estimators in nonparametric frontier models, Discussion paper #0317, Institut de Statistique, Université Catholique de Louvain, Louvain-la-Neuve, Belgium. 7
9 Racine, J., and Hyndman, R. (2002), Using R to teach econometrics, Journal of Applied Econometrics, 17, Shephard, R.W. (1970), Theory of Cost and Production Functions, Princeton: Princeton University Press. Simar, L., and Wilson, P.W. (1998), Sensitivity analysis of efficiency scores: How to bootstrap in nonparametric frontier models, Management Science, 44, (1999), Estimating and bootstrapping Malmquist indices, European Journal of Operational Research, 115, (2000a), A general methodology for bootstrapping in non-parametric frontier models, Journal of Applied Statistics, 27, (2000b), Statistical inference in nonparametric frontier models: The state of the art, Journal of Productivity Analysis, 13, (2001a), Nonparametric tests of returns to scale, European Journal of Operational Research, 139, (2001b), Testing restrictions in nonparametric efficiency models, Communications in Statistics, 30, (2004), Performance of the bootstrap for DEA estimators and iterating the principle, In Handbook on Data Envelopment Analysis, ed. W.W. Cooper, L.M. Seiford, and J. Zhu, New York: Kluwer Academic Publishers, Inc. chapter 10, pp (2007a), Estimation and inference in two-stage, semi-parametric models of productive efficiency, Journal of Econometrics, 136, (2007b), Statistical inference in nonparametric frontier models: Recent developments and perspectives, In The Measurement of Productive Efficiency, ed. H. Fried, C.A.K. Lovell, and S. Schmidt, 2nd ed., Oxford: Oxford University Press chapter 4. Forthcoming. Venables, W.N., and Ripley, B.D. (2002), Modern Applied Statistics with S, New York: Springer-Verlag, Inc. Verzani, J. (2004), Using R for Introductory Statistics, London: Chapman and Hall. Wilson, P.W. (1993), Detecting outliers in deterministic nonparametric frontier models with multiple outputs, Journal of Business and Economic Statistics, 11,
10 Figure 1: List of commands in the FEAR package 9
11 Figure 2: The R graphical user interface 10
12 Figure 3: Output from R s summary command 11
13 Figure 4: Plots produced by the ap.plot command 12
14 Figure 5: Kernel density plots produced by the show.dens command 13
FEAR: A Software Package for Frontier Efficiency Analysis with R
FEAR: A Software Package for Frontier Efficiency Analysis with R Paul W. Wilson The John E. Walker Department of Economics, 222 Sirrine Hall, Clemson University, Clemson, South Carolina 29634 USA Abstract
Comparing the Technical Efficiency of Hospitals in Italy and Germany: Non-parametric Conditional Approach
Comparing the Technical Efficiency of Hospitals in Italy and Germany: Non-parametric Conditional Approach TRACKING REGIONAL VARIATION IN HEALTH CARE Berlin, 4th June 2015 Yauheniya Varabyova, University
AN ANALYSIS OF EFFICIENCY PATTERNS FOR A SAMPLE OF NORWEGIAN BUS COMPANIES
AN ANALYSIS OF EFFICIENCY PATTERNS FOR A SAMPLE OF NORWEGIAN BUS COMPANIES Torben Holvad Transport Studies Unit, University of Oxford INTRODUCTION In recent years significant progress has been made concerning
TESTING RESTRICTIONS IN NONPARAMETRIC EFFICIENCY MODELS
COMMUN. STATIST. SIMULA., 30(1), 159 184 (2001) TESTING RESTRICTIONS IN NONPARAMETRIC EFFICIENCY MODELS Léopold Simar 1, and Paul W. Wilson 2, 1 Institut de Statistique, Université Catholique de Louvain,
Gautam Appa and H. Paul Williams A formula for the solution of DEA models
Gautam Appa and H. Paul Williams A formula for the solution of DEA models Working paper Original citation: Appa, Gautam and Williams, H. Paul (2002) A formula for the solution of DEA models. Operational
Efficiency in Software Development Projects
Efficiency in Software Development Projects Aneesh Chinubhai Dharmsinh Desai University [email protected] Abstract A number of different factors are thought to influence the efficiency of the software
Measuring the Relative Efficiency of European MBA Programs:A Comparative analysis of DEA, SBM, and FDH Model
Measuring the Relative Efficiency of European MBA Programs:A Comparative analysis of DEA, SBM, and FDH Model Wei-Kang Wang a1, Hao-Chen Huang b2 a College of Management, Yuan-Ze University, [email protected]
A PRIMAL-DUAL APPROACH TO NONPARAMETRIC PRODUCTIVITY ANALYSIS: THE CASE OF U.S. AGRICULTURE. Jean-Paul Chavas and Thomas L. Cox *
Copyright 1994 by Jean-Paul Chavas and homas L. Cox. All rights reserved. Readers may make verbatim copies of this document for noncommercial purposes by any means, provided that this copyright notice
Regression Modeling Strategies
Frank E. Harrell, Jr. Regression Modeling Strategies With Applications to Linear Models, Logistic Regression, and Survival Analysis With 141 Figures Springer Contents Preface Typographical Conventions
Software Review: ITSM 2000 Professional Version 6.0.
Lee, J. & Strazicich, M.C. (2002). Software Review: ITSM 2000 Professional Version 6.0. International Journal of Forecasting, 18(3): 455-459 (June 2002). Published by Elsevier (ISSN: 0169-2070). http://0-
DEA implementation and clustering analysis using the K-Means algorithm
Data Mining VI 321 DEA implementation and clustering analysis using the K-Means algorithm C. A. A. Lemos, M. P. E. Lins & N. F. F. Ebecken COPPE/Universidade Federal do Rio de Janeiro, Brazil Abstract
Centre for Efficiency and Productivity Analysis
Centre for Efficiency and Productivity Analysis Working Paper Series No. WP08/2010 Efficiency and Technological Change in Health Care Services in Ontario H. Chowdhury, V. Zelenyuk, W. Wodchis, & A. Laporte
Least Squares Estimation
Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David
The efficiency of fleets in Serbian distribution centres
The efficiency of fleets in Serbian distribution centres Milan Andrejic, Milorad Kilibarda 2 Faculty of Transport and Traffic Engineering, Logistics Department, University of Belgrade, Belgrade, Serbia
Forecasting Bank Failure: A Non-Parametric Frontier Estimation Approach
Forecasting Bank Failure: A Non-Parametric Frontier Estimation Approach Richard S. Barr Dept. Computer Science and Engineering Southern Methodist University Dallas, TX 75275 USA Lawrence M. Seiford Industrial
STATISTICA Formula Guide: Logistic Regression. Table of Contents
: Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary
Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.
Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are
Measuring Information Technology s Indirect Impact on Firm Performance
Information Technology and Management 5, 9 22, 2004 2004 Kluwer Academic Publishers. Manufactured in The Netherlands. Measuring Information Technology s Indirect Impact on Firm Performance YAO CHEN [email protected]
SAS Software to Fit the Generalized Linear Model
SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling
Structural Health Monitoring Tools (SHMTools)
Structural Health Monitoring Tools (SHMTools) Getting Started LANL/UCSD Engineering Institute LA-CC-14-046 c Copyright 2014, Los Alamos National Security, LLC All rights reserved. May 30, 2014 Contents
Data Envelopment Analysis
Data Envelopment Analysis Theory and Techniques for Economics and Operations Research SUBHASH C. RAY University of Connecticut PUBLISHED BY THE PRESS SYNDICATE OF THE UNIVERSITY OF CAMBRIDGE The Pitt Building,
NEW ONLINE COMPUTATIONAL FINANCE CERTIFICATE
NEW ONLINE COMPUTATIONAL FINANCE CERTIFICATE Fall, Winter, and Spring Quarters 2010-11 This new online certificate program is offered by the Department of Applied Mathematics in partnership with the departments
COMPUTATIONS IN DEA. Abstract
ISSN 0101-7438 COMPUTATIONS IN DEA José H. Dulá School of Business Administration The University of Mississippi University MS 38677 E-mail: [email protected] Received November 2001; accepted October 2002
INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition)
INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition) Abstract Indirect inference is a simulation-based method for estimating the parameters of economic models. Its
NON-PARAMETRIC APPROACHES TO EDUCATION AND HEALTH EFFICIENCY IN OECD COUNTRIES ANTÓNIO AFONSO * MIGUEL ST. AUBYN
Journal of Applied Economics. Vol VIII, No. 2 (Nov 2005), 227-246 NON-PARAMETRIC APPROACHES TO EDUCATION AND HEALTH EFFICIENCY 227 NON-PARAMETRIC APPROACHES TO EDUCATION AND HEALTH EFFICIENCY IN OECD COUNTRIES
http://dx.doi.org/10.1090/psapm/023 PROCEEDINGS OF SYMPOSIA IN APPLIED MATHEMATICS Volume XXIII MODERN STATISTICS: METHODS AND APPLICATIONS
http://dx.doi.org/10.1090/psapm/023 PROCEEDINGS OF SYMPOSIA IN APPLIED MATHEMATICS Volume XXIII MODERN STATISTICS: METHODS AND APPLICATIONS AMERICAN MATHEMATICAL SOCIETY PROVIDENCE, RHODE ISLAND 1980 LECTURE
Introduction to R Statistical Software
Introduction to R Statistical Software Anthony (Tony) R. Olsen USEPA ORD NHEERL Western Ecology Division Corvallis, OR 97333 (541) 754-4790 [email protected] What is R? A language and environment for
EMS: Efficiency Measurement System User s Manual
EMS: Efficiency Measurement System User s Manual Holger Scheel Version 1.3 2000-08-15 Contents 1 Introduction 2 2 Preparing the input output data 2 2.1 Using MS Excel files..............................
DEA IN MUTUAL FUND EVALUATION
DEA IN MUTUAL FUND EVALUATION E-mail: [email protected] Dipartimento di Matematica Applicata Università Ca Foscari di Venezia ABSTRACT - In this contribution we illustrate the recent use of Data Envelopment
A Guide to DEAP Version 2.1: A Data Envelopment Analysis (Computer) Program
A Guide to DEAP Version 2.1: A Data Envelopment Analysis (Computer) Program by Tim Coelli Centre for Efficiency and Productivity Analysis Department of Econometrics University of New England Armidale,
Direct Democracy and Local Government Efficiency
Discussion Paper No. 14-017 Direct Democracy and Local Government Efficiency Zareh Asatryan and Kristof De Witte Discussion Paper No. 14-017 Direct Democracy and Local Government Efficiency Zareh Asatryan
WESTMORELAND COUNTY PUBLIC SCHOOLS 2011 2012 Integrated Instructional Pacing Guide and Checklist Computer Math
Textbook Correlation WESTMORELAND COUNTY PUBLIC SCHOOLS 2011 2012 Integrated Instructional Pacing Guide and Checklist Computer Math Following Directions Unit FIRST QUARTER AND SECOND QUARTER Logic Unit
Programming Languages & Tools
4 Programming Languages & Tools Almost any programming language one is familiar with can be used for computational work (despite the fact that some people believe strongly that their own favorite programming
Why High-Order Polynomials Should Not be Used in Regression Discontinuity Designs
Why High-Order Polynomials Should Not be Used in Regression Discontinuity Designs Andrew Gelman Guido Imbens 2 Aug 2014 Abstract It is common in regression discontinuity analysis to control for high order
Package Benchmarking
Type Package Package Benchmarking July 8, 2015 Title Benchmark and Frontier Analysis Using DEA and SFA Version 0.26 Date 2015-7-8 ($Date: 2015-07-08 12:32:10 +0200 (on, 08 jul 2015) $) Author Peter Bogetoft
Axiomatic design of software systems
Axiomatic design of software systems N.P. Suh (1), S.H. Do Abstract Software is playing an increasingly important role in manufacturing. Many manufacturing firms have problems with software development.
Handling attrition and non-response in longitudinal data
Longitudinal and Life Course Studies 2009 Volume 1 Issue 1 Pp 63-72 Handling attrition and non-response in longitudinal data Harvey Goldstein University of Bristol Correspondence. Professor H. Goldstein
Technical Efficiency Accounting for Environmental Influence in the Japanese Gas Market
Technical Efficiency Accounting for Environmental Influence in the Japanese Gas Market Sumiko Asai Otsuma Women s University 2-7-1, Karakida, Tama City, Tokyo, 26-854, Japan [email protected] Abstract:
MATHEMATICAL METHODS OF STATISTICS
MATHEMATICAL METHODS OF STATISTICS By HARALD CRAMER TROFESSOK IN THE UNIVERSITY OF STOCKHOLM Princeton PRINCETON UNIVERSITY PRESS 1946 TABLE OF CONTENTS. First Part. MATHEMATICAL INTRODUCTION. CHAPTERS
for an appointment, e-mail [email protected]
M.Sc. in Economics Department of Economics, University College London Econometric Theory and Methods (G023) 1 Autumn term 2007/2008: weeks 2-8 Jérôme Adda for an appointment, e-mail [email protected] Introduction
ANALYTIC HIERARCHY PROCESS AS A RANKING TOOL FOR DECISION MAKING UNITS
ISAHP Article: Jablonsy/Analytic Hierarchy as a Raning Tool for Decision Maing Units. 204, Washington D.C., U.S.A. ANALYTIC HIERARCHY PROCESS AS A RANKING TOOL FOR DECISION MAKING UNITS Josef Jablonsy
Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.
Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing
Clustering-Based Method for Data Envelopment Analysis. Hassan Najadat, Kendall E. Nygard, Doug Schesvold North Dakota State University Fargo, ND 58105
Clustering-Based Method for Data Envelopment Analysis Hassan Najadat, Kendall E. Nygard, Doug Schesvold North Dakota State University Fargo, ND 58105 Abstract. Data Envelopment Analysis (DEA) is a powerful
Confidence Intervals for Cpk
Chapter 297 Confidence Intervals for Cpk Introduction This routine calculates the sample size needed to obtain a specified width of a Cpk confidence interval at a stated confidence level. Cpk is a process
Comparing Improved Versions of K-Means and Subtractive Clustering in a Tracking Application
Comparing Improved Versions of K-Means and Subtractive Clustering in a Tracking Application Marta Marrón Romera, Miguel Angel Sotelo Vázquez, and Juan Carlos García García Electronics Department, University
DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7
DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7 UNDER THE GUIDANCE Dr. N.P. DHAVALE, DGM, INFINET Department SUBMITTED TO INSTITUTE FOR DEVELOPMENT AND RESEARCH IN BANKING TECHNOLOGY
9.2 User s Guide SAS/STAT. Introduction. (Book Excerpt) SAS Documentation
SAS/STAT Introduction (Book Excerpt) 9.2 User s Guide SAS Documentation This document is an individual chapter from SAS/STAT 9.2 User s Guide. The correct bibliographic citation for the complete manual
Software Solutions - 375 - Appendix B. B.1 The R Software
Appendix B Software Solutions This appendix provides a brief discussion of the software solutions available to researchers for computing inter-rater reliability coefficients. The list of software packages
Applications of R Software in Bayesian Data Analysis
Article International Journal of Information Science and System, 2012, 1(1): 7-23 International Journal of Information Science and System Journal homepage: www.modernscientificpress.com/journals/ijinfosci.aspx
MEASURING PERFORMANCE AND EFFICIENCY GROWTH OF THE SELECTED INDIAN LIFE INSURANCE COMPANIES: A TOTAL FACTOR PRODUCTIVITY APPROACH
MEASURING PERFORMANCE AND EFFICIENCY GROWTH OF THE SELECTED INDIAN LIFE INSURANCE COMPANIES: A TOTAL FACTOR PRODUCTIVITY APPROACH ABSTRACT JOY CHAKRABORTY*; DR. PARTHA PRATIM SENGUPTA** *Assistant Professor
7 Time series analysis
7 Time series analysis In Chapters 16, 17, 33 36 in Zuur, Ieno and Smith (2007), various time series techniques are discussed. Applying these methods in Brodgar is straightforward, and most choices are
Simple Predictive Analytics Curtis Seare
Using Excel to Solve Business Problems: Simple Predictive Analytics Curtis Seare Copyright: Vault Analytics July 2010 Contents Section I: Background Information Why use Predictive Analytics? How to use
Statistics Graduate Courses
Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.
Assessing Container Terminal Safety and Security Using Data Envelopment Analysis
Assessing Container Terminal Safety and Security Using Data Envelopment Analysis ELISABETH GUNDERSEN, EVANGELOS I. KAISAR, PANAGIOTIS D. SCARLATOS Department of Civil Engineering Florida Atlantic University
Cross Validation techniques in R: A brief overview of some methods, packages, and functions for assessing prediction models.
Cross Validation techniques in R: A brief overview of some methods, packages, and functions for assessing prediction models. Dr. Jon Starkweather, Research and Statistical Support consultant This month
INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)
INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) As with other parametric statistics, we begin the one-way ANOVA with a test of the underlying assumptions. Our first assumption is the assumption of
From the help desk: Bootstrapped standard errors
The Stata Journal (2003) 3, Number 1, pp. 71 80 From the help desk: Bootstrapped standard errors Weihua Guan Stata Corporation Abstract. Bootstrapping is a nonparametric approach for evaluating the distribution
CowLog: Open-source software for coding behaviors from digital video
Behavior Research Methods 2009, 41 (2), 472-476 doi:10.3758/brm.41.2.472 CowLog: Open-source software for coding behaviors from digital video Laura Hänninen and Matti Pastell University of Helsinki, Helsinki,
Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.
Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C
Curriculum Vitae. Han Hong
Curriculum Vitae Han Hong Address Landau Economics Building Fax: (650) 725-5702 579 Serra Mall Web: www.stanford.edu/ doubleh Stanford, CA 94305-6072 Academic positions Stanford University, Department
How To Understand The Theory Of Probability
Graduate Programs in Statistics Course Titles STAT 100 CALCULUS AND MATR IX ALGEBRA FOR STATISTICS. Differential and integral calculus; infinite series; matrix algebra STAT 195 INTRODUCTION TO MATHEMATICAL
Two classes of ternary codes and their weight distributions
Two classes of ternary codes and their weight distributions Cunsheng Ding, Torleiv Kløve, and Francesco Sica Abstract In this paper we describe two classes of ternary codes, determine their minimum weight
Using Mixtures-of-Distributions models to inform farm size selection decisions in representative farm modelling. Philip Kostov and Seamus McErlean
Using Mixtures-of-Distributions models to inform farm size selection decisions in representative farm modelling. by Philip Kostov and Seamus McErlean Working Paper, Agricultural and Food Economics, Queen
Stress Test Modeling in Cards Portfolios
Stress Test Modeling in Cards Portfolios José J. Canals-Cerdá The views expressed here are my own and not necessarily those of the Federal Reserve or its staff. Credit Card Performance: Charge-off Rate
Abstract. Keywords: Data Envelopment Analysis (DEA), decision making unit (DMU), efficiency, Korea Securities Dealers Automated Quotation (KOSDAQ)
, pp. 205-218 http://dx.doi.org/10.14257/ijseia.2015.9.5.20 The Efficiency Comparative Evaluation of IT Service Companies using the Data Envelopment Analysis Approach Focus on KOSDAQ(KOrea Securities Dealers
From the help desk: Swamy s random-coefficients model
The Stata Journal (2003) 3, Number 3, pp. 302 308 From the help desk: Swamy s random-coefficients model Brian P. Poi Stata Corporation Abstract. This article discusses the Swamy (1970) random-coefficients
Imputing Values to Missing Data
Imputing Values to Missing Data In federated data, between 30%-70% of the data points will have at least one missing attribute - data wastage if we ignore all records with a missing value Remaining data
EMMANUEL THANASSOULIS CURRICULUM VITAE EXTRACT - DATED 04/03/2013
EMMANUEL THANASSOULIS CURRICULUM VITAE EXTRACT - DATED 04/03/2013 PERSONAL DETAILS Address: Office: Operations and Information Management, Aston Business School, University of Aston, Birmingham B4 7ET.
A Stochastic Frontier Model on Investigating Efficiency of Life Insurance Companies in India
A Stochastic Frontier Model on Investigating Efficiency of Life Insurance Companies in India R. Chandrasekaran 1a, R. Madhanagopal 2b and K. Karthick 3c 1 Associate Professor and Head (Retired), Department
UNIVERSITY OF OSLO HEALTH ECONOMICS RESEARCH PROGRAMME
UNIVERSITY OF OSLO HEALTH ECONOMICS RESEARCH PROGRAMME Errors in Survey Based Quality Evaluation Variables in Efficiency Models of Primary Care Physicians Sverre A.C. Kittelsen Guri Galtung Kjæserud The
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written
AN EVALUATION OF FACTORY PERFORMANCE UTILIZED KPI/KAI WITH DATA ENVELOPMENT ANALYSIS
Journal of the Operations Research Society of Japan 2009, Vol. 52, No. 2, 204-220 AN EVALUATION OF FACTORY PERFORMANCE UTILIZED KPI/KAI WITH DATA ENVELOPMENT ANALYSIS Koichi Murata Hiroshi Katayama Waseda
How To Check For Differences In The One Way Anova
MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way
Confidence Intervals for Spearman s Rank Correlation
Chapter 808 Confidence Intervals for Spearman s Rank Correlation Introduction This routine calculates the sample size needed to obtain a specified width of Spearman s rank correlation coefficient confidence
NCSS Statistical Software
Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the
5 Correlation and Data Exploration
5 Correlation and Data Exploration Correlation In Unit 3, we did some correlation analyses of data from studies related to the acquisition order and acquisition difficulty of English morphemes by both
Development of benchmarking models for distribution system operators in Belgium
P R O J E CT N EREUS Development of benchmarking models for distribution system operators in Belgium FINAL REPORT 2011-11-30 Disclaimer This is an open report on the development of efficiency benchmarking
about is Ox, developed by J.A. Doornik at Nuffield College, Oxford, United Kingdom.
Economy Informatics, no. 1/2001 35 A software package for time series analysis Adrian HOSPODAR Training Division, RomTelecom, [email protected] Abstract: This paper introduces a software package dedicated
