A HYBRID GENETIC ALGORITHM FOR THE MAXIMUM LIKELIHOOD ESTIMATION OF MODELS WITH MULTIPLE EQUILIBRIA: A FIRST REPORT
|
|
|
- Oscar Sherman
- 10 years ago
- Views:
Transcription
1 New Mathematics and Natural Computation Vol. 1, No. 2 (2005) c World Scientific Publishing Company A HYBRID GENETIC ALGORITHM FOR THE MAXIMUM LIKELIHOOD ESTIMATION OF MODELS WITH MULTIPLE EQUILIBRIA: A FIRST REPORT VICTOR AGUIRREGABIRIA Department of Economics, Boston University, 270 Bay State Road Boston, MA 02215, USA [email protected] PEDRO MIRA CEMFI, Casado del Alisal, 5 Madrid, Madrid 28004, Spain [email protected] This paper presents a hybrid genetic algorithm to obtain maximum likelihood estimates of parameters in structural econometric models with multiple equilibria. The algorithm combines a pseudo maximum likelihood (PML) procedure with a genetic algorithm (GA). The GA searches globally over the large space of possible combinations of multiple equilibria in the data. The PML procedure avoids the computation of all the equilibria associated with every trial value of the structural parameters. Keywords: Genetic algorithms; maximum likelihood estimation; multiple equilibria. 1. Introduction This paper deals with the computation of maximum likelihood estimates of structural parameters in econometric models with multiple equilibria. Multiplicity of equilibria appears frequently in structural econometric models like empirical discrete games, 1 coordination games 2 or models with social interactions, 3 among others. Models with multiple equilibria do not provide unique predictions on the probability distribution of the endogenous variables conditional on the exogenous variables. This indeterminacy poses practical problems in maximum likelihood estimation. To obtain maximum likelihood estimates of the structural parameters one should maximize the likelihood function not only with respect to the structural parameters but also with respect to the multiple equilibrium types that can generate the observations in the data. There are two main reasons why optimization with respect to equilibrium types can be a very complicated task. First, computing all the equilibria associated with each trial value of the parameters can be computationally very demanding. And second, the number of possible combinations of equilibria in the 295
2 296 V. Aguirregabiria & P. Mira data increases exponentially with the sample size and it becomes a huge number even for the simplest problems. These problems have motivated the use of estimation methods which are computationally simpler than maximum likelihood. For instance, in the context of empirical discrete games, Tamer, 4 Pesendorfer and Smichdt-Dengler, 5 Aguirregabiria, 6 and Aguirregabiria and Mira 7 have proposed and applied pseudo maximum likelihood (PML) estimators. These PML methods proceed in two steps. The first step identifies nonparametrically the equilibrium (or equilibria) played in the data. The second step estimates the structural parameters of the model by maximizing a pseudo likelihood function based on best response functions evaluated at the equilibrium estimated in the first step. The main advantage of PML estimation is its computational simplicity. However, the method is not statistically efficient, and it can perform poorly in small samples because the nonparametric estimates in the first step can be very imprecise. To deal with these limitations, Aguirregabiria and Mira 7 propose a recursive extension of the pseudo likelihood method, the so-called nested pseudo likelihood (NPL) algorithm. The NPL provides estimates which are statistically more efficient than the two-step PML, both asymptotically and in finite samples. Furthermore, the NPL algorithm can be used to search for the maximum likelihood estimator. In particular, the maximum likelihood estimator is a fixed point of the NPL procedure. However, the NPL algorithm can have other fixed points which are not the maximum likelihood estimator. Therefore, to compute the maximum likelihood estimator, we should search globally over the set of NPL fixed points. In this paper, we propose a relatively simple but computationally effective algorithm to search for the maximum likelihood estimator. The algorithm combines the NPL procedure with a Genetic Algorithm (GA). The GA searches efficiently over the huge space of possible combinations of multiple equilibria in the data. The NPL procedure avoids the repeated computation of equilibria for each trial value of the structural parameters. A Genetic Algorithm (GA) is a class of stochastic optimization method that exploits the concepts of natural selection and evolution to solve global optimization problems. GAs were first proposed by Holland. 8 Among their many applications, GA have been successfully used to search for global optima of discrete and step functions with very large search spaces. Mitchell 9 provides an excellent survey of the theory and applications of these algorithms. See also Mitchell, Holland and Forrest. 10 Although GAs have been extensively used in experimental and evolutionary economics, the application of GAs in econometrics has been more rare. Important exceptions are Dorsey and Mayer 11 and Beenstock and Szpirob. 12 The rest of this paper is organized as follows. Section 2 presents the class of structural models that this paper is concerned with and provides a particular example of this class of models. Section 3 describes the estimation of structural parameters by maximum likelihood and by pseudo maximum likelihood. Section 4 describes our hybrid genetic algorithm. We summarize and conclude in Sec. 5.
3 A Hybrid GA for the Maximum Likelihood Estimation of Models with Multiple Equilibria Model 2.1. General framework Let y Y and x X be two vectors of random variables where Y is a discrete and finite set. And let p 0 (y x) be the probability distribution of y conditional to x in the population under study. Given that y is a discrete variable with a finite support, we can also represent the conditional probability function p 0 ( x) using the vector P 0 (x) {p 0 (y x) :y Y }. A model can be described as a family of probability distribution functions p(y x, θ), where θ Θ is a vector of K parameters, and Θ R K is a compact set. We can also represent a probability function p( x, θ) using the vector P (x, θ) {p(y x, θ) :y Y }. An important feature of the class of models that we consider in this paper is that the model does not provide a closed form analytical expression for the probability functions p( x, θ). Instead, these functions are only implicitly defined as fixed points of a mapping. More specifically, the vector P (x, θ) is a fixed point of a mapping Ψ(x, θ, P )={ψ(y x, θ, P ):y Y }, whereψ( x, θ, P ) is a probability function for y conditional on x. Therefore, P (x, θ) =Ψ(x, θ, P (x, θ)) or, what is equivalent, for any y Y : p(y x, θ) =ψ(y x, θ, P (x, θ)). (1) The function ψ is twice continuously differentiable in P and θ,and for any(y, x, θ, P) the probability ψ(y x, θ, P ) is strictly greater than zero. For some values of (x, θ) the mapping Ψ(x, θ, ) can have more than one fixed point. That is, the model can have multiple equilibria for some values of the structural parameters. Let {P τ (x, θ) :τ =1, 2,...} be the set of equilibria associated with (x, θ). The equilibria are indexed by the variable τ {1, 2,...} that is called the equilibrium type Example: Discrete game with incomplete information There are N players which are indexed by i I = {1, 2,...,N}. Each player should choose an action from a set of choice alternatives Y = {0, 1,...,J}. Werepresent the decision of player i by the variable y i Y. The utility function of player i is: U i = u i (y i,y i,x)+ε i (y i ) (2) where y i is the vector with the decisions of players other than i; x is a vector of players exogenous characteristics which are common knowledge; and ε i (ε i (0),ε i (1),...,ε i (J)) represents characteristics that are private information of player i. Assumption 1. For any i I the vector ε i R J+1 is: (1) independent of common knowledge variables x; and (2) independently distributed across players with distribution function G i ( ) that is absolutely continuous with respect to the Lebesgue measure.
4 298 V. Aguirregabiria & P. Mira Let σ = {σ i (x, ε i ) : i I} be a set of strategy functions where σ i : X R J+1 Y. Associated with a set of strategy functions we can define a set of choice probabilities P σ (x) ={Pi σ(y i x) :(y i,i) Y I} such that: Pi σ (y i x) I {σ i (x, ε i )=y i } dg i (ε i ) (3) where I{ } is the indicator function. These probabilities represent the expected behavior of player i from the point of view of the other players (who do not know ε i ) when he follows his strategy in σ. Letu σ i (y i,x)beplayeri s expected utility if he chooses alternative y i and the other players behave according to their respective strategies in σ. By the independence of private information in Assumption 1, u σ i (y i,x)= Pj σ (y j x) u(y i,y i,x). (4) y i j i A Bayesian Nash equilibrium (BNE) is a vector of strategy functions σ such that for any player i and for any (x, ε i ) X R J+1, σi { (x, ε i ) = arg max u σ i (y i,x)+ε i (y i ) }. (5) y i Y We can represent this BNE in probability space. Let σ be a set of BNE strategies, and let P be the choice probabilities associated with these strategies. By definition, Pi (y i x) = I{y i = σi (x, ε i)} dg i (ε i ). Solving the equilibrium condition (5) in this expression we get that for any (y i,i) Y I: ( Pi (y i x) = I y i =argmax y Y {u i (y, x)+ε i(y)} ) dg i (ε i ). (6) Notice that the function u σ i depends on players strategies only through the choice probabilities P σ associated with σ. To emphasize this point, we modify the notation and use the symbol u P i, instead of uσ i, to denote these expected utility functions. Therefore, the right-hand side in Eq. (6) is a function that we define as ψ i (y i x, P ). We call the functions ψ i best response probability functions. The vector of equilibrium probabilities P (x) {Pi (y i x) :(y i,i) Y I} is a fixed point of the best response mapping Ψ(x, P ) {ψ i (y i x, P ):(y i,i) Y I}. Given Assumption 1, best response probability functions are continuous in the compact set of players choice probabilities. By Brower s theorem, there exists at least one equilibrium. In general, the equilibrium is not unique. The primitives of the model {u i,g i : i I} can be described in terms of a vector of parameters θ Θ R K. Primitives are continuously differentiable in θ. WeuseΨ(x, θ, P ) to denote the equilibrium mapping associated with (x, θ). And P (x, θ) ={P i (y x, θ) :(y, i) Y I} represents an equilibrium associated with (x, θ). For the sake of exposition we will concentrate on this empirical discrete game for the rest of the paper. However, all our results can be applied to the general model in Sec. 2.1.
5 A Hybrid GA for the Maximum Likelihood Estimation of Models with Multiple Equilibria Estimation 3.1. Data generating process Suppose that the game has been played at different moments in time or at different locations or markets. We have a random sample of T realizations of the game where we observe players actions and common knowledge state variables {y t,x t : t =1, 2,...,T} with y t =(y 1t,y 2t,...,y Nt ). Let θ 0 Θ be the true value of θ in the population under study. We are interested in the estimation of θ 0. Let τ t be the equilibrium type of observation t. These equilibrium types are unobservable to the researcher. And let Pt 0 (x t ) be the distribution of y t conditional on x t in the population that generates observation t. Sincey t comes from an equilibrium of the game, we have that Pt 0 (x t )=P τt (x t,θ 0 ). The following assumption establishes some conditions on the data generating process that guarantee the identification of θ 0. Assumption 2. (A) For every observation t the equilibrium type τ t is determined by a function τ 0 ( ) of the common knowledge state variables, i.e. τ t = τ 0 (x t ). And (B) there is a unique pair (θ 0,τ 0 ) such that P 0 (x) =P τ 0 (x) (x, θ 0 ) for every x X. Under Assumption 2(A), two sample points with the same values of the exogenous variables x should have the same equilibrium probabilities. The function τ 0 is called the equilibrium selection mechanism. Under this assumption, we can identify nonparametrically the equilibrium probabilities in the population under study. Assumption 2(B) establishes the joint identification of the structural parameters and the equilibrium selection mechanism Maximum likelihood estimation The maximum likelihood estimator (MLE) of (θ 0,τ 0 ) maximizes the likelihood with respect to θ and with respect to the equilibrium types in the sample. Thus, the MLE is: { } T N ˆθ =argmax log P τt i (y it x t ; θ) (7) θ Θ sup τ 1,...,τ T t=1 i=1 subject to the restriction that τ t = τ t when x t = x t. This estimator is consistent and asymptotically efficient. However, its implementation can be computationally very costly. The problem is in the maximization with respect to the equilibrium types. First, we need to know all the equilibrium types that the model has for every trial value of θ and for every value x t in the sample. This is impractical in many applications. And second, the number of possible values that the vector (τ 1,...,τ T ) can take is huge even for very simple models. For instance, if the number of equilibrium types is 3 and the number of observations is 200, we have possible values for the vector (τ 1,...,τ T ). It is clear that optimization with respect to the equilibrium types can be extremely costly. This problem has
6 300 V. Aguirregabiria & P. Mira motivated the development of alternative methods, like pseudo maximum likelihood (PML) estimation, that avoid the search for the MLE of the equilibrium selection mechanism τ Pseudo maximum likelihood estimation Define the pseudo likelihood function Q (θ, P 1,...,P T ) T t=1 i=1 N log Ψ i (y it x t,θ,p t ). (8) This function is defined for arbitrary choice probabilities, not necessarily in equilibrium. The two-step PML estimator is the value of θ that maximizes the pseudo likelihood function Q(θ, ˆP 1,..., ˆP T ), where { ˆP 1,..., ˆP T } are nonparametric estimates of players choice probabilities conditional on x. Notice that the nonparametric estimates of choice probabilities can be interpreted as estimates of the equilibrium selection mechanism τ 0. Therefore, this estimator avoids the search for the MLE of τ 0 by estimating it nonparametrically in a first step. The nested pseudo likelihood (NPL) estimator is a recursive extension of the two-step PML. Given the initial nonparametric estimates of choice probabilities, the NPL generates a sequence of estimators {ˆθ K : K 1} where the K-step estimator is defined as: ˆθ K =argmax Q( θ, ˆP 1 K,..., ˆP ) T K (9) θ Θ and the probabilities { ˆP 1 K,..., ˆP T K : K 2} are obtained recursively as: K+1 ˆP t = Ψ(x t, ˆθ K, ˆP t K ). The algorithm iterates until convergence. For any given sample, Brower s fixed-point theorem guarantees the existence of at least one NPL fixedpoint. Aguirregabiria and Mira 7 show that NPL is consistent and more efficient than the two-step PML. Monte Carlo experiments show that NPL has much better finite sample properties than the two-step estimator. Furthermore, the NPL algorithm can be used to search for the MLE. The reason is that the MLE is a fixed point of the NPL algorithm (though not every fixed point of the NPL procedure is the MLE). To see this, notice that the MLE in (7) can be also described as: sup Q (θ, P 1,...,P T ) P 1,...,P T ˆθ =argmax θ Θ subject to: P t =Ψ(x t,θ,p t ) for every observation t. (10) P t = P t for x t = x t Therefore, if the NPL has a unique fixed point, this should be the MLE. Aguirregabiria and Mira 13 show that for a class of discrete choice dynamic programming models the NPL has a unique fixed point. Unfortunately, that is not the case in other models like empirical discrete games. For these models, the NPL can still be used to search for the MLE, but we need a method to search for the MLE within the
7 A Hybrid GA for the Maximum Likelihood Estimation of Models with Multiple Equilibria 301 set of NPL fixed points. It is in this context where the use of a Genetic Algorithm is particularly useful. 4. Hybrid Genetic Algorithm Consider the following procedure to search for the MLE. We obtain M fixed points of the NPL procedure by applying this method to M different initial values of the choice probabilities. Given these M fixed-points of the NPL, the estimator of θ 0 will be the fixed point with the highest value of the pseudo likelihood Q( ). A limitation of this approach is that for some applications we will need a very large number of fixed points, M, to guarantee that this estimator is the MLE. To deal with this problem we combine the NPL procedure with a GA. At every step of the NPL we perform three operations on the M vectors of probabilities: crossover, mutation and selection. These operations, which are characteristic of GAs, permit a more global search over the space of (θ, P 1,...,P T ). Here we describe this algorithm. (0) Initial population. The initial population of probability vectors is Π 1 = { ˆP mt 1 : t = 1,...,T; m = 1,...,M}. This initial population may be arbitrarily chosen, or it may come from nonparametric estimates. For instance, the M probabilities can be obtained as M bootstrap nonparametric estimates of players choice probabilities. Our hybrid algorithm generates a sequence of probability vectors that we denote by {Π K : K 1}, wherek is the index for the step or iteration of the algorithm. Associated with this sequence of probabilities the GA also generates a sequence of vectors of parameter estimates {ˆθ m K : m =1,...,M; K 1}. An iteration of the algorithm consists in the creation of a new generation with the offsprings of the existing generation. An iteration can be described in terms of four processes or steps that are followed sequentially: (1) mating or selection of parents; (2) crossover and mutation; (3) NPL iteration; (4) selection of offsprings. (1) Selection of parents. We draw, with replacement, O>Mpairs of probability vectors from the population Π K. The probability that a vector is chosen depends on its relative fitness. Fitness is a term from evolution theory. In our problem, the fitness of a probability vector is the Lagrangian function: l( ˆP 1,..., ˆP T )=Q (ˆθ, ˆP1,..., ˆP ) T T λ ˆP t Ψ(x t, ˆθ, ˆP t ) (11) where ˆθ is the PML estimate associated with ( ˆP 1,..., ˆP T ), and λ is a small and positive constant. Given the measures of fitness of the M elements of Π K,the probability that the mth element is selected is: Sm K = exp{ σ 1 l ( K ˆP m1,..., ˆP )} mt K M j=1 exp{ σ 1 l ( ˆP K j1,..., ˆP )} (12) jt K where σ>0 is a parameter that measures the strength of the dependence of selection on fitness. If σ = 0, only the fittest individual is selected in the O random draws. If σ =, every individual has the same probability of being selected. t=1
8 302 V. Aguirregabiria & P. Mira (2) Crossover and mutation. Each couple generates one offspring. An offspring inherits chromosomes from its parents, but there maybe mutation as well. We represent this with two sets of binary variables: {z t,d t : t =1, 2,...,T}. z t is the indicator of a mutation for chromosome t and it is i.i.d. over t with Pr(z t =1)=γ, i.e. the mutation probability. d t is the indicator for the identity of the parent who transmits the tth chromosome and it is i.i.d. over t with Pr(d t =1)=1/2. Let { ˆP m1,..., ˆP mt } and { ˆP m 1,..., ˆP m T } be a couple. Then, the offspring from this couple is { ˆP 1,..., ˆP T } where for any t: ˆP t = d t{ ˆPmt + z t δ( ˆP mt U t ) } + ( ){ 1 d t ˆPm t + z t δ( ˆP m t U t ) } (13) where U t is a vector of N independent random draws from a U(0, 1); and δ is a parameter that represents the magnitude of the mutation. (3) NPL iteration. For each offspring we obtain its associated PML estimator of θ 0, i.e. the value of θ that maximizes the pseudo likelihood Q ( θ, ˆP 1,..., ˆP T ). Then, for each offspring { ˆP 1,..., ˆP T } and its PML estimator ˆθ, we obtain a new offspring { ˆP 1,..., } such that ˆP t =Ψ(x t, ˆθ, ˆP t). ˆP T (4) Selection of offsprings. We calculate the fitness of each new O offsprings and select the M ones with highest fitness. This is the new population Π K+1. The algorithm iterates until convergence of the sequence of populations {Π K }. 5. Conclusions This paper proposes a new algorithm to obtain maximum likelihood estimates of structural parameters in econometric models with multiple equilibria. The algorithm combines the nested pseudo likelihood method proposed by Aguirregabiria and Mira 7,13 with a genetic algorithm. This method can be applied to the estimation of a broad class of econometric models with multiple equilibria. For instance, in the context of empirical Industrial Organization, the algorithm can be applied to obtain maximum likelihood estimates of parameters in empirical discrete games, e.g. models of market entry, spatial competition, adoption of new technologies, or investment in R&D, among many others. Acknowledgments This material is based upon the work supported by the National Science Foundation under Grant No References 1. T. Bresnahan and P. Reiss, Empirical models of discrete games, J. Econometrics 48 (1991) A. Moro, The effect of statistical discrimination on black-white wage inequality: Estimating a model with multiple equilibria, Int. Economic Review 44 (2003)
9 A Hybrid GA for the Maximum Likelihood Estimation of Models with Multiple Equilibria W. Brock and S. Durlauf, Discrete choice with social interactions, Review of Economics Studies 68 (2001) E. Tamer, Incomplete simultaneous discrete response model with multiple equilibria, Review of Economic Studies 70 (2003) M. Pesendorfer and P. Schmidt-Dengler, Identification and estimation of dynamic games, NBER Working Paper No (2003). 6. V. Aguirregabiria, Pseudo maximum likelihood estimation of structural models involving fixed-point problems, Economics Letters 84(3) (2004) V. Aguirregabiria and P. Mira, Sequential estimation of dynamic discrete games, CEMFI Working Paper No (2004). 8. J. H. Holland, Adaptation in Natural and Artificial Systems (University of Michigan Press, Ann Arbor, 1975). 9. M. Mitchell, An Introduction to Genetic Algorithms (MIT Press, Cambridge, 1996). 10. M. Mitchell, J. H. Holland and S. Forrest, When will a genetic algorithm outperform hill climbing?, in Advances in Neural Information Processing Systems 6, eds. J. D. Dowan, G. Tesauro and J. Alspector (Morgan Kauffman, 1994). 11. R. E. Dorsey and W. J. Mayer, Genetic algorithms for estimation problems with multiple optima, nondifferentiability, and other irregular features, J. Business and Economic Statistics 13 (1995) M. Beenstock and G. Szpirob, Specification search in nonlinear time-series models using the genetic algorithm, J. Economic Dynamics and Control 26 (2002) V. Aguirregabiria and P. Mira, Swapping the nested fixed point algorithm: A class of estimators for discrete Markov decision models, Econometrica 70 (2002)
Lecture notes: single-agent dynamics 1
Lecture notes: single-agent dynamics 1 Single-agent dynamic optimization models In these lecture notes we consider specification and estimation of dynamic optimization models. Focus on single-agent models.
The Dynamics of a Genetic Algorithm on a Model Hard Optimization Problem
The Dynamics of a Genetic Algorithm on a Model Hard Optimization Problem Alex Rogers Adam Prügel-Bennett Image, Speech, and Intelligent Systems Research Group, Department of Electronics and Computer Science,
INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition)
INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition) Abstract Indirect inference is a simulation-based method for estimating the parameters of economic models. Its
Statistical Machine Learning
Statistical Machine Learning UoC Stats 37700, Winter quarter Lecture 4: classical linear and quadratic discriminants. 1 / 25 Linear separation For two classes in R d : simple idea: separate the classes
Maximum Likelihood Estimation
Math 541: Statistical Theory II Lecturer: Songfeng Zheng Maximum Likelihood Estimation 1 Maximum Likelihood Estimation Maximum likelihood is a relatively simple method of constructing an estimator for
A Non-Linear Schema Theorem for Genetic Algorithms
A Non-Linear Schema Theorem for Genetic Algorithms William A Greene Computer Science Department University of New Orleans New Orleans, LA 70148 bill@csunoedu 504-280-6755 Abstract We generalize Holland
A Uniform Asymptotic Estimate for Discounted Aggregate Claims with Subexponential Tails
12th International Congress on Insurance: Mathematics and Economics July 16-18, 2008 A Uniform Asymptotic Estimate for Discounted Aggregate Claims with Subexponential Tails XUEMIAO HAO (Based on a joint
Monte Carlo Simulation
1 Monte Carlo Simulation Stefan Weber Leibniz Universität Hannover email: [email protected] web: www.stochastik.uni-hannover.de/ sweber Monte Carlo Simulation 2 Quantifying and Hedging
CHAPTER 2 Estimating Probabilities
CHAPTER 2 Estimating Probabilities Machine Learning Copyright c 2016. Tom M. Mitchell. All rights reserved. *DRAFT OF January 24, 2016* *PLEASE DO NOT DISTRIBUTE WITHOUT AUTHOR S PERMISSION* This is a
ECON20310 LECTURE SYNOPSIS REAL BUSINESS CYCLE
ECON20310 LECTURE SYNOPSIS REAL BUSINESS CYCLE YUAN TIAN This synopsis is designed merely for keep a record of the materials covered in lectures. Please refer to your own lecture notes for all proofs.
Genetic algorithms for changing environments
Genetic algorithms for changing environments John J. Grefenstette Navy Center for Applied Research in Artificial Intelligence, Naval Research Laboratory, Washington, DC 375, USA [email protected] Abstract
Using Markov Decision Processes to Solve a Portfolio Allocation Problem
Using Markov Decision Processes to Solve a Portfolio Allocation Problem Daniel Bookstaber April 26, 2005 Contents 1 Introduction 3 2 Defining the Model 4 2.1 The Stochastic Model for a Single Asset.........................
Bayesian logistic betting strategy against probability forecasting. Akimichi Takemura, Univ. Tokyo. November 12, 2012
Bayesian logistic betting strategy against probability forecasting Akimichi Takemura, Univ. Tokyo (joint with Masayuki Kumon, Jing Li and Kei Takeuchi) November 12, 2012 arxiv:1204.3496. To appear in Stochastic
Adaptive Online Gradient Descent
Adaptive Online Gradient Descent Peter L Bartlett Division of Computer Science Department of Statistics UC Berkeley Berkeley, CA 94709 bartlett@csberkeleyedu Elad Hazan IBM Almaden Research Center 650
Notes from Week 1: Algorithms for sequential prediction
CS 683 Learning, Games, and Electronic Markets Spring 2007 Notes from Week 1: Algorithms for sequential prediction Instructor: Robert Kleinberg 22-26 Jan 2007 1 Introduction In this course we will be looking
Message-passing sequential detection of multiple change points in networks
Message-passing sequential detection of multiple change points in networks Long Nguyen, Arash Amini Ram Rajagopal University of Michigan Stanford University ISIT, Boston, July 2012 Nguyen/Amini/Rajagopal
Markov Chain Monte Carlo Simulation Made Simple
Markov Chain Monte Carlo Simulation Made Simple Alastair Smith Department of Politics New York University April2,2003 1 Markov Chain Monte Carlo (MCMC) simualtion is a powerful technique to perform numerical
Analysis of a Production/Inventory System with Multiple Retailers
Analysis of a Production/Inventory System with Multiple Retailers Ann M. Noblesse 1, Robert N. Boute 1,2, Marc R. Lambrecht 1, Benny Van Houdt 3 1 Research Center for Operations Management, University
Lecture notes: single-agent dynamics 1
Lecture notes: single-agent dynamics 1 Single-agent dynamic optimization models In these lecture notes we consider specification and estimation of dynamic optimization models. Focus on single-agent models.
International Journal of Software and Web Sciences (IJSWS) www.iasir.net
International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research) ISSN (Print): 2279-0063 ISSN (Online): 2279-0071 International
ALMOST COMMON PRIORS 1. INTRODUCTION
ALMOST COMMON PRIORS ZIV HELLMAN ABSTRACT. What happens when priors are not common? We introduce a measure for how far a type space is from having a common prior, which we term prior distance. If a type
Genetic Algorithm Evolution of Cellular Automata Rules for Complex Binary Sequence Prediction
Brill Academic Publishers P.O. Box 9000, 2300 PA Leiden, The Netherlands Lecture Series on Computer and Computational Sciences Volume 1, 2005, pp. 1-6 Genetic Algorithm Evolution of Cellular Automata Rules
LECTURE 15: AMERICAN OPTIONS
LECTURE 15: AMERICAN OPTIONS 1. Introduction All of the options that we have considered thus far have been of the European variety: exercise is permitted only at the termination of the contract. These
Reject Inference in Credit Scoring. Jie-Men Mok
Reject Inference in Credit Scoring Jie-Men Mok BMI paper January 2009 ii Preface In the Master programme of Business Mathematics and Informatics (BMI), it is required to perform research on a business
Modern Optimization Methods for Big Data Problems MATH11146 The University of Edinburgh
Modern Optimization Methods for Big Data Problems MATH11146 The University of Edinburgh Peter Richtárik Week 3 Randomized Coordinate Descent With Arbitrary Sampling January 27, 2016 1 / 30 The Problem
Estimation of the COCOMO Model Parameters Using Genetic Algorithms for NASA Software Projects
Journal of Computer Science 2 (2): 118-123, 2006 ISSN 1549-3636 2006 Science Publications Estimation of the COCOMO Model Parameters Using Genetic Algorithms for NASA Software Projects Alaa F. Sheta Computers
Linear Classification. Volker Tresp Summer 2015
Linear Classification Volker Tresp Summer 2015 1 Classification Classification is the central task of pattern recognition Sensors supply information about an object: to which class do the object belong
Spatial Statistics Chapter 3 Basics of areal data and areal data modeling
Spatial Statistics Chapter 3 Basics of areal data and areal data modeling Recall areal data also known as lattice data are data Y (s), s D where D is a discrete index set. This usually corresponds to data
On Marginal Effects in Semiparametric Censored Regression Models
On Marginal Effects in Semiparametric Censored Regression Models Bo E. Honoré September 3, 2008 Introduction It is often argued that estimation of semiparametric censored regression models such as the
6.207/14.15: Networks Lecture 15: Repeated Games and Cooperation
6.207/14.15: Networks Lecture 15: Repeated Games and Cooperation Daron Acemoglu and Asu Ozdaglar MIT November 2, 2009 1 Introduction Outline The problem of cooperation Finitely-repeated prisoner s dilemma
Basics of Statistical Machine Learning
CS761 Spring 2013 Advanced Machine Learning Basics of Statistical Machine Learning Lecturer: Xiaojin Zhu [email protected] Modern machine learning is rooted in statistics. You will find many familiar
HETEROGENEOUS AGENTS AND AGGREGATE UNCERTAINTY. Daniel Harenberg [email protected]. University of Mannheim. Econ 714, 28.11.
COMPUTING EQUILIBRIUM WITH HETEROGENEOUS AGENTS AND AGGREGATE UNCERTAINTY (BASED ON KRUEGER AND KUBLER, 2004) Daniel Harenberg [email protected] University of Mannheim Econ 714, 28.11.06 Daniel Harenberg
A Robust Method for Solving Transcendental Equations
www.ijcsi.org 413 A Robust Method for Solving Transcendental Equations Md. Golam Moazzam, Amita Chakraborty and Md. Al-Amin Bhuiyan Department of Computer Science and Engineering, Jahangirnagar University,
Numerical methods for American options
Lecture 9 Numerical methods for American options Lecture Notes by Andrzej Palczewski Computational Finance p. 1 American options The holder of an American option has the right to exercise it at any moment
Lecture 3: Linear methods for classification
Lecture 3: Linear methods for classification Rafael A. Irizarry and Hector Corrada Bravo February, 2010 Today we describe four specific algorithms useful for classification problems: linear regression,
Monte Carlo and Empirical Methods for Stochastic Inference (MASM11/FMS091)
Monte Carlo and Empirical Methods for Stochastic Inference (MASM11/FMS091) Magnus Wiktorsson Centre for Mathematical Sciences Lund University, Sweden Lecture 5 Sequential Monte Carlo methods I February
Nonlinear Optimization: Algorithms 3: Interior-point methods
Nonlinear Optimization: Algorithms 3: Interior-point methods INSEAD, Spring 2006 Jean-Philippe Vert Ecole des Mines de Paris [email protected] Nonlinear optimization c 2006 Jean-Philippe Vert,
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written
Week 1: Introduction to Online Learning
Week 1: Introduction to Online Learning 1 Introduction This is written based on Prediction, Learning, and Games (ISBN: 2184189 / -21-8418-9 Cesa-Bianchi, Nicolo; Lugosi, Gabor 1.1 A Gentle Start Consider
How To Understand The Theory Of Probability
Graduate Programs in Statistics Course Titles STAT 100 CALCULUS AND MATR IX ALGEBRA FOR STATISTICS. Differential and integral calculus; infinite series; matrix algebra STAT 195 INTRODUCTION TO MATHEMATICAL
Copyright. Network and Protocol Simulation. What is simulation? What is simulation? What is simulation? What is simulation?
Copyright Network and Protocol Simulation Michela Meo Maurizio M. Munafò [email protected] [email protected] Quest opera è protetta dalla licenza Creative Commons NoDerivs-NonCommercial. Per
A Simple Model of Price Dispersion *
Federal Reserve Bank of Dallas Globalization and Monetary Policy Institute Working Paper No. 112 http://www.dallasfed.org/assets/documents/institute/wpapers/2012/0112.pdf A Simple Model of Price Dispersion
Memory Allocation Technique for Segregated Free List Based on Genetic Algorithm
Journal of Al-Nahrain University Vol.15 (2), June, 2012, pp.161-168 Science Memory Allocation Technique for Segregated Free List Based on Genetic Algorithm Manal F. Younis Computer Department, College
Hybrid Evolution of Heterogeneous Neural Networks
Hybrid Evolution of Heterogeneous Neural Networks 01001110 01100101 01110101 01110010 01101111 01101110 01101111 01110110 01100001 00100000 01110011 01101011 01110101 01110000 01101001 01101110 01100001
Introduction to Machine Learning and Data Mining. Prof. Dr. Igor Trajkovski [email protected]
Introduction to Machine Learning and Data Mining Prof. Dr. Igor Trakovski [email protected] Neural Networks 2 Neural Networks Analogy to biological neural systems, the most robust learning systems
Fuzzy Probability Distributions in Bayesian Analysis
Fuzzy Probability Distributions in Bayesian Analysis Reinhard Viertl and Owat Sunanta Department of Statistics and Probability Theory Vienna University of Technology, Vienna, Austria Corresponding author:
Alpha Cut based Novel Selection for Genetic Algorithm
Alpha Cut based Novel for Genetic Algorithm Rakesh Kumar Professor Girdhar Gopal Research Scholar Rajesh Kumar Assistant Professor ABSTRACT Genetic algorithm (GA) has several genetic operators that can
CPC/CPA Hybrid Bidding in a Second Price Auction
CPC/CPA Hybrid Bidding in a Second Price Auction Benjamin Edelman Hoan Soo Lee Working Paper 09-074 Copyright 2008 by Benjamin Edelman and Hoan Soo Lee Working papers are in draft form. This working paper
Removing the Genetics from the Standard Genetic Algorithm
Removing the Genetics from the Standard Genetic Algorithm Shumeet Baluja & Rich Caruana May 22, 1995 CMU-CS-95-141 School of Computer Science Carnegie Mellon University Pittsburgh, Pennsylvania 15213 This
Statistics Graduate Courses
Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.
Time Series Analysis
Time Series Analysis [email protected] Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby 1 Outline of the lecture Identification of univariate time series models, cont.:
INDISTINGUISHABILITY OF ABSOLUTELY CONTINUOUS AND SINGULAR DISTRIBUTIONS
INDISTINGUISHABILITY OF ABSOLUTELY CONTINUOUS AND SINGULAR DISTRIBUTIONS STEVEN P. LALLEY AND ANDREW NOBEL Abstract. It is shown that there are no consistent decision rules for the hypothesis testing problem
Corrected Diffusion Approximations for the Maximum of Heavy-Tailed Random Walk
Corrected Diffusion Approximations for the Maximum of Heavy-Tailed Random Walk Jose Blanchet and Peter Glynn December, 2003. Let (X n : n 1) be a sequence of independent and identically distributed random
Generating Random Numbers Variance Reduction Quasi-Monte Carlo. Simulation Methods. Leonid Kogan. MIT, Sloan. 15.450, Fall 2010
Simulation Methods Leonid Kogan MIT, Sloan 15.450, Fall 2010 c Leonid Kogan ( MIT, Sloan ) Simulation Methods 15.450, Fall 2010 1 / 35 Outline 1 Generating Random Numbers 2 Variance Reduction 3 Quasi-Monte
Genetic algorithms for solving portfolio allocation models based on relative-entropy, mean and variance
Journal of Scientific Research and Development 2 (12): 7-12, 2015 Available online at www.jsrad.org ISSN 1115-7569 2015 JSRAD Genetic algorithms for solving portfolio allocation models based on relative-entropy,
Parametric fractional imputation for missing data analysis
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 Biometrika (????,??,?, pp. 1 14 C???? Biometrika Trust Printed in
The equivalence of logistic regression and maximum entropy models
The equivalence of logistic regression and maximum entropy models John Mount September 23, 20 Abstract As our colleague so aptly demonstrated ( http://www.win-vector.com/blog/20/09/the-simplerderivation-of-logistic-regression/
Real Business Cycle Theory. Marco Di Pietro Advanced () Monetary Economics and Policy 1 / 35
Real Business Cycle Theory Marco Di Pietro Advanced () Monetary Economics and Policy 1 / 35 Introduction to DSGE models Dynamic Stochastic General Equilibrium (DSGE) models have become the main tool for
Macroeconomic Effects of Financial Shocks Online Appendix
Macroeconomic Effects of Financial Shocks Online Appendix By Urban Jermann and Vincenzo Quadrini Data sources Financial data is from the Flow of Funds Accounts of the Federal Reserve Board. We report the
A Brief Study of the Nurse Scheduling Problem (NSP)
A Brief Study of the Nurse Scheduling Problem (NSP) Lizzy Augustine, Morgan Faer, Andreas Kavountzis, Reema Patel Submitted Tuesday December 15, 2009 0. Introduction and Background Our interest in the
Sharing Online Advertising Revenue with Consumers
Sharing Online Advertising Revenue with Consumers Yiling Chen 2,, Arpita Ghosh 1, Preston McAfee 1, and David Pennock 1 1 Yahoo! Research. Email: arpita, mcafee, [email protected] 2 Harvard University.
LOGISTIC REGRESSION. Nitin R Patel. where the dependent variable, y, is binary (for convenience we often code these values as
LOGISTIC REGRESSION Nitin R Patel Logistic regression extends the ideas of multiple linear regression to the situation where the dependent variable, y, is binary (for convenience we often code these values
Evolutionary Detection of Rules for Text Categorization. Application to Spam Filtering
Advances in Intelligent Systems and Technologies Proceedings ECIT2004 - Third European Conference on Intelligent Systems and Technologies Iasi, Romania, July 21-23, 2004 Evolutionary Detection of Rules
The Multiplicative Weights Update method
Chapter 2 The Multiplicative Weights Update method The Multiplicative Weights method is a simple idea which has been repeatedly discovered in fields as diverse as Machine Learning, Optimization, and Game
The Applications of Genetic Algorithms in Stock Market Data Mining Optimisation
The Applications of Genetic Algorithms in Stock Market Data Mining Optimisation Li Lin, Longbing Cao, Jiaqi Wang, Chengqi Zhang Faculty of Information Technology, University of Technology, Sydney, NSW
Estimating the random coefficients logit model of demand using aggregate data
Estimating the random coefficients logit model of demand using aggregate data David Vincent Deloitte Economic Consulting London, UK [email protected] September 14, 2012 Introduction Estimation
Monte Carlo-based statistical methods (MASM11/FMS091)
Monte Carlo-based statistical methods (MASM11/FMS091) Jimmy Olsson Centre for Mathematical Sciences Lund University, Sweden Lecture 5 Sequential Monte Carlo methods I February 5, 2013 J. Olsson Monte Carlo-based
6.254 : Game Theory with Engineering Applications Lecture 2: Strategic Form Games
6.254 : Game Theory with Engineering Applications Lecture 2: Strategic Form Games Asu Ozdaglar MIT February 4, 2009 1 Introduction Outline Decisions, utility maximization Strategic form games Best responses
PHASE ESTIMATION ALGORITHM FOR FREQUENCY HOPPED BINARY PSK AND DPSK WAVEFORMS WITH SMALL NUMBER OF REFERENCE SYMBOLS
PHASE ESTIMATION ALGORITHM FOR FREQUENCY HOPPED BINARY PSK AND DPSK WAVEFORMS WITH SMALL NUM OF REFERENCE SYMBOLS Benjamin R. Wiederholt The MITRE Corporation Bedford, MA and Mario A. Blanco The MITRE
FIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS
FIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS Jeffrey M. Wooldridge Department of Economics Michigan State University East Lansing, MI 48824-1038
Numerical Research on Distributed Genetic Algorithm with Redundant
Numerical Research on Distributed Genetic Algorithm with Redundant Binary Number 1 Sayori Seto, 2 Akinori Kanasugi 1,2 Graduate School of Engineering, Tokyo Denki University, Japan [email protected],
1 Teaching notes on GMM 1.
Bent E. Sørensen January 23, 2007 1 Teaching notes on GMM 1. Generalized Method of Moment (GMM) estimation is one of two developments in econometrics in the 80ies that revolutionized empirical work in
Dynamics and Equilibria
Dynamics and Equilibria Sergiu Hart Presidential Address, GAMES 2008 (July 2008) Revised and Expanded (November 2009) Revised (2010, 2011, 2012, 2013) SERGIU HART c 2008 p. 1 DYNAMICS AND EQUILIBRIA Sergiu
Equilibrium computation: Part 1
Equilibrium computation: Part 1 Nicola Gatti 1 Troels Bjerre Sorensen 2 1 Politecnico di Milano, Italy 2 Duke University, USA Nicola Gatti and Troels Bjerre Sørensen ( Politecnico di Milano, Italy, Equilibrium
Persuasion by Cheap Talk - Online Appendix
Persuasion by Cheap Talk - Online Appendix By ARCHISHMAN CHAKRABORTY AND RICK HARBAUGH Online appendix to Persuasion by Cheap Talk, American Economic Review Our results in the main text concern the case
PS 271B: Quantitative Methods II. Lecture Notes
PS 271B: Quantitative Methods II Lecture Notes Langche Zeng [email protected] The Empirical Research Process; Fundamental Methodological Issues 2 Theory; Data; Models/model selection; Estimation; Inference.
Bayesian Statistics in One Hour. Patrick Lam
Bayesian Statistics in One Hour Patrick Lam Outline Introduction Bayesian Models Applications Missing Data Hierarchical Models Outline Introduction Bayesian Models Applications Missing Data Hierarchical
Original Article Efficient Genetic Algorithm on Linear Programming Problem for Fittest Chromosomes
International Archive of Applied Sciences and Technology Volume 3 [2] June 2012: 47-57 ISSN: 0976-4828 Society of Education, India Website: www.soeagra.com/iaast/iaast.htm Original Article Efficient Genetic
The CUSUM algorithm a small review. Pierre Granjon
The CUSUM algorithm a small review Pierre Granjon June, 1 Contents 1 The CUSUM algorithm 1.1 Algorithm............................... 1.1.1 The problem......................... 1.1. The different steps......................
DECISION TREE INDUCTION FOR FINANCIAL FRAUD DETECTION USING ENSEMBLE LEARNING TECHNIQUES
DECISION TREE INDUCTION FOR FINANCIAL FRAUD DETECTION USING ENSEMBLE LEARNING TECHNIQUES Vijayalakshmi Mahanra Rao 1, Yashwant Prasad Singh 2 Multimedia University, Cyberjaya, MALAYSIA 1 [email protected]
1 Prior Probability and Posterior Probability
Math 541: Statistical Theory II Bayesian Approach to Parameter Estimation Lecturer: Songfeng Zheng 1 Prior Probability and Posterior Probability Consider now a problem of statistical inference in which
Model-based Synthesis. Tony O Hagan
Model-based Synthesis Tony O Hagan Stochastic models Synthesising evidence through a statistical model 2 Evidence Synthesis (Session 3), Helsinki, 28/10/11 Graphical modelling The kinds of models that
From the help desk: hurdle models
The Stata Journal (2003) 3, Number 2, pp. 178 184 From the help desk: hurdle models Allen McDowell Stata Corporation Abstract. This article demonstrates that, although there is no command in Stata for
A Sarsa based Autonomous Stock Trading Agent
A Sarsa based Autonomous Stock Trading Agent Achal Augustine The University of Texas at Austin Department of Computer Science Austin, TX 78712 USA [email protected] Abstract This paper describes an autonomous
The Advantages and Disadvantages of Online Linear Optimization
LINEAR PROGRAMMING WITH ONLINE LEARNING TATSIANA LEVINA, YURI LEVIN, JEFF MCGILL, AND MIKHAIL NEDIAK SCHOOL OF BUSINESS, QUEEN S UNIVERSITY, 143 UNION ST., KINGSTON, ON, K7L 3N6, CANADA E-MAIL:{TLEVIN,YLEVIN,JMCGILL,MNEDIAK}@BUSINESS.QUEENSU.CA
Entry and Inefficiency in the Real Estate Brokerage Industry: Empirical Evidence and Policy Implications
Entry and Inefficiency in the Real Estate Brokerage Industry: Empirical Evidence and Policy Implications Lu Han University of Toronto [email protected] Seung-Hyun Hong University of Illinois [email protected]
1 Introduction. 2 Prediction with Expert Advice. Online Learning 9.520 Lecture 09
1 Introduction Most of the course is concerned with the batch learning problem. In this lecture, however, we look at a different model, called online. Let us first compare and contrast the two. In batch
Government Debt Management: the Long and the Short of it
Government Debt Management: the Long and the Short of it E. Faraglia (U. of Cambridge and CEPR) A. Marcet (IAE, UAB, ICREA, BGSE, MOVE and CEPR), R. Oikonomou (U.C. Louvain) A. Scott (LBS and CEPR) ()
