# Online Appendix to Social Network Formation and Strategic Interaction in Large Networks

Save this PDF as:

Size: px
Start display at page:

Download "Online Appendix to Social Network Formation and Strategic Interaction in Large Networks"

## Transcription

1 Online Appendix to Social Network Formation and Strategic Interaction in Large Networks Euncheol Shin Recent Version: October 3, 25 Abstract In this online appendix, I present a continuous time approach to my model with a parametric form Φ(d) = d α. The resulting degree distribution is the Weibull distribution (Weibull, 95). I fit my model to four empirical degree distributions by using the method of maximum likelihood estimation. I compare the goodness-of-fit of my model with other three prominent degree distributions generated by Erdős and Rényi (959), Barabási et al. (999), and Jackson and Rogers (27). I use the Kolmogorov-Smirnov statistic as a way to find the best fitting distribution. Models and Distributions. Continuous Time Approach To obtain a tractable form of the resulting degree distribution, I apply a continuous time approach to my model. In this approach, a node s degree changes deterministically at a rate proportional to its expected change. Specifically, the degree of node i changes according to the following ordinary differential equation: dd i (t) dt = d i(t) µt d i (t), (.) α where d i (t) is the degree of node i at time t, µ > is the parameter that represents the rate of new nodes entering the network, and α > is the parameter that captures the cost See Chapter 5 in Jackson (2) and references therein for examples and discussions about this approach.

2 of forming new links. The first term shows that higher degree nodes have a greater chance of being identified by newborn nodes, and the second term shows that higher degree nodes are more selective with additional links. When the parameter α is one, the probability that node i forms additional links is independent of its degree. However, when parameter α is larger than one, node i is less likely to form additional links as its degree increases. I solve the differential equation (.) and find that d i (t) α d i (i) α = α µ log ( t i To find an expression for the complementary cumulative degree distribution, I identify the proportion of nodes with degrees that exceed a given degree d at time t. For this, it suffices to find the node having exactly degree d at time t because all nodes born before time t have larger degrees. Let i t (d) be the node that has degree d at time t: d it(d)(t) = d. In addition, let d be node i t (d) s initial degree: d it(d)(i t (d)) = d. Then, I have ). i t (d) t = e ((λd)α (λd ) α), where λ = ( µ α) α. Finally, by setting d =, the complementary cumulative degree distribution takes a form of a Weibull distribution as F (d; λ, α) = e (λd)α. The hazard rate function of the Weibull distribution is increasing if and only if the parameter α is larger than one. The following proposition summarizes the results. Proposition The resulting complementary cumulative degree distribution has a form of F (d; λ, α) = e (λd)α, where λ = ( µ α) α. The hazard rate function is strictly increasing if and only if α >..2 Other Models and Distributions Poisson model. The uniform random network formation model by Erdős and Rényi (959) generates the Poisson distribution as the resulting degree distribution. In their model, a link between two nodes is formed independently of other pairs of nodes with a fixed probability. The resulting degree distribution is approximated by the Poisson distribution when the 2

3 network size is sufficiently large. Specifically, the Poisson distribution with parameter ρ > has the form of f(d; ρ) = e ρ ρ d. d! The hazard rate of the Poisson distribution is strictly increasing for any value of ρ. Preferential attachment model. The scale-free distribution is another prominent degree distribution generated by the preferential attachment (PA) model (Barabási et al., 999). In the PA model, a newborn node forms links with existing nodes probabilistically, and the probability of selecting a particular existing node is proportional to its degree. The resulting degree distribution is the scale-free distribution, which has the form of f(d; γ) = cd γ, where γ > is the scale parameter, and c > is the normalization factor that is calculated as c = d=d min d γ when d min is the minimum degree of a network. 2 Assuming that the minimum degree is one, the complementary cumulative degree distribution has the form of F (d; γ) = ζ(γ, d) ζ(γ), where ζ(γ) = k= k γ is the Riemann zeta function, and ζ(γ, d) = k= (k + d) γ for d. Note that the Riemann zeta function is well-defined if and only if γ >. Network-based search model. The network-based search model by Jackson and Rogers (27) (henceforth, the JR model) produces another prominent degree distribution. In the JR model, nodes form links in two ways: (i) uniformly and randomly, and (ii) by searching locally through the given structure of the network. There are two parameters m > and r >. m is the expected number of links that a new node forms, and r is the ratio of the number of links that are formed uniformly at random comparing to network-based meetings. 2 Its name originates from the fact that for any two degrees of a fixed ratio, their probability ratios are independent of the scale of degrees: f(d)/f(d ) = f(sd)/f(sd ) for all d, d N and s R +. A scalefree distribution can be defined either over continuous real numbers or discrete positive integers. Although a continuous distribution intuitively explains its various properties, an estimation by using it generates systematic errors (Clauset et al., 29). 3

4 Notice that r can be infinity, which means that the network formation is purely random. When r <, the resulting complementary cumulative degree distribution has the form of ( ) +r d + rm F (d; r, m) =, d + rm where d is the parameter that calculates the number of links that each node forms upon its birth. Following Jackson and Rogers (27), I set d = for the later parameter estimation. 2 Fitting Models to sets 2. sets I consider the following four real network datasets: (a) the 75 social networks of rural Indian villages, (b) a collaboration network of jazz musicians, (c) an online friendship network of Facebook users, and (d) the network among webpages at Notre Dame University. 3 Figure plots their empirical hazard rate functions. I present the hazard rate function of village #58 as an illustrating example. One can observe that the empirical hazard rates exhibit increasing patterns for datasets (a) and (b), but decreasing patterns for datasets (c) and (d) Hazard Rate.4.2 Hazard Rate.5..5 Hazard Rate.3.2. Hazard Rate (a) Indian village #58 (b) Jazz musicians (c) Facebook users (d) Webpages Figure : Different patterns of the hazard rate function across datasets 2.2 Estimation Strategy and Comparison Estimation strategies. I fit the four theoretical degree distributions generated by network formation models to the empirical degree distributions of the above four network datasets. I 3 I obtain dataset (a) from Banerjee et al. (23), dataset (b) from Gleiser and Danon (23), dataset (c) from McAuley and Leskovec (22), and dataset (d) from Jackson and Rogers (27). All datasets are publicly available from their corresponding author s webpage. 4

5 explain how to estimate model parameters. First, I estimate parameters of the Weibull distribution by using the method of maximum likelihood estimation, which is well-documented in the statistics literature (e.g., Rinne, 28). Specifically, for a given dataset {d,..., d n } where d i is the degree of node i {,..., n}, the maximum likelihood estimator of α is identified as a solution of α = n i= dα i log d i n i= dα i n n log d i. One can find a solution of the above equation by using a standard iterative procedure (e.g., the Newton-Raphson method). The solution is unique because the left-hand side is strictly decreasing in α, but the right-hand side is strictly increasing in α. Second, I estimate the parameter ρ > for the Poission distribution by using the method of maximum likelihood estimation. In particular, for a given dataset {d,..., d n }, the maximum likelihood estimator ρ is the average degree from a dataset as ρ = n d i. n i= Third, I estimate the scale parameter γ by using the method of maximum likelihood estimation (Clauset et al., 29), which solves the following equation: ζ (γ) ζ(γ) = n log d i. n A solution of the above equation always exists and is unique because the left-hand side is strictly decreasing in γ, but the right-hand side is constant. Finally, for the JR model, I estimate two parameters m and r according to the iteration method suggested by Jackson and Rogers (27). To begin with, estimate m as the average i= degree from a dataset. To estimate r, note first that log F (d; r, m) = ( + r) log ( rm ) ( + r) log ( d + rm ). I use a method of iterative least squares as follows. Start with an initial value of r, say r, and plug in this value to obtain log ( d + r m ). Estimate ( + r) by using the method of least squares to obtain r. Repeat the same procedure until a fixed point r is achieved. If i= 5

6 the procedure converges, a fixed point is unique regardless of the initial value r ; otherwise, set r =. When r =, use the exponential distribution as the limit degree distribution. Goodness-of-fit. As a way to compare the models in terms of data fitting performance, I use the Kolmogorov-Smirnov (KS) statistic (Smirnov, 944). 4 The KS statistic is calculated as follows. Let {d,..., d n } be a given dataset. Then, I find the empirical cumulative degree distribution F n ( ) as F n (d) = n n {d min d i d}. i= For each model, let F ( ; θ) be the estimated cumulative degree distribution from the estimated parameter θ. The KS statistic is defined as sup F n (d) F (d; θ). d By the Glivenko-Cantelli theorem, the KS statistic converges almost surely to zero as the dataset size n becomes large if the dataset is generated according to the given model (Mood et al., 974). Thus, the smaller value of the KS statistic means the better data fitting performance of the model. Notice that this convergence is independent of the increasing hazard rate property of the true underlying degree distribution. 2.3 Results I first introduce the key parameters of the models. In my model, α > is the key parameter that distinguishes my model from other models; the other parameter λ > provides no additional information but the average degree of the network, which is available information from other three models. Recall that α indicates the behavior of the hazard rate function. In the Poisson model, there is only one parameter ρ >. There is also a single parameter γ > in the PA model. In the JR model, the key parameter is r (, ] that represents the ratio of network-based meetings; the other parameter m calculates the average degree of the network. When the estimate for r is infinity, I report the KS statistic 4 See Chapter in Mood et al. (974) for details of this test. 6

7 by using an exponential distribution as the limit distribution. 5 In Table, I report the estimates for the above four parameters and the KS statistics across datasets. The first and second lowest KS statistics are marked by and, respectively. 6 Since there are 75 networks in the Indian villages dataset, I present the minimum and the maximum values of the estimates and the KS statistics. Figure 2 illustrates the goodness-of-fit of the models across the datasets. My model Poisson model PA model JR model set α > ρ > γ > r > KS statistic KS statistic KS statistic KS statistic Indian villages Jazz musicians Facebook users Websites [.4, 2.39] [3.8, 7.4] [.49, 2.39] [2.38, ] [.5,.59] [.35,.5] [.3,.443] [.98,.456] Table : Parameter estimates and the goodness-of-fit of the models across datasets First of all, I find that all the estimates for the parameter α are consistent with simple eyeball tests by using Figure. Specifically, estimates for the networks of Indian villages and jazz musicians are all strictly greater than one, which confirm that hazard rates are increasing. Since the KS statistics of my model are very low for these networks, the inference about the increasing hazard rates is significant. On the contrary, for the networks of Facebook users and websites, the estimates are all strictly smaller than one. For the 75 social networks of Indian villages, my model and the Poisson model fit the datasets better. Although the lowest KS statistic of the JR model is.98, the second lowest KS statistic is.7, which is strictly greater than the largest KS statistic of my 5 The exponential distribution is a special case of the Weibull distribution with λ = m/2 and α =. 6 The estimate of the network of websites using the PA model is different from the values reported by Barabási and Albert (999). This result occurs because they use the usual least square estimates assuming a continuous scale-free distribution, but I use the method of maximum likelihood estimation assuming discrete scale-free distribution. 7

8 My Model Poisson Model JR Model PA Model (a) Indian village # My Model Poisson Model JR Model PA Model (b) Jazz musicians My Model Poisson Model JR Model PA Model 5 5 (c) Facebook users My Model Poisson Model JR Model PA Model (d) Webpages Figure 2: An illustration of the goodness-of-fit across datasets and models: In each plot, the horizontal axis represents degrees, and the vertical axis represents the empirical (the red circles) and estimated (the black line) cumulative degree distributions. For each model, the KS statistic measures the maximum difference between the two distributions. 8

9 model. Moreover, it turns out that for 7 social networks, the estimates for r are infinity. 7 This implies that the JR model loses its explanatory power because r = corresponds to the random attachment model. The difference between my model and the Poisson model becomes clearer when I compare their KS statistics for the networks of jazz musicians and Facebook users. My model still fits both datasets well (the KS statistics are about.5) but the Poisson model returns very large KS statistics that are greater than.3. This difference becomes clearer when observing the shape of the fitted cumulative distributions in Figure 2-(b) and -(c). The fitted distributions when using the Poisson model are S-shaped, but the empirical distributions show different patterns. In contrast, the shape of the fitted distributions by my model is flexible and performs well for the networks of jazz musicians and Facebook users. The main force driving the flexibility of my model is that it contains two parameters, λ and α, that determine the size and shape of the distribution. For example, the size of the jazz musician network is large in that its average degree is about 27. In my model, the scale parameter λ explains the average degree, and the shape parameter α explains the shape of the empirical distribution. However, there is only one parameter ρ in the Poisson model, which indicates only the scale of the distribution. Thus, when the network is large, my model outperforms the Poisson distribution even when the empirical hazard rate function exhibits increasing patterns. Finally, for the network of webpages, the PA model performs significantly better than any other models. The KS statistic is very small (.6), which is the lowest value among all the KS statistics across datasets and models. In comparison to my model, this dominant performance of the PA model is not surprising: my model tries to explain networks with bilateral and costly link formation, but the network of webpages is based on unilateral and costless link formations. Therefore, I conclude that my model fits empirical degree distributions very well whenever link formations are bilateral and costly. 7 Only villages #3, #9, #3, and #36 return finite estimates for r. 9

10 References Banerjee, A., Chandrasekhar, A. G., Duflo, E., and Jackson, M. O. (23). The diffusion of microfinance. Science, 34(644). Barabási, A.-L. and Albert, R. (999). Emergence of scaling in random networks. Science, 285. Barabási, A.-L., Albert, R., and Jeong, H. (999). Mean-field theory for scale-free random networks. Physica A, 272: Clauset, A., Shalizi, C. R., and Newman, M. E. J. (29). Power-law distributions in empirical data. SIAM Review, 5(4): Erdős and Rényi, A. (959). On random graphs i. Publications Mathematicae Debrecen, 6: Gleiser, P. M. and Danon, L. (23). Community structure in jazz. Advances in Complex Systems, 6(565). Jackson, M. O. (2). Social and economic networks. Princeton University Press. Jackson, M. O. and Rogers, B. W. (27). Meeting strangers and friends of friends: how random are social networks? American Economic Review, 97(3): McAuley, J. and Leskovec, J. (22). Learning to discover social circles in ego networks. Advances in Neural Information Processing Systems 25, pages Mood, A. M., Graybill, F. A., and Boes, D. C. (974). Introduction to the theory of statistics. McGraw-Hill series in probability and statistics. Mcgraw-Hill College. Rinne, H. (28). The Weibull distribution: A handbook. Chapman and Hall/CRC, st edition.

11 Smirnov, N. V. (944). Approximate laws of distributions of random variables from empirical data. Uspekhi Mat. Nauk, : Weibull, W. (95). A statistical distribution function of wide applicability. Journal of Applied Mechanics, 8(3):

### Graphs over Time Densification Laws, Shrinking Diameters and Possible Explanations

Graphs over Time Densification Laws, Shrinking Diameters and Possible Explanations Jurij Leskovec, CMU Jon Kleinberg, Cornell Christos Faloutsos, CMU 1 Introduction What can we do with graphs? What patterns

### Distribution Analysis

Finding the best distribution that explains your data ENMAX Energy Corporation 8 October, 2015 Introduction Introduction Statistical tests Goodness of fit We often fit observations to a model (e.g., lognormal

### Homophily in Online Social Networks

Homophily in Online Social Networks Bassel Tarbush and Alexander Teytelboym Department of Economics, University of Oxford bassel.tarbush@economics.ox.ac.uk Department of Economics, University of Oxford

### Joan Paola Cruz, Camilo Olaya

A SYSTEM DYNAMICS MODEL FOR STUDYING THE STRUCTURE OF NETWORK MARKETING ORGANIZATIONS Joan Paola Cruz, Camilo Olaya Universidad de los Andes, Dpto. de Ingeniería Industrial Carrera 1 N 18A 10 Bogotá, Colombia

### Temporal Dynamics of Scale-Free Networks

Temporal Dynamics of Scale-Free Networks Erez Shmueli, Yaniv Altshuler, and Alex Sandy Pentland MIT Media Lab {shmueli,yanival,sandy}@media.mit.edu Abstract. Many social, biological, and technological

### Effects of node buffer and capacity on network traffic

Chin. Phys. B Vol. 21, No. 9 (212) 9892 Effects of node buffer and capacity on network traffic Ling Xiang( 凌 翔 ) a), Hu Mao-Bin( 胡 茂 彬 ) b), and Ding Jian-Xun( 丁 建 勋 ) a) a) School of Transportation Engineering,

### ModelingandSimulationofthe OpenSourceSoftware Community

ModelingandSimulationofthe OpenSourceSoftware Community Yongqin Gao, GregMadey Departmentof ComputerScience and Engineering University ofnotre Dame ygao,gmadey@nd.edu Vince Freeh Department of ComputerScience

Logistics Big Data Algorithmic Introduction Prof. Yuval Shavitt Contact: shavitt@eng.tau.ac.il Final grade: 4 6 home assignments (will try to include programing assignments as well): 2% Exam 8% Big Data

### Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Week 1 Week 2 14.0 Students organize and describe distributions of data by using a number of different

### Lectures 5-6: Taylor Series

Math 1d Instructor: Padraic Bartlett Lectures 5-: Taylor Series Weeks 5- Caltech 213 1 Taylor Polynomials and Series As we saw in week 4, power series are remarkably nice objects to work with. In particular,

### Sect The Slope-Intercept Form

Concepts # and # Sect. - The Slope-Intercept Form Slope-Intercept Form of a line Recall the following definition from the beginning of the chapter: Let a, b, and c be real numbers where a and b are not

### Gamma Distribution Fitting

Chapter 552 Gamma Distribution Fitting Introduction This module fits the gamma probability distributions to a complete or censored set of individual or grouped data values. It outputs various statistics

### SAS Software to Fit the Generalized Linear Model

SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling

### IN recent years, advertising has become a major commercial

IEEE/ACM TRANSACTIONS ON NETWORKING On Modeling Product Advertisement in Large Scale Online Social Networks Yongkun Li, Bridge Qiao Zhao, and John C.S. Lui Fellow, IEEE, ACM Abstract We consider the following

### Pull versus Push Mechanism in Large Distributed Networks: Closed Form Results

Pull versus Push Mechanism in Large Distributed Networks: Closed Form Results Wouter Minnebo, Benny Van Houdt Dept. Mathematics and Computer Science University of Antwerp - iminds Antwerp, Belgium Wouter

### Graph models for the Web and the Internet. Elias Koutsoupias University of Athens and UCLA. Crete, July 2003

Graph models for the Web and the Internet Elias Koutsoupias University of Athens and UCLA Crete, July 2003 Outline of the lecture Small world phenomenon The shape of the Web graph Searching and navigation

### Introduction to Networks and Business Intelligence

Introduction to Networks and Business Intelligence Prof. Dr. Daning Hu Department of Informatics University of Zurich Sep 17th, 2015 Outline Network Science A Random History Network Analysis Network Topological

### MINITAB ASSISTANT WHITE PAPER

MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

### Survival Analysis Using SPSS. By Hui Bian Office for Faculty Excellence

Survival Analysis Using SPSS By Hui Bian Office for Faculty Excellence Survival analysis What is survival analysis Event history analysis Time series analysis When use survival analysis Research interest

### Non Parametric Inference

Maura Department of Economics and Finance Università Tor Vergata Outline 1 2 3 Inverse distribution function Theorem: Let U be a uniform random variable on (0, 1). Let X be a continuous random variable

### LOGNORMAL MODEL FOR STOCK PRICES

LOGNORMAL MODEL FOR STOCK PRICES MICHAEL J. SHARPE MATHEMATICS DEPARTMENT, UCSD 1. INTRODUCTION What follows is a simple but important model that will be the basis for a later study of stock prices as

### Chi-Square Test. Contingency Tables. Contingency Tables. Chi-Square Test for Independence. Chi-Square Tests for Goodnessof-Fit

Chi-Square Tests 15 Chapter Chi-Square Test for Independence Chi-Square Tests for Goodness Uniform Goodness- Poisson Goodness- Goodness Test ECDF Tests (Optional) McGraw-Hill/Irwin Copyright 2009 by The

### A MULTI-MODEL DOCKING EXPERIMENT OF DYNAMIC SOCIAL NETWORK SIMULATIONS ABSTRACT

A MULTI-MODEL DOCKING EXPERIMENT OF DYNAMIC SOCIAL NETWORK SIMULATIONS Jin Xu Yongqin Gao Jeffrey Goett Gregory Madey Dept. of Comp. Science University of Notre Dame Notre Dame, IN 46556 Email: {jxu, ygao,

### Lecture 8 : Coordinate Geometry. The coordinate plane The points on a line can be referenced if we choose an origin and a unit of 20

Lecture 8 : Coordinate Geometry The coordinate plane The points on a line can be referenced if we choose an origin and a unit of 0 distance on the axis and give each point an identity on the corresponding

### GUIDANCE FOR ASSESSING THE LIKELIHOOD THAT A SYSTEM WILL DEMONSTRATE ITS RELIABILITY REQUIREMENT DURING INITIAL OPERATIONAL TEST.

GUIDANCE FOR ASSESSING THE LIKELIHOOD THAT A SYSTEM WILL DEMONSTRATE ITS RELIABILITY REQUIREMENT DURING INITIAL OPERATIONAL TEST. 1. INTRODUCTION Purpose The purpose of this white paper is to provide guidance

### 2DI36 Statistics. 2DI36 Part II (Chapter 7 of MR)

2DI36 Statistics 2DI36 Part II (Chapter 7 of MR) What Have we Done so Far? Last time we introduced the concept of a dataset and seen how we can represent it in various ways But, how did this dataset came

### Worksheet for Teaching Module Probability (Lesson 1)

Worksheet for Teaching Module Probability (Lesson 1) Topic: Basic Concepts and Definitions Equipment needed for each student 1 computer with internet connection Introduction In the regular lectures in

### Random graphs with a given degree sequence

Sourav Chatterjee (NYU) Persi Diaconis (Stanford) Allan Sly (Microsoft) Let G be an undirected simple graph on n vertices. Let d 1,..., d n be the degrees of the vertices of G arranged in descending order.

### CORRELATED TO THE SOUTH CAROLINA COLLEGE AND CAREER-READY FOUNDATIONS IN ALGEBRA

We Can Early Learning Curriculum PreK Grades 8 12 INSIDE ALGEBRA, GRADES 8 12 CORRELATED TO THE SOUTH CAROLINA COLLEGE AND CAREER-READY FOUNDATIONS IN ALGEBRA April 2016 www.voyagersopris.com Mathematical

### Introduction to Regression and Data Analysis

Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it

### Some stability results of parameter identification in a jump diffusion model

Some stability results of parameter identification in a jump diffusion model D. Düvelmeyer Technische Universität Chemnitz, Fakultät für Mathematik, 09107 Chemnitz, Germany Abstract In this paper we discuss

### Lecture 3: Growth with Overlapping Generations (Acemoglu 2009, Chapter 9, adapted from Zilibotti)

Lecture 3: Growth with Overlapping Generations (Acemoglu 2009, Chapter 9, adapted from Zilibotti) Kjetil Storesletten September 10, 2013 Kjetil Storesletten () Lecture 3 September 10, 2013 1 / 44 Growth

### Computer Lab 3 Thursday, 24 February, 2011 DMS 106 4:00 5:15PM

Statistics: Continuous Methods STAT452/652, Spring 2011 Computer Lab 3 Thursday, 24 February, 2011 DMS 106 4:00 5:15PM Goodness of Fit tests: Chi-square, Kolmogorov-Smirnov, Anderson-Darling, Shapiro-Wilk

### Math 370, Spring 2008 Prof. A.J. Hildebrand. Practice Test 1 Solutions

Math 70, Spring 008 Prof. A.J. Hildebrand Practice Test Solutions About this test. This is a practice test made up of a random collection of 5 problems from past Course /P actuarial exams. Most of the

### Factoring & Primality

Factoring & Primality Lecturer: Dimitris Papadopoulos In this lecture we will discuss the problem of integer factorization and primality testing, two problems that have been the focus of a great amount

### Poisson Models for Count Data

Chapter 4 Poisson Models for Count Data In this chapter we study log-linear models for count data under the assumption of a Poisson error structure. These models have many applications, not only to the

### Supplement to Call Centers with Delay Information: Models and Insights

Supplement to Call Centers with Delay Information: Models and Insights Oualid Jouini 1 Zeynep Akşin 2 Yves Dallery 1 1 Laboratoire Genie Industriel, Ecole Centrale Paris, Grande Voie des Vignes, 92290

### Information Theory and Coding Prof. S. N. Merchant Department of Electrical Engineering Indian Institute of Technology, Bombay

Information Theory and Coding Prof. S. N. Merchant Department of Electrical Engineering Indian Institute of Technology, Bombay Lecture - 17 Shannon-Fano-Elias Coding and Introduction to Arithmetic Coding

### USING SPECTRAL RADIUS RATIO FOR NODE DEGREE TO ANALYZE THE EVOLUTION OF SCALE- FREE NETWORKS AND SMALL-WORLD NETWORKS

USING SPECTRAL RADIUS RATIO FOR NODE DEGREE TO ANALYZE THE EVOLUTION OF SCALE- FREE NETWORKS AND SMALL-WORLD NETWORKS Natarajan Meghanathan Jackson State University, 1400 Lynch St, Jackson, MS, USA natarajan.meghanathan@jsums.edu

### Exploring contact patterns between two subpopulations

Exploring contact patterns between two subpopulations Winfried Just Hannah Callender M. Drew LaMar December 23, 2015 In this module 1 we introduce a construction of generic random graphs for a given degree

### Persuasion by Cheap Talk - Online Appendix

Persuasion by Cheap Talk - Online Appendix By ARCHISHMAN CHAKRABORTY AND RICK HARBAUGH Online appendix to Persuasion by Cheap Talk, American Economic Review Our results in the main text concern the case

### MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 5 9/17/2008 RANDOM VARIABLES

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 5 9/17/2008 RANDOM VARIABLES Contents 1. Random variables and measurable functions 2. Cumulative distribution functions 3. Discrete

Advanced CFD Methods 1 Prof. Patrick Jenny, FS 2014 Date: 15.08.14, Time: 13:00, Student: Federico Danieli Summary The exam took place in Prof. Jenny s office, with his assistant taking notes on the answers.

### Adaptive Search with Stochastic Acceptance Probabilities for Global Optimization

Adaptive Search with Stochastic Acceptance Probabilities for Global Optimization Archis Ghate a and Robert L. Smith b a Industrial Engineering, University of Washington, Box 352650, Seattle, Washington,

### Asymmetry and the Cost of Capital

Asymmetry and the Cost of Capital Javier García Sánchez, IAE Business School Lorenzo Preve, IAE Business School Virginia Sarria Allende, IAE Business School Abstract The expected cost of capital is a crucial

### Summary of Probability

Summary of Probability Mathematical Physics I Rules of Probability The probability of an event is called P(A), which is a positive number less than or equal to 1. The total probability for all possible

### THE CENTRAL LIMIT THEOREM TORONTO

THE CENTRAL LIMIT THEOREM DANIEL RÜDT UNIVERSITY OF TORONTO MARCH, 2010 Contents 1 Introduction 1 2 Mathematical Background 3 3 The Central Limit Theorem 4 4 Examples 4 4.1 Roulette......................................

### 4. Joint Distributions

Virtual Laboratories > 2. Distributions > 1 2 3 4 5 6 7 8 4. Joint Distributions Basic Theory As usual, we start with a random experiment with probability measure P on an underlying sample space. Suppose

### The degree, size and chromatic index of a uniform hypergraph

The degree, size and chromatic index of a uniform hypergraph Noga Alon Jeong Han Kim Abstract Let H be a k-uniform hypergraph in which no two edges share more than t common vertices, and let D denote the

### Technology Step-by-Step Using StatCrunch

Technology Step-by-Step Using StatCrunch Section 1.3 Simple Random Sampling 1. Select Data, highlight Simulate Data, then highlight Discrete Uniform. 2. Fill in the following window with the appropriate

### INDISTINGUISHABILITY OF ABSOLUTELY CONTINUOUS AND SINGULAR DISTRIBUTIONS

INDISTINGUISHABILITY OF ABSOLUTELY CONTINUOUS AND SINGULAR DISTRIBUTIONS STEVEN P. LALLEY AND ANDREW NOBEL Abstract. It is shown that there are no consistent decision rules for the hypothesis testing problem

### F. Farrokhyar, MPhil, PhD, PDoc

Learning objectives Descriptive Statistics F. Farrokhyar, MPhil, PhD, PDoc To recognize different types of variables To learn how to appropriately explore your data How to display data using graphs How

### What Is Probability?

1 What Is Probability? The idea: Uncertainty can often be "quantified" i.e., we can talk about degrees of certainty or uncertainty. This is the idea of probability: a higher probability expresses a higher

### arxiv:physics/0607202v2 [physics.comp-ph] 9 Nov 2006

Stock price fluctuations and the mimetic behaviors of traders Jun-ichi Maskawa Department of Management Information, Fukuyama Heisei University, Fukuyama, Hiroshima 720-0001, Japan (Dated: February 2,

### Network Analysis and Visualization of Staphylococcus aureus. by Russ Gibson

Network Analysis and Visualization of Staphylococcus aureus by Russ Gibson Network analysis Based on graph theory Probabilistic models (random graphs) developed by Erdős and Rényi in 1959 Theory and tools

### Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network

, pp.273-284 http://dx.doi.org/10.14257/ijdta.2015.8.5.24 Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network Gengxin Sun 1, Sheng Bin 2 and

### Accelerated growth of networks

arxiv:cond-mat/0204102v1 [cond-mat.stat-mech] 4 Apr 2002 Accelerated growth of networks S.N. Dorogovtsev and J.F.F. Mendes WILEY-VCH Verlag Berlin GmbH February 1, 2008 Preface In many real growing networks

### Goal Problems in Gambling and Game Theory. Bill Sudderth. School of Statistics University of Minnesota

Goal Problems in Gambling and Game Theory Bill Sudderth School of Statistics University of Minnesota 1 Three problems Maximizing the probability of reaching a goal. Maximizing the probability of reaching

### START Selected Topics in Assurance

START Selected Topics in Assurance Related Technologies Table of Contents Introduction Some Statistical Background Fitting a Normal Using the Anderson Darling GoF Test Fitting a Weibull Using the Anderson

### Some Notes on Taylor Polynomials and Taylor Series

Some Notes on Taylor Polynomials and Taylor Series Mark MacLean October 3, 27 UBC s courses MATH /8 and MATH introduce students to the ideas of Taylor polynomials and Taylor series in a fairly limited

### 2 Complex Functions and the Cauchy-Riemann Equations

2 Complex Functions and the Cauchy-Riemann Equations 2.1 Complex functions In one-variable calculus, we study functions f(x) of a real variable x. Likewise, in complex analysis, we study functions f(z)

### CHAPTER 2 Estimating Probabilities

CHAPTER 2 Estimating Probabilities Machine Learning Copyright c 2016. Tom M. Mitchell. All rights reserved. *DRAFT OF January 24, 2016* *PLEASE DO NOT DISTRIBUTE WITHOUT AUTHOR S PERMISSION* This is a

### 6 Scalar, Stochastic, Discrete Dynamic Systems

47 6 Scalar, Stochastic, Discrete Dynamic Systems Consider modeling a population of sand-hill cranes in year n by the first-order, deterministic recurrence equation y(n + 1) = Ry(n) where R = 1 + r = 1

### LECTURE 15: AMERICAN OPTIONS

LECTURE 15: AMERICAN OPTIONS 1. Introduction All of the options that we have considered thus far have been of the European variety: exercise is permitted only at the termination of the contract. These

### CMSC 858T: Randomized Algorithms Spring 2003 Handout 8: The Local Lemma

CMSC 858T: Randomized Algorithms Spring 2003 Handout 8: The Local Lemma Please Note: The references at the end are given for extra reading if you are interested in exploring these ideas further. You are

### Mathematical Finance

Mathematical Finance Option Pricing under the Risk-Neutral Measure Cory Barnes Department of Mathematics University of Washington June 11, 2013 Outline 1 Probability Background 2 Black Scholes for European

### Gains from Trade: The Role of Composition

Gains from Trade: The Role of Composition Wyatt Brooks University of Notre Dame Pau Pujolas McMaster University February, 2015 Abstract In this paper we use production and trade data to measure gains from

### Chapter 29 Scale-Free Network Topologies with Clustering Similar to Online Social Networks

Chapter 29 Scale-Free Network Topologies with Clustering Similar to Online Social Networks Imre Varga Abstract In this paper I propose a novel method to model real online social networks where the growing

### Continued Fractions. Darren C. Collins

Continued Fractions Darren C Collins Abstract In this paper, we discuss continued fractions First, we discuss the definition and notation Second, we discuss the development of the subject throughout history

### Evaluation of a New Method for Measuring the Internet Degree Distribution: Simulation Results

Evaluation of a New Method for Measuring the Internet Distribution: Simulation Results Christophe Crespelle and Fabien Tarissan LIP6 CNRS and Université Pierre et Marie Curie Paris 6 4 avenue du président

### Visualization and Modeling of Structural Features of a Large Organizational Email Network

Visualization and Modeling of Structural Features of a Large Organizational Email Network Benjamin H. Sims Statistical Sciences (CCS-6) Email: bsims@lanl.gov Nikolai Sinitsyn Physics of Condensed Matter

### \$2 4 40 + ( \$1) = 40

THE EXPECTED VALUE FOR THE SUM OF THE DRAWS In the game of Keno there are 80 balls, numbered 1 through 80. On each play, the casino chooses 20 balls at random without replacement. Suppose you bet on the

### 3.3. Solving Polynomial Equations. Introduction. Prerequisites. Learning Outcomes

Solving Polynomial Equations 3.3 Introduction Linear and quadratic equations, dealt within Sections 3.1 and 3.2, are members of a class of equations, called polynomial equations. These have the general

### Multiplicity. Chapter 6

Chapter 6 Multiplicity The fundamental theorem of algebra says that any polynomial of degree n 0 has exactly n roots in the complex numbers if we count with multiplicity. The zeros of a polynomial are

### DATA ANALYSIS IN PUBLIC SOCIAL NETWORKS

International Scientific Conference & International Workshop Present Day Trends of Innovations 2012 28 th 29 th May 2012 Łomża, Poland DATA ANALYSIS IN PUBLIC SOCIAL NETWORKS Lubos Takac 1 Michal Zabovsky

### Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in

### Differentiating under an integral sign

CALIFORNIA INSTITUTE OF TECHNOLOGY Ma 2b KC Border Introduction to Probability and Statistics February 213 Differentiating under an integral sign In the derivation of Maximum Likelihood Estimators, or

### An Interest-Oriented Network Evolution Mechanism for Online Communities

An Interest-Oriented Network Evolution Mechanism for Online Communities Caihong Sun and Xiaoping Yang School of Information, Renmin University of China, Beijing 100872, P.R. China {chsun.vang> @ruc.edu.cn

### Review of Random Variables

Chapter 1 Review of Random Variables Updated: January 16, 2015 This chapter reviews basic probability concepts that are necessary for the modeling and statistical analysis of financial data. 1.1 Random

### princeton univ. F 13 cos 521: Advanced Algorithm Design Lecture 6: Provable Approximation via Linear Programming Lecturer: Sanjeev Arora

princeton univ. F 13 cos 521: Advanced Algorithm Design Lecture 6: Provable Approximation via Linear Programming Lecturer: Sanjeev Arora Scribe: One of the running themes in this course is the notion of

### Spectral Measure of Large Random Toeplitz Matrices

Spectral Measure of Large Random Toeplitz Matrices Yongwhan Lim June 5, 2012 Definition (Toepliz Matrix) The symmetric Toeplitz matrix is defined to be [X i j ] where 1 i, j n; that is, X 0 X 1 X 2 X n

### A scalable multilevel algorithm for graph clustering and community structure detection

A scalable multilevel algorithm for graph clustering and community structure detection Hristo N. Djidjev 1 Los Alamos National Laboratory, Los Alamos, NM 87545 Abstract. One of the most useful measures

### Generating Random Samples from the Generalized Pareto Mixture Model

Generating Random Samples from the Generalized Pareto Mixture Model MUSTAFA ÇAVUŞ AHMET SEZER BERNA YAZICI Department of Statistics Anadolu University Eskişehir 26470 TURKEY mustafacavus@anadolu.edu.tr

### Higher Education Math Placement

Higher Education Math Placement Placement Assessment Problem Types 1. Whole Numbers, Fractions, and Decimals 1.1 Operations with Whole Numbers Addition with carry Subtraction with borrowing Multiplication

### 9. Sampling Distributions

9. Sampling Distributions Prerequisites none A. Introduction B. Sampling Distribution of the Mean C. Sampling Distribution of Difference Between Means D. Sampling Distribution of Pearson's r E. Sampling

### **BEGINNING OF EXAMINATION** The annual number of claims for an insured has probability function: , 0 < q < 1.

**BEGINNING OF EXAMINATION** 1. You are given: (i) The annual number of claims for an insured has probability function: 3 p x q q x x ( ) = ( 1 ) 3 x, x = 0,1,, 3 (ii) The prior density is π ( q) = q,

### Chapter 2 Portfolio Management and the Capital Asset Pricing Model

Chapter 2 Portfolio Management and the Capital Asset Pricing Model In this chapter, we explore the issue of risk management in a portfolio of assets. The main issue is how to balance a portfolio, that

### STA 4273H: Statistical Machine Learning

STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.cs.toronto.edu/~rsalakhu/ Lecture 6 Three Approaches to Classification Construct

### Accurately and Efficiently Measuring Individual Account Credit Risk On Existing Portfolios

Accurately and Efficiently Measuring Individual Account Credit Risk On Existing Portfolios By: Michael Banasiak & By: Daniel Tantum, Ph.D. What Are Statistical Based Behavior Scoring Models And How Are

### ECON20310 LECTURE SYNOPSIS REAL BUSINESS CYCLE

ECON20310 LECTURE SYNOPSIS REAL BUSINESS CYCLE YUAN TIAN This synopsis is designed merely for keep a record of the materials covered in lectures. Please refer to your own lecture notes for all proofs.

### Probability distributions

Probability distributions (Notes are heavily adapted from Harnett, Ch. 3; Hayes, sections 2.14-2.19; see also Hayes, Appendix B.) I. Random variables (in general) A. So far we have focused on single events,

### Gambling Systems and Multiplication-Invariant Measures

Gambling Systems and Multiplication-Invariant Measures by Jeffrey S. Rosenthal* and Peter O. Schwartz** (May 28, 997.. Introduction. This short paper describes a surprising connection between two previously

### Probability and Statistics

CHAPTER 2: RANDOM VARIABLES AND ASSOCIATED FUNCTIONS 2b - 0 Probability and Statistics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be

### PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION Introduction In the previous chapter, we explored a class of regression models having particularly simple analytical

### SUMAN DUVVURU STAT 567 PROJECT REPORT

SUMAN DUVVURU STAT 567 PROJECT REPORT SURVIVAL ANALYSIS OF HEROIN ADDICTS Background and introduction: Current illicit drug use among teens is continuing to increase in many countries around the world.

### Introduction to Logistic Regression

OpenStax-CNX module: m42090 1 Introduction to Logistic Regression Dan Calderon This work is produced by OpenStax-CNX and licensed under the Creative Commons Attribution License 3.0 Abstract Gives introduction

### Complex Networks Analysis: Clustering Methods

Complex Networks Analysis: Clustering Methods Nikolai Nefedov Spring 2013 ISI ETH Zurich nefedov@isi.ee.ethz.ch 1 Outline Purpose to give an overview of modern graph-clustering methods and their applications