A Two-Stage Bayesian Model for Predicting Winners in Major League Baseball
|
|
|
- Randolf McDaniel
- 10 years ago
- Views:
Transcription
1 Journal of Data Science 2(2004), A Two-Stage Bayesian Model for Predicting Winners in Major League Baseball Tae Young Yang 1 and Tim Swartz 2 1 Myongji University and 2 Simon Fraser University Abstract: The probability of winning a game in major league baseball depends on various factors relating to team strength including the past performance of the two teams, the batting ability of the two teams and the starting pitchers. These three factors change over time. We combine these factors by adopting contribution parameters, and include a home field advantage variable in forming a two-stage Bayesian model. A Markov chain Monte Carlo algorithm is used to carry out Bayesian inference and to simulate outcomes of future games. We apply the approach to data obtained from the 2001 regular season in major league baseball. Key words: Major league baseball, Markov chain Monte Carlo, predictive distributions. 1. Introduction The probability of winning a game in major league baseball (MLB) depends on various factors relating to team strength including the past performance of the two teams, the batting ability of the two teams and the starting pitchers. We define the relative strength of a team over a competing team at a given point in time by combining these three factors into a single measurement. Although other factors relating to team strength may influence the probability of winning, we assume that their effect is minor, and note that the three main factors mentioned above are those that are traditionally considered in the setting of betting odds by bookmakers (McCune 1989). Bookmakers are also aware that the home field advantage significantly influences the probability of winning, and likewise, we include it in our calculations. In this paper, we propose a two-stage Bayesian model based on the relative strength variable and the home field advantage variable to predict the outcomes of games in MLB. MLB in the United States is divided into two leagues and six divisions. The American League (AL) has three divisions and the National League (NL) has three divisions. Each team plays 162 games in the regular season (April through 61
2 62 Tae Young Yang and Tim Swartz Table 1. Summary statistics for the 2001 MLB regular season. Division Team Overall Home Away Team Team Win % Win% Win % Batting Avg ERA AL East NY Yankees Boston Toronto Baltimore Tampa Bay AL Central Cleveland Minnesota White Sox Detroit Kansas City AL West Seattle Oakland Anaheim Texas NL East Atlanta Philadelphia NY Mets Florida Montreal NL Central Houston St. Louis Chicago Cubs Milwaukee Cincinnati Pittsburgh NL West Arizona San Francisco Los Angeles San Diego Colorado early October) which does not include the pre-season and the post-season playoffs. Summary statistics for the 2001 MLB regular season appear in Table 1. Overall win percentages are followed by the home and away win percentages, the overall team batting average and the overall team earned run average (ERA).
3 A Two-Stage Bayesian Model for Predicting Winners in Major League Baseball 63 We observe that most teams win more often at home than on the road, and this suggests that the home field advantage variable may be useful. The 2001 regular season is noteworthy in that Seattle finished with an outstanding won-loss record, tying the 1906 Chicago Cubs MLB record for wins in a season. Seattle also established the AL record for road wins in a season (59) and set a major league record with 29 consecutive road series won or tied. More detailed statistical information on MLB is available from numerous internet websites including and The relative strength of a team over a competing team at a given point in time is based on three ratios: (1) the ratio of the winning percentages between the two teams, (2) the ratio of the overall team batting averages and (3) the ratio of the ERAs between the two starting pitchers. We note that on the day of a MLB game, all three ratios are typically available. Later we demonstrate that ratio (3) is more important in determining the probability of a win than ratio (1) and ratio (2). The importance of ratio (3) is consistent with our intuition where we believe, for example, that in the 2001 season, the Arizona Diamondbacks have a higher probability of winning when either Randy Johnson or Curt Schilling is pitching. Clearly, it is unreasonable to expect that all three ratios affect the probability of winning in the same way. For this reason, we adopt three unknown contribution parameters that assign relative importance to each of the three ratios. The contribution parameters are assumed constant across all teams and arise from independent prior densities. The home field advantage parameter is assumed to be apriori independent of the contribution parameters. Therefore our model essentially relies on four primary parameters; the three contribution parameters and the home field advantage parameter. This parametrization is much simpler than many other approaches. For example, the Bradley and Terry (1952) model requires at least n 1 parameters where a team parameter is defined for each of the n teams and the team parameters sum to a constant. In addition, our relative strength variable is flexible as it readily accommodates other factors, and can be easily modified for various sports such as basketball, football, soccer and hockey. A two-stage Bayesian model based on the relative strength variable and the home field advantage variable is proposed to predict the outcome of games in MLB. In the first stage, we assume that the probability that a given team wins is a random sample from a beta distribution with parameters based on the relative strength variable and the home field advantage variable. In the second stage, the outcome (win or loss) is a random sample from a Bernoulli distribution with this winning probability. We contrast the two-stage model with a one-stage model where the outcome (win or loss) is a random sample from a Bernoulli(p) distribution whose probability p is a direct function of the relative strength variable and the home field advantage variable. Thus, the two-stage model offers an extra level of variation to account for the difficulty that investigators have encountered when modelling outcomes of MLB baseball games (Kaigh 1995 and McCune 1989). In addition, the two-stage Bayesian structure permits convenient Bayesian inference. We observe that the posterior distribution arising from the two-stage Bayesian model is of a form that is not readily interpretable. In this context, Markov chain
4 64 Tae Young Yang and Tim Swartz Monte Carlo (MCMC) algorithms sample variates according to a Markov chain whose stationary distribution is the desired posterior distribution. The goal is to average the sampled variates to estimate posterior quantities of interest. We consider a Metropolis within Gibbs MCMC algorithm (see Gilks, Richardson and Spiegelhalter 1996) which is useful when the full conditional densities of the Gibbs sampling algorithm (Geman and Geman 1984) are non-standard. The Gibbs sampling algorithm is perhaps the simplest of the MCMC algorithms. Additional reading on Gibbs sampling in related contexts can be found in Gelfand and Smith (1990), Barry and Hartigan (1993), and Kuo and Yang (1995, 1996, 2000). MCMC sampling is used to simulate the outcomes of future games and then predict the eventual division winners. Kaigh (1995) considers a simple method of prediction for major league baseball using only the home and away records of the competing teams. Predictions are compared against results from the MLB regular seasons. It is not evident that the simple predictive model yields a profitable betting strategy. Barry and Hartigan (1993) consider a more complex choice model for prediction in MLB. Like our model, the Barry and Hartigan model is Bayesian, allows for changing team strengths over time and relies on a MCMC implementation. Some distinguishing features of their model include the assumption that team strengths are discrete, the exclusion of team batting averages as covariates and the huge number of parameters required. Perhaps the greatest practical difference, however, is that their model does not consider the effect of starting pitchers. Since starting pitchers may critically affect the outcome of a game, the Barry and Hartigan (1993) model cannot be used for accurate prediction of individual games. Their model is appropriate only for predicting a large number of games (e.g. the remainder of a season) where the effect of individual pitchers is averaged over the large number of games. Additional reading on baseball and prediction can be found in Bennett (1998), James, Albert and Stern (1997), and Barry and Hartigan (1994). Section 2 provides a complete specification of the full two-stage Bayesian model and indicates sub-models that may be appropriate. In Section 3, we adopt a Markov chain Monte Carlo approach for Bayesian inference. The full conditional distributions are derived and sampling is carried out using the Metropolis within Gibbs algorithm. Section 4 looks at the problem of prediction with specific emphasis on the prediction of results for the remainder of the season. In Section 5, simulation results from the models are studied in comparison with the actual final results from the 2001 MLB regular season. Some concluding remarks are then provided in Section The Model Suppose that the T games in a MLB regular season are indexed in time according to s =1,...,T. We calculate covariates (α s,β s,γ s ) immediately prior to time s where α s denotes the ratio of the winning percentages between the two teams
5 A Two-Stage Bayesian Model for Predicting Winners in Major League Baseball 65 β s denotes the ratio of the overall team batting averages γ s denotes the ratio of the starting pitchers ERAs Without loss of generality, we let the numerator in the first two ratios correspond to the home team, and we let the denominator in the third ratio correspond to the home team. As pointed out by a referee, there is variability in the covariates and the variability is decreasing in time s. In particular, early in the season, some pitchers may have extraordinarily high or low ERAs. For this reason, we truncate γ s according to 1/γ 0 γ s γ 0 for prescribed γ 0. For the MLB data considered in this paper, we let γ 0 = 5. From a practical point of view, we consider only games corresponding to s t 0 for some moderate value t 0, to allow the covariates to stabilize. Typically, with MLB data, one would not expect extreme variation in the covariates α s and β s for s t 0. It is unreasonable to expect that all three ratios affect the probability of winning in the same way. We therefore adopt unknown contribution parameters r =(r 1,r 2,r 3 ) where we define the relative strength of the home team over the visiting team at time s as λ s = α r 1 s βr 2 s γr 3 s. We assume that the contribution parameters are constant across all MLB teams and arise from independent prior distributions with r i uniform(0,a i )wherea i is prescribed, i =1, 2, 3. The value r i close to 0 implies that the corresponding ratio has little effect on the relative strength. For the MLB data considered in this paper, we let a i =2fori =1, 2, 3. We view these as subjective priors where we use our knowledge of MLB to impose effects based on the current winning percentage, batting and pitching. Note that in the context of the models described below, a i = 2 is a realistic upper bound for the value of r i, i =1, 2, 3. The relative strength of the visiting team over the home team at time s is therefore given by the reciprocal 1/λ s =(1/α s ) r 1 (1/β s ) r 2 (1/γ s ) r 3. When a team s relative strength is larger (smaller) than 1, this implies that it is the stronger (weaker) team at time s. We note that the relative strength variable λ s is flexible as it readily accommodates other factors, and can be easily modified for various sports. We also consider a home field advantage variable δ which is assumed constant across all MLB teams and is assumed independent of the contribution parameters. The visiting field effect is then defined as 1/δ where we interpret δ>1asahome field advantage. We assign a uniform(δ 0,δ 1 ) prior distribution for δ where δ 0 and δ 1 are prescribed. For the MLB data considered in this paper, we let δ 0 =0and δ 1 = 2. The apriori belief E(δ) = 1 therefore implies that the home field has no effect. Having defined the relative strength λ s and the home field advantage δ, we now propose three predictive models in increasing levels of complexity. Let the random variable X s equal 1 (0) if the home team wins (loses) in the sth game. We
6 66 Tae Young Yang and Tim Swartz are interested in the probability distribution of X s. For convenience, we define p s =Prob(X s =1). Model 1: Prob(X s )=p Xs s (1 p s) 1 Xs and p s = λ sδ 1+λ s δ Model 1 is a simple one-stage model where X s is a Bernoulli random variable with expected value p s. One of the features of the( parametrization ) ( ) is that the probability of the visiting team winning is given by 1 λ sδ / 1+ 1 λ sδ = 1 1+λ = sδ 1 p s. Model 2: Prob(X s )=p Xs s (1 p s) 1 Xs and p s beta(λ s δ, 1) Model 2 is a two-stage model where the first stage uses the same Bernoulli distribution as in Model 1 and the second stage imposes a beta prior distribution. Note that conditional on p s, the expected value of X s is again p s.model2offers an additional level of variation beyond Model 1 to account for the difficulty that investigators have encountered when modelling the outcomes of MLB games. Model 3: Prob(X s )=p Xs s (1 p s ) 1 Xs and p s beta(mλ s δ, m) Model 3 generalizes Model 2 by introducing the parameter m>0. When m is equal to 1, then Model 3 reduces to Model 2. When m,thenmodel3 reduces to Model 1. Thus Model 1 and Model 2 can be viewed as sub-models, and we can assess their suitability according to the posterior distribution of m in Model 3. Note that Model 3 implies E(p s m) =λ s δ/(1 + λ s δ)asinmodel1and inmodel2,andvar(p s m) =λ s δ/(λ s δ +1) 2 (mλ s δ + m + 1). Therefore we can maintain the same mean structure, yet the probability distribution can be made more or less diffuse according to the value of m. We choose an exponential prior distribution with mean m 0 for m. In this paper, we let m 0 = Bayesian Inference Consider inference for the primary parameters r, δ, m and the p s in the full two-stage Bayesian model (Model 3). We refer to the fixed parameters a 1, a 2, a 3, δ 0, δ 1 and m 0 as hyperparameters of the model. In the 2001 MLB regular season, each of the 30 teams is scheduled to play 162 games leading to a total of T = 15(162) = 2430 games. Again, the games are indexed in time according to s =1,...,t,...,T where t is the current time. We denote the covariate triple D s =(α s,β s,γ s ) for s = t 0,...,t 1 where the components of D s are the three ratios defined in Section 2. Considering games from only time t 0 onwards, the posterior density of p t0,...,p t 1, m, δ and r is therefore proportional to the likelihood times prior and is given by π( ) = π(p ( t0,...,p t 1,m,δ,r x t0,...,x t 1,D t0,...,d t 1 ) t 1 ) s=t 0 B(mλ s δ, m) ps xs+mλsδ 1 (1 p s ) m xs e m/m (3.1) 0
7 A Two-Stage Bayesian Model for Predicting Winners in Major League Baseball 67 where B(a, b) =Γ(a + b)/(γ(a)γ(b)) is the norming constant of the beta(a, b) distribution, m>0, δ 0 δ δ 1, a i r i b i for i =1, 2, 3and0<p s < 1for s = t 0,...,t 1. For inference we use the empirical measure of the samples generated by the Gibbs sampler. From (1), the Gibbs sampler requires iterative variate generation from the following four full conditional densities: [p s ] beta(x s + mλ s δ, m x s +1) s = t 0,...,t 1 [m ] e m/m 0 t 1 s=t 0 B(mλ s δ, m) p mλsδ s (1 p s ) m [δ ] t 1 [r i ] t 1 Γ(mλ sδ+m) s=t 0 Γ(mλ sδ) Γ(mλ sδ+m) s=t 0 Γ(mλ sδ) p mλsδ s p mλsδ s i =1, 2, 3 The generation of the p s variates is straightforward as almost every statistical package has a built-in beta generator. The generation of m, δ, r 1, r 2 and r 3 from their respective full conditional densities is less obvious as their densities have non-standard forms. For these variates, we use a Metropolis within Gibbs step (Gilks, Richardson and Spiegelhalter 1996) where in each case we use the corresponding prior distribution as the proposal distribution. Note that this is a practical advantage in avoiding improper priors. We now estimate the probability that the home team wins at time t. Denote the data at time t as data = (x t0,...,x t 1,D t0,...,d t 1 ). Then the predictive density at time t is given by P (x t data) p xt t (1 p t) 1 xt B(mλ t δ, m) pt mλtδ 1 (1 p t ) m 1 π(m, δ, r data) dp t dm dδ dr (3.2) where π(m, δ, r data) is the marginal posterior density obtained from (3.1). It is not practical to integrate according to (3.2). Instead, the integration is carried out using the MCMC algorithm for Model 3. Retaining the values (m, δ, r) obtained in an iteration of the MCMC algorithm, generate p t beta(mλ t δ, m) andthen generate X t from a Bernoulli(p t ) distribution. The X t are now a sample from the predictive density in (3.2) and can be averaged to estimate the predictive probability that the home team wins. Noting that E(X t data) = E(P t data), a simpler and more efficient estimate of the predictive probability that the home team wins can be obtained by averaging the λ t δ/(λ t δ + 1) values. This technique is sometimes referred to as Rao-Blackwellization. 4. Prediction Besides making inferences on the unknown parameters, we are also interested in predicting the outcome of games for the remainder of the season. We remark
8 68 Tae Young Yang and Tim Swartz that the MLB schedule is determined well in advance of the season, and therefore we always know who plays whom at a given point in time. Recall that t is the current time in the season. The predictive density of X t,...,x T can be obtained via P (X t,...,x T X t0,...,x t 1,D t0,...,d t 1 ) = T s=t P (X s X t0,...,x s 1,D t0,...,d s 1 ) where the terms within the product are given in (3.2). Our Markov chain algorithm can also be used for estimating results for the remainder of the season. After the chain has stabilized, we retain the values (m, δ, r) from a given iteration. We generate p t beta(mλ t δ, m) and then generate X t Bernoulli(p t ). Next we generate p t+1 beta(mλ t+1 δ, m) followed by X t+1 Bernoulli(p t+1 ). We continue the process obtaining a single sequence X t,...,x T. For each team, we combine the number of wins at the current time in the season with the number of wins during the simulated season t,...,t to obtain the number of wins over the full season. The entire process is repeated over multiple simulated seasons to derive estimates for the total number of wins. A practical difficulty concerns the updating of D s =(α s,β s,γ s )usedintheconstruction of λ s during the simulated part of the season s = t,...,t. The win ratios α s change for a team as they win and lose games in the simulated part of the season. Although α s is readily changed according to X t0,...,x s 1,theratios for team batting β s and starting pitchers ERAs γ s are unavailable for games well off into the future. As a compromise, we fix β s = β t and set γ s equal to the team average at time t, s = t,...,t. 5. Analysis of 2001 MLB Data Collecting relevant information on the entire 2001 MLB regular season is a formidable task. As such, we considered only games played on each of twelve dates, beginning April 15 and spread nearly evenly across the season. In these games, we observe 106 home team wins out of a total of 179 games giving a home field winning proportion of This sample proportion is a little larger than historical values for MLB (Berry 2001). Table 2 lists the sample means and standard errors for α, β, γ and the home field winning variable X. We observe that the sample means of α, β and γ are what we expect, e.g. nearly centered about 1. Note that β does not vary greatly about 1.0; this is an early suggestion that the batting variable may not have a significant role in prediction. We first consider practical convergence of the Markov chain using the CODA software (Best, Cowles and Vines 1995). CODA is a set of S-Plus functions that gives graphical summaries and diagnostics of convergence from MCMC output. The results from CODA suggest that practical convergence is achieved after 1,000 iterations. The following results are based on 1000 iterations and 100 replications after 1000 burn-in iterations. Table 3 provides posterior means and posterior standard deviations of the primary parameters r, δ and m. The posterior distributions of the r variables are
9 A Two-Stage Bayesian Model for Predicting Winners in Major League Baseball 69 Table 2: Sample means and standard errors of α, β, γ and X. α β γ X Mean Standard Error somewhat flat confirming the difficulty in determining the exact contribution of the winning percentage, batting and pitching to the prediction problem. It also points to the need for larger data sets in future analyses. Table 3: Priors, posterior means and posterior standard deviations of the primary parameters. Parameter Prior Posterior Mean Posterior S.D. r 1 uniform(0, 2) r 2 uniform(0, 2) r 3 uniform(0, 2) δ uniform(0, 2) m exponential(10) To interpret the magnitude of the home field advantage, consider two teams with equal relative strength λ s = 1. In this case, the posterior mean of δ given by 1.58 implies that the home team wins with expected probability 0.61, which is close to the sample mean of X given by The posterior mean m =5.23 is less than the prior mean 10.0 and is in the direction of Model 2 where the value of m is 1. This provides some support for Model 2 although we rely on a model choice criterion based on prediction. We now turn to the prediction problem where the remainder of the 2001 MLB season is simulated from different points in time. We note that there are very few MLB games that are rained out and not rescheduled. Therefore almost all of the 30 teams completed the full 162 games. We compare the predictive ability of the three models given in Section 2. The criterion used is the sum of squared differences between the expected predictive win probabilities and the final season winning percentages over all 30 MLB teams. The chosen current time points are May 30, June 30, July 30 and August 30 in the regular season. The results are shown in Table 4 where the last column indicates the squared difference criterion between the current winning percentages and the final season winning percentages. The last column therefore corresponds to simple extrapolation of a
10 70 Tae Young Yang and Tim Swartz team s current performance to the end of the season. Naturally, we expect final season predictions to improve as the season progresses and this is the case in Table 4. We observe that the predictive ability of all three models is superior to simple extrapolation. Given the aforementioned difficulty of prediction in MLB, we stress that this is a promising result and is indicative of model adequacy. Amongst the three models, it seems that Model 3 may be overfitting although all three models are comparable in their predictive performance. We prefer Model 2 as it is slightly better than Model 3 in prediction and allows more variation than Model 1 without requiring additional parameters. The results from Table 3 also suggest that we might prefer Model 2 over Model 1. Table 4: Squared error difference criterion for comparing the three models of Section 2 and the simple extrapolation model. Predictive Date Model 1 Model 2 Model 3 Extrapolation May June July August In Table 5, we provide the prediction results for each of the 30 MLB teams from two points in time (May 30, July 30) based on Model 2. We note that 22 of the 30 predictions from July 30 are closer to the final winning percentages than the current winning percentages recorded on July 30. We also note that the model tends to exhibit a regression towards the mean effect. For example, using results to three decimal places, all six division leaders on July 30 have a predicted winning percentage that is less than their actual winning proportion on July 30. It is also interesting to compare the predictive impact of covariates α s, β s and γ s. The seven types of covariate combinations under Model 2 are given by S 1 : λ s = α r 1 s, S 2 : λ s = β r 2 s, S 3 : λ s = γ r 3 s, S 4 : λ s = α r 1 s β r 2 s, S 5 : λ s = α r 1 s γr 3 s, S 6 : λ s = β r 2 s γr 3 s, S 7 : λ s = α r 1 s βr 2 s γr 3 s. (5.1) The relative strengths of S 1, S 2 and S 3 consist of only one effect. The relative strengths of S 4, S 5 and S 6 consist of two effects and the relative strength of S 7 consists of all three effects. Table 6 provides the sum of squared differences between the expected predictive win probabilities and the final season winning percentages over all 30 MLB teams based on S 1 to S 7. For comparison purposes, the criterion is also reported for the simple extrapolation model. The results are not definitive but are in accord with prevailing wisdom that suggests that starting pitching is a key determinant in prediction.
11 A Two-Stage Bayesian Model for Predicting Winners in Major League Baseball 71 Table 5. Predicted probabilities from different points in time using Model 2. Team Final Win May 30 July 30 Percentage (Actual, Predicted) (Actual, Predicted) NY Yankees , , 0.60 Boston , , 0.58 Toronto , , 0.47 Baltimore , , 0.42 Tampa Bay , , 0.32 Cleveland , , 0.55 Minnesota , , 0.58 White Sox , , 0.51 Detroit , , 0.43 Kansas City , , 0.41 Seattle , , 0.70 Oakland , , 0.55 Anaheim , , 0.51 Texas , , 0.42 Atlanta , , 0.57 Philadelphia , , 0.54 NY Mets , , 0.47 Florida , , 0.50 Montreal , , 0.43 Houston , , 0.53 St. Louis , , 0.52 Chicago Cubs , , 0.59 Milwaukee , , 0.45 Cincinnati , , 0.40 Pittsburgh , , 0.39 Arizona , , 0.57 San Francisco , , 0.53 Los Angeles , , 0.56 San Diego , , 0.49 Colorado , , Concluding Remarks We have seen that the proposed two-stage Bayesian model is effective in predicting division winners in MLB given results partway through the season. The
12 72 Tae Young Yang and Tim Swartz model is simple, easily incorporates other factors, can be extended to other sports and is amenable to a MCMC implementation. What we have not done in this paper is compare the predictive probabilities of winning with those obtained from the odds posted by bookmakers. Bookmakers use statistical information and intuition in determining their odds. When our predictive probabilities differ from those of the bookmaker by a prescribed margin, this triggers a wagering situation. The natural question then is whether this betting strategy yields income sufficient to overcome the bookmaker s vigorish. This is a topic of future research. More information on aspects of sports gambling can be found in Insley, Mok and Swartz (2003). Table 6: Squared error difference criterion for assessing covariate combinations as described in (5.1). The simple extrapolation model is also included. Predictive S 1 S 2 S 3 S 4 S 5 S 6 S 7 Extra- Date polation May June July August Acknowledgement Yang s work was partially supported by a grant from the Korea Science and Engineering Foundation. Swartz s work was partially supported by a grant from the Natural Sciences and Engineering Research Council of Canada. References Barry, D. and Hartigan, J. A. (1993). Choice models for predicting divisional winners in major league baseball. Journal of the American Statistical Association, 88, Barry, D. and Hartigan, J. A. (1994). Change points in 0-1 sequences, with an application to predicting divisional winners in major league baseball. Journal of Applied Statistical Science 1, Bennett, J.M. (1998). Baseball. In Statistics in Sport (Edited by J.M. Bennett). Arnold Applications of Statistics Series, Arnold Publishing, Berry, S. (2001). A Statistician reads the sports pages. Chance 14(1),
13 A Two-Stage Bayesian Model for Predicting Winners in Major League Baseball 73 Best, N. G., Cowles, M. K. and Vines, S. K. (1995). CODA: Convergence Diagnosis and Output Analysis Software for Gibbs Sampling Output, Version 0.3. MRC Biostatistics Unit, Cambridge. Bradley, R. A. and Terry, M. E. (1952). Risk analysis of incomplete block designs I: The method of paired comparisons. Biometrika 39, Gelfand, A. and Smith, A. F. M. (1990). Sampling based approaches to calculating marginal densities. Journal of the American Statistical Association 85, Gelman, A. E. and Rubin, D. (1992). Inference from iterative simulation using multiple Ssquences. Statistical Science 7, Geman, S. and Geman, D. (1984). Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Transaction on Pattern Analysis and Machine Intelligence 6, Gilks, W. R., Richardson, S. and Spiegelhalter, D. J. (editors) (1996). Markov Chain Monte Carlo in Practice. Chapman and Hall. Insley, R., Mok, L. and Swartz, T. B. (2003). Some practical results related to sports gambling. To appear in The Australian and New Zeland Jpurnal of Statistics. James, B., Albert, J. and Stern, H.S. (1993). Answering questions about baseball using statistics. Chance, 6(2) ,30. Kaigh, W.D. (1995). Forecasting baseball games. Chance 8(2) Kuo, L. and Yang, T. (1995). Bayesian computation for software reliability. Journal of Computational and Graphical Statistics 4, Kuo, L. and Yang, T. (1996). Bayesian computation for nonhomogeneous Poisson processes in software reliability. Journal of the American Statistical Association 91, Kuo, L. and Yang, T. (2000). Bayesian reliability modeling for masked system lifetime data. Statistics and Probability Letters 47, McCune, B. (1989). Education of a Sports Bettor. McCune Sports Investments. Received November 14, 2002; accepted March 11, 2003 Tae Young Yang Department of Mathematics Myongji University Kyunggi, Korea [email protected] Tim Swartz Department of Statistics and Actuarial Science Simon Fraser University Burnaby, British Columbia Canada V5A1S6 [email protected]
The Effects of Atmospheric Conditions on Pitchers
The Effects of Atmospheric Conditions on Rodney Paul Syracuse University Matt Filippi Syracuse University Greg Ackerman Syracuse University Zack Albright Syracuse University Andrew Weinbach Coastal Carolina
2015 NFL Annual Selection Meeting R P O CLUB PLAYER POS COLLEGE ROUND 2
ROUND 2 2 1 33 TENNESSEE 2 2 34 TAMPA BAY 2 3 35 OAKLAND 2 4 36 JACKSONVILLE 2 5 37 NEW YORK JETS 2 6 38 WASHINGTON 2 7 39 CHICAGO 2 8 40 NEW YORK GIANTS 2 9 41 ST. LOUIS 2 10 42 ATLANTA 2 11 43 CLEVELAND
Comparing & Contrasting. - mathematically. Ways of comparing mathematical and scientific quantities...
Comparing & Contrasting - mathematically Ways of comparing mathematical and scientific quantities... Comparison by Division: using RATIOS, PERCENTAGES, ODDS, and PROPORTIONS Definition of Probability:
Organizing Topic: Data Analysis
Organizing Topic: Data Analysis Mathematical Goals: Students will analyze and interpret univariate data using measures of central tendency and dispersion. Students will calculate the z-scores for data.
Statistics in Baseball. Ali Bachani Chris Gomsak Jeff Nitz
Statistics in Baseball Ali Bachani Chris Gomsak Jeff Nitz EIS Mini-Paper Professor Adner October 7, 2013 2 Statistics in Baseball EIS Statistics in Baseball The Baseball Ecosystem... 3 An Unfair Game...
Using Baseball Data as a Gentle Introduction to Teaching Linear Regression
Creative Education, 2015, 6, 1477-1483 Published Online August 2015 in SciRes. http://www.scirp.org/journal/ce http://dx.doi.org/10.4236/ce.2015.614148 Using Baseball Data as a Gentle Introduction to Teaching
ISSUES RELATED TO SPORTS GAMBLING
ISSUES RELATED TO SPORTS GAMBLING Robin Insley, Lucia Mok and Tim Swartz Summary This paper looks at various issues that are of interest to the sports gambler. First, an expression is obtained for the
Improving paired comparison models for NFL point spreads by data transformation. Gregory J. Matthews
Improving paired comparison models for NFL point spreads by data transformation by Gregory J. Matthews A Project Report Submitted to the Faculty of WORCESTER POLYTECHNIC INSTITUTE in partial fulfillment
The Performance of Betting Lines for Predicting the Outcome of NFL Games
The Performance of Betting Lines for Predicting the Outcome of NFL Games Greg Szalkowski and Michael L. Nelson Old Dominion University, Department of Computer Science Norfolk, VA 23529 arxiv:1211.4000v1
Predicting the World Cup. Dr Christopher Watts Centre for Research in Social Simulation University of Surrey
Predicting the World Cup Dr Christopher Watts Centre for Research in Social Simulation University of Surrey Possible Techniques Tactics / Formation (4-4-2, 3-5-1 etc.) Space, movement and constraints Data
Solution Let us regress percentage of games versus total payroll.
Assignment 3, MATH 2560, Due November 16th Question 1: all graphs and calculations have to be done using the computer The following table gives the 1999 payroll (rounded to the nearest million dolars)
Bayesian Statistics: Indian Buffet Process
Bayesian Statistics: Indian Buffet Process Ilker Yildirim Department of Brain and Cognitive Sciences University of Rochester Rochester, NY 14627 August 2012 Reference: Most of the material in this note
Modeling and Analysis of Call Center Arrival Data: A Bayesian Approach
Modeling and Analysis of Call Center Arrival Data: A Bayesian Approach Refik Soyer * Department of Management Science The George Washington University M. Murat Tarimcilar Department of Management Science
Model Calibration with Open Source Software: R and Friends. Dr. Heiko Frings Mathematical Risk Consulting
Model with Open Source Software: and Friends Dr. Heiko Frings Mathematical isk Consulting Bern, 01.09.2011 Agenda in a Friends Model with & Friends o o o Overview First instance: An Extreme Value Example
Bayesian Statistics in One Hour. Patrick Lam
Bayesian Statistics in One Hour Patrick Lam Outline Introduction Bayesian Models Applications Missing Data Hierarchical Models Outline Introduction Bayesian Models Applications Missing Data Hierarchical
-- Special Ebook -- Bookie Buster: Secret Systems Used by Pro Sports Gamblers Finally REVEALED!
Special Ebook from Golden-Systems.com: Bookie Buster -- Special Ebook -- Bookie Buster: Secret Systems Used by Pro Sports Gamblers Finally REVEALED! By Frank Belanger http://www.golden-systems.com/ 2004
Handling attrition and non-response in longitudinal data
Longitudinal and Life Course Studies 2009 Volume 1 Issue 1 Pp 63-72 Handling attrition and non-response in longitudinal data Harvey Goldstein University of Bristol Correspondence. Professor H. Goldstein
The econometrics of baseball: A statistical investigation
The econometrics of baseball: A statistical investigation Mary Hilston Keener The University of Tampa The purpose of this paper is to use various baseball statistics available at the beginning of each
MAJOR LEAGUE BASEBALL 2011 ATTENDANCE ANALYSIS. Compiled and Written by David P. Kronheim. [email protected]
MAJOR LEAGUE BASEBALL 2011 ATTENDANCE ANALYSIS Compiled and Written by David P. Kronheim [email protected] 2012 MAJOR LEAGUE BASEBALL 2011 ATTENDANCE ANALYSIS TABLE OF CONTENTS PAGES Attendance Reporting
SACKS-CHAMPIONSHIPS-GAMES WON ALL-TIME RECORDS TEAM RECORDS
SACKS-CHAMPIONSHIPS-GAMES WON TEAM RECORDS CHAMPIONSHIPS Most Seasons League Champion 13 Green Bay, 1929-1931, 1936, 1939, 1944, 1961-62, 1965-67, 1996, 2010 9 Chi. Bears, 1921, 1932-33, 1940-41, 1943,
International Statistical Institute, 56th Session, 2007: Phil Everson
Teaching Regression using American Football Scores Everson, Phil Swarthmore College Department of Mathematics and Statistics 5 College Avenue Swarthmore, PA198, USA E-mail: [email protected] 1. Introduction
How To Bet On An Nfl Football Game With A Machine Learning Program
Beating the NFL Football Point Spread Kevin Gimpel [email protected] 1 Introduction Sports betting features a unique market structure that, while rather different from financial markets, still boasts
Predicting sports events from past results
Predicting sports events from past results Towards effective betting on football Douwe Buursma University of Twente P.O. Box 217, 7500AE Enschede The Netherlands [email protected] ABSTRACT
Rating Systems for Fixed Odds Football Match Prediction
Football-Data 2003 1 Rating Systems for Fixed Odds Football Match Prediction What is a Rating System? A rating system provides a quantitative measure of the superiority of one football team over their
ISyE 2028 Basic Statistical Methods - Fall 2015 Analysis of NFL Team Performance Against the Spread Betting Line
ISyE 2028 Basic Statistical Methods - Fall 2015 Analysis of NFL Team Performance Against the Spread Betting Line Introduction Author: Jonathan Pang Sports betting, although controversial, is an extremely
Sports Betting Systems
Sports Betting Systems Sports Betting Systems Fundamentals! Published By Brendan OHara & NicheMarketer.com 2008 You Have FULL Master Resell Rights To This Product. Table of Contents Part 1 - Fundamentals
The NCAA Basketball Betting Market: Tests of the Balanced Book and Levitt Hypotheses
The NCAA Basketball Betting Market: Tests of the Balanced Book and Levitt Hypotheses Rodney J. Paul, St. Bonaventure University Andrew P. Weinbach, Coastal Carolina University Kristin K. Paul, St. Bonaventure
THE DETERMINANTS OF SCORING IN NFL GAMES AND BEATING THE SPREAD
THE DETERMINANTS OF SCORING IN NFL GAMES AND BEATING THE SPREAD C. Barry Pfitzner, Department of Economics/Business, Randolph-Macon College, Ashland, VA 23005, [email protected], 804-752-7307 Steven D.
PROBATE LAW CASE SUMMARY
Every month I summarize the most important probate cases in Michigan. Now I publish my summaries as a service to colleagues and friends. I hope you find these summaries useful and I am always interested
PREDICTIVE DISTRIBUTIONS OF OUTSTANDING LIABILITIES IN GENERAL INSURANCE
PREDICTIVE DISTRIBUTIONS OF OUTSTANDING LIABILITIES IN GENERAL INSURANCE BY P.D. ENGLAND AND R.J. VERRALL ABSTRACT This paper extends the methods introduced in England & Verrall (00), and shows how predictive
Beating the Book: Are There Patterns in NFL Betting Lines?
Beating the Book: Are There Patterns in NFL Betting Lines? Michael R. Summers Abstract Las Vegas sports books provide two even-money bets (not counting commission, or "vigorish") regarding National Football
A Contrarian Approach to the Sports Betting Marketplace
A Contrarian Approach to the Sports Betting Marketplace Dan Fabrizio, James Cee Sports Insights, Inc. Beverly, MA 01915 USA Email: [email protected] Abstract Actual sports betting data is collected
Numerical Algorithms for Predicting Sports Results
Numerical Algorithms for Predicting Sports Results by Jack David Blundell, 1 School of Computing, Faculty of Engineering ABSTRACT Numerical models can help predict the outcome of sporting events. The features
Combining player statistics to predict outcomes of tennis matches
IMA Journal of Management Mathematics (2005) 16, 113 120 doi:10.1093/imaman/dpi001 Combining player statistics to predict outcomes of tennis matches TRISTAN BARNETT AND STEPHEN R. CLARKE School of Mathematical
The Determinants of Scoring in NFL Games and Beating the Over/Under Line. C. Barry Pfitzner*, Steven D. Lang*, and Tracy D.
FALL 2009 The Determinants of Scoring in NFL Games and Beating the Over/Under Line C. Barry Pfitzner*, Steven D. Lang*, and Tracy D. Rishel** Abstract In this paper we attempt to predict the total points
Tutorial on Markov Chain Monte Carlo
Tutorial on Markov Chain Monte Carlo Kenneth M. Hanson Los Alamos National Laboratory Presented at the 29 th International Workshop on Bayesian Inference and Maximum Entropy Methods in Science and Technology,
STA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! [email protected]! http://www.cs.toronto.edu/~rsalakhu/ Lecture 6 Three Approaches to Classification Construct
Validation of Software for Bayesian Models using Posterior Quantiles. Samantha R. Cook Andrew Gelman Donald B. Rubin DRAFT
Validation of Software for Bayesian Models using Posterior Quantiles Samantha R. Cook Andrew Gelman Donald B. Rubin DRAFT Abstract We present a simulation-based method designed to establish that software
The Fibonacci Strategy Revisited: Can You Really Make Money by Betting on Soccer Draws?
MPRA Munich Personal RePEc Archive The Fibonacci Strategy Revisited: Can You Really Make Money by Betting on Soccer Draws? Jiri Lahvicka 17. June 2013 Online at http://mpra.ub.uni-muenchen.de/47649/ MPRA
A Bayesian Antidote Against Strategy Sprawl
A Bayesian Antidote Against Strategy Sprawl Benjamin Scheibehenne ([email protected]) University of Basel, Missionsstrasse 62a 4055 Basel, Switzerland & Jörg Rieskamp ([email protected])
Estimating the Value of Major League Baseball Players
Estimating the Value of Major League Baseball Players Brian Fields * East Carolina University Department of Economics Masters Paper July 26, 2001 Abstract This paper examines whether Major League Baseball
Statistical techniques for betting on football markets.
Statistical techniques for betting on football markets. Stuart Coles Padova, 11 June, 2015 Smartodds Formed in 2003. Originally a betting company, now a company that provides predictions and support tools
An econometric analysis of the 2013 major league baseball season
An econometric analysis of the 2013 major league baseball season ABSTRACT Steven L. Fullerton New Mexico State University Thomas M. Fullerton, Jr. University of Texas at El Paso Adam G. Walke University
REGULATING INSIDER TRADING IN BETTING MARKETS
# Blackwell Publishers Ltd and the Board of Trustees of the Bulletin of Economic Research 1999. Published by Blackwell Publishers, 108 Cowley Road, Oxford OX4 1JF, UK and 350 Main Street, Malden, MA 02148,
Beating the MLB Moneyline
Beating the MLB Moneyline Leland Chen [email protected] Andrew He [email protected] 1 Abstract Sports forecasting is a challenging task that has similarities to stock market prediction, requiring time-series
Using SAS PROC MCMC to Estimate and Evaluate Item Response Theory Models
Using SAS PROC MCMC to Estimate and Evaluate Item Response Theory Models Clement A Stone Abstract Interest in estimating item response theory (IRT) models using Bayesian methods has grown tremendously
Time series Forecasting using Holt-Winters Exponential Smoothing
Time series Forecasting using Holt-Winters Exponential Smoothing Prajakta S. Kalekar(04329008) Kanwal Rekhi School of Information Technology Under the guidance of Prof. Bernard December 6, 2004 Abstract
11. Time series and dynamic linear models
11. Time series and dynamic linear models Objective To introduce the Bayesian approach to the modeling and forecasting of time series. Recommended reading West, M. and Harrison, J. (1997). models, (2 nd
During the course of our research on NBA basketball, we found out a couple of interesting principles.
After mining all the available NBA data for the last 15 years, we found the keys to a successful basketball betting system: If you follow this system exactly, you can expect to hit 90% of your NBA bets.
A STUDY OF HOME RUNS IN THE MAJOR LEAGUES
A STUDY OF HOME RUNS IN THE MAJOR LEAGUES KEITH KNIGHT ALEX SCHUSTER Department of Statistics University of Toronto May, 1992 Abstract It is well-known that the rate of home runs varies significantly among
Validation of Software for Bayesian Models Using Posterior Quantiles
Validation of Software for Bayesian Models Using Posterior Quantiles Samantha R. COOK, Andrew GELMAN, and Donald B. RUBIN This article presents a simulation-based method designed to establish the computational
arxiv:1112.0829v1 [math.pr] 5 Dec 2011
How Not to Win a Million Dollars: A Counterexample to a Conjecture of L. Breiman Thomas P. Hayes arxiv:1112.0829v1 [math.pr] 5 Dec 2011 Abstract Consider a gambling game in which we are allowed to repeatedly
Bayesian Methods for the Social and Behavioral Sciences
Bayesian Methods for the Social and Behavioral Sciences Jeff Gill Harvard University 2007 ICPSR First Session: June 25-July 20, 9-11 AM. Email: [email protected] TA: Yu-Sung Su ([email protected]).
Prices, Point Spreads and Profits: Evidence from the National Football League
Prices, Point Spreads and Profits: Evidence from the National Football League Brad R. Humphreys University of Alberta Department of Economics This Draft: February 2010 Abstract Previous research on point
The Variability of P-Values. Summary
The Variability of P-Values Dennis D. Boos Department of Statistics North Carolina State University Raleigh, NC 27695-8203 [email protected] August 15, 2009 NC State Statistics Departement Tech Report
Applications of R Software in Bayesian Data Analysis
Article International Journal of Information Science and System, 2012, 1(1): 7-23 International Journal of Information Science and System Journal homepage: www.modernscientificpress.com/journals/ijinfosci.aspx
THE USE OF STATISTICAL DISTRIBUTIONS TO MODEL CLAIMS IN MOTOR INSURANCE
THE USE OF STATISTICAL DISTRIBUTIONS TO MODEL CLAIMS IN MOTOR INSURANCE Batsirai Winmore Mazviona 1 Tafadzwa Chiduza 2 ABSTRACT In general insurance, companies need to use data on claims gathered from
The Math. P (x) = 5! = 1 2 3 4 5 = 120.
The Math Suppose there are n experiments, and the probability that someone gets the right answer on any given experiment is p. So in the first example above, n = 5 and p = 0.2. Let X be the number of correct
Inference on Phase-type Models via MCMC
Inference on Phase-type Models via MCMC with application to networks of repairable redundant systems Louis JM Aslett and Simon P Wilson Trinity College Dublin 28 th June 202 Toy Example : Redundant Repairable
Using Past Performance to Predict NFL Outcomes: A Chartist Approach
Using Past Performance to Predict NFL Outcomes: A Chartist Approach March 1997 This Revision: April 1997 David N. DeJong Department of Economics University of Pittsburgh Pittsburgh, PA 15260 [email protected]
The Basics of Graphical Models
The Basics of Graphical Models David M. Blei Columbia University October 3, 2015 Introduction These notes follow Chapter 2 of An Introduction to Probabilistic Graphical Models by Michael Jordan. Many figures
Volume 30, Issue 4. Market Efficiency and the NHL totals betting market: Is there an under bias?
Volume 30, Issue 4 Market Efficiency and the NHL totals betting market: Is there an under bias? Bill M Woodland Economics Department, Eastern Michigan University Linda M Woodland College of Business, Eastern
Markov Chain Monte Carlo Simulation Made Simple
Markov Chain Monte Carlo Simulation Made Simple Alastair Smith Department of Politics New York University April2,2003 1 Markov Chain Monte Carlo (MCMC) simualtion is a powerful technique to perform numerical
Prediction Markets, Fair Games and Martingales
Chapter 3 Prediction Markets, Fair Games and Martingales Prediction markets...... are speculative markets created for the purpose of making predictions. The current market prices can then be interpreted
Basic Bayesian Methods
6 Basic Bayesian Methods Mark E. Glickman and David A. van Dyk Summary In this chapter, we introduce the basics of Bayesian data analysis. The key ingredients to a Bayesian analysis are the likelihood
Joint modelling of goals and bookings in association football matches
Joint modelling of goals and bookings in association football matches Debbie Costain Gareth Ridall Kris Gregory September 4, 2012 Overview Data Counting process model for goals and bookings Modelling effects
DURATION ANALYSIS OF FLEET DYNAMICS
DURATION ANALYSIS OF FLEET DYNAMICS Garth Holloway, University of Reading, [email protected] David Tomberlin, NOAA Fisheries, [email protected] ABSTRACT Though long a standard technique
BayesX - Software for Bayesian Inference in Structured Additive Regression
BayesX - Software for Bayesian Inference in Structured Additive Regression Thomas Kneib Faculty of Mathematics and Economics, University of Ulm Department of Statistics, Ludwig-Maximilians-University Munich
The Kelly criterion for spread bets
IMA Journal of Applied Mathematics 2007 72,43 51 doi:10.1093/imamat/hxl027 Advance Access publication on December 5, 2006 The Kelly criterion for spread bets S. J. CHAPMAN Oxford Centre for Industrial
How To Check For Differences In The One Way Anova
MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way
Live Betting Extra Rules
Live Betting Extra Rules Rules for General Main Live Lines The following rules apply for Live Betting: Markets do not include overtime unless otherwise stated. Lines are offered at the provider s discretion.
POINT SPREAD SHADING AND BEHAVIORAL BIASES IN NBA BETTING MARKETS. by Brad R. Humphreys *
RIVISTA DI ISSN 1825-6678 DIRITTO ED ECONOMIA DELLO SPORT Vol. VI, Fasc. 1, 2010 POINT SPREAD SHADING AND BEHAVIORAL BIASES IN NBA BETTING MARKETS by Brad R. Humphreys * SUMMARY: Introduction 1. A Simple
SPORTSBOOK TERMS & CONDITIONS
SPORTSBOOK TERMS & CONDITIONS DATE OF ISSUE: 01.04.2014 1. General Sports Rules 1.1 Every user himself defines the bets, except the limitation that is set by the earnings limits according to the calculation
Modelling the Scores of Premier League Football Matches
Modelling the Scores of Premier League Football Matches by: Daan van Gemert The aim of this thesis is to develop a model for estimating the probabilities of premier league football outcomes, with the potential
5.2 Percent: Converting Between Fractions, Decimals, and Percents
5.2 Percent: Converting Between Fractions, Decimals, and Percents The concept of percent permeates most common uses of mathematics in everyday life. We pay taes based on percents, many people earn income
Nonparametric adaptive age replacement with a one-cycle criterion
Nonparametric adaptive age replacement with a one-cycle criterion P. Coolen-Schrijner, F.P.A. Coolen Department of Mathematical Sciences University of Durham, Durham, DH1 3LE, UK e-mail: [email protected]
Journal of Quantitative Analysis in Sports
Journal of Quantitative Analysis in Sports Volume 4, Issue 2 2008 Article 7 Racial Bias in the NBA: Implications in Betting Markets Tim Larsen, Brigham Young University - Utah Joe Price, Brigham Young
2011 AMERICAN CONFERENCE. East Division W L T Pct. Pts. OP New England# 13 3 0.813 513 342. Washington 5 11 0.313 288 367 North Division
2011 New England# 13 3 0.813 513 342 New York Giants 9 7 0.563 394 400 New York Jets 8 8 0.500 377 363 Philadelphia 8 8 0.500 396 328 Miami 6 10 0.375 329 313 Dallas 8 8 0.500 369 347 Buffalo 6 10 0.375
BAYESIAN ANALYSIS OF AN AGGREGATE CLAIM MODEL USING VARIOUS LOSS DISTRIBUTIONS. by Claire Dudley
BAYESIAN ANALYSIS OF AN AGGREGATE CLAIM MODEL USING VARIOUS LOSS DISTRIBUTIONS by Claire Dudley A dissertation submitted for the award of the degree of Master of Science in Actuarial Management School
Another Look at Sensitivity of Bayesian Networks to Imprecise Probabilities
Another Look at Sensitivity of Bayesian Networks to Imprecise Probabilities Oscar Kipersztok Mathematics and Computing Technology Phantom Works, The Boeing Company P.O.Box 3707, MC: 7L-44 Seattle, WA 98124
Estimation and comparison of multiple change-point models
Journal of Econometrics 86 (1998) 221 241 Estimation and comparison of multiple change-point models Siddhartha Chib* John M. Olin School of Business, Washington University, 1 Brookings Drive, Campus Box
RULES AND REGULATIONS OF FIXED ODDS BETTING GAMES
RULES AND REGULATIONS OF FIXED ODDS BETTING GAMES Project: Royalhighgate Public Company Ltd. 04.04.2014 Table of contents SECTION I: GENERAL RULES... 6 ARTICLE 1 GENERAL REGULATIONS...6 ARTICLE 2 THE HOLDING
Does NFL Spread Betting Obey the E cient Market Hypothesis?
Does NFL Spread Betting Obey the E cient Market Hypothesis? Johannes Harkins May 2013 Abstract In this paper I examine the possibility that NFL spread betting does not obey the strong form of the E cient
Fair Bets and Profitability in College Football Gambling
236 s and Profitability in College Football Gambling Rodney J. Paul, Andrew P. Weinbach, and Chris J. Weinbach * Abstract Efficient markets in college football are tested over a 25- year period, 1976-2000.
VALIDATING A DIVISION I-A COLLEGE FOOTBALL SEASON SIMULATION SYSTEM. Rick L. Wilson
Proceedings of the 2005 Winter Simulation Conference M. E. Kuhl, N. M. Steiger, F. B. Armstrong, and J. A. Joines, eds. VALIDATING A DIVISION I-A COLLEGE FOOTBALL SEASON SIMULATION SYSTEM Rick L. Wilson
Picking Winners is For Losers: A Strategy for Optimizing Investment Outcomes
Picking Winners is For Losers: A Strategy for Optimizing Investment Outcomes Clay graham DePaul University Risk Conference Las Vegas - November 11, 2011 REMEMBER Picking a winner is not at all the same
More details on the inputs, functionality, and output can be found below.
Overview: The SMEEACT (Software for More Efficient, Ethical, and Affordable Clinical Trials) web interface (http://research.mdacc.tmc.edu/smeeactweb) implements a single analysis of a two-armed trial comparing
Smoking Policies at Major League Baseball Stadiums April 2, 2015
Defending your right to breathe smokefree air since 1976 Smoking Policies at Major League Baseball Stadiums April 2, 2015 Arizona Diamondbacks Chase Field Chase Field is designated as a non-smoking facility.
Foundations of Statistics Frequentist and Bayesian
Mary Parker, http://www.austincc.edu/mparker/stat/nov04/ page 1 of 13 Foundations of Statistics Frequentist and Bayesian Statistics is the science of information gathering, especially when the information
ALIANTE RACE AND SPORTS BOOK HOUSE RULES
ALIANTE RACE AND SPORTS BOOK HOUSE RULES GENERAL RACE BOOK RULES 1. Aliante Race and Sports Book reserves the right to refuse any wager, prior to its acceptance. 2. Aliante Race and Sports Book is not
Bayesian Machine Learning (ML): Modeling And Inference in Big Data. Zhuhua Cai Google, Rice University [email protected]
Bayesian Machine Learning (ML): Modeling And Inference in Big Data Zhuhua Cai Google Rice University [email protected] 1 Syllabus Bayesian ML Concepts (Today) Bayesian ML on MapReduce (Next morning) Bayesian
Imputing Missing Data using SAS
ABSTRACT Paper 3295-2015 Imputing Missing Data using SAS Christopher Yim, California Polytechnic State University, San Luis Obispo Missing data is an unfortunate reality of statistics. However, there are
Exhibition & Event Industry Labor Rates Survey
Exhibition & Event Industry Labor Rates Survey Event Labor and Material Handling Cost Averages in Over 40 North American Cities Special Report Produced by RESEARCH & CONSULTING Font: Ocean Sans MM 648
Forecasting in Financial and Sports Gambling -Markets
Forecasting in Financial and Sports Gambling -Markets Adaptive Drift Modeling William S. Mallios Craig School of Busines California State University, Fresno Fresno, California WILEY A JOHN WILEY & SONS,
Fixed odds bookmaking with stochastic betting demands
Fixed odds bookmaking with stochastic betting demands Stewart Hodges Hao Lin January 4, 2009 Abstract This paper provides a model of bookmaking in the market for bets in a British horse race. The bookmaker
Statistics in Retail Finance. Chapter 6: Behavioural models
Statistics in Retail Finance 1 Overview > So far we have focussed mainly on application scorecards. In this chapter we shall look at behavioural models. We shall cover the following topics:- Behavioural
Chapter 4 Lecture Notes
Chapter 4 Lecture Notes Random Variables October 27, 2015 1 Section 4.1 Random Variables A random variable is typically a real-valued function defined on the sample space of some experiment. For instance,
