A Bayesian Model to Enhance Domestic Energy Consumption Forecast



Similar documents
11. Time series and dynamic linear models

Gaussian Processes to Speed up Hamiltonian Monte Carlo

Modeling and Analysis of Call Center Arrival Data: A Bayesian Approach

Using SAS PROC MCMC to Estimate and Evaluate Item Response Theory Models

Model-based Synthesis. Tony O Hagan

Centre for Central Banking Studies

Bayesian Statistics: Indian Buffet Process

Web-based Supplementary Materials for Bayesian Effect Estimation. Accounting for Adjustment Uncertainty by Chi Wang, Giovanni

PS 271B: Quantitative Methods II. Lecture Notes

Markov Chain Monte Carlo Simulation Made Simple

Analyzing Clinical Trial Data via the Bayesian Multiple Logistic Random Effects Model

Applications of R Software in Bayesian Data Analysis

Tutorial on Markov Chain Monte Carlo

Bayesian Statistics in One Hour. Patrick Lam

Handling attrition and non-response in longitudinal data

Validation of Software for Bayesian Models using Posterior Quantiles. Samantha R. Cook Andrew Gelman Donald B. Rubin DRAFT

Basics of Statistical Machine Learning

More details on the inputs, functionality, and output can be found below.

BayesX - Software for Bayesian Inference in Structured Additive Regression

Chenfeng Xiong (corresponding), University of Maryland, College Park

Estimating Industry Multiples

The HB. How Bayesian methods have changed the face of marketing research. Summer 2004

STA 4273H: Statistical Machine Learning

Simple Methods and Procedures Used in Forecasting

How To Understand The Theory Of Probability

A Comparative Study of the Pickup Method and its Variations Using a Simulated Hotel Reservation Data

Model Calibration with Open Source Software: R and Friends. Dr. Heiko Frings Mathematical Risk Consulting

Basic Bayesian Methods

INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition)

Sampling for Bayesian computation with large datasets

Objections to Bayesian statistics

Forecasting of Economic Quantities using Fuzzy Autoregressive Model and Fuzzy Neural Network

Learning outcomes. Knowledge and understanding. Competence and skills

1 Prior Probability and Posterior Probability

The Effects of Start Prices on the Performance of the Certainty Equivalent Pricing Policy

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Imputing Missing Data using SAS

itesla Project Innovative Tools for Electrical System Security within Large Areas

CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS

Validation of Software for Bayesian Models Using Posterior Quantiles

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

The Combination Forecasting Model of Auto Sales Based on Seasonal Index and RBF Neural Network

STAT3016 Introduction to Bayesian Data Analysis

SAS Certificate Applied Statistics and SAS Programming

Monte Carlo and Empirical Methods for Stochastic Inference (MASM11/FMS091)

Foundations of Statistics Frequentist and Bayesian

An Introduction to Using WinBUGS for Cost-Effectiveness Analyses in Health Economics

Nonparametric adaptive age replacement with a one-cycle criterion

Adaptive Demand-Forecasting Approach based on Principal Components Time-series an application of data-mining technique to detection of market movement

Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus

A Wavelet Based Prediction Method for Time Series

Modeling the Distribution of Environmental Radon Levels in Iowa: Combining Multiple Sources of Spatially Misaligned Data

Analysis of Bayesian Dynamic Linear Models

A Study on the Comparison of Electricity Forecasting Models: Korea and China

A Bayesian Antidote Against Strategy Sprawl

Statistics Graduate Courses

Bayesian Methods for the Social and Behavioral Sciences

Variations of Statistical Models

Probabilistic Models for Big Data. Alex Davies and Roger Frigola University of Cambridge 13th February 2014

Bayesian Machine Learning (ML): Modeling And Inference in Big Data. Zhuhua Cai Google, Rice University

Monte Carlo Methods and Models in Finance and Insurance

Least Squares Estimation

Deterministic Sampling-based Switching Kalman Filtering for Vehicle Tracking

Analysis of Financial Time Series

Reliability estimators for the components of series and parallel systems: The Weibull model

A New Quantitative Behavioral Model for Financial Prediction

Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 )

PREDICTIVE DISTRIBUTIONS OF OUTSTANDING LIABILITIES IN GENERAL INSURANCE

Government of Russian Federation. Faculty of Computer Science School of Data Analysis and Artificial Intelligence

Generating Random Numbers Variance Reduction Quasi-Monte Carlo. Simulation Methods. Leonid Kogan. MIT, Sloan , Fall 2010

Constrained Bayes and Empirical Bayes Estimator Applications in Insurance Pricing

MSCA Introduction to Statistical Concepts

Stock Investing Using HUGIN Software

Monte Carlo-based statistical methods (MASM11/FMS091)

One-year reserve risk including a tail factor : closed formula and bootstrap approaches

A Fuel Cost Comparison of Electric and Gas-Powered Vehicles

CHAPTER 11 FORECASTING AND DEMAND PLANNING

Sampling via Moment Sharing: A New Framework for Distributed Bayesian Inference for Big Data

Markov Chain Monte Carlo and Applied Bayesian Statistics: a short course Chris Holmes Professor of Biostatistics Oxford Centre for Gene Function

INTELLIGENT ENERGY MANAGEMENT OF ELECTRICAL POWER SYSTEMS WITH DISTRIBUTED FEEDING ON THE BASIS OF FORECASTS OF DEMAND AND GENERATION Chr.

Spatial Statistics Chapter 3 Basics of areal data and areal data modeling

Market Risk Analysis. Quantitative Methods in Finance. Volume I. The Wiley Finance Series

Fairfield Public Schools

A Multi-level Artificial Neural Network for Residential and Commercial Energy Demand Forecast: Iran Case Study

Introduction to Bayesian Analysis Using SAS R Software

Non Linear Dependence Structures: a Copula Opinion Approach in Portfolio Optimization

Prediction of Stock Performance Using Analytical Techniques

A State Space Model for Wind Forecast Correction

REAL-TIME PRICE FORECAST WITH BIG DATA

Comparing Artificial Intelligence Systems for Stock Portfolio Selection

Models for Product Demand Forecasting with the Use of Judgmental Adjustments to Statistical Forecasts

Linear regression methods for large n and streaming data

Bayesian Factorization Machines

A Robustness Simulation Method of Project Schedule based on the Monte Carlo Method

Program description for the Master s Degree Program in Mathematics and Finance

Supervised Learning (Big Data Analytics)

Transcription:

Proceedings of the 2012 International Conference on Industrial Engineering and Operations Management Istanbul, Turkey, July 3 6, 2012 A Bayesian Model to Enhance Domestic Energy Consumption Forecast Mohammad Anwar Rahman University of Southern Mississippi Hattiesburg, MS 39406, USA Mohammad Y. El-Sharkh University of South Alabama Mobile, Alabama 36688-002, USA Asheka Rahman University of Southern Mississippi Hattiesburg, MS 39406, USA Abstract Precise forecast of energy consumption demand is difficult because of its rapid growth, however, it is considered vital for energy planning, development mechanism and economic success. In this study, a Bayesian statistical technique is applied to a dynamic linear model to predict a complex energy consumption demand problem. The time series considered is a real case of nonlinear, non-stationary household energy demand data. The forecast is generated from the marginal posterior distribution of all the predictants of the proposed model. The forecast result is tested and compared with actual data and two alternate models. Bayesian dynamic leaner model has proven to be efficient, and possess the capacity to estimate values of uncertain parameters of large scale combinatorial problems. Test results indicate that Bayesian model can be preferred over the classical artificial neural networks and the statistical model due to better capacity to solve multi-dimensional forecasting problems. Keywords: Energy consumption, demand, Bayesian dynamic linear model. 1. Introduction The ability to learn energy consumption demand is of vital importance to many companies in developing countries. Energy prediction leads to a cost effective and efficient energy planning such as energy planning, production mechanism and distribution. Central to any energy planning and control system is the statistical forecast. There are a large variety of forecasting techniques available, ranging from a simple moving average to more sophisticated techniques such a Box Jenkins and classical artificial neural network models. Statistical technique such as regression models are often used to predict random demand for products, but the prediction process works well if the Bayesian statistical technique is applied to a statistical model. This article presents a technique based on application of Bayesian techniques on Statistical Model with particular emphasis to forecast a time series of energy consumption demand where data pattern changes annually and show nonlinear trends. The model developed also illustrated the influence of the energy input variables in the prediction. This is a fundamental progress with respect to classical statistical method. The effectiveness of the Bayesian model is tested and compared with the actual data and the forecast results produced by a classical artificial neural network (ANN) and an ordinary multiple regression model. There are many approaches considered for forecasting electricity demand, including multiple regression models [1-2], time-varying splines [3]), artificial neural networks [4]. Bayesian method is increasingly applied to many complex forecasting processes, researches and applications. Although it is a widely known statistical modeling approach, the Bayesian method has been used much less frequently for forecasting electricity demand [5]. The 1891

Bayesian model has the advantages over existing (statistical) forecasting techniques due to the ability to react due to any changes, adjustments and new occurrences outside of the routine model. For example, if there is a change in the (electricity consumption) demand due to various reasons, such as acquisition of new customers, or government policy for further urbanization, or natural disasters, the subjective view can be incorporated in the model by the experts judgment about the situation. Thereby, override can be made to make the adjustment of the intervention. When time series shows significance changes (alterations) in their (general) pattern, other models can have difficulty in picking up the change, thereby, cause higher level of forecast errors and variances. Since the Bayesian model captures these changes better than other models, it significantly reduces the variances, and the forecast errors. In most cases, Bayesian inference requires numerical approximation of analytically intractable integrals [6]. Bayesian inference provides the estimation of the values of the unknown model parameters using some prior beliefs of the parameters in which the prior beliefs are often expressed with the probability density functions. In order to forecast applying Bayesian techniques, the investigators require specifying a number of subjective choices some of which are determined through the iterative processes. The Bayesian model requires a prior belief about the test parameters before data are collected. The choices of prior density models for the parameters of the Bayesian models have limited effect in determining the values of the posterior parameters. An equally likely prior density function, known as flat prior, can be used for all parameters; however, alternative priors are often used for producing different posterior distributions for the test parameters. A computational advantage of the Bayesian approach is the ability to take into account of the uncertainty of parameters in the model [7]. The preference of Bayesian modeling can be perceived over classical ANN methods by the capacity and efficiency to solve multi-dimensional problems. One can find more subjective choices are essential in classical ANN model than Bayesian models. This paper is organized as follows. Section 2 discusses background of Bayesian methods. Section 3 discusses the development of Bayesian model using a real time series data. Section 4 compares the resulting forecast of Bayesian models with that of classical ANN model and identifies the effectiveness of the models. Section 5 presents the concluding remarks and discusses. Appendix presents pseudo code for the Bayesian model. 2. Bayesian Forecasting Method Bayesian statistical inferences can be used for estimation of unknown model parameters using a combination of prior knowledge and observed data to establish a probability distribution, known as the posterior, to describe the updated parameters in the model. In the application of the Bayesian methods, the values of unknown model parameters, θ: (θ 1,..., θ p ) are expressed with probability density functions, f(θ), based on (experts ) prior beliefs before collecting any new data. The dependence of observations X= (x 1,..., x t ) on model parameters θ can be expressed as the probability density function as L(X θ), which is often referred to as the likelihood function. The prior probability density functions are modified by the likelihood function as the new data becomes available and yield to the posterior distribution function. Using Bayes theorem, updating is performed as: f(θ)l(x θ) f(θ X) = θ f(θ)l(x θ)dθ where π(θ X) is called the posterior distribution and expresses the probability of the parameters after observing new data. Once the posterior distribution is available, any features of θ, such as the marginal distributions or means and variances of the individual parameter θ i, as well as the predictive distribution of future observations, require integrating over the posterior distribution. For example, the marginal posterior distribution of an individual θ i can be calculated as f(θ X) = f(θ X) dθ, where θ θ -1 represents all θ s except θ i [13]. Therefore, the forecast results in 1 the Bayesian models is expressed by the expectation of a function of interest, f(θ X), evaluated over the posterior distribution as E(f(θ X) = f(θ X)f(θ)dθ (2) θ 1 where E denotes the expectation operator. In scientific inference, the limitation of Bayesian methods is the analytical solutions for the required integrations over the likelihood and the prior density functions. For Bayesian inference, limited combinations of closed form integrals for demand model and the conjugate prior probability functions were available. For most nonlinear and high-dimensional models, the inability to solve such integrals made the implementation of Bayes theorem increasingly difficult. However, the advent of technology, software and the development of several methods made the complex integrals performed to make the posterior Bayesian inference. Bayesian parameter estimation method (1) 1892

now can be applied when the closed-form solution to a function is not available [6]. It is required to use the sampling based approaches, particularly the Markov Chain Monte Carlo (MCMC) method and Gibbs Sampling, to solve this type of problem. Recent improvement in sampling-based MCMC methods played a significant role in demand forecasting through Bayesian statistical application. There are many discussions of Bayesian statistical models available in literature. Gelman et al. [13] provided that if the observed data are sufficiently informative, the posterior distribution will be sharp enough to yield reasonably precise predictions. The learning follow the Initialization the dataset Selection of a dynamic linear model, includes all input variables Random selection of the prior following a probability (normal) distribution Stimuli iteration through the MCMC process Parameter update using the Equations (1) and (2) Collection of the 5000 trajectory from the posterior distribution 3. The Case Study A real time series data of household electricity demand presented by Azadeh and Faiz [14] is used to develop the Bayesian model, and compare. Generally, electricity demands are lumpy and greatly varying over the seasons. Traditional forecasting methods are not appropriate to predict such demand that has rapid growth and large fluctuations. The application here is to observe the performance of Bayesian model, comparing the forecast of classical ANN, and ordinary dynamic linear methods. The household electricity demand is integrated with five input variables, (i) electricity prices, (2) TV price index, (3) refrigerator price index, (4) urban household size, and (5) urban household income. The household electricity demand is a function of input variables, f(x 1, x 2, x 5 ), where x 1, x 2, x 5 is assigned to each input variable. The time series dataset from 1974 to 1995 are used for developing the forecasting models and dataset from 1996 to 2003 are used to test the efficiency of the models and compare. The flexibility of Bayesian methods are the capacity to use non-static parameters and accessibility to insert new data (or information) into the model as it appears enable the estimation process to improve continuously. Bayesian approaches combine the observed data and the uncertainty of the unknown parameters to provide posterior information. The focus here is to get the posterior or marginal probability f(θ X) based on the likelihood L(X θ)f(θ) and the observed data. The model relationship with the input variables are with c i for i = 1,, 5 as constant. The training model for period (t; from 1973 to 1995) is expressed as follows: f(x 1, x 2, x 3, x 4, x 5 )= Normal (μ[t], τ) where μ[t]= α + c 1 x 1(t) + c 2 x 2(t) + c 3 x 3(t) + c 4 x 4(t) + c 5 x 5(t), and τ is the precision, reciprocal of variance. In Bayesian methods when prior information are not available, a common approach is to apply highly diffused probability distributions as the non-informative prior, such as Normal distribution with mean 0 and inverse of variance 10 4 10 8. Initially little knowledge is available about the coefficient of the input variables (p 1, p 2,..., p 5 ). The Normal (0, 10 4 ) is assumed as the prior distributions for each of these coefficients [Appendix A]. The parameter τ follows a chi squared distribution with one degree of freedom. The coefficients of the input variables are estimated based on the likelihood function using dataset from 1973 to 1995 and the prior distributions. After deriving the values for each of the coefficients (b 1 to b 5 ), instead of using the static values, the model uses the dynamic form of the parameters to forecast. The forecast for the period (t; from 1996 to 2003) is expressed as follows: f(x 1, x 2, x 3, x 4, x 5) = Normal (μ.new [t], τ); where μ.new (t) = α + c 1.new x 1(t) + c 2.new x 2(t) +c 3.new x 3(t)+ c 4.new x 4(t)+ c 5.new x 5(t) and c i.new = Normal (c i [t], τ 1), for i ε (1, 5). The choice of prior distribution is followed by [2, 8]. In the Bayesian model, given the time series data, a Monte Carlo Markov chain (MCMC) is constructed using the Gibbs sampler algorithm for conducting posterior inference of Bayesian model through simulation. The summary statistics such as mean, median, standard deviation and Bayesian confidence intervals for each parameter are estimated using the MCMC samples from the posterior distributions in Bayesian procedure. The sample mean, standard deviation (SD) along with 95% confidence level are reported in Table 1. 1893

Table 1: Estimate of the Parameters Mean SD 2.50% Median 97.50% α 4.325 31.72-56.98 4.232 66.72 x 1 143.9 128-97.66 143.1 398.2 x 2-0.6189 122.6-302 11.43 214.1 x 3 78.03 106.6-130.6 77.51 283.6 x 4 229.5 151-40.06 220.9 552.3 x 5 108.5 109-102.2 107.6 319.2 4. Bayesian Results and Model Comparison Bayesian approaches are the preferred methods capable to model nonlinear and non-stationary time series demand. Using the parameters values presented in Table 1 and the corresponding density functions, the forecast obtained by posterior density function. In Figure 1, the Bayesian model is tested with actual data and compared with classical ANN forecasts. A result of classical ANN is borrowed from Azadeh and Faiz [9]. The resulting forecast obtained from Bayesian model is on last eight years for the period 1996 to 2003 and compared with the classical ANN. The estimates obtained by the Bayesian SLM is the differences between the mean of the sampled values (the estimate of the posterior mean for each parameter) and the true posterior mean [15]. Forecast by ordinary multiple regression method is not comparable with Bayesian model and Classical ANN. A public domain software package WinBUGS [17] is used to generates feasible posterior samples via MCMC simulations in minimum computational time. The convergence of the MCMC algorithm is checked by the ANOVA based-method integrated in Win BUGS [15]. The expected trajectory of annual averaged of the posterior sample has been used as the Bayesian forecast. A posterior sample of 5000 predicted trajectories has been collected and compared to actual data and classical ANN prediction. 38000 36000 Bayesian Actual C-ANN 34000 32000 30000 28000 26000 24000 22000 20000 1996 1997 1998 1999 2000 2001 2002 2003 Figure 1: Household Electricity Demand Forecast Using Bayesian Model The results of the both models were impressive and closely emerged. However, Bayesian model appears to be flexible, comprehensive, and accessible in consolidating experts judgment about future demand and observed data 1894

to predict the future. The results derived by Bayesian model has marginally edged over ANN model, suitable for enterprise forecast practices. This study also validated the forecasting results obtained by Bayesian model, classical ANN, and ordinary RM with respect to actual data shown in Table 2. Numerical values of the forecasts show that the Mean Square Error (MSE), Mean Absolute Percentage error (MAPE), and Mean absolute error (MAE) of Bayesian model is less than that of classical ANN model, and both models are significantly superior to ordinary multiple regression (OMR) model. Table 2: Predictive Performance Comparison MSE MAPE MAE Bayesian 0.0208 0.0335 0.0089 Classical ANN 0.022 0.0351 0.0108 OMR 0.1968 0.1339 0.0411 Electricity consumption prediction involves a process that is capable to fit a rapidly increasing time series with minimal data points. The precision in predicting electricity demand are of keen interests in every business. Accurate prediction is indispensable for energy sector operations management in order to harmonize business functions. The model presented in this paper uses existing energy consumption and other energy input variables such as electricity price, household electric appliances, and urban conditions through a stochastic simulation process. Energy consumption prediction is providing a way to help commercial and residential energy planning, development mechanism and economic success at the lowest cost in the shortest time. 5. Conclusion Managing demand and planning for products such as electricity demand, gasoline, and industrial energy usages are of special interests. Forecasts of such rapidly grown demand have become increasingly complex. Corporate management integrates various resource portfolios such as planning, production, purchasing, and bidding strategies with products demand forecast to maximize profit and minimize risks. It is widely recognized that the judgment-based forecasting procedures are not adequate to draw a fair forecast conclusion. In contrast, simply replacing the management judgment about future occurrences with advanced data extrapolation methods with no judgmental input is also unrealistic. Appropriate forecast software, its userfriendliness, and the capacity to precise prediction had been issues to forecast practitioners. In many cases, practitioners do not follow the development of concurrent software and technological advancement. In light of current practices, both Bayesian approaches methods have appeared to be suitable for the enterprise forecasters. There is a handful of forecasting methods capable to predict rapidly grown and large fluctuating demand. This paper presented a frame work for application of Bayesian statistical technique on Dynamic Linear Model with an illustrative problem of predicting household electricity demand in a developing region. Software package for the application of Bayesian statistical techniques are mostly open sourced. It has been perceived these software are user friendly, easier to implement in complex model applications and capable to combine experts judgment with observed sale data in the modeling process to find the best forecast solutions. References [1] Ramanathan R., Engle R., Granger C.W.J., Vahid-Araghi F., Brace C., 1997 Short-run forecasts of electricity loads and peaks, International Journal of Forecasting, 13, 161 174. [2] Taylor, J.W. and Buizza, R., 2003, Using weather ensemble predictions in electricity demand forecasting, International Journal of Forecasting, 19, 57 70. [3] Harvey, A.C., and Koopman, S.J., 1993, Forecasting hourly electricity demand using time-varying splines, Journal of the American Statistical Association, 88, 1228 1236. 1895

[4] Connor, J.T., 1996, A robust neural network filter for electricity demand prediction, Journal of Forecasting, 15, 436 458. [5] Cottet, R. and Smith, M., 2003, Bayesian modeling and forecasting of intraday electricity load, Journal of the American Statistical Association, 98(464), 839 849. [6] Gelfand A.E. and Smith, A.F.M., 1990, Sampling-based approaches to calculating marginal densities, Journal of the American Statistical Association, 85, 398 409. [7] P. Congdon, 3001, Bayesian Statistical Modeling. Chichester (UK): Wiley. [8] Yalcinoz, T. and Eminoglu, U., 2005, Short term and medium term power distribution load forecasting by Neural Networks, Energy Conversion and Management, 46, 1393 1405. [9] Peng, T.M., Hubele, F.N. and Karady, G.F., 1992, Advancement in the application of neural networks for short-term load forecasting, IEEE Transactions on Power Systems, 7, 250 256. [10] Mamun, M. Al. and Nagasaka, K., 2004, Artificial neural network applied to long term electricity demand forecasting, in: Proc. of the 4th Int. Conference on Hybrid Intelligent System, Kitakyushu, Japan, 204 209. [11] Hamzacebi, C., Akay, D. and Kutay, F., 2009, Comparison of direct and iterative artificial neural network forecast approaches in multi-periodic time series forecasting, Expert Systems with Applications, 36, 3839 3844. [12] Qian, S.S. Stow, C.A. and Borsuk, M.E., 2003, On Monte Carlo methods for Bayesian inference, Ecological Modeling, 159, 269-277. [13] A. Gelman, J. B. Carlin, H. S. Stern, and D. B. Rubin, Bayesian Data Analysis (2 nd ed.). Boca Raton, FL: Chapman and Hall/CRC, 2003. [14] Azadeha, A. and Faiza, Z.S. A meta-heuristic framework for forecasting household electricity consumption, Applied Soft Computing, 11(1), 614-620. [15] Brooks S.P., Gelman A. 1998, Alternative methods for monitoring convergence of iterative simulations. Journal of Computational and Graphical Statistics, 434 455. [16] WinBUGS Tutorial available at http://mathstat.helsinki.fi/openbugs (accessed on November, 2011). [17] The BUGS project Bayesian inference Using Gibbs Sampling, version 1.4.3, available at: http://www.mrcbsu.cam.ac.uk/bugs (accessed on November, 2011). 1896