Dynamic Pricing for Smart Grid with Reinforcement Learning

Size: px
Start display at page:

Download "Dynamic Pricing for Smart Grid with Reinforcement Learning"

Transcription

1 Dynamc Prcng for Smart Grd wth Renforcement Learnng Byung-Gook Km, Yu Zhang, Mhaela van der Schaar, and Jang-Won Lee Samsung Electroncs, Suwon, Korea Department of Electrcal Engneerng, UCLA, Los Angeles, USA Department of Electrcal and Electronc Engneerng, Yonse Unversty, Seoul, Korea Abstract In the smart grd system, dynamc prcng can be an effcent tool for the servce provder whch enables effcent and automated management of the grd. However, n practce, the lack of nformaton about the customers tme-varyng load demand and energy consumpton patterns and the volatlty of electrcty prce n the wholesale market make the mplementaton of dynamc prcng hghly challengng. In ths paper, we study a dynamc prcng problem n the smart grd system where the servce provder decdes the electrcty prce n the retal market. In order to overcome the challenges n mplementng dynamc prcng, we develop a renforcement learnng algorthm. To resolve the drawbacks of the conventonal renforcement learnng algorthm such as hgh computatonal complexty and low convergence speed, we propose an approxmate state defnton and adopt vrtual experence. Numercal results show that the proposed renforcement learnng algorthm can effectvely work wthout a pror nformaton of the system dynamcs. I. INTRODUCTION In the smart grd system, thanks to the real-tme nformaton exchange through communcaton networks, customers can schedule the operaton of ther applances accordng to the change of electrcty prce va the automated energy management system equpped n households, whch we refer to as demand response [1]. From the customers perspectve, prevous works on the load schedulng focus on drectly controllng the energy consumpton of the resdental applances. For example, n our prevous work [2], we proposed two dfferent load schedulng algorthms for a collaboratve and a non-collaboratve smart grd system by takng nto account the customers bdrectonal energy tradng capablty va electrc vehcles. Most of these works am at maxmzng the socal welfare of the smart grd system assumng that the prcng polces are predetermned n the electrcty market by the servce provder. Consequently, the servce provder s regarded as a passve and ndfferent entty whose role n the smart grd system s sgnfcantly lmted. On the contrary, from the servce provder s perspectve, dynamc prcng s an attractve tool that enables effcent grd operaton n terms of both effcent energy consumpton and automated management. Our paper s related to ths second strand of lterature. Specfcally, we focus on a scenaro where the servce provder can adaptvely decde the retal electrcty prce based on the customers load demand level and the wholesale prce such that t mnmzes ether the customers dsutlty (n the case of a benevolent servce provder) or ts own cost (n the case of a proft-makng servce provder). Although dynamc prcng does not drectly control each Ths work was supported n part by Md-career Researcher Program through NRF grant funded by the MSIP, Korea (213R1A2A2A6953), and by USA Natonal Scence Foundaton (NSF) grant CCF customer s load schedulng, the approprate prcng can gve consderable benefts to the smart grd system by encouragng the customers to consume energy n a more effcent way. Recently, there have been several works on dynamc prcng for smart grd [3][4][5][6][7][8]. In [3] and [4], dynamc prcng problems were studed amng at maxmzng the socal welfare. Consderng a smart grd system wth multple resdences and a sngle servce provder, optmal dynamc prcng schemes were proposed based on the dual decomposton approaches. The authors n [5] focused on the smart grd system wth non-cooperatve customers where the conventonal optmzaton approach cannot be appled to as n [3]. To overcome the lack of cooperaton of customers, a smulated annealng-based dynamc prcng algorthm was developed. In a smlar context, the authors n [6] modeled a dynamc prcng problem as a Stackelberg game where the servce provder decdes the retal prce and each selfsh customer decdes the schedule for ts applances accordng to the prce. In [7], the authors developed an ncentve-based dynamc prcng scheme whch allows the servce provder to decde the ncentve for the customers who shft ther applances usage from peak hours to off-peak hours. In [8], the authors ntroduced a twotmescale dynamc prcng scheme to ncorporate both the customers wth day-ahead schedulng and the customers wth real-tme schedulng. To take nto account the uncertantes of energy supply and demand, the authors formulated a Markov decson process (MDP) problem and developed an onlne algorthm. Despte those prevous efforts, there stll exst several crtcal challenges n mplementng dynamc prcng for demand response. Frst, n the practcal smart grd system, t s not easy for the servce provder to obtan the customer-sde nformaton such as ther current load demand levels and the transton probablty of the demand levels, and the customer-specfc utlty models ncludng the wllngness to purchase electrc energy gven ther load demand level and retal prce. Second, even f the servce provder can obtan those nformaton, t wll surfer from varous system dynamcs and uncertantes. The servce provder whch les between the utlty company and the customers may not obtan the perfect nformaton of those system dynamcs a pror. Fnally, the servce provder s requred to have the ablty to estmate the mpact of ts current prcng decson on the customers future behavor. In fact, the current prce nfluences not only the customers current energy consumpton but also ther energy consumpton for the next several hours or the next day. However, t s not easy for the servce provder to calculate the optmal prce consderng the future nfluence of the current prce wthout the detal customer-sde nformaton. Thus, most of exstng

2 2 works on dynamc prcng for smart grd have been studed n myopc approaches where the algorthms for dynamc prcng and demand sde load schedulng are conducted wthn a gven tme perod wthout consderng the long-term performance of the smart grd system. In order to overcome the aforementoned challenges of dynamc prcng, n ths paper, we use renforcement learnng to allow the servce provder to learn the behavors of customers and the change of wholesale prce to make an optmal prcng decson. We consder varous stochastc dynamcs of the smart grd system ncludng the customers dynamc demand generaton and energy consumptons, and wholesale prce changes. Based on the consdered system model, we formulate an MDP problem where the servce provder decdes the retal electrcty prce based on the observed system state transton to mnmze ts expected total cost or the customers dsutlty. Contrary to the prevous works [3]-[8] wth smplfed models, n ths paper, we consder a more realstc system model to ncorporate the customers demand generaton and load demand change as well as the wholesale market dynamc where the wholesale electrcty prce can be changed by the utlty company at each tme-slot. To solve the MDP problem wthout a pror nformaton about the change of the customers load demand level, we adopt the Q-learnng algorthm, and to resolve the exstng drawbacks of the conventonal Q-learnng algorthm, we propose the followng two mprovements. Frst, to reduce the complexty of the Q-learnng algorthm whch manly comes from the large number of customers, we propose an alternatve state defnton based on the observed total energy consumpton. Second, to mprove the learnng speed, we adopt vrtual experence n Q-learnng updates. The rest of ths paper s organzed as follows. In Secton II, the system model s presented. In Secton III, we defne the dynamc prcng problem and develop the renforcement learnng-based dynamc prcng algorthm. We provde numercal results n Secton IV and fnally conclude n Secton V. II. SYSTEM MODEL We consder a smart grd system whch conssts of one servce provder and a set of customers I as n Fg. 1. The smart grd system operates n a tme-slotted fashon, where each tme-slot has an equal duraton. At each tme-slot t, the servce provder buys electrc energy from the utlty company through a wholesale electrcty market and provdes t to the customers through a retal electrcty market. In the retal electrcty market, at each tme-slot t, the servce provder determnes the retal prcng functon a t : R + R + and charges each customer an electrcty bll a t (e t ), where et denotes customer s energy consumpton at tme-slot t. We defne the set of retal prcng functons as A and assume that the number of retal prcng functons, A, s fnte. At each tme-slot, each customer generates ts electrcty load demand and decdes the amount of energy consumpton based on ts current load demand level and the retal prcng functon. We assume that the customers average demand generaton rate and the wholesale prcng functon at a tmeslot can vary dependng on ts actual tme n a day. Moreover, ths tme-dependency of customers energy consumpton may nfluence the utlty company s decson on the wholesale prcng functon. To model ths tme-dependency of the demand generaton rate and the wholesale prcng functon, we ntroduce a set of perods H = {,1,,H 1} each of t c ( ) Utlty Company Wholesale prcng functon t t!eˆ ( d ) Servce Provder Flow of nformaton Retal prcng functon t a ( ) Flow of electrc energy Fg. 1. Smart grd system. eˆ ( d ) t t t t eˆ! ( d ) Customers whch represents an actual tme n a day. We map each tmeslot t to one perod h H denotng the perod at tme-slot t by h t. We assume that the sequence of perods h t,t =,1,2, s predetermned and repeated every day. For example, f one day conssts of H = 24 perods (.e., 24 hours), each tmeslot t s mapped to one perod n H = {,1,,23} and the mappng between tme-slots and perods can be represented as h t = mod (t,h), t. (1) A. Model of Customer s Response In each tme-slot, each customer has an accumulated load demand 1, whch s defned as the total amount of energy that t wants to consume for ts applances n that tme-slot. We denote the amount of the accumulated load demand of customer at tme-slot t by d t D, where D s the set of customer accumulated load demand levels. Once customer consumes energy e t at tme-slot t, t mples that the correspondng amount of customer s load demand s satsfed and the rest of the accumulated load demand d t et s not satsfed and we call t the remanng load demand. When the remanng load demand of a customer s greater than, t causes some degree of dssatsfacton to the customer at that tme-slot. To capture ths dssatsfacton from the remanng load demand, we ntroduce a dsutlty functon for u : R + R + for each customer. We assume that u ( ) s an ncreasng functon of the remanng load demandd t et. Based on the dsutlty u (d t et ) and the electrcty bll a t (e t ) that customer has to pay to the servce provder, we defne customer s cost at each tme-slot t as φ t (dt,et ) = u (d t et )+at (e t ). (2) We assume that each customer tres to mnmze ts cost at each tme-slot by decdng the amount of ts energy consumpton and let ê t (dt ) denote customer s energy consumpton decson that mnmzes ts cost 2,.e., ê t (dt ) = argmn φ t e t mn(emax,d t ) (dt,et ), (3) 1 For the convenence, accumulated load demand and load demand are used nterchangeably n the rest of ths paper. 2 Customer s energy consumpton decson s also a functon of the retal prcng functon a t. However, for the smple expresson, we represent t as ê t (dt ).

3 3 where e max s the maxmum amount of energy that customer can consume at each tme-slot due to physcal lmtatons of the grd. We assume that a porton of each customer s remanng load demand at a tme-slot s carred forward to the next tme-slot. We call the correspondng load demand the demand backlog and represent t as λ (d t êt (dt )), where λ 1 s the backlog rate of load demand that determnes the amount of demand backlog of customer. At each tme-slot t, each customer randomly generates ts new load demand, D t(ht ), and ts dstrbuton s assumed to be dependent on the current perodh t. At the begnnng of each tme-slott+1, accordng to the demand backlog at the prevous tme-slott,λ (d t êt (dt )), and the newly generated load demand, D t+1 (h t+1 ), customer s accumulated load demand d t+1 s updated as d t+1 = λ (d t êt (dt ))+Dt+1 (h t+1 ). (4) Here, f λ =, no remanng demand at a tme-slot s carred forward generatng no demand backlog, whereas f λ = 1, all the remanng demand at a tme-slot s carred forward to the next tme-slot. It s worth notng that the transton probablty of the load demand from d t to dt+1 depends only on the accumulated load demand d t, the perod ht, and the retal prcng functon a t at tme-slot t, and we represent t as p d (d t+1 d t,ht,a t ). B. Wholesale Electrcty Market At each tme-slot t, the servce provder buys electrc energy, whch corresponds to the total amount of energy consumpton of customers, I êt (dt ), from the utlty company n the wholesale electrcty market as llustrated n Fg. 1. The utlty company charges the servce provder an wholesale electrcty cost based on a wholesale prcng functon c t : R + R +, where c t s a functon of the total amount of energy consumpton I êt (dt ). We assume that ct s selected among a fnte number of wholesale prcng functons n set C and ts transton probablty from c t to c t+1 depends on the current wholesale prcng functon, c t, and the current perod, h t, and thus t can be represented as p c (c t+1 c t,h t ). We defne the servce provder s cost functon at each tmeslot t as a functon of the customers load demand vector d t = [d t ] I, the wholesale electrcty prcng functon c t, and the retal prcng functon a t,.e., ψ t ( d t,c t,a t ) = c t( ) ê t (d t ) a t (ê t (d t )), (5) I I where the frst term denotes the total wholesale electrcty cost that the servce provder has to pay to the utlty company and the second term denotes the servce provder s revenue from sellng energy to the customers. III. REINFORCEMENT LEARNING ALGORITHM In ths secton, based on the smart grd system ntroduced n the prevous secton, we frst formulate a dynamc prcng problem n the framework of MDP. Then, by usng renforcement learnng, we develop an effcent and fast dynamc prcng algorthm whch does not requre the nformaton about the system dynamcs and uncertantes. A. Problem Formulaton We formulate the dynamc prcng problem n the smart grd system as an MDP problem, whch s defned by a set of decson maker s actons, a set of system states and ther transton probabltes, and a system cost functon for the decson maker. In our MDP problem, the decson maker s the servce provder whose acton s choosng a retal prcng functon a t A at each tme-slot t. We defne the state of our smart grd system at tme-slot t as the combnaton of the accumulated load demands vector, dt, the current perod h t, and the wholesale prcng functon, c t,.e., s t = ( d t,h t,c t ) S, (6) where S = I D H C. Snce the transton of each customer s load demand (from d t to dt+1 ), that of the perod (from h t to h t+1 ), and that of the wholesale prcng functon (from c t to c t+1 ) depend only on the state s t and acton a t at tme-slot t, the sequence of states {s t,t =,1,2, } follows a Markov decson process wth acton a t. The transton probablty from state s t = ( d t,h t,c t ) to state s t+1 = ( d t+1,h t+1,c t+1 ) wth gven acton a t can be represented as p s (s t+1 s t,a t ) =p h (h t+1 h t )p c (c t+1 c t,h t ) (7) I p d (d t+1 d t,h t,a t ), where p h (h t+1 h t ) denotes the transton probablty of the perod from h t to h t+1. We defne the system cost for the servce provder at each tme-slot t as the weghted sum of the servce provder s cost and the customers cost at the tme-slot: r t (s t,a t ) = (1 ρ)ψ t ( d t,c t,a t )+ρ I φ t (d t,ê t ), (8) where ρ denotes the weghtng factor that determnes the relatve mportance between the servce provder s cost and the customers cost. We denote the statonary polcy that maps states to actons (retal prcng functons) by π : S A,.e, a t = π(s t ). The objectve of our dynamc prcng problem s to fnd an optmal polcy π for each state s S that mnmzes the expected dscounted system cost of the servce provder as n the followng MDP problem (P): [ ] (P) : mn E (γ) t r t (s t,π(s t )), (9) π:s A t= where γ < 1 s the dscount factor whch represents the relatve mportance of the future system cost compared wth the present system cost. The optmal statonary polcy π can be well defned by usng the optmal acton-value functon Q : S A R whch satsfes the followng Bellman optmalty equaton: Q (s,a) = r(s,a)+γ s Sp(s s,a)v (s ), () where V (s ) s the optmal state-value functon [9], whch s defned as V (s ) = mn a A Q (s,a), s S. (11)

4 4 Snce Q (s,a) s the expected dscounted system cost wth acton a n state s, we can obtan the optmal statonary polcy as π (s) = argmnq (s,a). (12) a A In ths paper, we use the well-known Q-learnng algorthm to solve our MDP problem (P) wthout nformaton of state transton probabltes. We refer the readers to [] for more detal on the Q-learnng algorthm. In the followng subsectons, n order to resolve the crtcal ssues whch make t dffcult to apply the conventonal Q-learnng algorthm to our smart grd system, we propose an alternatve state defnton based on the observed total energy consumpton as well as adopt vrtual experence n Q-learnng updates. B. Energy Consumpton-Based Approxmate State (EAS) When the sze of the state space s large, the Q-learnng algorthm requres not only a large memory space to store the state-acton functon Q(s, a), but also a long tme to converge. Moreover, n the practcal smart grd system, t s dffcult for the servce provder to acqure or use the nformaton about the customers current load demands due to prvacy. In order to resolve these dffcultes, n ths secton, we propose an alternatve defnton of the system state, whch s based on the observed total energy consumpton, I êt 1, and the prevously chosen acton a t 1. For notatonal convenence, we wll omt d t n êt (dt ) n the rest of ths secton. The man dea of ths alternatve state defnton comes from the fact that, snce each customer s dsutlty functon u s a decreasng functon of ê t, from the load demand update process n (4), the retal prcng functon and each customer s energy consumpton at tme-slot t 1 n a tuple (ê t 1,a t 1 ) characterzes the past accumulated load demand d t 1. Hence, f the servce provder knows new load demand D t(ht ) at tme-slot t, t can regard a dfferent tuple (ê t 1,a t 1,D t(ht )) as a dfferent actual accumulated load demand of customer at tme-slot t. Smlarly, once a tuple ( I êt 1,a t 1, I Dt (ht )) s observed by the servce provder, t approxmately reflects the customers overall load demands at tme-slot t. It s worth notng that snce D t(ht ) s ndependent random varable for each customer, by the law of the large number, the average of the sum of new load demands, I Dt (ht )/ I, goes to ts expected value as the number of customers gets larger. Ths mples that n the practcal smart grd system wth a large number of customers, a tuple( I êt 1,a t 1 ) provdes enough nformaton for the servce provder to nfer the customers overall load demand level at tme-slot t. Hence, nstead of the orgnal state s t n (6), we can use a new state defnton based on the observed energy consumpton by whch the servce provder does not need to know ether each customer s load demand or ts dsutlty functon. To reduce the number of system states, we dscretze the observed energy consumpton I êt 1 nto a fnte number of energy levels ne 3 The set of observed energy consumpton level by usng a quantzaton operaton q E ( ). Then, we refer 3 In our system model, the method of dscretzaton of the observed energy consumpton s not lmted to a specfc method. A smple example s to use an equally dvded levels between the maxmum and mnmum amounts of the energy consumpton. Algorthm 1 Q-Learnng Algorthm wth Vrtual Experence 1: Intalze Q arbtrarly, t = 2: for each tme-slot t 3: Choose a t accordng to polcy π(x t ) 4: Take acton a t, observe system cost r(x t,a t ) and next state x t+1 5: Obtan experence tuple σ t+1 = (x t,a t,r t,x t+1 ) 6: Generate set of vrtual experence tuples θ(σ t+1 ) 7: for each vrtual experence tuple σ t+1 θ(σ t+1 ) 8: v = r(x t,a t )+γmax a AQ(x t+1,a ) Q(x t,a t ) 9: Q(x t,a t ) Q(x t,a t )+α t v : end 11: end to tuple (q E ( I êt 1 ),a t 1 ) as the approxmate demand at tme-slot t and represent t as ( ) d t app = (q E ê t 1,a t 1). (13) I Based on the approxmate demand, we now defne the energy consumpton-based approxmate state (EAS) of the smart grd system as x t = (d t app,h t,c t ) X, (14) where X = E A H C denotes the set of the EASs. Note that EAS extremely reduces the number of states from S = I D H C to X = E A H C, whle allowng the servce provder to easly nfer the customers current load demands level wthout usng drect sgnalng from the customers. Now, we can smply substtute the orgnal state defnton s t by EAS x t n the Q-learnng algorthm. C. Accelerated Learnng usng Vrtual Experence Although the EAS x t sgnfcantly reduces the state space, the learnng speed of the Q-learnng algorthm mght be serously lmted by ts nherent structure n whch only one state-acton par s updated at each tme-slot. In ths subsecton, n order to mprove the speed of the Q-learnng algorthm, we adopt vrtual experence whch was ntroduced n [11]. The Q-learnng algorthm wth vrtual experence enables the servce provder to update multple state-acton pars at each tme-slot by explotng a pror known partal nformaton of the state transton probablty p x (x t+1 x t,a t ). In ths subsecton, we consder the case where the servce provder knows the transton probablty of the wholesale prcng functon p c (c t+1 c t,h t ) a pror. Note that ths s not a too restrctve assumpton because the servce provder can gather suffcent data for the transton probablty of the wholesale prcng functon whle partcpatng n the wholesale electrcty market. We frst defne the experence tuple (ET) observed by the servce provder at tme-slot t+1 as σ t+1 = (x t,a t,r t,x t+1 ), where r t s the observed system cost. Then, gven the actual ET σ t+1, we defne a set of vrtual experence tuples (vrtual ETs) θ(σ t+1 ), whch are statstcally equvalent 4 to the actual ET σ t+1,.e., θ(σ t+1 ) = σt+1 d t app = dt app, h t = h t, ã t = a t, r t = r( c t ), p c ( c t+1 c t, h t ) = p c (c t+1 c t,h t ), (15) where r(c) represents the system cost that s vrtually calculated by usng an arbtrary wholesale prcng functonc. In our 4 An ET σ t+1 = ( x t,ã t, r t, x t+1 ) s sad to be statstcally equvalent to ET σ t+1 = (x t,a t,r t,x t+1 ) f p x( x t+1 x t,ã t ) = p x(x t+1 x t,a t ) and the system cost r t can be calculated by usng σ t+1.

5 5 TABLE I COMPLEXITY COMPARISON OF THREE DIFFERENT ALGORITHMS. Q-learnng wth orgnal state Q-learnng wth EAS Q-learnng wth EAS and vrtual experence 3 Learnng update complexty per teraton O( A ) Memory complexty O( I D H C A ) O( A ) O( E H C A 2 ) O( θ(ˆσ) A ) O( E H C A 2 ) Average system cost Renforcement learnng Myopc optmzaton kwh Tme slot Fg. 2. Load demand profle. smart grd system, f d t app and ht are fxed, the system cost r t can be easly calculated for an arbtrary wholesale prcng functons c t C by applyng the same energy consumpton I êt to (8). Moreover, snce the transton probablty of the wholesale prcng functon p c (c t+1 c t,h t ) s ndependent of approxmate demand d t app and retal electrcty prcng functon a t, we can easly generate the set of vrtual ETs θ(σ t+1 ) n (15) from the observed actual ET σ t+1. Whle the observed ET σ t+1 s used to update only one state-acton functon Q(x t,a t ) n the conventonal Q-learnng, by usng the vrtual ETs, the Q-learnng algorthm can update multple state-acton pars at each tme-slot. The Q-learnng algorthm wth vrtual experence s outlned n Algorthm 1. Lnes 3-5 descrbe the operaton of the conventonal Q-learnng where the servce provder obtans the experence tuple σ t+1. Then, n lne 6, based on σ t+1, a set of ts vrtual experences s generated. In lnes 7-, the acton-value functon Q(x t,a t ) s updated for all vrtual experences n σ t+1. The complexty of the proposed renforcement learnng algorthms are summarzed n Table I. Although the Q-learnng algorthm wth vrtual experence has a hgher update complexty than the Q-learnng algorthm wthout vrtual experence, as we wll show n Secton IV, t sgnfcantly reduces the number of tme-slots needed to converge, whch s regarded as a more mportant aspect than the computatonal complexty at each tme-slot n renforcement learnng algorthms. IV. NUMERICAL RESULTS In ths secton, we provde numercal results to evaluate the performance of our dynamc prcng algorthm. One day conssts of 24 tme-slots each of whch lasts for one hour. Hence, the set of perods s gven as H = {,1,,23} and the mappng between tme-slots and perods s gven as h t = mod (t,24) λ Fg. 3. Performance comparson of our renforcement learnng algorthm and the myopc optmzaton algorthm varyng λ. We consder a smart grd system wth 2 customers. The newly generated load demand of customer, D t(ht ), follows a Posson dstrbuton wth expected value ω,h t, whch s proportonal to the hourly average load shapes of resdental electrcty servces n Calforna [12] as shown n Fg. 2. All customers have the same backlog rate,.e., λ = λ, I. Each customer s dsutlty functon u (d t et ) s gven as u (d t e t ) = κ (d t e t ) 2, (16) where κ s a constant that represents customer s dsutlty senstvty to ts remanng demand. Here, we let κ = κ =.1, I. We model the wholesale prcng functon c t as a quadratc functon of the total energy consumpton I êt as n [3],[4], and [6] : c t( ) ê t = µ t ( 2. ê t +νh t ê) t (17) t I I I We set µ t =.2, t andν t h t to be a random varable whose expected value, v h t, changes accordng to the correspondng perod h t based on the hourly average load shape n Fg. 2. Wth a gven perodh t,ν t h t s unform randomly chosen among values n {.25v h t,.5v h t,,1.75v h t}. The dscount factor γ n problem (P) s fxed to.95. The retal prcng functon a t s a lnear functon of the energy consumpton ê t,.e., a t (ê t ) = χt ê t, (18) where the coeffcent χ t can be chosen among set {.2,.4,,1.} each element of whch s drectly mapped to one retal prcng functon n A. We frst evaluate the performance of the dynamc prcng algorthm by comparng t wth that of the myopc optmzaton algorthm. In the myopc optmzaton algorthm, the servce provder chooses an acton wth the lowest expected nstantaneous system cost, by updatng the state-acton functon Q(s, a) smlarly to the Q-learnng update wth a dscount factor γ =. Ths mples that the myopc optmzaton algorthm focuses only on the mmedate system cost wthout consderng the mpact of the current acton on the future system cost. In Fg. 3, we show the average system costs of those two dynamc prcng algorthms by changng the backlog rate, λ, from to 1. We set ρ =.5 whch corresponds to the case where the servce provder ams at mnmzng the sum of the total dsutlty and the wholesale cost. We can observe that the average system costs ncrease as λ ncreases

6 6 Cost of customers $/kwh Average cost of customers Average cost of servce provder ρ 1.5 Average retal prce ρ Fg. 4. Impact of the weghtng factor ρ on the performances of customers and servce provder. Average system cost Tme slot x Cost of servce provder wth vrtual experence wthout vrtual experence Fg. 5. Impact of vrtual experence on the learnng speed of the Q-learnng algorthm. n both dynamc prcng algorthms because wth a hgher backlog rate, the accumulated load demand causes a hgher dsutlty. We also observe that the performance gap between two algorthms ncreases as λ ncreases. Wth a low backlog rate, the remanng backlog s not carred forward to the next tme-slot. In ths case, the soluton of our renforcement learnng algorthm s the same as that of the myopc optmzaton algorthm. On the contrary, n the case wth a hgher backlog rate, the remanng backlog s carred forward to the next tmeslot. Hence, the servce provder s prcng decson at a tmeslot nfluences the accumulated load demand n the future, and thus ts future system cost. Due to ths dfference, especally when λ s large, our algorthm acheves better performance than the myopc optmzaton problem whch consders only the current system cost. To study the mpact of the weghtng factor ρ, n Fg. 4, we show the cost of customers, that of the servce provder, and the average retal prce wth varyng ρ from to 1. We set λ =.5. We can observe that as ρ ncreases, the servce provder reduces the average retal prce, the cost of customers decreases, and the cost of the servce provder ncreases. For example, n the case wth ρ =, the servce provder ams at mnmzng ts own cost. Hence, the servce provder does not consder the customers dsutlty and chooses relatvely hgh prces to reduce the wholesale cost whch contrbutes to most of ts own cost. In the case wth ρ = 1, the servce provder ams at mnmzng the customers cost. Hence, the servce provder chooses relatvely low prces to provde electrc energy to the customers at a low retal prce as possble. In Fg. 5, we compare the learnng speed of our renforcement learnng algorthm wth vrtual experence to the conventonal Q-learnng algorthm wthout vrtual experence. We set λ =.5 and ρ =.5. We can observe that our algorthm wth vrtual experence acheves the near optmal average system cost after about 3, tme-slots, whle the conventonal Q-learnng algorthm shows a worse learnng speed. Ths means that even f the stochastc characterstc of the system dynamcs vary n tme, the proposed renforcement learnng algorthm can quckly adapt to the tme-varyng envronment by explotng vrtual experence update. V. CONCLUSION In ths paper, we studed a dynamc prcng problem for the smart grd system where the servce provder can adaptvely decde the electrcty prce accordng to the customers load demand levels and the wholesale prce. We developed a renforcement learnng-based dynamc prcng algorthm that enables effcent dynamc prcng wthout requrng the perfect nformaton about the system dynamcs a pror. To resolve the exstng drawbacks of the conventonal renforcement learnng algorthm, we proposed two mprovements: energy consumpton-based approxmate state defnton and the adopton of vrtual experence update n the conventonal Q-learnng algorthm. Numercal results show that the renforcement learnng-based dynamc prcng acheves a hgher long-term performance compared to the myopc optmzaton approach especally n the system where the customers have a hgh demand backlog rate. The results also show that our algorthm results n an mproved learnng speed due to the alternatve state defnton and vrtual experence mplyng that our dynamc prcng algorthm can be appled to the practcal smart grd system. REFERENCES [1] M. H. Albad and E. El-Saadany, Demand response n electrcty markets: An overvew, n IEEE Power Engneerng Socety General Meetng, 27. [2] B.-G. Km, S. Ren, M. van der Schaar, and J.-W. Lee, Bdrectonal energy tradng and resdental load schedulng wth electrc vehcles n the smart grd, IEEE Journal on Selected Areas n Communcatons, vol. 31, no. 7, pp , 213. [3] P. Samad, A.-H. Mohsenan-Rad, R. Schober, V. Wong, and J. Jatskevch, Optmal real-tme prcng algorthm based on utlty maxmzaton for smart grd, n IEEE SmartGrdComm, 2. [4] P. Tarasak, Optmal real-tme prcng under load uncertanty based on utlty maxmzaton for smart grd, n IEEE SmartGrdComm, 211. [5] L. P. Qan, Y. Zhang, J. Huang, and Y. Wu, Demand response management va real-tme electrcty prce control n smart grds, IEEE Journal on Selected Areas n Communcatons, vol. 31, no. 7, pp , 213. [6] C. Chen, S. Kshore, and L. Snyder, An nnovatve RTP-based resdental power schedulng scheme for smart grds, n IEEE ICASSP, 211. [7] C. Joe-Wong, S. Sen, S. Ha, and M. Chang, Optmzed day-ahead prcng for smart grds wth devce-specfc schedulng flexblty, IEEE Journal on Selected Areas n Communcatons, vol. 3, no. 6, pp , 212. [8] M. He, S. Murugesan, and J. Zhang, Multple tmescale dspatch and schedulng for stochastc relablty n smart grds wth wnd generaton ntegraton, n IEEE INFOCOM, 211, pp [9] L. P. Kaelblng, M. L. Lttman, and A. W. Moore, Renforcement learnng: A survey, Journal of Artfcal Intellgence Research, vol. 4, pp , May [] C. Watkns, Learnng from delayed rewards, Ph.D. dssertaton, Cambrdge Unversty, [11] N. Mastronarde and M. van der Schaar, Jont physcal-layer and systemlevel power management for delay-senstve wreless communcatons, IEEE Transactons on Moble Computng, vol. 12, no. 4, pp , 213. [12] Dynamc Load Profles n Calforna. Pacfc Gas & Electrc. [Onlne]. Avalable: use prces.shtml

Fault tolerance in cloud technologies presented as a service

Fault tolerance in cloud technologies presented as a service Internatonal Scentfc Conference Computer Scence 2015 Pavel Dzhunev, PhD student Fault tolerance n cloud technologes presented as a servce INTRODUCTION Improvements n technques for vrtualzaton and performance

More information

Feasibility of Using Discriminate Pricing Schemes for Energy Trading in Smart Grid

Feasibility of Using Discriminate Pricing Schemes for Energy Trading in Smart Grid Feasblty of Usng Dscrmnate Prcng Schemes for Energy Tradng n Smart Grd Wayes Tushar, Chau Yuen, Bo Cha, Davd B. Smth, and H. Vncent Poor Sngapore Unversty of Technology and Desgn, Sngapore 138682. Emal:

More information

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis The Development of Web Log Mnng Based on Improve-K-Means Clusterng Analyss TngZhong Wang * College of Informaton Technology, Luoyang Normal Unversty, Luoyang, 471022, Chna wangtngzhong2@sna.cn Abstract.

More information

A Lyapunov Optimization Approach to Repeated Stochastic Games

A Lyapunov Optimization Approach to Repeated Stochastic Games PROC. ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING, OCT. 2013 1 A Lyapunov Optmzaton Approach to Repeated Stochastc Games Mchael J. Neely Unversty of Southern Calforna http://www-bcf.usc.edu/

More information

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ).

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ). REVIEW OF RISK MANAGEMENT CONCEPTS LOSS DISTRIBUTIONS AND INSURANCE Loss and nsurance: When someone s subject to the rsk of ncurrng a fnancal loss, the loss s generally modeled usng a random varable or

More information

An Alternative Way to Measure Private Equity Performance

An Alternative Way to Measure Private Equity Performance An Alternatve Way to Measure Prvate Equty Performance Peter Todd Parlux Investment Technology LLC Summary Internal Rate of Return (IRR) s probably the most common way to measure the performance of prvate

More information

Data Broadcast on a Multi-System Heterogeneous Overlayed Wireless Network *

Data Broadcast on a Multi-System Heterogeneous Overlayed Wireless Network * JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 24, 819-840 (2008) Data Broadcast on a Mult-System Heterogeneous Overlayed Wreless Network * Department of Computer Scence Natonal Chao Tung Unversty Hsnchu,

More information

Power-of-Two Policies for Single- Warehouse Multi-Retailer Inventory Systems with Order Frequency Discounts

Power-of-Two Policies for Single- Warehouse Multi-Retailer Inventory Systems with Order Frequency Discounts Power-of-wo Polces for Sngle- Warehouse Mult-Retaler Inventory Systems wth Order Frequency Dscounts José A. Ventura Pennsylvana State Unversty (USA) Yale. Herer echnon Israel Insttute of echnology (Israel)

More information

When Network Effect Meets Congestion Effect: Leveraging Social Services for Wireless Services

When Network Effect Meets Congestion Effect: Leveraging Social Services for Wireless Services When Network Effect Meets Congeston Effect: Leveragng Socal Servces for Wreless Servces aowen Gong School of Electrcal, Computer and Energy Engeerng Arzona State Unversty Tempe, AZ 8587, USA xgong9@asuedu

More information

DEFINING %COMPLETE IN MICROSOFT PROJECT

DEFINING %COMPLETE IN MICROSOFT PROJECT CelersSystems DEFINING %COMPLETE IN MICROSOFT PROJECT PREPARED BY James E Aksel, PMP, PMI-SP, MVP For Addtonal Informaton about Earned Value Management Systems and reportng, please contact: CelersSystems,

More information

On the Optimal Control of a Cascade of Hydro-Electric Power Stations

On the Optimal Control of a Cascade of Hydro-Electric Power Stations On the Optmal Control of a Cascade of Hydro-Electrc Power Statons M.C.M. Guedes a, A.F. Rbero a, G.V. Smrnov b and S. Vlela c a Department of Mathematcs, School of Scences, Unversty of Porto, Portugal;

More information

Optimal Bidding Strategies for Generation Companies in a Day-Ahead Electricity Market with Risk Management Taken into Account

Optimal Bidding Strategies for Generation Companies in a Day-Ahead Electricity Market with Risk Management Taken into Account Amercan J. of Engneerng and Appled Scences (): 8-6, 009 ISSN 94-700 009 Scence Publcatons Optmal Bddng Strateges for Generaton Companes n a Day-Ahead Electrcty Market wth Rsk Management Taken nto Account

More information

Research Article Enhanced Two-Step Method via Relaxed Order of α-satisfactory Degrees for Fuzzy Multiobjective Optimization

Research Article Enhanced Two-Step Method via Relaxed Order of α-satisfactory Degrees for Fuzzy Multiobjective Optimization Hndaw Publshng Corporaton Mathematcal Problems n Engneerng Artcle ID 867836 pages http://dxdoorg/055/204/867836 Research Artcle Enhanced Two-Step Method va Relaxed Order of α-satsfactory Degrees for Fuzzy

More information

How To Solve An Onlne Control Polcy On A Vrtualzed Data Center

How To Solve An Onlne Control Polcy On A Vrtualzed Data Center Dynamc Resource Allocaton and Power Management n Vrtualzed Data Centers Rahul Urgaonkar, Ulas C. Kozat, Ken Igarash, Mchael J. Neely urgaonka@usc.edu, {kozat, garash}@docomolabs-usa.com, mjneely@usc.edu

More information

Luby s Alg. for Maximal Independent Sets using Pairwise Independence

Luby s Alg. for Maximal Independent Sets using Pairwise Independence Lecture Notes for Randomzed Algorthms Luby s Alg. for Maxmal Independent Sets usng Parwse Independence Last Updated by Erc Vgoda on February, 006 8. Maxmal Independent Sets For a graph G = (V, E), an ndependent

More information

A hybrid global optimization algorithm based on parallel chaos optimization and outlook algorithm

A hybrid global optimization algorithm based on parallel chaos optimization and outlook algorithm Avalable onlne www.ocpr.com Journal of Chemcal and Pharmaceutcal Research, 2014, 6(7):1884-1889 Research Artcle ISSN : 0975-7384 CODEN(USA) : JCPRC5 A hybrd global optmzaton algorthm based on parallel

More information

Recurrence. 1 Definitions and main statements

Recurrence. 1 Definitions and main statements Recurrence 1 Defntons and man statements Let X n, n = 0, 1, 2,... be a MC wth the state space S = (1, 2,...), transton probabltes p j = P {X n+1 = j X n = }, and the transton matrx P = (p j ),j S def.

More information

Answer: A). There is a flatter IS curve in the high MPC economy. Original LM LM after increase in M. IS curve for low MPC economy

Answer: A). There is a flatter IS curve in the high MPC economy. Original LM LM after increase in M. IS curve for low MPC economy 4.02 Quz Solutons Fall 2004 Multple-Choce Questons (30/00 ponts) Please, crcle the correct answer for each of the followng 0 multple-choce questons. For each queston, only one of the answers s correct.

More information

Robust Design of Public Storage Warehouses. Yeming (Yale) Gong EMLYON Business School

Robust Design of Public Storage Warehouses. Yeming (Yale) Gong EMLYON Business School Robust Desgn of Publc Storage Warehouses Yemng (Yale) Gong EMLYON Busness School Rene de Koster Rotterdam school of management, Erasmus Unversty Abstract We apply robust optmzaton and revenue management

More information

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12 14 The Ch-squared dstrbuton PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 1 If a normal varable X, havng mean µ and varance σ, s standardsed, the new varable Z has a mean 0 and varance 1. When ths standardsed

More information

APPLICATION OF PROBE DATA COLLECTED VIA INFRARED BEACONS TO TRAFFIC MANEGEMENT

APPLICATION OF PROBE DATA COLLECTED VIA INFRARED BEACONS TO TRAFFIC MANEGEMENT APPLICATION OF PROBE DATA COLLECTED VIA INFRARED BEACONS TO TRAFFIC MANEGEMENT Toshhko Oda (1), Kochro Iwaoka (2) (1), (2) Infrastructure Systems Busness Unt, Panasonc System Networks Co., Ltd. Saedo-cho

More information

How To Improve Power Demand Response Of A Data Center Wth A Real Time Power Demand Control Program

How To Improve Power Demand Response Of A Data Center Wth A Real Time Power Demand Control Program Demand Response of Data Centers: A Real-tme Prcng Game between Utltes n Smart Grd Nguyen H. Tran, Shaole Ren, Zhu Han, Sung Man Jang, Seung Il Moon and Choong Seon Hong Department of Computer Engneerng,

More information

QoS-based Scheduling of Workflow Applications on Service Grids

QoS-based Scheduling of Workflow Applications on Service Grids QoS-based Schedulng of Workflow Applcatons on Servce Grds Ja Yu, Rakumar Buyya and Chen Khong Tham Grd Computng and Dstrbuted System Laboratory Dept. of Computer Scence and Software Engneerng The Unversty

More information

Efficient Bandwidth Management in Broadband Wireless Access Systems Using CAC-based Dynamic Pricing

Efficient Bandwidth Management in Broadband Wireless Access Systems Using CAC-based Dynamic Pricing Effcent Bandwdth Management n Broadband Wreless Access Systems Usng CAC-based Dynamc Prcng Bader Al-Manthar, Ndal Nasser 2, Najah Abu Al 3, Hossam Hassanen Telecommuncatons Research Laboratory School of

More information

An Interest-Oriented Network Evolution Mechanism for Online Communities

An Interest-Oriented Network Evolution Mechanism for Online Communities An Interest-Orented Network Evoluton Mechansm for Onlne Communtes Cahong Sun and Xaopng Yang School of Informaton, Renmn Unversty of Chna, Bejng 100872, P.R. Chna {chsun,yang}@ruc.edu.cn Abstract. Onlne

More information

What is Candidate Sampling

What is Candidate Sampling What s Canddate Samplng Say we have a multclass or mult label problem where each tranng example ( x, T ) conssts of a context x a small (mult)set of target classes T out of a large unverse L of possble

More information

Period and Deadline Selection for Schedulability in Real-Time Systems

Period and Deadline Selection for Schedulability in Real-Time Systems Perod and Deadlne Selecton for Schedulablty n Real-Tme Systems Thdapat Chantem, Xaofeng Wang, M.D. Lemmon, and X. Sharon Hu Department of Computer Scence and Engneerng, Department of Electrcal Engneerng

More information

An Analysis of Central Processor Scheduling in Multiprogrammed Computer Systems

An Analysis of Central Processor Scheduling in Multiprogrammed Computer Systems STAN-CS-73-355 I SU-SE-73-013 An Analyss of Central Processor Schedulng n Multprogrammed Computer Systems (Dgest Edton) by Thomas G. Prce October 1972 Techncal Report No. 57 Reproducton n whole or n part

More information

An MILP model for planning of batch plants operating in a campaign-mode

An MILP model for planning of batch plants operating in a campaign-mode An MILP model for plannng of batch plants operatng n a campagn-mode Yanna Fumero Insttuto de Desarrollo y Dseño CONICET UTN yfumero@santafe-concet.gov.ar Gabrela Corsano Insttuto de Desarrollo y Dseño

More information

Chapter 4 ECONOMIC DISPATCH AND UNIT COMMITMENT

Chapter 4 ECONOMIC DISPATCH AND UNIT COMMITMENT Chapter 4 ECOOMIC DISATCH AD UIT COMMITMET ITRODUCTIO A power system has several power plants. Each power plant has several generatng unts. At any pont of tme, the total load n the system s met by the

More information

Performance Analysis of Energy Consumption of Smartphone Running Mobile Hotspot Application

Performance Analysis of Energy Consumption of Smartphone Running Mobile Hotspot Application Internatonal Journal of mart Grd and lean Energy Performance Analyss of Energy onsumpton of martphone Runnng Moble Hotspot Applcaton Yun on hung a chool of Electronc Engneerng, oongsl Unversty, 511 angdo-dong,

More information

THE DISTRIBUTION OF LOAN PORTFOLIO VALUE * Oldrich Alfons Vasicek

THE DISTRIBUTION OF LOAN PORTFOLIO VALUE * Oldrich Alfons Vasicek HE DISRIBUION OF LOAN PORFOLIO VALUE * Oldrch Alfons Vascek he amount of captal necessary to support a portfolo of debt securtes depends on the probablty dstrbuton of the portfolo loss. Consder a portfolo

More information

J. Parallel Distrib. Comput. Environment-conscious scheduling of HPC applications on distributed Cloud-oriented data centers

J. Parallel Distrib. Comput. Environment-conscious scheduling of HPC applications on distributed Cloud-oriented data centers J. Parallel Dstrb. Comput. 71 (2011) 732 749 Contents lsts avalable at ScenceDrect J. Parallel Dstrb. Comput. ournal homepage: www.elsever.com/locate/pdc Envronment-conscous schedulng of HPC applcatons

More information

In some supply chains, materials are ordered periodically according to local information. This paper investigates

In some supply chains, materials are ordered periodically according to local information. This paper investigates MANUFACTURING & SRVIC OPRATIONS MANAGMNT Vol. 12, No. 3, Summer 2010, pp. 430 448 ssn 1523-4614 essn 1526-5498 10 1203 0430 nforms do 10.1287/msom.1090.0277 2010 INFORMS Improvng Supply Chan Performance:

More information

2008/8. An integrated model for warehouse and inventory planning. Géraldine Strack and Yves Pochet

2008/8. An integrated model for warehouse and inventory planning. Géraldine Strack and Yves Pochet 2008/8 An ntegrated model for warehouse and nventory plannng Géraldne Strack and Yves Pochet CORE Voe du Roman Pays 34 B-1348 Louvan-la-Neuve, Belgum. Tel (32 10) 47 43 04 Fax (32 10) 47 43 01 E-mal: corestat-lbrary@uclouvan.be

More information

1. Fundamentals of probability theory 2. Emergence of communication traffic 3. Stochastic & Markovian Processes (SP & MP)

1. Fundamentals of probability theory 2. Emergence of communication traffic 3. Stochastic & Markovian Processes (SP & MP) 6.3 / -- Communcaton Networks II (Görg) SS20 -- www.comnets.un-bremen.de Communcaton Networks II Contents. Fundamentals of probablty theory 2. Emergence of communcaton traffc 3. Stochastc & Markovan Processes

More information

Downlink Power Allocation for Multi-class. Wireless Systems

Downlink Power Allocation for Multi-class. Wireless Systems Downlnk Power Allocaton for Mult-class 1 Wreless Systems Jang-Won Lee, Rav R. Mazumdar, and Ness B. Shroff School of Electrcal and Computer Engneerng Purdue Unversty West Lafayette, IN 47907, USA {lee46,

More information

Lecture 3: Force of Interest, Real Interest Rate, Annuity

Lecture 3: Force of Interest, Real Interest Rate, Annuity Lecture 3: Force of Interest, Real Interest Rate, Annuty Goals: Study contnuous compoundng and force of nterest Dscuss real nterest rate Learn annuty-mmedate, and ts present value Study annuty-due, and

More information

How To Understand The Results Of The German Meris Cloud And Water Vapour Product

How To Understand The Results Of The German Meris Cloud And Water Vapour Product Ttel: Project: Doc. No.: MERIS level 3 cloud and water vapour products MAPP MAPP-ATBD-ClWVL3 Issue: 1 Revson: 0 Date: 9.12.1998 Functon Name Organsaton Sgnature Date Author: Bennartz FUB Preusker FUB Schüller

More information

VoIP Playout Buffer Adjustment using Adaptive Estimation of Network Delays

VoIP Playout Buffer Adjustment using Adaptive Estimation of Network Delays VoIP Playout Buffer Adjustment usng Adaptve Estmaton of Network Delays Mroslaw Narbutt and Lam Murphy* Department of Computer Scence Unversty College Dubln, Belfeld, Dubln, IRELAND Abstract The poor qualty

More information

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module LOSSLESS IMAGE COMPRESSION SYSTEMS Lesson 3 Lossless Compresson: Huffman Codng Instructonal Objectves At the end of ths lesson, the students should be able to:. Defne and measure source entropy..

More information

A Novel Auction Mechanism for Selling Time-Sensitive E-Services

A Novel Auction Mechanism for Selling Time-Sensitive E-Services A ovel Aucton Mechansm for Sellng Tme-Senstve E-Servces Juong-Sk Lee and Boleslaw K. Szymansk Optmaret Inc. and Department of Computer Scence Rensselaer Polytechnc Insttute 110 8 th Street, Troy, Y 12180,

More information

Solving Factored MDPs with Continuous and Discrete Variables

Solving Factored MDPs with Continuous and Discrete Variables Solvng Factored MPs wth Contnuous and screte Varables Carlos Guestrn Berkeley Research Center Intel Corporaton Mlos Hauskrecht epartment of Computer Scence Unversty of Pttsburgh Branslav Kveton Intellgent

More information

The OC Curve of Attribute Acceptance Plans

The OC Curve of Attribute Acceptance Plans The OC Curve of Attrbute Acceptance Plans The Operatng Characterstc (OC) curve descrbes the probablty of acceptng a lot as a functon of the lot s qualty. Fgure 1 shows a typcal OC Curve. 10 8 6 4 1 3 4

More information

Risk-based Fatigue Estimate of Deep Water Risers -- Course Project for EM388F: Fracture Mechanics, Spring 2008

Risk-based Fatigue Estimate of Deep Water Risers -- Course Project for EM388F: Fracture Mechanics, Spring 2008 Rsk-based Fatgue Estmate of Deep Water Rsers -- Course Project for EM388F: Fracture Mechancs, Sprng 2008 Chen Sh Department of Cvl, Archtectural, and Envronmental Engneerng The Unversty of Texas at Austn

More information

Pricing Model of Cloud Computing Service with Partial Multihoming

Pricing Model of Cloud Computing Service with Partial Multihoming Prcng Model of Cloud Computng Servce wth Partal Multhomng Zhang Ru 1 Tang Bng-yong 1 1.Glorous Sun School of Busness and Managment Donghua Unversty Shangha 251 Chna E-mal:ru528369@mal.dhu.edu.cn Abstract

More information

Enabling P2P One-view Multi-party Video Conferencing

Enabling P2P One-view Multi-party Video Conferencing Enablng P2P One-vew Mult-party Vdeo Conferencng Yongxang Zhao, Yong Lu, Changja Chen, and JanYn Zhang Abstract Mult-Party Vdeo Conferencng (MPVC) facltates realtme group nteracton between users. Whle P2P

More information

AN APPOINTMENT ORDER OUTPATIENT SCHEDULING SYSTEM THAT IMPROVES OUTPATIENT EXPERIENCE

AN APPOINTMENT ORDER OUTPATIENT SCHEDULING SYSTEM THAT IMPROVES OUTPATIENT EXPERIENCE AN APPOINTMENT ORDER OUTPATIENT SCHEDULING SYSTEM THAT IMPROVES OUTPATIENT EXPERIENCE Yu-L Huang Industral Engneerng Department New Mexco State Unversty Las Cruces, New Mexco 88003, U.S.A. Abstract Patent

More information

On the Interaction between Load Balancing and Speed Scaling

On the Interaction between Load Balancing and Speed Scaling On the Interacton between Load Balancng and Speed Scalng Ljun Chen and Na L Abstract Speed scalng has been wdely adopted n computer and communcaton systems, n partcular, to reduce energy consumpton. An

More information

CALL ADMISSION CONTROL IN WIRELESS MULTIMEDIA NETWORKS

CALL ADMISSION CONTROL IN WIRELESS MULTIMEDIA NETWORKS CALL ADMISSION CONTROL IN WIRELESS MULTIMEDIA NETWORKS Novella Bartoln 1, Imrch Chlamtac 2 1 Dpartmento d Informatca, Unverstà d Roma La Sapenza, Roma, Italy novella@ds.unroma1.t 2 Center for Advanced

More information

Logistic Regression. Lecture 4: More classifiers and classes. Logistic regression. Adaboost. Optimization. Multiple class classification

Logistic Regression. Lecture 4: More classifiers and classes. Logistic regression. Adaboost. Optimization. Multiple class classification Lecture 4: More classfers and classes C4B Machne Learnng Hlary 20 A. Zsserman Logstc regresson Loss functons revsted Adaboost Loss functons revsted Optmzaton Multple class classfcaton Logstc Regresson

More information

Method for Production Planning and Inventory Control in Oil

Method for Production Planning and Inventory Control in Oil Memors of the Faculty of Engneerng, Okayama Unversty, Vol.41, pp.20-30, January, 2007 Method for Producton Plannng and Inventory Control n Ol Refnery TakujImamura,MasamKonshandJunIma Dvson of Electronc

More information

Price Competition in an Oligopoly Market with Multiple IaaS Cloud Providers

Price Competition in an Oligopoly Market with Multiple IaaS Cloud Providers Prce Competton n an Olgopoly Market wth Multple IaaS Cloud Provders Yuan Feng, Baochun L, Bo L Department of Computng, Hong Kong Polytechnc Unversty Department of Electrcal and Computer Engneerng, Unversty

More information

Section 5.4 Annuities, Present Value, and Amortization

Section 5.4 Annuities, Present Value, and Amortization Secton 5.4 Annutes, Present Value, and Amortzaton Present Value In Secton 5.2, we saw that the present value of A dollars at nterest rate per perod for n perods s the amount that must be deposted today

More information

Solution: Let i = 10% and d = 5%. By definition, the respective forces of interest on funds A and B are. i 1 + it. S A (t) = d (1 dt) 2 1. = d 1 dt.

Solution: Let i = 10% and d = 5%. By definition, the respective forces of interest on funds A and B are. i 1 + it. S A (t) = d (1 dt) 2 1. = d 1 dt. Chapter 9 Revew problems 9.1 Interest rate measurement Example 9.1. Fund A accumulates at a smple nterest rate of 10%. Fund B accumulates at a smple dscount rate of 5%. Fnd the pont n tme at whch the forces

More information

Preventive Maintenance and Replacement Scheduling: Models and Algorithms

Preventive Maintenance and Replacement Scheduling: Models and Algorithms Preventve Mantenance and Replacement Schedulng: Models and Algorthms By Kamran S. Moghaddam B.S. Unversty of Tehran 200 M.S. Tehran Polytechnc 2003 A Dssertaton Proposal Submtted to the Faculty of the

More information

BERNSTEIN POLYNOMIALS

BERNSTEIN POLYNOMIALS On-Lne Geometrc Modelng Notes BERNSTEIN POLYNOMIALS Kenneth I. Joy Vsualzaton and Graphcs Research Group Department of Computer Scence Unversty of Calforna, Davs Overvew Polynomals are ncredbly useful

More information

RESEARCH ON DUAL-SHAKER SINE VIBRATION CONTROL. Yaoqi FENG 1, Hanping QIU 1. China Academy of Space Technology (CAST) yaoqi.feng@yahoo.

RESEARCH ON DUAL-SHAKER SINE VIBRATION CONTROL. Yaoqi FENG 1, Hanping QIU 1. China Academy of Space Technology (CAST) yaoqi.feng@yahoo. ICSV4 Carns Australa 9- July, 007 RESEARCH ON DUAL-SHAKER SINE VIBRATION CONTROL Yaoq FENG, Hanpng QIU Dynamc Test Laboratory, BISEE Chna Academy of Space Technology (CAST) yaoq.feng@yahoo.com Abstract

More information

Using Series to Analyze Financial Situations: Present Value

Using Series to Analyze Financial Situations: Present Value 2.8 Usng Seres to Analyze Fnancal Stuatons: Present Value In the prevous secton, you learned how to calculate the amount, or future value, of an ordnary smple annuty. The amount s the sum of the accumulated

More information

Efficient Project Portfolio as a tool for Enterprise Risk Management

Efficient Project Portfolio as a tool for Enterprise Risk Management Effcent Proect Portfolo as a tool for Enterprse Rsk Management Valentn O. Nkonov Ural State Techncal Unversty Growth Traectory Consultng Company January 5, 27 Effcent Proect Portfolo as a tool for Enterprse

More information

Multiple-Period Attribution: Residuals and Compounding

Multiple-Period Attribution: Residuals and Compounding Multple-Perod Attrbuton: Resduals and Compoundng Our revewer gave these authors full marks for dealng wth an ssue that performance measurers and vendors often regard as propretary nformaton. In 1994, Dens

More information

Cross-Selling in a Call Center with a Heterogeneous Customer Population

Cross-Selling in a Call Center with a Heterogeneous Customer Population OPERATIONS RESEARCH Vol. 57, No. 2, March Aprl 2009, pp. 299 313 ssn 0030-364X essn 1526-5463 09 5702 0299 nforms do 10.1287/opre.1080.0568 2009 INFORMS Cross-Sellng n a Call Center wth a Heterogeneous

More information

Coordinated Denial-of-Service Attacks in IEEE 802.22 Networks

Coordinated Denial-of-Service Attacks in IEEE 802.22 Networks Coordnated Denal-of-Servce Attacks n IEEE 82.22 Networks Y Tan Department of ECE Stevens Insttute of Technology Hoboken, NJ Emal: ytan@stevens.edu Shamk Sengupta Department of Math. & Comp. Sc. John Jay

More information

Modeling and Simulation of Multi-Agent System of China's Real Estate Market Based on Bayesian Network Decision-Making

Modeling and Simulation of Multi-Agent System of China's Real Estate Market Based on Bayesian Network Decision-Making Int. J. on Recent Trends n Engneerng and Technology, Vol. 11, No. 1, July 2014 Modelng and Smulaton of Mult-Agent System of Chna's Real Estate Market Based on Bayesan Network Decson-Makng Yang Shen, Shan

More information

J. Parallel Distrib. Comput.

J. Parallel Distrib. Comput. J. Parallel Dstrb. Comput. 71 (2011) 62 76 Contents lsts avalable at ScenceDrect J. Parallel Dstrb. Comput. journal homepage: www.elsever.com/locate/jpdc Optmzng server placement n dstrbuted systems n

More information

Forecasting the Direction and Strength of Stock Market Movement

Forecasting the Direction and Strength of Stock Market Movement Forecastng the Drecton and Strength of Stock Market Movement Jngwe Chen Mng Chen Nan Ye cjngwe@stanford.edu mchen5@stanford.edu nanye@stanford.edu Abstract - Stock market s one of the most complcated systems

More information

An Intelligent Policy System for Channel Allocation of Information Appliance

An Intelligent Policy System for Channel Allocation of Information Appliance Tamkang Journal of Scence and Engneerng, Vol. 5, No., pp. 63-68 (2002) 63 An Intellgent Polcy System for Channel Allocaton of Informaton Applance Cheng-Yuan Ku, Chang-Jnn Tsao 2 and Davd Yen 3 Department

More information

A Secure Password-Authenticated Key Agreement Using Smart Cards

A Secure Password-Authenticated Key Agreement Using Smart Cards A Secure Password-Authentcated Key Agreement Usng Smart Cards Ka Chan 1, Wen-Chung Kuo 2 and Jn-Chou Cheng 3 1 Department of Computer and Informaton Scence, R.O.C. Mltary Academy, Kaohsung 83059, Tawan,

More information

Portfolio Loss Distribution

Portfolio Loss Distribution Portfolo Loss Dstrbuton Rsky assets n loan ortfolo hghly llqud assets hold-to-maturty n the bank s balance sheet Outstandngs The orton of the bank asset that has already been extended to borrowers. Commtment

More information

Efficient On-Demand Data Service Delivery to High-Speed Trains in Cellular/Infostation Integrated Networks

Efficient On-Demand Data Service Delivery to High-Speed Trains in Cellular/Infostation Integrated Networks IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. XX, NO. XX, MONTH 2XX 1 Effcent On-Demand Data Servce Delvery to Hgh-Speed Trans n Cellular/Infostaton Integrated Networks Hao Lang, Student Member,

More information

Retailers must constantly strive for excellence in operations; extremely narrow profit margins

Retailers must constantly strive for excellence in operations; extremely narrow profit margins Managng a Retaler s Shelf Space, Inventory, and Transportaton Gerard Cachon 300 SH/DH, The Wharton School, Unversty of Pennsylvana, Phladelpha, Pennsylvana 90 cachon@wharton.upenn.edu http://opm.wharton.upenn.edu/cachon/

More information

Hosting Virtual Machines on Distributed Datacenters

Hosting Virtual Machines on Distributed Datacenters Hostng Vrtual Machnes on Dstrbuted Datacenters Chuan Pham Scence and Engneerng, KyungHee Unversty, Korea pchuan@khu.ac.kr Jae Hyeok Son Scence and Engneerng, KyungHee Unversty, Korea sonaehyeok@khu.ac.kr

More information

ECE544NA Final Project: Robust Machine Learning Hardware via Classifier Ensemble

ECE544NA Final Project: Robust Machine Learning Hardware via Classifier Ensemble 1 ECE544NA Fnal Project: Robust Machne Learnng Hardware va Classfer Ensemble Sa Zhang, szhang12@llnos.edu Dept. of Electr. & Comput. Eng., Unv. of Illnos at Urbana-Champagn, Urbana, IL, USA Abstract In

More information

On File Delay Minimization for Content Uploading to Media Cloud via Collaborative Wireless Network

On File Delay Minimization for Content Uploading to Media Cloud via Collaborative Wireless Network On Fle Delay Mnmzaton for Content Uploadng to Meda Cloud va Collaboratve Wreless Network Ge Zhang and Yonggang Wen School of Computer Engneerng Nanyang Technologcal Unversty Sngapore Emal: {zh0001ge, ygwen}@ntu.edu.sg

More information

Number of Levels Cumulative Annual operating Income per year construction costs costs ($) ($) ($) 1 600,000 35,000 100,000 2 2,200,000 60,000 350,000

Number of Levels Cumulative Annual operating Income per year construction costs costs ($) ($) ($) 1 600,000 35,000 100,000 2 2,200,000 60,000 350,000 Problem Set 5 Solutons 1 MIT s consderng buldng a new car park near Kendall Square. o unversty funds are avalable (overhead rates are under pressure and the new faclty would have to pay for tself from

More information

A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION. Michael E. Kuhl Radhamés A. Tolentino-Peña

A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION. Michael E. Kuhl Radhamés A. Tolentino-Peña Proceedngs of the 2008 Wnter Smulaton Conference S. J. Mason, R. R. Hll, L. Mönch, O. Rose, T. Jefferson, J. W. Fowler eds. A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION

More information

Open Access A Load Balancing Strategy with Bandwidth Constraint in Cloud Computing. Jing Deng 1,*, Ping Guo 2, Qi Li 3, Haizhu Chen 1

Open Access A Load Balancing Strategy with Bandwidth Constraint in Cloud Computing. Jing Deng 1,*, Ping Guo 2, Qi Li 3, Haizhu Chen 1 Send Orders for Reprnts to reprnts@benthamscence.ae The Open Cybernetcs & Systemcs Journal, 2014, 8, 115-121 115 Open Access A Load Balancng Strategy wth Bandwdth Constrant n Cloud Computng Jng Deng 1,*,

More information

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network 700 Proceedngs of the 8th Internatonal Conference on Innovaton & Management Forecastng the Demand of Emergency Supples: Based on the CBR Theory and BP Neural Network Fu Deqang, Lu Yun, L Changbng School

More information

Staff Paper. Farm Savings Accounts: Examining Income Variability, Eligibility, and Benefits. Brent Gloy, Eddy LaDue, and Charles Cuykendall

Staff Paper. Farm Savings Accounts: Examining Income Variability, Eligibility, and Benefits. Brent Gloy, Eddy LaDue, and Charles Cuykendall SP 2005-02 August 2005 Staff Paper Department of Appled Economcs and Management Cornell Unversty, Ithaca, New York 14853-7801 USA Farm Savngs Accounts: Examnng Income Varablty, Elgblty, and Benefts Brent

More information

Schedulability Bound of Weighted Round Robin Schedulers for Hard Real-Time Systems

Schedulability Bound of Weighted Round Robin Schedulers for Hard Real-Time Systems Schedulablty Bound of Weghted Round Robn Schedulers for Hard Real-Tme Systems Janja Wu, Jyh-Charn Lu, and We Zhao Department of Computer Scence, Texas A&M Unversty {janjaw, lu, zhao}@cs.tamu.edu Abstract

More information

ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING

ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING Matthew J. Lberatore, Department of Management and Operatons, Vllanova Unversty, Vllanova, PA 19085, 610-519-4390,

More information

Politecnico di Torino. Porto Institutional Repository

Politecnico di Torino. Porto Institutional Repository Poltecnco d Torno Porto Insttutonal Repostory [Artcle] A cost-effectve cloud computng framework for acceleratng multmeda communcaton smulatons Orgnal Ctaton: D. Angel, E. Masala (2012). A cost-effectve

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Max Wellng Department of Computer Scence Unversty of Toronto 10 Kng s College Road Toronto, M5S 3G5 Canada wellng@cs.toronto.edu Abstract Ths s a note to explan support vector machnes.

More information

Application of Multi-Agents for Fault Detection and Reconfiguration of Power Distribution Systems

Application of Multi-Agents for Fault Detection and Reconfiguration of Power Distribution Systems 1 Applcaton of Mult-Agents for Fault Detecton and Reconfguraton of Power Dstrbuton Systems K. Nareshkumar, Member, IEEE, M. A. Choudhry, Senor Member, IEEE, J. La, A. Felach, Senor Member, IEEE Abstract--The

More information

2. SYSTEM MODEL. the SLA (unlike the only other related mechanism [15] we can compare it is never able to meet the SLA).

2. SYSTEM MODEL. the SLA (unlike the only other related mechanism [15] we can compare it is never able to meet the SLA). Managng Server Energy and Operatonal Costs n Hostng Centers Yyu Chen Dept. of IE Penn State Unversty Unversty Park, PA 16802 yzc107@psu.edu Anand Svasubramanam Dept. of CSE Penn State Unversty Unversty

More information

Cross-Selling in a Call Center with a Heterogeneous Customer Population

Cross-Selling in a Call Center with a Heterogeneous Customer Population OPERATIONS RESEARCH Vol. 57, No. 2, March Aprl 29, pp. 299 313 ssn 3-364X essn 1526-5463 9 572 299 nforms do 1.1287/opre.18.568 29 INFORMS Cross-Sellng n a Call Center wth a Heterogeneous Customer Populaton

More information

Maintenance Scheduling by using the Bi-Criterion Algorithm of Preferential Anti-Pheromone

Maintenance Scheduling by using the Bi-Criterion Algorithm of Preferential Anti-Pheromone Leonardo ournal of Scences ISSN 583-0233 Issue 2, anuary-une 2008 p. 43-64 Mantenance Schedulng by usng the B-Crteron Algorthm of Preferental Ant-Pheromone Trantafyllos MYTAKIDIS and Arstds VLACHOS Department

More information

Effective Network Defense Strategies against Malicious Attacks with Various Defense Mechanisms under Quality of Service Constraints

Effective Network Defense Strategies against Malicious Attacks with Various Defense Mechanisms under Quality of Service Constraints Effectve Network Defense Strateges aganst Malcous Attacks wth Varous Defense Mechansms under Qualty of Servce Constrants Frank Yeong-Sung Ln Department of Informaton Natonal Tawan Unversty Tape, Tawan,

More information

Realistic Image Synthesis

Realistic Image Synthesis Realstc Image Synthess - Combned Samplng and Path Tracng - Phlpp Slusallek Karol Myszkowsk Vncent Pegoraro Overvew: Today Combned Samplng (Multple Importance Samplng) Renderng and Measurng Equaton Random

More information

Institute of Informatics, Faculty of Business and Management, Brno University of Technology,Czech Republic

Institute of Informatics, Faculty of Business and Management, Brno University of Technology,Czech Republic Lagrange Multplers as Quanttatve Indcators n Economcs Ivan Mezník Insttute of Informatcs, Faculty of Busness and Management, Brno Unversty of TechnologCzech Republc Abstract The quanttatve role of Lagrange

More information

AN APPROACH TO WIRELESS SCHEDULING CONSIDERING REVENUE AND USERS SATISFACTION

AN APPROACH TO WIRELESS SCHEDULING CONSIDERING REVENUE AND USERS SATISFACTION The Medterranean Journal of Computers and Networks, Vol. 2, No. 1, 2006 57 AN APPROACH TO WIRELESS SCHEDULING CONSIDERING REVENUE AND USERS SATISFACTION L. Bada 1,*, M. Zorz 2 1 Department of Engneerng,

More information

LIFETIME INCOME OPTIONS

LIFETIME INCOME OPTIONS LIFETIME INCOME OPTIONS May 2011 by: Marca S. Wagner, Esq. The Wagner Law Group A Professonal Corporaton 99 Summer Street, 13 th Floor Boston, MA 02110 Tel: (617) 357-5200 Fax: (617) 357-5250 www.ersa-lawyers.com

More information

Optimal Customized Pricing in Competitive Settings

Optimal Customized Pricing in Competitive Settings Optmal Customzed Prcng n Compettve Settngs Vshal Agrawal Industral & Systems Engneerng, Georga Insttute of Technology, Atlanta, Georga 30332 vshalagrawal@gatech.edu Mark Ferguson College of Management,

More information

Research of Network System Reconfigurable Model Based on the Finite State Automation

Research of Network System Reconfigurable Model Based on the Finite State Automation JOURNAL OF NETWORKS, VOL., NO. 5, MAY 24 237 Research of Network System Reconfgurable Model Based on the Fnte State Automaton Shenghan Zhou and Wenbng Chang School of Relablty and System Engneerng, Behang

More information

Proceedings of the Annual Meeting of the American Statistical Association, August 5-9, 2001

Proceedings of the Annual Meeting of the American Statistical Association, August 5-9, 2001 Proceedngs of the Annual Meetng of the Amercan Statstcal Assocaton, August 5-9, 2001 LIST-ASSISTED SAMPLING: THE EFFECT OF TELEPHONE SYSTEM CHANGES ON DESIGN 1 Clyde Tucker, Bureau of Labor Statstcs James

More information

Optimal Scheduling in the Hybrid-Cloud

Optimal Scheduling in the Hybrid-Cloud Optmal Schedulng n the Hybrd-Cloud Mark Shfrn Faculty of Electrcal Engneerng Technon, Israel Emal: shfrn@tx.technon.ac.l Ram Atar Faculty of Electrcal Engneerng Technon, Israel Emal: atar@ee.technon.ac.l

More information

Cost Minimization using Renewable Cooling and Thermal Energy Storage in CDNs

Cost Minimization using Renewable Cooling and Thermal Energy Storage in CDNs Cost Mnmzaton usng Renewable Coolng and Thermal Energy Storage n CDNs Stephen Lee College of Informaton and Computer Scences UMass, Amherst stephenlee@cs.umass.edu Rahul Urgaonkar IBM Research rurgaon@us.bm.com

More information

HowHow to Find the Best Online Stock Broker

HowHow to Find the Best Online Stock Broker A GENERAL APPROACH FOR SECURITY MONITORING AND PREVENTIVE CONTROL OF NETWORKS WITH LARGE WIND POWER PRODUCTION Helena Vasconcelos INESC Porto hvasconcelos@nescportopt J N Fdalgo INESC Porto and FEUP jfdalgo@nescportopt

More information

Dynamic Online-Advertising Auctions as Stochastic Scheduling

Dynamic Online-Advertising Auctions as Stochastic Scheduling Dynamc Onlne-Advertsng Auctons as Stochastc Schedulng Isha Menache and Asuman Ozdaglar Massachusetts Insttute of Technology {sha,asuman}@mt.edu R. Srkant Unversty of Illnos at Urbana-Champagn rsrkant@llnos.edu

More information

An Empirical Study of Search Engine Advertising Effectiveness

An Empirical Study of Search Engine Advertising Effectiveness An Emprcal Study of Search Engne Advertsng Effectveness Sanjog Msra, Smon School of Busness Unversty of Rochester Edeal Pnker, Smon School of Busness Unversty of Rochester Alan Rmm-Kaufman, Rmm-Kaufman

More information