Solving Factored MDPs with Continuous and Discrete Variables

Size: px
Start display at page:

Download "Solving Factored MDPs with Continuous and Discrete Variables"

Transcription

1 Solvng Factored MPs wth Contnuous and screte Varables Carlos Guestrn Berkeley Research Center Intel Corporaton Mlos Hauskrecht epartment of Computer Scence Unversty of Pttsburgh Branslav Kveton Intellgent Systems Program Unversty of Pttsburgh Abstract Although many real-world stochastc plannng problems are more naturally formulated by hybrd models wth both dscrete and contnuous varables, current state-of-the-art methods cannot adequately address these problems. We present the frst framework that can explot problem structure for modelng and solvng hybrd problems effcently. We formulate these problems as hybrd Markov decson processes (MPs wth contnuous and dscrete state and acton varables), whch we assume can be represented n a factored way usng a hybrd dynamc Bayesan network (hybrd BN). We present a new lnear program approxmaton method that explots the structure of the hybrd MP and lets us compute approxmate value functons more effcently. In partcular, we descrbe a new factored dscretzaton of contnuous varables that avods the exponental blow-up of tradtonal approaches. We provde theoretcal bounds on the qualty of such an approxmaton and on ts scaleup potental. We support our theoretcal arguments wth experments on a set of control problems wth up to 28-dmensonal contnuous state space and 22-dmensonal acton space. 1 Introducton Markov decson processes (MPs) (Bellman 1957; Bertsekas & Tstskls 1996) offer an elegant mathematcal framework for representng sequental decson problems n the presence of uncertanty. Whle standard soluton technques, such as value or polcy teraton, scale-up well n terms of the total number of states and actons, these technques are less successful n real-world MPs. In purely dscrete settngs, the runnng tme of these algorthms grows exponentally n the number varables, the so called curse of dmensonalty. Furthermore, many real-world problems nclude a combnaton of contnuous and dscrete state and acton varables. The contnuous components are usually dscretzed, whch leads to an exponental blow up n the number of varables. We present the frst framework that explots problem structure and solves large hybrd MPs effcently. The MPs are modelled by hybrd factored MPs, where the stochastc dynamcs s represented compactly by a probablstc graphcal model, a hybrd dynamc Bayesan network (BN) (ean & Kanazawa 1989). The soluton of the MP s approxmated by a lnear combnaton of bass functons (Bellman, Kalaba, & Kotkn 1963; Bertsekas & Tstskls 1996). Specfcally, we use a factored (lnear) value functon (Koller & Parr 1999), where each bass functon depends on a small number of state varables. We show that the weghts of ths approxmaton can be optmzed usng a convex formulaton that we call hybrd approxmate lnear programmng (HALP). The HALP reduces to the approxmate lnear programmng (ALP) formulaton (Schwetzer & Sedmann 1985) n purely dscrete settngs and to the formulaton recently proposed by (Hauskrecht & Kveton 23) for the contnuous-state settngs. Copyrght c 24, Amercan Assocaton for Artfcal Intellgence ( All rghts reserved. We present a theoretcal analyss of the HALP, provdng bounds wth respect to the best approxmaton n the space of the bass functons. Unfortunately, the HALP formulaton of the problem may not be solved drectly snce t may use nfnte number of constrants. To address ths problem, we formulate a relaxed verson of the HALP, an ɛ-halp, that uses a fnte subset of constrants nduced by the ε-grd dsretzaton of contnuous components. We provde a bound on the loss n the qualty of the ε-halp soluton wth respect to the complete HALP formulaton. The man advantage of the ε-halp s that t can be solved effcently by exstng factored ALP methods (Guestrn, Koller, & Parr 21a; Schuurmans & Patrascu 22). Therefore, the complexty of our soluton does not grow exponentally wth the number of varables, and depends only on the structure of the problem and the choce of bass functons. We llustrate the feasblty of our formulaton and ts soluton algorthm on a sequence of control optmzaton problems wth 28-dmensonal contnuous state space and 22-dmensonal acton space. These nontrval dynamc optmzaton problems are far out of reach of classc soluton technques. 2 Multagent hybrd factored MPs Factored MPs (Boutler, earden, & Goldszmdt 1995) allow one to explot problem structure to represent exponentally large MPs compactly. We extend ths formalsm to a multagent hybrd factored MP that s defned by a 4-tuple (X, A, P, R) consstng of a state space X represented by a set of state varables X = {X 1,... X n }, an acton space A defned by a set of acton varables A = {A 1,... A m }, a stochastc transton model P modelng the dynamcs of a state condtoned on the prevous state and acton choce, and a reward model R that quantfes the mmedate payoffs assocated wth a state-acton confguraton. State varables: Each state varable s ether dscrete or contnuous. We assume that every contnuous varable s bounded to a [, 1] subspace, and each dscrete varable takes on values n some fnte doman. A state s defned by a vector x of value assgnments to each state varable, whch splts nto dscrete and contnuous components denoted by x = (x, x C ). Actons: Acton space s dstrbuted such that every acton corresponds to one agent. As wth state varables, the global acton a s defned by a vector of ndvdual acton choces that can be dvded nto dscrete a and contnuous a C components. Factored transton: State transton model s defned by a dynamc Bayesan network (BN) (ean & Kanazawa 1989). Let X denote a varable at the current tme and let X denote the same varable at the successve step. The transton graph of a BN s a two-layer drected acyclc graph whose nodes are {X 1,..., X n, A 1,..., A m, X 1,..., X n}. The parents of X n the graph are denoted by Par(X ). For smplc- ) {X, A},.e., ty of exposton, we assume that Par(X

2 all arcs n the BN are between varables n consecutve tme slces. Each node X s assocated wth a condtonal probablty functon (CPF) p(x Par(X )). The transton probablty p(x x, a) s then defned to be p(x u ), where u s the value n {x, a} of the varables n Par(X Parameterzaton of CPFs: The transton model ). for each varable s local, as each CPF depends only on a small subset of state varables and ndvdual actons. Compact parametrc representaton of the transtons s acheved by usng beta or mxture of beta denstes (Hauskrecht & Kveton 23; Kveton & Hauskrecht 24) for contnuous varables, and by general dscrmnant functons for dscrete varables. Rewards: Reward functon R decomposes as a sum of partal reward functons R j defned on the subsets of state and acton varables. Polcy: The objectve s to fnd a control polcy π : X A that maxmzes the nfnte-horzon, dscounted reward crteron: E[ = γ r ], where γ [, 1) s a dscount factor, and r s a reward obtaned n step. Value functon: The value of the optmal polcy satsfes the Bellman fxed pont equaton (Bellman 1957; Bertsekas & Tstskls 1996): V (x) = sup p(x x, a)v (x ), (1) a R(x, a) + γ x where V s the value of the optmal polcy. Gven the value functon V, the optmal polcy π (x) s defned by the composte acton a optmzng Equaton 1. 3 Approxmate lnear programmng solutons for hybrd MPs A standard way of solvng complex MPs s to assume a surrogate value functon form wth a small set of tunable parameters. Increasngly popular n recent years are the approxmatons based on lnear representatons of value functons, where the value functon V (x) s expressed as a lnear combnaton of k bass functons f (x) (Bellman, Kalaba, & Kotkn 1963; Roy 1998): k V (x) = w f (x). =1 Bass functons are often restrcted to small subsets of state varables (Bellman, Kalaba, & Kotkn 1963; Roy 1998), and the goal of the optmzaton s to ft the set of weghts w = (w 1,..., w k ). 3.1 Formulaton We generalze approxmate lnear programmng (ALP) for dscrete MPs (Schwetzer & Sedmann 1985) nto hybrd settngs. Weghts w are optmzed by solvng a convex optmzaton problem that we call hybrd approxmate lnear program (HALP): mnmze w w α subject to: x C w F (x, a) R(x, a) x, a; (2) where α denotes the bass functon relevance weght gven by: α = x x C ψ(x)f (x)dx C, (3) where ψ(x) > s a state relevance densty functon such that x x C ψ(x)dx C = 1, allowng us to weght the qualty of our approxmaton dfferently for dfferent parts of the state space; and F (x, a) denotes: F (x, a) = f (x) γ p(x x, a)f (x )dx C. (4) x x C Ths formulaton reduces to the standard dscrete-case ALP (Schwetzer & Sedmann 1985; Guestrn, Koller, & Parr 21b; de Faras & Van Roy 23; Schuurmans & Patrascu 22) f the state space x s dscrete, or to the contnuous ALP (Hauskrecht & Kveton 23) f the state space s contnuous. A number of concerns arse n context of the HALP approxmaton. Frst, the formulaton of the HALP appears to be arbtrary, and t s not mmedately clear how t relates to the orgnal hybrd MP problem. Second, the HALP aproxmaton for the hybrd MP nvolves complex ntegrals that must be evaluated. Thrd, the number of constrants defnng the LP s exponental f the state and acton spaces are dscrete and nfnte f any of the spaces nvolves contnuous components. In the followng text, we address and provde solutons for each of these ssues. 3.2 Theoretcal analyss Theoretcal analyss of the qualty of the soluton obtaned by the HALP follows the deas of de Faras and Van Roy 23 for the dscrete case. They note that the approxmate formulaton cannot guarantee an unformly good approxmaton of the optmal value functon over the whole state space. To address ths ssue, they defne a Lyapunov functon that weghs states approprately: a Lyapunov functon L(x) = wl f (x) wth contracton factor κ (, 1) for the transton model P π s a strctly postve functon such that: κl(x) γ P π (x x)l(x )dx C. (5) x x C Ths defnton allows to clam: Proposton 1 Let w be an optmal soluton to the HALP n Equaton 2, then, for any Lyapunov functon L(x), we have that: V Hw 1,ψ 2ψ L 1 κ mn V Hw w,1/l, where Hw represents the functon w f ( ), the L 1 norm weghted by ψ gven by 1,ψ, and,1/l s the maxnorm weghted by 1/L. Proof: The proof of ths result for the hybrd settng follows the outlne of the proof of de Faras and Van Roy s Theorem 4.2 (de Faras & Van Roy 23) for the dscrete case. 4 Factored HALP Factored MP models offer, n addton to structured parameterzatons of the process, an opportunty to solve the problem more effcently. The opportunty stems from the structure of constrant defntons that decompose over state and acton subspaces. Ths s a drect consequence of: (1) factorzatons, (2) presence of local transtons, and (3) bass functons defned over small state subspaces. Ths secton descrbes how these propertes allow us to compute the factors n the HALP effcently.

3 4.1 Factored hybrd bass functon representaton Koller and Parr 1999 show that bass functons wth lmted scope provde the bass for effcent approxmatons n the context of dscrete factored MPs. An mportant ssue n hybrd settngs s that the problem formulaton ncorporates ntegrals, whch may not be computable. Hauskrecht and Kveton 23 propose conjugate transton model and bass functon classes that lead to closed-form solutons of all ntegrals n strctly contnuous cases. In our hybrd settng, each bass functon f (x ) s defned over dscrete components x and contnuous components x C, and decomposes as a product of two factors: f (x ) = f (x )f C (x C ), (6) where f C (x C ) takes the form of polynomals over the varables n X C, and f (x ) s an arbtrary functon over the dscrete varables X. Ths bass functon representaton gves us hgh flexblty and ablty to effcently solve hybrd plannng problem. 4.2 Hybrd backprojectons Computaton of F (x, a), the dfference between the bass functon f (x) and ts dscounted backprojecton, gven by: g (x, a) = p(x x, a)f (x )dx C x x C requres us to compute a sum over the exponental number of dscrete states x, and ntegrals over the contnuous states x C Based on the results of Koller and Parr 1999 for dscrete. varables, and Hauskrecht and Kveton 23 for contnuous varables, we can rewrte the backprojecton for hybrd bass: g (x, a) = g (x, a)g C (x, a), ( ) = p(x x x, a)f (x ) ( ) (7) p(x x C x, a)f C (x C )dx C C and compute t effcently. Note that g (x, a) s the backprojecton of a dscrete bass functon and g C (x, a) s the backprojecton of a contnuous bass functon. 4.3 Hybrd relevance weghts Computaton of bass functon relevance weghts α n Equaton 3 requres us to solve exponentally-large sums and complex ntegrals. Guestrn et al. 21b; 23 showed that f the state relevance densty ψ(x) s represented n a factorzed fashon, these weghts can be computed effcently. Ths result extends to hybrd settngs, and thus we can decompose the computaton of α : α = α α C, ( ) = ψ(x x )f (x ) ( ) (8) ψ(x xc C )f C (x C )dx C, where ψ(x ) s the margnal of the densty ψ(x) to the dscrete varables X, and ψ(x C ) s the margnal to the contnuous varables X C. 5 Factored ε-halp formulaton espte the decompostons and closed-form solutons, factored HALPs reman hard to solve. Unfortunately, the formulaton ncludes constrants for each jont state x and acton a, whch leads to exponentally-many constrants for dscrete components, and uncountably nfnte constrant set for contnuous. To address these ssues, we propose to transform the factored HALP nto ε-halp, an approxmaton of the factored HALP wth a fnte number of constrants. The ε-halp reles on the ε coverage of the constrant space. In the ε-coverage each contnuous (state or acton) varable s dscretzed nto 1 2ε + 1 equally spaced values. The dscretzaton nduces a multdmensonal grd G, such that any pont n [, 1] d s at most ε far from a pont n G under the max-norm. If we drectly enumerate each state and acton confguraton of the ε-halp we obtan an LP wth exponentallymany constrants. However, not all these constrants defne the soluton and need to be enumerated. Ths s the same settng as the factored LP decomposton of Guestrn et al. 21a. We can use the same technque to decompose our ε-halp nto an equvalent LP wth exponentally-fewer constrants. The complexty of ths new problem wll only be exponentally n the tree-wdth of a cost network formed by the restrcted scope functons n our LP, rather than n the complete set of varables (Guestrn, Koller, & Parr 21a; Guestrn et al. 23). Alternatvely we can also apply the approach by Schuurmans and Patrascu 22 that ncrementally bulds the set of constrants usng a constrant generaton heurstc and often performs well n practce. The ε-halp offers an effcent approxmaton of a hybrd factored MP; however, t s unclear how the dscretzaton affects the qualty of the approxmaton. Most dscretzaton approaches requre an exponental number of ponts for a fxed approxmaton level. In the remander of ths secton, we provde a proof that explots factorzaton structure to show that our ε-halp provdes a polynomal approxmaton of the contnuous HALP formulaton. 5.1 Bound on the qualty of ε-halp A soluton to the ε-halp wll usually volate some of the constrants n the orgnal HALP formulaton. We show that f these constrants are volated by a small amount, then the ε-halp soluton s nearly optmal. Let us frst defne the degree to whch a relaxed HALP, that s, a HALP defned over a fnte subset constrants, volates the complete set of constrants. efnton 1 A set of weghts w s δ-nfeasble f: w F (x, a) R(x, a) δ, x, a. Now we are ready to show that, f the soluton to the relaxed HALP s δ-nfeasble, then the qualty of the approxmaton obtaned from the relaxed HALP s close to the one n the complete HALP. Proposton 2 Let w be any optmal soluton to the complete HALP n Equaton 2, and ŵ be any optmal soluton to a relaxed HALP, such that ŵ s δ-nfeasble, then: V Hŵ 1,ψ V Hw δ 1,ψ γ. Proof: Frst, by monotoncty of the Bellman operator, any feasble soluton w n the complete HALP satsfes: w f (x) V (x). (9) Usng ths fact, we have that: Hw V 1,ψ = ψ Hw V, = ψ (Hw V ), = ψ Hw ψ V. (1)

4 Next, note that the constrants n the relaxed HALP are a subset of those n the complete HALP. Thus, w s feasble for the relaxed HALP, and we have that: ψ Hw ψ Hŵ. (11) Now, note that f ŵ s δ-nfeasble n the complete HALP, then f we add δ 1 γ to Hŵ we obtan a feasble soluton to the complete HALP, yeldng: Hŵ + δ 1 γ V = ψ Hŵ + δ 1,ψ 1 γ ψ V, ψ Hw + δ 1 γ ψ V, = Hw V 1,ψ + δ 1 γ. (12) The proof s concluded by substtutng Equaton 12 nto the trangle nequalty bound: Hŵ V 1,ψ Hŵ + δ 1 γ V + 1,ψ δ 1 γ. The above result can be combned wth the result n Secton 3 to obtan the bound on the qualty of the ε-halp. Theorem 1 Let ŵ be any optmal soluton to the relaxed ε- HALP satsfyng the δ nfeasblty condton. Then, for any Lyapunov functon L(x), we have: V δ Hŵ 1,ψ 2 1 γ + 2ψ L 1 κ mn V Hw w,1/l. Proof: rect combnaton of Propostons 1, Resoluton of the ε grd Our bound for relaxed versons on the HALP formulaton, presented n the prevous secton, reles on addng enough constrants to guarantee at most δ-nfeasblty. The ε-halp approxmates the constrants n HALP by restrctng values of ts contnuous varables to the ε grd. In ths secton, we analyze the relatonshp between the choce of ε and the volaton level δ, allowng us to choose the approprate dscretzaton level for a desred approxmaton error n Theorem 1. Our condton n efnton 1 can be satsfed by a set constrants C that ensures a δ max-norm dscretzaton of ŵf (x, a) R(x, a). In the ε-halp ths condton s met wth the ε-grd dscretzaton that assures that for any stateacton par x, a there exsts a par x G, a G n the ε grd such that: ŵf(x, a) R(x, a) ŵf(xg, ag) + R(xG, ag) δ. Usually, such bounds are acheved by consderng the Lpschtz modulus of the dscretzed functon: Let h(u) be an arbtrary functon defned over the contnuous subspace U [, 1] d wth a Lpschtz modulus K and let G be an ε-grd dscretzaton of U. Then the δ max-norm dscretzaton of h(u) can be acheved wth a ε grd wth the resoluton ε δ K. Usually, the Lpschtz modulus of a functon rapdly ncreases wth dmenson d, thus requrng addtonal ponts for a desred dscretzaton level. Each constrant n the ε-halp s defned n terms of a sum of functons: ŵf (x, a) j R(x, a), where each functon depends only on a small number of varables (and thus has a small dmenson). Therefore, nstead of usng a global Lpschtz constant K for the complete expresson we can express the relaton n between the factor δ and ε n terms of the Non outgong channels Outgong channels x (a) (b) Fgure 1: a. The topology of an rrgaton system. Irrgaton channels are represented by lnks x and water regulaton devces are marked by rectangles a. Input and output regulaton devces are shown n lght and dark gray colors. b. Reward functons for the amount of water x n the th rrgaton channel. Lpschtz constants of ndvdual functons, explotng the factorzaton structure. In partcular, let K max be the worst-case Lpschtz constant over both the reward functons R j (x, a) and w F (x, a). To guarantee that K max s bounded, we must bound the magntude of ŵ. Typcally, f the bass functons have unt magntude, the ŵ wll be bounded R max /(1 γ). Here, we can defne K max to be the maxmum of the Lpschtz constants of the reward functons and of R max /(1 γ) tmes the constant for each F (x, a). By choosng an ε dscretzaton of only: δ ε, MK max where M s the number of functons, we guarantee the condton of Theorem 1 for a volaton of δ. 6 Experments Ths secton presents an emprcal evaluaton of our approach, demonstratng the qualty of the approxmaton and the scaleup potental. 6.1 Irrgaton network example An rrgaton system conssts of a network of rrgaton channels that are connected by regulaton devces (Fgure 1a). Regulaton devces are used to regulate the amount of water n the channels, whch s acheved by pumpng the water from one of the channels to another one. The goal of the operator of the rrgaton system s to keep the amount of water n all channels on an optmal level (determned by the type of planted crops, etc.), by manpulaton of regulaton devces. Fgure 1a llustrates the topology of channels and regulaton devces for one of the rrgaton systems used n the experments. To keep problem formulaton smple, we adopt several smplfyng assumptons: all channels are of the same sze, water flows are orented, and the control structures operate n dscrete modes. The rrgaton system can be formalzed as a hybrd MP, and the optmal behavor of the operator can be found as the optmal control polcy for the MP. The amount of water n the th channel s naturally represented by a contnuous state factor x [, 1]. Each regulaton devce can operate n multple modes: the water can be pumped n between any par

5 x x x x Fgure 2: Feature functons for the amount of water x n the th rrgaton channel. of ncomng and outgong channel. These optons are represented by dscrete acton varables a, one varable per regulaton devce. The nput and output regulaton devces (devces wth no ncomng or no outgong channels) are specal and contnuously pump the water n or out of the rrgaton system. Transton functons are defned as beta denstes that represent water flows dependng on the operatng modes of the regulaton devces. Reward functon reflects our preference for the amount of water n the channels (Fgure 1b). The reward functon s factorzed along channels, defned by a lnear reward functon for the outgong channels, and a mxture of Gaussans for all other channels. The dscount factor s γ =.95. To approxmate the optmal value functon, a combnaton of lnear and pecewse lnear feature functons s used at every channel (Fgure 2). 6.2 Expermental results The objectve of the frst set of experments was to compare the qualty of solutons obtaned by the ε-halp for varyng grd resolutons ε aganst other technques for polcy generaton and to llustrate tme (n seconds) needed to solve the ε- HALP problem. All experments are performed on the rrgaton network from Fgure 1a wth 17 dmensonal state space and 15 dmensonal acton space. The results are presented n Fgure 3. The qualty of polces s measured n terms of the average reward that s obtaned va Monte Carlo smulatons of the polcy on 1 state-acton trajectores, each of 1 steps. To assure the farness of the comparson, the set of ntal states s kept fxed across experments. Three alternatve solutons are used n the comparson: random polcy, local heurstc, and global heurstc. The random polcy operates regulaton devces randomly and serves as a baselne soluton. The local heurstc optmzes the one-step expected reward for every regulaton devce locally, whle gnorng all other devces. Fnally, the global heurstc attempts to optmze one-step expected reward for all regulatory devces together. The parameter of the global heurstc s the number of trals used to estmate the global one-step reward. All heurstc solutons were appled n the on-lne mode; thus, ther soluton tmes are not ncluded n Fgure 3. The results show that the ε-halp s able to solve a very complex optmzaton problem relatvely quckly and outperform strawman heurstc methods n terms of the qualty of ther solutons. 6.3 Scale-up study The second set of experments focuses on the scale-up potental of ε-halp method wth respect to the complexty of the model. The experments are performed for n-rng and n-rngof-rngs topologes (Fgure 4a). The results, summarzed n Fgure 4b, show several mportant trends: (1) the qualty of the polcy for the ε-halp mproves wth hgher grd resoluton ε, (2) the runnng tme of the method grows polynomally wth ε-halp Alternatve soluton ε µ σ Tme[s] Method µ σ Random / Local / Global / Global / Global Fgure 3: Results of the experments for the rrgaton system n Fgure 1a. The qualty of found polces s measured by the average reward µ for 1 state-acton trajectores, where σ denotes the standard devaton of the rewards. the grd resoluton, and (3) the ncrease n the runnng tme of the method for topologes of ncreased complexty s mld and far from exponental n the number of varables n. Graphcal examples of each of these trends are gven n Fgures 4c, 4d, and 4e. In addton to the runnng tme curve, Fgure 4e shows a quadratc polynomal ftted to the values for dfferent n. Ths supports our theoretcal fndngs that the runnng tme complexty of the ε-halp method for an approprate choce of bass functons does not grow exponentally n the number of varables. 7 Conclusons We present the frst framework that can explot problem structure for modelng and approxmately solvng hybrd problems effcently. We provde bounds on the qualty of the solutons obtaned by our HALP formulaton wth respect to the best approxmaton n our bass functon class. Ths HALP formulaton can be closely approxmated by the (relaxed) ε- HALP, f the resultng soluton s near feasble n the orgnal HALP formulaton. Although we would typcally requre an exponentally-large dscretzaton to guarantee ths near feasblty, we provde an algorthm that can effcently generate an equvalent guarantee wth an exponentally-smaller dscretzaton. When combned, these theoretcal results lead to a practcal algorthm that we have successfully demonstrated on a set of control problems wth up to 28-dmensonal contnuous state space and 22-dmensonal acton space. The technques presented n ths paper drectly generalze to collaboratve multagent settngs, where each agent s responsble for one of the acton varables, and they must coordnate to maxmze the total reward. The off-lne plannng stage of our algorthm remans unchanged. However, n the on-lne acton selecton phase, at every tme step, the agents must coordnate to choose the acton that jontly maxmzes the expected value for the current state. We can acheve ths by extendng the coordnaton graph algorthm of Guestrn et al. 21b to our hybrd settng wth our factored dscretzaton scheme. The result wll be an effcent dstrbute coordnaton algorthm that can cope wth both contnuous and dscrete actons. Many real-world problems nvolve contnuous and dscrete elements. We beleve that our algorthms and theoretcal results wll sgnfcantly further the applcablty of automated plannng algorthms to these settngs. Acknowledgments Mlos Hauskrecht was supported n part by the Natonal Scence Foundaton under grant ITR and grant Branslav Kveton acknowledges the fellowshp support from the School of Arts and Scences, Unversty of Ptts-

6 (a) Expected reward n-rng ε n = 6 n = 9 n = 12 n = 15 n = 18 µ Tme[s] µ Tme[s] µ Tme[s] µ Tme[s] µ Tme[s] / / / / n-rng-of-rngs ε n = 6 n = 9 n = 12 n = 15 n = 18 µ Tme[s] µ Tme[s] µ Tme[s] µ Tme[s] µ Tme[s] / / / / rng of rngs / ε Tme 3 x 12 rng of rngs Tme / ε (b) n rng of rngs, 1 / ε = n (c) (d) (e) Fgure 4: a. Two rrgaton network topologes used n the scale-up experments: n-rng-of-rngs (shown for n = 6) and n-rng (shown for n = 6). b. Average rewards and polcy computaton tmes for dfferent ε and varous networks archtectures. c. Average reward as a functon of grd resoluton ε. d. Tme complexty as a functon of grd resoluton ε. e. Tme complexty (sold lne) as a functon of dfferent network szes n. Quadratc approxmaton of the tme complexty s plotted as dashed lne. burgh. References Bellman, R.; Kalaba, R.; and Kotkn, B Polynomal approxmaton a new computatonal technque n dynamc programmng. Math. Comp. 17(8): Bellman, R. E ynamc programmng. Prnceton Press. Bertsekas,. P., and Tstskls, J. N Neuro-dynamc Programmng. Athena. Boutler, C.; earden, R.; and Goldszmdt, M Explotng structure n polcy constructon. In IJCAI. de Faras,. P., and Roy, B. V. 21. On constrant samplng for the lnear programmng approach to approxmate dynamc programmng. Mathematcs of Operatons Research submtted. de Faras,., and Van Roy, B. 23. The lnear programmng approach to approxmate dynamc programmng. Operatons Research 51(6). ean, T., and Kanazawa, K A model for reasonng about persstence and causaton. Computatonal Intellgence 5: Guestrn, C. E.; Koller,.; Parr, R.; and Venkataraman, S. 23. Effcent soluton algorthms for factored MPs. JAIR 19: Guestrn, C. E.; Koller,.; and Parr, R. 21a. Max-norm projectons for factored MPs. In IJCAI-1. Guestrn, C. E.; Koller,.; and Parr, R. 21b. Multagent plannng wth factored MPs. In NIPS-14. Hauskrecht, M., and Kveton, B. 23. Lnear program approxmatons to factored contnuous-state Markov decson processes. In NIPS-17. Koller,., and Parr, R Computng factored value functons for polces n structured MPs. In IJCAI-99. Kveton, B., and Hauskrecht, M. 24. Heurstc refnements of approxmate lnear programmng for factored contnuousstate Markov decson processes. In ICAPS-14. Roy, B. V Learnng and value functon approxmaton n complex decson problems. Ph.. ssertaton, MIT. Schuurmans,., and Patrascu, R. 22. rect valueapproxmaton for factored mdps. In NIPS-14. Schwetzer, P., and Sedmann, A Generalzed polynomal approxmatons n Markovan decson processes. Journal of Math. Analyss and Apps. 11:

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ).

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ). REVIEW OF RISK MANAGEMENT CONCEPTS LOSS DISTRIBUTIONS AND INSURANCE Loss and nsurance: When someone s subject to the rsk of ncurrng a fnancal loss, the loss s generally modeled usng a random varable or

More information

Luby s Alg. for Maximal Independent Sets using Pairwise Independence

Luby s Alg. for Maximal Independent Sets using Pairwise Independence Lecture Notes for Randomzed Algorthms Luby s Alg. for Maxmal Independent Sets usng Parwse Independence Last Updated by Erc Vgoda on February, 006 8. Maxmal Independent Sets For a graph G = (V, E), an ndependent

More information

BERNSTEIN POLYNOMIALS

BERNSTEIN POLYNOMIALS On-Lne Geometrc Modelng Notes BERNSTEIN POLYNOMIALS Kenneth I. Joy Vsualzaton and Graphcs Research Group Department of Computer Scence Unversty of Calforna, Davs Overvew Polynomals are ncredbly useful

More information

Feature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College

Feature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College Feature selecton for ntruson detecton Slobodan Petrovć NISlab, Gjøvk Unversty College Contents The feature selecton problem Intruson detecton Traffc features relevant for IDS The CFS measure The mrmr measure

More information

Efficient Reinforcement Learning in Factored MDPs

Efficient Reinforcement Learning in Factored MDPs Effcent Renforcement Learnng n Factored MDPs Mchael Kearns AT&T Labs mkearns@research.att.com Daphne Koller Stanford Unversty koller@cs.stanford.edu Abstract We present a provably effcent and near-optmal

More information

POLYSA: A Polynomial Algorithm for Non-binary Constraint Satisfaction Problems with and

POLYSA: A Polynomial Algorithm for Non-binary Constraint Satisfaction Problems with and POLYSA: A Polynomal Algorthm for Non-bnary Constrant Satsfacton Problems wth and Mguel A. Saldo, Federco Barber Dpto. Sstemas Informátcos y Computacón Unversdad Poltécnca de Valenca, Camno de Vera s/n

More information

ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING

ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING Matthew J. Lberatore, Department of Management and Operatons, Vllanova Unversty, Vllanova, PA 19085, 610-519-4390,

More information

A hybrid global optimization algorithm based on parallel chaos optimization and outlook algorithm

A hybrid global optimization algorithm based on parallel chaos optimization and outlook algorithm Avalable onlne www.ocpr.com Journal of Chemcal and Pharmaceutcal Research, 2014, 6(7):1884-1889 Research Artcle ISSN : 0975-7384 CODEN(USA) : JCPRC5 A hybrd global optmzaton algorthm based on parallel

More information

Ring structure of splines on triangulations

Ring structure of splines on triangulations www.oeaw.ac.at Rng structure of splnes on trangulatons N. Vllamzar RICAM-Report 2014-48 www.rcam.oeaw.ac.at RING STRUCTURE OF SPLINES ON TRIANGULATIONS NELLY VILLAMIZAR Introducton For a trangulated regon

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Max Wellng Department of Computer Scence Unversty of Toronto 10 Kng s College Road Toronto, M5S 3G5 Canada wellng@cs.toronto.edu Abstract Ths s a note to explan support vector machnes.

More information

Recurrence. 1 Definitions and main statements

Recurrence. 1 Definitions and main statements Recurrence 1 Defntons and man statements Let X n, n = 0, 1, 2,... be a MC wth the state space S = (1, 2,...), transton probabltes p j = P {X n+1 = j X n = }, and the transton matrx P = (p j ),j S def.

More information

An MILP model for planning of batch plants operating in a campaign-mode

An MILP model for planning of batch plants operating in a campaign-mode An MILP model for plannng of batch plants operatng n a campagn-mode Yanna Fumero Insttuto de Desarrollo y Dseño CONICET UTN yfumero@santafe-concet.gov.ar Gabrela Corsano Insttuto de Desarrollo y Dseño

More information

Causal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting

Causal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting Causal, Explanatory Forecastng Assumes cause-and-effect relatonshp between system nputs and ts output Forecastng wth Regresson Analyss Rchard S. Barr Inputs System Cause + Effect Relatonshp The job of

More information

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis The Development of Web Log Mnng Based on Improve-K-Means Clusterng Analyss TngZhong Wang * College of Informaton Technology, Luoyang Normal Unversty, Luoyang, 471022, Chna wangtngzhong2@sna.cn Abstract.

More information

Credit Limit Optimization (CLO) for Credit Cards

Credit Limit Optimization (CLO) for Credit Cards Credt Lmt Optmzaton (CLO) for Credt Cards Vay S. Desa CSCC IX, Ednburgh September 8, 2005 Copyrght 2003, SAS Insttute Inc. All rghts reserved. SAS Propretary Agenda Background Tradtonal approaches to credt

More information

How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence

How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence 1 st Internatonal Symposum on Imprecse Probabltes and Ther Applcatons, Ghent, Belgum, 29 June 2 July 1999 How Sets of Coherent Probabltes May Serve as Models for Degrees of Incoherence Mar J. Schervsh

More information

Article received on July 15, 2008; accepted on April 03, 2009

Article received on July 15, 2008; accepted on April 03, 2009 AsstO: A Qualtatve MDP-based Recommender System for Power Plant Operaton AsstO: Un Sstema de Recomendacones basado en MDPs Cualtatvos para la Operacón de Plantas Generadoras Alberto Reyes 1, L. Enrque

More information

J. Parallel Distrib. Comput.

J. Parallel Distrib. Comput. J. Parallel Dstrb. Comput. 71 (2011) 62 76 Contents lsts avalable at ScenceDrect J. Parallel Dstrb. Comput. journal homepage: www.elsever.com/locate/jpdc Optmzng server placement n dstrbuted systems n

More information

Forecasting the Direction and Strength of Stock Market Movement

Forecasting the Direction and Strength of Stock Market Movement Forecastng the Drecton and Strength of Stock Market Movement Jngwe Chen Mng Chen Nan Ye cjngwe@stanford.edu mchen5@stanford.edu nanye@stanford.edu Abstract - Stock market s one of the most complcated systems

More information

Research Article Enhanced Two-Step Method via Relaxed Order of α-satisfactory Degrees for Fuzzy Multiobjective Optimization

Research Article Enhanced Two-Step Method via Relaxed Order of α-satisfactory Degrees for Fuzzy Multiobjective Optimization Hndaw Publshng Corporaton Mathematcal Problems n Engneerng Artcle ID 867836 pages http://dxdoorg/055/204/867836 Research Artcle Enhanced Two-Step Method va Relaxed Order of α-satsfactory Degrees for Fuzzy

More information

8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by

8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by 6 CHAPTER 8 COMPLEX VECTOR SPACES 5. Fnd the kernel of the lnear transformaton gven n Exercse 5. In Exercses 55 and 56, fnd the mage of v, for the ndcated composton, where and are gven by the followng

More information

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module LOSSLESS IMAGE COMPRESSION SYSTEMS Lesson 3 Lossless Compresson: Huffman Codng Instructonal Objectves At the end of ths lesson, the students should be able to:. Defne and measure source entropy..

More information

Power-of-Two Policies for Single- Warehouse Multi-Retailer Inventory Systems with Order Frequency Discounts

Power-of-Two Policies for Single- Warehouse Multi-Retailer Inventory Systems with Order Frequency Discounts Power-of-wo Polces for Sngle- Warehouse Mult-Retaler Inventory Systems wth Order Frequency Dscounts José A. Ventura Pennsylvana State Unversty (USA) Yale. Herer echnon Israel Insttute of echnology (Israel)

More information

Dynamic Pricing for Smart Grid with Reinforcement Learning

Dynamic Pricing for Smart Grid with Reinforcement Learning Dynamc Prcng for Smart Grd wth Renforcement Learnng Byung-Gook Km, Yu Zhang, Mhaela van der Schaar, and Jang-Won Lee Samsung Electroncs, Suwon, Korea Department of Electrcal Engneerng, UCLA, Los Angeles,

More information

Project Networks With Mixed-Time Constraints

Project Networks With Mixed-Time Constraints Project Networs Wth Mxed-Tme Constrants L Caccetta and B Wattananon Western Australan Centre of Excellence n Industral Optmsaton (WACEIO) Curtn Unversty of Technology GPO Box U1987 Perth Western Australa

More information

What is Candidate Sampling

What is Candidate Sampling What s Canddate Samplng Say we have a multclass or mult label problem where each tranng example ( x, T ) conssts of a context x a small (mult)set of target classes T out of a large unverse L of possble

More information

Robust Design of Public Storage Warehouses. Yeming (Yale) Gong EMLYON Business School

Robust Design of Public Storage Warehouses. Yeming (Yale) Gong EMLYON Business School Robust Desgn of Publc Storage Warehouses Yemng (Yale) Gong EMLYON Busness School Rene de Koster Rotterdam school of management, Erasmus Unversty Abstract We apply robust optmzaton and revenue management

More information

A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION. Michael E. Kuhl Radhamés A. Tolentino-Peña

A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION. Michael E. Kuhl Radhamés A. Tolentino-Peña Proceedngs of the 2008 Wnter Smulaton Conference S. J. Mason, R. R. Hll, L. Mönch, O. Rose, T. Jefferson, J. W. Fowler eds. A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION

More information

THE DISTRIBUTION OF LOAN PORTFOLIO VALUE * Oldrich Alfons Vasicek

THE DISTRIBUTION OF LOAN PORTFOLIO VALUE * Oldrich Alfons Vasicek HE DISRIBUION OF LOAN PORFOLIO VALUE * Oldrch Alfons Vascek he amount of captal necessary to support a portfolo of debt securtes depends on the probablty dstrbuton of the portfolo loss. Consder a portfolo

More information

IMPACT ANALYSIS OF A CELLULAR PHONE

IMPACT ANALYSIS OF A CELLULAR PHONE 4 th ASA & μeta Internatonal Conference IMPACT AALYSIS OF A CELLULAR PHOE We Lu, 2 Hongy L Bejng FEAonlne Engneerng Co.,Ltd. Bejng, Chna ABSTRACT Drop test smulaton plays an mportant role n nvestgatng

More information

On the Optimal Control of a Cascade of Hydro-Electric Power Stations

On the Optimal Control of a Cascade of Hydro-Electric Power Stations On the Optmal Control of a Cascade of Hydro-Electrc Power Statons M.C.M. Guedes a, A.F. Rbero a, G.V. Smrnov b and S. Vlela c a Department of Mathematcs, School of Scences, Unversty of Porto, Portugal;

More information

Extending Probabilistic Dynamic Epistemic Logic

Extending Probabilistic Dynamic Epistemic Logic Extendng Probablstc Dynamc Epstemc Logc Joshua Sack May 29, 2008 Probablty Space Defnton A probablty space s a tuple (S, A, µ), where 1 S s a set called the sample space. 2 A P(S) s a σ-algebra: a set

More information

Joint Scheduling of Processing and Shuffle Phases in MapReduce Systems

Joint Scheduling of Processing and Shuffle Phases in MapReduce Systems Jont Schedulng of Processng and Shuffle Phases n MapReduce Systems Fangfe Chen, Mural Kodalam, T. V. Lakshman Department of Computer Scence and Engneerng, The Penn State Unversty Bell Laboratores, Alcatel-Lucent

More information

HowHow to Find the Best Online Stock Broker

HowHow to Find the Best Online Stock Broker A GENERAL APPROACH FOR SECURITY MONITORING AND PREVENTIVE CONTROL OF NETWORKS WITH LARGE WIND POWER PRODUCTION Helena Vasconcelos INESC Porto hvasconcelos@nescportopt J N Fdalgo INESC Porto and FEUP jfdalgo@nescportopt

More information

Generalizing the degree sequence problem

Generalizing the degree sequence problem Mddlebury College March 2009 Arzona State Unversty Dscrete Mathematcs Semnar The degree sequence problem Problem: Gven an nteger sequence d = (d 1,...,d n ) determne f there exsts a graph G wth d as ts

More information

Period and Deadline Selection for Schedulability in Real-Time Systems

Period and Deadline Selection for Schedulability in Real-Time Systems Perod and Deadlne Selecton for Schedulablty n Real-Tme Systems Thdapat Chantem, Xaofeng Wang, M.D. Lemmon, and X. Sharon Hu Department of Computer Scence and Engneerng, Department of Electrcal Engneerng

More information

An Analysis of Central Processor Scheduling in Multiprogrammed Computer Systems

An Analysis of Central Processor Scheduling in Multiprogrammed Computer Systems STAN-CS-73-355 I SU-SE-73-013 An Analyss of Central Processor Schedulng n Multprogrammed Computer Systems (Dgest Edton) by Thomas G. Prce October 1972 Techncal Report No. 57 Reproducton n whole or n part

More information

Risk-based Fatigue Estimate of Deep Water Risers -- Course Project for EM388F: Fracture Mechanics, Spring 2008

Risk-based Fatigue Estimate of Deep Water Risers -- Course Project for EM388F: Fracture Mechanics, Spring 2008 Rsk-based Fatgue Estmate of Deep Water Rsers -- Course Project for EM388F: Fracture Mechancs, Sprng 2008 Chen Sh Department of Cvl, Archtectural, and Envronmental Engneerng The Unversty of Texas at Austn

More information

An Interest-Oriented Network Evolution Mechanism for Online Communities

An Interest-Oriented Network Evolution Mechanism for Online Communities An Interest-Orented Network Evoluton Mechansm for Onlne Communtes Cahong Sun and Xaopng Yang School of Informaton, Renmn Unversty of Chna, Bejng 100872, P.R. Chna {chsun,yang}@ruc.edu.cn Abstract. Onlne

More information

Face Verification Problem. Face Recognition Problem. Application: Access Control. Biometric Authentication. Face Verification (1:1 matching)

Face Verification Problem. Face Recognition Problem. Application: Access Control. Biometric Authentication. Face Verification (1:1 matching) Face Recognton Problem Face Verfcaton Problem Face Verfcaton (1:1 matchng) Querymage face query Face Recognton (1:N matchng) database Applcaton: Access Control www.vsage.com www.vsoncs.com Bometrc Authentcaton

More information

Revenue Management for a Multiclass Single-Server Queue via a Fluid Model Analysis

Revenue Management for a Multiclass Single-Server Queue via a Fluid Model Analysis OPERATIONS RESEARCH Vol. 54, No. 5, September October 6, pp. 94 93 ssn 3-364X essn 56-5463 6 545 94 nforms do.87/opre.6.35 6 INFORMS Revenue Management for a Multclass Sngle-Server Queue va a Flud Model

More information

Can Auto Liability Insurance Purchases Signal Risk Attitude?

Can Auto Liability Insurance Purchases Signal Risk Attitude? Internatonal Journal of Busness and Economcs, 2011, Vol. 10, No. 2, 159-164 Can Auto Lablty Insurance Purchases Sgnal Rsk Atttude? Chu-Shu L Department of Internatonal Busness, Asa Unversty, Tawan Sheng-Chang

More information

Logistic Regression. Lecture 4: More classifiers and classes. Logistic regression. Adaboost. Optimization. Multiple class classification

Logistic Regression. Lecture 4: More classifiers and classes. Logistic regression. Adaboost. Optimization. Multiple class classification Lecture 4: More classfers and classes C4B Machne Learnng Hlary 20 A. Zsserman Logstc regresson Loss functons revsted Adaboost Loss functons revsted Optmzaton Multple class classfcaton Logstc Regresson

More information

An Alternative Way to Measure Private Equity Performance

An Alternative Way to Measure Private Equity Performance An Alternatve Way to Measure Prvate Equty Performance Peter Todd Parlux Investment Technology LLC Summary Internal Rate of Return (IRR) s probably the most common way to measure the performance of prvate

More information

Institute of Informatics, Faculty of Business and Management, Brno University of Technology,Czech Republic

Institute of Informatics, Faculty of Business and Management, Brno University of Technology,Czech Republic Lagrange Multplers as Quanttatve Indcators n Economcs Ivan Mezník Insttute of Informatcs, Faculty of Busness and Management, Brno Unversty of TechnologCzech Republc Abstract The quanttatve role of Lagrange

More information

The OC Curve of Attribute Acceptance Plans

The OC Curve of Attribute Acceptance Plans The OC Curve of Attrbute Acceptance Plans The Operatng Characterstc (OC) curve descrbes the probablty of acceptng a lot as a functon of the lot s qualty. Fgure 1 shows a typcal OC Curve. 10 8 6 4 1 3 4

More information

Realistic Image Synthesis

Realistic Image Synthesis Realstc Image Synthess - Combned Samplng and Path Tracng - Phlpp Slusallek Karol Myszkowsk Vncent Pegoraro Overvew: Today Combned Samplng (Multple Importance Samplng) Renderng and Measurng Equaton Random

More information

How To Calculate The Accountng Perod Of Nequalty

How To Calculate The Accountng Perod Of Nequalty Inequalty and The Accountng Perod Quentn Wodon and Shlomo Ytzha World Ban and Hebrew Unversty September Abstract Income nequalty typcally declnes wth the length of tme taen nto account for measurement.

More information

The Greedy Method. Introduction. 0/1 Knapsack Problem

The Greedy Method. Introduction. 0/1 Knapsack Problem The Greedy Method Introducton We have completed data structures. We now are gong to look at algorthm desgn methods. Often we are lookng at optmzaton problems whose performance s exponental. For an optmzaton

More information

Characterization of Assembly. Variation Analysis Methods. A Thesis. Presented to the. Department of Mechanical Engineering. Brigham Young University

Characterization of Assembly. Variation Analysis Methods. A Thesis. Presented to the. Department of Mechanical Engineering. Brigham Young University Characterzaton of Assembly Varaton Analyss Methods A Thess Presented to the Department of Mechancal Engneerng Brgham Young Unversty In Partal Fulfllment of the Requrements for the Degree Master of Scence

More information

CS 2750 Machine Learning. Lecture 3. Density estimation. CS 2750 Machine Learning. Announcements

CS 2750 Machine Learning. Lecture 3. Density estimation. CS 2750 Machine Learning. Announcements Lecture 3 Densty estmaton Mlos Hauskrecht mlos@cs.ptt.edu 5329 Sennott Square Next lecture: Matlab tutoral Announcements Rules for attendng the class: Regstered for credt Regstered for audt (only f there

More information

Bayesian Network Based Causal Relationship Identification and Funding Success Prediction in P2P Lending

Bayesian Network Based Causal Relationship Identification and Funding Success Prediction in P2P Lending Proceedngs of 2012 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 25 (2012) (2012) IACSIT Press, Sngapore Bayesan Network Based Causal Relatonshp Identfcaton and Fundng Success

More information

Application of Quasi Monte Carlo methods and Global Sensitivity Analysis in finance

Application of Quasi Monte Carlo methods and Global Sensitivity Analysis in finance Applcaton of Quas Monte Carlo methods and Global Senstvty Analyss n fnance Serge Kucherenko, Nlay Shah Imperal College London, UK skucherenko@mperalacuk Daro Czraky Barclays Captal DaroCzraky@barclayscaptalcom

More information

) of the Cell class is created containing information about events associated with the cell. Events are added to the Cell instance

) of the Cell class is created containing information about events associated with the cell. Events are added to the Cell instance Calbraton Method Instances of the Cell class (one nstance for each FMS cell) contan ADC raw data and methods assocated wth each partcular FMS cell. The calbraton method ncludes event selecton (Class Cell

More information

v a 1 b 1 i, a 2 b 2 i,..., a n b n i.

v a 1 b 1 i, a 2 b 2 i,..., a n b n i. SECTION 8.4 COMPLEX VECTOR SPACES AND INNER PRODUCTS 455 8.4 COMPLEX VECTOR SPACES AND INNER PRODUCTS All the vector spaces we have studed thus far n the text are real vector spaces snce the scalars are

More information

CALL ADMISSION CONTROL IN WIRELESS MULTIMEDIA NETWORKS

CALL ADMISSION CONTROL IN WIRELESS MULTIMEDIA NETWORKS CALL ADMISSION CONTROL IN WIRELESS MULTIMEDIA NETWORKS Novella Bartoln 1, Imrch Chlamtac 2 1 Dpartmento d Informatca, Unverstà d Roma La Sapenza, Roma, Italy novella@ds.unroma1.t 2 Center for Advanced

More information

Loop Parallelization

Loop Parallelization - - Loop Parallelzaton C-52 Complaton steps: nested loops operatng on arrays, sequentell executon of teraton space DECLARE B[..,..+] FOR I :=.. FOR J :=.. I B[I,J] := B[I-,J]+B[I-,J-] ED FOR ED FOR analyze

More information

1. Fundamentals of probability theory 2. Emergence of communication traffic 3. Stochastic & Markovian Processes (SP & MP)

1. Fundamentals of probability theory 2. Emergence of communication traffic 3. Stochastic & Markovian Processes (SP & MP) 6.3 / -- Communcaton Networks II (Görg) SS20 -- www.comnets.un-bremen.de Communcaton Networks II Contents. Fundamentals of probablty theory 2. Emergence of communcaton traffc 3. Stochastc & Markovan Processes

More information

Product-Form Stationary Distributions for Deficiency Zero Chemical Reaction Networks

Product-Form Stationary Distributions for Deficiency Zero Chemical Reaction Networks Bulletn of Mathematcal Bology (21 DOI 1.17/s11538-1-9517-4 ORIGINAL ARTICLE Product-Form Statonary Dstrbutons for Defcency Zero Chemcal Reacton Networks Davd F. Anderson, Gheorghe Cracun, Thomas G. Kurtz

More information

When Network Effect Meets Congestion Effect: Leveraging Social Services for Wireless Services

When Network Effect Meets Congestion Effect: Leveraging Social Services for Wireless Services When Network Effect Meets Congeston Effect: Leveragng Socal Servces for Wreless Servces aowen Gong School of Electrcal, Computer and Energy Engeerng Arzona State Unversty Tempe, AZ 8587, USA xgong9@asuedu

More information

2008/8. An integrated model for warehouse and inventory planning. Géraldine Strack and Yves Pochet

2008/8. An integrated model for warehouse and inventory planning. Géraldine Strack and Yves Pochet 2008/8 An ntegrated model for warehouse and nventory plannng Géraldne Strack and Yves Pochet CORE Voe du Roman Pays 34 B-1348 Louvan-la-Neuve, Belgum. Tel (32 10) 47 43 04 Fax (32 10) 47 43 01 E-mal: corestat-lbrary@uclouvan.be

More information

Vision Mouse. Saurabh Sarkar a* University of Cincinnati, Cincinnati, USA ABSTRACT 1. INTRODUCTION

Vision Mouse. Saurabh Sarkar a* University of Cincinnati, Cincinnati, USA ABSTRACT 1. INTRODUCTION Vson Mouse Saurabh Sarkar a* a Unversty of Cncnnat, Cncnnat, USA ABSTRACT The report dscusses a vson based approach towards trackng of eyes and fngers. The report descrbes the process of locatng the possble

More information

L10: Linear discriminants analysis

L10: Linear discriminants analysis L0: Lnear dscrmnants analyss Lnear dscrmnant analyss, two classes Lnear dscrmnant analyss, C classes LDA vs. PCA Lmtatons of LDA Varants of LDA Other dmensonalty reducton methods CSCE 666 Pattern Analyss

More information

Ants Can Schedule Software Projects

Ants Can Schedule Software Projects Ants Can Schedule Software Proects Broderck Crawford 1,2, Rcardo Soto 1,3, Frankln Johnson 4, and Erc Monfroy 5 1 Pontfca Unversdad Católca de Valparaíso, Chle FrstName.Name@ucv.cl 2 Unversdad Fns Terrae,

More information

A Novel Methodology of Working Capital Management for Large. Public Constructions by Using Fuzzy S-curve Regression

A Novel Methodology of Working Capital Management for Large. Public Constructions by Using Fuzzy S-curve Regression Novel Methodology of Workng Captal Management for Large Publc Constructons by Usng Fuzzy S-curve Regresson Cheng-Wu Chen, Morrs H. L. Wang and Tng-Ya Hseh Department of Cvl Engneerng, Natonal Central Unversty,

More information

Fault tolerance in cloud technologies presented as a service

Fault tolerance in cloud technologies presented as a service Internatonal Scentfc Conference Computer Scence 2015 Pavel Dzhunev, PhD student Fault tolerance n cloud technologes presented as a servce INTRODUCTION Improvements n technques for vrtualzaton and performance

More information

1 Example 1: Axis-aligned rectangles

1 Example 1: Axis-aligned rectangles COS 511: Theoretcal Machne Learnng Lecturer: Rob Schapre Lecture # 6 Scrbe: Aaron Schld February 21, 2013 Last class, we dscussed an analogue for Occam s Razor for nfnte hypothess spaces that, n conjuncton

More information

Data Broadcast on a Multi-System Heterogeneous Overlayed Wireless Network *

Data Broadcast on a Multi-System Heterogeneous Overlayed Wireless Network * JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 24, 819-840 (2008) Data Broadcast on a Mult-System Heterogeneous Overlayed Wreless Network * Department of Computer Scence Natonal Chao Tung Unversty Hsnchu,

More information

Risk Model of Long-Term Production Scheduling in Open Pit Gold Mining

Risk Model of Long-Term Production Scheduling in Open Pit Gold Mining Rsk Model of Long-Term Producton Schedulng n Open Pt Gold Mnng R Halatchev 1 and P Lever 2 ABSTRACT Open pt gold mnng s an mportant sector of the Australan mnng ndustry. It uses large amounts of nvestments,

More information

A Lyapunov Optimization Approach to Repeated Stochastic Games

A Lyapunov Optimization Approach to Repeated Stochastic Games PROC. ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING, OCT. 2013 1 A Lyapunov Optmzaton Approach to Repeated Stochastc Games Mchael J. Neely Unversty of Southern Calforna http://www-bcf.usc.edu/

More information

Chapter 4 ECONOMIC DISPATCH AND UNIT COMMITMENT

Chapter 4 ECONOMIC DISPATCH AND UNIT COMMITMENT Chapter 4 ECOOMIC DISATCH AD UIT COMMITMET ITRODUCTIO A power system has several power plants. Each power plant has several generatng unts. At any pont of tme, the total load n the system s met by the

More information

Traffic State Estimation in the Traffic Management Center of Berlin

Traffic State Estimation in the Traffic Management Center of Berlin Traffc State Estmaton n the Traffc Management Center of Berln Authors: Peter Vortsch, PTV AG, Stumpfstrasse, D-763 Karlsruhe, Germany phone ++49/72/965/35, emal peter.vortsch@ptv.de Peter Möhl, PTV AG,

More information

Latent Class Regression. Statistics for Psychosocial Research II: Structural Models December 4 and 6, 2006

Latent Class Regression. Statistics for Psychosocial Research II: Structural Models December 4 and 6, 2006 Latent Class Regresson Statstcs for Psychosocal Research II: Structural Models December 4 and 6, 2006 Latent Class Regresson (LCR) What s t and when do we use t? Recall the standard latent class model

More information

How To Solve An Onlne Control Polcy On A Vrtualzed Data Center

How To Solve An Onlne Control Polcy On A Vrtualzed Data Center Dynamc Resource Allocaton and Power Management n Vrtualzed Data Centers Rahul Urgaonkar, Ulas C. Kozat, Ken Igarash, Mchael J. Neely urgaonka@usc.edu, {kozat, garash}@docomolabs-usa.com, mjneely@usc.edu

More information

Efficient Striping Techniques for Variable Bit Rate Continuous Media File Servers æ

Efficient Striping Techniques for Variable Bit Rate Continuous Media File Servers æ Effcent Strpng Technques for Varable Bt Rate Contnuous Meda Fle Servers æ Prashant J. Shenoy Harrck M. Vn Department of Computer Scence, Department of Computer Scences, Unversty of Massachusetts at Amherst

More information

Heuristic Static Load-Balancing Algorithm Applied to CESM

Heuristic Static Load-Balancing Algorithm Applied to CESM Heurstc Statc Load-Balancng Algorthm Appled to CESM 1 Yur Alexeev, 1 Sher Mckelson, 1 Sven Leyffer, 1 Robert Jacob, 2 Anthony Crag 1 Argonne Natonal Laboratory, 9700 S. Cass Avenue, Argonne, IL 60439,

More information

Cost Minimization using Renewable Cooling and Thermal Energy Storage in CDNs

Cost Minimization using Renewable Cooling and Thermal Energy Storage in CDNs Cost Mnmzaton usng Renewable Coolng and Thermal Energy Storage n CDNs Stephen Lee College of Informaton and Computer Scences UMass, Amherst stephenlee@cs.umass.edu Rahul Urgaonkar IBM Research rurgaon@us.bm.com

More information

Logical Development Of Vogel s Approximation Method (LD-VAM): An Approach To Find Basic Feasible Solution Of Transportation Problem

Logical Development Of Vogel s Approximation Method (LD-VAM): An Approach To Find Basic Feasible Solution Of Transportation Problem INTERNATIONAL JOURNAL OF SCIENTIFIC & TECHNOLOGY RESEARCH VOLUME, ISSUE, FEBRUARY ISSN 77-866 Logcal Development Of Vogel s Approxmaton Method (LD- An Approach To Fnd Basc Feasble Soluton Of Transportaton

More information

Preventive Maintenance and Replacement Scheduling: Models and Algorithms

Preventive Maintenance and Replacement Scheduling: Models and Algorithms Preventve Mantenance and Replacement Schedulng: Models and Algorthms By Kamran S. Moghaddam B.S. Unversty of Tehran 200 M.S. Tehran Polytechnc 2003 A Dssertaton Proposal Submtted to the Faculty of the

More information

DEFINING %COMPLETE IN MICROSOFT PROJECT

DEFINING %COMPLETE IN MICROSOFT PROJECT CelersSystems DEFINING %COMPLETE IN MICROSOFT PROJECT PREPARED BY James E Aksel, PMP, PMI-SP, MVP For Addtonal Informaton about Earned Value Management Systems and reportng, please contact: CelersSystems,

More information

EVERY year, seasonal hurricanes threaten coastal areas.

EVERY year, seasonal hurricanes threaten coastal areas. 1 Strategc Stockplng of Power System Supples for Dsaster Recovery Carleton Coffrn, Pascal Van Hentenryck, and Russell Bent Abstract Ths paper studes the Power System Stochastc Storage Problem (PSSSP),

More information

Survey on Virtual Machine Placement Techniques in Cloud Computing Environment

Survey on Virtual Machine Placement Techniques in Cloud Computing Environment Survey on Vrtual Machne Placement Technques n Cloud Computng Envronment Rajeev Kumar Gupta and R. K. Paterya Department of Computer Scence & Engneerng, MANIT, Bhopal, Inda ABSTRACT In tradtonal data center

More information

Ant Colony Optimization for Economic Generator Scheduling and Load Dispatch

Ant Colony Optimization for Economic Generator Scheduling and Load Dispatch Proceedngs of the th WSEAS Int. Conf. on EVOLUTIONARY COMPUTING, Lsbon, Portugal, June 1-18, 5 (pp17-175) Ant Colony Optmzaton for Economc Generator Schedulng and Load Dspatch K. S. Swarup Abstract Feasblty

More information

8 Algorithm for Binary Searching in Trees

8 Algorithm for Binary Searching in Trees 8 Algorthm for Bnary Searchng n Trees In ths secton we present our algorthm for bnary searchng n trees. A crucal observaton employed by the algorthm s that ths problem can be effcently solved when the

More information

IDENTIFICATION AND CORRECTION OF A COMMON ERROR IN GENERAL ANNUITY CALCULATIONS

IDENTIFICATION AND CORRECTION OF A COMMON ERROR IN GENERAL ANNUITY CALCULATIONS IDENTIFICATION AND CORRECTION OF A COMMON ERROR IN GENERAL ANNUITY CALCULATIONS Chrs Deeley* Last revsed: September 22, 200 * Chrs Deeley s a Senor Lecturer n the School of Accountng, Charles Sturt Unversty,

More information

Optimal resource capacity management for stochastic networks

Optimal resource capacity management for stochastic networks Submtted for publcaton. Optmal resource capacty management for stochastc networks A.B. Deker H. Mlton Stewart School of ISyE, Georga Insttute of Technology, Atlanta, GA 30332, ton.deker@sye.gatech.edu

More information

Availability-Based Path Selection and Network Vulnerability Assessment

Availability-Based Path Selection and Network Vulnerability Assessment Avalablty-Based Path Selecton and Network Vulnerablty Assessment Song Yang, Stojan Trajanovsk and Fernando A. Kupers Delft Unversty of Technology, The Netherlands {S.Yang, S.Trajanovsk, F.A.Kupers}@tudelft.nl

More information

Enabling P2P One-view Multi-party Video Conferencing

Enabling P2P One-view Multi-party Video Conferencing Enablng P2P One-vew Mult-party Vdeo Conferencng Yongxang Zhao, Yong Lu, Changja Chen, and JanYn Zhang Abstract Mult-Party Vdeo Conferencng (MPVC) facltates realtme group nteracton between users. Whle P2P

More information

Distributed Optimal Contention Window Control for Elastic Traffic in Wireless LANs

Distributed Optimal Contention Window Control for Elastic Traffic in Wireless LANs Dstrbuted Optmal Contenton Wndow Control for Elastc Traffc n Wreless LANs Yalng Yang, Jun Wang and Robn Kravets Unversty of Illnos at Urbana-Champagn { yyang8, junwang3, rhk@cs.uuc.edu} Abstract Ths paper

More information

NONLINEAR OPTIMIZATION FOR PROJECT SCHEDULING AND RESOURCE ALLOCATION UNDER UNCERTAINTY

NONLINEAR OPTIMIZATION FOR PROJECT SCHEDULING AND RESOURCE ALLOCATION UNDER UNCERTAINTY NONLINEAR OPTIMIZATION FOR PROJECT SCHEDULING AND RESOURCE ALLOCATION UNDER UNCERTAINTY A Dssertaton Presented to the Faculty of the Graduate School of Cornell Unversty In Partal Fulfllment of the Requrements

More information

Abstract. Clustering ensembles have emerged as a powerful method for improving both the

Abstract. Clustering ensembles have emerged as a powerful method for improving both the Clusterng Ensembles: {topchyal, Models jan, of punch}@cse.msu.edu Consensus and Weak Parttons * Alexander Topchy, Anl K. Jan, and Wllam Punch Department of Computer Scence and Engneerng, Mchgan State Unversty

More information

Answer: A). There is a flatter IS curve in the high MPC economy. Original LM LM after increase in M. IS curve for low MPC economy

Answer: A). There is a flatter IS curve in the high MPC economy. Original LM LM after increase in M. IS curve for low MPC economy 4.02 Quz Solutons Fall 2004 Multple-Choce Questons (30/00 ponts) Please, crcle the correct answer for each of the followng 0 multple-choce questons. For each queston, only one of the answers s correct.

More information

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network 700 Proceedngs of the 8th Internatonal Conference on Innovaton & Management Forecastng the Demand of Emergency Supples: Based on the CBR Theory and BP Neural Network Fu Deqang, Lu Yun, L Changbng School

More information

OPTIMAL INVESTMENT POLICIES FOR THE HORSE RACE MODEL. Thomas S. Ferguson and C. Zachary Gilstein UCLA and Bell Communications May 1985, revised 2004

OPTIMAL INVESTMENT POLICIES FOR THE HORSE RACE MODEL. Thomas S. Ferguson and C. Zachary Gilstein UCLA and Bell Communications May 1985, revised 2004 OPTIMAL INVESTMENT POLICIES FOR THE HORSE RACE MODEL Thomas S. Ferguson and C. Zachary Glsten UCLA and Bell Communcatons May 985, revsed 2004 Abstract. Optmal nvestment polces for maxmzng the expected

More information

Towards a Formal Framework for Multi-Objective Multi-Agent Planning

Towards a Formal Framework for Multi-Objective Multi-Agent Planning Towards a Formal Framework for Mult-Objectve Mult-Agent Plannng Abdel-Illah Mouaddb, Matheu Boussard, Maroua Bouzd Maréchal Jun, Campus II BP 5186 Computer Scence Department 14032 Caen Cedex, France (mouaddb,mboussar,bouzd@nfo.uncaen.fr

More information

Complex Service Provisioning in Collaborative Cloud Markets

Complex Service Provisioning in Collaborative Cloud Markets Melane Sebenhaar, Ulrch Lampe, Tm Lehrg, Sebastan Zöller, Stefan Schulte, Ralf Stenmetz: Complex Servce Provsonng n Collaboratve Cloud Markets. In: W. Abramowcz et al. (Eds.): Proceedngs of the 4th European

More information

Dynamic Constrained Economic/Emission Dispatch Scheduling Using Neural Network

Dynamic Constrained Economic/Emission Dispatch Scheduling Using Neural Network Dynamc Constraned Economc/Emsson Dspatch Schedulng Usng Neural Network Fard BENHAMIDA 1, Rachd BELHACHEM 1 1 Department of Electrcal Engneerng, IRECOM Laboratory, Unversty of Djllal Labes, 220 00, Sd Bel

More information

行 政 院 國 家 科 學 委 員 會 補 助 專 題 研 究 計 畫 成 果 報 告 期 中 進 度 報 告

行 政 院 國 家 科 學 委 員 會 補 助 專 題 研 究 計 畫 成 果 報 告 期 中 進 度 報 告 行 政 院 國 家 科 學 委 員 會 補 助 專 題 研 究 計 畫 成 果 報 告 期 中 進 度 報 告 畫 類 別 : 個 別 型 計 畫 半 導 體 產 業 大 型 廠 房 之 設 施 規 劃 計 畫 編 號 :NSC 96-2628-E-009-026-MY3 執 行 期 間 : 2007 年 8 月 1 日 至 2010 年 7 月 31 日 計 畫 主 持 人 : 巫 木 誠 共 同

More information

ECE544NA Final Project: Robust Machine Learning Hardware via Classifier Ensemble

ECE544NA Final Project: Robust Machine Learning Hardware via Classifier Ensemble 1 ECE544NA Fnal Project: Robust Machne Learnng Hardware va Classfer Ensemble Sa Zhang, szhang12@llnos.edu Dept. of Electr. & Comput. Eng., Unv. of Illnos at Urbana-Champagn, Urbana, IL, USA Abstract In

More information

Durham Research Online

Durham Research Online Durham Research Onlne Deposted n DRO: 9 March 21 Verson of attached le: Accepted Verson Peer-revew status of attached le: Peer-revewed Ctaton for publshed tem: Matthews, P. C. and Coates, G. (27) 'Stochastc

More information