Article received on July 15, 2008; accepted on April 03, 2009
|
|
|
- Jemimah Davis
- 10 years ago
- Views:
Transcription
1 AsstO: A Qualtatve MDP-based Recommender System for Power Plant Operaton AsstO: Un Sstema de Recomendacones basado en MDPs Cualtatvos para la Operacón de Plantas Generadoras Alberto Reyes 1, L. Enrque Sucar 2 and Eduardo F. Morales 2 1 Insttuto de Investgacones Eléctrcas; Av. Reforma 113, Palmra, Cuernavaca, Morelos, 62490, Méxco; [email protected] 2 INAOE; Lus Enrque Erro 1, Sta. Ma. Tonantzntla, Puebla 72840, Méxco; {esucar, [email protected]} Artcle receved on July 15, 2008; accepted on Aprl 03, 2009 Abstract Ths paper proposes a novel and practcal model-based learnng approach wth teratve refnement for solvng contnuous (and hybrd) Markov decson processes. Intally, an approxmate model s learned usng conventonal samplng methods and solved to obtan a polcy. Iteratvely, the approxmate model s refned usng varance n the utlty values as partton crteron. In the learnng phase, ntal reward and transton functons are obtaned by samplng the state acton space. The samples are used to nduce a decson tree predctng reward values from whch an ntal partton of the state space s bult. The samples are also used to nduce a factored MDP. The state abstracton s then refned by splttng states only where the splt s locally mportant. The man contrbutons of ths paper are the use of samplng to construct an abstracton, and a local refnement process of the state abstracton based on utlty varance. The proposed technque was tested n AsstO, an ntellgent recommender system for power plant operaton, where we solved two versons of a complex hybrd contnuous-dscrete problem. We show how our technque approxmates a soluton even n cases where standard methods explode computatonally. Keywords: Recommender systems, power plants, Markov decson processes, abstractons. Resumen Este artículo propone una técnca novedosa y práctca de aprendzaje basada en modelos con refnamento teratvo para resolver procesos de decsón de Markov (MDPs) contnuos. Incalmente, se aprende un modelo aproxmado usando métodos de muestreo convenconales, el cual se resuelve para obtener una polítca. Iteratvamente, el modelo aproxmado se refna con base en la varanza de los valores de la utldad esperada. En la fase de aprendzaje, se obtenen las funcones de recompensa nmedata y de transcón medante muestras del tpo estado-accón. Éstas prmero se usan para nducr un árbol de decsón que predce los valores de recompensa y a partr del cual se construye una partcón ncal del espaco de estados. Posterormente, las muestras tambén se usan para nducr un MDP factorzado. Fnalmente, la abstraccón de espaco de estados resultante se refna dvdendo aquellos estados donde pueda haber cambos en la polítca. Las contrbucones prncpales de este trabajo son el uso de datos para construr una abstraccón ncal, y el proceso de refnamento local basado en la varanza de la utldad. La técnca propuesta fue probada en AsstO, un sstema ntelgente de recomendacones para la operacón de plantas generadoras de electrcdad, donde resolvmos dos versones de un problema complejo con varables híbrdas contnuas y dscretas. Aquí mostramos como nuestra técnca aproxma una solucón aun en casos donde los métodos estándar explotan computaconalmente. Palabras clave: Sstemas de recomendacones, plantas generadoras, procesos de decsón de Markov, abstraccones. 1 Introducton Markov Decson Processes (MDPs) [18] have developed as a standard method for decson-theoretc plannng. Tradtonal MDP soluton technques have the drawback that they requre an explct state representaton, lmtng ther applcablty to real-world problems. Factored representatons [6] help to address ths drawback va compactly specfyng state-spaces n factored form by usng dynamc Bayesan networks or decson dagrams. Gven that
2 6 Alberto Reyes, L. Enrque Sucar and Eduardo F. Morales algorthms for plannng usng MDPs stll run n tme polynomal n the sze of the state space, they do not guarantee that a factored model for hgh dmensonal domans wll be solved effcently. Abstracton and aggregaton methods gve us the tools to deal wth these dffcultes so that plannng n real world problems can become tractable. However, these technques generally apply only to problems wth dscrete state and acton spaces. The problem wth contnuous MDPs (CMDPs) s that f the contnuous space s dscretzed to fnd a soluton, the dscretzaton causes yet another level of exponental blow up. Ths curse of dmensonalty has lmted the use of the MDP framework, and overcomng t has become a relevant topc of research. Two recent methods to solve CMDPs are grd-based MDP dscretzatons and parametrc approxmatons. The dea behnd the grd-based MDPs dscretzatons technque s to dscretze the state-space n a set of grd ponts and approxmate value functons over such ponts. Unfortunately, classc grd algorthms scale up exponentally wth the number of state varables [5]. An alternatve way to solve a contnuous-state MDP s to approxmate the optmal value functon V() s wth an approprate parametrc functon model [4]. The parameters of the model are ftted teratvely by applyng one step Bellman backups to a fnte set of state ponts arranged on a fxed grd or obtaned through Monte Carlo samplng. A least squares crteron s used to ft the parameters of the model. In addton to parallel updates and optmzatons, on-lne update schemes based on gradent decent [4] can be used to optmze the parameters. The dsadvantages of these methods are ther nstablty and possble dvergence [3]. Several authors, e.g., [17], use the notons of abstracton and aggregaton to group states that are smlar wth respect to certan problem characterstcs to further reduce the complexty of the representaton or the soluton. Feng [11] proposes a state aggregaton approach for explotng the structure of MDPs wth contnuous varables. The state space s dynamcally parttoned nto regons where the value functon s the same throughout each regon. L et al. [15] address hybrd state spaces usng a dscretzaton-free approach called lazy approxmaton and present a comparson wth the Feng s work fndng that ther method produced reasonable and consstent results n a more complex verson of the planet rover doman (also used by Feng). Hauskrech [13] shows that approxmate lnear programmng s able to solve factored contnuous MDPs. Smlarly, Guestrn [12] presents a framework to model and solve factored MDPs for both dscrete and contnuous problems n collaboratve settngs. Our approach s related to ths work; however t dffers on several aspects. Frst, t s based on qualtatve models, whch are partcularly useful for domans wth contnuous state varables. It also dffers n the way n whch the abstracton s bult. We use tranng data to learn a decson tree for the reward functon, from whch we deduce an abstracton called qualtatve states. There has been other work on varable-resoluton grds [16,7], however, most of them start from a unform grd. The dea of refnng an ntal abstracton for dscrete state spaces has been also suggested n [1], however we ntroduce a dfferent refnement crtera. The ntal abstracton s refned and mproved va a local teratve process. States wth hgh varance n ther value wth respect to neghborng states are parttoned, and the MDP s solved locally to mprove the polcy. At each stage n the refnement process, only one state s parttoned, and the process fnshes when any potental partton does not change the polcy. In our approach, the reward functon and transton model are learned from a random exploraton of the envronment, and can work wth both, pure contnuous spaces; or hybrd, wth contnuous and dscrete varables. Algorthms such as lke Dyna-Q or prortzed sweepng (e.g., see [21]) from the renforcement learnng communty, have been used to learn a transton model whle explorng the envronment. In contrast to these and other prevous approaches, our method learns automatcally both an abstracton and a model by just samplng the envronment. Ths abstracton s teratvely refned based on local nformaton, makng the refnement very effcent. Thus, our method s, on one hand, smpler than other abstracton and refnement approaches; and on the other hand, t automatcally bulds the model and abstracton. The man contrbutons are the use of samplng to construct an abstracton, and a local refnement of the ntal abstracton based on utlty varance. We have tested our method n a hgh-dmensonal problem n the power plant doman, n whch the state space can be ether contnuous or hybrd contnuous-dscrete. We show how our technque approxmates a soluton even n cases where standard methods explode computatonally. The rest of the paper s organzed as follows. The next secton descrbes our doman of nterest and the assocated plannng problem. Secton 3 gves a bref ntroducton to MDPs and ther factored representaton. Secton 4 develops the abstracton process and a procedure to learn such abstracton from data. Secton 5 explans the
3 AsstO: A Qualtatve MDP-based Recommender System for Power Plant Operaton 7 refnement stage. Secton 6 presents AsstO, a recommender system for power plant operaton, whch mplements the noton of qualtatve MDPs n ts plannng subsystem; and the emprcal evaluaton s descrbed. We conclude wth a summary and drectons for future work. 2 Applcaton Doman Our doman of nterest les on the steam generaton system of a combned-cycle power plant. Ths system, whch s amed to provde superheated steam to a steam turbne, s bascally composed by a recovery steam generator, a recrculaton pump, control valves and nterconnecton ppes. A heat recovery steam generator (HRSG) s a process machnery capable of recoverng resdual energy from a gas turbne exhaust gases to generate hgh pressure (Pd) steam n a specal tank (steam drum). The recrculaton pump s a devce that extracts resdual water from the steam drum to keep a water supply n the HRSG (Ffw). The result of ths process s a hgh-pressure steam flow (Fms) that keeps runnng a steam turbne to produce electrc energy (g) n a power generator. The man control elements assocated are the feed-water valve (fwv) and the man steam valve (msv). The complete process control doman s shown n fgure 1. Durng normal operaton, a three-element feed water control system (ecs) commands the feed-water control valve (fwv) to regulate the level (dl) and pressure (pd) n the drum. However, ths tradtonal controller does not consder the possblty of falures n the control loop (valves, nstrumentaton, or any other process devces). Furthermore, t gnores whether the outcomes of executng a decson wll help n the future to ncrease the steam drum lfetme, securty, and productvty. So, the problem s to obtan a functon that maps plant states to recommendatons that consders all these aspects. Under the MDP framework, the potental falures are consdered mplctly n a transton functon, and the securty and productvty goals are ncluded n the reward. Thus, MDPs provde an adequate model for ths problem; however, standard solutons explode computatonally and can not deal wth contnuous varables. Next we gve a bref revew of MDPs, and then we present our method for solvng contnuous and complex MDPs, requred for the power plan doman. Fg. 1. A smplfed dagram of steam generaton process. Amed to provde superheated steam to a turbne, the steam generaton system s bascally composed of a recovery steam generator, a recrculaton pump, control valves and nterconnecton ppes
4 8 Alberto Reyes, L. Enrque Sucar and Eduardo F. Morales 3 Factored Markov Decson Processes A Markov decson process (MDP) [18] models a sequental decson problem, n whch a system evolves n tme and s controlled by an agent. The system dynamcs s governed by a probablstc transton functon Φ that maps states S and actons A to new states S. At each tme, an agent receves a reward R that depends on the current state s and the appled acton a. Thus, they solve the problem of fndng a recommendaton strategy or polcy that maxmzes the expected reward over tme and also deals wth the uncertanty on the effects of an acton. Formally, an MDP s a tuple M =< S, A,Φ, R >, where s a fnte set of states {s,, s }. A s a fnte S 1 n set of actons for all states. Φ : A S S s the state transton functon specfed as a probablty dstrbuton. The probablty of reachng state s by performng acton a n state s s wrtten as Φ ( ass,, ). R: S A R s the reward functon. R( sa, ) s the reward that the agent receves f t takes acton a n state s. For the dscrete dscounted nfnte-horzon case wth any gven dscount factorγ, there s a polcy π that s optmal regardless of the startng state and that satsfes the Bellman equaton [2]: π π V () s = max {R( s, a) + γ Φ ( a, s, s ) V ( s )} a s S In Contnuous Markov Decson Processes (CMDPs) the optmal value functon satsfes the Bellman fxed pont equaton: V ( s ) = max [ R ( s, a ) + γ Φ ( a, s, s ) V ( s ) ds ] a (2) s Two methods for solvng these equatons and fndng an optmal polcy for an MDP are: (a) dynamc programmng [18] and (b) lnear programmng. In a factored MDP, the set of states s descrbed va a set of random varables X = {X 1,, X n }, where each X takes on values n some fnte doman Dom( X ). A state s defnes a value x Dom( X ) for each varable X. The transton model can be exponentally large f t s explctly represented as matrces, however, the frameworks of dynamc Bayesan networks (DBN) [10] and decson trees [19] gve us the tools to descrbe the transton model and the reward functon concsely. (1) Fg. 2. A smple DBN wth 5 state varables for one acton (left). Influence Dagram denotng a reward functon (center). Structured condtonal reward (CR) represented as a bnary decson tree (rght)
5 AsstO: A Qualtatve MDP-based Recommender System for Power Plant Operaton 9 Let X denote a varable at the current tme and X ' the varable at the next step. The transton graph of a DBN s a two layer drected acyclc graph G whose nodes are{ X,, X, X ',, X ' n }, see fgure 2 (left). Each node ' T 1 n 1 X s assocated wth a condtonal probablty dstrbuton (CPD) P ( X ' Parents( X ' )), whch s usually represented by a matrx (condtonal probablty table) or more compactly by a decson tree. The transton probablty Φ ( as,, s ) s then defned to be ΠPΦ ( x' u ) where u represents the values of the varables n Parents( X ' ). The next value X', often depends on a small subset of varables (Parents(X')) smplfyng the transton functon. The reward assocated wth a state often depends only on the values of certan features of the state. The relatonshp between rewards and state varables can be represented wth value nodes n nfluence dagrams, as shown n fgure 2 (center). The condtonal reward tables (CRT) for such a node s a table that assocates a reward wth every combnaton of values for ts parents n the graph. Ths table s locally exponental n the number of relevant varables. Although n the worst case the CRT wll take exponental space to store the reward functon, n many cases the reward functon exhbts structure allowng t to be represented compactly usng decson trees or graphs, as shown n fgure 2 (rght). 4 Qualtatve MDPs Although factored MDPs provde mportant reductons n the representaton of transton and reward functons, n cases of problems wth hgh dmensonalty there can stll be a large number of states nvolved. On the other hand, defnng a sutable partton of the state space by a human expert s not an easy task. In ths paper, we propose a novel approach to automatcally defne abstract states, and a procedure to approxmate a decson model from data. In the proposed method, we gather nformaton about the rewards and the dynamcs of the system by explorng the envronment. Ths nformaton s used to buld a decson tree [20] representng a small set of abstract states (called the qualtatve partton) wth equvalent rewards, and then s used to learn a probablstc transton functon usng a Bayesan network learnng algorthm [9]. The resultng approxmate MDP model can be solved usng tradtonal dynamc programmng algorthms Qualtatve states A qualtatve state 1 (or q state), q, s a set of states (or a partton of the state space n the contnuous case) that share smlar mmedate rewards. A qualtatve state space, Q, s a set of q states: q1, q2,.. qn, also called the qualtatve partton. Smlarly to the reward functon n a factored MDP, the qualtatve constrans that dstngush regons of the state space wth dfferent reward values, can be represented by a decson tree called Reward Decson Tree (RDT). Snce a qualtatve state maps drectly a reward value, a qualtatve partton Q can also be represented by a bnary decson tree (Q tree). In order to obtan a Q tree, a reward decson tree (RDT) s frst nduced from smulated data and then transformed by smply renamng the reward values to q-state labels. Each leave n the Q tree s labeled wth a new qualtatve state. Even for leaves wth the same reward value, we assgn a dfferent qualtatve state value. Ths produces more states but at the same tme creates more gudance that helps to produce more adequate polces. Fgure 3 llustrates ths tree transformaton for a smple two dmensonal case that represents a Temperature-Volume dagram for an deal gas. Φ 1 Although other authors have used the term qualtatve n a temporal sense, ths work refers to qualtatve n a relatonal spatal sense.
6 10 Alberto Reyes, L. Enrque Sucar and Eduardo F. Morales Fg. 3. Transformaton of the reward decson tree (left) nto a Q-tree (rght). Internal nodes n both trees represent contnuous varables and edges evaluate whether ths varable s less or greater than a partcular bound. Leaf nodes n the RDT represent rewards, and n the Q-tree are q-states Each branch n the Q tree denotes a set of constrants for each q state, q, that bounds a contnuous regon. For example, a qualtatve state could be a regon n a Temperature Volume dagram bounded by the constrants: Temp > 306 and Vol > 48. Fgure 4 llustrates the constrants assocated to the example presented above, and ts representaton n a 2-dmensonal space. It s evdent that a qualtatve state can cover a large number of states (f we consder a fne dscretzaton) wth smlar propertes. Fg. 4. In a Q-tree (left), branches are constrants and leaves are qualtatve states. A graphcal representaton of the tree s also shown (rght). Note that when an upper or lower varable bound s nfnte, t must be understood as the upper or lower varable bound n the doman 4.2. Qualtatve MDP Model Specfcaton We can defne a qualtatve MDP as an MDP wth a qualtatve state space. A hybrd (or qualtatve dscrete) MDP s a factored MDP wth a set of qualtatve and dscrete factors. In ths case, we have a set of dscrete varables, and the qualtatve state space Q, whch s an addtonal factor that concentrates all the contnuous varables. Intally, only the contnuous varables nvolved n the reward functon are consdered n the learnng algorthm. Other contnues varables are dscretzed arbtrarly; however, ths ntal dscretzaton s mproved n the refnement stage,
7 AsstO: A Qualtatve MDP-based Recommender System for Power Plant Operaton 11 as descrbed n Secton 5. Thus, a hybrd qualtatve-dscrete state s descrbed n a factored form as s h = { X 1,, Xn, Q}, where X 1, X, n are the dscrete factors, and Q s a factor that represents the relevant contnuous dmensons n the reward functon Learnng Qualtatve MDPs The Qualtatve MDP model s learned from data based on a random exploraton of the envronment that allows recordng state transtons, actons taken, and the assocated reward values. To better understand how a tranng data set s recorded, consder the b-dmensonal doman descrbed above, but now assumng that the system state can be modfed by changng the temperature and volume values. The possble actons are ncrease/decrease the temperature, ncrease/decrease the volume, and do nothng (the null acton). Fgure 5 shows graphcally a possble data trace produced by the random applcaton of dfferent actons on the system. Each dot n the fgure represents a partcular state (volume and temperature) that results after the applcaton of a partcular acton. Each state s assocated also to a reward value, whch corresponds to the dfferent regons n fgure 5. Thus, after explorng the envronment we obtan a data set that records for each acton, sequentally from t = 1 to N, the acton, resultng state and reward. So for the gas example, each data record wll contan: Data =(Temperature, Volume, Acton, Reward). From ths data set, a decson model s obtaned, and then solved usng the value teraton algorthm. Formally, ths dea can be descrbed as follows. Gven a set of state transtons represented as a set of random j varables, O = { Xt, A, X t+ 1 }, for j = 12,,..., N, for each state and acton A executed by an agent, and a j reward (or cost) R assocated to each transton, we learn a qualtatve factored MDP model: 1. From a set of smulated transtons { OR, } nduce a reward decson tree, RDT, that predcts the reward functon R n terms of contnuous and dscrete state varables, X 1, X, k, Q. For the gas example, ths tree corresponds to the one shown n Fgure 3, left. 2. Obtan from the decson tree ( RDT ) the set of constrants for the contnuous varables relevant to determne the qualtatve states (q states) n the form of a Q-tree. In terms of the doman varables, we obtan a new varable Q representng the reward-based qualtatve state space whose values are the q states. Ths transformaton s llustrated n Fgure 3 for the deal gas example, wth the resultng Q-tree (rght). Ths Q-Tree s shown agan n Fgure 4 (left), whch also shows the qualtatve partton obtaned (rght), where the state space s dvded nto 5 qualtatve states, q0, q 1, q4. 3. Qualfy data from the orgnal sample n such a way that the new set of attrbutes s the Q varable, the remanng dscrete and contnuous varables not ncluded n the decson tree, and the acton A. The contnuous varables not consdered n the RDT tree are dscretzed n a coarse way wth equal sze ntervals (ths ntal dscretzaton s mproved n the refnement stage). Ths transformed data set s called the qualfed data set. For the example, the state n each record n the data set wll be represented by the correspondng qualtatve state, q q 0 4, nstead of the numerc values of the orgnal state varables, Vol. and Temp. These q states are determned n terms of the partton of the state space, as shown n Fgure Format the qualfed data set n such a way that the attrbutes follow a temporal causal orderng. For example varable Qt must be set before Q t + 1, X 1 t before X 1 t+ 1, and so on. The whole set of attrbutes should be the varable Q n tme t, the remanng system varables, X 1, X, k, n tme t, the varable Q n tme t + 1, the remanng system varables n tme t + 1, and the acton A. Thus, for the gas example, each record n the qualfed data set wll be: ( q, a, r) t, where q s the q state, a s the acton, r s the reward, and t s tme, from t = 0 to t = N ( N s the number of steps n the exploraton).
8 12 Alberto Reyes, L. Enrque Sucar and Eduardo F. Morales 5. Prepare data for the nducton of a 2-stage dynamc Bayesan net. Accordng to the acton space dmenson, splt the qualfed data set nto A sets of samples, one for each acton. In the gas case there wll be 5 sets, one for each possble acton: ncrease/decrease the temperature, ncrease/decrease the volume, and do nothng. 6. Induce the transton model for each acton, A j, usng a Bayesan network learnng algorthm [9]. So for our runnng example, we wll nduce a DBN to represent the transton model for each of the 5 actons, all n terms of the q state varables. Fg. 5. Exploraton trace for the deal gas doman. Each dot n the fgure represents a data pont n the exploraton, wth ts correspondng state (Vol. and Temp.), reward (determned by the regon), and acton appled to reach ths state. Thus, by applyng random actons on the system, t s possble to capture the effects of these actons (new states) and the mmedate reward receved per state At the end of ths process we have learned a qualtatve MDP model of the problem based on a random exploraton of the envronment, and the qualtatve partton obtaned from the reward decson tree. In ths model, the transton functon s represented as a set of 2 stage DBNs, one per acton, and the reward by a decson tree; both n terms of the q state varables. As mentoned before, f there are addtonal varables that are not part of the reward functon, these are just ncorporated nto the model. Ths ntal model represents a hgh-level abstracton of the contnuous state space and can be solved effcently usng a standard technque, such as value teraton, to obtan the optmal polcy. For nstance, n the deal gas example, the resultng polcy wll gve the optmal acton for each q-state, q q 0 4. Ths approach has been successfully appled n several domans; however, n some cases the ntal abstracton can mss some relevant detals of the doman and consequently produce sub-optmal polces. We mprove ths ntal partton through a refnement stage descrbed n the next secton. 5 Qualtatve State Refnement We have desgned a value-based algorthm that recursvely selects and parttons abstract states wth hgh utlty varance. If there are contnuous dmensons that were not ncluded n the ntal Q-tree (because they do not affect the reward), these are ncorporated at ths stage. For ths, we smply extend the Q-tree wth the addtonal dmensons wth an ntal, coarse dscretzaton. Before we see n detal the refnement algorthm, we need to defne some relevant concepts.
9 The border of state, AsstO: A Qualtatve MDP-based Recommender System for Power Plant Operaton 13 s, s defned as the set of states, S { s s } j =,,, such that s S s a neghbor of ; that s, they are adjacent n at least one dmenson. A regon s defned as r = s S, that s, a state and ts border states. For nstance, n the deal gas example, and are the border states of, and 1 q0 q4 3 fgure 4. The utlty varance of a regon, r, that corresponds to state s, s defned as: S 1 n 2 2 r = ( ) Vqk rn n k = 1 n j k j q r { q q q } s =,,, see V (3) where n s the number of border states for s, V s the value of each state, s, n the regon, and s the qk average value of the states n the regon. The value for each state s obtaned when we solve the qualtatve MDP, as descrbed n the prevous secton. The utlty gradent gves the dfference n utlty between one state, s, and one of ts border states, sk, and t s defned as follows: k V r n δ = V V (4) k of ts The hyper-volume of a state, d dmensons: s, corresponds to the space occuped by the state and ts obtaned by the product hv d = x (5) l= 1 l where x l s the value for each dmenson l. The refnement algorthm has as nput the ntal qualtatve partton obtaned n the learnng stage and an ntal soluton for ths qualtatve MDP. It also requres a mnmum hyper-volume for a state defned by the user, as ths depends on the applcaton. It proceeds as follows: 1. Intalze all the states as unmarked. 2. Whle there s an unmarked qualtatve state greater than the mnmum hyper-volume: (a) Save a copy of the prevous MDP (before the partton) and ts soluton. (b) Obtan the utlty varance for each state n ts correspondng regon. (c) Select a qualtatve state wth the hghest varance n ts utlty value wth respect to ts neghbors, name t q. (d) For the qualtatve state q select a contnuous dmenson to splt t, from ( x0, x 1,, xn ), such that t has the hghest utlty gradent wth respect to ts border states along ths dmenson. (e) Bsect the q-state q over the selected dmenson (dvde the state n two). (f) Solve the new MDP, whch ncludes the new partton, usng value teraton. (g) If the new MDP has the same polcy as before, mark the orgnal state q before the partton, and return to the prevous MDP, otherwse, accept the refnement and contnue. 3. Return the fnal partton and ts soluton.
10 14 Alberto Reyes, L. Enrque Sucar and Eduardo F. Morales The refnement process s now descrbed for the deal gas example. Fgure 6 llustrates 3 steps n the abstracton process for the example n fgure 4. The ntal partton s shown at the top left. Let us assume that the state has the hghest varance n utlty wth respect to ts neghbors, q1, q2, q3, q4; and that Vol. s the dmenson wth the hghest dfference n utlty. A bsecton s then nserted to splt state n the new states and q (Step 1, top q0 q0 1 rght). The remanng states are relabeled to preserve a progressve numberng. After solvng the new MDP and verfyng that the polcy has changed, the bsecton s accepted and the algorthm proceeds to Step 2 (bottom-left). In ths case q1 s the state wth the hghest varance and t s splt on the Temp. dmenson whch s the dmenson wth the hghest dfference n utlty. However, after solvng the new MDP, the polcy does not change, so the dvson s canceled and t returns to the prevous partton, as depcted n the bottom-rght of fgure 6. Thus, ths state wll be marked and not consdered for subsequent parttons. q 0 Fg. 6. An example of the qualtatve refnement process for a two-dmenson state space. Intal partton: the ntal soluton obtaned before, for each q state ts value and optmal acton are shown. Step 1: the state wth hghest varance s bsected along the dmenson wth hghest varance, Vol. Note that the q states have been q 0 q 1 renamed. Step 2: now s parttoned along the Temp. dmenson. Step 3: as there s no change n polcy for the partton n Step 2, t returns to the partton n Step 1 Next we descrbe how the qualtatve MDP approach was appled n the power plant doman.
11 6 AsstO: A Recommender System for Power Plants AsstO: A Qualtatve MDP-based Recommender System for Power Plant Operaton 15 AsstO s an ntellgent assstant that provdes useful recommendatons for tranng and on-lne assstance n the power plant doman. AsstO was bult specally to demonstrate the potental of the qualtatve MDP approach to solve plannng problems n complex domans. The recommender system s coupled to a power plant smulator capable to partally reproduce the operaton of a combned cycle power plant (CCPP), n partcular, the steam generaton process (HRSG), descrbed n secton 2. The smulator (fgure 7) s provded wth controls for settng up the power condtons n the gas and steam turbnes (nomnal load, medum load, mnmum load, hot standby condton, low speed, and start-up). It ncludes an operaton panel to confgure load demands, unt trps, shutdowns, and other hgh level operatons n dfferent plant subsystems. It also ncludes a vsualzaton tool for trackng the behavor n tme of a set of varables selected by the user, and a functon for recordng hstorcal data. Fg. 7. A screen shot of human computer nterface of the steam generaton smulator. The smulator provdes controls, an operaton panel, and data vsualzaton tools 6.1. General Archtecture The AsstO recommender system s composed by a decson model base, a smulaton data base, and the followng subsystems: ) data management, ) model management, ) plannng subsystem, and v) user nterface. Fgure 8 shows AsstO s general archtecture. The smulaton data base allocates the process sgnals generated by the smulator (outputs), and the control sgnals (nputs) sent by an nstructor to set up a specfc electrc load or falure condton n the process. On the other hand, the decson model base stores the qualtatve MDP model of the process and ts soluton n form of a polcy. That s, t has the optmal acton that wll be recommended to the operator for every state of the plant subprocess consdered. The polcy s based on a factored representaton of the plant q-states (see secton 4.2), and represented n the form of algebrac decson dagrams (ADDs) [14].
12 16 Alberto Reyes, L. Enrque Sucar and Eduardo F. Morales Fg. 8. AsstO s general archtecture. Gven a state of the plant obtaned from the smulaton data base, the plannng subsystem queres a recommendaton to the decson model base. Ths recommendaton s presented to the operator va the user nterface The data management subsystem s composed by a set of tools for data admnstraton and analyss. The model management subsystem manpulates the transton and reward models, and the utlty and polcy functons stored n the decson model base. The transton model management system was mplemented n Elvra [8] (whch also was adapted to compute Dynamc Bayesan Networks), and the reward model management system usng Weka [22]. The management of the polcy and utlty models s carred out usng SPUDD [14], whch ncludes model query and prntng capabltes. The plannng subsystem n AsstO s also based on SPUDD [14], whch mplements a very effcent verson of the value teraton algorthm for MDPs as nference method. The plannng subsystem frst approxmates the decson models usng the data allocated n the smulaton data base. Transton and reward models are respectvely learned usng the K2 [9] algorthm avalable n Elvra, and the C4.5 algorthm avalable n Weka (J4.8) [20]. Then t uses these models and ts nference algorthms to obtan an optmal polcy, from whch the recommendatons that wll be gven to the operator are obtaned. The resultng transton and reward functons, and polcy and utlty functons are then stored n the decson model base. The plannng subsystem transforms the contnuous plant state nto the qualtatve representaton descrbed n sectons 4 and 5 for problem specfcaton and polcy query purposes. The user nterface provdes the communcaton wth the envronment. In ths case, the power plant smulator s the envronment, and the operator s the actor that executes the recommendatons that modfy the envronment. The user nterface provdes controls for command executon, load selecton, falure smulaton, and recommendaton dsplay. Ths module, whch can also be used as a supervson console, ncludes the controls for random exploraton and system samplng for the learnng purposes descrbed n secton 4.3. It also provdes a graphcal nterface to observe how fast the correct executon of recommendatons mpact on the plant operaton. The man screen of the user nterface s shown n Fgure 9. Currently AsstO s used for operator tranng. In a tranng sesson, the plannng subsystem obtans the plant q- state from the smulaton data base. Then t queres the polcy functon for the current q-state n the model base to obtan a recommendaton. Both, current q-state and recommendaton are shown graphcally to the operator through the user nterface, who fnally decdes whether or not to execute the recommended command. The sequental executon of these recommendatons wll help the operator to get the plant to an optmal operatng condton.
13 AsstO: A Qualtatve MDP-based Recommender System for Power Plant Operaton 17 Fg. 9. User Interface. It s the graphcal lnk between the recommender system and the operator. It ncludes supervson features, problem specfcaton utltes, dsplay console, and manual control capabltes 6.2. Expermental Results We used AsstO to run two sets of experments wth dfferent complextes. In the frst set of experments, we specfed a 5-acton hybrd problem wth 5 varables ( Fms, Ffw, Pd, g, d ). We also defned a smple bnary reward functon based on the safety parameters of the drum ( and ). The relatonshp between ther values and the reward receved can be seen n fgure 10 (left). Central black squares denote safe states (desred operaton regons), and whte zones represent non-rewarded zones (ndfferent regons). To learn the model and the ntal abstracton, samples of the system dynamcs were gathered usng smulaton. Black dots n fgure 10 (rght) represent sampled states wth postve reward, red (gray) dots have no reward, and whte zones were smply not explored. Fgure 10 (left) shows the state partton and polcy found (arrows) by the learnng system. For ths smple example, although the resultng polcy s not very detaled ( qstates are qute large), t drects the plant to the optmal operatng condton (black regon n the mddle). When analyzed by an expert operator, ths control strategy s nearoptmal n most of the abstract states. We solved the same problem but addng two extra varables, the poston for valves msv and fwv, and usng 9 actons (all the combnatons of open-close valves msv and fwv ). We also redefned the reward functon to maxmze power generaton, g, under safe condtons n the drum. Although the problem ncreased sgnfcantly n complexty, the polcy obtaned s smoother than the 5-acton smple verson presented above. To gve an dea about the computatonal savng, for a fne dscretzaton (15,200 dscrete states) ths problem was solved n seconds, whle our abstract representaton (40 q-states) took only seconds. In both cases, the solutons were found usng the SPUDD system [14]. In summary, the frst experment shows that the proposed approach obtans approxmately optmal polces; whle the second experment demonstrates a sgnfcant reducton n the soluton tme n comparson to a fne dscretzaton of the state space. Pd Fms
14 18 Alberto Reyes, L. Enrque Sucar and Eduardo F. Morales Fg. 10. Process control problem. Left: qualtatve state partton n terms of the Steam Flow and Drum Pressure. For each q state t shows the optmal acton (arrows). The black regon represents the desred operatng state (hgh reward). Rght: an mage of the exploraton trace, where black dots represent sampled states wth postve reward, red dots (gray) are sampled states wth no reward, and whte regons are unexplored zones 7 Conclusons and Future Work In ths paper, we presented a novel and practcal model-based learnng approach wth teratve refnement for solvng contnuous and hybrd Markov decson processes. In the frst phase we use an exploraton strategy of the envronment and a machne learnng approach to nduce an ntal state abstracton. We then follow a refnement process to mprove the ntal abstracton by performng local tests on the varance of utlty values. Our approach creates sgnfcant reductons n space and tme allowng to solve effcently contnuous and hybrd problems. We tested our method n a power plant doman usng AsstO, showng that ths approach can be appled to complex domans where a smple dcretzaton approach s not feasble or computatonally too expensve. Snce AsstO s amed ether for operaton assstance and operator tranng, we are currently developng an extra module that explans the recommended commands generated by the plannng subsystem and, provdes, after a bad decson, the reason why a recommendaton should have been followed. We plan to extend the plannng subsystem to support partally observable MDPs, and use the AsstO archtecture n other power plant applcatons. As future research work we wll lke to mprove our refnement strategy to select a better segmentaton of the abstract states and consder alternatve search strateges. We also plan to test our approach n other domans. Acknowledgments Ths work was supported jontly by the Insttuto de Investgacones Eléctrcas, Mexco and CONACYT Project No References 1. J. Baum and A. E. Ncholson. Dynamc non-unform abstractons for approxmate plannng n large structured stochastc domans. In PRICAI 98 Proceedngs of the 5th Pacfc Rm Internatonal Conference on Artfcal Intellgence, pages , Sngapore, R.E. Bellman. Dynamc Programmng. Prnceton U. Press, Prnceton, N.J., D. P. Bertsekas. A counter-example to temporal dfference learnng. Neural Computaton, 1994.
15 AsstO: A Qualtatve MDP-based Recommender System for Power Plant Operaton D. P. Bertsekas and J.N. Tstskls. Neuro-dynamc programmng. Athena Scences, B. Bonet and J. Pearl. Qualtatve MDPs and POMDPs: An order-of-magntude approach. In Proceedngs of the 18th Conf. on Uncertanty n AI, UAI-02, pages 61 68, Edmonton, Canada, C. Boutler, T. Dean, and S. Hanks. Decson-theoretc plannng: structural assumptons and computatonal leverage. Journal of AI Research, 11:1 94, C. Boutler, M. Goldszmdt, and B. Sabata. Contnuous value functon approxmaton for sequental bddng polces. In Kathryn Laskey and Henr Prade, edtors, Proceedngs of the 15th Conference on Uncertanty n Artfcal Intellgence (UAI-99), pages Morgan Kaufmann Publshers, San Francsco, Calforna, USA, Elvra Consortum. Elvra: an envronment for creatng and usng probablstc graphcal models. Techncal report, U. de Granada, Span, G. F. Cooper and E. Herskovts. A bayesan method for the nducton of probablstc networks from data. Machne Learnng, T. Dean and K. Kanazawa. A model for reasonng about persstence and causaton. Computatonal Intellgence, 5: , Z. Feng, R. Dearden, N. Meuleau, and R. Washngton. Dynamc programmng for structured contnuous Markov decson problems. In Proc. of the 20th Conf. on Uncertanty n AI (UAI-2004). Banff, Canada, C. Guestrn, M. Hauskrecht, and B. Kveton. Solvng factored MDPs wth contnuous and dscrete varables. In Twenteth Conference on Uncertanty n Artfcal Intellgence (UAI 2004), Banff, Canada, M. Hauskrecht and B. Kveton. Lnear program approxmaton for factored contnuous-state Markov decson processes. In In Advances n Neural Informaton Processng Systems NIPS(03), pages , J. Hoey, R. St-Aubn, A. Hu, and C. Boutler. SPUDD: Stochastc plannng usng decson dagrams. In Proc. of the 15th Conf. on Uncertanty n AI, UAI-99, pages , L. L and M. L. Lttman. Lazy approxmaton for solvng contnuous fnte-horzon MDPs. In AAAI-05, pages , Pttsburgh, PA, R. Munos and A. Moore. Varable resoluton dscretzaton for hgh-accuracy solutons of optmal control problems. In Thomas Dean, edtor, Proceedngs of the 16th Internatonal Jont Conference on Artfcal Intellgence (IJCAI-99), pages Morgan Kaufmann Publshers, San Francsco, Calforna, USA, August J. Pneau, G. Gordon, and S. Thrun. Polcy-contngent abstracton for robust control. In Proc. of the 19th Conf. on Uncertanty n AI, UAI-03, pages , M. L. Puterman. Markov Decson Processes. Wley, New York, J.R. Qunlan. Inducton of decson trees. Machne Learnng, 1(1):81 106, J.R. Qunlan. C4.5: Programs for machne learnng. Morgan Kaufmann, San Francsco, Calf., USA., R. S. Sutton and A.G. Barto. Renforcement Learnng: An Introducton. MIT Press, I.H. Wtten. Data Mnng: Practcal Machne Learnng Tools and Technques wth Java Implementatons, 2nd Ed. Morgan Kaufmann, USA, Alberto Reyes s a researcher at the Electrcal Research Insttute n Méxco (IIE) and part-tme professor at Insttuto Tecnológco y de Estudos Superores de Monterrey (ITESM) campus Méxco Cty. Hs research nterests nclude decson-theoretc plannng, machne learnng, and ther applcatons n robotcs and ntellgent assstants for ndustry. He receved a PhD n Computer Scence from ITESM campus Cuernavaca.
16 20 Alberto Reyes, L. Enrque Sucar and Eduardo F. Morales L. Enrque Sucar s a Senor Researcher at the Natonal Insttute for Astrophyscs, Optcs and Electroncs (INAOE) n Puebla, Mexco. Hs research nterests nclude reasonng under uncertanty n artfcal ntellgence, moble robotcs and computer vson. He receved a B.S. n Electroncs and Communcatons Engneerng from the Monterrey Insttute of Technology, n Monterrey, Mexco, a M.Sc. n Electrcal Engneerng from Stanford Unversty, and a Ph.D. degree n Computer Scence from Imperal College, London. He has been presdent of the Mexcan AI Socety, member of the Advsory Commttee for IJCAI, and member of the Natonal Research System and the Mexcan Academy of Scence. Eduardo F. Morales s a Senor Researcher at the Natonal Insttute for Astrophyscs, Optcs and Electroncs (INAOE) n Puebla, Mexco. Hs research nterests nclude machne learnng and moble robotcs. He receved a B.Sc. degree n Physcs Engneerng from Unverdad Autonoma Metropoltana, n Mexco Cty, an M.Sc. n Artfcal Intellgence from Ednburgh Unversty, Scotland, and a Ph.D. degree n Computer Scence from The Turng Insttute- Strathclyde Unversty, n Glasgow. He s a member of the Natonal Research System n Mexco.
Vision Mouse. Saurabh Sarkar a* University of Cincinnati, Cincinnati, USA ABSTRACT 1. INTRODUCTION
Vson Mouse Saurabh Sarkar a* a Unversty of Cncnnat, Cncnnat, USA ABSTRACT The report dscusses a vson based approach towards trackng of eyes and fngers. The report descrbes the process of locatng the possble
The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis
The Development of Web Log Mnng Based on Improve-K-Means Clusterng Analyss TngZhong Wang * College of Informaton Technology, Luoyang Normal Unversty, Luoyang, 471022, Chna [email protected] Abstract.
Solving Factored MDPs with Continuous and Discrete Variables
Solvng Factored MPs wth Contnuous and screte Varables Carlos Guestrn Berkeley Research Center Intel Corporaton Mlos Hauskrecht epartment of Computer Scence Unversty of Pttsburgh Branslav Kveton Intellgent
Feature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College
Feature selecton for ntruson detecton Slobodan Petrovć NISlab, Gjøvk Unversty College Contents The feature selecton problem Intruson detecton Traffc features relevant for IDS The CFS measure The mrmr measure
A hybrid global optimization algorithm based on parallel chaos optimization and outlook algorithm
Avalable onlne www.ocpr.com Journal of Chemcal and Pharmaceutcal Research, 2014, 6(7):1884-1889 Research Artcle ISSN : 0975-7384 CODEN(USA) : JCPRC5 A hybrd global optmzaton algorthm based on parallel
On the Optimal Control of a Cascade of Hydro-Electric Power Stations
On the Optmal Control of a Cascade of Hydro-Electrc Power Statons M.C.M. Guedes a, A.F. Rbero a, G.V. Smrnov b and S. Vlela c a Department of Mathematcs, School of Scences, Unversty of Porto, Portugal;
benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ).
REVIEW OF RISK MANAGEMENT CONCEPTS LOSS DISTRIBUTIONS AND INSURANCE Loss and nsurance: When someone s subject to the rsk of ncurrng a fnancal loss, the loss s generally modeled usng a random varable or
POLYSA: A Polynomial Algorithm for Non-binary Constraint Satisfaction Problems with and
POLYSA: A Polynomal Algorthm for Non-bnary Constrant Satsfacton Problems wth and Mguel A. Saldo, Federco Barber Dpto. Sstemas Informátcos y Computacón Unversdad Poltécnca de Valenca, Camno de Vera s/n
CS 2750 Machine Learning. Lecture 3. Density estimation. CS 2750 Machine Learning. Announcements
Lecture 3 Densty estmaton Mlos Hauskrecht [email protected] 5329 Sennott Square Next lecture: Matlab tutoral Announcements Rules for attendng the class: Regstered for credt Regstered for audt (only f there
Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network
700 Proceedngs of the 8th Internatonal Conference on Innovaton & Management Forecastng the Demand of Emergency Supples: Based on the CBR Theory and BP Neural Network Fu Deqang, Lu Yun, L Changbng School
Efficient Reinforcement Learning in Factored MDPs
Effcent Renforcement Learnng n Factored MDPs Mchael Kearns AT&T Labs [email protected] Daphne Koller Stanford Unversty [email protected] Abstract We present a provably effcent and near-optmal
An Interest-Oriented Network Evolution Mechanism for Online Communities
An Interest-Orented Network Evoluton Mechansm for Onlne Communtes Cahong Sun and Xaopng Yang School of Informaton, Renmn Unversty of Chna, Bejng 100872, P.R. Chna {chsun,yang}@ruc.edu.cn Abstract. Onlne
IMPACT ANALYSIS OF A CELLULAR PHONE
4 th ASA & μeta Internatonal Conference IMPACT AALYSIS OF A CELLULAR PHOE We Lu, 2 Hongy L Bejng FEAonlne Engneerng Co.,Ltd. Bejng, Chna ABSTRACT Drop test smulaton plays an mportant role n nvestgatng
Recurrence. 1 Definitions and main statements
Recurrence 1 Defntons and man statements Let X n, n = 0, 1, 2,... be a MC wth the state space S = (1, 2,...), transton probabltes p j = P {X n+1 = j X n = }, and the transton matrx P = (p j ),j S def.
A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION. Michael E. Kuhl Radhamés A. Tolentino-Peña
Proceedngs of the 2008 Wnter Smulaton Conference S. J. Mason, R. R. Hll, L. Mönch, O. Rose, T. Jefferson, J. W. Fowler eds. A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION
Credit Limit Optimization (CLO) for Credit Cards
Credt Lmt Optmzaton (CLO) for Credt Cards Vay S. Desa CSCC IX, Ednburgh September 8, 2005 Copyrght 2003, SAS Insttute Inc. All rghts reserved. SAS Propretary Agenda Background Tradtonal approaches to credt
Power-of-Two Policies for Single- Warehouse Multi-Retailer Inventory Systems with Order Frequency Discounts
Power-of-wo Polces for Sngle- Warehouse Mult-Retaler Inventory Systems wth Order Frequency Dscounts José A. Ventura Pennsylvana State Unversty (USA) Yale. Herer echnon Israel Insttute of echnology (Israel)
Luby s Alg. for Maximal Independent Sets using Pairwise Independence
Lecture Notes for Randomzed Algorthms Luby s Alg. for Maxmal Independent Sets usng Parwse Independence Last Updated by Erc Vgoda on February, 006 8. Maxmal Independent Sets For a graph G = (V, E), an ndependent
Dynamic Pricing for Smart Grid with Reinforcement Learning
Dynamc Prcng for Smart Grd wth Renforcement Learnng Byung-Gook Km, Yu Zhang, Mhaela van der Schaar, and Jang-Won Lee Samsung Electroncs, Suwon, Korea Department of Electrcal Engneerng, UCLA, Los Angeles,
Bayesian Network Based Causal Relationship Identification and Funding Success Prediction in P2P Lending
Proceedngs of 2012 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 25 (2012) (2012) IACSIT Press, Sngapore Bayesan Network Based Causal Relatonshp Identfcaton and Fundng Success
Project Networks With Mixed-Time Constraints
Project Networs Wth Mxed-Tme Constrants L Caccetta and B Wattananon Western Australan Centre of Excellence n Industral Optmsaton (WACEIO) Curtn Unversty of Technology GPO Box U1987 Perth Western Australa
What is Candidate Sampling
What s Canddate Samplng Say we have a multclass or mult label problem where each tranng example ( x, T ) conssts of a context x a small (mult)set of target classes T out of a large unverse L of possble
) of the Cell class is created containing information about events associated with the cell. Events are added to the Cell instance
Calbraton Method Instances of the Cell class (one nstance for each FMS cell) contan ADC raw data and methods assocated wth each partcular FMS cell. The calbraton method ncludes event selecton (Class Cell
An Alternative Way to Measure Private Equity Performance
An Alternatve Way to Measure Prvate Equty Performance Peter Todd Parlux Investment Technology LLC Summary Internal Rate of Return (IRR) s probably the most common way to measure the performance of prvate
Improved SVM in Cloud Computing Information Mining
Internatonal Journal of Grd Dstrbuton Computng Vol.8, No.1 (015), pp.33-40 http://dx.do.org/10.1457/jgdc.015.8.1.04 Improved n Cloud Computng Informaton Mnng Lvshuhong (ZhengDe polytechnc college JangSu
"Research Note" APPLICATION OF CHARGE SIMULATION METHOD TO ELECTRIC FIELD CALCULATION IN THE POWER CABLES *
Iranan Journal of Scence & Technology, Transacton B, Engneerng, ol. 30, No. B6, 789-794 rnted n The Islamc Republc of Iran, 006 Shraz Unversty "Research Note" ALICATION OF CHARGE SIMULATION METHOD TO ELECTRIC
The OC Curve of Attribute Acceptance Plans
The OC Curve of Attrbute Acceptance Plans The Operatng Characterstc (OC) curve descrbes the probablty of acceptng a lot as a functon of the lot s qualty. Fgure 1 shows a typcal OC Curve. 10 8 6 4 1 3 4
ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING
ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING Matthew J. Lberatore, Department of Management and Operatons, Vllanova Unversty, Vllanova, PA 19085, 610-519-4390,
An Enhanced Super-Resolution System with Improved Image Registration, Automatic Image Selection, and Image Enhancement
An Enhanced Super-Resoluton System wth Improved Image Regstraton, Automatc Image Selecton, and Image Enhancement Yu-Chuan Kuo ( ), Chen-Yu Chen ( ), and Chou-Shann Fuh ( ) Department of Computer Scence
Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur
Module LOSSLESS IMAGE COMPRESSION SYSTEMS Lesson 3 Lossless Compresson: Huffman Codng Instructonal Objectves At the end of ths lesson, the students should be able to:. Defne and measure source entropy..
Forecasting the Direction and Strength of Stock Market Movement
Forecastng the Drecton and Strength of Stock Market Movement Jngwe Chen Mng Chen Nan Ye [email protected] [email protected] [email protected] Abstract - Stock market s one of the most complcated systems
Descriptive Models. Cluster Analysis. Example. General Applications of Clustering. Examples of Clustering Applications
CMSC828G Prncples of Data Mnng Lecture #9 Today s Readng: HMS, chapter 9 Today s Lecture: Descrptve Modelng Clusterng Algorthms Descrptve Models model presents the man features of the data, a global summary
The Greedy Method. Introduction. 0/1 Knapsack Problem
The Greedy Method Introducton We have completed data structures. We now are gong to look at algorthm desgn methods. Often we are lookng at optmzaton problems whose performance s exponental. For an optmzaton
Course outline. Financial Time Series Analysis. Overview. Data analysis. Predictive signal. Trading strategy
Fnancal Tme Seres Analyss Patrck McSharry [email protected] www.mcsharry.net Trnty Term 2014 Mathematcal Insttute Unversty of Oxford Course outlne 1. Data analyss, probablty, correlatons, vsualsaton
Support Vector Machines
Support Vector Machnes Max Wellng Department of Computer Scence Unversty of Toronto 10 Kng s College Road Toronto, M5S 3G5 Canada [email protected] Abstract Ths s a note to explan support vector machnes.
Can Auto Liability Insurance Purchases Signal Risk Attitude?
Internatonal Journal of Busness and Economcs, 2011, Vol. 10, No. 2, 159-164 Can Auto Lablty Insurance Purchases Sgnal Rsk Atttude? Chu-Shu L Department of Internatonal Busness, Asa Unversty, Tawan Sheng-Chang
LIFETIME INCOME OPTIONS
LIFETIME INCOME OPTIONS May 2011 by: Marca S. Wagner, Esq. The Wagner Law Group A Professonal Corporaton 99 Summer Street, 13 th Floor Boston, MA 02110 Tel: (617) 357-5200 Fax: (617) 357-5250 www.ersa-lawyers.com
RELIABILITY, RISK AND AVAILABILITY ANLYSIS OF A CONTAINER GANTRY CRANE ABSTRACT
Kolowrock Krzysztof Joanna oszynska MODELLING ENVIRONMENT AND INFRATRUCTURE INFLUENCE ON RELIABILITY AND OPERATION RT&A # () (Vol.) March RELIABILITY RIK AND AVAILABILITY ANLYI OF A CONTAINER GANTRY CRANE
A DATA MINING APPLICATION IN A STUDENT DATABASE
JOURNAL OF AERONAUTICS AND SPACE TECHNOLOGIES JULY 005 VOLUME NUMBER (53-57) A DATA MINING APPLICATION IN A STUDENT DATABASE Şenol Zafer ERDOĞAN Maltepe Ünversty Faculty of Engneerng Büyükbakkalköy-Istanbul
Activity Scheduling for Cost-Time Investment Optimization in Project Management
PROJECT MANAGEMENT 4 th Internatonal Conference on Industral Engneerng and Industral Management XIV Congreso de Ingenería de Organzacón Donosta- San Sebastán, September 8 th -10 th 010 Actvty Schedulng
PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12
14 The Ch-squared dstrbuton PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 1 If a normal varable X, havng mean µ and varance σ, s standardsed, the new varable Z has a mean 0 and varance 1. When ths standardsed
A Secure Password-Authenticated Key Agreement Using Smart Cards
A Secure Password-Authentcated Key Agreement Usng Smart Cards Ka Chan 1, Wen-Chung Kuo 2 and Jn-Chou Cheng 3 1 Department of Computer and Informaton Scence, R.O.C. Mltary Academy, Kaohsung 83059, Tawan,
8 Algorithm for Binary Searching in Trees
8 Algorthm for Bnary Searchng n Trees In ths secton we present our algorthm for bnary searchng n trees. A crucal observaton employed by the algorthm s that ths problem can be effcently solved when the
1 Example 1: Axis-aligned rectangles
COS 511: Theoretcal Machne Learnng Lecturer: Rob Schapre Lecture # 6 Scrbe: Aaron Schld February 21, 2013 Last class, we dscussed an analogue for Occam s Razor for nfnte hypothess spaces that, n conjuncton
1. Fundamentals of probability theory 2. Emergence of communication traffic 3. Stochastic & Markovian Processes (SP & MP)
6.3 / -- Communcaton Networks II (Görg) SS20 -- www.comnets.un-bremen.de Communcaton Networks II Contents. Fundamentals of probablty theory 2. Emergence of communcaton traffc 3. Stochastc & Markovan Processes
Extending Probabilistic Dynamic Epistemic Logic
Extendng Probablstc Dynamc Epstemc Logc Joshua Sack May 29, 2008 Probablty Space Defnton A probablty space s a tuple (S, A, µ), where 1 S s a set called the sample space. 2 A P(S) s a σ-algebra: a set
A Novel Methodology of Working Capital Management for Large. Public Constructions by Using Fuzzy S-curve Regression
Novel Methodology of Workng Captal Management for Large Publc Constructons by Usng Fuzzy S-curve Regresson Cheng-Wu Chen, Morrs H. L. Wang and Tng-Ya Hseh Department of Cvl Engneerng, Natonal Central Unversty,
Automated information technology for ionosphere monitoring of low-orbit navigation satellite signals
Automated nformaton technology for onosphere montorng of low-orbt navgaton satellte sgnals Alexander Romanov, Sergey Trusov and Alexey Romanov Federal State Untary Enterprse Russan Insttute of Space Devce
Robust Design of Public Storage Warehouses. Yeming (Yale) Gong EMLYON Business School
Robust Desgn of Publc Storage Warehouses Yemng (Yale) Gong EMLYON Busness School Rene de Koster Rotterdam school of management, Erasmus Unversty Abstract We apply robust optmzaton and revenue management
8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by
6 CHAPTER 8 COMPLEX VECTOR SPACES 5. Fnd the kernel of the lnear transformaton gven n Exercse 5. In Exercses 55 and 56, fnd the mage of v, for the ndcated composton, where and are gven by the followng
Data Broadcast on a Multi-System Heterogeneous Overlayed Wireless Network *
JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 24, 819-840 (2008) Data Broadcast on a Mult-System Heterogeneous Overlayed Wreless Network * Department of Computer Scence Natonal Chao Tung Unversty Hsnchu,
Logistic Regression. Lecture 4: More classifiers and classes. Logistic regression. Adaboost. Optimization. Multiple class classification
Lecture 4: More classfers and classes C4B Machne Learnng Hlary 20 A. Zsserman Logstc regresson Loss functons revsted Adaboost Loss functons revsted Optmzaton Multple class classfcaton Logstc Regresson
How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence
1 st Internatonal Symposum on Imprecse Probabltes and Ther Applcatons, Ghent, Belgum, 29 June 2 July 1999 How Sets of Coherent Probabltes May Serve as Models for Degrees of Incoherence Mar J. Schervsh
Damage detection in composite laminates using coin-tap method
Damage detecton n composte lamnates usng con-tap method S.J. Km Korea Aerospace Research Insttute, 45 Eoeun-Dong, Youseong-Gu, 35-333 Daejeon, Republc of Korea [email protected] 45 The con-tap test has the
Gender Classification for Real-Time Audience Analysis System
Gender Classfcaton for Real-Tme Audence Analyss System Vladmr Khryashchev, Lev Shmaglt, Andrey Shemyakov, Anton Lebedev Yaroslavl State Unversty Yaroslavl, Russa [email protected], [email protected], [email protected],
SCHEDULING OF CONSTRUCTION PROJECTS BY MEANS OF EVOLUTIONARY ALGORITHMS
SCHEDULING OF CONSTRUCTION PROJECTS BY MEANS OF EVOLUTIONARY ALGORITHMS Magdalena Rogalska 1, Wocech Bożeko 2,Zdzsław Heduck 3, 1 Lubln Unversty of Technology, 2- Lubln, Nadbystrzycka 4., Poland. E-mal:[email protected]
Single and multiple stage classifiers implementing logistic discrimination
Sngle and multple stage classfers mplementng logstc dscrmnaton Hélo Radke Bttencourt 1 Dens Alter de Olvera Moraes 2 Vctor Haertel 2 1 Pontfíca Unversdade Católca do Ro Grande do Sul - PUCRS Av. Ipranga,
Chapter 4 ECONOMIC DISPATCH AND UNIT COMMITMENT
Chapter 4 ECOOMIC DISATCH AD UIT COMMITMET ITRODUCTIO A power system has several power plants. Each power plant has several generatng unts. At any pont of tme, the total load n the system s met by the
Research Article QoS and Energy Aware Cooperative Routing Protocol for Wildfire Monitoring Wireless Sensor Networks
The Scentfc World Journal Volume 3, Artcle ID 43796, pages http://dx.do.org/.55/3/43796 Research Artcle QoS and Energy Aware Cooperatve Routng Protocol for Wldfre Montorng Wreless Sensor Networks Mohamed
Inter-Ing 2007. INTERDISCIPLINARITY IN ENGINEERING SCIENTIFIC INTERNATIONAL CONFERENCE, TG. MUREŞ ROMÂNIA, 15-16 November 2007.
Inter-Ing 2007 INTERDISCIPLINARITY IN ENGINEERING SCIENTIFIC INTERNATIONAL CONFERENCE, TG. MUREŞ ROMÂNIA, 15-16 November 2007. UNCERTAINTY REGION SIMULATION FOR A SERIAL ROBOT STRUCTURE MARIUS SEBASTIAN
Frequency Selective IQ Phase and IQ Amplitude Imbalance Adjustments for OFDM Direct Conversion Transmitters
Frequency Selectve IQ Phase and IQ Ampltude Imbalance Adjustments for OFDM Drect Converson ransmtters Edmund Coersmeer, Ernst Zelnsk Noka, Meesmannstrasse 103, 44807 Bochum, Germany [email protected],
INVESTIGATION OF VEHICULAR USERS FAIRNESS IN CDMA-HDR NETWORKS
21 22 September 2007, BULGARIA 119 Proceedngs of the Internatonal Conference on Informaton Technologes (InfoTech-2007) 21 st 22 nd September 2007, Bulgara vol. 2 INVESTIGATION OF VEHICULAR USERS FAIRNESS
Ant Colony Optimization for Economic Generator Scheduling and Load Dispatch
Proceedngs of the th WSEAS Int. Conf. on EVOLUTIONARY COMPUTING, Lsbon, Portugal, June 1-18, 5 (pp17-175) Ant Colony Optmzaton for Economc Generator Schedulng and Load Dspatch K. S. Swarup Abstract Feasblty
An MILP model for planning of batch plants operating in a campaign-mode
An MILP model for plannng of batch plants operatng n a campagn-mode Yanna Fumero Insttuto de Desarrollo y Dseño CONICET UTN [email protected] Gabrela Corsano Insttuto de Desarrollo y Dseño
Distributed Multi-Target Tracking In A Self-Configuring Camera Network
Dstrbuted Mult-Target Trackng In A Self-Confgurng Camera Network Crstan Soto, B Song, Amt K. Roy-Chowdhury Department of Electrcal Engneerng Unversty of Calforna, Rversde {cwlder,bsong,amtrc}@ee.ucr.edu
ECE544NA Final Project: Robust Machine Learning Hardware via Classifier Ensemble
1 ECE544NA Fnal Project: Robust Machne Learnng Hardware va Classfer Ensemble Sa Zhang, [email protected] Dept. of Electr. & Comput. Eng., Unv. of Illnos at Urbana-Champagn, Urbana, IL, USA Abstract In
J. Parallel Distrib. Comput.
J. Parallel Dstrb. Comput. 71 (2011) 62 76 Contents lsts avalable at ScenceDrect J. Parallel Dstrb. Comput. journal homepage: www.elsever.com/locate/jpdc Optmzng server placement n dstrbuted systems n
APPLICATION OF PROBE DATA COLLECTED VIA INFRARED BEACONS TO TRAFFIC MANEGEMENT
APPLICATION OF PROBE DATA COLLECTED VIA INFRARED BEACONS TO TRAFFIC MANEGEMENT Toshhko Oda (1), Kochro Iwaoka (2) (1), (2) Infrastructure Systems Busness Unt, Panasonc System Networks Co., Ltd. Saedo-cho
2008/8. An integrated model for warehouse and inventory planning. Géraldine Strack and Yves Pochet
2008/8 An ntegrated model for warehouse and nventory plannng Géraldne Strack and Yves Pochet CORE Voe du Roman Pays 34 B-1348 Louvan-la-Neuve, Belgum. Tel (32 10) 47 43 04 Fax (32 10) 47 43 01 E-mal: [email protected]
Logical Development Of Vogel s Approximation Method (LD-VAM): An Approach To Find Basic Feasible Solution Of Transportation Problem
INTERNATIONAL JOURNAL OF SCIENTIFIC & TECHNOLOGY RESEARCH VOLUME, ISSUE, FEBRUARY ISSN 77-866 Logcal Development Of Vogel s Approxmaton Method (LD- An Approach To Fnd Basc Feasble Soluton Of Transportaton
RequIn, a tool for fast web traffic inference
RequIn, a tool for fast web traffc nference Olver aul, Jean Etenne Kba GET/INT, LOR Department 9 rue Charles Fourer 90 Evry, France [email protected], [email protected] Abstract As networked
Institute of Informatics, Faculty of Business and Management, Brno University of Technology,Czech Republic
Lagrange Multplers as Quanttatve Indcators n Economcs Ivan Mezník Insttute of Informatcs, Faculty of Busness and Management, Brno Unversty of TechnologCzech Republc Abstract The quanttatve role of Lagrange
Software project management with GAs
Informaton Scences 177 (27) 238 241 www.elsever.com/locate/ns Software project management wth GAs Enrque Alba *, J. Francsco Chcano Unversty of Málaga, Grupo GISUM, Departamento de Lenguajes y Cencas de
Fuzzy Set Approach To Asymmetrical Load Balancing In Distribution Networks
Fuzzy Set Approach To Asymmetrcal Load Balancng n Dstrbuton Networks Goran Majstrovc Energy nsttute Hrvoje Por Zagreb, Croata [email protected] Slavko Krajcar Faculty of electrcal engneerng and computng
Linear Circuits Analysis. Superposition, Thevenin /Norton Equivalent circuits
Lnear Crcuts Analyss. Superposton, Theenn /Norton Equalent crcuts So far we hae explored tmendependent (resste) elements that are also lnear. A tmendependent elements s one for whch we can plot an / cure.
Latent Class Regression. Statistics for Psychosocial Research II: Structural Models December 4 and 6, 2006
Latent Class Regresson Statstcs for Psychosocal Research II: Structural Models December 4 and 6, 2006 Latent Class Regresson (LCR) What s t and when do we use t? Recall the standard latent class model
Research Article Enhanced Two-Step Method via Relaxed Order of α-satisfactory Degrees for Fuzzy Multiobjective Optimization
Hndaw Publshng Corporaton Mathematcal Problems n Engneerng Artcle ID 867836 pages http://dxdoorg/055/204/867836 Research Artcle Enhanced Two-Step Method va Relaxed Order of α-satsfactory Degrees for Fuzzy
Adaptive Fractal Image Coding in the Frequency Domain
PROCEEDINGS OF INTERNATIONAL WORKSHOP ON IMAGE PROCESSING: THEORY, METHODOLOGY, SYSTEMS AND APPLICATIONS 2-22 JUNE,1994 BUDAPEST,HUNGARY Adaptve Fractal Image Codng n the Frequency Doman K AI UWE BARTHEL
CALL ADMISSION CONTROL IN WIRELESS MULTIMEDIA NETWORKS
CALL ADMISSION CONTROL IN WIRELESS MULTIMEDIA NETWORKS Novella Bartoln 1, Imrch Chlamtac 2 1 Dpartmento d Informatca, Unverstà d Roma La Sapenza, Roma, Italy [email protected] 2 Center for Advanced
DEFINING %COMPLETE IN MICROSOFT PROJECT
CelersSystems DEFINING %COMPLETE IN MICROSOFT PROJECT PREPARED BY James E Aksel, PMP, PMI-SP, MVP For Addtonal Informaton about Earned Value Management Systems and reportng, please contact: CelersSystems,
Calculating the high frequency transmission line parameters of power cables
< ' Calculatng the hgh frequency transmsson lne parameters of power cables Authors: Dr. John Dcknson, Laboratory Servces Manager, N 0 RW E B Communcatons Mr. Peter J. Ncholson, Project Assgnment Manager,
Preventive Maintenance and Replacement Scheduling: Models and Algorithms
Preventve Mantenance and Replacement Schedulng: Models and Algorthms By Kamran S. Moghaddam B.S. Unversty of Tehran 200 M.S. Tehran Polytechnc 2003 A Dssertaton Proposal Submtted to the Faculty of the
Enabling P2P One-view Multi-party Video Conferencing
Enablng P2P One-vew Mult-party Vdeo Conferencng Yongxang Zhao, Yong Lu, Changja Chen, and JanYn Zhang Abstract Mult-Party Vdeo Conferencng (MPVC) facltates realtme group nteracton between users. Whle P2P
v a 1 b 1 i, a 2 b 2 i,..., a n b n i.
SECTION 8.4 COMPLEX VECTOR SPACES AND INNER PRODUCTS 455 8.4 COMPLEX VECTOR SPACES AND INNER PRODUCTS All the vector spaces we have studed thus far n the text are real vector spaces snce the scalars are
A Programming Model for the Cloud Platform
Internatonal Journal of Advanced Scence and Technology A Programmng Model for the Cloud Platform Xaodong Lu School of Computer Engneerng and Scence Shangha Unversty, Shangha 200072, Chna [email protected]
Joint Scheduling of Processing and Shuffle Phases in MapReduce Systems
Jont Schedulng of Processng and Shuffle Phases n MapReduce Systems Fangfe Chen, Mural Kodalam, T. V. Lakshman Department of Computer Scence and Engneerng, The Penn State Unversty Bell Laboratores, Alcatel-Lucent
Learning from Multiple Outlooks
Learnng from Multple Outlooks Maayan Harel Department of Electrcal Engneerng, Technon, Hafa, Israel She Mannor Department of Electrcal Engneerng, Technon, Hafa, Israel [email protected] [email protected]
Methodology to Determine Relationships between Performance Factors in Hadoop Cloud Computing Applications
Methodology to Determne Relatonshps between Performance Factors n Hadoop Cloud Computng Applcatons Lus Eduardo Bautsta Vllalpando 1,2, Alan Aprl 1 and Alan Abran 1 1 Department of Software Engneerng and
Article received on April 23, 2007; accepted on October 18, 2007
A Renforcement Learnng Soluton for Allocatng Replcated Fragments n a Dstrbuted Database Una solucón de Aprendzae Reforzado para ubcar fragmentos replcados en Bases de Datos Dstrbudas Abel Rodríguez Morff
A Simple Approach to Clustering in Excel
A Smple Approach to Clusterng n Excel Aravnd H Center for Computatonal Engneerng and Networng Amrta Vshwa Vdyapeetham, Combatore, Inda C Rajgopal Center for Computatonal Engneerng and Networng Amrta Vshwa
Face Verification Problem. Face Recognition Problem. Application: Access Control. Biometric Authentication. Face Verification (1:1 matching)
Face Recognton Problem Face Verfcaton Problem Face Verfcaton (1:1 matchng) Querymage face query Face Recognton (1:N matchng) database Applcaton: Access Control www.vsage.com www.vsoncs.com Bometrc Authentcaton
Mining Multiple Large Data Sources
The Internatonal Arab Journal of Informaton Technology, Vol. 7, No. 3, July 2 24 Mnng Multple Large Data Sources Anmesh Adhkar, Pralhad Ramachandrarao 2, Bhanu Prasad 3, and Jhml Adhkar 4 Department of
Risk-based Fatigue Estimate of Deep Water Risers -- Course Project for EM388F: Fracture Mechanics, Spring 2008
Rsk-based Fatgue Estmate of Deep Water Rsers -- Course Project for EM388F: Fracture Mechancs, Sprng 2008 Chen Sh Department of Cvl, Archtectural, and Envronmental Engneerng The Unversty of Texas at Austn
COMPUTER SUPPORT OF SEMANTIC TEXT ANALYSIS OF A TECHNICAL SPECIFICATION ON DESIGNING SOFTWARE. Alla Zaboleeva-Zotova, Yulia Orlova
Internatonal Book Seres "Informaton Scence and Computng" 29 COMPUTE SUPPOT O SEMANTIC TEXT ANALYSIS O A TECHNICAL SPECIICATION ON DESIGNING SOTWAE Alla Zaboleeva-Zotova, Yula Orlova Abstract: The gven
