Stochastic Bandits with Side Observations on Networks

Size: px
Start display at page:

Download "Stochastic Bandits with Side Observations on Networks"

Transcription

1 Stochastc Bandts wth Sde Observatons on Networks Swapna Buccapatnam, Atlla Erylmaz Department of ECE The Oho State Unversty Columbus, OH buccapat@eceosuedu, erylmaz@osuedu Ness B Shroff Departments of ECE and CSE The Oho State Unversty Columbus, OH shroff@osuedu ABSTRACT We study the stochastc mult-armed bandt (MAB problem n the presence of sde-observatons across actons In our model, choosng an acton provdes addtonal sde observatons for a subset of the remanng actons One example of ths model occurs n the problem of targetng users n onlne socal networks where users respond to ther frends s actvty, thus provdng nformaton about each other s preferences Our contrbutons are as follows: We derve an asymptotc (wth respect to tme lower bound (as a functon of the network structure on the regret (loss of any unformly good polcy that acheves the maxmum long term average reward We propose two polces - a randomzed polcy and a polcy based on the well-known upper confdence bound (UCB polces, both of whch explore each acton at a rate that s a functon of ts network poston We show that these polces acheve the asymptotc lower bound on the regret up to a multplcatve factor ndependent of network structure The upper bound guarantees on the regret of these polces are better than those of exstng polces Fnally, we use numercal examples on a real-world socal network to demonstrate the sgnfcant benefts obtaned by our polces aganst other exstng polces Categores and Subject Descrptors H Models and Prncples: Mscellaneous; I Artfcal Intellgence: Mscellaneous Keywords Multarmed bandts; Sde observatons; Socal networks INTRODUCTION Mult-armed bandt (MAB problems have receved renewed nterest over the past decade because of the emergence of content recommendaton, onlne advertsng, and socal networks In the classcal MAB settng, at each tme, a polcy must choose an acton from a set of K actons wth Permsson to make dgtal or hard copes of all or part of ths work for personal or classroom use s granted wthout fee provded that copes are not made or dstrbuted for proft or commercal advantage and that copes bear ths notce and the full ctaton on the frst page Copyrghts for components of ths work owned by others than ACM must be honored Abstractng wth credt s permtted To copy otherwse, or republsh, to post on servers or to redstrbute to lsts, requres pror specfc permsson and/or a fee Request permssons from Permssons@acmorg SIGMETRICS 4, June 6 0, 04, Austn, Texas, USA Copyrght 04 ACM /4/06 $500 unknown probablty dstrbutons Choosng an acton at tme t gves a random reward X (t drawn from the dstrbuton of acton The regret of any polcy s defned as the dfference between the total reward obtaned from the acton wth the hghest average reward and the gven polcy s total reward The goal s to fnd polces that mnmze the expected regret over a gven tme horzon In our work, we consder an mportant MAB settng, smlar to that of and 6, where choosng an acton not only generates a reward from acton, but also reveals observatons for a subset of the remanng actons An example of such a scenaro s as follows: a decson maker must choose one user at each tme n an onlne socal network (FaceBook, etc to offer a promoton 6 Each tme the decson maker offers a promoton to a user, he also has an opportunty to survey the user s neghbors n the network regardng ther potental nterest n a smlar offer (see Fgure Users are found to be more responsve to such surveys usng socal network nformaton compared to generc surveys 3 and ths effect can be leveraged to construct sde-observatons In ths example, choosng an acton n the mult-armed bandt problem corresponds to choosng a user n the network and sde-observatons across actons are captured by the lnks n the socal network Another example s when the actons n the MAB problem are advertsements - the decson maker constructs a graph of dfferent vacaton places (Hawa, Carbbean, Pars, etc, where lnks capture smlartes between dfferent places When a customer shows nterest n one of the places, he s also asked to provde hs opnon about the neghborng places n the graph For the settng of sde-observatons, the authors n consder adversaral bandts, whle 6 consders stochastc bandts as n our current work In 6, the authors propose modfed upper-confdence bound (UCB based polces and show that the regret of these polces s at most O( χ(glog(t, where χ(g s the clque partton number (see Defnton of the sde-observaton network G(K However, t s possble to acheve a lower regret For example, n the star network wth one central acton lnked to all other actons, explorng the central acton yelds suffcent exploraton for the rest of the network In ths case, the optmal regret s at most O(log(t, whle χ(g scalesaso(k for the star network Ths s possble when the onlne network has an addtonal survey feature that generates sde observatons For example, when user s offered a promoton, her neghbors may be quered as follows: User was recently offered a promoton Would you also be nterested n the offer?

2 Motvated by ths observaton, n our work, we am to characterze the asymptotc lower bound on the regret for a general stochastc mult-armed bandt problem n the presence of sde-observatons and nvestgate polces that acheve ths lower bound by takng the network structure nto account Our man contrbutons are as follows: We model the MAB problem n the presence of sdeobservatons and derve an asymptotc (wth respect to tme lower bound (as a functon of the network structure on the regret of any unformly good polcy whch acheves the maxmum long term average reward Ths lower bound s presented n terms of the optmal value of a lnear program (LP, namely, P Motvated by LP P, we propose and nvestgate the performance of a randomzed polcy, we call ɛ t-greedy- LP polcy, as well as an upper confdence bound based polcy, we call UCB-LP polcy Both of these polces explore each acton at a rate that s a functon of ts network poston We show that these polces are optmal n the sense that they acheve the asymptotc lower bound on the regret up to a multplcatve constant ndependent of the network structure under mld assumptons We also show that the upper bound on the regret of our polces scales as O(γ(Glog(t, where γ(g s the sze of the mnmum domnatng set of the network of actons We show that the regret performance of our polces can be strctly better than those proposed n 6 for some mportant network structures Fnally, we use numercal results on a socal network dataset obtaned from Flxster to emprcally compare the performance of our polces aganst those n 6 The model consdered n our work can be vewed as a frst step n the drecton of more general models of nterdependence across actons For ths model, we show that as the number of actons becomes large, sgnfcant benefts can be obtaned from polces that explctly take network structure nto account Whle ɛ t-greedy-lp polcy explores actons at a rate proportonal to ther network poston, ts exploraton s oblvous to the average rewards of the suboptmal actons On the other hand, UCB-LP polcy takes nto account both the upper confdence bounds on the mean rewards as well as network poston of dfferent actons at each tme The rest of the paper s organzed as follows In Secton, we brefly dscuss the related exstng lterature We ntroduce the model n Secton 3 and we present our man results n Sectons 4, 5, 6 Fnally, we present some numercal results n Secton 7 and the proofs of our theoretcal results are gven n Secton 8 We conclude our work n Secton 9 RELATED WORK The semnal work of 0 shows that the asymptotc lower bound on the regret of any unformly good polcy scales logarthmcally wth tme wth a multplcatve constant whch Flxster s a move recommendaton network wth a socal graph Datasets from ths network are made publcly avalable by authors of 9 at datasets/ s a functon of the dstrbutons of actons Further, the authors of 0 provde constructve polces called Upper Confdence Bound (UCB polces based on the concept of optmsm n the face of uncertanty that asymptotcally acheve the lower bound More recently, the authors n consder the case of bounded rewards and propose smpler samplemean based UCB polces and decreasng-ɛ t-greedy polcy that acheve logarthmc regret unformly over tme, rather than only asymptotcally as n the prevous works The UCB ndex of any acton at tme t ntroduced n s gven below: log(t x (t+ s, ( (t where x (t s the average reward and s (t s the total number of observatons avalable for acton at tme t The tradtonal mult-armed bandt polces ncur a regret that s lnear n the number of suboptmal arms Ths makes them unsutable n settngs such as content recommendaton, advertsng, etc, where the acton space s typcally very large To overcome ths dffculty, rcher models specfyng addtonal nformaton across reward dstrbutons of dfferent actons have been studed, such as dependent bandts 3, X -armed bandts 5, lnear bandts 4, contextual sde nformaton n bandt problems, etc More recently, and 6 propose to handle the large number of actons by assumng that choosng an acton reveals observatons for a larger set of actons The polces proposed n acheve the best possble regret n the adversaral settng (see 4 for a survey of adversaral MABs wth sde-observatons and the regret bounds of these polces are n terms of the ndependence number of the network For the settng of stochastc bandts wth sde-observatons, t can be easly shown that, except n the trval case where all actons have a neghborng acton whch s optmal, the regret due to any unformly good polcy s lower bounded by Ω(log(t A formal proof s gven n 6 Further, the authors n 6 propose two modfed UCB polces, namely, UCB-N and UCB-MaxN, and show that the regret of these polces s at most O( χ(glog(t, where χ(g s the clque partton number (see Defnton In our work, we consder the stochastc MAB problem wth sde-observatons smlar to 6 and characterze the asymptotc lower bound on the regret as a functon of the network structure for the stochastc MAB problem n the presence of sde-observatons Further, we propose two polces: ɛ t-greedy-lp polcy, UCB-LP polcy, whch acheve the asymptotc regret lower bound up to a multplcatve constant ndependent of the network structure Further, the regret of our polces s at most O(γ(Glog(t, where γ(g s the sze of the mnmum domnatng set of the network of actons Snce, γ(g χ(g for any network G, the regret obtaned by our polces can be better than that of UCB-N and UCB-MaxN proposed n 6 3 MODEL In ths secton, we formally defne the K-armed bandt problem n the presence of sde observatons across actons Let K {,,K} denote the set of actons A decson maker must choose an acton K at each tme t Let X (t denote the reward obtaned by the decson maker on choosng acton at tme t The random varable X (t has an unknown probablty dstrbuton F Let μ be the mean

3 of the random varable X (t We assume that {X (t,t 0} are d for each and {X (t, K}are ndependent for each tme t We further assume that the dstrbuton, F has a bounded support n 0,bforeach We let b for smplcty of exposton n our work Sde-observaton model : The actons K form nodes n anetworkg(k, represented by the adjacency matrx G g(, j,j K, where g(, j {0, } and g(, for all Let K be the set of neghbors of acton (ncludng, e, g(j,, j K Whle all the results n our work can be easly extended to drected networks, we assume that G(K s undrected for smplcty of notaton We assume that when the decson maker chooses an acton at tme t, he receves a reward X (t andalsoreceves observatons X j(t, for all j K such that EX j(t μ j In general, not all neghborng actons are equally responsve n provdng sde-observatons Ths scenaro can be modeled by assumng that each acton has a known probablty p of provdng sde-observatons when any of ts neghbors are actually chosen We let p forall for the sake of clarty Our results can be easly extended to the settng of mperfect sde-observatons wth p We wll dscuss ths extenson n Remark 5 n Secton 6 Fgure : At tme t, suppose the decson maker chooses user to offer a promoton He then receves a response X (t fromuser Usng the socal nterconnectons, he also observes responses X j(t andx k (t of s neghbors j and k Fgure llustrates the sde-observaton model for the example of targetng users n an onlne socal network Such sde observatons are made possble n settngs of onlne socal networks lke Facebook by surveyng or trackng a user s neghbors reactons (lkes, dslkes, no opnon, etc to the user s actvty Ths s possble when the onlne socal network has a survey or a lke/dslke ndcator that generates sde observatons For example, when user s offered a promoton, her neghbors may be quered as follows: User was recently offered a promoton Would you also be nterested n the offer? 3 Objectve: An allocaton strategy or polcy φ chooses the acton to be played at each tme Formally, φ s a sequence of random varables {φ(t,t 0}, where φ(t Ks the acton chosen by polcy φ at tme t Let X φ (t be the reward and sde-observatons obtaned by the polcy φ at tme t Then, the event {φ (t } belongs to the σ-feld generated by {φ(m, X φ (m,m t } 3 Snce, the neghbors do not have any nformaton on whether the user accepted the promoton, they act ndependently accordng to ther own preferences n answerng ths survey The network tself provdes a better way for surveyng and obtanng sde observatons Let T φ (t be the total number of tmes acton s chosenuptotmet by polcy φ For each acton, rewards are only obtaned when the acton s chosen by the polcy (sdeobservatons do not contrbute to the total reward Then, the regret of polcy φ at tme t for a fxed μ (μ,,μ K s defned by R φ μ(t μ t K K μ ET φ (t Δ ET φ (t, where Δ μ μ and μ max μ Henceforth, we drop K the superscrpt φ unless t s requred The objectve s to fnd polces that mnmze the rate at whch the regret grows as a functon of tme for every fxed network G(K We focus our nvestgaton on the class of unformly good polces 0 defned below: Unformly good polces: An allocaton rule φ s sad to be unformly good f for every fxed μ, the followng condton s satsfed as t : R μ(t o(t b, for every b>0 The above condton mples that unformly good polces acheve the optmal long term average reward of μ Next, we defne two structures that wll be useful later to bound the performance of allocaton strateges n terms of the network structure G(K Defnton A clque coverng C of a network G(K s a partton of all ts nodes nto sets C Csuch that the sub-network formed by each C s a clque Let χ(g be the smallest number of clques nto whch the nodes of the network G(K can be parttoned, also called the clque partton number Defnton A domnatng set D of a network G(K s such that every node n the network s ether n D or has at least one neghbor n D Let γ(g denote the sze of the mnmum domnatng set of network G(K, also called the domnaton number Note that γ(g χ(g for any network G(K In the next secton, we obtan an asymptotc lower bound on the regret of unformly good polces for the settng of MABs wth sde-observatons Ths lower bound s expressed as the optmal value of a lnear program (LP, where the constrants of the LP capture the connectvty of each acton n the network 4 REGRET LOWER BOUND In order to derve a lower bound on the regret, we need some mld regularty assumptons (Assumptons,, and 3 on the dstrbutons F that are smlar to the ones n 0 Let the probablty dstrbuton F have a unvarate densty functon f(x; θ wth unknown parameters θ Let D(θ σ denote the Kullback Lebler (KL dstance between dstrbutons wth densty functons f(x; θ andf(x; σ andwth means μ(θ and μ(σ respectvely Assumpton (Fnteness We assume that f( ; s such that 0 <D(θ σ < whenever μ(σ >μ(θ Assumpton (Contnuty For any ɛ>0 and θ, σ such that μ(σ >μ(θ, there exsts η>0 for whch D(θ σ D(θ ρ <ɛwhenever μ(σ <μ(ρ <μ(σ +η

4 Assumpton 3 (Denseness For each K, θ Θ where the set Θ satsfes: for all θ Θ and for all η>0, there exsts θ Θ such that μ(θ <μ(θ <μ(θ+η The followng proposton s obtaned usng Theorem n 0 It provdes an asymptotc lower bound on the regret of any unformly good polcy under the model descrbed n Secton 3: Proposton Suppose Assumptons,, and 3 hold Let U { : μ <μ } be the set of suboptmal actons Also, let Δ μ μ Recall that K s the set of neghbors of, ncludng, n the network G(K Then, under any unformly good polcy φ, the expected regret s asymptotcally bounded below as follows: lm nf t R μ(t cμ, ( log(t where c μ s the optmal value of the followng lnear program (LP P : P : mn U Δ w, subject to: w j, U, D(θ θ w 0, K Proof (Sketch Let S (t be the total number of observatons correspondng to acton avalable at tme t Then, by modfyng the proof of Theorem of 0, we have that, for U, lm nf t ES (t log(t D(θ θ An observaton s receved for acton whenever any acton n K s chosen Hence, S (t T j(t These two facts gve us the constrants n LP P See Secton 8 for the full proof The lnear program gven n P contans the graphcal nformaton that governs the lower bound However, t requres the knowledge of θ and θ, whch are unknown Ths motvates the constructon of the followng lnear program, LP P, whch preserves the graphcal structure whle elmnatng the dstrbutonal dependence on θ and θ P :mn K z subject to: z j, K, and z 0, K Let z (z K be the optmal soluton of LP P In Sectons 5 and 6, we use the above LP P to modfy the ɛ- greedy polcy n and UCB polcy n for the settng of sde-observatons We provde regret guarantees of these modfed polces n terms of the optmal value K z of LP P We note that the lnear program P s, n fact, the LP relaxaton of the mnmum domnatng set problem on network G(K Snce, any domnatng set of network G(K s a feasble soluton to the LP P, we have that the optmal value of the LP K z γ(g χ(g As we wll see n Remark, for a rch network structure, t s possble to have γ(g << χ(g In the next proposton, we provde a lower bound on c μ n Equaton ( usng the optmal soluton z (z K of LP P Proposton Let U { : μ < μ } be the set of suboptmal actons Let O { : μ μ } be the set of optmal actons Then, max U D(θ θ c μ + O z mn U D(θ θ c μ mn U Δ max U Δ K (3 Proof (Sketch Let I { U: K O } be the set of suboptmal actons wth neghbors n O Usng the optmal soluton of LP P, we construct a feasble soluton satsfyng constrants n LP P for actons n U\I In order to satsfy the constrants for actons n I O, we use z for all n O The feasble soluton constructed n ths way gves an upper bound on the optmal value of LP P n terms of the optmal value of LP P For the lower bound, any feasble soluton of P, n partcular z, can be used to construct a feasble soluton of P See Secton 8 for the full proof K z Θ(c μ completely captures the tme dependence of regret on network structure under the followng assumpton: Assumpton 4 The quanttes O, mn Δ, and U mn U D(θ θ are constants that are ndependent of network sze K Note that the constants n the above assumpton are unknown to the decson maker In the next secton, we propose the ɛ t-greedy-lp polcy whch acheves the regret lower bound of c μ log(t up to a multplcatve constant factor that s ndependent of the network structure and tme 5 EPSILON-GREEDY-LP POLICY Motvated by the LPs P and P, we propose a networkaware randomzed polcy called the ɛ t-greedy-lp polcy We provde an upper bound on the regret of ths polcy and show that t acheves the asymptotc lower bound up to a constant multpler, ndependent of network structure Let x (t be the emprcal average of observatons (rewards and sde-observatons combned avalable for acton up to tme t The ɛ t-greedy-lp polcy s descrbed n Algorthm The polcy conssts of two phases - explotaton and exploraton, where the exploraton probablty decreases as /t, smlar to the ɛ t-greedy polcy proposed n However, n our polcy, we choose the exploraton probablty for acton to be proportonal to z /t, where z s the optmal soluton of LP P, whle n the orgnal polcy n, the exploraton probablty s unform over all actons The followng proposton provdes performance guarantees on the ɛ t-greedy-lp polcy: Proposton 3 For 0 <d<mn U Δ, any c>0, and α>, the probablty wth whch a suboptmal acton s selected by the ɛ t-greedy-lp polcy, descrbed n Algorthm,

5 Algorthm : ɛ t-greedy-lp Input: c>0, 0 <d<, optmal soluton z of LP P for each tme t do ( Let ɛ(t mn, c K z and a argmax d t x(t K Wth probablty ɛ(t, pck acton φ(t a Wth probablty ɛ(t, pck acton φ(t wth probablty z for all K K z end for Update average rewards x v(t +, v K φ(t for all t>t c K z d ( c d t z + 4 d ( et t s at most c/α + δc αd ( et t cr/αd ( e t log, t (4 where r 3(α, and δ s the maxmum degree n the 8α network Proof (Sketch Snce z satsfes the constrants n LP P, there s suffcent exploraton wthn each suboptmal acton s neghborhood The proof s then a combnaton of ths fact and the proof of Theorem 3 n See Secton 8 for the full proof In the above proposton, for large enough c, we see that second and thrd terms are O(/t +ɛ forsomeɛ>0 Usng ths fact, the followng corollary bounds the expected regret of the ɛ t-greedy-lp polcy: Corollary Choose parameters c and d such that, 0 <d<mn Δ, and U c>max(αd /r, α, for any α> Then, the expected regret at tme t of the ɛ t-greedy-lp polcy descrbed n Algorthm s at most ( c Δ z d log(t+o(k, (5 U where the O(K term captures constants ndependent of tme but dependent on the network structure Remark Under Assumpton 4, we can see from Proposton and Corollary that, ɛ t-greedy-lp( algorthm s order optmal achevng the lower bound O z log(t O (c μ log(t as the network and tme scale Whle ɛ t-greedy-lp polcy s network aware, ts exploraton s oblvous to the average rewards of the sub-optmal actons Further, ts performance guarantees depend on the knowledge of mn U Δ, whch s the dfference between the best and the second best optmal actons On the other hand, the UCB-LP polcy proposed n the next secton s network-aware takng nto account the average rewards of suboptmal actons Ths could lead to better performance compared to ɛ t-greedy-lp polcy n certan stuatons, for example, when the hghly connected acton s also hghly suboptmal U 6 UCB-LP POLICY In ths secton we propose the UCB-LP polcy defned n Algorthm and obtan upper bounds on ts regret The UCB-LP polcy s based on the mproved UCB polcy proposed n, whch can be summarzed as follows: the polcy estmates the values of Δ n each round by a value Δ m whch s ntalzed to and halved n each round m By each round m, the polcy draws n(m observatons for each acton n the set of actons not elmnated by round m, where n(m sdetermnedby Δ m Then, t elmnates those actons whose UCB ndces perform poorly Our polcy dffers from the one n by accountng for the presence of sde-observatons - ths s acheved by choosng each acton accordng to the optmal soluton of LP P, whle ensurng that n(m observatons are avalable for each acton not elmnated by round m Algorthm : UCB-LP polcy Input: Set of actons K, tme horzon T, and optmal soluton z of LP P Intalzaton: Let Δ 0 :, S 0 : K, and B 0 : K for round m 0,,,, log T do e log(t Δ Acton Selecton: Let n(m : m Δ m If B m : choose the sngle acton n B m untl tme T Else If z B m Δ m : For each acton n S m, S m choose t z n(m n(m tmes Else For each acton n B m, choose t n(m n(m tmes Update the average rewards of all actons n B m Acton Elmnaton: To get B m+, delete all actons n B or whch log(t x Δ m (m+ max s xa(m log(t Δ m (m a B m s a(m, where x (m s the emprcal average reward of acton, and s (m s the total number of observatons for acton up to round m Reset: The set S m+ of actons wth neghbors n B m+ s gven as S m+ { : K B m+ } Note that B m+ S m+ Let Δ m+ Δ m end for The followng proposton provdes performance guarantees on the expected regret due to UCB-LP polcy: Proposton 4 For acton, defne round m as follows: { } m : mn m : Δ m < Δ

6 { Defne m mn m : z > K set B { U: m > m} :m >m } m+ and the Then, the expected regret due to the UCB-LP polcy descrbed n Algorthm s at most U\B Δ z 3 log(t ˆΔ ˆΔ + B 3 log(t Δ Δ + O(Kδ, (6 where ˆΔ max{ m+, mn K {Δ j}} and (z s the soluton of LP P δs the maxmum degree n the network The O(Kδ term captures constants ndependent of tme Further, under Assumpton 4, the regret s also at most ( O z log(t + O(Kδ, (7 K where (z entrely captures the tme dependence on network structure Proof (Sketch The log(t term n the regret follows from the fact that, wth hgh probablty, each suboptmal acton s elmnated (from the set B monorbeforethe frst round m such that Δ m < Δ / See Secton 8 for the full proof Next, we brefly descrbe the polces UCB-N and UCB- MaxN proposed n 6 In UCB-N polcy, at each tme, the acton wth the hghest UCB ndex (see ( s chosen smlar to UCB polcy n In UCB-MaxN polcy, at each tme t, the acton wth the hghest UCB ndex ( s dentfed and ts neghborng acton j K wth the hghest emprcal average reward at tme t s chosen Remark The regret upper bound of UCB-N polcy s nf C C C 8max C Δ mn C Δ log(t +O(K, where C s a clque coverng of the sub-network of suboptmal actons The regret upper bound for UCB-MaxN s the same as that for UCB-N wth an O( C term nstead of the tmenvarant O(K term We show a better regret performance for UCB-LP polcy and ɛ t-greedy-lp polces wth respect to the log(t term because K z γ(g χ(g However, the tme-nvarant term n our polces s O(K ando(kδ, whch can be worse than the tme-nvarant term O( C n UCB-MaxN To see the possble gap between γ(g and χ(g, consder an Erdos-Reny random graph G(K, p As noted n, as K, for any fxed p>0, γ(g satmosto(log(k whle χ(g satleasto(k/ log(k On the other hand, t has been shown 7 that for power law graphs, both γ(g and χ(g scale lnearly wth N, although γ(g hasalower slope We note that the socal network of nterest may or may not dsplay a power law behavor We fnd that the subgraphs of the Flxster network have a degree dstrbuton that s a straght lne on a log-log plot ndcatng a power law dstrbuton dsplay whle the authors n 5 show that the degree dstrbuton of the global Facebook network s not a straght lne on log-log plot Our numercal results n Secton 7 show that our polces outperform exstng polces even for the Flxster network Remark 3 All unformly good polces that gnore sdeobservatons ncur a regret that s at least Ω( U log(t 0, where U s the number of suboptmal actons Ths could be sgnfcantly hgher than the guarantees on the regret of both ɛ t-greedy-lp polcy and UCB-LP polcy for a rch network structure as dscussed n Remark Remark 4 Whle ɛ t-greedy-lp does not requre knowledge of the tme horzon T, UCB-LP polcy requres the knowledge of T UCB-LP polcy can be extended to the case of an unknown tme horzon smlar to the suggeston n Start wth T 0 and at end of each T l, set T l+ Tl The regret bound for ths case s expected to be smlar to the one n Proposton 4 Remark 5 In our work, we assumed that the sde observatons are always avalable However, n realty, sde observatons may only be obtaned sporadcally Suppose that when acton s chosen, sde-observatons are obtaned for each neghborng acton j wth a known probablty p j In ths case, Proposton holds wth the replacement of LP P wth LP P as follows: P : mn U Δ w, subject to: w + p \{} w j, U, D(θ θ w 0, K Both of our polces work for ths settng by changng the LP P to P as follows: P : mn K z subject to: z + p \{} z j, K, and z 0, K The regret bounds of our polces wll now depend on the optmal soluton of LP P 7 NUMERICAL RESULTS We consder the Flxster network dataset for the numercal evaluaton of our algorthms The authors n 9 collected ths socal network data, whch contans about mllon users and 4 mllon lnks We use graph clusterng 8 to dentfy two strongly clustered sub-networks of szes 000 and 000 nodes Both these sub-networks have a degree dstrbuton that s a straght lne on a log-log plot ndcatng a power law dstrbuton commonly observed n socal networks Our emprcal setup s as follows Each user n the network s offered a promoton at each tme, and accepts the promoton wth probablty μ 03, 09 The decson maker receves a random reward of f a user accepts the promoton or 0 reward otherwse μ s chosen unformly at random from 03, 08 and there are 50 randomly chosen users wth optmal μ 09 Fgures and 3 show the regret performance as a functon of tme for the two sub-networks of szes 000 and 000 respectvely For the ɛ t-greedy-lp polcy, we let c 5andd 0 For both networks, we see that our polces outperform the UCB-N and UCB-MaxN polces

7 We also observe that the mprovement obtaned by UCB-N polcy over the baselne UCB polcy s margnal Expected Regret 4 x UCB wth no sde observatons UCB N UCB MaxN UCB LP ε t greedy LP Tme Fgure : Regret of all the polces for a network of sze 000 Expected Regret x 05 0 UCB wth no sde observatons UCB N UCB MaxN UCB LP ε t greedy LP Tme Fgure 3: Regret of all the polces for a network of sze PROOFS In what follows, we gve the proofs of all propostons stated n the earler sectons These proofs make use of Lemmas,, and 3, and Proposton 5 gven n the Appendx Proof of Proposton Let U { : μ <μ } be the set of suboptmal actons Also, let Δ μ μ Recall that K s the set of neghbors of, ncludng, n the network G(K Also, T (t s the total number of tmes acton s chosen up to tme t by polcy φ Let S (t be the total number of observatons correspondng to acton avalable at tme t From Proposton 5 gven n the Appendx, we have, lm nf t ES (t log(t, U (8 D(θ θ An observaton s receved for acton whenever any acton n K s chosen Hence, S (t T j(t (9 Now, from Equatons (8 and (9, for each U, j K lm nf ET j(t t log(t D(θ θ (0 Usng (0, we get the constrants of LP P Further, we have from defnton of regret that, lm nf t R μ(t log(t lm nf t U Δ ET (t log(t The above equaton along wth the constrants of the LP P obtaned from (0 gves us the requred lower bound on regret Proof of Proposton Let I { U: K O } be the set of suboptmal actons wth neghbors n O Let (z K be the optmal soluton of LP P We wll frst prove the upper bound n Equaton 3 Usng the optmal soluton (w K of LP P, we construct a feasble soluton satsfyng constrants ( n LP P n the followng way: For actons U, let z max U D(θ θ w Then (z U satsfy constrants for all actons U\Ibecause w satsfy constrants of LP P In order to satsfy the constrants for actons n I O, we use z forall n O The feasble soluton constructed n ths way gves an upper bound on the optmal value of LP P Hence, z + O K z U U ( max U D(θ θ w + O max U D(θ θ mn U Δ Δ w + O max U D(θ θ c μ + O mn U Δ For the lower bound, any feasble soluton of P, n partcular z, can be used to construct a feasble soluton of P For z actons K, let w Then mn U D(θ θ (w K satsfes the constrants of LP P and hence gves an upper bound on ts optmal value Therefore, we have c μ U Δ w, U Δ z mn U D(θ K θ max U Δ z mn U D(θ K θ whch gves us the requred lower bound Proof of Proposton 3 Snce z satsfes the constrants n LP P, there s suffcent exploraton wthn each suboptmal acton s neghborhood The proof s then a combnaton of ths fact and the proof of Theorem 3 n Let X (t be the random varable denotng the sample mean of all observatons avalable for

8 acton at tme t Let X (t be the random varable denotng the sample mean of all observatons avalable for an optmal acton at tme t Let X,m denote the sample mean of m random varables drawn from the dstrbuton F Fx asuboptmalacton For some α>, defne m as follows, m t ɛ(m α j K z Let φ(t be the acton chosen by ɛ t-greedy-lp polcy at tme t Then, Pφ(t ɛ(tz +( ɛ(tp X (t X (t K z We also have that, P X (t X (t P X (t μ + Δ + P X (t μ Δ The analyss of both the terms n the rght hand sde of the above expresson s smlar Let S (R (t be the total number of observatons avalable for acton from the exploraton phase of the polcy up to tme t Let S (t be the total number of observatons avalable for acton up to tme t Hence, we have, t P X (t μ + Δ P S (t m; X (t μ + Δ t t m P S (t m X,m μ + Δ P S (t m X,m μ + Δ P e Δ X,m μ + Δ m (follows from Chernoff-Hoeffdng bound n Lemma P S (t m X,m μ + Δ + e Δ m Δ ( snce e ku k e km m m P P m+ S (R (t m X,m μ + Δ S (R (t m + d e d m + Δ e Δ m In the above, the last equaton follows from the fact that S (R (t, whch s the total number of observatons for acton avalable from the exploraton phase of the polcy up to tme t, s ndependent of the sample means of all actons Now, E S (R (t t ɛ(m z j j K z j K z t ɛ(m αm var S (R (t t ɛ(m j K z ɛ(m t ɛ(m j K z j K z E S (R (t Now, usng Bernsten s nequalty gven n Lemma, we have P S (R (t m P S (R (t E S (R (t (α m exp ( rm, where r 3(α Now, we wll obtan upper and lower 8α bounds on m For the upper bound, for any t > t c K z d, m α α α z j j K z t ɛ(m t + j K z α c K z j K z d ( e t δc αd log t, z j j K z + δc αd t mt + t mt + c j K z j d t where δ s the maxmum degree n the network In the above, K z δ because z, whch s due to the fact that (z K s the optmal soluton of LP P Next, for the lower bound, we use the fact that z for all because (z K satsfes the constrants of LP P Thus m α j K z α j K z c αd t mt + t ɛ(m t mt + t c j K z j d t c αd log t ( t et Hence, combnng the nequaltes above, P X (t μ + Δ m P S (R (t m + d e d m m exp ( rm + d e d m δc ( et cr/αd ( e t log + d ( et c/α αd t t t Now, smlarly for the optmal acton, we have, for all t>t P X (t μ Δ δc αd ( et cr/αd ( e t log t t + ( et c/α d t

9 Combnng everythng, we have for any suboptmal acton, for all t>t Pφ(t ( c d t z + 4 d ( et + δc αd Proof of Proposton 4 t ( et t c/α cr/αd ( e t log The proof technque n smlar to that n We wll analyze the regret by condtonng on two dsjont events The frst event s that each suboptmal acton a s elmnated by an optmal acton on or before the frst round m such that Δ m < Δ a/ Ths happens wth hgh probablty and leads to logarthmc regret The complment of the frst event yelds lnear regret n tme but occurs wth probablty proportonal to /T The man dfference from the proof n s that on the frst event, the number of tmes we choose each acton s proportonal to z log(t n the exploraton phase of the polcy Ths gves us the requred upper bound n terms of optmal soluton z of LP P Let denote any optmal acton Let m denote the round n whch the last optmal acton s elmnated For each suboptmal acton, defne round m : mn{m : Δ m < Δ } For an optmal acton, m by conventon Then, by the defnton of m, for all rounds m<m, Δ Δ m, and < m 4 < m+ ( Δ Δ m Δ Δ m + From Lemma 3 n the Appendx, the probablty that acton s not elmnated n round m by s at most t T Δ m Let U K U be the set of suboptmal neghbors of acton Let I(t be the acton chosen at tme t by the UCB-LP polcy Let E m be the event that all suboptmal actons wth m m are elmnated by on or before ther respectve m Then, the complement of E m, denoted as Em c, s the event that there exsts some suboptmal acton wth m m, whch s not elmnated n round m Let E c be the event that acton s not elmnated by round m by Let log T and I(t denote the acton chosen at tme t by e the polcy Recall that regret s denoted by R μ(t Let Pm m bedenotedbyp m Hence, p m E R μ(t + t j U E R μ(t {m m} Pm m T Δ jp I(t j {m m} p m T Δ jp {I(t j} E m {m m} p m t j U T Δ jp {I(t j} Em c {m m} p m t j U (+( Next we wll show that term ( leads to logarthmc regret whle term ( leads to a constant regret wth tme Frst, consder the term ( of the regret expresson Recall that U j K j U s the set of suboptmal neghbors of j For each j U, we have, t m f T P {I(t j} Em c {m m} Pm m t m f T t t T P {I(t j} ( U:m m Ec {m m} p m P {I(t j} ( Uj :m m Ec {m m} p m (because {I((t j} depends only on neghbors of j T ( P {I(t j} ( Uj :m m Ec, {m m} T P Uj E c {m } P Uj :m m Ec {m m} p m p m T U j T Δ, m ( usng Lemma 3, P E c {m } T Δ m 3, Δ U j where the last nequalty follows from Equaton ( Hence, the term ( ofregrets T Δ jp {I(t j} Em c {m m} p m t j U j U 3 Δ j Δ U j where δ s the maxmum degree n the network O(Kδ, ( Next, we consder the term ( Recall that, n ths term, we consder the case that all suboptmal actons wth m m are elmnated by on or before m ( T Δ jp {I(t j} E m {m m} p m t j U E R μ(t {m m},e m PE m {m m}p m ( E Regret from { : m m } {m m},e m + E Regret from { : m >m } {m m},e m p m (E Regret from { : m } {m },E mf + E Regret from { : m >m } {m m},e m p m

10 E R μ(t {m },E mf + (a+(b p m E Regret from { : m >m } {m m},e m p m Once agan, we wll consder the above two terms separately For the term (a, under the event E mf, each suboptmal acton s elmnated by by round m Defne round m and the set B as follows: m mn{m : z > m+ }, K :m >m B { U: m > m} After round m, Algorthm chooses only those actons wth m > m Also, by the defnton of the Reset phase of Algorthm, we have that any suboptmal acton / B s chosen (e appears n the set S m at round m onlyuntl all actons n ts neghborhood are elmnated or untl m, whchever happens frst Defne n mn{ m, max K {m j}} for each suboptmal acton Then any suboptmal acton / B s chosen for at most n rounds (a E R μ(t {m },E mf U\B U\B Δ z Δ z log(t Δ n Δ n 3 log(t ˆΔ ˆΔ + B + B Δ log(t Δ m Δ m 3 log(t Δ Δ, (3 Δ where ˆΔ max{ m+, mn K {Δ j}} and (z sthesoluton of LP P Fnally, we consder the term (b An optmal acton s not elmnated n round m f (6 holds for m m Hence, usng (7 and (8, the probablty p m that s elmnated by a suboptmal acton n any round m s at most Hence, term (b sgvenas: T Δ m E Regret from { : m >m } {m m},e m p m U:m m max U Δj U U m T Δ m T max U Δj Δ U:m m m Δ m m + U 3 Δ O(K (4 Now we get the result (6 by combnng the bounds n (, (3, and (4 Further, the defnton of set B ensures that we have Δ z B K Also, usng the Assumpton 4, 3Δ log(t ˆΔ, 3 log(t Δ ˆΔ are Δ bounded by C log(t, where C s a constant ndependent of network structure Hence, (3 can be bounded as: U\B Δ z 3 log(t ˆΔ ˆΔ + B Δ 3 log(t Δ Δ Δ C log(t z C log(t + U\B B m+ C log(t z C log(t + U\B B z C log(t (5 K Hence, we get (7 from (5, (, and (4 9 CONCLUSION In ths work, we studed the stochastc mult-armed bandt problem n the presence of sde-observatons across actons that are embedded n a network We obtaned an asymptotc (wth respect to tme lower bound as a functon of the network structure on the regret of any unformly good polcy Further, we proposed two polces: the ɛ t-greedy- LP polcy, and the UCB-LP polcy, both of whch are optmal n the sense that they acheve the asymptotc lower bound on the regret, up to a multplcatve constant that s ndependent of the network structure These polces can have a better regret performance than exstng polces for some mportant network structures The ɛ t-greedy-lp polcy s a network-aware any-tme polcy, but ts exploraton s oblvous to the average rewards of the suboptmal actons On the other hand, UCB-LP consders both the network structure and the average rewards of actons Fnally, usng numercal examples on the Flxster network dataset, we confrmed the sgnfcant benefts obtaned by our polces aganst other exstng polces Acknowledgments Ths work s supported by the NSF grants: CAREER-CNS , CCF , CNS-0700, CNS-06536, and by grants from the Army Research Offce: W9NF and W9NF Also, the work of A Erylmaz was supported n part by the QNRF grant number NPRP REFERENCES P Auer, N Cesa-Banch, and P Fscher Fnte-tme analyss of the multarmed bandt problem Machne Learnng, 47(-3:35 56, May 00 P Auer and R Ortner UCB revsted: Improved regret bounds for the stochastc mult-armed bandt problem Perodca Mathematca Hungarca, 6(-:55 65, 00 3 R M Bond, C J Farss, J J Jones, A D I Kramer, CMarlow,JESettle,andJHFowlerA 6-mllon-person experment n socal nfluence and poltcal moblzaton Nature, 489:95 98, 0 4 S Bubeck and N Cesa-Banch Regret analyss of stochastc and nonstochastc mult-armed bandt problems Foundatons and Trends n Machne Learnng, 5(:, 0

11 5 S Bubeck, R Munos, G Stoltz, and C Szepesvár X-armed bandts Journal of Machne Learnng Research, : , 0 6 S Caron, B Kveton, M Lelarge, and S Bhagat Leveragng sde observatons n stochastc bandts In UAI, pages 4 5 AUAI Press, 0 7 C Cooper, R Klasng, and M Zto Lower bounds and algorthms for domnatng sets n web graphs Internet Mathematcs, :75 300, I S Dhllon, Y Guan, and B Kuls Weghted graph cuts wthout egenvectors: A multlevel approach IEEE Transactons on Pattern Analyss and Machne Intellgence, 9(: , M Jamal and M Ester A matrx factorzaton technque wth trust propagaton for recommendaton n socal networks In Proceedngs of the fourth ACM conference on Recommender systems, RecSys 0, pages 35 4 ACM, 00 0 T L La and H Robbns Asymptotcally effcent adaptve allocaton rules Advances n Appled Mathematcs, 6(:4, 985 L L, W Chu, J Langford, and R E Schapre A contextual-bandt approach to personalzed news artcle recommendaton In Proceedngs of the 9th Internatonal Conference on World Wde Web, WWW 0, pages ACM, 00 S Mannor and O Shamr From bandts to experts: On the value of sde-observatons In NIPS, pages , 0 3 S Pandey, D Chakrabart, and D Agarwal Mult-armed bandt problems wth dependent arms In Proceedngs of the 4th Internatonal Conference on Machne Learnng, ICML 07, pages 7 78, New York, NY, USA, 007 ACM 4 P Rusmevchentong and J N Tstskls Lnearly parameterzed bandts Math Oper Res, 35(:395 4, 00 5 J Ugander, B Karrer, L Backstrom, and C Marlow The anatomy of the facebook socal graph CoRR, abs/4503, 0 APPENDIX Notaton: S n n n j Xj s called the sample mean of the random varables X,,X n The frst two lemmas below state the Chernoff-Hoeffdng nequalty and Bernsten s nequalty Lemma Let X,,X n be a sequence of random varables wth support 0, and EX tμ for all t n Let S n n n j Xj Then, for all ɛ>0, we have, PS n μ + ɛ e nɛ PS n μ ɛ e nɛ Lemma Let X,,X n be a sequence of random varables wth support 0, and t k varx k X,,X k σ for all t n Let S n n j Xj Then, for all ɛ>0, we have, { } ɛ PS n ES n+ɛ exp σ + ɛ 3 { PS n ES n ɛ exp ɛ σ + 3 ɛ } The next lemma s used n the proof of Proposton 4 Lemma 3 The probablty that acton s not elmnated n round m by s at most T Δ m Proof Let X (m be the sample mean of all observatons for acton avalable n round m Let X (m bethe sample mean of the optmal acton The constrants of LP P ensure that at the end of each round m, for all actons n log(t Δ B m, we have n(m : m observatons Now, for Δ m m m, f we have, log(t X (m μ Δ m + and X (m μ log(t Δ m, n(m n(m (6 then, acton s elmnated by n round m In fact, n round m, we have log(t Δ m n(m Δ m < Δ 4 Hence, n the elmnaton phase of the UCB-LP polcy, f (6 holds for acton n round m, we have, log(t X Δ m log(t (m + μ + Δ m n(m n(m <μ +Δ μ log(t Δ m n(m X (m log(t Δ m n(m log(t Δ m, n(m and acton s elmnated Hence, the probablty that acton s not elmnated n round m s the probablty that ether one of the nequaltes n (6 do not hold Usng Chernoff- Hoeffdng bound (Lemma, we can bound ths as follows, P log(t X(m >μ + Δ m n(m T Δ (7 m P X (m <μ log(t Δ m n(m T Δ (8 m Summng the above two nequaltes for m m gves us that the probablty that acton s not elmnated n round m by s at most T Δ m The next proposton s a modfed verson of Theorem n 0 We use t to obtan the regret lower bound n Proposton Proposton 5 Suppose Assumptons,, and 3 hold Then, under any unformly good polcy φ, we have that, for each acton wth μ <μ, lm nf t ES (t log(t D(θ θ (9

12 Proof Ths proof follows from the proof of Theorem n 0 To fx deas, suppose s a suboptmal acton and suppose acton s optmal Let the parameters of the reward dstrbutons be θ (θ,,θ K and the assocated means be μ (μ,,μ K Then, for any 0 <δ<, due to Assumptons,, 3, we have that there exsts a parameter λ and mean μ λ assocated wth the densty functon f(,λ such that μ λ >μ and D(θ θ D(θ λ δd(θ θ (0 Now, consder the new sets of parameters η (λ,,θ K, where the mean rewards are changed to (μ λ,μ,,μ K For ths set of parameters, acton s the unque optmal Then, for any unformly good polcy, for 0 <b<δ, E ηt T (t o(t b and therefore, P η T (t < ( δlog(t/d(θ λ o(t b, smlar to the asymptotc lower bound proof n 0 Now, usng the fact that S (t T (t, we have P η S (t < ( δlog(t/d(θ λ o(t b Now the rest of the proof of Theorem n 0 apples drectly to S (t We wll repeat t below for completeness Let (Y (k k be the observatons drawn from dstrbuton F and defne m ( f(y(k; θ L m log f(y (k; λ k Now, we have that P ηc to(t b wherec t {S (t < ( δlog(t/d(θ λ andl S (t ( blog(t} Now, P ηt (t t,,t K(t t K,L s ( blog(t E η P η T (t t,,t K(t t K, L s ( blog(t (Y (k { K,k,,t s f(y (k; λ E θ f(y k (k; θ P θ T (t t,,t K(t t K, L s ( blog(t (Y (k { K,k,,s exp( ( blog(tp θ T (t t,, T K(t t K,L s ( blog(t, ( In the above, we used the defnton s t j j K Also, C t s a dsjont unon of events of the form {T (t t,,t K(t t K,L s ( blog(t} wth t ++t M t and s ( δlog(t/d(θ λ Hence usng (, P θ C t t ( b P ηc t 0 ( By strong law of large numbers L m/m D(θ λ asm and max km L k /m D(θ λ almostsurely Now,snce b> δ, t follows that as t, P θ L k ( blog(t forsome k<( δlog(t/d(θ λ 0 (3 Hence, from ( and (3, P θ S (t < ( δlog(t/d(θ λ 0 Now usng the above equaton wth (0 gves us the asymptotc lower bound n (9

Luby s Alg. for Maximal Independent Sets using Pairwise Independence

Luby s Alg. for Maximal Independent Sets using Pairwise Independence Lecture Notes for Randomzed Algorthms Luby s Alg. for Maxmal Independent Sets usng Parwse Independence Last Updated by Erc Vgoda on February, 006 8. Maxmal Independent Sets For a graph G = (V, E), an ndependent

More information

Recurrence. 1 Definitions and main statements

Recurrence. 1 Definitions and main statements Recurrence 1 Defntons and man statements Let X n, n = 0, 1, 2,... be a MC wth the state space S = (1, 2,...), transton probabltes p j = P {X n+1 = j X n = }, and the transton matrx P = (p j ),j S def.

More information

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis The Development of Web Log Mnng Based on Improve-K-Means Clusterng Analyss TngZhong Wang * College of Informaton Technology, Luoyang Normal Unversty, Luoyang, 471022, Chna wangtngzhong2@sna.cn Abstract.

More information

How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence

How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence 1 st Internatonal Symposum on Imprecse Probabltes and Ther Applcatons, Ghent, Belgum, 29 June 2 July 1999 How Sets of Coherent Probabltes May Serve as Models for Degrees of Incoherence Mar J. Schervsh

More information

An Alternative Way to Measure Private Equity Performance

An Alternative Way to Measure Private Equity Performance An Alternatve Way to Measure Prvate Equty Performance Peter Todd Parlux Investment Technology LLC Summary Internal Rate of Return (IRR) s probably the most common way to measure the performance of prvate

More information

8 Algorithm for Binary Searching in Trees

8 Algorithm for Binary Searching in Trees 8 Algorthm for Bnary Searchng n Trees In ths secton we present our algorthm for bnary searchng n trees. A crucal observaton employed by the algorthm s that ths problem can be effcently solved when the

More information

1 Example 1: Axis-aligned rectangles

1 Example 1: Axis-aligned rectangles COS 511: Theoretcal Machne Learnng Lecturer: Rob Schapre Lecture # 6 Scrbe: Aaron Schld February 21, 2013 Last class, we dscussed an analogue for Occam s Razor for nfnte hypothess spaces that, n conjuncton

More information

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module LOSSLESS IMAGE COMPRESSION SYSTEMS Lesson 3 Lossless Compresson: Huffman Codng Instructonal Objectves At the end of ths lesson, the students should be able to:. Defne and measure source entropy..

More information

The OC Curve of Attribute Acceptance Plans

The OC Curve of Attribute Acceptance Plans The OC Curve of Attrbute Acceptance Plans The Operatng Characterstc (OC) curve descrbes the probablty of acceptng a lot as a functon of the lot s qualty. Fgure 1 shows a typcal OC Curve. 10 8 6 4 1 3 4

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Max Wellng Department of Computer Scence Unversty of Toronto 10 Kng s College Road Toronto, M5S 3G5 Canada wellng@cs.toronto.edu Abstract Ths s a note to explan support vector machnes.

More information

How To Understand The Results Of The German Meris Cloud And Water Vapour Product

How To Understand The Results Of The German Meris Cloud And Water Vapour Product Ttel: Project: Doc. No.: MERIS level 3 cloud and water vapour products MAPP MAPP-ATBD-ClWVL3 Issue: 1 Revson: 0 Date: 9.12.1998 Functon Name Organsaton Sgnature Date Author: Bennartz FUB Preusker FUB Schüller

More information

Learning the Best K-th Channel for QoS Provisioning in Cognitive Networks

Learning the Best K-th Channel for QoS Provisioning in Cognitive Networks 000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050

More information

Dynamic Cost-Per-Action Mechanisms and Applications to Online Advertising

Dynamic Cost-Per-Action Mechanisms and Applications to Online Advertising Dynamc Cost-Per-Acton Mechansms and Applcatons to Onlne Advertsng Hamd Nazerzadeh Amn Saber Rakesh Vohra Abstract We examne the problem of allocatng a resource repeatedly over tme amongst a set of agents.

More information

An Interest-Oriented Network Evolution Mechanism for Online Communities

An Interest-Oriented Network Evolution Mechanism for Online Communities An Interest-Orented Network Evoluton Mechansm for Onlne Communtes Cahong Sun and Xaopng Yang School of Informaton, Renmn Unversty of Chna, Bejng 100872, P.R. Chna {chsun,yang}@ruc.edu.cn Abstract. Onlne

More information

THE DISTRIBUTION OF LOAN PORTFOLIO VALUE * Oldrich Alfons Vasicek

THE DISTRIBUTION OF LOAN PORTFOLIO VALUE * Oldrich Alfons Vasicek HE DISRIBUION OF LOAN PORFOLIO VALUE * Oldrch Alfons Vascek he amount of captal necessary to support a portfolo of debt securtes depends on the probablty dstrbuton of the portfolo loss. Consder a portfolo

More information

The Greedy Method. Introduction. 0/1 Knapsack Problem

The Greedy Method. Introduction. 0/1 Knapsack Problem The Greedy Method Introducton We have completed data structures. We now are gong to look at algorthm desgn methods. Often we are lookng at optmzaton problems whose performance s exponental. For an optmzaton

More information

A Lyapunov Optimization Approach to Repeated Stochastic Games

A Lyapunov Optimization Approach to Repeated Stochastic Games PROC. ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING, OCT. 2013 1 A Lyapunov Optmzaton Approach to Repeated Stochastc Games Mchael J. Neely Unversty of Southern Calforna http://www-bcf.usc.edu/

More information

Dynamic Cost-Per-Action Mechanisms and Applications to Online Advertising

Dynamic Cost-Per-Action Mechanisms and Applications to Online Advertising Dynamc Cost-Per-Acton Mechansms and Applcatons to Onlne Advertsng Hamd Nazerzadeh Amn Saber Rakesh Vohra September 9, 2008 Abstract We examne the problem of allocatng a resource repeatedly over tme amongst

More information

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ).

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ). REVIEW OF RISK MANAGEMENT CONCEPTS LOSS DISTRIBUTIONS AND INSURANCE Loss and nsurance: When someone s subject to the rsk of ncurrng a fnancal loss, the loss s generally modeled usng a random varable or

More information

ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING

ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING Matthew J. Lberatore, Department of Management and Operatons, Vllanova Unversty, Vllanova, PA 19085, 610-519-4390,

More information

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12 14 The Ch-squared dstrbuton PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 1 If a normal varable X, havng mean µ and varance σ, s standardsed, the new varable Z has a mean 0 and varance 1. When ths standardsed

More information

What is Candidate Sampling

What is Candidate Sampling What s Canddate Samplng Say we have a multclass or mult label problem where each tranng example ( x, T ) conssts of a context x a small (mult)set of target classes T out of a large unverse L of possble

More information

Project Networks With Mixed-Time Constraints

Project Networks With Mixed-Time Constraints Project Networs Wth Mxed-Tme Constrants L Caccetta and B Wattananon Western Australan Centre of Excellence n Industral Optmsaton (WACEIO) Curtn Unversty of Technology GPO Box U1987 Perth Western Australa

More information

J. Parallel Distrib. Comput.

J. Parallel Distrib. Comput. J. Parallel Dstrb. Comput. 71 (2011) 62 76 Contents lsts avalable at ScenceDrect J. Parallel Dstrb. Comput. journal homepage: www.elsever.com/locate/jpdc Optmzng server placement n dstrbuted systems n

More information

8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by

8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by 6 CHAPTER 8 COMPLEX VECTOR SPACES 5. Fnd the kernel of the lnear transformaton gven n Exercse 5. In Exercses 55 and 56, fnd the mage of v, for the ndcated composton, where and are gven by the followng

More information

General Auction Mechanism for Search Advertising

General Auction Mechanism for Search Advertising General Aucton Mechansm for Search Advertsng Gagan Aggarwal S. Muthukrshnan Dávd Pál Martn Pál Keywords game theory, onlne auctons, stable matchngs ABSTRACT Internet search advertsng s often sold by an

More information

Answer: A). There is a flatter IS curve in the high MPC economy. Original LM LM after increase in M. IS curve for low MPC economy

Answer: A). There is a flatter IS curve in the high MPC economy. Original LM LM after increase in M. IS curve for low MPC economy 4.02 Quz Solutons Fall 2004 Multple-Choce Questons (30/00 ponts) Please, crcle the correct answer for each of the followng 0 multple-choce questons. For each queston, only one of the answers s correct.

More information

Forecasting the Direction and Strength of Stock Market Movement

Forecasting the Direction and Strength of Stock Market Movement Forecastng the Drecton and Strength of Stock Market Movement Jngwe Chen Mng Chen Nan Ye cjngwe@stanford.edu mchen5@stanford.edu nanye@stanford.edu Abstract - Stock market s one of the most complcated systems

More information

Efficient Project Portfolio as a tool for Enterprise Risk Management

Efficient Project Portfolio as a tool for Enterprise Risk Management Effcent Proect Portfolo as a tool for Enterprse Rsk Management Valentn O. Nkonov Ural State Techncal Unversty Growth Traectory Consultng Company January 5, 27 Effcent Proect Portfolo as a tool for Enterprse

More information

DEFINING %COMPLETE IN MICROSOFT PROJECT

DEFINING %COMPLETE IN MICROSOFT PROJECT CelersSystems DEFINING %COMPLETE IN MICROSOFT PROJECT PREPARED BY James E Aksel, PMP, PMI-SP, MVP For Addtonal Informaton about Earned Value Management Systems and reportng, please contact: CelersSystems,

More information

Institute of Informatics, Faculty of Business and Management, Brno University of Technology,Czech Republic

Institute of Informatics, Faculty of Business and Management, Brno University of Technology,Czech Republic Lagrange Multplers as Quanttatve Indcators n Economcs Ivan Mezník Insttute of Informatcs, Faculty of Busness and Management, Brno Unversty of TechnologCzech Republc Abstract The quanttatve role of Lagrange

More information

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network 700 Proceedngs of the 8th Internatonal Conference on Innovaton & Management Forecastng the Demand of Emergency Supples: Based on the CBR Theory and BP Neural Network Fu Deqang, Lu Yun, L Changbng School

More information

Enabling P2P One-view Multi-party Video Conferencing

Enabling P2P One-view Multi-party Video Conferencing Enablng P2P One-vew Mult-party Vdeo Conferencng Yongxang Zhao, Yong Lu, Changja Chen, and JanYn Zhang Abstract Mult-Party Vdeo Conferencng (MPVC) facltates realtme group nteracton between users. Whle P2P

More information

v a 1 b 1 i, a 2 b 2 i,..., a n b n i.

v a 1 b 1 i, a 2 b 2 i,..., a n b n i. SECTION 8.4 COMPLEX VECTOR SPACES AND INNER PRODUCTS 455 8.4 COMPLEX VECTOR SPACES AND INNER PRODUCTS All the vector spaces we have studed thus far n the text are real vector spaces snce the scalars are

More information

How To Calculate The Accountng Perod Of Nequalty

How To Calculate The Accountng Perod Of Nequalty Inequalty and The Accountng Perod Quentn Wodon and Shlomo Ytzha World Ban and Hebrew Unversty September Abstract Income nequalty typcally declnes wth the length of tme taen nto account for measurement.

More information

Power-of-Two Policies for Single- Warehouse Multi-Retailer Inventory Systems with Order Frequency Discounts

Power-of-Two Policies for Single- Warehouse Multi-Retailer Inventory Systems with Order Frequency Discounts Power-of-wo Polces for Sngle- Warehouse Mult-Retaler Inventory Systems wth Order Frequency Dscounts José A. Ventura Pennsylvana State Unversty (USA) Yale. Herer echnon Israel Insttute of echnology (Israel)

More information

Extending Probabilistic Dynamic Epistemic Logic

Extending Probabilistic Dynamic Epistemic Logic Extendng Probablstc Dynamc Epstemc Logc Joshua Sack May 29, 2008 Probablty Space Defnton A probablty space s a tuple (S, A, µ), where 1 S s a set called the sample space. 2 A P(S) s a σ-algebra: a set

More information

IDENTIFICATION AND CORRECTION OF A COMMON ERROR IN GENERAL ANNUITY CALCULATIONS

IDENTIFICATION AND CORRECTION OF A COMMON ERROR IN GENERAL ANNUITY CALCULATIONS IDENTIFICATION AND CORRECTION OF A COMMON ERROR IN GENERAL ANNUITY CALCULATIONS Chrs Deeley* Last revsed: September 22, 200 * Chrs Deeley s a Senor Lecturer n the School of Accountng, Charles Sturt Unversty,

More information

The literature on many-server approximations provides significant simplifications toward the optimal capacity

The literature on many-server approximations provides significant simplifications toward the optimal capacity Publshed onlne ahead of prnt November 13, 2009 Copyrght: INFORMS holds copyrght to ths Artcles n Advance verson, whch s made avalable to nsttutonal subscrbers. The fle may not be posted on any other webste,

More information

Slow Fading Channel Selection: A Restless Multi-Armed Bandit Formulation

Slow Fading Channel Selection: A Restless Multi-Armed Bandit Formulation Slow Fadng Channel Selecton: A Restless Mult-Armed Bandt Formulaton Konstantn Avrachenkov INRIA, Maestro Team BP95, 06902 Sopha Antpols, France Emal: k.avrachenkov@sopha.nra.fr Laura Cottatellucc, Lorenzo

More information

+ + + - - This circuit than can be reduced to a planar circuit

+ + + - - This circuit than can be reduced to a planar circuit MeshCurrent Method The meshcurrent s analog of the nodeoltage method. We sole for a new set of arables, mesh currents, that automatcally satsfy KCLs. As such, meshcurrent method reduces crcut soluton to

More information

OPTIMAL INVESTMENT POLICIES FOR THE HORSE RACE MODEL. Thomas S. Ferguson and C. Zachary Gilstein UCLA and Bell Communications May 1985, revised 2004

OPTIMAL INVESTMENT POLICIES FOR THE HORSE RACE MODEL. Thomas S. Ferguson and C. Zachary Gilstein UCLA and Bell Communications May 1985, revised 2004 OPTIMAL INVESTMENT POLICIES FOR THE HORSE RACE MODEL Thomas S. Ferguson and C. Zachary Glsten UCLA and Bell Communcatons May 985, revsed 2004 Abstract. Optmal nvestment polces for maxmzng the expected

More information

Can Auto Liability Insurance Purchases Signal Risk Attitude?

Can Auto Liability Insurance Purchases Signal Risk Attitude? Internatonal Journal of Busness and Economcs, 2011, Vol. 10, No. 2, 159-164 Can Auto Lablty Insurance Purchases Sgnal Rsk Atttude? Chu-Shu L Department of Internatonal Busness, Asa Unversty, Tawan Sheng-Chang

More information

Generalizing the degree sequence problem

Generalizing the degree sequence problem Mddlebury College March 2009 Arzona State Unversty Dscrete Mathematcs Semnar The degree sequence problem Problem: Gven an nteger sequence d = (d 1,...,d n ) determne f there exsts a graph G wth d as ts

More information

Section 5.4 Annuities, Present Value, and Amortization

Section 5.4 Annuities, Present Value, and Amortization Secton 5.4 Annutes, Present Value, and Amortzaton Present Value In Secton 5.2, we saw that the present value of A dollars at nterest rate per perod for n perods s the amount that must be deposted today

More information

Joint Scheduling of Processing and Shuffle Phases in MapReduce Systems

Joint Scheduling of Processing and Shuffle Phases in MapReduce Systems Jont Schedulng of Processng and Shuffle Phases n MapReduce Systems Fangfe Chen, Mural Kodalam, T. V. Lakshman Department of Computer Scence and Engneerng, The Penn State Unversty Bell Laboratores, Alcatel-Lucent

More information

Period and Deadline Selection for Schedulability in Real-Time Systems

Period and Deadline Selection for Schedulability in Real-Time Systems Perod and Deadlne Selecton for Schedulablty n Real-Tme Systems Thdapat Chantem, Xaofeng Wang, M.D. Lemmon, and X. Sharon Hu Department of Computer Scence and Engneerng, Department of Electrcal Engneerng

More information

Calculation of Sampling Weights

Calculation of Sampling Weights Perre Foy Statstcs Canada 4 Calculaton of Samplng Weghts 4.1 OVERVIEW The basc sample desgn used n TIMSS Populatons 1 and 2 was a two-stage stratfed cluster desgn. 1 The frst stage conssted of a sample

More information

Analysis of Premium Liabilities for Australian Lines of Business

Analysis of Premium Liabilities for Australian Lines of Business Summary of Analyss of Premum Labltes for Australan Lnes of Busness Emly Tao Honours Research Paper, The Unversty of Melbourne Emly Tao Acknowledgements I am grateful to the Australan Prudental Regulaton

More information

This paper can be downloaded without charge from the Social Sciences Research Network Electronic Paper Collection: http://ssrn.com/abstract=2694633

This paper can be downloaded without charge from the Social Sciences Research Network Electronic Paper Collection: http://ssrn.com/abstract=2694633 Workng Paper Coordnatng Prcng and Inventory Replenshment wth Nonparametrc Demand Learnng Boxao Chen Department of Industral and Operatons Engneerng Unversty of Mchgan Xul Chao Department of Industral and

More information

In some supply chains, materials are ordered periodically according to local information. This paper investigates

In some supply chains, materials are ordered periodically according to local information. This paper investigates MANUFACTURING & SRVIC OPRATIONS MANAGMNT Vol. 12, No. 3, Summer 2010, pp. 430 448 ssn 1523-4614 essn 1526-5498 10 1203 0430 nforms do 10.1287/msom.1090.0277 2010 INFORMS Improvng Supply Chan Performance:

More information

Online Advertisement, Optimization and Stochastic Networks

Online Advertisement, Optimization and Stochastic Networks Onlne Advertsement, Optmzaton and Stochastc Networks Bo (Rambo) Tan and R. Srkant Department of Electrcal and Computer Engneerng Unversty of Illnos at Urbana-Champagn Urbana, IL, USA 1 arxv:1009.0870v6

More information

How To Calculate An Approxmaton Factor Of 1 1/E

How To Calculate An Approxmaton Factor Of 1 1/E Approxmaton algorthms for allocaton problems: Improvng the factor of 1 1/e Urel Fege Mcrosoft Research Redmond, WA 98052 urfege@mcrosoft.com Jan Vondrák Prnceton Unversty Prnceton, NJ 08540 jvondrak@gmal.com

More information

1. Fundamentals of probability theory 2. Emergence of communication traffic 3. Stochastic & Markovian Processes (SP & MP)

1. Fundamentals of probability theory 2. Emergence of communication traffic 3. Stochastic & Markovian Processes (SP & MP) 6.3 / -- Communcaton Networks II (Görg) SS20 -- www.comnets.un-bremen.de Communcaton Networks II Contents. Fundamentals of probablty theory 2. Emergence of communcaton traffc 3. Stochastc & Markovan Processes

More information

Sngle Snk Buy at Bulk Problem and the Access Network

Sngle Snk Buy at Bulk Problem and the Access Network A Constant Factor Approxmaton for the Sngle Snk Edge Installaton Problem Sudpto Guha Adam Meyerson Kamesh Munagala Abstract We present the frst constant approxmaton to the sngle snk buy-at-bulk network

More information

Dynamic Online-Advertising Auctions as Stochastic Scheduling

Dynamic Online-Advertising Auctions as Stochastic Scheduling Dynamc Onlne-Advertsng Auctons as Stochastc Schedulng Isha Menache and Asuman Ozdaglar Massachusetts Insttute of Technology {sha,asuman}@mt.edu R. Srkant Unversty of Illnos at Urbana-Champagn rsrkant@llnos.edu

More information

How To Know The Components Of Mean Squared Error Of Herarchcal Estmator S

How To Know The Components Of Mean Squared Error Of Herarchcal Estmator S S C H E D A E I N F O R M A T I C A E VOLUME 0 0 On Mean Squared Error of Herarchcal Estmator Stans law Brodowsk Faculty of Physcs, Astronomy, and Appled Computer Scence, Jagellonan Unversty, Reymonta

More information

Allocating Collaborative Profit in Less-than-Truckload Carrier Alliance

Allocating Collaborative Profit in Less-than-Truckload Carrier Alliance J. Servce Scence & Management, 2010, 3: 143-149 do:10.4236/jssm.2010.31018 Publshed Onlne March 2010 (http://www.scrp.org/journal/jssm) 143 Allocatng Collaboratve Proft n Less-than-Truckload Carrer Allance

More information

Data Broadcast on a Multi-System Heterogeneous Overlayed Wireless Network *

Data Broadcast on a Multi-System Heterogeneous Overlayed Wireless Network * JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 24, 819-840 (2008) Data Broadcast on a Mult-System Heterogeneous Overlayed Wreless Network * Department of Computer Scence Natonal Chao Tung Unversty Hsnchu,

More information

INVESTIGATION OF VEHICULAR USERS FAIRNESS IN CDMA-HDR NETWORKS

INVESTIGATION OF VEHICULAR USERS FAIRNESS IN CDMA-HDR NETWORKS 21 22 September 2007, BULGARIA 119 Proceedngs of the Internatonal Conference on Informaton Technologes (InfoTech-2007) 21 st 22 nd September 2007, Bulgara vol. 2 INVESTIGATION OF VEHICULAR USERS FAIRNESS

More information

INSTITUT FÜR INFORMATIK

INSTITUT FÜR INFORMATIK INSTITUT FÜR INFORMATIK Schedulng jobs on unform processors revsted Klaus Jansen Chrstna Robene Bercht Nr. 1109 November 2011 ISSN 2192-6247 CHRISTIAN-ALBRECHTS-UNIVERSITÄT ZU KIEL Insttut für Informat

More information

Optimal resource capacity management for stochastic networks

Optimal resource capacity management for stochastic networks Submtted for publcaton. Optmal resource capacty management for stochastc networks A.B. Deker H. Mlton Stewart School of ISyE, Georga Insttute of Technology, Atlanta, GA 30332, ton.deker@sye.gatech.edu

More information

Causal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting

Causal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting Causal, Explanatory Forecastng Assumes cause-and-effect relatonshp between system nputs and ts output Forecastng wth Regresson Analyss Rchard S. Barr Inputs System Cause + Effect Relatonshp The job of

More information

Cross-Selling in a Call Center with a Heterogeneous Customer Population

Cross-Selling in a Call Center with a Heterogeneous Customer Population OPERATIONS RESEARCH Vol. 57, No. 2, March Aprl 2009, pp. 299 313 ssn 0030-364X essn 1526-5463 09 5702 0299 nforms do 10.1287/opre.1080.0568 2009 INFORMS Cross-Sellng n a Call Center wth a Heterogeneous

More information

PERRON FROBENIUS THEOREM

PERRON FROBENIUS THEOREM PERRON FROBENIUS THEOREM R. CLARK ROBINSON Defnton. A n n matrx M wth real entres m, s called a stochastc matrx provded () all the entres m satsfy 0 m, () each of the columns sum to one, m = for all, ()

More information

Brigid Mullany, Ph.D University of North Carolina, Charlotte

Brigid Mullany, Ph.D University of North Carolina, Charlotte Evaluaton And Comparson Of The Dfferent Standards Used To Defne The Postonal Accuracy And Repeatablty Of Numercally Controlled Machnng Center Axes Brgd Mullany, Ph.D Unversty of North Carolna, Charlotte

More information

Feasibility of Using Discriminate Pricing Schemes for Energy Trading in Smart Grid

Feasibility of Using Discriminate Pricing Schemes for Energy Trading in Smart Grid Feasblty of Usng Dscrmnate Prcng Schemes for Energy Tradng n Smart Grd Wayes Tushar, Chau Yuen, Bo Cha, Davd B. Smth, and H. Vncent Poor Sngapore Unversty of Technology and Desgn, Sngapore 138682. Emal:

More information

Cross-Selling in a Call Center with a Heterogeneous Customer Population

Cross-Selling in a Call Center with a Heterogeneous Customer Population OPERATIONS RESEARCH Vol. 57, No. 2, March Aprl 29, pp. 299 313 ssn 3-364X essn 1526-5463 9 572 299 nforms do 1.1287/opre.18.568 29 INFORMS Cross-Sellng n a Call Center wth a Heterogeneous Customer Populaton

More information

2008/8. An integrated model for warehouse and inventory planning. Géraldine Strack and Yves Pochet

2008/8. An integrated model for warehouse and inventory planning. Géraldine Strack and Yves Pochet 2008/8 An ntegrated model for warehouse and nventory plannng Géraldne Strack and Yves Pochet CORE Voe du Roman Pays 34 B-1348 Louvan-la-Neuve, Belgum. Tel (32 10) 47 43 04 Fax (32 10) 47 43 01 E-mal: corestat-lbrary@uclouvan.be

More information

When Network Effect Meets Congestion Effect: Leveraging Social Services for Wireless Services

When Network Effect Meets Congestion Effect: Leveraging Social Services for Wireless Services When Network Effect Meets Congeston Effect: Leveragng Socal Servces for Wreless Servces aowen Gong School of Electrcal, Computer and Energy Engeerng Arzona State Unversty Tempe, AZ 8587, USA xgong9@asuedu

More information

On File Delay Minimization for Content Uploading to Media Cloud via Collaborative Wireless Network

On File Delay Minimization for Content Uploading to Media Cloud via Collaborative Wireless Network On Fle Delay Mnmzaton for Content Uploadng to Meda Cloud va Collaboratve Wreless Network Ge Zhang and Yonggang Wen School of Computer Engneerng Nanyang Technologcal Unversty Sngapore Emal: {zh0001ge, ygwen}@ntu.edu.sg

More information

Underwriting Risk. Glenn Meyers. Insurance Services Office, Inc.

Underwriting Risk. Glenn Meyers. Insurance Services Office, Inc. Underwrtng Rsk By Glenn Meyers Insurance Servces Offce, Inc. Abstract In a compettve nsurance market, nsurers have lmted nfluence on the premum charged for an nsurance contract. hey must decde whether

More information

Value Driven Load Balancing

Value Driven Load Balancing Value Drven Load Balancng Sherwn Doroud a, Esa Hyytä b,1, Mor Harchol-Balter c,2 a Tepper School of Busness, Carnege Mellon Unversty, 5000 Forbes Ave., Pttsburgh, PA 15213 b Department of Communcatons

More information

Fixed income risk attribution

Fixed income risk attribution 5 Fxed ncome rsk attrbuton Chthra Krshnamurth RskMetrcs Group chthra.krshnamurth@rskmetrcs.com We compare the rsk of the actve portfolo wth that of the benchmark and segment the dfference between the two

More information

Minimal Coding Network With Combinatorial Structure For Instantaneous Recovery From Edge Failures

Minimal Coding Network With Combinatorial Structure For Instantaneous Recovery From Edge Failures Mnmal Codng Network Wth Combnatoral Structure For Instantaneous Recovery From Edge Falures Ashly Joseph 1, Mr.M.Sadsh Sendl 2, Dr.S.Karthk 3 1 Fnal Year ME CSE Student Department of Computer Scence Engneerng

More information

An MILP model for planning of batch plants operating in a campaign-mode

An MILP model for planning of batch plants operating in a campaign-mode An MILP model for plannng of batch plants operatng n a campagn-mode Yanna Fumero Insttuto de Desarrollo y Dseño CONICET UTN yfumero@santafe-concet.gov.ar Gabrela Corsano Insttuto de Desarrollo y Dseño

More information

BERNSTEIN POLYNOMIALS

BERNSTEIN POLYNOMIALS On-Lne Geometrc Modelng Notes BERNSTEIN POLYNOMIALS Kenneth I. Joy Vsualzaton and Graphcs Research Group Department of Computer Scence Unversty of Calforna, Davs Overvew Polynomals are ncredbly useful

More information

THE METHOD OF LEAST SQUARES THE METHOD OF LEAST SQUARES

THE METHOD OF LEAST SQUARES THE METHOD OF LEAST SQUARES The goal: to measure (determne) an unknown quantty x (the value of a RV X) Realsaton: n results: y 1, y 2,..., y j,..., y n, (the measured values of Y 1, Y 2,..., Y j,..., Y n ) every result s encumbered

More information

POLYSA: A Polynomial Algorithm for Non-binary Constraint Satisfaction Problems with and

POLYSA: A Polynomial Algorithm for Non-binary Constraint Satisfaction Problems with and POLYSA: A Polynomal Algorthm for Non-bnary Constrant Satsfacton Problems wth and Mguel A. Saldo, Federco Barber Dpto. Sstemas Informátcos y Computacón Unversdad Poltécnca de Valenca, Camno de Vera s/n

More information

Portfolio Loss Distribution

Portfolio Loss Distribution Portfolo Loss Dstrbuton Rsky assets n loan ortfolo hghly llqud assets hold-to-maturty n the bank s balance sheet Outstandngs The orton of the bank asset that has already been extended to borrowers. Commtment

More information

Risk-based Fatigue Estimate of Deep Water Risers -- Course Project for EM388F: Fracture Mechanics, Spring 2008

Risk-based Fatigue Estimate of Deep Water Risers -- Course Project for EM388F: Fracture Mechanics, Spring 2008 Rsk-based Fatgue Estmate of Deep Water Rsers -- Course Project for EM388F: Fracture Mechancs, Sprng 2008 Chen Sh Department of Cvl, Archtectural, and Envronmental Engneerng The Unversty of Texas at Austn

More information

NON-CONSTANT SUM RED-AND-BLACK GAMES WITH BET-DEPENDENT WIN PROBABILITY FUNCTION LAURA PONTIGGIA, University of the Sciences in Philadelphia

NON-CONSTANT SUM RED-AND-BLACK GAMES WITH BET-DEPENDENT WIN PROBABILITY FUNCTION LAURA PONTIGGIA, University of the Sciences in Philadelphia To appear n Journal o Appled Probablty June 2007 O-COSTAT SUM RED-AD-BLACK GAMES WITH BET-DEPEDET WI PROBABILITY FUCTIO LAURA POTIGGIA, Unversty o the Scences n Phladelpha Abstract In ths paper we nvestgate

More information

Logistic Regression. Lecture 4: More classifiers and classes. Logistic regression. Adaboost. Optimization. Multiple class classification

Logistic Regression. Lecture 4: More classifiers and classes. Logistic regression. Adaboost. Optimization. Multiple class classification Lecture 4: More classfers and classes C4B Machne Learnng Hlary 20 A. Zsserman Logstc regresson Loss functons revsted Adaboost Loss functons revsted Optmzaton Multple class classfcaton Logstc Regresson

More information

NPAR TESTS. One-Sample Chi-Square Test. Cell Specification. Observed Frequencies 1O i 6. Expected Frequencies 1EXP i 6

NPAR TESTS. One-Sample Chi-Square Test. Cell Specification. Observed Frequencies 1O i 6. Expected Frequencies 1EXP i 6 PAR TESTS If a WEIGHT varable s specfed, t s used to replcate a case as many tmes as ndcated by the weght value rounded to the nearest nteger. If the workspace requrements are exceeded and samplng has

More information

Linear Circuits Analysis. Superposition, Thevenin /Norton Equivalent circuits

Linear Circuits Analysis. Superposition, Thevenin /Norton Equivalent circuits Lnear Crcuts Analyss. Superposton, Theenn /Norton Equalent crcuts So far we hae explored tmendependent (resste) elements that are also lnear. A tmendependent elements s one for whch we can plot an / cure.

More information

Software project management with GAs

Software project management with GAs Informaton Scences 177 (27) 238 241 www.elsever.com/locate/ns Software project management wth GAs Enrque Alba *, J. Francsco Chcano Unversty of Málaga, Grupo GISUM, Departamento de Lenguajes y Cencas de

More information

Stochastic Games on a Multiple Access Channel

Stochastic Games on a Multiple Access Channel Stochastc Games on a Multple Access Channel Prashant N and Vnod Sharma Department of Electrcal Communcaton Engneerng Indan Insttute of Scence, Bangalore 560012, Inda Emal: prashant2406@gmal.com, vnod@ece.sc.ernet.n

More information

PAS: A Packet Accounting System to Limit the Effects of DoS & DDoS. Debish Fesehaye & Klara Naherstedt University of Illinois-Urbana Champaign

PAS: A Packet Accounting System to Limit the Effects of DoS & DDoS. Debish Fesehaye & Klara Naherstedt University of Illinois-Urbana Champaign PAS: A Packet Accountng System to Lmt the Effects of DoS & DDoS Debsh Fesehaye & Klara Naherstedt Unversty of Illnos-Urbana Champagn DoS and DDoS DDoS attacks are ncreasng threats to our dgtal world. Exstng

More information

denote the location of a node, and suppose node X . This transmission causes a successful reception by node X for any other node

denote the location of a node, and suppose node X . This transmission causes a successful reception by node X for any other node Fnal Report of EE359 Class Proect Throughput and Delay n Wreless Ad Hoc Networs Changhua He changhua@stanford.edu Abstract: Networ throughput and pacet delay are the two most mportant parameters to evaluate

More information

The Application of Fractional Brownian Motion in Option Pricing

The Application of Fractional Brownian Motion in Option Pricing Vol. 0, No. (05), pp. 73-8 http://dx.do.org/0.457/jmue.05.0..6 The Applcaton of Fractonal Brownan Moton n Opton Prcng Qng-xn Zhou School of Basc Scence,arbn Unversty of Commerce,arbn zhouqngxn98@6.com

More information

Efficient Striping Techniques for Variable Bit Rate Continuous Media File Servers æ

Efficient Striping Techniques for Variable Bit Rate Continuous Media File Servers æ Effcent Strpng Technques for Varable Bt Rate Contnuous Meda Fle Servers æ Prashant J. Shenoy Harrck M. Vn Department of Computer Scence, Department of Computer Scences, Unversty of Massachusetts at Amherst

More information

Joint Optimization of Bid and Budget Allocation in Sponsored Search

Joint Optimization of Bid and Budget Allocation in Sponsored Search Jont Optmzaton of Bd and Budget Allocaton n Sponsored Search Wenan Zhang Shangha Jao Tong Unversty Shangha, 224, P. R. Chna wnzhang@apex.sjtu.edu.cn Yong Yu Shangha Jao Tong Unversty Shangha, 224, P. R.

More information

Research Article Enhanced Two-Step Method via Relaxed Order of α-satisfactory Degrees for Fuzzy Multiobjective Optimization

Research Article Enhanced Two-Step Method via Relaxed Order of α-satisfactory Degrees for Fuzzy Multiobjective Optimization Hndaw Publshng Corporaton Mathematcal Problems n Engneerng Artcle ID 867836 pages http://dxdoorg/055/204/867836 Research Artcle Enhanced Two-Step Method va Relaxed Order of α-satsfactory Degrees for Fuzzy

More information

Course outline. Financial Time Series Analysis. Overview. Data analysis. Predictive signal. Trading strategy

Course outline. Financial Time Series Analysis. Overview. Data analysis. Predictive signal. Trading strategy Fnancal Tme Seres Analyss Patrck McSharry patrck@mcsharry.net www.mcsharry.net Trnty Term 2014 Mathematcal Insttute Unversty of Oxford Course outlne 1. Data analyss, probablty, correlatons, vsualsaton

More information

CHAPTER 14 MORE ABOUT REGRESSION

CHAPTER 14 MORE ABOUT REGRESSION CHAPTER 14 MORE ABOUT REGRESSION We learned n Chapter 5 that often a straght lne descrbes the pattern of a relatonshp between two quanttatve varables. For nstance, n Example 5.1 we explored the relatonshp

More information

n + d + q = 24 and.05n +.1d +.25q = 2 { n + d + q = 24 (3) n + 2d + 5q = 40 (2)

n + d + q = 24 and.05n +.1d +.25q = 2 { n + d + q = 24 (3) n + 2d + 5q = 40 (2) MATH 16T Exam 1 : Part I (In-Class) Solutons 1. (0 pts) A pggy bank contans 4 cons, all of whch are nckels (5 ), dmes (10 ) or quarters (5 ). The pggy bank also contans a con of each denomnaton. The total

More information

Sketching Sampled Data Streams

Sketching Sampled Data Streams Sketchng Sampled Data Streams Florn Rusu, Aln Dobra CISE Department Unversty of Florda Ganesvlle, FL, USA frusu@cse.ufl.edu adobra@cse.ufl.edu Abstract Samplng s used as a unversal method to reduce the

More information

Efficient Reinforcement Learning in Factored MDPs

Efficient Reinforcement Learning in Factored MDPs Effcent Renforcement Learnng n Factored MDPs Mchael Kearns AT&T Labs mkearns@research.att.com Daphne Koller Stanford Unversty koller@cs.stanford.edu Abstract We present a provably effcent and near-optmal

More information

Nordea G10 Alpha Carry Index

Nordea G10 Alpha Carry Index Nordea G10 Alpha Carry Index Index Rules v1.1 Verson as of 10/10/2013 1 (6) Page 1 Index Descrpton The G10 Alpha Carry Index, the Index, follows the development of a rule based strategy whch nvests and

More information

Joe Pimbley, unpublished, 2005. Yield Curve Calculations

Joe Pimbley, unpublished, 2005. Yield Curve Calculations Joe Pmbley, unpublshed, 005. Yeld Curve Calculatons Background: Everythng s dscount factors Yeld curve calculatons nclude valuaton of forward rate agreements (FRAs), swaps, nterest rate optons, and forward

More information