Multi-armed Bandit Problems with History

Size: px
Start display at page:

Download "Multi-armed Bandit Problems with History"

Transcription

1 Multi-arme Bait Problems with History Paaga Shivaswamy Departmet of Computer Siee Corell Uiversity, Ithaa NY Thorste Joahims Departmet of Computer Siee Corell Uiversity, Ithaa NY Abstrat I this paper we osier the stohasti multi-arme bait problem. However, ulike i the ovetioal versio of this problem, we o ot assume that the algorithm starts from srath. May appliatios offer observatios of some of the arms eve before the algorithm starts. We propose three ovel multi-arme bait algorithms that a exploit this ata. A upper bou o the regret is erive i eah ase. The results show that a logarithmi amout of histori ata a reue regret from logarithmi to ostat. The effetiveess of the propose algorithms are emostrate o a large-sale maliious URL etetio problem. 1 Itroutio May real-worl problems, ragig from the optimizatio of avertisig reveue i searh egies to the sheulig of liial trials, a be moele as multiarme bait problems. At eah time step, the algorithm hooses oe of the possible arms i.e. avertisemets, treatmets a observes its rewars. The goal is to maximize the sum of rewars over all time steps, typially expresse as regret ompare to the best arm i hisight. I the ovetioal formulatio of the problem, the algorithm has o prior kowlege about the arms. May appliatios, however, provie some ata about the arms eve before the algorithm starts. For example: A searh egie ompay has esige K retrieval futios. Histori ata is available from a betatest o a small sample of pai users, but ow the Appearig i Proeeigs of the 15 th Iteratioal Coferee o Artifiial Itelligee a Statistis AISTATS 1, La Palma, Caary Islas. Volume XX of JMLR: W&CP XX. Copyright 1 by the authors. futios shoul be fiele i the proutio system as to maximize likthrough. A olie movie ompay has K ifferet reommeer futios to suggest movies to a user. Whe a ew user sigs up, he is aske to rate a few pivotal movies whih provies histori ata for optimizig the hoie of reommeer futio i the log ru. A liial trial experimet was stoppe ue to a legal hurle. Now the ourts wet i favor of otiuig the liial trial but also war that the losses shoul be miimum from ow o. More geerally, we efie histori ata as ay observatios of the arms that are ollete before the start of the olie learig algorithm. The algorithm itself has o otrol over the hoie of arms i the histori ata, or o all arms have to be sample uiformly. The availability of suh histori ata leas to the questio of how olie learig algorithms a best use it to reue regret. This problem is meaigful oly for the ase of stohasti arms [8, 5], sie o amout of histori ata a help i the aversarial settig [4]. To our best kowlege, this problem has ot bee stuie i the literature. However, the work by [9] o bait problem with sie iformatio is relate. Their work assumes that histori ata ollete via some poliy is available to evaluate a mappig from sie iformatio to arms. I the absee of sie-iformatio, their poliy evaluatio strategy reues to hoosig the arm with the highest mea rewar o the histori ata. Relate is also the Sleepig Baits Problem [7], where oly a subset of the arms is ative at eah time step. While it a mimi histori ata to some extet e.g. it allows the aitio of a ew arm at ay time, algorithms a bous are weaker sie they aot rely o a separatio of histori ata a olie learig. This paper propose three ew olie learig algorithms that are able to exploit histori ata. We erive

2 Multi-arme Bait Problems with History upper bous o the regret for eah of the three algorithms, showig that a logarithmi amout of histori ata allows them to ahieve ostat regret. A esirable property of ay bait algorithm with histori obseravatios is that the regret is zero with ifiite histori ata. All the three algorithms that we propose satisfy this property. We also evaluate the algorithms empirially o a maliious URL etetio problem, fiig that histori ata a make a substatial ifferee o pratial problems. Problem Defiitio a Notatio The stohasti K arme bait problem osiers boue raom variables X,t [,1] for 1 K a time iex t 1. Eah X,t eotes the rewar thatisiurrewhethe th armispullethet th time. For arm, the rewars X,t are iepeet a ietially istribute with a ukow mea µ a a ukow variae σ. The arm with the largest mea rewariseoteby i.e., µ := max 1 i K µ i. Further, for ay arm, eotes µ µ. Ofte, we replae with i ay otatio to eote a quatity that orrespos to. Histori observatios are eote by X,t h [,1] for 1 K a 1 t H iexig the t th istae of histori rewar for arm. H is the umber of histori istaes available for arm, a H is efie as H := K =1 H. The histori rewars for eah arm are assume to be raw iepeetly from the same istributios as the o-histori rewars. T eotes the umber of times the arm is pulle betwee times 1 a this exlues the pulls of the arm i the histori ata. The regret at time is efie as R := µ µ K =1 E[T ], where E[T ] is the expetatio of T. The per-rou regret at time is efie as R /. H The mea rewar from the histori ata for arm is efie as X H h := Xh,t. Mea rewarofarm urig the exeutio of the algorithms util its th pull is efie as X, := X,t. Aalogously, the oit mea rewar of arm iorporatig both the histori a the olie ata is X H, h := Xh,t + X,t. Fi- H + ally, V, h eotes the sample variae of the rewars for arm util its th pull iluig the histori ata a V, eotes the sample variae without history. 3 A Algorithm We first osier the simplest algorithm that makes use of the histori ata: pik the arm with the maximum mea rewar o the histori ata a the to simply play that arm i every iteratio. Ufortuately, this is ot a very goo strategy, sie there is a ostat probability of sufferig regret i eah step. By ostrutig a example, Theorem 1 shows that this algorithm a have regret that grows polyomially with time eve if the arms have a logarithmi amout of histori ata. Theorem 1 Cosier a two arme bait problem. The first arm has a fixe rewar.5+ǫ,.5 > ǫ >, the seo arm has a Beroulli rewar with mea.5. Suppose H = 3δl/16ǫ the the aive strategy has regret growig polyomially with for ay > exp1/δ. Proof We lower bou the probability that the observe mea rewar for the worse seo arm is higher tha the mea rewarfor the better first arm: P[ X h > X 1] h = P[B > H.5+ǫ] [ P Z > 4 H ǫ/ ] 3. 1 I 1 we applie Slu s iequality [1] whih states: [ P[B > t] P Z > t p / ] p1 p, for a biomial raom variable B parametrize by a p suh that p 1/, p t 1 p, a Z N,1i.e. staargaussiaraomvariable. Further, for Z N,1, we have from a result i [6]: P[Z > θ] θexp θ / / π1+θ. For θ > 1, it is easy to verify that θ/1 + θ > exp θ /. Thus, P[Z > θ] exp θ / π. Applyig this to 1, we get, P[ X h > X h 1 ] exp 16H ǫ /3 / π. Substitutig the value of H from the statemet of the theorem, we get, P[ X h > X h 1 ] 1 / π δ. Thus, the regret ahieve by the algorithm i steps is at least ǫ 1 δ / π. From θ > 1, we get > exp1/δ. 4 Algorithms a Aalysis I this setio, we propose three ew algorithms for the stohasti multi-arme bait problem with histori ata. For eah algorithm, we prove a logarithmi regret bou. Iterestigly, these bous show

3 Paaga Shivaswamy, Thorste Joahims Algorithm 1 UCB1 At time t play the arm that maximizes X, + lt, where eotes T t 1. Algorithm HUCB1 At time t play the arm that maximizes X, h lh+t + +H, where eotes T t 1. that a logarithmi amout of histori ata is suffiiet to allow these algorithms to ahieve ostat regret. Moreover, as the umber of histori observatios for every arm tes to ifitiy, the regret ahieve is zero. I partiular, we erive bous for the expete umber of pulls for ay suboptimal arm, i.e., E[T ]. From these, the regret bou a be ompute as : > E[T ]. 4.1 HUCB1: UCB1 with Histori Data Our first algorithm is erive from the UCB1 algorithm [5]. The origial UCB1 algorithm is give i Algorithm 1, while our extesio of UCB1 for histori ata alle HUCB1 is show i Algorithm. For a give amout H of historial ata for eah arm, the followig theorem provies a upper bou for HUCB1 o the expete umber of pulls for ay suboptimal arm. Theorem The expete umber of pulls of ay suboptimal arm, for ay time horizo, satisfies, E[T ] 1+l + + π 1+6H 6H +1 + π 1+6H 6H +1., where, l + = max, 8log+H H. Proof Defie i t,s = lt+h i /H i +s, we the have, for ay iteger l >, T = l+ {I t = } l+ {I t =,T t 1 l} { Xh,T t 1 + t 1,T t 1 X },T h t 1 + t 1,T,T t 1 t 1 l { } l+ mi X,s h <s<t + t 1,s max X,s h l s + t 1,s t l+ s=1 s =l { Xh,s + t,s X h,s + t,s}. Theevet{ Xh,s + t,s X },s h + t,s implies at least oe of the followig hols: 1 X h,s µ t,s, X h,s µ + t,s, µ < µ + t,s. 3 The erivatioso-faris verysimilarto that i the origial UCB1 aalysis. However, from this poit, havig histori ata starts to have a sigifiat impat. The probability that the first two iequalities i 3 hol a be bou usig Hoeffig s iequality; ilusio of histori ata gives sigifiatly tighter bous: P [ Xh,s µ t,s] e 4logt+H = t+h 4, ] P[ Xh µ,s + t,s e 4logt+H = t+h 4. Further, for our hoie of l = l + give i, the thir iequality i 3 is false. We are ow reay to bou the expete umber of pulls for arm. We have, E[T ] l l + + s=1 s=1 s =l + P[ X h,s µ t,s] s =l + P[ X h,s µ + t,s ] s=1 s =l + t+h 4 +t+h 4 1+l + + π 1+6H 6H +1 + π 1+6H 6H +1. I the above, we have use the fat that mm 1π 3m+1 m 1 t mm+π 3m+1, to erive a upper bou for t+h. First, ote that the above bou reues to the bou for the UCB1 algorithm [5] whe H = for all. Next, to see how muh impat histori ata a have o the regret, osier = exp H 8 H. I this situatio E[T ] π 1+6H 6H +1 + π 1+6H 6H +1 for HUCB1 whih is iversely relate to H. However, for UCB1, for the above hoie of, the upper bou o the E[T ] is upper boue by 1+ 8 H log exp 8 H + π 3, whih is approximately liear i H. Rather tha the ofiee iterval show i Algorithm, at first glae oe might thik that 1 It is easy to hek this by egatig this laim.

4 Multi-arme Bait Problems with History logh+t +H is the most atural hoie to use for histori ata. It a be show that this hoie leas to the followig bou: E[T ] max,8 log+h H + π 1+6H 3H +1. It therefore has two isavatages. First, it oes ot take ito aout that there oul be ifferet umbers of pulls for ifferet arms i the histori ata. Seo, wheh issmallbut H isquitelarge,the abovebou a be worse tha the oe erive i Theorem. 4. HUCB3: A ǫ-greey Algorithm Arguably the simplest bait algorithm is UCB3 [5]. We ow explore whether there is a similar algorithm with histori ata. We first preset a slightly moifie versio of UCB3 i Algorithm 3. Istea of havig a sigle rate ǫ for all arms, the followig versio has a ifferet rate ǫ for arm. Despite this hage, the aalysis of this algorithm is aalogous to that of the origial UCB3 algorithm. UCB3 has two parameters, whih is a lowerbouothesmallesto-zero aaother parameter >, but these two parameters always appear together as /. Algorithm 3 UCB3 Parameters > a < < mi Defie a sequee for eah arm: ǫ := mi 1 K, At iteratio, let i be the arm with the highest average rewar with o histori ata, play arm i with probability 1 K =1 ǫ. Play arm with probability ǫ. To erive a algorithm that a exploit histori ata, the key is to set the rates ǫ i a way that aouts for histori ata. It might seem, at first, that replaig +H therate with woulwork. Ufortuately, this approah oes ot lea to strog guaratees. First, observe that i the ase of UCB3, ǫ is 1/K util K/. The amout of exploratio oe by UCB3 betwee times t := K/ + 1 to is lower boue as follows: P[I t = ] = t=t t=t log K O1 To erive a ǫ-greey-like algorithm that a exploit histori ata, we first fi suh that the expete exploratio exees H. This is oe by settig H We igore the floor o K/ for brevity. equal to the lower bou we igore the ostat aitive term i the above equatio. This gives = K exp H. The histori versio of the ǫ- greey algorithm will have the same rates as UCB3 i the first K/ steps. However, after that, the rate use by the histori algorithm at time step t > K/ will be that of UCB3 at step +t K/. Base o these ieas, HUCB3 is presete i Algorithm 4. Note that whe H =, ǫ for HUCB3 reues to /, whih is exatly the same rate as i UCB3. We ow provie a upper bou o the istataeous regret of HUCB3 i Theorem 4. The proof of the followig theorem is provie as as a appeix ue to spae ostraits. The overall iea of the proof is the same as the orrespoig proof for HUCB3. The two ifferees i our proof are the availability of histori ata while applyig oetratio iequalities a the alterate efiitio of ǫ as propose i Algorithm 4. Algorithm 4 HUCB3 Parameters > a < < 1 Defie a sequee for eah arm: ǫ := 1/K for K/ a ǫ := Ke H 1 1+ for > K/. At iteratio, let = argmax Xh,T 1. Play arm with probability 1 K =1 ǫ. Play arm with probability ǫ. Theorem 3 For ay K/, where 1, HUCB3 satisfies, P[I = ] 1 1 +o. K exp H 1+ The followig orollary gives a upper bou o the expete umber of pulls of ay sub-optimal arm. It is obtaie by summig the istataeous regrets for arm give i Theorem 3. Corollary 4 HUCB3 amits the followig bou for ay sub-optimal arm, for ay > K/, E[T ] K log exp H 1+ +O1. K exp H To see how the above bou hages with histori ata, suppose H = log /K, the E[T ] = O1. This agai shows that a logarithmi amout of histori ata suffies to ahieve ostat regret. It is alsoeasyto see from the proofoftheorem 3 that these aitive terms go to zero expoetially with H a H thus showigthattheregretapproaheszeroasthe umber of histori observatios approahes ifiity.

5 Paaga Shivaswamy, Thorste Joahims 4.3 HUCBV: Exploitig Sample Variae Our fial algorithm is base o a reet versio of the UCB algorithm whih also iorporates the sample variae of the rewars [, 3]. I its most basi form, the UCBV algorithm is as show i Algorithm 5. Auibert et al. [] show that a value of θ = 1. is eough for logarithmi overgee. The expete regret of the UCBV algorithm was show to be upper boue by 1 :µ <µ σ / + log. The avatage of UCBV over algorithms that o ot iorporate the sample variae is that the regret bou for UCBV ivolves σ / istea of 1/. The variae σ a be substatially smaller tha 1. our hoie of u esures that, σ + /E t s+h σ + /E u +H + 3Et s+h + 3E u +H σ + 8σ σ +. 6 Cosier the probability i the first term i 5, Algorithm 5 UCBV At timet play the arm that maximizes θv, logt X, + + 3θlogt. The histori versio of the UCBV algorithm is summarize i Algorithm 6. We will ow erive a upper bou o its regret. Algorithm 6 HUCBV At time t play the arm that maximizes B,Tt 1,t with B,s,t = X,s h + θv,s h logt+h s+h + 3θlogt+H s+h. Theorem 5 For θ = 1., HUCBV satisfies, E[T ] 1+v +O1 where v is efie as: v := max { 8 σ E eotes θlog+h. + E H, }. 4 Proof We start with iequality 8 from [3] whih hols for ay iteger u > 1: E[T ] u + + t=u +K 1 t=u +K 1 s=u P[B,s,t > µ ] P[ s : 1 s t 1 s.t. B,s,t µ ] 5 Our hoie of u is the smallest iteger greater tha v efie i 4. Followig [3], for u s t a t, P[B,s,t > µ ] P[ X,s h + V,s h Et + 3Et > µ + ] s+h s+h P[ X,s h σ + + /E t + 3Et > µ + ] s+h s+h +P[V h,s σ + /] P[ X h,s µ > /] +P[V h,s σ + /] e s+h /8σ +4 /3. I the above, the seo step follows from 6. I the last step, Berstei s iequality has bee use twie a the extra term H i the expoet is a result of havig histori ata for arm. Summig the above upper bous from s = u to t 1 a usig the fat that 1 e x x/3 for x 3/4 gives, 4σ P[B,s,t > µ ] + e 4 E s=u Now, osier the last term i 5, usig Theorem 1 empirial Berstei bou of [3], it a be upper boue by, 3 t=u +1 βe t,t, where, βx,t := if 1<α 3 mi e x/α. Therefore, we a write t, logt logα the upper bou o E[T ] as, E[T ] 1+max + 4σ + 4 { 8 σ e E + + E H, t=u +1 βe t,t For the hoie, θ = 1., e E i the thir term above }

6 Multi-arme Bait Problems with History beomes, +H Now osier the last term: t=u +1 mi t=3 O1+ O1+ βet,t βet,t t, t=4 t=4 t=3 logt log 1.1 e θlogt+h /1.1 logt log1.1 e 1.logt+H /1.1 logt/log1.1 = O1. t+h 1.9 I the seo step, we replae ifimum over a rage to a speifi value i the rage. I the thir step, we use the fat that logt/logα < t for t 4 a α = 1.1. I the last step, we use the fat that logt t+h is a overget series; it is easy to 1.9 verify this fat by the itegral test. I the ase of HUCBV, E[T ] = O1 whe = exp H /9.6σ / +/ H. Thus, with logarithmi amout of histori ata, the regret is ostat oe agai. It a agai be see from the proof that the aitive terms approah zero as H a H approah ifiity. I pratie, the performae of HUCBV is sigifiatly better ompare to the other versios of the algorithms that we have propose. This will be a reurrig theme i our experimets. 5 Experimets Experimets were oute o a large-sale realworl ataset [1] otaiig about.4 millio istaes. Eah istae orrespos to a URL a has more tha 3. millio features assoiate with it. The label of a istae iiates whether the URL is maliious or ot. Five ifferet SVM lassifiers were traie usig a subset of twety thousa examples. The ifferet SVMs orrespoe to ifferet C parameter values whih trae-off betwee margi a slak variables i SVM. Preitios were the obtaie o all the remaiig istaes for all the five lassifiers. The istaes use i traiig were ot use i the rest of the experimets. The five lassifiers were the use as the arms of a multi-arme bait problem. The rewar was simply oe whe the preitio of the lassifier mathe the true URL reputatio label a zero whe it i ot. The best arm iffere from the seo best by about.8. Whereas the best arm iffere from the worst by :=.55. We show per-rou regret expresse as a fratio of i.e. R / i our results. Note thatthese valueswereestimatefromabout.4millio examples. All the experimets i this setio were performe by rawig raom samples from this populatio. Hee the above values eote true values for the uerlyig istributio from whih examples were raw. Baselies Obviously, the origial UCB algorithms a the NAIVE strategy Setio 3 are baselies i our experimets. However, we also osiere three other stroger strategies alle BUCB1, BUCBV a BUCB3. These stroger strategies BUCB aot be ru with arbitrary histori ata a were ilue merely for a wier perspetive. These strategies were as follows. I ay experimet, if there were H histori examples for all the arms together, the orrespoig UCB algorithms were ru for extra H rous at the start but the regret aumulate i these first H rous wassimply igore. Note that the arms pulle i the first H iteratios of the BUCB strategies are ompletely etermie by the uerlyig UCB algorithm. I otrast, our algorithms for histori ata a have arbitrary history for ay subset of arms. It is possible to argue that BUCB strategies have higher regret ompare to HUCB algorithms. Suppose, UCB1 3 is ru for + H iteratios, the the umber of pulls of the sub-optimal arm is Ol+ H/. The umber of pulls i the first H steps is OlH/. The worst possible seario is whe ΘlH/ pullsaremaeithe firsth steps. Thus igorig the pulls of arm i the first H steps woul give Ol+H/ lh/ pulls. I otrast, E[T ] for HUCB1 is ofthe orerol+h / H. This shows that the upper bou for our algorithms are muh better eve though these baselie strategies are stroger tha ompletely igorig history. Our experimets ofirm this fiig. While the regret bous we prove for the three algorithms presribe what parameters to use, these parameter hoies are ofte very oservative sie the bous hol for ay istributio. We therefore osiere variats of the propose algorithms where the trae-off betwee exploratio a exploitatio is tue empirially. I the ase of UCB1, HUCB1, UCBV a HUCBV, we put a weight θ o the ofiee iterval; i the ase of UCB3 a HUCB3, the parameter was always set at.8; however the parameter was tue. To tue the values of these parameters, UCB1, UCB3, a UCBV were ru times where the rewars ame from a raom raw of istaes eah time. The parameters orrespoig to the smallest average regret from these rus were fixe 3 We a show similar results for UCB3 a UCBV.

7 Paaga Shivaswamy, Thorste Joahims NAIVE UCB1 HUCB1 BUCB NAIVE UCBV HUCBV BUCBV NAIVE UCB3 HUCB3 BUCB R.4 R.4 R Figure 1: R / vs iteratios with 4 historial examples per-arm. fortherestoftheexperimets. ForUCB1,θ wasetermie to be.. I the ase of UCB3, the parameter was fou to be.3. Fially, i the ase of UCBV, θ was equal to.4. For our propose algorithms e.g. HUCB1 a for the baselies above e.g. BUCB1, we simply use the same value of parametersfou for the orrespoig base algorithm e.g. UCB How oes history affet the regret? The aim of the first experimet was to stuy the behavior of regret i the presee of histori ata. The total amout of histori ata was fixe at, uiformly split ito 4 per arm. The algorithms were the ru o istaes a the per-rou regret was ote after eah iteratio for eah algorithm. The experimets were repeate times by raomly seletig the istaes. A ifferet set of histori ata was selete for eah ru. The results R / a errorbars ofthis experimet areshowifigure1. Examiigtheimpatofhistorial ata o the regret, we see that all algorithms that exploit histori ata iee outperform their outerparts. This experimet shows how a omparably small amout of histori ata a help ahieve a substatial improvemet i regret. As expete, the NAIVE algorithm performs poorly whereas, HUCBV has the best performae amog all the algorithms. It a also be see that the HUCB algorithms perform slightly better tha the orrespoig BUCB strategies for large i the ase of HUB3 a HUCBV. 5. How oes regret hage with the amout of history? I this experimet, the amout of histori ata is varietostuy itseffet othe regret. Thesetup isaalogous to the previous experimets a agai the histori ata is split uiformly amog the arms. Per-rou regret is measure after 5, iteratios. The results of this experimet are show i Figure. The regret at 5, iteratios for UCB1, UCB3, a UCBV is show as a baselie. As the amout of histori ata ireases, the regret ereases as expete. Over most amouts of histori ata, HUCB1, HUCB3, a HUCBV outperform their ovetioal outerparts. We oe agai see a small improvemet over BUCB strategies as well. BUCB3 has slightly better performae over HUCB3 with a large amout of history at 5, iteratios. This is ue to the fat that UCB3 algorithms take a loger time to overge a fat that a be verifie from Figure 1 as well ue to ostat rates i the begiig. For large amout of histori ata, the NAIVE algorithm a reliably pik the best arm usig oly the histori ata. However, whe the amout of history is small, the regret from the aive strategy is sigifiatly higher whe ompare to our algorithms. 5.3 How oes the istributio of histori ata affet regret? The fial experimet was esige to stuy the effet of ubalae amouts of histori ata per arm. Sie the bous we erive i this paper showe that the umber of times a arm is pulle epes o H a H, we fixe the umber of istaes at 4 for the four o-optimal arms i.e. H = 4 whe. The umber of istaes for the optimal arm H was the varie i steps. The BUCB baselies have a avatage i this experimet sie we aot efore a istributio of histori ata over the arms i that ase the algorithms eie whih rewars are reveale to them. The results of this experimet are show i Figure 3. We show the behavior of the algorithms at 5, iteratios. 4 Whe H is large, the sub-optimal arms are uer sample a they te to be pulle more ofte i the begiig. This a be see by almost flat urves for HUCBV a HUCB3 a by a irease i regret i the ase of HUCB1 for larger H values. Obviously, the aive algorithm has the opposite behavior 4 After a large umber of rous e.g. 1 5 there was harly ay ifferee i regret for ifferet H values.

8 Multi-arme Bait Problems with History UCB1 HUCB1 BUCB UCBV HUCBV BUCBV UCB3 HUCB3 BUCB R.4 R.4 R Amout of history Amout of history Amout of history Figure : R / vs the amout of history at 5, iteratios UCBV HUCBV BUCBV.35.3 UCB3 HUCB3 BUCB R. R. R UCB1 HUCB1 BUCB Amout of H * Amout of H * Amout of H * Figure 3: R / vs the amout of H at 5, iteratios. ompare to our algorithms sie the higher H, the more likely it is to hoose the best arm. Amog the three algorithms, HUCB1 seems to be the most sesitive with respet to ubalae history. 6 Disussio logt As we poite out i Setio 4.1, the aive way of iorporatig history is to have logt+h i the ofiee iterval rather tha our hoie of logt+h. We also poite out that the regret bous a be sigifiatly better for our hoie of ofiee iterval ompare to the aive hoie. This leas to a itriguig possibility for the multi-arme bait problems with o histori ata. If we losely examie UCB algorithms UCB1 for istae, the ofiee iterval there is. A atural questio is whether it is possible to replae t isie the logarithm suh that the per-arm history urig a ru of UCB1 is better iorporate? A algorithm of this ki will better exploit the per-arm history urig a ru. Proposig a formal ofiee iterval a provig rigorous upper bous i this ase seem like iterestig iretios of researh to pursue. 7 Colusios We propose three ovel algorithms to exploit histori ata i stohasti multi-arme bait problems. The algorithms themselves have o otrol over the histori ata or o the arms have to be sample uiformly. Logarithmi fiite-time regret bous were erive for eah of the three propose algorithms. The bous showe that alreay a logarithmi amout of histori ata a lea to ostat regret with our algorithms. Experimets were oute o a large-sale ataset. The experimets valiate our theory a showe that eve a little histori ata a make a sigifiat ifferee i terms of regret. Overall, HUCBV has the best performae amog all the algorithms. A properly tue HUCB3 ofte performs better tha HUCB1. A future iretio i this lie of researh is to erive algorithms that a exploit histori ata also for other stohasti bait settigs, suh as baits with a otiuum of arms, uelig baits, et. While we oly showe upper bous o the performae of the propose algorithms i this paper, a atural ext step is to also prove lower bous for bait problems with histori ata. Akowlegmets We thak Bobby Kleiberg for helpful isussios. This work was fue i part uer NSF awar IIS

9 Paaga Shivaswamy, Thorste Joahims Referees [1] M. Athoy a P.L. Bartlett. Neural Network Learig: Theoretial Fouatios. Cambrige Uiversity Press, Cambrige, UK, [] J.-Y. Auibert, R. Muos, a Cs. Szepesvári. Tuig bait algorithms i stohasti eviromets. I ALT, pages Spriger, 7. [3] J.-Y. Auibert, R. Muos, a Cs. Szepesvári. Variae estimates a exploratio futio i multi-arme bait. Researh report 7-31, Certis - Eole es Pots, 7. [4] P. Auer, N. Cesa-Biahi, Y. Freu, a R. Shapire. The o-stohasti multi-arme bait problem. SIAM Joural o Computig, 31:48 77,. [5] Peter Auer, Niolò Cesa-Biahi, a Paul Fisher. Fiite-time aalysis of the multiarme bait problem. Mahie Learig, 47-3:35 56,. [6] P. Boresso a C.-E. Suberg. Simple approximatios of the error futio qx for ommuiatios appliatios. IEEE Trasatios o Commuiatios, 7: , [7] Robert D. Kleiberg, Alexaru Niulesu-Mizil, a Yogeshwer Sharma. Regret bous for sleepig experts a baits. I COLT, pages , 8. [8] T.L. Lai a H. Robbis. Asymptotially effiiet aaptive alloatio rules. Avaes i Applie Mathematis, 6:4, [9] Joh Lagfor, Alexaer Strehl, a Jeifer Wortma. Exploratio savegig. I ICML, 8. [1] Justi Ma, Lawree K. Saul, Stefa Savage, a Geoffrey M. Voelker. Ietifyig suspiious urls: A appliatio of large-sale olie learig. I ICML, 9.

10 A Appeix: Proof of Theorem 3 Proof Oly a sketh of this proof showig the ifferees with the orrespoig steps i a similar erivatio for UCB3 are give. The probability that the arm is hose at time t is give by: Moreover, K P[I = ] = ǫ +1 ǫ P[ X,T h X h 1,T ] 1 =1 P[ X,T X T ] P[ X,T h µ + ]+P[ X,T h µ ]. 1 Deotig 1 ǫ t by x, it a be show that the first term above is upper boue by, P[ X h,t µ + ] x P[T R x ]+ e e x / H /, where, weget the extra fatorexp H /from a appliatioofhoeffig s iequality iorporatigthe histori ata a T R is the umber of times arm is selete at raom i the first raws. Sie for all we a replae exp H / with exp H /. It a further be show that: usig Berstei s iequality. Fially, we a lower bou, x as follows: x = 1 ǫ t = 1 K log P[T R x ] e x /5, 3 1 K + 1 ek t= K +1 K e H / 1+t e H / 1+e K e H / Usig 1,, 3 a 4, it a be show that: P[I = ] K e H / 1+ + P 1 1 log + P 4 + P 1 log P 1 P. 4 e H / + P 4 e H / 5 1

11 where P := K e H / K e H / Thus, for 1, the last four terms i 5 are o 1 sie < 1.

Properties of MLE: consistency, asymptotic normality. Fisher information.

Properties of MLE: consistency, asymptotic normality. Fisher information. Lecture 3 Properties of MLE: cosistecy, asymptotic ormality. Fisher iformatio. I this sectio we will try to uderstad why MLEs are good. Let us recall two facts from probability that we be used ofte throughout

More information

Asymptotic Growth of Functions

Asymptotic Growth of Functions CMPS Itroductio to Aalysis of Algorithms Fall 3 Asymptotic Growth of Fuctios We itroduce several types of asymptotic otatio which are used to compare the performace ad efficiecy of algorithms As we ll

More information

Chapter 7 Methods of Finding Estimators

Chapter 7 Methods of Finding Estimators Chapter 7 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 011 Chapter 7 Methods of Fidig Estimators Sectio 7.1 Itroductio Defiitio 7.1.1 A poit estimator is ay fuctio W( X) W( X1, X,, X ) of

More information

Factors of sums of powers of binomial coefficients

Factors of sums of powers of binomial coefficients ACTA ARITHMETICA LXXXVI.1 (1998) Factors of sums of powers of biomial coefficiets by Neil J. Cali (Clemso, S.C.) Dedicated to the memory of Paul Erdős 1. Itroductio. It is well ow that if ( ) a f,a = the

More information

SAMPLE QUESTIONS FOR FINAL EXAM. (1) (2) (3) (4) Find the following using the definition of the Riemann integral: (2x + 1)dx

SAMPLE QUESTIONS FOR FINAL EXAM. (1) (2) (3) (4) Find the following using the definition of the Riemann integral: (2x + 1)dx SAMPLE QUESTIONS FOR FINAL EXAM REAL ANALYSIS I FALL 006 3 4 Fid the followig usig the defiitio of the Riema itegral: a 0 x + dx 3 Cosider the partitio P x 0 3, x 3 +, x 3 +,......, x 3 3 + 3 of the iterval

More information

Lecture 4: Cauchy sequences, Bolzano-Weierstrass, and the Squeeze theorem

Lecture 4: Cauchy sequences, Bolzano-Weierstrass, and the Squeeze theorem Lecture 4: Cauchy sequeces, Bolzao-Weierstrass, ad the Squeeze theorem The purpose of this lecture is more modest tha the previous oes. It is to state certai coditios uder which we are guarateed that limits

More information

3. Greatest Common Divisor - Least Common Multiple

3. Greatest Common Divisor - Least Common Multiple 3 Greatest Commo Divisor - Least Commo Multiple Defiitio 31: The greatest commo divisor of two atural umbers a ad b is the largest atural umber c which divides both a ad b We deote the greatest commo gcd

More information

Irreducible polynomials with consecutive zero coefficients

Irreducible polynomials with consecutive zero coefficients Irreducible polyomials with cosecutive zero coefficiets Theodoulos Garefalakis Departmet of Mathematics, Uiversity of Crete, 71409 Heraklio, Greece Abstract Let q be a prime power. We cosider the problem

More information

A probabilistic proof of a binomial identity

A probabilistic proof of a binomial identity A probabilistic proof of a biomial idetity Joatho Peterso Abstract We give a elemetary probabilistic proof of a biomial idetity. The proof is obtaied by computig the probability of a certai evet i two

More information

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method Chapter 6: Variace, the law of large umbers ad the Mote-Carlo method Expected value, variace, ad Chebyshev iequality. If X is a radom variable recall that the expected value of X, E[X] is the average value

More information

Confidence Intervals for One Mean

Confidence Intervals for One Mean Chapter 420 Cofidece Itervals for Oe Mea Itroductio This routie calculates the sample size ecessary to achieve a specified distace from the mea to the cofidece limit(s) at a stated cofidece level for a

More information

Incremental calculation of weighted mean and variance

Incremental calculation of weighted mean and variance Icremetal calculatio of weighted mea ad variace Toy Fich faf@cam.ac.uk dot@dotat.at Uiversity of Cambridge Computig Service February 009 Abstract I these otes I eplai how to derive formulae for umerically

More information

Taking DCOP to the Real World: Efficient Complete Solutions for Distributed Multi-Event Scheduling

Taking DCOP to the Real World: Efficient Complete Solutions for Distributed Multi-Event Scheduling Taig DCOP to the Real World: Efficiet Complete Solutios for Distributed Multi-Evet Schedulig Rajiv T. Maheswara, Milid Tambe, Emma Bowrig, Joatha P. Pearce, ad Pradeep araatham Uiversity of Souther Califoria

More information

Convexity, Inequalities, and Norms

Convexity, Inequalities, and Norms Covexity, Iequalities, ad Norms Covex Fuctios You are probably familiar with the otio of cocavity of fuctios. Give a twicedifferetiable fuctio ϕ: R R, We say that ϕ is covex (or cocave up) if ϕ (x) 0 for

More information

Laws of Exponents. net effect is to multiply with 2 a total of 3 + 5 = 8 times

Laws of Exponents. net effect is to multiply with 2 a total of 3 + 5 = 8 times The Mathematis 11 Competey Test Laws of Expoets (i) multipliatio of two powers: multiply by five times 3 x = ( x x ) x ( x x x x ) = 8 multiply by three times et effet is to multiply with a total of 3

More information

Measures of Spread and Boxplots Discrete Math, Section 9.4

Measures of Spread and Boxplots Discrete Math, Section 9.4 Measures of Spread ad Boxplots Discrete Math, Sectio 9.4 We start with a example: Example 1: Comparig Mea ad Media Compute the mea ad media of each data set: S 1 = {4, 6, 8, 10, 1, 14, 16} S = {4, 7, 9,

More information

Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval

Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval Chapter 8 Tests of Statistical Hypotheses 8. Tests about Proportios HT - Iferece o Proportio Parameter: Populatio Proportio p (or π) (Percetage of people has o health isurace) x Statistic: Sample Proportio

More information

where: T = number of years of cash flow in investment's life n = the year in which the cash flow X n i = IRR = the internal rate of return

where: T = number of years of cash flow in investment's life n = the year in which the cash flow X n i = IRR = the internal rate of return EVALUATING ALTERNATIVE CAPITAL INVESTMENT PROGRAMS By Ke D. Duft, Extesio Ecoomist I the March 98 issue of this publicatio we reviewed the procedure by which a capital ivestmet project was assessed. The

More information

Soving Recurrence Relations

Soving Recurrence Relations Sovig Recurrece Relatios Part 1. Homogeeous liear 2d degree relatios with costat coefficiets. Cosider the recurrece relatio ( ) T () + at ( 1) + bt ( 2) = 0 This is called a homogeeous liear 2d degree

More information

Overview of some probability distributions.

Overview of some probability distributions. Lecture Overview of some probability distributios. I this lecture we will review several commo distributios that will be used ofte throughtout the class. Each distributio is usually described by its probability

More information

Department of Computer Science, University of Otago

Department of Computer Science, University of Otago Departmet of Computer Sciece, Uiversity of Otago Techical Report OUCS-2006-09 Permutatios Cotaiig May Patters Authors: M.H. Albert Departmet of Computer Sciece, Uiversity of Otago Micah Colema, Rya Fly

More information

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the. Cofidece Itervals A cofidece iterval is a iterval whose purpose is to estimate a parameter (a umber that could, i theory, be calculated from the populatio, if measuremets were available for the whole populatio).

More information

1. C. The formula for the confidence interval for a population mean is: x t, which was

1. C. The formula for the confidence interval for a population mean is: x t, which was s 1. C. The formula for the cofidece iterval for a populatio mea is: x t, which was based o the sample Mea. So, x is guarateed to be i the iterval you form.. D. Use the rule : p-value

More information

Maximum Likelihood Estimators.

Maximum Likelihood Estimators. Lecture 2 Maximum Likelihood Estimators. Matlab example. As a motivatio, let us look at oe Matlab example. Let us geerate a radom sample of size 00 from beta distributio Beta(5, 2). We will lear the defiitio

More information

Sequences and Series

Sequences and Series CHAPTER 9 Sequeces ad Series 9.. Covergece: Defiitio ad Examples Sequeces The purpose of this chapter is to itroduce a particular way of geeratig algorithms for fidig the values of fuctios defied by their

More information

1 Correlation and Regression Analysis

1 Correlation and Regression Analysis 1 Correlatio ad Regressio Aalysis I this sectio we will be ivestigatig the relatioship betwee two cotiuous variable, such as height ad weight, the cocetratio of a ijected drug ad heart rate, or the cosumptio

More information

A Capacity Supply Model for Virtualized Servers

A Capacity Supply Model for Virtualized Servers 96 Iformatia Eoomiă vol. 3, o. 3/009 A apaity upply Model for Virtualized ervers Alexader PINNOW, tefa OTERBURG Otto-vo-Guerike-Uiversity, Magdeburg, Germay {alexader.piow stefa.osterburg}@iti.s.ui-magdeburg.de

More information

ABSTRACT INTRODUCTION MATERIALS AND METHODS

ABSTRACT INTRODUCTION MATERIALS AND METHODS INTENATIONAL JOUNAL OF AGICULTUE & BIOLOGY 156 853/6/8 1 5 9 http://www.fspublishers.org Multiplate Peetratio Tests to Predit Soil Pressure-siage Behaviour uder etagular egio M. ASHIDI 1, A. KEYHANI AND

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics We leared to describe data sets graphically. We ca also describe a data set umerically. Measures of Locatio Defiitio The sample mea is the arithmetic average of values. We deote

More information

5: Introduction to Estimation

5: Introduction to Estimation 5: Itroductio to Estimatio Cotets Acroyms ad symbols... 1 Statistical iferece... Estimatig µ with cofidece... 3 Samplig distributio of the mea... 3 Cofidece Iterval for μ whe σ is kow before had... 4 Sample

More information

Lesson 15 ANOVA (analysis of variance)

Lesson 15 ANOVA (analysis of variance) Outlie Variability -betwee group variability -withi group variability -total variability -F-ratio Computatio -sums of squares (betwee/withi/total -degrees of freedom (betwee/withi/total -mea square (betwee/withi

More information

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008 I ite Sequeces Dr. Philippe B. Laval Keesaw State Uiversity October 9, 2008 Abstract This had out is a itroductio to i ite sequeces. mai de itios ad presets some elemetary results. It gives the I ite Sequeces

More information

Estimating Probability Distributions by Observing Betting Practices

Estimating Probability Distributions by Observing Betting Practices 5th Iteratioal Symposium o Imprecise Probability: Theories ad Applicatios, Prague, Czech Republic, 007 Estimatig Probability Distributios by Observig Bettig Practices Dr C Lych Natioal Uiversity of Irelad,

More information

Modified Line Search Method for Global Optimization

Modified Line Search Method for Global Optimization Modified Lie Search Method for Global Optimizatio Cria Grosa ad Ajith Abraham Ceter of Excellece for Quatifiable Quality of Service Norwegia Uiversity of Sciece ad Techology Trodheim, Norway {cria, ajith}@q2s.tu.o

More information

Factoring x n 1: cyclotomic and Aurifeuillian polynomials Paul Garrett <garrett@math.umn.edu>

Factoring x n 1: cyclotomic and Aurifeuillian polynomials Paul Garrett <garrett@math.umn.edu> (March 16, 004) Factorig x 1: cyclotomic ad Aurifeuillia polyomials Paul Garrett Polyomials of the form x 1, x 3 1, x 4 1 have at least oe systematic factorizatio x 1 = (x 1)(x 1

More information

1. MATHEMATICAL INDUCTION

1. MATHEMATICAL INDUCTION 1. MATHEMATICAL INDUCTION EXAMPLE 1: Prove that for ay iteger 1. Proof: 1 + 2 + 3 +... + ( + 1 2 (1.1 STEP 1: For 1 (1.1 is true, sice 1 1(1 + 1. 2 STEP 2: Suppose (1.1 is true for some k 1, that is 1

More information

Chapter 5 O A Cojecture Of Erdíos Proceedigs NCUR VIII è1994è, Vol II, pp 794í798 Jeærey F Gold Departmet of Mathematics, Departmet of Physics Uiversity of Utah Do H Tucker Departmet of Mathematics Uiversity

More information

Output Analysis (2, Chapters 10 &11 Law)

Output Analysis (2, Chapters 10 &11 Law) B. Maddah ENMG 6 Simulatio 05/0/07 Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should

More information

0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5

0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5 Sectio 13 Kolmogorov-Smirov test. Suppose that we have a i.i.d. sample X 1,..., X with some ukow distributio P ad we would like to test the hypothesis that P is equal to a particular distributio P 0, i.e.

More information

Running Time ( 3.1) Analysis of Algorithms. Experimental Studies ( 3.1.1) Limitations of Experiments. Pseudocode ( 3.1.2) Theoretical Analysis

Running Time ( 3.1) Analysis of Algorithms. Experimental Studies ( 3.1.1) Limitations of Experiments. Pseudocode ( 3.1.2) Theoretical Analysis Ruig Time ( 3.) Aalysis of Algorithms Iput Algorithm Output A algorithm is a step-by-step procedure for solvig a problem i a fiite amout of time. Most algorithms trasform iput objects ito output objects.

More information

Math C067 Sampling Distributions

Math C067 Sampling Distributions Math C067 Samplig Distributios Sample Mea ad Sample Proportio Richard Beigel Some time betwee April 16, 2007 ad April 16, 2007 Examples of Samplig A pollster may try to estimate the proportio of voters

More information

WHEN IS THE (CO)SINE OF A RATIONAL ANGLE EQUAL TO A RATIONAL NUMBER?

WHEN IS THE (CO)SINE OF A RATIONAL ANGLE EQUAL TO A RATIONAL NUMBER? WHEN IS THE (CO)SINE OF A RATIONAL ANGLE EQUAL TO A RATIONAL NUMBER? JÖRG JAHNEL 1. My Motivatio Some Sort of a Itroductio Last term I tought Topological Groups at the Göttige Georg August Uiversity. This

More information

CHAPTER 3 THE TIME VALUE OF MONEY

CHAPTER 3 THE TIME VALUE OF MONEY CHAPTER 3 THE TIME VALUE OF MONEY OVERVIEW A dollar i the had today is worth more tha a dollar to be received i the future because, if you had it ow, you could ivest that dollar ad ear iterest. Of all

More information

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas: Chapter 7 - Samplig Distributios 1 Itroductio What is statistics? It cosist of three major areas: Data Collectio: samplig plas ad experimetal desigs Descriptive Statistics: umerical ad graphical summaries

More information

Lecture 2: Karger s Min Cut Algorithm

Lecture 2: Karger s Min Cut Algorithm priceto uiv. F 3 cos 5: Advaced Algorithm Desig Lecture : Karger s Mi Cut Algorithm Lecturer: Sajeev Arora Scribe:Sajeev Today s topic is simple but gorgeous: Karger s mi cut algorithm ad its extesio.

More information

U.C. Berkeley CS270: Algorithms Lecture 9 Professor Vazirani and Professor Rao Last revised. Lecture 9

U.C. Berkeley CS270: Algorithms Lecture 9 Professor Vazirani and Professor Rao Last revised. Lecture 9 U.C. Berkeley CS270: Algorithms Lecture 9 Professor Vazirai a Professor Rao Scribe: Aupam Last revise Lecture 9 1 Sparse cuts a Cheeger s iequality Cosier the problem of partitioig a give graph G(V, E

More information

Lecture 4: Cheeger s Inequality

Lecture 4: Cheeger s Inequality Spectral Graph Theory ad Applicatios WS 0/0 Lecture 4: Cheeger s Iequality Lecturer: Thomas Sauerwald & He Su Statemet of Cheeger s Iequality I this lecture we assume for simplicity that G is a d-regular

More information

FOUNDATIONS OF MATHEMATICS AND PRE-CALCULUS GRADE 10

FOUNDATIONS OF MATHEMATICS AND PRE-CALCULUS GRADE 10 FOUNDATIONS OF MATHEMATICS AND PRE-CALCULUS GRADE 10 [C] Commuicatio Measuremet A1. Solve problems that ivolve liear measuremet, usig: SI ad imperial uits of measure estimatio strategies measuremet strategies.

More information

UC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 9 Spring 2006

UC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 9 Spring 2006 Exam format UC Bereley Departmet of Electrical Egieerig ad Computer Sciece EE 6: Probablity ad Radom Processes Solutios 9 Sprig 006 The secod midterm will be held o Wedesday May 7; CHECK the fial exam

More information

Basic Elements of Arithmetic Sequences and Series

Basic Elements of Arithmetic Sequences and Series MA40S PRE-CALCULUS UNIT G GEOMETRIC SEQUENCES CLASS NOTES (COMPLETED NO NEED TO COPY NOTES FROM OVERHEAD) Basic Elemets of Arithmetic Sequeces ad Series Objective: To establish basic elemets of arithmetic

More information

PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM

PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY Physical ad Mathematical Scieces 2015, 1, p. 15 19 M a t h e m a t i c s AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM A. G. GULYAN Chair of Actuarial Mathematics

More information

Project Deliverables. CS 361, Lecture 28. Outline. Project Deliverables. Administrative. Project Comments

Project Deliverables. CS 361, Lecture 28. Outline. Project Deliverables. Administrative. Project Comments Project Deliverables CS 361, Lecture 28 Jared Saia Uiversity of New Mexico Each Group should tur i oe group project cosistig of: About 6-12 pages of text (ca be loger with appedix) 6-12 figures (please

More information

ANNUITIES SOFTWARE ASSIGNMENT TABLE OF CONTENTS... 1 ANNUITIES SOFTWARE ASSIGNMENT... 2 WHAT IS AN ANNUITY?... 2 EXAMPLE 1... 2 QUESTIONS...

ANNUITIES SOFTWARE ASSIGNMENT TABLE OF CONTENTS... 1 ANNUITIES SOFTWARE ASSIGNMENT... 2 WHAT IS AN ANNUITY?... 2 EXAMPLE 1... 2 QUESTIONS... ANNUITIES SOFTWARE ASSIGNMENT TABLE OF CONTENTS ANNUITIES SOFTWARE ASSIGNMENT TABLE OF CONTENTS... 1 ANNUITIES SOFTWARE ASSIGNMENT... WHAT IS AN ANNUITY?... EXAMPLE 1... QUESTIONS... EXAMPLE BRANDON S

More information

Case Study. Normal and t Distributions. Density Plot. Normal Distributions

Case Study. Normal and t Distributions. Density Plot. Normal Distributions Case Study Normal ad t Distributios Bret Halo ad Bret Larget Departmet of Statistics Uiversity of Wiscosi Madiso October 11 13, 2011 Case Study Body temperature varies withi idividuals over time (it ca

More information

1 Computing the Standard Deviation of Sample Means

1 Computing the Standard Deviation of Sample Means Computig the Stadard Deviatio of Sample Meas Quality cotrol charts are based o sample meas ot o idividual values withi a sample. A sample is a group of items, which are cosidered all together for our aalysis.

More information

5 Boolean Decision Trees (February 11)

5 Boolean Decision Trees (February 11) 5 Boolea Decisio Trees (February 11) 5.1 Graph Coectivity Suppose we are give a udirected graph G, represeted as a boolea adjacecy matrix = (a ij ), where a ij = 1 if ad oly if vertices i ad j are coected

More information

CS103A Handout 23 Winter 2002 February 22, 2002 Solving Recurrence Relations

CS103A Handout 23 Winter 2002 February 22, 2002 Solving Recurrence Relations CS3A Hadout 3 Witer 00 February, 00 Solvig Recurrece Relatios Itroductio A wide variety of recurrece problems occur i models. Some of these recurrece relatios ca be solved usig iteratio or some other ad

More information

I. Chi-squared Distributions

I. Chi-squared Distributions 1 M 358K Supplemet to Chapter 23: CHI-SQUARED DISTRIBUTIONS, T-DISTRIBUTIONS, AND DEGREES OF FREEDOM To uderstad t-distributios, we first eed to look at aother family of distributios, the chi-squared distributios.

More information

CS103X: Discrete Structures Homework 4 Solutions

CS103X: Discrete Structures Homework 4 Solutions CS103X: Discrete Structures Homewor 4 Solutios Due February 22, 2008 Exercise 1 10 poits. Silico Valley questios: a How may possible six-figure salaries i whole dollar amouts are there that cotai at least

More information

Section 8.3 : De Moivre s Theorem and Applications

Section 8.3 : De Moivre s Theorem and Applications The Sectio 8 : De Moivre s Theorem ad Applicatios Let z 1 ad z be complex umbers, where z 1 = r 1, z = r, arg(z 1 ) = θ 1, arg(z ) = θ z 1 = r 1 (cos θ 1 + i si θ 1 ) z = r (cos θ + i si θ ) ad z 1 z =

More information

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means)

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means) CHAPTER 7: Cetral Limit Theorem: CLT for Averages (Meas) X = the umber obtaied whe rollig oe six sided die oce. If we roll a six sided die oce, the mea of the probability distributio is X P(X = x) Simulatio:

More information

Chair for Network Architectures and Services Institute of Informatics TU München Prof. Carle. Network Security. Chapter 2 Basics

Chair for Network Architectures and Services Institute of Informatics TU München Prof. Carle. Network Security. Chapter 2 Basics Chair for Network Architectures ad Services Istitute of Iformatics TU Müche Prof. Carle Network Security Chapter 2 Basics 2.4 Radom Number Geeratio for Cryptographic Protocols Motivatio It is crucial to

More information

SOLID MECHANICS DYNAMICS TUTORIAL DAMPED VIBRATIONS. On completion of this tutorial you should be able to do the following.

SOLID MECHANICS DYNAMICS TUTORIAL DAMPED VIBRATIONS. On completion of this tutorial you should be able to do the following. SOLID MECHANICS DYNAMICS TUTORIAL DAMPED VIBRATIONS This work overs elemets of the syllabus for the Egieerig Couil Eam D5 Dyamis of Mehaial Systems, C05 Mehaial ad Strutural Egieerig ad the Edeel HNC/D

More information

BINOMIAL EXPANSIONS 12.5. In this section. Some Examples. Obtaining the Coefficients

BINOMIAL EXPANSIONS 12.5. In this section. Some Examples. Obtaining the Coefficients 652 (12-26) Chapter 12 Sequeces ad Series 12.5 BINOMIAL EXPANSIONS I this sectio Some Examples Otaiig the Coefficiets The Biomial Theorem I Chapter 5 you leared how to square a iomial. I this sectio you

More information

Listing terms of a finite sequence List all of the terms of each finite sequence. a) a n n 2 for 1 n 5 1 b) a n for 1 n 4 n 2

Listing terms of a finite sequence List all of the terms of each finite sequence. a) a n n 2 for 1 n 5 1 b) a n for 1 n 4 n 2 74 (4 ) Chapter 4 Sequeces ad Series 4. SEQUENCES I this sectio Defiitio Fidig a Formula for the th Term The word sequece is a familiar word. We may speak of a sequece of evets or say that somethig is

More information

Cooley-Tukey. Tukey FFT Algorithms. FFT Algorithms. Cooley

Cooley-Tukey. Tukey FFT Algorithms. FFT Algorithms. Cooley Cooley Cooley-Tuey Tuey FFT Algorithms FFT Algorithms Cosider a legth- sequece x[ with a -poit DFT X[ where Represet the idices ad as +, +, Cooley Cooley-Tuey Tuey FFT Algorithms FFT Algorithms Usig these

More information

Chapter 14 Nonparametric Statistics

Chapter 14 Nonparametric Statistics Chapter 14 Noparametric Statistics A.K.A. distributio-free statistics! Does ot deped o the populatio fittig ay particular type of distributio (e.g, ormal). Sice these methods make fewer assumptios, they

More information

Universal coding for classes of sources

Universal coding for classes of sources Coexios module: m46228 Uiversal codig for classes of sources Dever Greee This work is produced by The Coexios Project ad licesed uder the Creative Commos Attributio Licese We have discussed several parametric

More information

Solving Logarithms and Exponential Equations

Solving Logarithms and Exponential Equations Solvig Logarithms ad Epoetial Equatios Logarithmic Equatios There are two major ideas required whe solvig Logarithmic Equatios. The first is the Defiitio of a Logarithm. You may recall from a earlier topic:

More information

Lecture 13. Lecturer: Jonathan Kelner Scribe: Jonathan Pines (2009)

Lecture 13. Lecturer: Jonathan Kelner Scribe: Jonathan Pines (2009) 18.409 A Algorithmist s Toolkit October 27, 2009 Lecture 13 Lecturer: Joatha Keler Scribe: Joatha Pies (2009) 1 Outlie Last time, we proved the Bru-Mikowski iequality for boxes. Today we ll go over the

More information

Trigonometric Form of a Complex Number. The Complex Plane. axis. ( 2, 1) or 2 i FIGURE 6.44. The absolute value of the complex number z a bi is

Trigonometric Form of a Complex Number. The Complex Plane. axis. ( 2, 1) or 2 i FIGURE 6.44. The absolute value of the complex number z a bi is 0_0605.qxd /5/05 0:45 AM Page 470 470 Chapter 6 Additioal Topics i Trigoometry 6.5 Trigoometric Form of a Complex Number What you should lear Plot complex umbers i the complex plae ad fid absolute values

More information

A Faster Clause-Shortening Algorithm for SAT with No Restriction on Clause Length

A Faster Clause-Shortening Algorithm for SAT with No Restriction on Clause Length Joural o Satisfiability, Boolea Modelig ad Computatio 1 2005) 49-60 A Faster Clause-Shorteig Algorithm for SAT with No Restrictio o Clause Legth Evgey Datsi Alexader Wolpert Departmet of Computer Sciece

More information

2-3 The Remainder and Factor Theorems

2-3 The Remainder and Factor Theorems - The Remaider ad Factor Theorems Factor each polyomial completely usig the give factor ad log divisio 1 x + x x 60; x + So, x + x x 60 = (x + )(x x 15) Factorig the quadratic expressio yields x + x x

More information

Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed.

Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed. This documet was writte ad copyrighted by Paul Dawkis. Use of this documet ad its olie versio is govered by the Terms ad Coditios of Use located at http://tutorial.math.lamar.edu/terms.asp. The olie versio

More information

Perfect Packing Theorems and the Average-Case Behavior of Optimal and Online Bin Packing

Perfect Packing Theorems and the Average-Case Behavior of Optimal and Online Bin Packing SIAM REVIEW Vol. 44, No. 1, pp. 95 108 c 2002 Society for Idustrial ad Applied Mathematics Perfect Packig Theorems ad the Average-Case Behavior of Optimal ad Olie Bi Packig E. G. Coffma, Jr. C. Courcoubetis

More information

Building Blocks Problem Related to Harmonic Series

Building Blocks Problem Related to Harmonic Series TMME, vol3, o, p.76 Buildig Blocks Problem Related to Harmoic Series Yutaka Nishiyama Osaka Uiversity of Ecoomics, Japa Abstract: I this discussio I give a eplaatio of the divergece ad covergece of ifiite

More information

THE ARITHMETIC OF INTEGERS. - multiplication, exponentiation, division, addition, and subtraction

THE ARITHMETIC OF INTEGERS. - multiplication, exponentiation, division, addition, and subtraction THE ARITHMETIC OF INTEGERS - multiplicatio, expoetiatio, divisio, additio, ad subtractio What to do ad what ot to do. THE INTEGERS Recall that a iteger is oe of the whole umbers, which may be either positive,

More information

I. Why is there a time value to money (TVM)?

I. Why is there a time value to money (TVM)? Itroductio to the Time Value of Moey Lecture Outlie I. Why is there the cocept of time value? II. Sigle cash flows over multiple periods III. Groups of cash flows IV. Warigs o doig time value calculatios

More information

The analysis of the Cournot oligopoly model considering the subjective motive in the strategy selection

The analysis of the Cournot oligopoly model considering the subjective motive in the strategy selection The aalysis of the Courot oligopoly model cosiderig the subjective motive i the strategy selectio Shigehito Furuyama Teruhisa Nakai Departmet of Systems Maagemet Egieerig Faculty of Egieerig Kasai Uiversity

More information

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean 1 Social Studies 201 October 13, 2004 Note: The examples i these otes may be differet tha used i class. However, the examples are similar ad the methods used are idetical to what was preseted i class.

More information

Lecture 5: Span, linear independence, bases, and dimension

Lecture 5: Span, linear independence, bases, and dimension Lecture 5: Spa, liear idepedece, bases, ad dimesio Travis Schedler Thurs, Sep 23, 2010 (versio: 9/21 9:55 PM) 1 Motivatio Motivatio To uderstad what it meas that R has dimesio oe, R 2 dimesio 2, etc.;

More information

Example 2 Find the square root of 0. The only square root of 0 is 0 (since 0 is not positive or negative, so those choices don t exist here).

Example 2 Find the square root of 0. The only square root of 0 is 0 (since 0 is not positive or negative, so those choices don t exist here). BEGINNING ALGEBRA Roots ad Radicals (revised summer, 00 Olso) Packet to Supplemet the Curret Textbook - Part Review of Square Roots & Irratioals (This portio ca be ay time before Part ad should mostly

More information

Analyzing Longitudinal Data from Complex Surveys Using SUDAAN

Analyzing Longitudinal Data from Complex Surveys Using SUDAAN Aalyzig Logitudial Data from Complex Surveys Usig SUDAAN Darryl Creel Statistics ad Epidemiology, RTI Iteratioal, 312 Trotter Farm Drive, Rockville, MD, 20850 Abstract SUDAAN: Software for the Statistical

More information

Swelling and Mechanical Properties of Hydrogels Composed of. Binary Blends of Inter-linked ph-responsive Microgel Particles

Swelling and Mechanical Properties of Hydrogels Composed of. Binary Blends of Inter-linked ph-responsive Microgel Particles Eletroi Supplemetry Mteril (ESI) for Soft Mtter. This jourl is The Royl Soiety of Chemistry 205 SUPPLEMENTARY INFRMATIN Swellig Mehil Properties of Hyrogels Compose of Biry Bles of Iter-like ph-resposive

More information

Mann-Whitney U 2 Sample Test (a.k.a. Wilcoxon Rank Sum Test)

Mann-Whitney U 2 Sample Test (a.k.a. Wilcoxon Rank Sum Test) No-Parametric ivariate Statistics: Wilcoxo-Ma-Whitey 2 Sample Test 1 Ma-Whitey 2 Sample Test (a.k.a. Wilcoxo Rak Sum Test) The (Wilcoxo-) Ma-Whitey (WMW) test is the o-parametric equivalet of a pooled

More information

Queueing Analysis of Patient Flow in Hospital

Queueing Analysis of Patient Flow in Hospital IOSR Joural of Mathematis (IOSR-JM) e-issn: 2278-5728, p-issn: 2319-765X. Volume 10, Issue 4 Ver. VI (Jul-Aug. 2014), PP 47-53 Queueig Aalysis of Patiet Flow i Hospital Olorusola S. A, Adeleke R. A ad

More information

Research Article Sign Data Derivative Recovery

Research Article Sign Data Derivative Recovery Iteratioal Scholarly Research Network ISRN Applied Mathematics Volume 0, Article ID 63070, 7 pages doi:0.540/0/63070 Research Article Sig Data Derivative Recovery L. M. Housto, G. A. Glass, ad A. D. Dymikov

More information

NEW HIGH PERFORMANCE COMPUTATIONAL METHODS FOR MORTGAGES AND ANNUITIES. Yuri Shestopaloff,

NEW HIGH PERFORMANCE COMPUTATIONAL METHODS FOR MORTGAGES AND ANNUITIES. Yuri Shestopaloff, NEW HIGH PERFORMNCE COMPUTTIONL METHODS FOR MORTGGES ND NNUITIES Yuri Shestopaloff, Geerally, mortgage ad auity equatios do ot have aalytical solutios for ukow iterest rate, which has to be foud usig umerical

More information

SEQUENCES AND SERIES CHAPTER

SEQUENCES AND SERIES CHAPTER CHAPTER SEQUENCES AND SERIES Whe the Grat family purchased a computer for $,200 o a istallmet pla, they agreed to pay $00 each moth util the cost of the computer plus iterest had bee paid The iterest each

More information

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights Ceter, Spread, ad Shape i Iferece: Claims, Caveats, ad Isights Dr. Nacy Pfeig (Uiversity of Pittsburgh) AMATYC November 2008 Prelimiary Activities 1. I would like to produce a iterval estimate for the

More information

Section 11.3: The Integral Test

Section 11.3: The Integral Test Sectio.3: The Itegral Test Most of the series we have looked at have either diverged or have coverged ad we have bee able to fid what they coverge to. I geeral however, the problem is much more difficult

More information

Elementary Theory of Russian Roulette

Elementary Theory of Russian Roulette Elemetary Theory of Russia Roulette -iterestig patters of fractios- Satoshi Hashiba Daisuke Miematsu Ryohei Miyadera Itroductio. Today we are goig to study mathematical theory of Russia roulette. If some

More information

CHAPTER 3 DIGITAL CODING OF SIGNALS

CHAPTER 3 DIGITAL CODING OF SIGNALS CHAPTER 3 DIGITAL CODING OF SIGNALS Computers are ofte used to automate the recordig of measuremets. The trasducers ad sigal coditioig circuits produce a voltage sigal that is proportioal to a quatity

More information

Simple Annuities Present Value.

Simple Annuities Present Value. Simple Auities Preset Value. OBJECTIVES (i) To uderstad the uderlyig priciple of a preset value auity. (ii) To use a CASIO CFX-9850GB PLUS to efficietly compute values associated with preset value auities.

More information

Week 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable

Week 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable Week 3 Coditioal probabilities, Bayes formula, WEEK 3 page 1 Expected value of a radom variable We recall our discussio of 5 card poker hads. Example 13 : a) What is the probability of evet A that a 5

More information

Solutions to Selected Problems In: Pattern Classification by Duda, Hart, Stork

Solutions to Selected Problems In: Pattern Classification by Duda, Hart, Stork Solutios to Selected Problems I: Patter Classificatio by Duda, Hart, Stork Joh L. Weatherwax February 4, 008 Problem Solutios Chapter Bayesia Decisio Theory Problem radomized rules Part a: Let Rx be the

More information

Chatpun Khamyat Department of Industrial Engineering, Kasetsart University, Bangkok, Thailand ocpky@hotmail.com

Chatpun Khamyat Department of Industrial Engineering, Kasetsart University, Bangkok, Thailand ocpky@hotmail.com SOLVING THE OIL DELIVERY TRUCKS ROUTING PROBLEM WITH MODIFY MULTI-TRAVELING SALESMAN PROBLEM APPROACH CASE STUDY: THE SME'S OIL LOGISTIC COMPANY IN BANGKOK THAILAND Chatpu Khamyat Departmet of Idustrial

More information

Class Meeting # 16: The Fourier Transform on R n

Class Meeting # 16: The Fourier Transform on R n MATH 18.152 COUSE NOTES - CLASS MEETING # 16 18.152 Itroductio to PDEs, Fall 2011 Professor: Jared Speck Class Meetig # 16: The Fourier Trasform o 1. Itroductio to the Fourier Trasform Earlier i the course,

More information

Hypothesis testing. Null and alternative hypotheses

Hypothesis testing. Null and alternative hypotheses Hypothesis testig Aother importat use of samplig distributios is to test hypotheses about populatio parameters, e.g. mea, proportio, regressio coefficiets, etc. For example, it is possible to stipulate

More information

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Istructor: Nicolas Christou Three importat distributios: Distributios related to the ormal distributio Chi-square (χ ) distributio.

More information