Regularized Distance Metric Learning: Theory and Algorithm

Size: px
Start display at page:

Download "Regularized Distance Metric Learning: Theory and Algorithm"

Transcription

1 Regularized Distace Metric Learig: Theory ad Algorithm Rog Ji 1 Shiju Wag 2 Yag Zhou 1 1 Dept. of Computer Sciece & Egieerig, Michiga State Uiversity, East Lasig, MI Radiology ad Imagig Scieces, Natioal Istitutes of Health, Bethesda, MD rogji@cse.msu.edu wagshi@cc.ih.gov zhouyag@msu.edu Abstract I this paper, we examie the geeralizatio error of regularized distace metric learig. We show that with appropriate costraits, the geeralizatio error of regularized distace metric learig could be idepedet from the dimesioality, makig it suitable for hadlig high dimesioal data. I additio, we preset a efficiet olie learig algorithm for regularized distace metric learig. Our empirical studies with data classificatio ad face recogitio show that the proposed algorithm is (i) effective for distace metric learig whe compared to the state-of-the-art methods, ad (ii) efficiet ad robust for high dimesioal data. 1 Itroductio Distace metric learig is a fudametal problem i machie learig ad patter recogitio. It is critical to may real-world applicatios, such as iformatio retrieval, classificatio, ad clusterig. Numerous algorithms have bee proposed ad examied for distace metric learig. They are usually classified ito two categories: usupervised metric learig ad supervised metric learig. Usupervised distace metric learig, or sometimes referred to as maifold learig, aims to lear a uderlyig low-dimesioal maifold where the distace betwee most pairs of data poits are preserved. Example algorithms i this category iclude ISOMAP [10] ad Local Liear Embeddig (LLE) [6]. Supervised metric learig attempts to lear distace metrics from side iformatio such as labeled istaces ad pairwise costraits. It searches for the optimal distace metric that (a) keeps data poits of the same classes close, ad (b) keeps data poits from differet classes far apart. Example algorithms i this category iclude [13, 8, 12, 5, 11, 15, 4]. I this work, we focus o supervised distace metric learig. Although a large umber of studies were devoted to supervised distace metric learig (see the survey i [14] ad refereces therei), few studies address the geeralizatio error of distace metric learig. I this paper, we examie the geeralizatio error for regularized distace metric learig. Followig the idea of stability aalysis [1], we show that with appropriate costraits, the geeralizatio error of regularized distace metric learig is idepedet from the dimesioality of data, makig it suitable for hadlig high dimesioal data. I additio, we preset a olie learig algorithm for regularized distace metric learig, ad show its regret boud. Note that although olie metric learig was studied i [7], our approach is advatageous i that (a) it is computatioally more efficiet i hadlig the costrait of SDP coe, ad (b) it has a proved regret boud while [7] oly shows a mistake boud for the datasets that ca be separated by a Mahalaobis distace. To verify the efficacy ad efficiecy of the proposed algorithm for regularized distace metric learig, we coduct experimets with data classificatio ad face recogitio. Our empirical results show that the proposed olie algorithm is (1) effective for metric learig compared to the state-of-the-art methods, ad (2) robust ad efficiet for high dimesioal data. 1

2 2 Regularized Distace Metric Learig Let D = {z i = (x i, y i ), i = 1,..., } deote the labeled examples, where x k = (x 1 k,..., xd k ) Rd is a vector of d dimesio ad y i {1, 2,..., m} is class label. I our study, we assume that the orm of ay example is upper bouded by R, i.e., sup x x 2 R. Let A S d d + be the distace metric to be leared, where the distace betwee two data poits x ad x is calculated as x x 2 A = (x x ) A(x x ). Followig the idea of maximum margi classifiers, we have the followig framework for regularized distace metric learig: 1 mi A 2 A 2 F + 2C g ( [ y i,j 1 xi x j 2 ]) A : A 0, tr(a) η(d) (1) ( 1) where i<j y i,j is derived from class labels y i ad y j, i.e., y i,j = 1 if y i = y j ad 1 otherwise. g(z) is the loss fuctio. It outputs a small value whe z is a large positive value, ad a large value whe z is large egative. We assume g(z) to be covex ad Lipschitz cotiuous with Lipschitz costat L. A 2 F is the regularizer that measures the complexity of the distace metric A. tr(a) η(d) is itroduced to esure a bouded domai for A. As will be revealed later, this costrait will become active oly whe the costrait costat η(d) is subliear i d, i.e., η O(d p ) with p < 1. We will also show how this costrait could affect the geeralizatio error of distace metric learig. 3 Geeralizatio Error Let A D be the distace metric leared by the algorithm i (1) from the traiig examples D. Let I D (A) deote the empirical loss, i.e., 2 I D (A) = g ( [ y i,j 1 xi x j 2 ]) A (2) ( 1) i<j For the coveiece of presetatio, we also write g ( y i,j (1 x i x j 2 A )) = V (A, z i, z j ) to highlight its depedece o A ad two examples z i ad z j. We deote by I(A) the loss of A over the true distributio, i.e., I(A) = E (zi,z j)[v (A, z i, z j )] (3) Give the empirical loss I D (A) ad the loss over the true distributio I(A), we defie the estimatio error as D D = I(A D ) I D (A D ) (4) I order to show the behavior of estimatio error, we follow the aalysis based o the stability of the algorithm [1]. The uiform stability of a algorithm determies the stability of the algorithm whe oe of the traiig examples is replaced with aother. More specifically, a algorithm A has uiform stability β if (D, z), i, sup V (A D, u, v) V (A D z,i, u, v) β (5) u,v where D z,i stads for the ew traiig set that is obtaied by replacig z j D with a ew example z. We further defie β = κ/ as the uiform stability β behaves like O(1/). The advatage of usig stability aalysis for the geeralizatio error of regularized distace metric learig. This is because the example pair (z i, z j ) used for traiig distace metrics are ot I.I.D. although z i is, makig it difficult to directly utilize the results from statistical learig theory. I the aalysis below, we first show how to derive the geeralizatio error boud for regularized distace metric learig give the uiform stability β (or κ). We the derive the uiform stability costat for the regularized distace metric learig framework i (1). 2

3 3.1 Geeralizatio Error Boud for Give Uiform Stability Aalysis i this sectio follows closely [1], ad we therefore omit the detailed proofs. Our aalysis utilizes the McDiarmid iequality that is stated as follows. Theorem 1. (McDiarmid Iequality) Give radom variables {v i } l i=1, v i, ad a fuctio F : vl R satisfyig sup F (v 1,..., v l ) F (v 1,..., v i 1, v i, v i+1,..., v l ) c i, v 1,...,v l,v i the followig statemet holds Pr ( F (v 1,..., v l ) E(F (v 1,..., v l )) > ɛ) 2 exp 2ɛ P l i=1 c2 i! To use the McDiarmid iequality, we first compute E(D D ). Lemma 1. Give a distace metric learig algorithm A has uiform stability κ/, we have the followig iequality for E(D D ) E(D D ) 2 κ (6) where is the umber of traiig examples i D. The result i the followig lemma shows that the coditio i McDiarmid iequality holds. Lemma 2. Let D be a collectio of radomly selected traiig examples, ad D i,z be the collectio of examples that replaces z i i D with example z. We have D D D D i,z bouded as follows D D D D i,z 2κ + 8Lη(d) + 2g 0 where g 0 = sup z,z V (0, z, z ) measures the largest loss whe distace metric A is 0. (7) Combiig the results i Lemma 1 ad 2, we ca ow derive the the boud for the geeralizatio error by usig the McDiarmid iequality. Theorem 2. Let D deote a collectio of radomly selected traiig examples, ad A D be the distace metric leared by the algorithm i (1) whose uiform stability is κ/. With probability 1 δ, we have the followig boud for I(A D ) I(A D ) I D (A D ) 2κ + (2κ + 4Lη(d) + 2g 0) l(2/δ) 2 (8) 3.2 Geeralizatio Error for Regularized Distace Metric Learig First, we show that the superium of tr(a D ) is O(d 1/2 ), which verifies that η(d) should behave subliear i d. This is summarized by the followig propositio. Propositio 1. The trace costrait i (1) will be activated oly whe where g 0 = sup z,z V (0, z, z ). η(d) 2dg 0 C (9) Proof. It follows directly from [tr(a D )/d] 2 A D 2 F 2C sup z,z V (0, z, z ) Cg 0. To boud the uiform stability, we eed the followig propositio Propositio 2. For ay two distace metrics A ad A, we have the followig iequality hold for ay examples z u ad z v V (A, z u, z v ) V (A, z u, z v ) 4LR 2 A A F (10) 3

4 The above propositio follows directly from the fact that (a) V (A, z, z ) is Lipschitz cotiuous ad (b) x 2 R for ay example x. The followig lemma bouds A D A D F. Lemma 3. Let D deote a collectio of radomly selected traiig examples, ad by z = (x, y) a radomly selected example. Let A D be the distace metric leared by the algorithm i (1). We have A D A D i,z F 8CLR2 (11) The proof of the above lemma ca be foud i Appedix A. By puttig the results i Lemma 3 ad Propositio 2, we have the followig theorem for the stability of the Frobeius orm based regularizer. Theorem 3. The uiform stability for the algorithm i (1) usig the Frobeius orm regularizer, deoted by β, is bouded as follows where κ = 32CL 2 R 4 β = κ 32CL2 R 4 Combig Theorem 3 ad 2, we have the followig theorem for the geeralizatio error of distace metric learig algorithm i (1) usig the Frobeius orm regularizer Theorem 4. Let D be a collectio of radomly selected examples, ad A D be the distace metric leared by the algorithm i (1) with h(a) = A 2 F. With probability 1 δ, we have the followig boud for the true loss fuctio I(A D ) where A D is leared from (1) usig the Frobeius orm regularizer (12) I(A D ) I D (A D ) 32CL2 R 4 where s(d) = mi ( 2dg 0 C, η(d) ). + ( 32CL 2 R 4 + 4Ls(d) + 2g 0 ) l(2/δ) 2 (13) Remark The most importat feature i the estimatio error is that it coverges i the order of O(s(d)/ ). By choosig η(d) to have a low depedece of d (i.e., η(d) d p with p 1), the proposed framework for regularized distace metric learig will be robust to the high dimesioal data. I the extreme case, by settig η(d) to be a costat, the estimatio error will be idepedet from the dimesioality of data. 4 Algorithm I this sectio, we discuss a efficiet algorithm for solvig (1). We assume a hige loss for g(z), i.e., g(z) = max(0, b z), where b is the classificatio margi. To desig a olie learig algorithm for regularized distace metric learig, we follow the theory of gradiet based olie learig [2] by defiig potetial fuctio Φ(A) = A 2 F /2. Algorithm 1 shows the olie learig algorithm. The theorem below shows the regret boud for the olie learig algorithm i Figure 1. Theorem 5. Let the olie learig algorithm 1 ru with learig rate λ > 0 o a sequece (x t, x t), y t, t = 1,...,. Assume x 2 R for all the traiig examples. The, for all distace metric M S+ d d, we have ( 1 L 1 8R 4 L (M) + 1 ) λ/b 2λ M 2 F where Ł (M) = max ( 0, b y t (1 x t x t 2 M ) ), L = ( ) max 0, b y t (1 x t x t 2 A t 1 ) 4

5 Algorithm 1 Olie Learig Algorithm for Regularized Distace Metric Learig 1: INPUT: predefied learig rate λ 2: Iitialize A 0 = 0 3: for t = 1,..., T do 4: Receive a pair of traiig examples {(x 1 t, y 1 t ), (x 2 t, y 2 t )} 5: Compute the class label y t : y t = +1 if y 1 t = y 2 t, ad y t = 1 otherwise. 6: if the traiig pair (x 1 t, x 2 t ), y t is classified correctly, i.e., y t ( 1 x 1 t x 2 t 2 A t 1 ) > 0 the 7: A t = A t 1. 8: else 9: A t = π S+ (A t 1 λy t (x t x t)(x t x t) ), where π S+ (M) projects matrix M ito the SDP coe. 10: ed if 11: ed for The proof of this theorem ca be foud i Appedix B. Note that the above olie learig algorithm require computig π S+ (M), i.e., projectig matrix M oto the SDP coe, which is expesive for high dimesioal data. To address this challege, first otice that M = π S+ (M) is equivalet to the optimizatio problem M = arg mi M 0 M M F. We thus approximate A t = π S+ (A t 1 λy t (x t x t)(x t x t) ) with A t = A t 1 λ t y t (x t x t)(x t x t) where λ t is computed as follows λ t = arg mi { λt λ : λ t [0, λ], A t 1 λ t y t (x t x t)(x t x t) 0 } (14) λ t The followig theorem shows the solutio to the above optimizatio problem. Theorem 6. The optimal solutio λ t to the problem i (14) is expressed as { λ yt = 1 λ t = mi ( λ, [(x t x t) A 1 t 1 (x t x t)] 1) y t = +1 Proof of this theorem ca be foud i the supplemetary materials. Fially, the quatity (x t x t)a 1 t 1 (x t x t) ca be computed by solvig the followig optimizatio problem max u 2u (x t x t) u Au whose optimal value ca be computed efficietly usig the cojugate gradiet method [9]. Note that compared to the olie metric learig algorithm i [7], the proposed olie learig algorithm for metric learig is advatageous i that (i) it is computatioally more efficiet by avoidig projectig a matrix ito a SDP coe, ad (ii) it has a provable regret boud while [7] oly presets the mistake boud for the separable datasets. 5 Experimets We coducted a extesive study to verify both the efficiecy ad the efficacy of the proposed algorithms for metric learig. For the coveiece of discussio, we refer to the propoesd olie distace metric learig algorithm as olie-reg. To examie the efficacy of the leared distace metric, we employed the k Nearest Neighbor (k-nn) classifier. Our hypothesis is that the better the distace metric is, the higher the classificatio accuracy of k-nn will be. We set k = 3 for k-nn for all the experimets accordig to our experiece. We compare our algorithm to the followig six state-of-the-art algorithms for distace metric learig as baselies: (1) Euclidea distace metric; (2) Mahalaobis distace metric, which is computed as the iverse of covariace matrix of traiig samples, i.e., ( i=1 x ix i ) 1 ; (3) Xig s algorithm proposed i [13]; (4) LMNN, a distace metric learig algorithm based o the large margi earest eighbor classifier [12]; (5) ITML, a Iformatio-theoretic metric learig based o [4]; ad (6) Relevace Compoet Aalysis (RCA) [8]. We set the maximum umber of iteratios for Xig s method to be 10, 000. The umber of target eighbors i LMNN ad parameter γ i ITML 5

6 Table 1: Classificatio error (%) of a k-nn (k = 3) classifier o the te UCI data sets usig seve differet metrics. Stadard deviatio is icluded. Dataset Eclidea Mahala Xig LMNN ITML RCA Olie-reg ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± 0.4 Table 2: p-values of the Wilcoxo siged-rak test of the 7 methods o the 9 datasets. Methods Eclidea Mahala Xig LMNN ITML RCA Olie-reg Euclidea Mahala Xig LMNN ITML RCA Olie-reg were tued by cross validatio over the rage from 10 4 to All the algorithms are implemeted ad ru usig Matlab. All the experimet are ru o a AMD Processor 2.8G machie, with 8GMB RAM ad Liux operatio system. 5.1 Experimet (I): Compariso to State-of-the-art Algorithms We coducted experimets of data classificatio over the followig ie datasets from UCI repository: (1) balace-scale, with 3 classes, 4 features, ad 625 istaces; (2) breast-cacer, with 2 classes, 10 features, ad 683 istace; (3) glass, with 6 classes, 9 features, ad 214 istaces; (4) iris, with 3 classes, 4 features, ad 150 istaces; (5) pima, with 2 classes, 8 features, ad 768 istaces; (6) segmetatio, with 7 classes, 19 features, ad 210 istaces; (7)wie, with 3 classes, 13 features, ad 178 istaces; (8) waveform, with 3 classes, 21 features, ad 5000 istaces; (9) optdigits, with 10 classes, 64 features, 3823 istaces. For all the datasets, we radomly select 50% samples for traiig, ad use the remaiig samples for testig. Table 1 shows the classificatio errors of all the metric learig methods over 9 datasets averaged over 10 rus, together with the stadard deviatio. We observe that the proposed metric learig algorithm deliver performace that comparable to the state-of-the-art methods. I particular, for almost all datasets, the classificatio accuracy of the proposed algorithm is close to that of LMNN, which has yielded overall the best performace amog six baselie algorithms. This is cosistet with the results of the other studies, which show LMNN is amog the most effective algorithms for distace metric learig. To further verify if the proposed method performs statistically better tha the baselie methods, we coduct statistical test by usig Wilcoxo siged-rak test [3]. The Wilcoxo siged-rak test is a o-parametric statistical hypothesis test for the comparisos of two related samples. It is kow to be safer tha the Studet s t-test because it does ot assume ormal distributios. From table 2, we fid that the regularized distace metric learig improves the classificatio accuracy sigificatly compared to Mahalaobis distace, Xig s method ad RCA at sigificat level 0.1. It performs slightly better tha ITML ad is comparable to LMNN. 6

7 1 att face 7000 att face Classificatio accuracy Euclidea 0.4 Mahalaobis 0.3 LMNN ITML 0.2 RCA Olie_reg Image resize ratio (a) Ruig time (secods) LMNN ITML RCA Olie_reg Image resize ratio (b) Figure 1: (a) Face recogitio accuracy of knn ad (b) ruig time of LMNN, ITML, RCA ad olie reg algorithms o the att-face dataset with varyig image sizes. 5.2 Experimet (II): Results for High Dimesioal Data To evaluate the depedece of the regularized metric learig algorithms o data dimesios, we tested it by the task of face recogitio. The AT&T face database 1 is used i our study. It cosists of grey images of faces from 40 distict subjects, with te pictures for each subject. For every subject, the images were take at differet times, with varied the lightig coditio ad differet facial expressios (ope/closed-eyes, smilig/ot-smilig) ad facial details (glasses/o-glasses). The origial size of each image is pixels, with 256 grey levels per pixel. To examie the sesitivity to data dimesioality, we vary the data dimesio (i.e., the size of images) by compressig the origial images ito size differet sizes with the image aspect ratio preserved. The image compressio is achieved by bicubic iterpolatio (the output pixel value is a weighted average of pixels i the earest 4-by-4 eighborhood). For each subject, we radomly spit its face images ito traiig set ad test set with ratio 4 : 6. A distace metric is leared from the collectio of traiig face images, ad is used by the knn classifier (k = 3) to predict the subject ID of the test images. We coduct each experimet 10 times, ad report the classificatio accuracy by averagig over 40 subjects ad 10 rus. Figure 1 (a) shows the average classificatio accuracy of the knn classifier usig differet distace metric learig algorithms. The ruig times of differet metric learig algorithms for the same dataset is show i Figure 1 (b). Note that we exclude Xig s method i compariso because its extremely log computatioal time. We observed that with icreasig image size (dimesios), the regularized distace metric learig algorithm yields stable performace, idicatig that the it is resiliet to high dimesioal data. I cotrast, for almost all the baselie methods except ITML, their performace varied sigificatly as the size of the iput image chaged. Although ITML yields stable performace with respect to differet size of images, its high computatioal cost (Figure 1), arisig from solvig a Bregma optimizatio problem i each iteratio, makes it usuitable for high-dimesioal data. 6 Coclusio I this paper, we aalyze the geeralizatio error of regularized distace metric learig. We show that with appropriate costrait, the regularized distace metric learig could be robust to high dimesioal data. We also preset efficiet learig algorithms for solvig the related optimizatio problems. Empirical studies with face recogitio ad data classificatio show the proposed approach is (i) robust ad efficiet for high dimesioal data, ad (ii) comparable to the state-of-theart approaches for distace learig. I the future, we pla to ivestigate differet regularizers ad their effect for distace metric learig

8 ACKNOWLEDGEMENTS The work was supported i part by the Natioal Sciece Foudatio (IIS ) ad the U. S. Army Research Laboratory ad the U. S. Army Research Office (W911NF ). Ay opiios, fidigs, ad coclusios or recommedatios expressed i this material are those of the authors ad do ot ecessarily reflect the views of NSF ad ARO. Appedix A: Proof of Lemma 3 Proof. We itroduce the Bregme divergece for the proof of this lemma. Give a covex fuctio of matrix ϕ(x), the Bregme divergece betwee two matrices A ad B is computed as follows: d ϕ (A, B) = ϕ(b) ϕ(a) tr ( ϕ(a) (B A) ) We defie covex fuctio N(X) ad V D (X) as follows: N(X) = X 2 F, V D (X) = 2 ( 1) V (X, z i, z j ) ad furthermore covex fuctio T D (X) = N(X) + CV D (X). We thus have d N (A D, A D i,z) + d N (A D i,z, A D ) d TD (A D, A D i,z) + d TD i,z (A D i,z, A D ) C = [V (A ( 1) D i,z, z i, z j ) V (A D i,z, z, z j ) + V (A D, z, z j ) V (A D, z i, z j )] j i 8CLR2 A D A D i,z F The first iequality follows from the fact that both N(X) ad V D (X) are covex i X. The secod step holds because matrix A D ad A D i,z miimize the objective fuctio T D (X) ad T D i,z(x), respectively, ad therefore (A D i,z A D ) T D (A D ) 0, (A D A D i,z) T D i,z(a D i,z) 0 Sice d N (A, B) = A B 2 F, we therefore have A D A D i,z 2 F 8CLR2 A D A D i,z F, which leads to the result i the lemma. Appedix B: Proof of Theorem 7 Proof. We deote by A t = A t 1 λy(x t x t)(x t x t) ad A t = π S+ (A t). Followig Theorem 11.1 ad Theorem 11.4 [2], we have L L (M) 1 λ D Φ (M, A 0) + 1 D Φ (A t 1, A λ t) where i<j D Φ (A, B) = 1 2 A B 2 F, Φ(A) = Φ (A) = 1 2 A 2 F Usig the relatio A t = A t 1 λy(x t x t)(x t x t) ad A 0 = 0, we have L L (M) 1 2λ M 2 F + 1 [ ] I y t (1 x t x 2λ t 2 A t 1 ) < 0 x t x t 4 By assumig x 2 R for ay traiig example, we have x t x t R 4. Sice [ ] I y t (1 x t x t 2 A t 1 ) < 0 x t x t 4 max(0, b y t (1 x t x t 2 A t 1 )) 16R4 b we thus have the result i the theorem = 16R4 b L 8

9 Refereces [1] Bousquet, Olivier, ad Adré Elisseeff. Stability ad geeralizatio. Joural of Machie Learig Research, 2: , March [2] Nicolo Cesa-Biachi ad Gabor Lugosi. Predictio, Learig, ad Games. Cambridge Uiversity Press, New York, NY, USA, [3] G.W. Corder ad D.I. Forema. Noparametric Statistics for No-Statisticias: A Step-by-Step Approach. New Jersey: Wiley, [4] J.V. Davis, B. Kulis, P. Jai, S. Sra, ad I.S. Dhillo. Iformatio-theoretic metric learig. I Proceedigs of the 24th iteratioal coferece o Machie Learig, [5] A. Globerso ad S. Roweis. Metric learig by collapsig classes. I Advaces i Neural Iformatio Processig Systems, [6] L. K. Saul ad S. T. Roweis. Thik globally, fit locally: Usupervised learig of low dimesioal maifolds. Joural of Machie Learig Research, 4, [7] Shai Shalev-Shwartz, Yoram Siger, ad Adrew Y. Ng. Olie ad batch learig of pseudometrics. I Proceedigs of the twety-first iteratioal coferece o Machie learig, pages , [8] N. Shetal, T. Hertz, D. Weishall, ad M. Pavel. Adjustmet learig ad relevat compoet aalysis. I Proceedigs of the Seveth Europea Coferece o Computer Visio, volume 4, pages , [9] Joatha R Shewchuk. A itroductio to the cojugate gradiet method without the agoizig pai. Techical report, Caregie Mello Uiversity, Pittsburgh, PA, USA, [10] J.B. Teebaum, V. de Silva, ad J. C. Lagford. A global geometric framework for oliear dimesioality reductio. Sciece, 290, [11] I.W. Tsag, P.M. Cheug, ad J.T. Kwok. Kerel relevat compoet aalysis for distace metric learig. I IEEE Iteratioal Joit Coferece o Neural Networks (IJCNN), [12] K. Weiberger, J. Blitzer, ad L. Saul. Distace metric learig for large margi earest eighbor classificatio. I Advaces i Neural Iformatio Processig Systems, [13] E.P. Xig, A.Y. Ng, M.I. Jorda, ad S. Russell. Distace metric learig, with applicatio to clusterig with side-iformatio. I Advaces i Neural Iformatio Processig Systems, [14] L. Yag ad R. Ji. Distace metric learig: A comprehesive survey. Michiga State Uiversity, Tech. Rep., [15] L. Yag, R. Ji, R. Sukthakar, ad Y. Liu. A efficiet algorithm for local distace metric learig. I the Proceedigs of the Twety-First Natioal Coferece o Artificial Itelligece Proceedigs (AAAI),

Modified Line Search Method for Global Optimization

Modified Line Search Method for Global Optimization Modified Lie Search Method for Global Optimizatio Cria Grosa ad Ajith Abraham Ceter of Excellece for Quatifiable Quality of Service Norwegia Uiversity of Sciece ad Techology Trodheim, Norway {cria, ajith}@q2s.tu.o

More information

Asymptotic Growth of Functions

Asymptotic Growth of Functions CMPS Itroductio to Aalysis of Algorithms Fall 3 Asymptotic Growth of Fuctios We itroduce several types of asymptotic otatio which are used to compare the performace ad efficiecy of algorithms As we ll

More information

Plug-in martingales for testing exchangeability on-line

Plug-in martingales for testing exchangeability on-line Plug-i martigales for testig exchageability o-lie Valetia Fedorova, Alex Gammerma, Ilia Nouretdiov, ad Vladimir Vovk Computer Learig Research Cetre Royal Holloway, Uiversity of Lodo, UK {valetia,ilia,alex,vovk}@cs.rhul.ac.uk

More information

0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5

0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5 Sectio 13 Kolmogorov-Smirov test. Suppose that we have a i.i.d. sample X 1,..., X with some ukow distributio P ad we would like to test the hypothesis that P is equal to a particular distributio P 0, i.e.

More information

Chapter 7 Methods of Finding Estimators

Chapter 7 Methods of Finding Estimators Chapter 7 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 011 Chapter 7 Methods of Fidig Estimators Sectio 7.1 Itroductio Defiitio 7.1.1 A poit estimator is ay fuctio W( X) W( X1, X,, X ) of

More information

A probabilistic proof of a binomial identity

A probabilistic proof of a binomial identity A probabilistic proof of a biomial idetity Joatho Peterso Abstract We give a elemetary probabilistic proof of a biomial idetity. The proof is obtaied by computig the probability of a certai evet i two

More information

Department of Computer Science, University of Otago

Department of Computer Science, University of Otago Departmet of Computer Sciece, Uiversity of Otago Techical Report OUCS-2006-09 Permutatios Cotaiig May Patters Authors: M.H. Albert Departmet of Computer Sciece, Uiversity of Otago Micah Colema, Rya Fly

More information

Output Analysis (2, Chapters 10 &11 Law)

Output Analysis (2, Chapters 10 &11 Law) B. Maddah ENMG 6 Simulatio 05/0/07 Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should

More information

LECTURE 13: Cross-validation

LECTURE 13: Cross-validation LECTURE 3: Cross-validatio Resampli methods Cross Validatio Bootstrap Bias ad variace estimatio with the Bootstrap Three-way data partitioi Itroductio to Patter Aalysis Ricardo Gutierrez-Osua Texas A&M

More information

Soving Recurrence Relations

Soving Recurrence Relations Sovig Recurrece Relatios Part 1. Homogeeous liear 2d degree relatios with costat coefficiets. Cosider the recurrece relatio ( ) T () + at ( 1) + bt ( 2) = 0 This is called a homogeeous liear 2d degree

More information

THE ABRACADABRA PROBLEM

THE ABRACADABRA PROBLEM THE ABRACADABRA PROBLEM FRANCESCO CARAVENNA Abstract. We preset a detailed solutio of Exercise E0.6 i [Wil9]: i a radom sequece of letters, draw idepedetly ad uiformly from the Eglish alphabet, the expected

More information

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008 I ite Sequeces Dr. Philippe B. Laval Keesaw State Uiversity October 9, 2008 Abstract This had out is a itroductio to i ite sequeces. mai de itios ad presets some elemetary results. It gives the I ite Sequeces

More information

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method Chapter 6: Variace, the law of large umbers ad the Mote-Carlo method Expected value, variace, ad Chebyshev iequality. If X is a radom variable recall that the expected value of X, E[X] is the average value

More information

A Combined Continuous/Binary Genetic Algorithm for Microstrip Antenna Design

A Combined Continuous/Binary Genetic Algorithm for Microstrip Antenna Design A Combied Cotiuous/Biary Geetic Algorithm for Microstrip Atea Desig Rady L. Haupt The Pesylvaia State Uiversity Applied Research Laboratory P. O. Box 30 State College, PA 16804-0030 haupt@ieee.org Abstract:

More information

Convexity, Inequalities, and Norms

Convexity, Inequalities, and Norms Covexity, Iequalities, ad Norms Covex Fuctios You are probably familiar with the otio of cocavity of fuctios. Give a twicedifferetiable fuctio ϕ: R R, We say that ϕ is covex (or cocave up) if ϕ (x) 0 for

More information

A Faster Clause-Shortening Algorithm for SAT with No Restriction on Clause Length

A Faster Clause-Shortening Algorithm for SAT with No Restriction on Clause Length Joural o Satisfiability, Boolea Modelig ad Computatio 1 2005) 49-60 A Faster Clause-Shorteig Algorithm for SAT with No Restrictio o Clause Legth Evgey Datsi Alexader Wolpert Departmet of Computer Sciece

More information

Sequences and Series

Sequences and Series CHAPTER 9 Sequeces ad Series 9.. Covergece: Defiitio ad Examples Sequeces The purpose of this chapter is to itroduce a particular way of geeratig algorithms for fidig the values of fuctios defied by their

More information

Coordinating Principal Component Analyzers

Coordinating Principal Component Analyzers Coordiatig Pricipal Compoet Aalyzers J.J. Verbeek ad N. Vlassis ad B. Kröse Iformatics Istitute, Uiversity of Amsterdam Kruislaa 403, 1098 SJ Amsterdam, The Netherlads Abstract. Mixtures of Pricipal Compoet

More information

Properties of MLE: consistency, asymptotic normality. Fisher information.

Properties of MLE: consistency, asymptotic normality. Fisher information. Lecture 3 Properties of MLE: cosistecy, asymptotic ormality. Fisher iformatio. I this sectio we will try to uderstad why MLEs are good. Let us recall two facts from probability that we be used ofte throughout

More information

Analyzing Longitudinal Data from Complex Surveys Using SUDAAN

Analyzing Longitudinal Data from Complex Surveys Using SUDAAN Aalyzig Logitudial Data from Complex Surveys Usig SUDAAN Darryl Creel Statistics ad Epidemiology, RTI Iteratioal, 312 Trotter Farm Drive, Rockville, MD, 20850 Abstract SUDAAN: Software for the Statistical

More information

DAME - Microsoft Excel add-in for solving multicriteria decision problems with scenarios Radomir Perzina 1, Jaroslav Ramik 2

DAME - Microsoft Excel add-in for solving multicriteria decision problems with scenarios Radomir Perzina 1, Jaroslav Ramik 2 Itroductio DAME - Microsoft Excel add-i for solvig multicriteria decisio problems with scearios Radomir Perzia, Jaroslav Ramik 2 Abstract. The mai goal of every ecoomic aget is to make a good decisio,

More information

Running Time ( 3.1) Analysis of Algorithms. Experimental Studies ( 3.1.1) Limitations of Experiments. Pseudocode ( 3.1.2) Theoretical Analysis

Running Time ( 3.1) Analysis of Algorithms. Experimental Studies ( 3.1.1) Limitations of Experiments. Pseudocode ( 3.1.2) Theoretical Analysis Ruig Time ( 3.) Aalysis of Algorithms Iput Algorithm Output A algorithm is a step-by-step procedure for solvig a problem i a fiite amout of time. Most algorithms trasform iput objects ito output objects.

More information

The analysis of the Cournot oligopoly model considering the subjective motive in the strategy selection

The analysis of the Cournot oligopoly model considering the subjective motive in the strategy selection The aalysis of the Courot oligopoly model cosiderig the subjective motive i the strategy selectio Shigehito Furuyama Teruhisa Nakai Departmet of Systems Maagemet Egieerig Faculty of Egieerig Kasai Uiversity

More information

Lecture 13. Lecturer: Jonathan Kelner Scribe: Jonathan Pines (2009)

Lecture 13. Lecturer: Jonathan Kelner Scribe: Jonathan Pines (2009) 18.409 A Algorithmist s Toolkit October 27, 2009 Lecture 13 Lecturer: Joatha Keler Scribe: Joatha Pies (2009) 1 Outlie Last time, we proved the Bru-Mikowski iequality for boxes. Today we ll go over the

More information

Vladimir N. Burkov, Dmitri A. Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT

Vladimir N. Burkov, Dmitri A. Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT Keywords: project maagemet, resource allocatio, etwork plaig Vladimir N Burkov, Dmitri A Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT The paper deals with the problems of resource allocatio betwee

More information

Research Article Sign Data Derivative Recovery

Research Article Sign Data Derivative Recovery Iteratioal Scholarly Research Network ISRN Applied Mathematics Volume 0, Article ID 63070, 7 pages doi:0.540/0/63070 Research Article Sig Data Derivative Recovery L. M. Housto, G. A. Glass, ad A. D. Dymikov

More information

PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM

PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY Physical ad Mathematical Scieces 2015, 1, p. 15 19 M a t h e m a t i c s AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM A. G. GULYAN Chair of Actuarial Mathematics

More information

1 Computing the Standard Deviation of Sample Means

1 Computing the Standard Deviation of Sample Means Computig the Stadard Deviatio of Sample Meas Quality cotrol charts are based o sample meas ot o idividual values withi a sample. A sample is a group of items, which are cosidered all together for our aalysis.

More information

Totally Corrective Boosting Algorithms that Maximize the Margin

Totally Corrective Boosting Algorithms that Maximize the Margin Mafred K. Warmuth mafred@cse.ucsc.edu Ju Liao liaoju@cse.ucsc.edu Uiversity of Califoria at Sata Cruz, Sata Cruz, CA 95064, USA Guar Rätsch Guar.Raetsch@tuebige.mpg.de Friedrich Miescher Laboratory of

More information

I. Chi-squared Distributions

I. Chi-squared Distributions 1 M 358K Supplemet to Chapter 23: CHI-SQUARED DISTRIBUTIONS, T-DISTRIBUTIONS, AND DEGREES OF FREEDOM To uderstad t-distributios, we first eed to look at aother family of distributios, the chi-squared distributios.

More information

Confidence Intervals for One Mean

Confidence Intervals for One Mean Chapter 420 Cofidece Itervals for Oe Mea Itroductio This routie calculates the sample size ecessary to achieve a specified distace from the mea to the cofidece limit(s) at a stated cofidece level for a

More information

Systems Design Project: Indoor Location of Wireless Devices

Systems Design Project: Indoor Location of Wireless Devices Systems Desig Project: Idoor Locatio of Wireless Devices Prepared By: Bria Murphy Seior Systems Sciece ad Egieerig Washigto Uiversity i St. Louis Phoe: (805) 698-5295 Email: bcm1@cec.wustl.edu Supervised

More information

SAMPLE QUESTIONS FOR FINAL EXAM. (1) (2) (3) (4) Find the following using the definition of the Riemann integral: (2x + 1)dx

SAMPLE QUESTIONS FOR FINAL EXAM. (1) (2) (3) (4) Find the following using the definition of the Riemann integral: (2x + 1)dx SAMPLE QUESTIONS FOR FINAL EXAM REAL ANALYSIS I FALL 006 3 4 Fid the followig usig the defiitio of the Riema itegral: a 0 x + dx 3 Cosider the partitio P x 0 3, x 3 +, x 3 +,......, x 3 3 + 3 of the iterval

More information

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES Read Sectio 1.5 (pages 5 9) Overview I Sectio 1.5 we lear to work with summatio otatio ad formulas. We will also itroduce a brief overview of sequeces,

More information

On the Generalization Ability of Online Learning Algorithms for Pairwise Loss Functions

On the Generalization Ability of Online Learning Algorithms for Pairwise Loss Functions O the Geeraliatio Ability of Olie Learig Algorithms for Pairwise Loss Fuctios Purushottam Kar purushot@cse.iitk.ac.i Departmet of Computer Sciece ad Egieerig, Idia Istitute of Techology, Kapur, UP 208

More information

Lecture 2: Karger s Min Cut Algorithm

Lecture 2: Karger s Min Cut Algorithm priceto uiv. F 3 cos 5: Advaced Algorithm Desig Lecture : Karger s Mi Cut Algorithm Lecturer: Sajeev Arora Scribe:Sajeev Today s topic is simple but gorgeous: Karger s mi cut algorithm ad its extesio.

More information

Reliability Analysis in HPC clusters

Reliability Analysis in HPC clusters Reliability Aalysis i HPC clusters Narasimha Raju, Gottumukkala, Yuda Liu, Chokchai Box Leagsuksu 1, Raja Nassar, Stephe Scott 2 College of Egieerig & Sciece, Louisiaa ech Uiversity Oak Ridge Natioal Lab

More information

Automatic Tuning for FOREX Trading System Using Fuzzy Time Series

Automatic Tuning for FOREX Trading System Using Fuzzy Time Series utomatic Tuig for FOREX Tradig System Usig Fuzzy Time Series Kraimo Maeesilp ad Pitihate Soorasa bstract Efficiecy of the automatic currecy tradig system is time depedet due to usig fixed parameters which

More information

Chapter 14 Nonparametric Statistics

Chapter 14 Nonparametric Statistics Chapter 14 Noparametric Statistics A.K.A. distributio-free statistics! Does ot deped o the populatio fittig ay particular type of distributio (e.g, ormal). Sice these methods make fewer assumptios, they

More information

Taking DCOP to the Real World: Efficient Complete Solutions for Distributed Multi-Event Scheduling

Taking DCOP to the Real World: Efficient Complete Solutions for Distributed Multi-Event Scheduling Taig DCOP to the Real World: Efficiet Complete Solutios for Distributed Multi-Evet Schedulig Rajiv T. Maheswara, Milid Tambe, Emma Bowrig, Joatha P. Pearce, ad Pradeep araatham Uiversity of Souther Califoria

More information

Solutions to Selected Problems In: Pattern Classification by Duda, Hart, Stork

Solutions to Selected Problems In: Pattern Classification by Duda, Hart, Stork Solutios to Selected Problems I: Patter Classificatio by Duda, Hart, Stork Joh L. Weatherwax February 4, 008 Problem Solutios Chapter Bayesia Decisio Theory Problem radomized rules Part a: Let Rx be the

More information

Determining the sample size

Determining the sample size Determiig the sample size Oe of the most commo questios ay statisticia gets asked is How large a sample size do I eed? Researchers are ofte surprised to fid out that the aswer depeds o a umber of factors

More information

SUPPLEMENTARY MATERIAL TO GENERAL NON-EXACT ORACLE INEQUALITIES FOR CLASSES WITH A SUBEXPONENTIAL ENVELOPE

SUPPLEMENTARY MATERIAL TO GENERAL NON-EXACT ORACLE INEQUALITIES FOR CLASSES WITH A SUBEXPONENTIAL ENVELOPE SUPPLEMENTARY MATERIAL TO GENERAL NON-EXACT ORACLE INEQUALITIES FOR CLASSES WITH A SUBEXPONENTIAL ENVELOPE By Guillaume Lecué CNRS, LAMA, Mare-la-vallée, 77454 Frace ad By Shahar Medelso Departmet of Mathematics,

More information

Maximum Likelihood Estimators.

Maximum Likelihood Estimators. Lecture 2 Maximum Likelihood Estimators. Matlab example. As a motivatio, let us look at oe Matlab example. Let us geerate a radom sample of size 00 from beta distributio Beta(5, 2). We will lear the defiitio

More information

Incremental calculation of weighted mean and variance

Incremental calculation of weighted mean and variance Icremetal calculatio of weighted mea ad variace Toy Fich faf@cam.ac.uk dot@dotat.at Uiversity of Cambridge Computig Service February 009 Abstract I these otes I eplai how to derive formulae for umerically

More information

Dimensionality Reduction of Multimodal Labeled Data by Local Fisher Discriminant Analysis

Dimensionality Reduction of Multimodal Labeled Data by Local Fisher Discriminant Analysis Joural of Machie Learig Research 8 (2007) 1027-1061 Submitted 3/06; Revised 12/06; Published 5/07 Dimesioality Reductio of Multimodal Labeled Data by Local Fisher Discrimiat Aalysis Masashi Sugiyama Departmet

More information

Irreducible polynomials with consecutive zero coefficients

Irreducible polynomials with consecutive zero coefficients Irreducible polyomials with cosecutive zero coefficiets Theodoulos Garefalakis Departmet of Mathematics, Uiversity of Crete, 71409 Heraklio, Greece Abstract Let q be a prime power. We cosider the problem

More information

Review: Classification Outline

Review: Classification Outline Data Miig CS 341, Sprig 2007 Decisio Trees Neural etworks Review: Lecture 6: Classificatio issues, regressio, bayesia classificatio Pretice Hall 2 Data Miig Core Techiques Classificatio Clusterig Associatio

More information

Discriminative Models of Integrating Document Evidence and Document-Candidate Associations for Expert Search

Discriminative Models of Integrating Document Evidence and Document-Candidate Associations for Expert Search Discrimiative Models of Itegratig Documet Evidece ad Documet-Cadidate Associatios for Expert Search Yi Fag Departmet of Computer Sciece Purdue Uiversity West Lafayette, IN 47907, USA fagy@cs.purdue.edu

More information

arxiv:1506.08910v1 [stat.ml] 30 Jun 2015

arxiv:1506.08910v1 [stat.ml] 30 Jun 2015 Learig Sigle Idex Models i High Dimesios Ravi Gati, Nikhil Rao 2, Rebecca M. Willett 3 ad Robert Nowak 3 arxiv:506.0890v [stat.ml] 30 Ju 205 Wiscosi Istitutes for Discovery, 330 N Orchard St, Madiso, WI,

More information

COMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S 2 CONTROL CHART FOR THE CHANGES IN A PROCESS

COMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S 2 CONTROL CHART FOR THE CHANGES IN A PROCESS COMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S CONTROL CHART FOR THE CHANGES IN A PROCESS Supraee Lisawadi Departmet of Mathematics ad Statistics, Faculty of Sciece ad Techoology, Thammasat

More information

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas: Chapter 7 - Samplig Distributios 1 Itroductio What is statistics? It cosist of three major areas: Data Collectio: samplig plas ad experimetal desigs Descriptive Statistics: umerical ad graphical summaries

More information

5 Boolean Decision Trees (February 11)

5 Boolean Decision Trees (February 11) 5 Boolea Decisio Trees (February 11) 5.1 Graph Coectivity Suppose we are give a udirected graph G, represeted as a boolea adjacecy matrix = (a ij ), where a ij = 1 if ad oly if vertices i ad j are coected

More information

TIGHT BOUNDS ON EXPECTED ORDER STATISTICS

TIGHT BOUNDS ON EXPECTED ORDER STATISTICS Probability i the Egieerig ad Iformatioal Scieces, 20, 2006, 667 686+ Prited i the U+S+A+ TIGHT BOUNDS ON EXPECTED ORDER STATISTICS DIMITRIS BERTSIMAS Sloa School of Maagemet ad Operatios Research Ceter

More information

Infinite Sequences and Series

Infinite Sequences and Series CHAPTER 4 Ifiite Sequeces ad Series 4.1. Sequeces A sequece is a ifiite ordered list of umbers, for example the sequece of odd positive itegers: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29...

More information

Chatpun Khamyat Department of Industrial Engineering, Kasetsart University, Bangkok, Thailand ocpky@hotmail.com

Chatpun Khamyat Department of Industrial Engineering, Kasetsart University, Bangkok, Thailand ocpky@hotmail.com SOLVING THE OIL DELIVERY TRUCKS ROUTING PROBLEM WITH MODIFY MULTI-TRAVELING SALESMAN PROBLEM APPROACH CASE STUDY: THE SME'S OIL LOGISTIC COMPANY IN BANGKOK THAILAND Chatpu Khamyat Departmet of Idustrial

More information

Class Meeting # 16: The Fourier Transform on R n

Class Meeting # 16: The Fourier Transform on R n MATH 18.152 COUSE NOTES - CLASS MEETING # 16 18.152 Itroductio to PDEs, Fall 2011 Professor: Jared Speck Class Meetig # 16: The Fourier Trasform o 1. Itroductio to the Fourier Trasform Earlier i the course,

More information

Research Method (I) --Knowledge on Sampling (Simple Random Sampling)

Research Method (I) --Knowledge on Sampling (Simple Random Sampling) Research Method (I) --Kowledge o Samplig (Simple Radom Samplig) 1. Itroductio to samplig 1.1 Defiitio of samplig Samplig ca be defied as selectig part of the elemets i a populatio. It results i the fact

More information

AMS 2000 subject classification. Primary 62G08, 62G20; secondary 62G99

AMS 2000 subject classification. Primary 62G08, 62G20; secondary 62G99 VARIABLE SELECTION IN NONPARAMETRIC ADDITIVE MODELS Jia Huag 1, Joel L. Horowitz 2 ad Fegrog Wei 3 1 Uiversity of Iowa, 2 Northwester Uiversity ad 3 Uiversity of West Georgia Abstract We cosider a oparametric

More information

Domain 1: Designing a SQL Server Instance and a Database Solution

Domain 1: Designing a SQL Server Instance and a Database Solution Maual SQL Server 2008 Desig, Optimize ad Maitai (70-450) 1-800-418-6789 Domai 1: Desigig a SQL Server Istace ad a Database Solutio Desigig for CPU, Memory ad Storage Capacity Requiremets Whe desigig a

More information

Research Article Crude Oil Price Prediction Based on a Dynamic Correcting Support Vector Regression Machine

Research Article Crude Oil Price Prediction Based on a Dynamic Correcting Support Vector Regression Machine Abstract ad Applied Aalysis Volume 2013, Article ID 528678, 7 pages http://dx.doi.org/10.1155/2013/528678 Research Article Crude Oil Price Predictio Based o a Dyamic Correctig Support Vector Regressio

More information

Application and research of fuzzy clustering analysis algorithm under micro-lecture English teaching mode

Application and research of fuzzy clustering analysis algorithm under micro-lecture English teaching mode SHS Web of Cofereces 25, shscof/20162501018 Applicatio ad research of fuzzy clusterig aalysis algorithm uder micro-lecture Eglish teachig mode Yig Shi, Wei Dog, Chuyi Lou & Ya Dig Qihuagdao Istitute of

More information

(VCP-310) 1-800-418-6789

(VCP-310) 1-800-418-6789 Maual VMware Lesso 1: Uderstadig the VMware Product Lie I this lesso, you will first lear what virtualizatio is. Next, you ll explore the products offered by VMware that provide virtualizatio services.

More information

Recovery time guaranteed heuristic routing for improving computation complexity in survivable WDM networks

Recovery time guaranteed heuristic routing for improving computation complexity in survivable WDM networks Computer Commuicatios 30 (2007) 1331 1336 wwwelseviercom/locate/comcom Recovery time guarateed heuristic routig for improvig computatio complexity i survivable WDM etworks Lei Guo * College of Iformatio

More information

NEW HIGH PERFORMANCE COMPUTATIONAL METHODS FOR MORTGAGES AND ANNUITIES. Yuri Shestopaloff,

NEW HIGH PERFORMANCE COMPUTATIONAL METHODS FOR MORTGAGES AND ANNUITIES. Yuri Shestopaloff, NEW HIGH PERFORMNCE COMPUTTIONL METHODS FOR MORTGGES ND NNUITIES Yuri Shestopaloff, Geerally, mortgage ad auity equatios do ot have aalytical solutios for ukow iterest rate, which has to be foud usig umerical

More information

An Efficient Polynomial Approximation of the Normal Distribution Function & Its Inverse Function

An Efficient Polynomial Approximation of the Normal Distribution Function & Its Inverse Function A Efficiet Polyomial Approximatio of the Normal Distributio Fuctio & Its Iverse Fuctio Wisto A. Richards, 1 Robi Atoie, * 1 Asho Sahai, ad 3 M. Raghuadh Acharya 1 Departmet of Mathematics & Computer Sciece;

More information

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n We will cosider the liear regressio model i matrix form. For simple liear regressio, meaig oe predictor, the model is i = + x i + ε i for i =,,,, This model icludes the assumptio that the ε i s are a sample

More information

Lecture 3. denote the orthogonal complement of S k. Then. 1 x S k. n. 2 x T Ax = ( ) λ x. with x = 1, we have. i = λ k x 2 = λ k.

Lecture 3. denote the orthogonal complement of S k. Then. 1 x S k. n. 2 x T Ax = ( ) λ x. with x = 1, we have. i = λ k x 2 = λ k. 18.409 A Algorithmist s Toolkit September 17, 009 Lecture 3 Lecturer: Joatha Keler Scribe: Adre Wibisoo 1 Outlie Today s lecture covers three mai parts: Courat-Fischer formula ad Rayleigh quotiets The

More information

Universal coding for classes of sources

Universal coding for classes of sources Coexios module: m46228 Uiversal codig for classes of sources Dever Greee This work is produced by The Coexios Project ad licesed uder the Creative Commos Attributio Licese We have discussed several parametric

More information

On Formula to Compute Primes. and the n th Prime

On Formula to Compute Primes. and the n th Prime Applied Mathematical cieces, Vol., 0, o., 35-35 O Formula to Compute Primes ad the th Prime Issam Kaddoura Lebaese Iteratioal Uiversity Faculty of Arts ad cieces, Lebao issam.kaddoura@liu.edu.lb amih Abdul-Nabi

More information

MARTINGALES AND A BASIC APPLICATION

MARTINGALES AND A BASIC APPLICATION MARTINGALES AND A BASIC APPLICATION TURNER SMITH Abstract. This paper will develop the measure-theoretic approach to probability i order to preset the defiitio of martigales. From there we will apply this

More information

Spam Detection. A Bayesian approach to filtering spam

Spam Detection. A Bayesian approach to filtering spam Spam Detectio A Bayesia approach to filterig spam Kual Mehrotra Shailedra Watave Abstract The ever icreasig meace of spam is brigig dow productivity. More tha 70% of the email messages are spam, ad it

More information

Trigonometric Form of a Complex Number. The Complex Plane. axis. ( 2, 1) or 2 i FIGURE 6.44. The absolute value of the complex number z a bi is

Trigonometric Form of a Complex Number. The Complex Plane. axis. ( 2, 1) or 2 i FIGURE 6.44. The absolute value of the complex number z a bi is 0_0605.qxd /5/05 0:45 AM Page 470 470 Chapter 6 Additioal Topics i Trigoometry 6.5 Trigoometric Form of a Complex Number What you should lear Plot complex umbers i the complex plae ad fid absolute values

More information

Swaps: Constant maturity swaps (CMS) and constant maturity. Treasury (CMT) swaps

Swaps: Constant maturity swaps (CMS) and constant maturity. Treasury (CMT) swaps Swaps: Costat maturity swaps (CMS) ad costat maturity reasury (CM) swaps A Costat Maturity Swap (CMS) swap is a swap where oe of the legs pays (respectively receives) a swap rate of a fixed maturity, while

More information

THIN SEQUENCES AND THE GRAM MATRIX PAMELA GORKIN, JOHN E. MCCARTHY, SANDRA POTT, AND BRETT D. WICK

THIN SEQUENCES AND THE GRAM MATRIX PAMELA GORKIN, JOHN E. MCCARTHY, SANDRA POTT, AND BRETT D. WICK THIN SEQUENCES AND THE GRAM MATRIX PAMELA GORKIN, JOHN E MCCARTHY, SANDRA POTT, AND BRETT D WICK Abstract We provide a ew proof of Volberg s Theorem characterizig thi iterpolatig sequeces as those for

More information

THE HEIGHT OF q-binary SEARCH TREES

THE HEIGHT OF q-binary SEARCH TREES THE HEIGHT OF q-binary SEARCH TREES MICHAEL DRMOTA AND HELMUT PRODINGER Abstract. q biary search trees are obtaied from words, equipped with the geometric distributio istead of permutatios. The average

More information

Finding the circle that best fits a set of points

Finding the circle that best fits a set of points Fidig the circle that best fits a set of poits L. MAISONOBE October 5 th 007 Cotets 1 Itroductio Solvig the problem.1 Priciples............................... Iitializatio.............................

More information

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Istructor: Nicolas Christou Three importat distributios: Distributios related to the ormal distributio Chi-square (χ ) distributio.

More information

PSYCHOLOGICAL STATISTICS

PSYCHOLOGICAL STATISTICS UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION B Sc. Cousellig Psychology (0 Adm.) IV SEMESTER COMPLEMENTARY COURSE PSYCHOLOGICAL STATISTICS QUESTION BANK. Iferetial statistics is the brach of statistics

More information

Stochastic Online Scheduling with Precedence Constraints

Stochastic Online Scheduling with Precedence Constraints Stochastic Olie Schedulig with Precedece Costraits Nicole Megow Tark Vredeveld July 15, 2008 Abstract We cosider the preemptive ad o-preemptive problems of schedulig obs with precedece costraits o parallel

More information

Lesson 15 ANOVA (analysis of variance)

Lesson 15 ANOVA (analysis of variance) Outlie Variability -betwee group variability -withi group variability -total variability -F-ratio Computatio -sums of squares (betwee/withi/total -degrees of freedom (betwee/withi/total -mea square (betwee/withi

More information

Annuities Under Random Rates of Interest II By Abraham Zaks. Technion I.I.T. Haifa ISRAEL and Haifa University Haifa ISRAEL.

Annuities Under Random Rates of Interest II By Abraham Zaks. Technion I.I.T. Haifa ISRAEL and Haifa University Haifa ISRAEL. Auities Uder Radom Rates of Iterest II By Abraham Zas Techio I.I.T. Haifa ISRAEL ad Haifa Uiversity Haifa ISRAEL Departmet of Mathematics, Techio - Israel Istitute of Techology, 3000, Haifa, Israel I memory

More information

A CUSUM TEST OF COMMON TRENDS IN LARGE HETEROGENEOUS PANELS

A CUSUM TEST OF COMMON TRENDS IN LARGE HETEROGENEOUS PANELS A CUSUM TEST OF COMMON TRENDS IN LARGE HETEROGENEOUS PANELS JAVIER HIDALGO AND JUNGYOON LEE A. This paper examies a oparametric CUSUM-type test for commo treds i large pael data sets with idividual fixed

More information

Evaluating Model for B2C E- commerce Enterprise Development Based on DEA

Evaluating Model for B2C E- commerce Enterprise Development Based on DEA , pp.180-184 http://dx.doi.org/10.14257/astl.2014.53.39 Evaluatig Model for B2C E- commerce Eterprise Developmet Based o DEA Weli Geg, Jig Ta Computer ad iformatio egieerig Istitute, Harbi Uiversity of

More information

Overview on S-Box Design Principles

Overview on S-Box Design Principles Overview o S-Box Desig Priciples Debdeep Mukhopadhyay Assistat Professor Departmet of Computer Sciece ad Egieerig Idia Istitute of Techology Kharagpur INDIA -721302 What is a S-Box? S-Boxes are Boolea

More information

Stackelberg Games for Adversarial Prediction Problems

Stackelberg Games for Adversarial Prediction Problems Stackelberg Games for Adversarial Predictio Problems Michael Brücker Departmet of Computer Sciece Uiversity of Potsdam, Germay mibrueck@cs.ui-potsdam.de Tobias Scheffer Departmet of Computer Sciece Uiversity

More information

Statistical Learning Theory

Statistical Learning Theory 1 / 130 Statistical Learig Theory Machie Learig Summer School, Kyoto, Japa Alexader (Sasha) Rakhli Uiversity of Pesylvaia, The Wharto School Pe Research i Machie Learig (PRiML) August 27-28, 2012 2 / 130

More information

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the. Cofidece Itervals A cofidece iterval is a iterval whose purpose is to estimate a parameter (a umber that could, i theory, be calculated from the populatio, if measuremets were available for the whole populatio).

More information

Chapter 5 Unit 1. IET 350 Engineering Economics. Learning Objectives Chapter 5. Learning Objectives Unit 1. Annual Amount and Gradient Functions

Chapter 5 Unit 1. IET 350 Engineering Economics. Learning Objectives Chapter 5. Learning Objectives Unit 1. Annual Amount and Gradient Functions Chapter 5 Uit Aual Amout ad Gradiet Fuctios IET 350 Egieerig Ecoomics Learig Objectives Chapter 5 Upo completio of this chapter you should uderstad: Calculatig future values from aual amouts. Calculatig

More information

Perfect Packing Theorems and the Average-Case Behavior of Optimal and Online Bin Packing

Perfect Packing Theorems and the Average-Case Behavior of Optimal and Online Bin Packing SIAM REVIEW Vol. 44, No. 1, pp. 95 108 c 2002 Society for Idustrial ad Applied Mathematics Perfect Packig Theorems ad the Average-Case Behavior of Optimal ad Olie Bi Packig E. G. Coffma, Jr. C. Courcoubetis

More information

CHAPTER 3 THE TIME VALUE OF MONEY

CHAPTER 3 THE TIME VALUE OF MONEY CHAPTER 3 THE TIME VALUE OF MONEY OVERVIEW A dollar i the had today is worth more tha a dollar to be received i the future because, if you had it ow, you could ivest that dollar ad ear iterest. Of all

More information

Chair for Network Architectures and Services Institute of Informatics TU München Prof. Carle. Network Security. Chapter 2 Basics

Chair for Network Architectures and Services Institute of Informatics TU München Prof. Carle. Network Security. Chapter 2 Basics Chair for Network Architectures ad Services Istitute of Iformatics TU Müche Prof. Carle Network Security Chapter 2 Basics 2.4 Radom Number Geeratio for Cryptographic Protocols Motivatio It is crucial to

More information

Chapter 5: Inner Product Spaces

Chapter 5: Inner Product Spaces Chapter 5: Ier Product Spaces Chapter 5: Ier Product Spaces SECION A Itroductio to Ier Product Spaces By the ed of this sectio you will be able to uderstad what is meat by a ier product space give examples

More information

, a Wishart distribution with n -1 degrees of freedom and scale matrix.

, a Wishart distribution with n -1 degrees of freedom and scale matrix. UMEÅ UNIVERSITET Matematisk-statistiska istitutioe Multivariat dataaalys D MSTD79 PA TENTAMEN 004-0-9 LÖSNINGSFÖRSLAG TILL TENTAMEN I MATEMATISK STATISTIK Multivariat dataaalys D, 5 poäg.. Assume that

More information

STUDENTS PARTICIPATION IN ONLINE LEARNING IN BUSINESS COURSES AT UNIVERSITAS TERBUKA, INDONESIA. Maya Maria, Universitas Terbuka, Indonesia

STUDENTS PARTICIPATION IN ONLINE LEARNING IN BUSINESS COURSES AT UNIVERSITAS TERBUKA, INDONESIA. Maya Maria, Universitas Terbuka, Indonesia STUDENTS PARTICIPATION IN ONLINE LEARNING IN BUSINESS COURSES AT UNIVERSITAS TERBUKA, INDONESIA Maya Maria, Uiversitas Terbuka, Idoesia Co-author: Amiuddi Zuhairi, Uiversitas Terbuka, Idoesia Kuria Edah

More information

FIBONACCI NUMBERS: AN APPLICATION OF LINEAR ALGEBRA. 1. Powers of a matrix

FIBONACCI NUMBERS: AN APPLICATION OF LINEAR ALGEBRA. 1. Powers of a matrix FIBONACCI NUMBERS: AN APPLICATION OF LINEAR ALGEBRA. Powers of a matrix We begi with a propositio which illustrates the usefuless of the diagoalizatio. Recall that a square matrix A is diogaalizable if

More information

1 The Gaussian channel

1 The Gaussian channel ECE 77 Lecture 0 The Gaussia chael Objective: I this lecture we will lear about commuicatio over a chael of practical iterest, i which the trasmitted sigal is subjected to additive white Gaussia oise.

More information

INVESTMENT PERFORMANCE COUNCIL (IPC)

INVESTMENT PERFORMANCE COUNCIL (IPC) INVESTMENT PEFOMANCE COUNCIL (IPC) INVITATION TO COMMENT: Global Ivestmet Performace Stadards (GIPS ) Guidace Statemet o Calculatio Methodology The Associatio for Ivestmet Maagemet ad esearch (AIM) seeks

More information

THE problem of fitting a circle to a collection of points

THE problem of fitting a circle to a collection of points IEEE TRANACTION ON INTRUMENTATION AND MEAUREMENT, VOL. XX, NO. Y, MONTH 000 A Few Methods for Fittig Circles to Data Dale Umbach, Kerry N. Joes Abstract Five methods are discussed to fit circles to data.

More information

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring No-life isurace mathematics Nils F. Haavardsso, Uiversity of Oslo ad DNB Skadeforsikrig Mai issues so far Why does isurace work? How is risk premium defied ad why is it importat? How ca claim frequecy

More information