Regularized Distance Metric Learning: Theory and Algorithm
|
|
- Stephanie Curtis
- 8 years ago
- Views:
Transcription
1 Regularized Distace Metric Learig: Theory ad Algorithm Rog Ji 1 Shiju Wag 2 Yag Zhou 1 1 Dept. of Computer Sciece & Egieerig, Michiga State Uiversity, East Lasig, MI Radiology ad Imagig Scieces, Natioal Istitutes of Health, Bethesda, MD rogji@cse.msu.edu wagshi@cc.ih.gov zhouyag@msu.edu Abstract I this paper, we examie the geeralizatio error of regularized distace metric learig. We show that with appropriate costraits, the geeralizatio error of regularized distace metric learig could be idepedet from the dimesioality, makig it suitable for hadlig high dimesioal data. I additio, we preset a efficiet olie learig algorithm for regularized distace metric learig. Our empirical studies with data classificatio ad face recogitio show that the proposed algorithm is (i) effective for distace metric learig whe compared to the state-of-the-art methods, ad (ii) efficiet ad robust for high dimesioal data. 1 Itroductio Distace metric learig is a fudametal problem i machie learig ad patter recogitio. It is critical to may real-world applicatios, such as iformatio retrieval, classificatio, ad clusterig. Numerous algorithms have bee proposed ad examied for distace metric learig. They are usually classified ito two categories: usupervised metric learig ad supervised metric learig. Usupervised distace metric learig, or sometimes referred to as maifold learig, aims to lear a uderlyig low-dimesioal maifold where the distace betwee most pairs of data poits are preserved. Example algorithms i this category iclude ISOMAP [10] ad Local Liear Embeddig (LLE) [6]. Supervised metric learig attempts to lear distace metrics from side iformatio such as labeled istaces ad pairwise costraits. It searches for the optimal distace metric that (a) keeps data poits of the same classes close, ad (b) keeps data poits from differet classes far apart. Example algorithms i this category iclude [13, 8, 12, 5, 11, 15, 4]. I this work, we focus o supervised distace metric learig. Although a large umber of studies were devoted to supervised distace metric learig (see the survey i [14] ad refereces therei), few studies address the geeralizatio error of distace metric learig. I this paper, we examie the geeralizatio error for regularized distace metric learig. Followig the idea of stability aalysis [1], we show that with appropriate costraits, the geeralizatio error of regularized distace metric learig is idepedet from the dimesioality of data, makig it suitable for hadlig high dimesioal data. I additio, we preset a olie learig algorithm for regularized distace metric learig, ad show its regret boud. Note that although olie metric learig was studied i [7], our approach is advatageous i that (a) it is computatioally more efficiet i hadlig the costrait of SDP coe, ad (b) it has a proved regret boud while [7] oly shows a mistake boud for the datasets that ca be separated by a Mahalaobis distace. To verify the efficacy ad efficiecy of the proposed algorithm for regularized distace metric learig, we coduct experimets with data classificatio ad face recogitio. Our empirical results show that the proposed olie algorithm is (1) effective for metric learig compared to the state-of-the-art methods, ad (2) robust ad efficiet for high dimesioal data. 1
2 2 Regularized Distace Metric Learig Let D = {z i = (x i, y i ), i = 1,..., } deote the labeled examples, where x k = (x 1 k,..., xd k ) Rd is a vector of d dimesio ad y i {1, 2,..., m} is class label. I our study, we assume that the orm of ay example is upper bouded by R, i.e., sup x x 2 R. Let A S d d + be the distace metric to be leared, where the distace betwee two data poits x ad x is calculated as x x 2 A = (x x ) A(x x ). Followig the idea of maximum margi classifiers, we have the followig framework for regularized distace metric learig: 1 mi A 2 A 2 F + 2C g ( [ y i,j 1 xi x j 2 ]) A : A 0, tr(a) η(d) (1) ( 1) where i<j y i,j is derived from class labels y i ad y j, i.e., y i,j = 1 if y i = y j ad 1 otherwise. g(z) is the loss fuctio. It outputs a small value whe z is a large positive value, ad a large value whe z is large egative. We assume g(z) to be covex ad Lipschitz cotiuous with Lipschitz costat L. A 2 F is the regularizer that measures the complexity of the distace metric A. tr(a) η(d) is itroduced to esure a bouded domai for A. As will be revealed later, this costrait will become active oly whe the costrait costat η(d) is subliear i d, i.e., η O(d p ) with p < 1. We will also show how this costrait could affect the geeralizatio error of distace metric learig. 3 Geeralizatio Error Let A D be the distace metric leared by the algorithm i (1) from the traiig examples D. Let I D (A) deote the empirical loss, i.e., 2 I D (A) = g ( [ y i,j 1 xi x j 2 ]) A (2) ( 1) i<j For the coveiece of presetatio, we also write g ( y i,j (1 x i x j 2 A )) = V (A, z i, z j ) to highlight its depedece o A ad two examples z i ad z j. We deote by I(A) the loss of A over the true distributio, i.e., I(A) = E (zi,z j)[v (A, z i, z j )] (3) Give the empirical loss I D (A) ad the loss over the true distributio I(A), we defie the estimatio error as D D = I(A D ) I D (A D ) (4) I order to show the behavior of estimatio error, we follow the aalysis based o the stability of the algorithm [1]. The uiform stability of a algorithm determies the stability of the algorithm whe oe of the traiig examples is replaced with aother. More specifically, a algorithm A has uiform stability β if (D, z), i, sup V (A D, u, v) V (A D z,i, u, v) β (5) u,v where D z,i stads for the ew traiig set that is obtaied by replacig z j D with a ew example z. We further defie β = κ/ as the uiform stability β behaves like O(1/). The advatage of usig stability aalysis for the geeralizatio error of regularized distace metric learig. This is because the example pair (z i, z j ) used for traiig distace metrics are ot I.I.D. although z i is, makig it difficult to directly utilize the results from statistical learig theory. I the aalysis below, we first show how to derive the geeralizatio error boud for regularized distace metric learig give the uiform stability β (or κ). We the derive the uiform stability costat for the regularized distace metric learig framework i (1). 2
3 3.1 Geeralizatio Error Boud for Give Uiform Stability Aalysis i this sectio follows closely [1], ad we therefore omit the detailed proofs. Our aalysis utilizes the McDiarmid iequality that is stated as follows. Theorem 1. (McDiarmid Iequality) Give radom variables {v i } l i=1, v i, ad a fuctio F : vl R satisfyig sup F (v 1,..., v l ) F (v 1,..., v i 1, v i, v i+1,..., v l ) c i, v 1,...,v l,v i the followig statemet holds Pr ( F (v 1,..., v l ) E(F (v 1,..., v l )) > ɛ) 2 exp 2ɛ P l i=1 c2 i! To use the McDiarmid iequality, we first compute E(D D ). Lemma 1. Give a distace metric learig algorithm A has uiform stability κ/, we have the followig iequality for E(D D ) E(D D ) 2 κ (6) where is the umber of traiig examples i D. The result i the followig lemma shows that the coditio i McDiarmid iequality holds. Lemma 2. Let D be a collectio of radomly selected traiig examples, ad D i,z be the collectio of examples that replaces z i i D with example z. We have D D D D i,z bouded as follows D D D D i,z 2κ + 8Lη(d) + 2g 0 where g 0 = sup z,z V (0, z, z ) measures the largest loss whe distace metric A is 0. (7) Combiig the results i Lemma 1 ad 2, we ca ow derive the the boud for the geeralizatio error by usig the McDiarmid iequality. Theorem 2. Let D deote a collectio of radomly selected traiig examples, ad A D be the distace metric leared by the algorithm i (1) whose uiform stability is κ/. With probability 1 δ, we have the followig boud for I(A D ) I(A D ) I D (A D ) 2κ + (2κ + 4Lη(d) + 2g 0) l(2/δ) 2 (8) 3.2 Geeralizatio Error for Regularized Distace Metric Learig First, we show that the superium of tr(a D ) is O(d 1/2 ), which verifies that η(d) should behave subliear i d. This is summarized by the followig propositio. Propositio 1. The trace costrait i (1) will be activated oly whe where g 0 = sup z,z V (0, z, z ). η(d) 2dg 0 C (9) Proof. It follows directly from [tr(a D )/d] 2 A D 2 F 2C sup z,z V (0, z, z ) Cg 0. To boud the uiform stability, we eed the followig propositio Propositio 2. For ay two distace metrics A ad A, we have the followig iequality hold for ay examples z u ad z v V (A, z u, z v ) V (A, z u, z v ) 4LR 2 A A F (10) 3
4 The above propositio follows directly from the fact that (a) V (A, z, z ) is Lipschitz cotiuous ad (b) x 2 R for ay example x. The followig lemma bouds A D A D F. Lemma 3. Let D deote a collectio of radomly selected traiig examples, ad by z = (x, y) a radomly selected example. Let A D be the distace metric leared by the algorithm i (1). We have A D A D i,z F 8CLR2 (11) The proof of the above lemma ca be foud i Appedix A. By puttig the results i Lemma 3 ad Propositio 2, we have the followig theorem for the stability of the Frobeius orm based regularizer. Theorem 3. The uiform stability for the algorithm i (1) usig the Frobeius orm regularizer, deoted by β, is bouded as follows where κ = 32CL 2 R 4 β = κ 32CL2 R 4 Combig Theorem 3 ad 2, we have the followig theorem for the geeralizatio error of distace metric learig algorithm i (1) usig the Frobeius orm regularizer Theorem 4. Let D be a collectio of radomly selected examples, ad A D be the distace metric leared by the algorithm i (1) with h(a) = A 2 F. With probability 1 δ, we have the followig boud for the true loss fuctio I(A D ) where A D is leared from (1) usig the Frobeius orm regularizer (12) I(A D ) I D (A D ) 32CL2 R 4 where s(d) = mi ( 2dg 0 C, η(d) ). + ( 32CL 2 R 4 + 4Ls(d) + 2g 0 ) l(2/δ) 2 (13) Remark The most importat feature i the estimatio error is that it coverges i the order of O(s(d)/ ). By choosig η(d) to have a low depedece of d (i.e., η(d) d p with p 1), the proposed framework for regularized distace metric learig will be robust to the high dimesioal data. I the extreme case, by settig η(d) to be a costat, the estimatio error will be idepedet from the dimesioality of data. 4 Algorithm I this sectio, we discuss a efficiet algorithm for solvig (1). We assume a hige loss for g(z), i.e., g(z) = max(0, b z), where b is the classificatio margi. To desig a olie learig algorithm for regularized distace metric learig, we follow the theory of gradiet based olie learig [2] by defiig potetial fuctio Φ(A) = A 2 F /2. Algorithm 1 shows the olie learig algorithm. The theorem below shows the regret boud for the olie learig algorithm i Figure 1. Theorem 5. Let the olie learig algorithm 1 ru with learig rate λ > 0 o a sequece (x t, x t), y t, t = 1,...,. Assume x 2 R for all the traiig examples. The, for all distace metric M S+ d d, we have ( 1 L 1 8R 4 L (M) + 1 ) λ/b 2λ M 2 F where Ł (M) = max ( 0, b y t (1 x t x t 2 M ) ), L = ( ) max 0, b y t (1 x t x t 2 A t 1 ) 4
5 Algorithm 1 Olie Learig Algorithm for Regularized Distace Metric Learig 1: INPUT: predefied learig rate λ 2: Iitialize A 0 = 0 3: for t = 1,..., T do 4: Receive a pair of traiig examples {(x 1 t, y 1 t ), (x 2 t, y 2 t )} 5: Compute the class label y t : y t = +1 if y 1 t = y 2 t, ad y t = 1 otherwise. 6: if the traiig pair (x 1 t, x 2 t ), y t is classified correctly, i.e., y t ( 1 x 1 t x 2 t 2 A t 1 ) > 0 the 7: A t = A t 1. 8: else 9: A t = π S+ (A t 1 λy t (x t x t)(x t x t) ), where π S+ (M) projects matrix M ito the SDP coe. 10: ed if 11: ed for The proof of this theorem ca be foud i Appedix B. Note that the above olie learig algorithm require computig π S+ (M), i.e., projectig matrix M oto the SDP coe, which is expesive for high dimesioal data. To address this challege, first otice that M = π S+ (M) is equivalet to the optimizatio problem M = arg mi M 0 M M F. We thus approximate A t = π S+ (A t 1 λy t (x t x t)(x t x t) ) with A t = A t 1 λ t y t (x t x t)(x t x t) where λ t is computed as follows λ t = arg mi { λt λ : λ t [0, λ], A t 1 λ t y t (x t x t)(x t x t) 0 } (14) λ t The followig theorem shows the solutio to the above optimizatio problem. Theorem 6. The optimal solutio λ t to the problem i (14) is expressed as { λ yt = 1 λ t = mi ( λ, [(x t x t) A 1 t 1 (x t x t)] 1) y t = +1 Proof of this theorem ca be foud i the supplemetary materials. Fially, the quatity (x t x t)a 1 t 1 (x t x t) ca be computed by solvig the followig optimizatio problem max u 2u (x t x t) u Au whose optimal value ca be computed efficietly usig the cojugate gradiet method [9]. Note that compared to the olie metric learig algorithm i [7], the proposed olie learig algorithm for metric learig is advatageous i that (i) it is computatioally more efficiet by avoidig projectig a matrix ito a SDP coe, ad (ii) it has a provable regret boud while [7] oly presets the mistake boud for the separable datasets. 5 Experimets We coducted a extesive study to verify both the efficiecy ad the efficacy of the proposed algorithms for metric learig. For the coveiece of discussio, we refer to the propoesd olie distace metric learig algorithm as olie-reg. To examie the efficacy of the leared distace metric, we employed the k Nearest Neighbor (k-nn) classifier. Our hypothesis is that the better the distace metric is, the higher the classificatio accuracy of k-nn will be. We set k = 3 for k-nn for all the experimets accordig to our experiece. We compare our algorithm to the followig six state-of-the-art algorithms for distace metric learig as baselies: (1) Euclidea distace metric; (2) Mahalaobis distace metric, which is computed as the iverse of covariace matrix of traiig samples, i.e., ( i=1 x ix i ) 1 ; (3) Xig s algorithm proposed i [13]; (4) LMNN, a distace metric learig algorithm based o the large margi earest eighbor classifier [12]; (5) ITML, a Iformatio-theoretic metric learig based o [4]; ad (6) Relevace Compoet Aalysis (RCA) [8]. We set the maximum umber of iteratios for Xig s method to be 10, 000. The umber of target eighbors i LMNN ad parameter γ i ITML 5
6 Table 1: Classificatio error (%) of a k-nn (k = 3) classifier o the te UCI data sets usig seve differet metrics. Stadard deviatio is icluded. Dataset Eclidea Mahala Xig LMNN ITML RCA Olie-reg ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± 0.4 Table 2: p-values of the Wilcoxo siged-rak test of the 7 methods o the 9 datasets. Methods Eclidea Mahala Xig LMNN ITML RCA Olie-reg Euclidea Mahala Xig LMNN ITML RCA Olie-reg were tued by cross validatio over the rage from 10 4 to All the algorithms are implemeted ad ru usig Matlab. All the experimet are ru o a AMD Processor 2.8G machie, with 8GMB RAM ad Liux operatio system. 5.1 Experimet (I): Compariso to State-of-the-art Algorithms We coducted experimets of data classificatio over the followig ie datasets from UCI repository: (1) balace-scale, with 3 classes, 4 features, ad 625 istaces; (2) breast-cacer, with 2 classes, 10 features, ad 683 istace; (3) glass, with 6 classes, 9 features, ad 214 istaces; (4) iris, with 3 classes, 4 features, ad 150 istaces; (5) pima, with 2 classes, 8 features, ad 768 istaces; (6) segmetatio, with 7 classes, 19 features, ad 210 istaces; (7)wie, with 3 classes, 13 features, ad 178 istaces; (8) waveform, with 3 classes, 21 features, ad 5000 istaces; (9) optdigits, with 10 classes, 64 features, 3823 istaces. For all the datasets, we radomly select 50% samples for traiig, ad use the remaiig samples for testig. Table 1 shows the classificatio errors of all the metric learig methods over 9 datasets averaged over 10 rus, together with the stadard deviatio. We observe that the proposed metric learig algorithm deliver performace that comparable to the state-of-the-art methods. I particular, for almost all datasets, the classificatio accuracy of the proposed algorithm is close to that of LMNN, which has yielded overall the best performace amog six baselie algorithms. This is cosistet with the results of the other studies, which show LMNN is amog the most effective algorithms for distace metric learig. To further verify if the proposed method performs statistically better tha the baselie methods, we coduct statistical test by usig Wilcoxo siged-rak test [3]. The Wilcoxo siged-rak test is a o-parametric statistical hypothesis test for the comparisos of two related samples. It is kow to be safer tha the Studet s t-test because it does ot assume ormal distributios. From table 2, we fid that the regularized distace metric learig improves the classificatio accuracy sigificatly compared to Mahalaobis distace, Xig s method ad RCA at sigificat level 0.1. It performs slightly better tha ITML ad is comparable to LMNN. 6
7 1 att face 7000 att face Classificatio accuracy Euclidea 0.4 Mahalaobis 0.3 LMNN ITML 0.2 RCA Olie_reg Image resize ratio (a) Ruig time (secods) LMNN ITML RCA Olie_reg Image resize ratio (b) Figure 1: (a) Face recogitio accuracy of knn ad (b) ruig time of LMNN, ITML, RCA ad olie reg algorithms o the att-face dataset with varyig image sizes. 5.2 Experimet (II): Results for High Dimesioal Data To evaluate the depedece of the regularized metric learig algorithms o data dimesios, we tested it by the task of face recogitio. The AT&T face database 1 is used i our study. It cosists of grey images of faces from 40 distict subjects, with te pictures for each subject. For every subject, the images were take at differet times, with varied the lightig coditio ad differet facial expressios (ope/closed-eyes, smilig/ot-smilig) ad facial details (glasses/o-glasses). The origial size of each image is pixels, with 256 grey levels per pixel. To examie the sesitivity to data dimesioality, we vary the data dimesio (i.e., the size of images) by compressig the origial images ito size differet sizes with the image aspect ratio preserved. The image compressio is achieved by bicubic iterpolatio (the output pixel value is a weighted average of pixels i the earest 4-by-4 eighborhood). For each subject, we radomly spit its face images ito traiig set ad test set with ratio 4 : 6. A distace metric is leared from the collectio of traiig face images, ad is used by the knn classifier (k = 3) to predict the subject ID of the test images. We coduct each experimet 10 times, ad report the classificatio accuracy by averagig over 40 subjects ad 10 rus. Figure 1 (a) shows the average classificatio accuracy of the knn classifier usig differet distace metric learig algorithms. The ruig times of differet metric learig algorithms for the same dataset is show i Figure 1 (b). Note that we exclude Xig s method i compariso because its extremely log computatioal time. We observed that with icreasig image size (dimesios), the regularized distace metric learig algorithm yields stable performace, idicatig that the it is resiliet to high dimesioal data. I cotrast, for almost all the baselie methods except ITML, their performace varied sigificatly as the size of the iput image chaged. Although ITML yields stable performace with respect to differet size of images, its high computatioal cost (Figure 1), arisig from solvig a Bregma optimizatio problem i each iteratio, makes it usuitable for high-dimesioal data. 6 Coclusio I this paper, we aalyze the geeralizatio error of regularized distace metric learig. We show that with appropriate costrait, the regularized distace metric learig could be robust to high dimesioal data. We also preset efficiet learig algorithms for solvig the related optimizatio problems. Empirical studies with face recogitio ad data classificatio show the proposed approach is (i) robust ad efficiet for high dimesioal data, ad (ii) comparable to the state-of-theart approaches for distace learig. I the future, we pla to ivestigate differet regularizers ad their effect for distace metric learig
8 ACKNOWLEDGEMENTS The work was supported i part by the Natioal Sciece Foudatio (IIS ) ad the U. S. Army Research Laboratory ad the U. S. Army Research Office (W911NF ). Ay opiios, fidigs, ad coclusios or recommedatios expressed i this material are those of the authors ad do ot ecessarily reflect the views of NSF ad ARO. Appedix A: Proof of Lemma 3 Proof. We itroduce the Bregme divergece for the proof of this lemma. Give a covex fuctio of matrix ϕ(x), the Bregme divergece betwee two matrices A ad B is computed as follows: d ϕ (A, B) = ϕ(b) ϕ(a) tr ( ϕ(a) (B A) ) We defie covex fuctio N(X) ad V D (X) as follows: N(X) = X 2 F, V D (X) = 2 ( 1) V (X, z i, z j ) ad furthermore covex fuctio T D (X) = N(X) + CV D (X). We thus have d N (A D, A D i,z) + d N (A D i,z, A D ) d TD (A D, A D i,z) + d TD i,z (A D i,z, A D ) C = [V (A ( 1) D i,z, z i, z j ) V (A D i,z, z, z j ) + V (A D, z, z j ) V (A D, z i, z j )] j i 8CLR2 A D A D i,z F The first iequality follows from the fact that both N(X) ad V D (X) are covex i X. The secod step holds because matrix A D ad A D i,z miimize the objective fuctio T D (X) ad T D i,z(x), respectively, ad therefore (A D i,z A D ) T D (A D ) 0, (A D A D i,z) T D i,z(a D i,z) 0 Sice d N (A, B) = A B 2 F, we therefore have A D A D i,z 2 F 8CLR2 A D A D i,z F, which leads to the result i the lemma. Appedix B: Proof of Theorem 7 Proof. We deote by A t = A t 1 λy(x t x t)(x t x t) ad A t = π S+ (A t). Followig Theorem 11.1 ad Theorem 11.4 [2], we have L L (M) 1 λ D Φ (M, A 0) + 1 D Φ (A t 1, A λ t) where i<j D Φ (A, B) = 1 2 A B 2 F, Φ(A) = Φ (A) = 1 2 A 2 F Usig the relatio A t = A t 1 λy(x t x t)(x t x t) ad A 0 = 0, we have L L (M) 1 2λ M 2 F + 1 [ ] I y t (1 x t x 2λ t 2 A t 1 ) < 0 x t x t 4 By assumig x 2 R for ay traiig example, we have x t x t R 4. Sice [ ] I y t (1 x t x t 2 A t 1 ) < 0 x t x t 4 max(0, b y t (1 x t x t 2 A t 1 )) 16R4 b we thus have the result i the theorem = 16R4 b L 8
9 Refereces [1] Bousquet, Olivier, ad Adré Elisseeff. Stability ad geeralizatio. Joural of Machie Learig Research, 2: , March [2] Nicolo Cesa-Biachi ad Gabor Lugosi. Predictio, Learig, ad Games. Cambridge Uiversity Press, New York, NY, USA, [3] G.W. Corder ad D.I. Forema. Noparametric Statistics for No-Statisticias: A Step-by-Step Approach. New Jersey: Wiley, [4] J.V. Davis, B. Kulis, P. Jai, S. Sra, ad I.S. Dhillo. Iformatio-theoretic metric learig. I Proceedigs of the 24th iteratioal coferece o Machie Learig, [5] A. Globerso ad S. Roweis. Metric learig by collapsig classes. I Advaces i Neural Iformatio Processig Systems, [6] L. K. Saul ad S. T. Roweis. Thik globally, fit locally: Usupervised learig of low dimesioal maifolds. Joural of Machie Learig Research, 4, [7] Shai Shalev-Shwartz, Yoram Siger, ad Adrew Y. Ng. Olie ad batch learig of pseudometrics. I Proceedigs of the twety-first iteratioal coferece o Machie learig, pages , [8] N. Shetal, T. Hertz, D. Weishall, ad M. Pavel. Adjustmet learig ad relevat compoet aalysis. I Proceedigs of the Seveth Europea Coferece o Computer Visio, volume 4, pages , [9] Joatha R Shewchuk. A itroductio to the cojugate gradiet method without the agoizig pai. Techical report, Caregie Mello Uiversity, Pittsburgh, PA, USA, [10] J.B. Teebaum, V. de Silva, ad J. C. Lagford. A global geometric framework for oliear dimesioality reductio. Sciece, 290, [11] I.W. Tsag, P.M. Cheug, ad J.T. Kwok. Kerel relevat compoet aalysis for distace metric learig. I IEEE Iteratioal Joit Coferece o Neural Networks (IJCNN), [12] K. Weiberger, J. Blitzer, ad L. Saul. Distace metric learig for large margi earest eighbor classificatio. I Advaces i Neural Iformatio Processig Systems, [13] E.P. Xig, A.Y. Ng, M.I. Jorda, ad S. Russell. Distace metric learig, with applicatio to clusterig with side-iformatio. I Advaces i Neural Iformatio Processig Systems, [14] L. Yag ad R. Ji. Distace metric learig: A comprehesive survey. Michiga State Uiversity, Tech. Rep., [15] L. Yag, R. Ji, R. Sukthakar, ad Y. Liu. A efficiet algorithm for local distace metric learig. I the Proceedigs of the Twety-First Natioal Coferece o Artificial Itelligece Proceedigs (AAAI),
Modified Line Search Method for Global Optimization
Modified Lie Search Method for Global Optimizatio Cria Grosa ad Ajith Abraham Ceter of Excellece for Quatifiable Quality of Service Norwegia Uiversity of Sciece ad Techology Trodheim, Norway {cria, ajith}@q2s.tu.o
More informationAsymptotic Growth of Functions
CMPS Itroductio to Aalysis of Algorithms Fall 3 Asymptotic Growth of Fuctios We itroduce several types of asymptotic otatio which are used to compare the performace ad efficiecy of algorithms As we ll
More informationPlug-in martingales for testing exchangeability on-line
Plug-i martigales for testig exchageability o-lie Valetia Fedorova, Alex Gammerma, Ilia Nouretdiov, ad Vladimir Vovk Computer Learig Research Cetre Royal Holloway, Uiversity of Lodo, UK {valetia,ilia,alex,vovk}@cs.rhul.ac.uk
More information0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5
Sectio 13 Kolmogorov-Smirov test. Suppose that we have a i.i.d. sample X 1,..., X with some ukow distributio P ad we would like to test the hypothesis that P is equal to a particular distributio P 0, i.e.
More informationChapter 7 Methods of Finding Estimators
Chapter 7 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 011 Chapter 7 Methods of Fidig Estimators Sectio 7.1 Itroductio Defiitio 7.1.1 A poit estimator is ay fuctio W( X) W( X1, X,, X ) of
More informationA probabilistic proof of a binomial identity
A probabilistic proof of a biomial idetity Joatho Peterso Abstract We give a elemetary probabilistic proof of a biomial idetity. The proof is obtaied by computig the probability of a certai evet i two
More informationDepartment of Computer Science, University of Otago
Departmet of Computer Sciece, Uiversity of Otago Techical Report OUCS-2006-09 Permutatios Cotaiig May Patters Authors: M.H. Albert Departmet of Computer Sciece, Uiversity of Otago Micah Colema, Rya Fly
More informationOutput Analysis (2, Chapters 10 &11 Law)
B. Maddah ENMG 6 Simulatio 05/0/07 Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should
More informationLECTURE 13: Cross-validation
LECTURE 3: Cross-validatio Resampli methods Cross Validatio Bootstrap Bias ad variace estimatio with the Bootstrap Three-way data partitioi Itroductio to Patter Aalysis Ricardo Gutierrez-Osua Texas A&M
More informationSoving Recurrence Relations
Sovig Recurrece Relatios Part 1. Homogeeous liear 2d degree relatios with costat coefficiets. Cosider the recurrece relatio ( ) T () + at ( 1) + bt ( 2) = 0 This is called a homogeeous liear 2d degree
More informationTHE ABRACADABRA PROBLEM
THE ABRACADABRA PROBLEM FRANCESCO CARAVENNA Abstract. We preset a detailed solutio of Exercise E0.6 i [Wil9]: i a radom sequece of letters, draw idepedetly ad uiformly from the Eglish alphabet, the expected
More informationIn nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008
I ite Sequeces Dr. Philippe B. Laval Keesaw State Uiversity October 9, 2008 Abstract This had out is a itroductio to i ite sequeces. mai de itios ad presets some elemetary results. It gives the I ite Sequeces
More informationChapter 6: Variance, the law of large numbers and the Monte-Carlo method
Chapter 6: Variace, the law of large umbers ad the Mote-Carlo method Expected value, variace, ad Chebyshev iequality. If X is a radom variable recall that the expected value of X, E[X] is the average value
More informationA Combined Continuous/Binary Genetic Algorithm for Microstrip Antenna Design
A Combied Cotiuous/Biary Geetic Algorithm for Microstrip Atea Desig Rady L. Haupt The Pesylvaia State Uiversity Applied Research Laboratory P. O. Box 30 State College, PA 16804-0030 haupt@ieee.org Abstract:
More informationConvexity, Inequalities, and Norms
Covexity, Iequalities, ad Norms Covex Fuctios You are probably familiar with the otio of cocavity of fuctios. Give a twicedifferetiable fuctio ϕ: R R, We say that ϕ is covex (or cocave up) if ϕ (x) 0 for
More informationA Faster Clause-Shortening Algorithm for SAT with No Restriction on Clause Length
Joural o Satisfiability, Boolea Modelig ad Computatio 1 2005) 49-60 A Faster Clause-Shorteig Algorithm for SAT with No Restrictio o Clause Legth Evgey Datsi Alexader Wolpert Departmet of Computer Sciece
More informationSequences and Series
CHAPTER 9 Sequeces ad Series 9.. Covergece: Defiitio ad Examples Sequeces The purpose of this chapter is to itroduce a particular way of geeratig algorithms for fidig the values of fuctios defied by their
More informationCoordinating Principal Component Analyzers
Coordiatig Pricipal Compoet Aalyzers J.J. Verbeek ad N. Vlassis ad B. Kröse Iformatics Istitute, Uiversity of Amsterdam Kruislaa 403, 1098 SJ Amsterdam, The Netherlads Abstract. Mixtures of Pricipal Compoet
More informationProperties of MLE: consistency, asymptotic normality. Fisher information.
Lecture 3 Properties of MLE: cosistecy, asymptotic ormality. Fisher iformatio. I this sectio we will try to uderstad why MLEs are good. Let us recall two facts from probability that we be used ofte throughout
More informationAnalyzing Longitudinal Data from Complex Surveys Using SUDAAN
Aalyzig Logitudial Data from Complex Surveys Usig SUDAAN Darryl Creel Statistics ad Epidemiology, RTI Iteratioal, 312 Trotter Farm Drive, Rockville, MD, 20850 Abstract SUDAAN: Software for the Statistical
More informationDAME - Microsoft Excel add-in for solving multicriteria decision problems with scenarios Radomir Perzina 1, Jaroslav Ramik 2
Itroductio DAME - Microsoft Excel add-i for solvig multicriteria decisio problems with scearios Radomir Perzia, Jaroslav Ramik 2 Abstract. The mai goal of every ecoomic aget is to make a good decisio,
More informationRunning Time ( 3.1) Analysis of Algorithms. Experimental Studies ( 3.1.1) Limitations of Experiments. Pseudocode ( 3.1.2) Theoretical Analysis
Ruig Time ( 3.) Aalysis of Algorithms Iput Algorithm Output A algorithm is a step-by-step procedure for solvig a problem i a fiite amout of time. Most algorithms trasform iput objects ito output objects.
More informationThe analysis of the Cournot oligopoly model considering the subjective motive in the strategy selection
The aalysis of the Courot oligopoly model cosiderig the subjective motive i the strategy selectio Shigehito Furuyama Teruhisa Nakai Departmet of Systems Maagemet Egieerig Faculty of Egieerig Kasai Uiversity
More informationLecture 13. Lecturer: Jonathan Kelner Scribe: Jonathan Pines (2009)
18.409 A Algorithmist s Toolkit October 27, 2009 Lecture 13 Lecturer: Joatha Keler Scribe: Joatha Pies (2009) 1 Outlie Last time, we proved the Bru-Mikowski iequality for boxes. Today we ll go over the
More informationVladimir N. Burkov, Dmitri A. Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT
Keywords: project maagemet, resource allocatio, etwork plaig Vladimir N Burkov, Dmitri A Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT The paper deals with the problems of resource allocatio betwee
More informationResearch Article Sign Data Derivative Recovery
Iteratioal Scholarly Research Network ISRN Applied Mathematics Volume 0, Article ID 63070, 7 pages doi:0.540/0/63070 Research Article Sig Data Derivative Recovery L. M. Housto, G. A. Glass, ad A. D. Dymikov
More informationPROCEEDINGS OF THE YEREVAN STATE UNIVERSITY AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM
PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY Physical ad Mathematical Scieces 2015, 1, p. 15 19 M a t h e m a t i c s AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM A. G. GULYAN Chair of Actuarial Mathematics
More information1 Computing the Standard Deviation of Sample Means
Computig the Stadard Deviatio of Sample Meas Quality cotrol charts are based o sample meas ot o idividual values withi a sample. A sample is a group of items, which are cosidered all together for our aalysis.
More informationTotally Corrective Boosting Algorithms that Maximize the Margin
Mafred K. Warmuth mafred@cse.ucsc.edu Ju Liao liaoju@cse.ucsc.edu Uiversity of Califoria at Sata Cruz, Sata Cruz, CA 95064, USA Guar Rätsch Guar.Raetsch@tuebige.mpg.de Friedrich Miescher Laboratory of
More informationI. Chi-squared Distributions
1 M 358K Supplemet to Chapter 23: CHI-SQUARED DISTRIBUTIONS, T-DISTRIBUTIONS, AND DEGREES OF FREEDOM To uderstad t-distributios, we first eed to look at aother family of distributios, the chi-squared distributios.
More informationConfidence Intervals for One Mean
Chapter 420 Cofidece Itervals for Oe Mea Itroductio This routie calculates the sample size ecessary to achieve a specified distace from the mea to the cofidece limit(s) at a stated cofidece level for a
More informationSystems Design Project: Indoor Location of Wireless Devices
Systems Desig Project: Idoor Locatio of Wireless Devices Prepared By: Bria Murphy Seior Systems Sciece ad Egieerig Washigto Uiversity i St. Louis Phoe: (805) 698-5295 Email: bcm1@cec.wustl.edu Supervised
More informationSAMPLE QUESTIONS FOR FINAL EXAM. (1) (2) (3) (4) Find the following using the definition of the Riemann integral: (2x + 1)dx
SAMPLE QUESTIONS FOR FINAL EXAM REAL ANALYSIS I FALL 006 3 4 Fid the followig usig the defiitio of the Riema itegral: a 0 x + dx 3 Cosider the partitio P x 0 3, x 3 +, x 3 +,......, x 3 3 + 3 of the iterval
More informationSECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES
SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES Read Sectio 1.5 (pages 5 9) Overview I Sectio 1.5 we lear to work with summatio otatio ad formulas. We will also itroduce a brief overview of sequeces,
More informationOn the Generalization Ability of Online Learning Algorithms for Pairwise Loss Functions
O the Geeraliatio Ability of Olie Learig Algorithms for Pairwise Loss Fuctios Purushottam Kar purushot@cse.iitk.ac.i Departmet of Computer Sciece ad Egieerig, Idia Istitute of Techology, Kapur, UP 208
More informationLecture 2: Karger s Min Cut Algorithm
priceto uiv. F 3 cos 5: Advaced Algorithm Desig Lecture : Karger s Mi Cut Algorithm Lecturer: Sajeev Arora Scribe:Sajeev Today s topic is simple but gorgeous: Karger s mi cut algorithm ad its extesio.
More informationReliability Analysis in HPC clusters
Reliability Aalysis i HPC clusters Narasimha Raju, Gottumukkala, Yuda Liu, Chokchai Box Leagsuksu 1, Raja Nassar, Stephe Scott 2 College of Egieerig & Sciece, Louisiaa ech Uiversity Oak Ridge Natioal Lab
More informationAutomatic Tuning for FOREX Trading System Using Fuzzy Time Series
utomatic Tuig for FOREX Tradig System Usig Fuzzy Time Series Kraimo Maeesilp ad Pitihate Soorasa bstract Efficiecy of the automatic currecy tradig system is time depedet due to usig fixed parameters which
More informationChapter 14 Nonparametric Statistics
Chapter 14 Noparametric Statistics A.K.A. distributio-free statistics! Does ot deped o the populatio fittig ay particular type of distributio (e.g, ormal). Sice these methods make fewer assumptios, they
More informationTaking DCOP to the Real World: Efficient Complete Solutions for Distributed Multi-Event Scheduling
Taig DCOP to the Real World: Efficiet Complete Solutios for Distributed Multi-Evet Schedulig Rajiv T. Maheswara, Milid Tambe, Emma Bowrig, Joatha P. Pearce, ad Pradeep araatham Uiversity of Souther Califoria
More informationSolutions to Selected Problems In: Pattern Classification by Duda, Hart, Stork
Solutios to Selected Problems I: Patter Classificatio by Duda, Hart, Stork Joh L. Weatherwax February 4, 008 Problem Solutios Chapter Bayesia Decisio Theory Problem radomized rules Part a: Let Rx be the
More informationDetermining the sample size
Determiig the sample size Oe of the most commo questios ay statisticia gets asked is How large a sample size do I eed? Researchers are ofte surprised to fid out that the aswer depeds o a umber of factors
More informationSUPPLEMENTARY MATERIAL TO GENERAL NON-EXACT ORACLE INEQUALITIES FOR CLASSES WITH A SUBEXPONENTIAL ENVELOPE
SUPPLEMENTARY MATERIAL TO GENERAL NON-EXACT ORACLE INEQUALITIES FOR CLASSES WITH A SUBEXPONENTIAL ENVELOPE By Guillaume Lecué CNRS, LAMA, Mare-la-vallée, 77454 Frace ad By Shahar Medelso Departmet of Mathematics,
More informationMaximum Likelihood Estimators.
Lecture 2 Maximum Likelihood Estimators. Matlab example. As a motivatio, let us look at oe Matlab example. Let us geerate a radom sample of size 00 from beta distributio Beta(5, 2). We will lear the defiitio
More informationIncremental calculation of weighted mean and variance
Icremetal calculatio of weighted mea ad variace Toy Fich faf@cam.ac.uk dot@dotat.at Uiversity of Cambridge Computig Service February 009 Abstract I these otes I eplai how to derive formulae for umerically
More informationDimensionality Reduction of Multimodal Labeled Data by Local Fisher Discriminant Analysis
Joural of Machie Learig Research 8 (2007) 1027-1061 Submitted 3/06; Revised 12/06; Published 5/07 Dimesioality Reductio of Multimodal Labeled Data by Local Fisher Discrimiat Aalysis Masashi Sugiyama Departmet
More informationIrreducible polynomials with consecutive zero coefficients
Irreducible polyomials with cosecutive zero coefficiets Theodoulos Garefalakis Departmet of Mathematics, Uiversity of Crete, 71409 Heraklio, Greece Abstract Let q be a prime power. We cosider the problem
More informationReview: Classification Outline
Data Miig CS 341, Sprig 2007 Decisio Trees Neural etworks Review: Lecture 6: Classificatio issues, regressio, bayesia classificatio Pretice Hall 2 Data Miig Core Techiques Classificatio Clusterig Associatio
More informationDiscriminative Models of Integrating Document Evidence and Document-Candidate Associations for Expert Search
Discrimiative Models of Itegratig Documet Evidece ad Documet-Cadidate Associatios for Expert Search Yi Fag Departmet of Computer Sciece Purdue Uiversity West Lafayette, IN 47907, USA fagy@cs.purdue.edu
More informationarxiv:1506.08910v1 [stat.ml] 30 Jun 2015
Learig Sigle Idex Models i High Dimesios Ravi Gati, Nikhil Rao 2, Rebecca M. Willett 3 ad Robert Nowak 3 arxiv:506.0890v [stat.ml] 30 Ju 205 Wiscosi Istitutes for Discovery, 330 N Orchard St, Madiso, WI,
More informationCOMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S 2 CONTROL CHART FOR THE CHANGES IN A PROCESS
COMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S CONTROL CHART FOR THE CHANGES IN A PROCESS Supraee Lisawadi Departmet of Mathematics ad Statistics, Faculty of Sciece ad Techoology, Thammasat
More informationChapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:
Chapter 7 - Samplig Distributios 1 Itroductio What is statistics? It cosist of three major areas: Data Collectio: samplig plas ad experimetal desigs Descriptive Statistics: umerical ad graphical summaries
More information5 Boolean Decision Trees (February 11)
5 Boolea Decisio Trees (February 11) 5.1 Graph Coectivity Suppose we are give a udirected graph G, represeted as a boolea adjacecy matrix = (a ij ), where a ij = 1 if ad oly if vertices i ad j are coected
More informationTIGHT BOUNDS ON EXPECTED ORDER STATISTICS
Probability i the Egieerig ad Iformatioal Scieces, 20, 2006, 667 686+ Prited i the U+S+A+ TIGHT BOUNDS ON EXPECTED ORDER STATISTICS DIMITRIS BERTSIMAS Sloa School of Maagemet ad Operatios Research Ceter
More informationInfinite Sequences and Series
CHAPTER 4 Ifiite Sequeces ad Series 4.1. Sequeces A sequece is a ifiite ordered list of umbers, for example the sequece of odd positive itegers: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29...
More informationChatpun Khamyat Department of Industrial Engineering, Kasetsart University, Bangkok, Thailand ocpky@hotmail.com
SOLVING THE OIL DELIVERY TRUCKS ROUTING PROBLEM WITH MODIFY MULTI-TRAVELING SALESMAN PROBLEM APPROACH CASE STUDY: THE SME'S OIL LOGISTIC COMPANY IN BANGKOK THAILAND Chatpu Khamyat Departmet of Idustrial
More informationClass Meeting # 16: The Fourier Transform on R n
MATH 18.152 COUSE NOTES - CLASS MEETING # 16 18.152 Itroductio to PDEs, Fall 2011 Professor: Jared Speck Class Meetig # 16: The Fourier Trasform o 1. Itroductio to the Fourier Trasform Earlier i the course,
More informationResearch Method (I) --Knowledge on Sampling (Simple Random Sampling)
Research Method (I) --Kowledge o Samplig (Simple Radom Samplig) 1. Itroductio to samplig 1.1 Defiitio of samplig Samplig ca be defied as selectig part of the elemets i a populatio. It results i the fact
More informationAMS 2000 subject classification. Primary 62G08, 62G20; secondary 62G99
VARIABLE SELECTION IN NONPARAMETRIC ADDITIVE MODELS Jia Huag 1, Joel L. Horowitz 2 ad Fegrog Wei 3 1 Uiversity of Iowa, 2 Northwester Uiversity ad 3 Uiversity of West Georgia Abstract We cosider a oparametric
More informationDomain 1: Designing a SQL Server Instance and a Database Solution
Maual SQL Server 2008 Desig, Optimize ad Maitai (70-450) 1-800-418-6789 Domai 1: Desigig a SQL Server Istace ad a Database Solutio Desigig for CPU, Memory ad Storage Capacity Requiremets Whe desigig a
More informationResearch Article Crude Oil Price Prediction Based on a Dynamic Correcting Support Vector Regression Machine
Abstract ad Applied Aalysis Volume 2013, Article ID 528678, 7 pages http://dx.doi.org/10.1155/2013/528678 Research Article Crude Oil Price Predictio Based o a Dyamic Correctig Support Vector Regressio
More informationApplication and research of fuzzy clustering analysis algorithm under micro-lecture English teaching mode
SHS Web of Cofereces 25, shscof/20162501018 Applicatio ad research of fuzzy clusterig aalysis algorithm uder micro-lecture Eglish teachig mode Yig Shi, Wei Dog, Chuyi Lou & Ya Dig Qihuagdao Istitute of
More information(VCP-310) 1-800-418-6789
Maual VMware Lesso 1: Uderstadig the VMware Product Lie I this lesso, you will first lear what virtualizatio is. Next, you ll explore the products offered by VMware that provide virtualizatio services.
More informationRecovery time guaranteed heuristic routing for improving computation complexity in survivable WDM networks
Computer Commuicatios 30 (2007) 1331 1336 wwwelseviercom/locate/comcom Recovery time guarateed heuristic routig for improvig computatio complexity i survivable WDM etworks Lei Guo * College of Iformatio
More informationNEW HIGH PERFORMANCE COMPUTATIONAL METHODS FOR MORTGAGES AND ANNUITIES. Yuri Shestopaloff,
NEW HIGH PERFORMNCE COMPUTTIONL METHODS FOR MORTGGES ND NNUITIES Yuri Shestopaloff, Geerally, mortgage ad auity equatios do ot have aalytical solutios for ukow iterest rate, which has to be foud usig umerical
More informationAn Efficient Polynomial Approximation of the Normal Distribution Function & Its Inverse Function
A Efficiet Polyomial Approximatio of the Normal Distributio Fuctio & Its Iverse Fuctio Wisto A. Richards, 1 Robi Atoie, * 1 Asho Sahai, ad 3 M. Raghuadh Acharya 1 Departmet of Mathematics & Computer Sciece;
More informationTHE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n
We will cosider the liear regressio model i matrix form. For simple liear regressio, meaig oe predictor, the model is i = + x i + ε i for i =,,,, This model icludes the assumptio that the ε i s are a sample
More informationLecture 3. denote the orthogonal complement of S k. Then. 1 x S k. n. 2 x T Ax = ( ) λ x. with x = 1, we have. i = λ k x 2 = λ k.
18.409 A Algorithmist s Toolkit September 17, 009 Lecture 3 Lecturer: Joatha Keler Scribe: Adre Wibisoo 1 Outlie Today s lecture covers three mai parts: Courat-Fischer formula ad Rayleigh quotiets The
More informationUniversal coding for classes of sources
Coexios module: m46228 Uiversal codig for classes of sources Dever Greee This work is produced by The Coexios Project ad licesed uder the Creative Commos Attributio Licese We have discussed several parametric
More informationOn Formula to Compute Primes. and the n th Prime
Applied Mathematical cieces, Vol., 0, o., 35-35 O Formula to Compute Primes ad the th Prime Issam Kaddoura Lebaese Iteratioal Uiversity Faculty of Arts ad cieces, Lebao issam.kaddoura@liu.edu.lb amih Abdul-Nabi
More informationMARTINGALES AND A BASIC APPLICATION
MARTINGALES AND A BASIC APPLICATION TURNER SMITH Abstract. This paper will develop the measure-theoretic approach to probability i order to preset the defiitio of martigales. From there we will apply this
More informationSpam Detection. A Bayesian approach to filtering spam
Spam Detectio A Bayesia approach to filterig spam Kual Mehrotra Shailedra Watave Abstract The ever icreasig meace of spam is brigig dow productivity. More tha 70% of the email messages are spam, ad it
More informationTrigonometric Form of a Complex Number. The Complex Plane. axis. ( 2, 1) or 2 i FIGURE 6.44. The absolute value of the complex number z a bi is
0_0605.qxd /5/05 0:45 AM Page 470 470 Chapter 6 Additioal Topics i Trigoometry 6.5 Trigoometric Form of a Complex Number What you should lear Plot complex umbers i the complex plae ad fid absolute values
More informationSwaps: Constant maturity swaps (CMS) and constant maturity. Treasury (CMT) swaps
Swaps: Costat maturity swaps (CMS) ad costat maturity reasury (CM) swaps A Costat Maturity Swap (CMS) swap is a swap where oe of the legs pays (respectively receives) a swap rate of a fixed maturity, while
More informationTHIN SEQUENCES AND THE GRAM MATRIX PAMELA GORKIN, JOHN E. MCCARTHY, SANDRA POTT, AND BRETT D. WICK
THIN SEQUENCES AND THE GRAM MATRIX PAMELA GORKIN, JOHN E MCCARTHY, SANDRA POTT, AND BRETT D WICK Abstract We provide a ew proof of Volberg s Theorem characterizig thi iterpolatig sequeces as those for
More informationTHE HEIGHT OF q-binary SEARCH TREES
THE HEIGHT OF q-binary SEARCH TREES MICHAEL DRMOTA AND HELMUT PRODINGER Abstract. q biary search trees are obtaied from words, equipped with the geometric distributio istead of permutatios. The average
More informationFinding the circle that best fits a set of points
Fidig the circle that best fits a set of poits L. MAISONOBE October 5 th 007 Cotets 1 Itroductio Solvig the problem.1 Priciples............................... Iitializatio.............................
More informationUniversity of California, Los Angeles Department of Statistics. Distributions related to the normal distribution
Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Istructor: Nicolas Christou Three importat distributios: Distributios related to the ormal distributio Chi-square (χ ) distributio.
More informationPSYCHOLOGICAL STATISTICS
UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION B Sc. Cousellig Psychology (0 Adm.) IV SEMESTER COMPLEMENTARY COURSE PSYCHOLOGICAL STATISTICS QUESTION BANK. Iferetial statistics is the brach of statistics
More informationStochastic Online Scheduling with Precedence Constraints
Stochastic Olie Schedulig with Precedece Costraits Nicole Megow Tark Vredeveld July 15, 2008 Abstract We cosider the preemptive ad o-preemptive problems of schedulig obs with precedece costraits o parallel
More informationLesson 15 ANOVA (analysis of variance)
Outlie Variability -betwee group variability -withi group variability -total variability -F-ratio Computatio -sums of squares (betwee/withi/total -degrees of freedom (betwee/withi/total -mea square (betwee/withi
More informationAnnuities Under Random Rates of Interest II By Abraham Zaks. Technion I.I.T. Haifa ISRAEL and Haifa University Haifa ISRAEL.
Auities Uder Radom Rates of Iterest II By Abraham Zas Techio I.I.T. Haifa ISRAEL ad Haifa Uiversity Haifa ISRAEL Departmet of Mathematics, Techio - Israel Istitute of Techology, 3000, Haifa, Israel I memory
More informationA CUSUM TEST OF COMMON TRENDS IN LARGE HETEROGENEOUS PANELS
A CUSUM TEST OF COMMON TRENDS IN LARGE HETEROGENEOUS PANELS JAVIER HIDALGO AND JUNGYOON LEE A. This paper examies a oparametric CUSUM-type test for commo treds i large pael data sets with idividual fixed
More informationEvaluating Model for B2C E- commerce Enterprise Development Based on DEA
, pp.180-184 http://dx.doi.org/10.14257/astl.2014.53.39 Evaluatig Model for B2C E- commerce Eterprise Developmet Based o DEA Weli Geg, Jig Ta Computer ad iformatio egieerig Istitute, Harbi Uiversity of
More informationOverview on S-Box Design Principles
Overview o S-Box Desig Priciples Debdeep Mukhopadhyay Assistat Professor Departmet of Computer Sciece ad Egieerig Idia Istitute of Techology Kharagpur INDIA -721302 What is a S-Box? S-Boxes are Boolea
More informationStackelberg Games for Adversarial Prediction Problems
Stackelberg Games for Adversarial Predictio Problems Michael Brücker Departmet of Computer Sciece Uiversity of Potsdam, Germay mibrueck@cs.ui-potsdam.de Tobias Scheffer Departmet of Computer Sciece Uiversity
More informationStatistical Learning Theory
1 / 130 Statistical Learig Theory Machie Learig Summer School, Kyoto, Japa Alexader (Sasha) Rakhli Uiversity of Pesylvaia, The Wharto School Pe Research i Machie Learig (PRiML) August 27-28, 2012 2 / 130
More informationConfidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.
Cofidece Itervals A cofidece iterval is a iterval whose purpose is to estimate a parameter (a umber that could, i theory, be calculated from the populatio, if measuremets were available for the whole populatio).
More informationChapter 5 Unit 1. IET 350 Engineering Economics. Learning Objectives Chapter 5. Learning Objectives Unit 1. Annual Amount and Gradient Functions
Chapter 5 Uit Aual Amout ad Gradiet Fuctios IET 350 Egieerig Ecoomics Learig Objectives Chapter 5 Upo completio of this chapter you should uderstad: Calculatig future values from aual amouts. Calculatig
More informationPerfect Packing Theorems and the Average-Case Behavior of Optimal and Online Bin Packing
SIAM REVIEW Vol. 44, No. 1, pp. 95 108 c 2002 Society for Idustrial ad Applied Mathematics Perfect Packig Theorems ad the Average-Case Behavior of Optimal ad Olie Bi Packig E. G. Coffma, Jr. C. Courcoubetis
More informationCHAPTER 3 THE TIME VALUE OF MONEY
CHAPTER 3 THE TIME VALUE OF MONEY OVERVIEW A dollar i the had today is worth more tha a dollar to be received i the future because, if you had it ow, you could ivest that dollar ad ear iterest. Of all
More informationChair for Network Architectures and Services Institute of Informatics TU München Prof. Carle. Network Security. Chapter 2 Basics
Chair for Network Architectures ad Services Istitute of Iformatics TU Müche Prof. Carle Network Security Chapter 2 Basics 2.4 Radom Number Geeratio for Cryptographic Protocols Motivatio It is crucial to
More informationChapter 5: Inner Product Spaces
Chapter 5: Ier Product Spaces Chapter 5: Ier Product Spaces SECION A Itroductio to Ier Product Spaces By the ed of this sectio you will be able to uderstad what is meat by a ier product space give examples
More information, a Wishart distribution with n -1 degrees of freedom and scale matrix.
UMEÅ UNIVERSITET Matematisk-statistiska istitutioe Multivariat dataaalys D MSTD79 PA TENTAMEN 004-0-9 LÖSNINGSFÖRSLAG TILL TENTAMEN I MATEMATISK STATISTIK Multivariat dataaalys D, 5 poäg.. Assume that
More informationSTUDENTS PARTICIPATION IN ONLINE LEARNING IN BUSINESS COURSES AT UNIVERSITAS TERBUKA, INDONESIA. Maya Maria, Universitas Terbuka, Indonesia
STUDENTS PARTICIPATION IN ONLINE LEARNING IN BUSINESS COURSES AT UNIVERSITAS TERBUKA, INDONESIA Maya Maria, Uiversitas Terbuka, Idoesia Co-author: Amiuddi Zuhairi, Uiversitas Terbuka, Idoesia Kuria Edah
More informationFIBONACCI NUMBERS: AN APPLICATION OF LINEAR ALGEBRA. 1. Powers of a matrix
FIBONACCI NUMBERS: AN APPLICATION OF LINEAR ALGEBRA. Powers of a matrix We begi with a propositio which illustrates the usefuless of the diagoalizatio. Recall that a square matrix A is diogaalizable if
More information1 The Gaussian channel
ECE 77 Lecture 0 The Gaussia chael Objective: I this lecture we will lear about commuicatio over a chael of practical iterest, i which the trasmitted sigal is subjected to additive white Gaussia oise.
More informationINVESTMENT PERFORMANCE COUNCIL (IPC)
INVESTMENT PEFOMANCE COUNCIL (IPC) INVITATION TO COMMENT: Global Ivestmet Performace Stadards (GIPS ) Guidace Statemet o Calculatio Methodology The Associatio for Ivestmet Maagemet ad esearch (AIM) seeks
More informationTHE problem of fitting a circle to a collection of points
IEEE TRANACTION ON INTRUMENTATION AND MEAUREMENT, VOL. XX, NO. Y, MONTH 000 A Few Methods for Fittig Circles to Data Dale Umbach, Kerry N. Joes Abstract Five methods are discussed to fit circles to data.
More informationNon-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring
No-life isurace mathematics Nils F. Haavardsso, Uiversity of Oslo ad DNB Skadeforsikrig Mai issues so far Why does isurace work? How is risk premium defied ad why is it importat? How ca claim frequecy
More information