A Latent Variable Pairwise Classification Model of a Clustering Ensemble
|
|
|
- Virginia Oliver
- 10 years ago
- Views:
Transcription
1 A atent Variabe Pairwise Cassification Mode of a Custering Ensembe Vadimir Berikov Soboev Institute of mathematics, Novosibirsk State University, Russia [email protected] Abstract. This paper addresses some theoretica properties of custering ensembes. We consider the probem of custer anaysis from pattern recognition point of view. A atent variabe pairwise cassification mode is proposed for studying the efficiency (in terms of error probabiity ) of the ensembe. The notions of stabiity, homogeneity and correation between ensembe eements are introduced. An upper bound for miscassification probabiity is obtained. Numerica experiment confirms potentia usefuness of the suggested ensembe characteristics. Keywords: custering ensembe, atent variabe mode, miscassification probabiity, error bound, ensembe s homogeneity and correation. 1 Introduction Coective decision-making based on a combination of simpe agorithms is activey used in modern pattern recognition and machine earning. In ast decade, there is growing interest in custering ensembe agorithms [1,2]. In the ensembe design process, the resuts obtained by different agorithms, or by one agorithm with variousparameters settings are used. After construction of partia custering soutions, a fina coective decision is buit. Modern iterature on custering ensembes can be roughy divided into severa main categories. There are a great dea of works in which the ensembe methodoogy is adapted to new appication areas such as magnetic resonance imaging, sateite images anaysis, anaysis of genetic sequences etc (see, for exampe, [3,4,5]). Another direction aims to deveop custering ensembes methods of genera usage and eaborate efficient agorithms using various optimization techniques (e.g., [6]). Other categories of works are of more theoretica nature; their purpose is to study the properties of custering ensembes, improve measures of ensembe quaity, suggest the ways to achieve the best quaity (e.g., [7,8,9]). There is a arge number of experimenta evidences confirming a significant raise in stabiity of custering decisions for ensembe agorithms (see, for exampe, [2,10]). At the same time, theoretica grounds of custering ensembes agorithms, as opposed to the pattern cassifier ensembes theory (e.g., [11]), are sti in C. Sansone, J. Kitter, and F. Roi (Eds.): MCS 2011, NCS 6713, pp , c Springer-Verag Berin Heideberg 2011
2 280 V. Berikov the eary deveopment stage. Existing works consider mainy the asymptotic properties of custering ensembes (e.g., [7]). Custer anaysis probems are characterized by the compexity of formaization caused by substantiay subjective nature of grouping process. For the definition of custering quaity it is necessary to appy additiona a priori information in terms of natura cassification, to use data generation modes etc. In the given work we attempt to corroborate custering ensembe methodoogy by utiizing of a pattern recognition mode with atent cass abes. To avoid probems with cass renumbering, a pairwise cassification approach is used. The rest of the paper is organized as foows. In the next section we give basic definitions and introduce the mode of ensembe custer anaysis. In the third section we receive an upper bound for error probabiity (in cassifying a pair of arbitrary objects according to atent variabe abes) and give some quaitative consequences of the resut. In the forth section we introduce the estimates of ensembe characteristics. The next section describes numerica experiment that demonstrates the usage of these notions. The concusion summaries the work and gives possibe future directions. 2 Ensembe Mode et us consider a sampe s = {o (1),...,o (N) } of objects independenty and randomy seected from a genera popuation. The purpose of the anaysis is to group the objects into K 2 casses in accordance with some custering criterion; the number of casses may be either given beforehand or not (in the atter case an optima number of casses shoud be determined automaticay). et each of the objects be characterized by variabes X 1,...,X n.denoteby x = (x 1,...,x n ) the vector of these variabes for an object o, x j = X j (o), j =1,...,n. In many custering tasks it is aowabe to consider that there exists a ground truth (atent, directy unobserved) variabe Y {1,...,K} that determines to which cass an object beongs. Suppose that the observations of k-th cass are distributed according to the conditiona dencity function p k (x) = p(x Y = k), k =1,...,K. Consider the foowing mode of data generation. et each object be assigned to cass k in accordance with a priori probabiities P k = P(Y = k), k =1,...,K, where K P k = 1. After the assignment, an observabe vaue of x is determined k=1 with use of p k (x). This procedure is repeated independenty for each object. For an arbitrary pair of different objects a, b s, their correspondent observations are denoted by x(a) and x(b). et Z = I(Y (a) Y (b)),
3 A atent Variabe Pairwise Cassification Mode of a Custering Ensembe 281 where I( ) is indicator function. Denote by P Z = P[Z =1 x(a),x(b)] the probabiity of the event a and b beong to different casses, given x(a) and x(b) : P Z =1 P[Y (a) =1 x(a)] P[Y (b) =1 x(b)]... P[Y (a) =K x(a)] P[Y (b) =K x(b)] = 1 where p(x(o)) = K p k (x(o))p k, o = a, b. k=1 K k=1 p k (x(a))p k (x(b))p 2 k p(x(a))p(x(b)) et a custering agorithm μ be run to partition s into K subsets (custers). Because the numberings of custers do not matter, it is convenient to consider the equivaence reation, i.e. to indicate whether the agorithm μ assigns each pair of objects to the same cass or to different casses. et h μ (a, b) =I[μ(a) μ(b)]. et us consider the foowing mode of ensembe custering. Suppose that agorithm μ is randomized, i.e. it depends from a random vaue Ω from a given set of aowabe vaues (parameters or more generay earning settings such as bootstrap sampes, order of input objects etc). In addition to Ω, the agorithm s decisions are dependent from the true status of the pair a, b (i.e., from Z): h μ (a, b) =h μ(ω) (Z, a, b). Hereinafter we wi denote h μ(ω) (Z, a, b) =h(ω,z). Suppose that P[h(Ω,Z) =1 Z =1]=P[h(Ω,Z) =0 Z =0]=q, i.e. the conditiona probabiities of correct decision (either partition or union of objects a,b) coincide. One can say that q refects the stabiity of agorithm under various earning settings. We sha suppose that q>1/2; it means that agorithm μ provides better custering quaity than just random guessing. In machine earning theory, such a condition is known as the condition of weak earnabiity. Denote P h = P[h(Ω,Z) = 1]. This quantity shows the homogeneity of agorithm s decisions: P h cose to 0 or 1; or homogeneity index I h =1 P h (1 P h ) cose to 1 means high agreement between the soutions. Note that P h = P[h(Ω,Z) =1 Z =1]P Z + P[h(Ω,Z) =1 Z =0](1 P Z )= qp Z +(1 q)(1 P Z ). Suppose that agorithm μ is running times under randomy and independenty chosen settings. As a resut, we get random decisions h(ω 1,Z),...,,
4 282 V. Berikov h(ω,z). By Ω 1,...,Ω we denote independent statistica copies of a random vector Ω. For every Ω, agorithm μ is running independenty (it does not use the resuts obtained with other Ω, ). Suppose that the decisions are conditionay independent: P[h(Ω i1,z)=h i1,...,h(ω ij,z)=h ij Z = z] = P[(h(Ω i1,z)=h i1 Z = z] P[h(Ω ij,z) =h ij Z = z], where Ω i1,...,ω ij are arbitrary earning settings, h i1,...,h ij,z {0, 1} (we sha assume that is odd). et P h,h = P[h(Ω,Z)=1,h(Ω,Z)=1],whereΩ,Ω havethesamedistribution as Ω, andω Ω. It foows from the assumptions of independence and stabiity that P h,h = P[h(Ω,Z)=1,h(Ω )=1 Z =1]P Z + P[h(Ω,Z)=1,h(Ω )=1 Z =0](1 P Z )= q 2 P Z +(1 q) 2 (1 P Z ). (1) Denote H = 1 h(ω,z). The function =1 c(h(ω 1,Z),...,h(Ω,Z)) = I[ H > 1 2 ] sha be caed the ensembe soution for a and b, based on the majority voting. For constructing a fina ensembe custering decision, various approaches can be utiized [2]. For exampe, it is possibe to use a methodoogy based on the pairwise dissimiarity matrix H = ( H(o (i 1),o (i2)),whereo (i1),o (i2) s, o (i1) o (i2). This matrix can be considered as a matrix of pairwise distances between objects and used as input information for a dendrogram construction agorithm to form a sampe partition on a desired number of custers. 3 An Upper Bound for Miscassification Probabiity et us consider the margin [11] of the ensembe: mg = 1 { number of votes for Z number of votes against Z}, where Z =0, 1. It is easy to show that the margin equas: mg = mg( H,Z)=(2Z 1)(2 H 1). Using the notion of margin, one can represent the probabiity of wrong prediction of the true vaue of Z: P err = P Ω1,...,Ω,Z[mg( H,Z) < 0].
5 A atent Variabe Pairwise Cassification Mode of a Custering Ensembe 283 It foows from the Tchebychev s inequaity that P[U <0] < VarU (EU) 2, where EU is popuation mean, VarU is variance of random vaue U (it is required that EU >0). Thus, P Ω1,...,Ω,Z[mg( H,Z) < 0] < Var mg( H,Z) (E mg( H,Z)) 2, provided that E mg( H,Z) > 0. Theorem. The expected vaue and variance of the margin are: E mg( H,Z)=2q 1, Var mg( H,Z)= 4 (P h P h,h ). Proof. We have: E mg( H,Z)=E(2Z 1)( 2 h(ω,z) 1) = 4 E Zh(Ω,Z) 2EZ 2 E h(ω,z)+1. Because a h(ω,z) are distributed in the same way as h(ω,z), we get: E mg( H,Z)=4EZh(Ω,Z) 2P Z 2E h(ω,z)+1= 4P[Z =1,h(Ω,Z) =1] 2P Z 2P[h(Ω,Z) =1]+1. As P[h(Ω,Z) =1]=P[Z =1,h(Ω,Z) =1]+P[Z =0,h(Ω,Z) =1]= qp Z +(1 q)(1 P Z )=2qP Z +1 q P Z, we obtain: E mg( H,Z)=4qP Z 2P Z 2(2qP Z +1 q P Z )+1=2q 1. Consider the variance of margin: Var mg( H,Z)=Var(4Z H 2 H 2Z) =E(4Z H 2 H 2Z) 2 (E (4Z H 2 H 2Z)) 2 =E(16Z 2 H2 +4 h 2 +4Z 2 16Z H 2 16Z 2 H +8Z H) (E mg( H,Z) 1) 2 =E(4 H 2 +4Z 8Z H) 4(1 q) 2
6 284 V. Berikov (we appy Z 2 = Z). Next, we have E H 2 = 1 2 E ( 1 E h(ω,z)+ 1 From this, we obtain: ) 2 ( ) h(ω,z) = 1 2 E h 2 (Ω,Z) E(h(Ω,Z)h(Ω,Z)) =, :,Z)h(Ω,Z)= P h + 1 E(h(Ω Ω,Ω : Ω Ω P h,h. Var mg( H,Z) = 4 P h P h,h + 4P Z 8qP Z 4(1 q) 2. Using (1) finay we get: Var mg( H,Z) = 4 P h P h,h 4P h,h = 4 (P h P h,h ). The theorem is proved. Evidenty, the requirement E mg( H,Z) > 0 is fufied if q>1/2. et us consider the correation coefficient ρ between h = h(ω,z)andh = h(ω,z), where Ω Ω.Wehave ρ = ρ h,h = P h,h P 2 h P h (1 P h ). Because P h P h,h = P h P 2 h + P 2 h P h,h, weobtain Var (mg( H,Z)) = 4 (1 ρ) P h(1 P h ). Note that P h P h,h = q(1 q), and after necessary transformations we get the foowing upper bound for error probabiity: P err < 1 ( ) 1 1 4(1 ρ)p h (1 P h ) 1. The obtained expression aows to make some quaitative concusions. Namey, if the mode assumptions are fufied and q>1/2, then under other conditions being equa the foowing statements are vaid: - the probabiity of error decreases with an increase in number of ensembe eements; - an increase in homogeneity of the ensembe and raise of correation between its outputs reduce the probabiity of error (note that a signed vaue of the correation coefficient is meant).
7 A atent Variabe Pairwise Cassification Mode of a Custering Ensembe Estimating Characteristics of a Custering Ensembe To evauate the quaity of a custering ensembe, it is necessary to estimate ensembe s characteristics (in our mode homogeneity and correation) from a finite number of ensembe eements. For an arbitrary pair of different objects a and b, the estimate of the ensembe s homogeneity can be found as foows: where Î h (a, b) =1 ˆP h (a, b)(1 ˆP h (a, b)), ˆP h (a, b) = 1 h (a, b). Unfortunatey, the straightforward estimation of the correation coefficient ρ(a, b) is impossibe: under a fixed sampe, every pair of custering agorithms give conditionay independent decisions. et us introduce a simiar notion: the averaged correation coefficient, where the averaging is done over a pairs of different objects: ρ = cov(h,h ) σ 2 ; (h) where the covariance cov(h,h )=h h h 2, and the variance =1 h h 2 2 = N(N 1) ( 1) a,b: a b 2 1 h = h (a, b) = N(N 1) a,b: a b σ 2 (h) =h 2 h 2 = h h 2. h (a, b) h (a, b),, : 2 ˆP h (a, b), N(N 1) a,b: a b Simiary, it is possibe to introduce the averaged homogeneity index: Ī h = 2 N(N 1) Î h (a, b). a,b: a b 5 Numerica Experiment To verify the appicabiity of the suggested methodoogy for the anaysis of custering ensembe behavior, the statistica modeing approach was used. In the modeing, artificia data sets are repeatedy generated according to certain distribution cass (a type of custering tasks ). Each data set is cassified by the ensembe agorithm. The correct cassification rate, averaged over a given number of trias, determines agorithm s performance for the given type of tasks.
8 286 V. Berikov The foowing experiment was performed. In each tria, two casses are independenty samped according to the Gauss mutivariate distributions where m 1,m 2 are vectors of means, N (m 1,Σ), N (m 2,Σ), Σ = σi is a diagona n-dimensiona covariance matrix, σ =(σ 1,...,σ n ) is a vector of variances. Variabe X i sha be caed noisy, if for some i {1,..., n}, σ i = σ noise >> 1. In our experiment, the set of noisy variabes {X i1,..., X innoize } is chosen at random. For those variabes that are not noisy, we set σ i = σ 0 = const. Both casses have the same sampe size. The mixture of sampes is cassified by the ensembe of k-means custering agorithms. Each agorithm performs custering in the randomy chosen variabe subspace of dimensionaity n ens. The ensembe decision for each pair of objects (i.e., either unite them or assign to different casses) is made by the majority voting procedure. The true overa performance P cor of the ensembe is determined as the proportion of correcty cassified pairs P cor 0.3 P ind 0.2 averaged ρ averaged I h n noise Fig. 1. Exampe of experiment resuts (averaged over 100 trias). Experiment settings: N = 60, n = 10, m 1 = 0, m 2 = 1, σ noise = 10, σ 0 =0.2, n ens = 2, ensembe size = 15.
9 A atent Variabe Pairwise Cassification Mode of a Custering Ensembe 287 An exampe of experiment resuts is shown in Figure 1. The graphs dispay the dependence of averaged vaues of P cor, ρ and Īh from the number of noisy variabes n noise. To demonstrate the effectiveness of the ensembe soution in comparison with individua custering, the averaged performance rate P ind of a singe k-means custering agorithm is given (this agorithm performs custering in the whoe feature space of dimensionaity n). From this exampe, one can concude that a) in average, the ensembe agorithm has better performance than a singe custering agorithm (when a noise presents), and b) the dynamics of estimated ensembe characteristics (averaged homogeneity and correation) reproduces we the behavior of correct cassification rate (note that this rate is directy unobserved in rea custering probems). When averaged correation and homogeneity index are sufficienty arge, one can expect good cassification quaity. Concusion A atent variabe pairwise cassification mode is proposed for studying nonasymptotic properties of custering ensembes. In this mode, the notions of stabiity, homogeneity and correation between ensembe eements are utiized. An upper bound for probabiity of error is obtained. Theoretica anaysis of the suggested mode aows to make a concusion that the probabiity of correct decision increases with an increase in number of ensembe eements. It is aso found that a arge degree of agreement between partia custering soutions (expressed in our mode in terms of homogeneity and correation between ensembe eements), under condition of independence of base custering agorithms, indicates good cassification performance. Numerica experiment aso confirms this concusion. The foowing possibe future directions can be indicated. It is interesting to study intensiona connections between the notions used in the suggested mode (conditiona independence, stabiity, homogeneity and correation) and other known concepts such as mutua information [2] and diversity in custering ensembes (e.g., [8,9]). Another direction coud aim to improve the tightness of the obtained error bound. Acknowedgements This work was partiay supported by the Russian Foundation for Basic Research, projects a, a. References 1. Jain, A.K.: Data Custering: 50 Years Beyond K-Means. Pattern Recognition etters 31(8), (2010) 2. Streh, A., Ghosh, J.: Custering ensembes - a knowedge reuse framework for combining mutipe partitions. The Journa of Machine earning Research 3, (2002)
10 288 V. Berikov 3. Kuncheva,.I., Rodriguez, J.J., Pumpton, C.O., inden, D.E.J., Johnston, S.J.: Random Subspace Ensembes for fmri Cassification. IEEE Transactions on Medica Imaging 29(2), (2010) 4. Pestunov, I.A., Berikov, V.B., Kuikova, E.A.: Grid-based ensembe custering agorithm using sequence of fixed grids. In: Proc. of the 3rd IASTED Intern. Conf. on Automation, Contro, and Information Technoogy, pp ACTA Press, Cagary (2010) 5. Iam-on, N., Boongoen, T., Garrett, S.: CE: a ink-based custer ensembe method for improved gene expression data anaysis. Bioinformatics 26(12), (2010) 6. Hong, Y., Kwong, S.: To combine steady-state genetic agorithm and ensembe earning for data custering. Pattern Recognition etters 29(9), (2008) 7. Topchy, A., aw, M., Jain, A., Fred, A.: Anaysis of Consensus Partition in Custer Ensembe. In: Fourth IEEE Internationa Conference on Data Mining, pp IEEE Press, New York (2004) 8. Hadjitodorov, S.T., Kuncheva,.I., Todorova,.P.: Moderate diversity for better custer ensembes. Information Fusion 7(3), (2006) 9. Azimi, J., Fern, X.: Adaptive Custer Ensembe Seection. In: Proceedings of Internationa Joint Conference on Artificia Inteigence, pp (2009) 10. Kuncheva,.: Combining Pattern Cassifiers. Methods and Agorithms. John Wiey & Sons, Hoboken (2004) 11. Breiman,.: Random Forests. Machine earning 45(1), 5 32 (2001)
Face Hallucination and Recognition
Face Haucination and Recognition Xiaogang Wang and Xiaoou Tang Department of Information Engineering, The Chinese University of Hong Kong {xgwang1, xtang}@ie.cuhk.edu.hk http://mmab.ie.cuhk.edu.hk Abstract.
Secure Network Coding with a Cost Criterion
Secure Network Coding with a Cost Criterion Jianong Tan, Murie Médard Laboratory for Information and Decision Systems Massachusetts Institute of Technoogy Cambridge, MA 0239, USA E-mai: {jianong, medard}@mit.edu
SELECTING THE SUITABLE ERP SYSTEM: A FUZZY AHP APPROACH. Ufuk Cebeci
SELECTING THE SUITABLE ERP SYSTEM: A FUZZY AHP APPROACH Ufuk Cebeci Department of Industria Engineering, Istanbu Technica University, Macka, Istanbu, Turkey - [email protected] Abstract An Enterprise
A Supplier Evaluation System for Automotive Industry According To Iso/Ts 16949 Requirements
A Suppier Evauation System for Automotive Industry According To Iso/Ts 16949 Requirements DILEK PINAR ÖZTOP 1, ASLI AKSOY 2,*, NURSEL ÖZTÜRK 2 1 HONDA TR Purchasing Department, 41480, Çayırova - Gebze,
Fast Robust Hashing. ) [7] will be re-mapped (and therefore discarded), due to the load-balancing property of hashing.
Fast Robust Hashing Manue Urueña, David Larrabeiti and Pabo Serrano Universidad Caros III de Madrid E-89 Leganés (Madrid), Spain Emai: {muruenya,darra,pabo}@it.uc3m.es Abstract As statefu fow-aware services
Multi-Robot Task Scheduling
Proc of IEEE Internationa Conference on Robotics and Automation, Karsruhe, Germany, 013 Muti-Robot Tas Scheduing Yu Zhang and Lynne E Parer Abstract The scheduing probem has been studied extensivey in
Vendor Performance Measurement Using Fuzzy Logic Controller
The Journa of Mathematics and Computer Science Avaiabe onine at http://www.tjmcs.com The Journa of Mathematics and Computer Science Vo.2 No.2 (2011) 311-318 Performance Measurement Using Fuzzy Logic Controer
Art of Java Web Development By Neal Ford 624 pages US$44.95 Manning Publications, 2004 ISBN: 1-932394-06-0
IEEE DISTRIBUTED SYSTEMS ONLINE 1541-4922 2005 Pubished by the IEEE Computer Society Vo. 6, No. 5; May 2005 Editor: Marcin Paprzycki, http://www.cs.okstate.edu/%7emarcin/ Book Reviews: Java Toos and Frameworks
ASYMPTOTIC DIRECTION FOR RANDOM WALKS IN RANDOM ENVIRONMENTS arxiv:math/0512388v2 [math.pr] 11 Dec 2007
ASYMPTOTIC DIRECTION FOR RANDOM WALKS IN RANDOM ENVIRONMENTS arxiv:math/0512388v2 [math.pr] 11 Dec 2007 FRANÇOIS SIMENHAUS Université Paris 7, Mathématiques, case 7012, 2, pace Jussieu, 75251 Paris, France
Teamwork. Abstract. 2.1 Overview
2 Teamwork Abstract This chapter presents one of the basic eements of software projects teamwork. It addresses how to buid teams in a way that promotes team members accountabiity and responsibiity, and
Australian Bureau of Statistics Management of Business Providers
Purpose Austraian Bureau of Statistics Management of Business Providers 1 The principa objective of the Austraian Bureau of Statistics (ABS) in respect of business providers is to impose the owest oad
The Basel II Risk Parameters. Second edition
The Base II Risk Parameters Second edition . Bernd Engemann Editors Robert Rauhmeier The Base II Risk Parameters Estimation, Vaidation, Stress Testing with Appications to Loan Risk Management Editors Dr.
Chapter 3: e-business Integration Patterns
Chapter 3: e-business Integration Patterns Page 1 of 9 Chapter 3: e-business Integration Patterns "Consistency is the ast refuge of the unimaginative." Oscar Wide In This Chapter What Are Integration Patterns?
Risk Margin for a Non-Life Insurance Run-Off
Risk Margin for a Non-Life Insurance Run-Off Mario V. Wüthrich, Pau Embrechts, Andreas Tsanakas February 2, 2011 Abstract For sovency purposes insurance companies need to cacuate so-caed best-estimate
A New Statistical Approach to Network Anomaly Detection
A New Statistica Approach to Network Anomay Detection Christian Caegari, Sandrine Vaton 2, and Michee Pagano Dept of Information Engineering, University of Pisa, ITALY E-mai: {christiancaegari,mpagano}@ietunipiit
COMPARISON OF DIFFUSION MODELS IN ASTRONOMICAL OBJECT LOCALIZATION
COMPARISON OF DIFFUSION MODELS IN ASTRONOMICAL OBJECT LOCALIZATION Františe Mojžíš Department of Computing and Contro Engineering, ICT Prague, Technicá, 8 Prague [email protected] Abstract This
Risk Margin for a Non-Life Insurance Run-Off
Risk Margin for a Non-Life Insurance Run-Off Mario V. Wüthrich, Pau Embrechts, Andreas Tsanakas August 15, 2011 Abstract For sovency purposes insurance companies need to cacuate so-caed best-estimate reserves
Betting Strategies, Market Selection, and the Wisdom of Crowds
Betting Strategies, Market Seection, and the Wisdom of Crowds Wiemien Kets Northwestern University [email protected] David M. Pennock Microsoft Research New York City [email protected]
An Integrated Data Management Framework of Wireless Sensor Network
An Integrated Data Management Framework of Wireess Sensor Network for Agricutura Appications 1,2 Zhao Liang, 2 He Liyuan, 1 Zheng Fang, 1 Jin Xing 1 Coege of Science, Huazhong Agricutura University, Wuhan
ACO and SVM Selection Feature Weighting of Network Intrusion Detection Method
, pp. 129-270 http://dx.doi.org/10.14257/ijsia.2015.9.4.24 ACO and SVM Seection Feature Weighting of Network Intrusion Detection Method Wang Xingzhu Furong Coege Hunan, University of Arts and Science,
Design of Follow-Up Experiments for Improving Model Discrimination and Parameter Estimation
Design of Foow-Up Experiments for Improving Mode Discrimination and Parameter Estimation Szu Hui Ng 1 Stephen E. Chick 2 Nationa University of Singapore, 10 Kent Ridge Crescent, Singapore 119260. Technoogy
An Idiot s guide to Support vector machines (SVMs)
An Idiot s guide to Support vector machines (SVMs) R. Berwick, Viage Idiot SVMs: A New Generation of Learning Agorithms Pre 1980: Amost a earning methods earned inear decision surfaces. Linear earning
STUDY MATERIAL. M.B.A. PROGRAMME (Code No. 411) (Effective from 2009-2010) II SEMESTER 209MBT27 APPLIED RESEARCH METHODS IN MANAGEMENT
St. PETER'S UNIVERSITY St. Peter s Institute of Higher Education and Research (Decared Under Section 3 of the UGC Act, 1956) AVADI, CHENNAI - 600 054 TAMIL NADU STUDY MATERIAL M.B.A. PROGRAMME (Code No.
Simultaneous Routing and Power Allocation in CDMA Wireless Data Networks
Simutaneous Routing and Power Aocation in CDMA Wireess Data Networks Mikae Johansson *,LinXiao and Stephen Boyd * Department of Signas, Sensors and Systems Roya Institute of Technoogy, SE 00 Stockhom,
Chapter 1 Structural Mechanics
Chapter Structura echanics Introduction There are many different types of structures a around us. Each structure has a specific purpose or function. Some structures are simpe, whie others are compex; however
Automatic Structure Discovery for Large Source Code
Automatic Structure Discovery for Large Source Code By Sarge Rogatch Master Thesis Universiteit van Amsterdam, Artificia Inteigence, 2010 Automatic Structure Discovery for Large Source Code Page 1 of 130
Chapter 2 Traditional Software Development
Chapter 2 Traditiona Software Deveopment 2.1 History of Project Management Large projects from the past must aready have had some sort of project management, such the Pyramid of Giza or Pyramid of Cheops,
FRAME BASED TEXTURE CLASSIFICATION BY CONSIDERING VARIOUS SPATIAL NEIGHBORHOODS. Karl Skretting and John Håkon Husøy
FRAME BASED TEXTURE CLASSIFICATION BY CONSIDERING VARIOUS SPATIAL NEIGHBORHOODS Kar Skretting and John Håkon Husøy University of Stavanger, Department of Eectrica and Computer Engineering N-4036 Stavanger,
Virtual trunk simulation
Virtua trunk simuation Samui Aato * Laboratory of Teecommunications Technoogy Hesinki University of Technoogy Sivia Giordano Laboratoire de Reseaux de Communication Ecoe Poytechnique Federae de Lausanne
Learning from evaluations Processes and instruments used by GIZ as a learning organisation and their contribution to interorganisational learning
Monitoring and Evauation Unit Learning from evauations Processes and instruments used by GIZ as a earning organisation and their contribution to interorganisationa earning Contents 1.3Learning from evauations
CONTRIBUTION OF INTERNAL AUDITING IN THE VALUE OF A NURSING UNIT WITHIN THREE YEARS
Dehi Business Review X Vo. 4, No. 2, Juy - December 2003 CONTRIBUTION OF INTERNAL AUDITING IN THE VALUE OF A NURSING UNIT WITHIN THREE YEARS John N.. Var arvatsouakis atsouakis DURING the present time,
Betting on the Real Line
Betting on the Rea Line Xi Gao 1, Yiing Chen 1,, and David M. Pennock 2 1 Harvard University, {xagao,yiing}@eecs.harvard.edu 2 Yahoo! Research, [email protected] Abstract. We study the probem of designing
Capacity of Multi-service Cellular Networks with Transmission-Rate Control: A Queueing Analysis
Capacity of Muti-service Ceuar Networs with Transmission-Rate Contro: A Queueing Anaysis Eitan Atman INRIA, BP93, 2004 Route des Lucioes, 06902 Sophia-Antipois, France aso CESIMO, Facutad de Ingeniería,
Best Practices for Push & Pull Using Oracle Inventory Stock Locators. Introduction to Master Data and Master Data Management (MDM): Part 1
SPECIAL CONFERENCE ISSUE THE OFFICIAL PUBLICATION OF THE Orace Appications USERS GROUP spring 2012 Introduction to Master Data and Master Data Management (MDM): Part 1 Utiizing Orace Upgrade Advisor for
Introduction the pressure for efficiency the Estates opportunity
Heathy Savings? A study of the proportion of NHS Trusts with an in-house Buidings Repair and Maintenance workforce, and a discussion of eary experiences of Suppies efficiency initiatives Management Summary
Optimizing QoS-Aware Semantic Web Service Composition
Optimizing QoS-Aware Semantic Web Service Composition Freddy Lécué The University of Manchester Booth Street East, Manchester, UK {(firstname.astname)@manchester.ac.uk} Abstract. Ranking and optimization
Packet Classification with Network Traffic Statistics
Packet Cassification with Network Traffic Statistics Yaxuan Qi and Jun Li Research Institute of Information Technoogy (RIIT), Tsinghua Uniersity Beijing, China, 100084 Abstract-- Packet cassification on
Big Data projects and use cases. Claus Samuelsen IBM Analytics, Europe [email protected]
Big projects and use cases Caus Samuesen IBM Anaytics, Europe [email protected] IBM Sofware Overview of BigInsights IBM BigInsights Scientist Free Quick Start (non production): IBM Open Patform BigInsights
Advanced ColdFusion 4.0 Application Development - 3 - Server Clustering Using Bright Tiger
Advanced CodFusion 4.0 Appication Deveopment - CH 3 - Server Custering Using Bri.. Page 1 of 7 [Figures are not incuded in this sampe chapter] Advanced CodFusion 4.0 Appication Deveopment - 3 - Server
A Similarity Search Scheme over Encrypted Cloud Images based on Secure Transformation
A Simiarity Search Scheme over Encrypted Coud Images based on Secure Transormation Zhihua Xia, Yi Zhu, Xingming Sun, and Jin Wang Jiangsu Engineering Center o Network Monitoring, Nanjing University o Inormation
A quantum model for the stock market
A quantum mode for the stock market Authors: Chao Zhang a,, Lu Huang b Affiiations: a Schoo of Physics and Engineering, Sun Yat-sen University, Guangzhou 5175, China b Schoo of Economics and Business Administration,
Traffic classification-based spam filter
Traffic cassification-based spam fiter Ni Zhang 1,2, Yu Jiang 3, Binxing Fang 1, Xueqi Cheng 1, Li Guo 1 1 Software Division, Institute of Computing Technoogy, Chinese Academy of Sciences, 100080, Beijing,
Fixed income managers: evolution or revolution
Fixed income managers: evoution or revoution Traditiona approaches to managing fixed interest funds rey on benchmarks that may not represent optima risk and return outcomes. New techniques based on separate
READING A CREDIT REPORT
Name Date CHAPTER 6 STUDENT ACTIVITY SHEET READING A CREDIT REPORT Review the sampe credit report. Then search for a sampe credit report onine, print it off, and answer the questions beow. This activity
Insertion and deletion correcting DNA barcodes based on watermarks
Kracht and Schober BMC Bioinformatics (2015) 16:50 DOI 10.1186/s12859-015-0482-7 METHODOLOGY ARTICLE Open Access Insertion and deetion correcting DNA barcodes based on watermarks David Kracht * and Steffen
Overview of Health and Safety in China
Overview of Heath and Safety in China Hongyuan Wei 1, Leping Dang 1, and Mark Hoye 2 1 Schoo of Chemica Engineering, Tianjin University, Tianjin 300072, P R China, E-mai: [email protected] 2 AstraZeneca
Human Capital & Human Resources Certificate Programs
MANAGEMENT CONCEPTS Human Capita & Human Resources Certificate Programs Programs to deveop functiona and strategic skis in: Human Capita // Human Resources ENROLL TODAY! Contract Hoder Contract GS-02F-0010J
The Comparison and Selection of Programming Languages for High Energy Physics Applications
The Comparison and Seection of Programming Languages for High Energy Physics Appications TN-91-6 June 1991 (TN) Bebo White Stanford Linear Acceerator Center P.O. Box 4349, Bin 97 Stanford, Caifornia 94309
Maintenance activities planning and grouping for complex structure systems
Maintenance activities panning and grouping for compex structure systems Hai Canh u, Phuc Do an, Anne Barros, Christophe Berenguer To cite this version: Hai Canh u, Phuc Do an, Anne Barros, Christophe
Load Balancing in Distributed Web Server Systems with Partial Document Replication *
Load Baancing in Distributed Web Server Systems with Partia Document Repication * Ling Zhuo Cho-Li Wang Francis C. M. Lau Department of Computer Science and Information Systems The University of Hong Kong
Fast b-matching via Sufficient Selection Belief Propagation
Fast b-matching via Sufficient Seection Beief Propagation Bert Huang Computer Science Department Coumbia University New York, NY 127 [email protected] Tony Jebara Computer Science Department Coumbia
Oligopoly in Insurance Markets
Oigopoy in Insurance Markets June 3, 2008 Abstract We consider an oigopoistic insurance market with individuas who differ in their degrees of accident probabiities. Insurers compete in coverage and premium.
Oracle Project Financial Planning. User's Guide Release 11.1.2.2
Orace Project Financia Panning User's Guide Reease 11.1.2.2 Project Financia Panning User's Guide, 11.1.2.2 Copyright 2012, Orace and/or its affiiates. A rights reserved. Authors: EPM Information Deveopment
Sketch-based Network-wide Traffic Anomaly Detection
Sketch-based Network-wide Traffic Anomay Detection Yang Liu, Linfeng Zhang, and Yong Guan Department of Eectrica and Computer Engineering Iowa State University, Ames, Iowa 500 Emai: {yang, zhangf, guan}@iastate.edu
Introduction to XSL. Max Froumentin - W3C
Introduction to XSL Max Froumentin - W3C Introduction to XSL XML Documents Stying XML Documents XSL Exampe I: Hamet Exampe II: Mixed Writing Modes Exampe III: database Other Exampes How do they do that?
Leakage detection in water pipe networks using a Bayesian probabilistic framework
Probabiistic Engineering Mechanics 18 (2003) 315 327 www.esevier.com/ocate/probengmech Leakage detection in water pipe networks using a Bayesian probabiistic framework Z. Pouakis, D. Vaougeorgis, C. Papadimitriou*
Integrating Risk into your Plant Lifecycle A next generation software architecture for risk based
Integrating Risk into your Pant Lifecyce A next generation software architecture for risk based operations Dr Nic Cavanagh 1, Dr Jeremy Linn 2 and Coin Hickey 3 1 Head of Safeti Product Management, DNV
Advantages and Disadvantages of Sampling. Vermont ASQ Meeting October 26, 2011
Advantages and Disadvantages of Samping Vermont ASQ Meeting October 26, 2011 Jeffrey S. Soomon Genera Dynamics Armament and Technica Products, Inc. Wiiston, VT 05495 Outine I. Definition and Exampes II.
Market Design & Analysis for a P2P Backup System
Market Design & Anaysis for a P2P Backup System Sven Seuken Schoo of Engineering & Appied Sciences Harvard University, Cambridge, MA [email protected] Denis Chares, Max Chickering, Sidd Puri Microsoft
Design and Analysis of a Hidden Peer-to-peer Backup Market
Design and Anaysis of a Hidden Peer-to-peer Backup Market Sven Seuken, Denis Chares, Max Chickering, Mary Czerwinski Kama Jain, David C. Parkes, Sidd Puri, and Desney Tan December, 2015 Abstract We present
CLOUD service providers manage an enterprise-class
IEEE TRANSACTIONS ON XXXXXX, VOL X, NO X, XXXX 201X 1 Oruta: Privacy-Preserving Pubic Auditing for Shared Data in the Coud Boyang Wang, Baochun Li, Member, IEEE, and Hui Li, Member, IEEE Abstract With
We focus on systems composed of entities operating with autonomous control, such
Middeware Architecture for Federated Contro Systems Girish Baiga and P.R. Kumar University of Iinois at Urbana-Champaign A federated contro system (FCS) is composed of autonomous entities, such as cars,
GREEN: An Active Queue Management Algorithm for a Self Managed Internet
: An Active Queue Management Agorithm for a Sef Managed Internet Bartek Wydrowski and Moshe Zukerman ARC Specia Research Centre for Utra-Broadband Information Networks, EEE Department, The University of
Minimum Support Size of the Defender s Strong Stackelberg Equilibrium Strategies in Security Games
Minimum Support Size o the Deender s Strong Stackeberg Equiibrium Strategies in Security Games Jiarui Gan University o Chinese Academy o Sciences The Key Lab o Inteigent Inormation Processing, ICT, CAS
The BBC s management of its Digital Media Initiative
The BBC s management of its Digita Media Initiative Report by the Comptroer and Auditor Genera presented to the BBC Trust s Finance and Compiance Committee, 13 January 2011 Department for Cuture, Media
LADDER SAFETY Table of Contents
Tabe of Contents SECTION 1. TRAINING PROGRAM INTRODUCTION..................3 Training Objectives...........................................3 Rationae for Training.........................................3
Ricoh Healthcare. Process Optimized. Healthcare Simplified.
Ricoh Heathcare Process Optimized. Heathcare Simpified. Rather than a destination that concudes with the eimination of a paper, the Paperess Maturity Roadmap is a continuous journey to strategicay remove
Ricoh Legal. ediscovery and Document Solutions. Powerful document services provide your best defense.
Ricoh Lega ediscovery and Document Soutions Powerfu document services provide your best defense. Our peope have aways been at the heart of our vaue proposition: our experience in your industry, commitment
Let s get usable! Usability studies for indexes. Susan C. Olason. Study plan
Let s get usabe! Usabiity studies for indexes Susan C. Oason The artice discusses a series of usabiity studies on indexes from a systems engineering and human factors perspective. The purpose of these
EFFICIENT CLUSTERING OF VERY LARGE DOCUMENT COLLECTIONS
Chapter 1 EFFICIENT CLUSTERING OF VERY LARGE DOCUMENT COLLECTIONS Inderjit S. Dhion, James Fan and Yuqiang Guan Abstract An invauabe portion of scientific data occurs naturay in text form. Given a arge
Leadership & Management Certificate Programs
MANAGEMENT CONCEPTS Leadership & Management Certificate Programs Programs to deveop expertise in: Anaytics // Leadership // Professiona Skis // Supervision ENROLL TODAY! Contract oder Contract GS-02F-0010J
Storing Shared Data on the Cloud via Security-Mediator
Storing Shared Data on the Coud via Security-Mediator Boyang Wang, Sherman S. M. Chow, Ming Li, and Hui Li State Key Laboratory of Integrated Service Networks, Xidian University, Xi an, China Department
DEGREES OF ORDERS ON TORSION-FREE ABELIAN GROUPS
DEGREES OF ORDERS ON TORSION-FREE ABELIAN GROUPS ASHER M. KACH, KAREN LANGE, AND REED SOLOMON Abstract. We show that if H is an effectivey competey decomposabe computabe torsion-free abeian group, then
A train dispatching model based on fuzzy passenger demand forecasting during holidays
Journa of Industria Engineering and Management JIEM, 2013 6(1):320-335 Onine ISSN: 2013-0953 Print ISSN: 2013-8423 http://dx.doi.org/10.3926/jiem.699 A train dispatching mode based on fuzzy passenger demand
The Essence of Research Methodology
The Essence of Research Methodoogy Jan Jonker Bartjan Pennink The Essence of Research Methodoogy A Concise Guide for Master and PhD Students in Management Science Dr. Jan Jonker Nijmegen Schoo of Management
Applying graph theory to automatic vehicle tracking by remote sensing
0 0 Appying graph theory to automatic vehice tracking by remote sensing *Caros Lima Azevedo Nationa Laboratory for Civi Engineering Department of Transportation Av. Do Brasi, Lisbon, 00-0 Portuga Phone:
Finance 360 Problem Set #6 Solutions
Finance 360 Probem Set #6 Soutions 1) Suppose that you are the manager of an opera house. You have a constant margina cost of production equa to $50 (i.e. each additiona person in the theatre raises your
SPOTLIGHT. A year of transformation
WINTER ISSUE 2014 2015 SPOTLIGHT Wecome to the winter issue of Oasis Spotight. These newsetters are designed to keep you upto-date with news about the Oasis community. This quartery issue features an artice
Swingtum A Computational Theory of Fractal Dynamic Swings and Physical Cycles of Stock Market in A Quantum Price-Time Space
Swingtum A Computationa Theory of Fracta Dynamic Swings and Physica Cyces of Stock Market in A Quantum Price-Time Space Heping Pan Schoo of Information Technoogy and Mathematica Sciences, University of
Normalization of Database Tables. Functional Dependency. Examples of Functional Dependencies: So Now what is Normalization? Transitive Dependencies
ISM 602 Dr. Hamid Nemati Objectives The idea Dependencies Attributes and Design Understand concepts normaization (Higher-Leve Norma Forms) Learn how to normaize tabes Understand normaization and database
arxiv:0712.3028v2 [astro-ph] 7 Jan 2008
A practica guide to Basic Statistica Techniques for Data Anaysis in Cosmoogy Licia Verde arxiv:0712.3028v2 [astro-ph] 7 Jan 2008 ICREA & Institute of Space Sciences (IEEC-CSIC), Fac. Ciencies, Campus UAB
DEGREES OF ORDERS ON TORSION-FREE ABELIAN GROUPS
1 DEGREES OF ORDERS ON TORSION-FREE ABELIAN GROUPS 2 ASHER M. KACH, KAREN LANGE, AND REED SOLOMON Abstract. We show that if H is an effectivey competey decomposabe computabe torsion-free abeian group,
Oracle Hyperion Tax Provision. User's Guide Release 11.1.2.2
Orace Hyperion Tax Provision User's Guide Reease 11.1.2.2 Tax Provision User's Guide, 11.1.2.2 Copyright 2013, Orace and/or its affiiates. A rights reserved. Authors: EPM Information Deveopment Team Orace
3.3 SOFTWARE RISK MANAGEMENT (SRM)
93 3.3 SOFTWARE RISK MANAGEMENT (SRM) Fig. 3.2 SRM is a process buit in five steps. The steps are: Identify Anayse Pan Track Resove The process is continuous in nature and handed dynamicay throughout ifecyce
University of Southern California
Master of Science in Financia Engineering Viterbi Schoo of Engineering University of Southern Caifornia Dia 1-866-469-3239 (Meeting number 924 898 113) to hear the audio portion, or isten through your
