Uncertain Data Mining: A New Research Direction

Size: px
Start display at page:

Download "Uncertain Data Mining: A New Research Direction"

Transcription

1 Uncertan Data Mnng: A New Research Drecton Mchael Chau 1, Reynold Cheng, and Ben Kao 3 1: School of Busness, The Unversty of Hong Kong, Pokfulam, Hong Kong : Department of Computng, Hong Kong Polytechnc Unversty Kowloon, Hong Kong 3: Department of Computer Scence, The Unversty of Hong Kong, Pokfulam, Hong Kong Emals: mchau@busness.hku.hk, csckcheng@comp.polyu.edu.hk, kao@cs.hku.hk Abstract Data uncertanty s often found n real-world applcatons due to reasons such as mprecse measurement, outdated sources, or samplng errors. Recently, much research has been publshed n the area of managng data uncertanty n databases. We propose that when data mnng s performed on uncertan data, data uncertanty has to be consdered n order to obtan hgh qualty data mnng results. We call ths the "Uncertan Data Mnng" problem. In ths paper, we present a framework for possble research drectons n ths area. We also present the UK-means clusterng algorthm as an example to llustrate how the tradtonal K-means algorthm can be modfed to handle data uncertanty n data mnng. 1. Introducton Data s often assocated wth uncertanty because of measurement naccuracy, samplng dscrepancy, outdated data sources, or other errors. Ths s especally true for applcatons that requre nteracton wth the physcal world, such as locaton-based servces [15] and sensor montorng [3]. For example, n the scenaro of movng obects (such as vehcles or people), t s mpossble for the database to track the exact locatons of all obects at all tme nstants. Therefore, the locaton of each obect s assocated wth uncertanty between updates [4]. These varous sources of uncertanty have to be consdered n order to produce accurate query and mnng results. In recent years, there has been much research on the management of uncertan data n databases, such as the representaton of uncertanty n databases and queryng data wth uncertanty. However, lttle research work has addressed the ssue of mnng uncertan data. We note that wth uncertanty, data values are no longer atomc. To apply tradtonal data mnng technques, uncertan data has to be summarzed nto atomc values. Takng movng-obect applcatons as an example agan, the locaton of an obect can be summarzed ether by ts last recorded locaton, or by an expected locaton (f the probablty dstrbuton of an obect s locaton s taken nto account). Unfortunately, dscrepancy n the summarzed recorded values and the actual values could serously affect the qualty of the mnng results. Fgure 1 llustrates ths problem when a clusterng algorthm s appled to movng obects wth locaton uncertanty. Fgure 1(a) shows the actual locatons of a set of obects, and Fgure 1(b) shows the recorded locaton of these obects, whch are already outdated. The clusters obtaned from these outdated values could be sgnfcantly dfferent from those obtaned as f the actual locatons were avalable (Fgure 1(b)). If we solely rely on the recorded values, many obects could possbly be put nto wrong clusters. Even worse, each member of a cluster would change the cluster centrods, thus resultng n more errors. Chau, M., Cheng, R., and Kao, B., "Uncertan Data Mnng: A New Research Drecton," n Proceedngs of the Workshop on the Scences of the Artfcal, Hualen, Tawan, December 7-8, 005.

2 Fgure 1. (a) The real-world data are parttoned nto three clusters (a, b, c). (b) The recorded locatons of some obects (shaded) are not the same as ther true locaton, thus creatng clusters a, b, c and c. Note that a has one fewer obect than a, and b has one more obect than b. Also, c s mstakenly splt nto c and c. (c) Lne uncertanty s consdered to produce clusters a, b and c. The clusterng result s closer to that of (a) than (b). We suggest ncorporatng uncertanty nformaton, such as the probablty densty functons (pdf) of uncertan data, nto exstng data mnng methods so that the mnng results could resemble closer to the results obtaned as f actual data were avalable and used n the mnng process (Fgure (c)). In ths paper we study how uncertanty can be ncorporated n data mnng by usng data clusterng as a motvatng example. We call ths the Uncertan Data Mnng problem. In ths paper, we present a framework for possble research drectons n ths area. The rest of the paper s structured as follows. Related work s revewed n Secton. In Secton 3 we defne the problem of clusterng on data wth uncertanty and present our proposed algorthm. Secton 4 presents the applcaton of our algorthm to a movng-obect database. Detaled experment results are shown n Secton 5. We conclude our paper and suggest possble research drectons n Secton 6.. Research Background In recent years, there s sgnfcant research nterest n data uncertanty management. Data uncertanty can be categorzed nto two types, namely exstental uncertanty and value uncertanty. In the frst type t s uncertan whether the obect or data tuple exsts or not. For example, a tuple n a relatonal database could be assocated wth a probablty value that ndcates the confdence of ts presence [1,11]. In value uncertanty, a data tem s modelled as a closed regon whch bounds ts possble values, together wth a probablty densty functon (pdf) of ts value [3,4,1,15]. Ths model can be used to quantfy the mprecson of locaton and sensor data n a constantly-evolvng envronment. Most works n ths area have been devoted to mprecse queres, whch provde probablstc guarantees over correctness of answers. For example, n [5], ndexng solutons for range queres over uncertan data have been proposed. The same authors also proposed solutons for aggregate queres such as nearest-neghbor queres n [4]. Notce that all these works have appled the study of uncertan data management to smple database queres, nstead of to the relatvely more complcated data analyss and mnng problems. The clusterng problem has been well studed n data mnng research. A standard clusterng process conssts of fve maor steps: pattern representaton, defnton of a pattern smlarty metrc, clusterng or groupng, data abstracton, and output assessment [10]. Only a few studes on data mnng or data clusterng for uncertan data have been reported. Hamdan and Govaert have addressed the problem of fttng mxture denstes to uncertan data for clusterng usng the EM algorthm [8]. However, the model cannot be readly appled to other clusterng algorthms and s rather customzed for EM. Clusterng on nterval data also has been studed. Dfferent dstance measures, lke cty-block dstance or Mnkowsk dstance, have been used n measurng the smlarty between two ntervals [6,9]. The pdf of the nterval s not taken nto account n most of these metrcs. Another related area of research s fuzzy clusterng. Fuzzy clusterng has been long studed n fuzzy logc [13]. In fuzzy clusterng, a cluster s represented by a fuzzy subset of a set of obects. Each obect has a degree of belongngness for each cluster. In other words, an obect can belong to more than one cluster, each

3 wth a dfferent degree. The fuzzy c-means algorthm was one of the most wdely used fuzzy clusterng method [,7]. Dfferent fuzzy clusterng methods have been appled on normal data or fuzzy data to produce fuzzy clusters [14]. Whle ther work s based on a fuzzy data model, our work s developed based on the uncertanty model of movng obects. 3. Taxonomy of Uncertan Data Mnng In Fgure, we propose a taxonomy to llustrate how data mnng methods can be classfed based on whether data mprecson s consdered. There are a number of common data mnng technques, e.g., assocaton rule mnng, data classfcaton, data clusterng, that need to be modfed n order to handle uncertan data. Moreover, we dstngush two types of data clusterng: hard clusterng and fuzzy clusterng. Hard clusterng ams at mprovng the accuracy of clusterng by consderng expected data values after data uncertanty s consdered. On the other hand, fuzzy clusterng presents the clusterng result n a fuzzy form. An example of a fuzzy clusterng result s that each data tem s gven a probablty of beng assgned to each member n a set of clusters [14]. Fgure. A taxonomy of data mnng on data wth uncertanty For example, when uncertanty s consdered, there s an nterestng problem on how each tuple and the uncertanty assocated should be represented n the dataset. Moreover, the noton of support and other metrcs would need to be redefned. Well-known assocaton rule mnng algorthm (such as Apror) has to be revsed n order to take ths nto account. Smlarly, n data classfcaton and data clusterng, tradtonal algorthms may not work any more because uncertanty was not taken nto account. Important metrcs, lke cluster centrods, dstance between two obects, or dstance between an obect and a centrod, have to be redefned and further studed. 4. Example on Uncertan Data Clusterng In ths secton, we present our work on uncertan data clusterng as an example of uncertan data mnng. Ths llustrates our dea of adaptng tradtonal data mnng algorthm for uncertan data. 4.1 Problem Defnton Let S be a set of V-dmensonal vectors x, where = 1 to n, representng the attrbute values of all the records to be consdered n the clusterng applcaton. Each record o s assocated wth a probablty densty functon (pdf), f (x), whch s the probablty densty functon of o s attrbute values x at tme t. We do not lmt how the uncertanty functon evolves over tme, or what the probablty densty functon of a record s. An example pdf s the unform densty functon, whch depcts the worst-case or most uncertan scenaro [3]. Another pdf commonly used s the Gaussan dstrbuton, whch can be used to descrbe measurement errors [1,15].

4 The clusterng problem s to fnd a set C of clusters C, where = 1 to K, wth cluster means c based on smlarty. Dfferent clusterng algorthms have dfferent obectve functons, but the general dea s to mnmze the dstance between obects n the same cluster whle maxmzng the dstance between obects n dfferent clusters. Mnmzaton of ntra-cluster dstance can also be vewed as the mnmzaton of the dstance between each data pont x and the cluster means c of the cluster C that x s assgned to. In ths paper, we only consder hard clusterng,.e., every obect s assgned to one and only one cluster. 4. K-means Clusterng for Precse Data The classcal K-means clusterng algorthm whch ams at fndng a set C of K clusters C wth cluster mean c to mnmze the sum of squared errors (SSE). The SSE s usually calculated as follows: K = 1 x C c x (1) where. s a dstance metrc between a data pont x and a cluster mean c. For example, the Eucldean dstance s defned as: V x y = x y () =1 The mean (centrod) of a cluster C s defned by the followng vector: 1 c = x (3) C C The K-means algorthm s as follows: 1. Assgn ntal values for cluster means c 1 to c K. repeat 3. for = 1 to n do 4. Assgn each data pont x to cluster C where c - x s the mnmum. 5. end for 6. for = 1 to K do 7. Recalculate cluster mean c of cluster C 8. end for 9. untl convergence 10. return C Convergence can be defned based on dfferent crtera. Some example convergence crtera nclude: (1) when the change n the sum of squared errors s smaller than a certan user-specfed threshold, () when no obects are reassgned to a dfferent cluster n an teraton and (3) when the number of teratons has reached a pre-defned maxmum number. 4.3 K-means Clusterng for Uncertan Data In order to take nto account data uncertanty n the clusterng process, we propose a clusterng algorthm wth the goal of mnmzng the expected sum of squared errors E(SSE). Notce that a data obect x s specfed by an uncertanty regon wth an uncertanty pdf f(x ). Gven a set of clusters, C s the expected SSE can be calculated as follow: k E c x = 1 C k (4) = E c x = 1 C ( ) = k = 1 C c x f ( x ) dx

5 Cluster means are gven by: 1 c = E x C C 1 = C 1 = C C C E( x ) x f ( x ) dx (5) We now propose a new K-means algorthm, called UK-means, for clusterng uncertan data. 1. Assgn ntal values for cluster means c 1 to c K. repeat 3. for = 1 to n do 4. Assgn each data pont x to cluster C where E( c - x ) s the mnmum. 5. end for 6. for = 1 to K do 7. Recalculate cluster mean c of cluster C 8. end for 9. untl convergence 10. return C The man dfference between UK-mean clusterng and K-means clusterng les n the computaton of dstance and clusters. In partcular, UK-means compute the expected dstance and cluster centrods based on the data uncertanty model. Agan, convergence can be defned based on dfferent crtera. Note that f the convergence s based on squared error, E(SSE) as n Equaton (4) should be used nstead of SSE. In Step 4, t s often dffcult to determne E( c - x ) algebracally. In partcular, the varety of geometrc shapes of uncertanty regons (e.g., lne, crcle) and dfferent uncertanty pdf mply that numercal ntegraton methods are necessary. In vew of ths, E( c - x ), whch s easer to obtan, s used nstead. Ths allows us to determne the cluster assgnment (.e., Step 4) usng a smple algebrac expresson. 5. A Case Study and Evaluaton 5.1 Clusterng Data wth Lne-movng Uncertanty The UK-means algorthm presented n the last secton s applcable to any uncertanty regon and pdf. To demonstrate the feasblty of the approach, we descrbe how the proposed algorthm can be appled to uncertanty models specfc to movng obects that are movng n a two-dmensonal space. We also present the evaluaton results of the algorthm. The algorthm was appled to a model wth the undrectonal lne-movng uncertanty, whch requres that each obect s locaton s unformly dstrbuted n a lne segment along the lne of movement n one drecton. Suppose we have a centrod c = (p, q) and a data obect x specfed by a lne uncertanty regon wth a unform dstrbuton. Let the end ponts of the lne segment uncertanty be (a,b) and (c,d). The lne equaton can be parametrzed by (a + t (c - a), b + t (d - b)), where t s between [0,1]. Let the uncertanty pdf be f(t). Also, let the dstance of the lne segment uncertanty be D = ( c a) + ( d b). We have: E( c - x ) f ( t)( D t + Bt + C) dt (6) = 1 0

6 where B = [(c - a) (a - p) + (d - b) (b - q)] C = (p - a) + (q - b) If f(t) s unform, then f(t) = 1, and the above becomes: E(dstance of lne uncertanty from centrod )= D B + + C (7) 3 We are thus able to compute the expected squared dstance easly for lne-movng uncertanty for unform dstrbuton. These formulae can be readly used by the UK-means algorthm to decde the assgnment of clusters. Nonetheless, the use of unform dstrbuton s only a specfc example here. When the pdf s are not unform (e.g., Gaussan), samplng technques can be used to estmate E( c - x ). 5. Experments Experments were conducted to evaluate the performance of UK-means. The goal s to study whether the ncluson of data uncertanty mproves clusterng qualty. We smulate the followng scenaro: a system that tracks the locatons of a set of movng obects has taken a snapshot of the whereabouts of the obects. Ths locaton data s stored n a set called recorded. Each obect assumes an uncertanty model. Let uncertanty captures such uncertanty nformaton. We compare two clusterng approaches: (1) apply K-means to recorded and () apply UK-means to recorded + uncertanty. More specfcally, we frst generated a set of random data ponts n a 100 x 100 D space as recorded. For each data pont, we then randomly generated ts uncertanty accordng to the undrectonal lneuncertanty model. The uncertanty specfcaton (uncertanty) of an obect contans the type of the uncertanty (bdrectonal lne), the maxmum dstance d that the obect can move, and the drecton that the obect can move. The actual locatons of the obects were then generated based on recorded and uncertanty, smulatng the scenaro that the obects have moved away from ther orgnal locatons as regstered n recorded. Specfcally, for each data pont, we took ts poston n recorded and then generated a random number to decde the dstance that the obect should have moved. If t s free-movng (crcle) uncertanty or bdrectonal uncertanty, we generated another random number to see whch drecton the obect should move. We use actual to denote the set of actual locatons of the obects. Ideally, a system should know actual and apply K-means on the actual locatons. Although not practcal, such clusterng result serves as a good reference for the qualty of clusterng result. Hence, we compute and compare the cluster output of the followng data sets: (1) recorded (usng classcal K-means) () recorded + uncertanty (usng UK-means) (3) actual (usng classcal K-means) In order to verfy the ablty of the UK-means algorthm n generatng a set of clusters close to the ones generated from actual, we apply a wdely-used metrc known as the Adusted Rand Index (ARI), whch measures the smlarty between clusterng results [16]. A hgher ARI value ndcates a hgher degree of smlarty between two clusters. We wll compare the ARI between the sets of clusters created n () and (3) and the ARI between those created n (1) and (3). Three parameters, namely number of obects (n), number of clusters (K), and the maxmum dstance an obect can move (d), were vared durng the experment. Table 1 shows the dfferent experment results by varyng d whle keepng n = 1000 and K = 0. Under each set of dfferent parameter settngs, 500 rounds were run. In each round, the sets of recorded, uncertanty, and actual were frst generated and the same set of data was used for the three clusterng processes. The same set of ntal centrods were also used n each of the three processes n order to avod any bas resultng from the ntal condtons of the K-means and UK-means algorthms. In each round, both K-means (n (1) and

7 (3)) and UK-means (n ()) were allowed to run untl there was no change n cluster membershp for all obects n two consecutve teratons, or when the number of teratons reached The ARI values and tme elapsed were averaged across the 500 runs for UK-means and K-means, respectvely. As can be seen from Table 1, the UK-means algorthm consstently showed a hgher ARI than the tradtonal K-means algorthm appled on the recorded data. Parwse t-tests were conducted and the results showed that the dfference n the ARI values of the two methods was sgnfcant for all settng (p < for every case). The results demonstrated that the clusters produced by the UK-means algorthm were more smlar to those clusters obtaned from the real-world data. In other words, the UK-means algorthm can gve a set of clusters that could be a better predcton of the clusters that would be produced f the real-world data were avalable. Table 1. Experment results D ARI (UK-means) ARI (K-means) Improvement % of mprovement % 10.03% 13.84% 0.8% 44.34% % In terms of effcency, we found that the UK-means algorthm requres more computatonal tme than K-means, but t often only requred a reasonable amount of extra tme, whch s well ustfed snce the clusterng qualty s hgher when uncertanty s consdered. We further conducted experments by varyng n, K, and d for dfferent values, whle keepng the other varables constant. In all cases, we found that the UK-means algorthm showed mprovement over the tradtonal K-means algorthm, and the dfferences were all statstcally sgnfcant (as shown by the t- test result n each case). Our prelmnary results showed that the mprovement of the UK-means algorthm s hgher when the degree of uncertanty (as represented by d) ncreases. On the other hand, the number of obects and number of clusters do not have a large effect on the performance of the UK-means algorthm, except when the number of clusters s very small. 6. Concluson and Future Work Tradtonal data mnng algorthms do not consder uncertanty nherent n a data tem and can produce ncorrect mnng results that do not correspond to the real-world data. In ths paper we propose a taxonomy of research n the area of uncertan data mnng. We also present the UK-means algorthm as a case study and llustrate how the proposed algorthm can be appled. Wth the ncreasng complexty of real-world data brought by advanced sensor devces, we beleve that uncertan data mnng s an mportant and sgnfcant research area. Acknowledgement We would lke to thank Jackey Ng (Unversty of Hong Kong), Davd Cheung (Unversty of Hong Kong), Edward Hung (Hong Kong Polytechnc Unversty), and Kevn Yp (Yale Unversty) for ther valuable suggestons.

8 References 1. Barbara, D., Garca-Molna, H. and Porter, D. The Management of Probablstc Data, IEEE Transactons on Knowledge and Data Engneerng, 4(5), Bezdek, J. C. Pattern Recognton wth Fuzzy Obectve Functon Algorthms. Plenum Press, New York (1981). 3. Cheng, R., Kalashnkov, D., and Prabhakar, S. Evaluatng Probablstc Queres over Imprecse Data, Proceedngs of the ACM SIGMOD Internatonal Conference on Management of Data, June Cheng, R., Kalashnkov, D., and Prabhakar, S. Queryng Imprecse Data n Movng Obect Envronments, IEEE Transactons on Knowledge and Data Engneerng, 16(9) (004) Cheng, R., Xa, X., Prabhakar, S., Shah, R. and Vtter, J. Effcent Indexng Methods for Probablstc Threshold Queres over Uncertan Data, Proceedngs of VLDB, de Souza, R. M. C. R. and de Carvalho, F. de A. T. Clusterng of Interval Data Based on Cty Block Dstances, Pattern Recognton Letters, 5 (004) Dunn, J. C. A Fuzzy Relatve of the ISODATA Process and Its Use n Detectng Compact Well-Separated Clusters, Journal of Cybernetcs, 3 (1973) Hamdan, H. and Govaert, G. Mxture Model Clusterng of Uncertan Data, IEEE Internatonal Conference on Fuzzy Systems (005) Ichno, M., Yaguch, H. Generalzed Mnkowsk Metrcs for Mxed Feature Type Data Analyss, IEEE Transactons on Systems, Man and Cybernetcs, 4(4) (1994) Jan, A. and Dubes, R. Algorthms for Clusterng Data. Prentce Hall, New Jersey (1988). 11. Nlesh N. D. and Sucu, D. Effcent Query Evaluaton on Probablstc Databases, VLDB (004) Pfoser D. and Jensen, C. Capturng the Uncertanty of Movng-obects Representatons, Proceedngs of the SSDBM Conference, 13 13, Ruspn, E. H. A New Approach to Clusterng, Informaton Control, 15(1) (1969) Sato, M., Sato, Y., and Jan, L. Fuzzy Clusterng Models and Applcatons. Physca-Verlag, Hedelberg (1997). 15. Wolfson, O., Sstla, P., Chamberlan, S. and Yesha, Y. Updatng and Queryng Databases that Track Moble Unts, Dstrbuted and Parallel Databases, 7(3), Yeung, K. and Ruzzo, W. An Emprcal Study on Prncpal Component Analyss for Clusterng Gene Expresson Data, Bonformatcs, 17(9) (001)

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis The Development of Web Log Mnng Based on Improve-K-Means Clusterng Analyss TngZhong Wang * College of Informaton Technology, Luoyang Normal Unversty, Luoyang, 471022, Chna wangtngzhong2@sna.cn Abstract.

More information

Descriptive Models. Cluster Analysis. Example. General Applications of Clustering. Examples of Clustering Applications

Descriptive Models. Cluster Analysis. Example. General Applications of Clustering. Examples of Clustering Applications CMSC828G Prncples of Data Mnng Lecture #9 Today s Readng: HMS, chapter 9 Today s Lecture: Descrptve Modelng Clusterng Algorthms Descrptve Models model presents the man features of the data, a global summary

More information

The OC Curve of Attribute Acceptance Plans

The OC Curve of Attribute Acceptance Plans The OC Curve of Attrbute Acceptance Plans The Operatng Characterstc (OC) curve descrbes the probablty of acceptng a lot as a functon of the lot s qualty. Fgure 1 shows a typcal OC Curve. 10 8 6 4 1 3 4

More information

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ).

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ). REVIEW OF RISK MANAGEMENT CONCEPTS LOSS DISTRIBUTIONS AND INSURANCE Loss and nsurance: When someone s subject to the rsk of ncurrng a fnancal loss, the loss s generally modeled usng a random varable or

More information

A DATA MINING APPLICATION IN A STUDENT DATABASE

A DATA MINING APPLICATION IN A STUDENT DATABASE JOURNAL OF AERONAUTICS AND SPACE TECHNOLOGIES JULY 005 VOLUME NUMBER (53-57) A DATA MINING APPLICATION IN A STUDENT DATABASE Şenol Zafer ERDOĞAN Maltepe Ünversty Faculty of Engneerng Büyükbakkalköy-Istanbul

More information

Conversion between the vector and raster data structures using Fuzzy Geographical Entities

Conversion between the vector and raster data structures using Fuzzy Geographical Entities Converson between the vector and raster data structures usng Fuzzy Geographcal Enttes Cdála Fonte Department of Mathematcs Faculty of Scences and Technology Unversty of Combra, Apartado 38, 3 454 Combra,

More information

An Alternative Way to Measure Private Equity Performance

An Alternative Way to Measure Private Equity Performance An Alternatve Way to Measure Prvate Equty Performance Peter Todd Parlux Investment Technology LLC Summary Internal Rate of Return (IRR) s probably the most common way to measure the performance of prvate

More information

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module LOSSLESS IMAGE COMPRESSION SYSTEMS Lesson 3 Lossless Compresson: Huffman Codng Instructonal Objectves At the end of ths lesson, the students should be able to:. Defne and measure source entropy..

More information

Can Auto Liability Insurance Purchases Signal Risk Attitude?

Can Auto Liability Insurance Purchases Signal Risk Attitude? Internatonal Journal of Busness and Economcs, 2011, Vol. 10, No. 2, 159-164 Can Auto Lablty Insurance Purchases Sgnal Rsk Atttude? Chu-Shu L Department of Internatonal Busness, Asa Unversty, Tawan Sheng-Chang

More information

A Novel Methodology of Working Capital Management for Large. Public Constructions by Using Fuzzy S-curve Regression

A Novel Methodology of Working Capital Management for Large. Public Constructions by Using Fuzzy S-curve Regression Novel Methodology of Workng Captal Management for Large Publc Constructons by Usng Fuzzy S-curve Regresson Cheng-Wu Chen, Morrs H. L. Wang and Tng-Ya Hseh Department of Cvl Engneerng, Natonal Central Unversty,

More information

Document Clustering Analysis Based on Hybrid PSO+K-means Algorithm

Document Clustering Analysis Based on Hybrid PSO+K-means Algorithm Document Clusterng Analyss Based on Hybrd PSO+K-means Algorthm Xaohu Cu, Thomas E. Potok Appled Software Engneerng Research Group, Computatonal Scences and Engneerng Dvson, Oak Rdge Natonal Laboratory,

More information

What is Candidate Sampling

What is Candidate Sampling What s Canddate Samplng Say we have a multclass or mult label problem where each tranng example ( x, T ) conssts of a context x a small (mult)set of target classes T out of a large unverse L of possble

More information

Vision Mouse. Saurabh Sarkar a* University of Cincinnati, Cincinnati, USA ABSTRACT 1. INTRODUCTION

Vision Mouse. Saurabh Sarkar a* University of Cincinnati, Cincinnati, USA ABSTRACT 1. INTRODUCTION Vson Mouse Saurabh Sarkar a* a Unversty of Cncnnat, Cncnnat, USA ABSTRACT The report dscusses a vson based approach towards trackng of eyes and fngers. The report descrbes the process of locatng the possble

More information

"Research Note" APPLICATION OF CHARGE SIMULATION METHOD TO ELECTRIC FIELD CALCULATION IN THE POWER CABLES *

Research Note APPLICATION OF CHARGE SIMULATION METHOD TO ELECTRIC FIELD CALCULATION IN THE POWER CABLES * Iranan Journal of Scence & Technology, Transacton B, Engneerng, ol. 30, No. B6, 789-794 rnted n The Islamc Republc of Iran, 006 Shraz Unversty "Research Note" ALICATION OF CHARGE SIMULATION METHOD TO ELECTRIC

More information

Data Broadcast on a Multi-System Heterogeneous Overlayed Wireless Network *

Data Broadcast on a Multi-System Heterogeneous Overlayed Wireless Network * JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 24, 819-840 (2008) Data Broadcast on a Mult-System Heterogeneous Overlayed Wreless Network * Department of Computer Scence Natonal Chao Tung Unversty Hsnchu,

More information

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network 700 Proceedngs of the 8th Internatonal Conference on Innovaton & Management Forecastng the Demand of Emergency Supples: Based on the CBR Theory and BP Neural Network Fu Deqang, Lu Yun, L Changbng School

More information

A Secure Password-Authenticated Key Agreement Using Smart Cards

A Secure Password-Authenticated Key Agreement Using Smart Cards A Secure Password-Authentcated Key Agreement Usng Smart Cards Ka Chan 1, Wen-Chung Kuo 2 and Jn-Chou Cheng 3 1 Department of Computer and Informaton Scence, R.O.C. Mltary Academy, Kaohsung 83059, Tawan,

More information

DEFINING %COMPLETE IN MICROSOFT PROJECT

DEFINING %COMPLETE IN MICROSOFT PROJECT CelersSystems DEFINING %COMPLETE IN MICROSOFT PROJECT PREPARED BY James E Aksel, PMP, PMI-SP, MVP For Addtonal Informaton about Earned Value Management Systems and reportng, please contact: CelersSystems,

More information

An Interest-Oriented Network Evolution Mechanism for Online Communities

An Interest-Oriented Network Evolution Mechanism for Online Communities An Interest-Orented Network Evoluton Mechansm for Onlne Communtes Cahong Sun and Xaopng Yang School of Informaton, Renmn Unversty of Chna, Bejng 100872, P.R. Chna {chsun,yang}@ruc.edu.cn Abstract. Onlne

More information

Luby s Alg. for Maximal Independent Sets using Pairwise Independence

Luby s Alg. for Maximal Independent Sets using Pairwise Independence Lecture Notes for Randomzed Algorthms Luby s Alg. for Maxmal Independent Sets usng Parwse Independence Last Updated by Erc Vgoda on February, 006 8. Maxmal Independent Sets For a graph G = (V, E), an ndependent

More information

Cluster Analysis of Data Points using Partitioning and Probabilistic Model-based Algorithms

Cluster Analysis of Data Points using Partitioning and Probabilistic Model-based Algorithms Internatonal Journal of Appled Informaton Systems (IJAIS) ISSN : 2249-0868 Foundaton of Computer Scence FCS, New York, USA Volume 7 No.7, August 2014 www.jas.org Cluster Analyss of Data Ponts usng Parttonng

More information

On the Optimal Control of a Cascade of Hydro-Electric Power Stations

On the Optimal Control of a Cascade of Hydro-Electric Power Stations On the Optmal Control of a Cascade of Hydro-Electrc Power Statons M.C.M. Guedes a, A.F. Rbero a, G.V. Smrnov b and S. Vlela c a Department of Mathematcs, School of Scences, Unversty of Porto, Portugal;

More information

A hybrid global optimization algorithm based on parallel chaos optimization and outlook algorithm

A hybrid global optimization algorithm based on parallel chaos optimization and outlook algorithm Avalable onlne www.ocpr.com Journal of Chemcal and Pharmaceutcal Research, 2014, 6(7):1884-1889 Research Artcle ISSN : 0975-7384 CODEN(USA) : JCPRC5 A hybrd global optmzaton algorthm based on parallel

More information

Mining Multiple Large Data Sources

Mining Multiple Large Data Sources The Internatonal Arab Journal of Informaton Technology, Vol. 7, No. 3, July 2 24 Mnng Multple Large Data Sources Anmesh Adhkar, Pralhad Ramachandrarao 2, Bhanu Prasad 3, and Jhml Adhkar 4 Department of

More information

Feature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College

Feature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College Feature selecton for ntruson detecton Slobodan Petrovć NISlab, Gjøvk Unversty College Contents The feature selecton problem Intruson detecton Traffc features relevant for IDS The CFS measure The mrmr measure

More information

A machine vision approach for detecting and inspecting circular parts

A machine vision approach for detecting and inspecting circular parts A machne vson approach for detectng and nspectng crcular parts Du-Mng Tsa Machne Vson Lab. Department of Industral Engneerng and Management Yuan-Ze Unversty, Chung-L, Tawan, R.O.C. E-mal: edmtsa@saturn.yzu.edu.tw

More information

A Performance Analysis of View Maintenance Techniques for Data Warehouses

A Performance Analysis of View Maintenance Techniques for Data Warehouses A Performance Analyss of Vew Mantenance Technques for Data Warehouses Xng Wang Dell Computer Corporaton Round Roc, Texas Le Gruenwald The nversty of Olahoma School of Computer Scence orman, OK 739 Guangtao

More information

A Dynamic Load Balancing for Massive Multiplayer Online Game Server

A Dynamic Load Balancing for Massive Multiplayer Online Game Server A Dynamc Load Balancng for Massve Multplayer Onlne Game Server Jungyoul Lm, Jaeyong Chung, Jnryong Km and Kwanghyun Shm Dgtal Content Research Dvson Electroncs and Telecommuncatons Research Insttute Daejeon,

More information

A Comparative Study of Data Clustering Techniques

A Comparative Study of Data Clustering Techniques A COMPARATIVE STUDY OF DATA CLUSTERING TECHNIQUES A Comparatve Study of Data Clusterng Technques Khaled Hammouda Prof. Fakhreddne Karray Unversty of Waterloo, Ontaro, Canada Abstract Data clusterng s a

More information

1 Example 1: Axis-aligned rectangles

1 Example 1: Axis-aligned rectangles COS 511: Theoretcal Machne Learnng Lecturer: Rob Schapre Lecture # 6 Scrbe: Aaron Schld February 21, 2013 Last class, we dscussed an analogue for Occam s Razor for nfnte hypothess spaces that, n conjuncton

More information

Causal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting

Causal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting Causal, Explanatory Forecastng Assumes cause-and-effect relatonshp between system nputs and ts output Forecastng wth Regresson Analyss Rchard S. Barr Inputs System Cause + Effect Relatonshp The job of

More information

8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by

8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by 6 CHAPTER 8 COMPLEX VECTOR SPACES 5. Fnd the kernel of the lnear transformaton gven n Exercse 5. In Exercses 55 and 56, fnd the mage of v, for the ndcated composton, where and are gven by the followng

More information

Properties of Indoor Received Signal Strength for WLAN Location Fingerprinting

Properties of Indoor Received Signal Strength for WLAN Location Fingerprinting Propertes of Indoor Receved Sgnal Strength for WLAN Locaton Fngerprntng Kamol Kaemarungs and Prashant Krshnamurthy Telecommuncatons Program, School of Informaton Scences, Unversty of Pttsburgh E-mal: kakst2,prashk@ptt.edu

More information

Brigid Mullany, Ph.D University of North Carolina, Charlotte

Brigid Mullany, Ph.D University of North Carolina, Charlotte Evaluaton And Comparson Of The Dfferent Standards Used To Defne The Postonal Accuracy And Repeatablty Of Numercally Controlled Machnng Center Axes Brgd Mullany, Ph.D Unversty of North Carolna, Charlotte

More information

Forecasting the Direction and Strength of Stock Market Movement

Forecasting the Direction and Strength of Stock Market Movement Forecastng the Drecton and Strength of Stock Market Movement Jngwe Chen Mng Chen Nan Ye cjngwe@stanford.edu mchen5@stanford.edu nanye@stanford.edu Abstract - Stock market s one of the most complcated systems

More information

SPEE Recommended Evaluation Practice #6 Definition of Decline Curve Parameters Background:

SPEE Recommended Evaluation Practice #6 Definition of Decline Curve Parameters Background: SPEE Recommended Evaluaton Practce #6 efnton of eclne Curve Parameters Background: The producton hstores of ol and gas wells can be analyzed to estmate reserves and future ol and gas producton rates and

More information

A Simple Approach to Clustering in Excel

A Simple Approach to Clustering in Excel A Smple Approach to Clusterng n Excel Aravnd H Center for Computatonal Engneerng and Networng Amrta Vshwa Vdyapeetham, Combatore, Inda C Rajgopal Center for Computatonal Engneerng and Networng Amrta Vshwa

More information

Implementations of Web-based Recommender Systems Using Hybrid Methods

Implementations of Web-based Recommender Systems Using Hybrid Methods Internatonal Journal of Computer Scence & Applcatons Vol. 3 Issue 3, pp 52-64 2006 Technomathematcs Research Foundaton Implementatons of Web-based Recommender Systems Usng Hybrd Methods Janusz Sobeck Insttute

More information

A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION. Michael E. Kuhl Radhamés A. Tolentino-Peña

A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION. Michael E. Kuhl Radhamés A. Tolentino-Peña Proceedngs of the 2008 Wnter Smulaton Conference S. J. Mason, R. R. Hll, L. Mönch, O. Rose, T. Jefferson, J. W. Fowler eds. A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION

More information

L10: Linear discriminants analysis

L10: Linear discriminants analysis L0: Lnear dscrmnants analyss Lnear dscrmnant analyss, two classes Lnear dscrmnant analyss, C classes LDA vs. PCA Lmtatons of LDA Varants of LDA Other dmensonalty reducton methods CSCE 666 Pattern Analyss

More information

Fault tolerance in cloud technologies presented as a service

Fault tolerance in cloud technologies presented as a service Internatonal Scentfc Conference Computer Scence 2015 Pavel Dzhunev, PhD student Fault tolerance n cloud technologes presented as a servce INTRODUCTION Improvements n technques for vrtualzaton and performance

More information

Face Verification Problem. Face Recognition Problem. Application: Access Control. Biometric Authentication. Face Verification (1:1 matching)

Face Verification Problem. Face Recognition Problem. Application: Access Control. Biometric Authentication. Face Verification (1:1 matching) Face Recognton Problem Face Verfcaton Problem Face Verfcaton (1:1 matchng) Querymage face query Face Recognton (1:N matchng) database Applcaton: Access Control www.vsage.com www.vsoncs.com Bometrc Authentcaton

More information

INVESTIGATION OF VEHICULAR USERS FAIRNESS IN CDMA-HDR NETWORKS

INVESTIGATION OF VEHICULAR USERS FAIRNESS IN CDMA-HDR NETWORKS 21 22 September 2007, BULGARIA 119 Proceedngs of the Internatonal Conference on Informaton Technologes (InfoTech-2007) 21 st 22 nd September 2007, Bulgara vol. 2 INVESTIGATION OF VEHICULAR USERS FAIRNESS

More information

CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK. Sample Stability Protocol

CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK. Sample Stability Protocol CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK Sample Stablty Protocol Background The Cholesterol Reference Method Laboratory Network (CRMLN) developed certfcaton protocols for total cholesterol, HDL

More information

How To Understand The Results Of The German Meris Cloud And Water Vapour Product

How To Understand The Results Of The German Meris Cloud And Water Vapour Product Ttel: Project: Doc. No.: MERIS level 3 cloud and water vapour products MAPP MAPP-ATBD-ClWVL3 Issue: 1 Revson: 0 Date: 9.12.1998 Functon Name Organsaton Sgnature Date Author: Bennartz FUB Preusker FUB Schüller

More information

An Evaluation of the Extended Logistic, Simple Logistic, and Gompertz Models for Forecasting Short Lifecycle Products and Services

An Evaluation of the Extended Logistic, Simple Logistic, and Gompertz Models for Forecasting Short Lifecycle Products and Services An Evaluaton of the Extended Logstc, Smple Logstc, and Gompertz Models for Forecastng Short Lfecycle Products and Servces Charles V. Trappey a,1, Hsn-yng Wu b a Professor (Management Scence), Natonal Chao

More information

Risk Model of Long-Term Production Scheduling in Open Pit Gold Mining

Risk Model of Long-Term Production Scheduling in Open Pit Gold Mining Rsk Model of Long-Term Producton Schedulng n Open Pt Gold Mnng R Halatchev 1 and P Lever 2 ABSTRACT Open pt gold mnng s an mportant sector of the Australan mnng ndustry. It uses large amounts of nvestments,

More information

Recurrence. 1 Definitions and main statements

Recurrence. 1 Definitions and main statements Recurrence 1 Defntons and man statements Let X n, n = 0, 1, 2,... be a MC wth the state space S = (1, 2,...), transton probabltes p j = P {X n+1 = j X n = }, and the transton matrx P = (p j ),j S def.

More information

Abstract. Clustering ensembles have emerged as a powerful method for improving both the

Abstract. Clustering ensembles have emerged as a powerful method for improving both the Clusterng Ensembles: {topchyal, Models jan, of punch}@cse.msu.edu Consensus and Weak Parttons * Alexander Topchy, Anl K. Jan, and Wllam Punch Department of Computer Scence and Engneerng, Mchgan State Unversty

More information

An interactive system for structure-based ASCII art creation

An interactive system for structure-based ASCII art creation An nteractve system for structure-based ASCII art creaton Katsunor Myake Henry Johan Tomoyuk Nshta The Unversty of Tokyo Nanyang Technologcal Unversty Abstract Non-Photorealstc Renderng (NPR), whose am

More information

Period and Deadline Selection for Schedulability in Real-Time Systems

Period and Deadline Selection for Schedulability in Real-Time Systems Perod and Deadlne Selecton for Schedulablty n Real-Tme Systems Thdapat Chantem, Xaofeng Wang, M.D. Lemmon, and X. Sharon Hu Department of Computer Scence and Engneerng, Department of Electrcal Engneerng

More information

Answer: A). There is a flatter IS curve in the high MPC economy. Original LM LM after increase in M. IS curve for low MPC economy

Answer: A). There is a flatter IS curve in the high MPC economy. Original LM LM after increase in M. IS curve for low MPC economy 4.02 Quz Solutons Fall 2004 Multple-Choce Questons (30/00 ponts) Please, crcle the correct answer for each of the followng 0 multple-choce questons. For each queston, only one of the answers s correct.

More information

APPLICATION OF PROBE DATA COLLECTED VIA INFRARED BEACONS TO TRAFFIC MANEGEMENT

APPLICATION OF PROBE DATA COLLECTED VIA INFRARED BEACONS TO TRAFFIC MANEGEMENT APPLICATION OF PROBE DATA COLLECTED VIA INFRARED BEACONS TO TRAFFIC MANEGEMENT Toshhko Oda (1), Kochro Iwaoka (2) (1), (2) Infrastructure Systems Busness Unt, Panasonc System Networks Co., Ltd. Saedo-cho

More information

Efficient Striping Techniques for Variable Bit Rate Continuous Media File Servers æ

Efficient Striping Techniques for Variable Bit Rate Continuous Media File Servers æ Effcent Strpng Technques for Varable Bt Rate Contnuous Meda Fle Servers æ Prashant J. Shenoy Harrck M. Vn Department of Computer Scence, Department of Computer Scences, Unversty of Massachusetts at Amherst

More information

Performance Analysis of Energy Consumption of Smartphone Running Mobile Hotspot Application

Performance Analysis of Energy Consumption of Smartphone Running Mobile Hotspot Application Internatonal Journal of mart Grd and lean Energy Performance Analyss of Energy onsumpton of martphone Runnng Moble Hotspot Applcaton Yun on hung a chool of Electronc Engneerng, oongsl Unversty, 511 angdo-dong,

More information

RELIABILITY, RISK AND AVAILABILITY ANLYSIS OF A CONTAINER GANTRY CRANE ABSTRACT

RELIABILITY, RISK AND AVAILABILITY ANLYSIS OF A CONTAINER GANTRY CRANE ABSTRACT Kolowrock Krzysztof Joanna oszynska MODELLING ENVIRONMENT AND INFRATRUCTURE INFLUENCE ON RELIABILITY AND OPERATION RT&A # () (Vol.) March RELIABILITY RIK AND AVAILABILITY ANLYI OF A CONTAINER GANTRY CRANE

More information

Realistic Image Synthesis

Realistic Image Synthesis Realstc Image Synthess - Combned Samplng and Path Tracng - Phlpp Slusallek Karol Myszkowsk Vncent Pegoraro Overvew: Today Combned Samplng (Multple Importance Samplng) Renderng and Measurng Equaton Random

More information

Extending Probabilistic Dynamic Epistemic Logic

Extending Probabilistic Dynamic Epistemic Logic Extendng Probablstc Dynamc Epstemc Logc Joshua Sack May 29, 2008 Probablty Space Defnton A probablty space s a tuple (S, A, µ), where 1 S s a set called the sample space. 2 A P(S) s a σ-algebra: a set

More information

Implementation of Deutsch's Algorithm Using Mathcad

Implementation of Deutsch's Algorithm Using Mathcad Implementaton of Deutsch's Algorthm Usng Mathcad Frank Roux The followng s a Mathcad mplementaton of Davd Deutsch's quantum computer prototype as presented on pages - n "Machnes, Logc and Quantum Physcs"

More information

Traffic State Estimation in the Traffic Management Center of Berlin

Traffic State Estimation in the Traffic Management Center of Berlin Traffc State Estmaton n the Traffc Management Center of Berln Authors: Peter Vortsch, PTV AG, Stumpfstrasse, D-763 Karlsruhe, Germany phone ++49/72/965/35, emal peter.vortsch@ptv.de Peter Möhl, PTV AG,

More information

NEURO-FUZZY INFERENCE SYSTEM FOR E-COMMERCE WEBSITE EVALUATION

NEURO-FUZZY INFERENCE SYSTEM FOR E-COMMERCE WEBSITE EVALUATION NEURO-FUZZY INFERENE SYSTEM FOR E-OMMERE WEBSITE EVALUATION Huan Lu, School of Software, Harbn Unversty of Scence and Technology, Harbn, hna Faculty of Appled Mathematcs and omputer Scence, Belarusan State

More information

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12 14 The Ch-squared dstrbuton PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 1 If a normal varable X, havng mean µ and varance σ, s standardsed, the new varable Z has a mean 0 and varance 1. When ths standardsed

More information

Inter-Ing 2007. INTERDISCIPLINARITY IN ENGINEERING SCIENTIFIC INTERNATIONAL CONFERENCE, TG. MUREŞ ROMÂNIA, 15-16 November 2007.

Inter-Ing 2007. INTERDISCIPLINARITY IN ENGINEERING SCIENTIFIC INTERNATIONAL CONFERENCE, TG. MUREŞ ROMÂNIA, 15-16 November 2007. Inter-Ing 2007 INTERDISCIPLINARITY IN ENGINEERING SCIENTIFIC INTERNATIONAL CONFERENCE, TG. MUREŞ ROMÂNIA, 15-16 November 2007. UNCERTAINTY REGION SIMULATION FOR A SERIAL ROBOT STRUCTURE MARIUS SEBASTIAN

More information

v a 1 b 1 i, a 2 b 2 i,..., a n b n i.

v a 1 b 1 i, a 2 b 2 i,..., a n b n i. SECTION 8.4 COMPLEX VECTOR SPACES AND INNER PRODUCTS 455 8.4 COMPLEX VECTOR SPACES AND INNER PRODUCTS All the vector spaces we have studed thus far n the text are real vector spaces snce the scalars are

More information

Cluster Analysis. Cluster Analysis

Cluster Analysis. Cluster Analysis Cluster Analyss Cluster Analyss What s Cluster Analyss? Types of Data n Cluster Analyss A Categorzaton of Maor Clusterng Methos Parttonng Methos Herarchcal Methos Densty-Base Methos Gr-Base Methos Moel-Base

More information

Calculating the high frequency transmission line parameters of power cables

Calculating the high frequency transmission line parameters of power cables < ' Calculatng the hgh frequency transmsson lne parameters of power cables Authors: Dr. John Dcknson, Laboratory Servces Manager, N 0 RW E B Communcatons Mr. Peter J. Ncholson, Project Assgnment Manager,

More information

Ants Can Schedule Software Projects

Ants Can Schedule Software Projects Ants Can Schedule Software Proects Broderck Crawford 1,2, Rcardo Soto 1,3, Frankln Johnson 4, and Erc Monfroy 5 1 Pontfca Unversdad Católca de Valparaíso, Chle FrstName.Name@ucv.cl 2 Unversdad Fns Terrae,

More information

Learning from Multiple Outlooks

Learning from Multiple Outlooks Learnng from Multple Outlooks Maayan Harel Department of Electrcal Engneerng, Technon, Hafa, Israel She Mannor Department of Electrcal Engneerng, Technon, Hafa, Israel maayanga@tx.technon.ac.l she@ee.technon.ac.l

More information

Number of Levels Cumulative Annual operating Income per year construction costs costs ($) ($) ($) 1 600,000 35,000 100,000 2 2,200,000 60,000 350,000

Number of Levels Cumulative Annual operating Income per year construction costs costs ($) ($) ($) 1 600,000 35,000 100,000 2 2,200,000 60,000 350,000 Problem Set 5 Solutons 1 MIT s consderng buldng a new car park near Kendall Square. o unversty funds are avalable (overhead rates are under pressure and the new faclty would have to pay for tself from

More information

J. Parallel Distrib. Comput.

J. Parallel Distrib. Comput. J. Parallel Dstrb. Comput. 71 (2011) 62 76 Contents lsts avalable at ScenceDrect J. Parallel Dstrb. Comput. journal homepage: www.elsever.com/locate/jpdc Optmzng server placement n dstrbuted systems n

More information

SIMPLE LINEAR CORRELATION

SIMPLE LINEAR CORRELATION SIMPLE LINEAR CORRELATION Smple lnear correlaton s a measure of the degree to whch two varables vary together, or a measure of the ntensty of the assocaton between two varables. Correlaton often s abused.

More information

1. Measuring association using correlation and regression

1. Measuring association using correlation and regression How to measure assocaton I: Correlaton. 1. Measurng assocaton usng correlaton and regresson We often would lke to know how one varable, such as a mother's weght, s related to another varable, such as a

More information

Damage detection in composite laminates using coin-tap method

Damage detection in composite laminates using coin-tap method Damage detecton n composte lamnates usng con-tap method S.J. Km Korea Aerospace Research Insttute, 45 Eoeun-Dong, Youseong-Gu, 35-333 Daejeon, Republc of Korea yaeln@kar.re.kr 45 The con-tap test has the

More information

A Replication-Based and Fault Tolerant Allocation Algorithm for Cloud Computing

A Replication-Based and Fault Tolerant Allocation Algorithm for Cloud Computing A Replcaton-Based and Fault Tolerant Allocaton Algorthm for Cloud Computng Tork Altameem Dept of Computer Scence, RCC, Kng Saud Unversty, PO Box: 28095 11437 Ryadh-Saud Araba Abstract The very large nfrastructure

More information

Improved SVM in Cloud Computing Information Mining

Improved SVM in Cloud Computing Information Mining Internatonal Journal of Grd Dstrbuton Computng Vol.8, No.1 (015), pp.33-40 http://dx.do.org/10.1457/jgdc.015.8.1.04 Improved n Cloud Computng Informaton Mnng Lvshuhong (ZhengDe polytechnc college JangSu

More information

A Fast Incremental Spectral Clustering for Large Data Sets

A Fast Incremental Spectral Clustering for Large Data Sets 2011 12th Internatonal Conference on Parallel and Dstrbuted Computng, Applcatons and Technologes A Fast Incremental Spectral Clusterng for Large Data Sets Tengteng Kong 1,YeTan 1, Hong Shen 1,2 1 School

More information

1. Fundamentals of probability theory 2. Emergence of communication traffic 3. Stochastic & Markovian Processes (SP & MP)

1. Fundamentals of probability theory 2. Emergence of communication traffic 3. Stochastic & Markovian Processes (SP & MP) 6.3 / -- Communcaton Networks II (Görg) SS20 -- www.comnets.un-bremen.de Communcaton Networks II Contents. Fundamentals of probablty theory 2. Emergence of communcaton traffc 3. Stochastc & Markovan Processes

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Max Wellng Department of Computer Scence Unversty of Toronto 10 Kng s College Road Toronto, M5S 3G5 Canada wellng@cs.toronto.edu Abstract Ths s a note to explan support vector machnes.

More information

Calculation of Sampling Weights

Calculation of Sampling Weights Perre Foy Statstcs Canada 4 Calculaton of Samplng Weghts 4.1 OVERVIEW The basc sample desgn used n TIMSS Populatons 1 and 2 was a two-stage stratfed cluster desgn. 1 The frst stage conssted of a sample

More information

Evaluation of Coordination Strategies for Heterogeneous Sensor Networks Aiming at Surveillance Applications

Evaluation of Coordination Strategies for Heterogeneous Sensor Networks Aiming at Surveillance Applications Evaluaton of Coordnaton Strateges for Heterogeneous Sensor Networs Amng at Survellance Applcatons Edson Pgnaton de Fretas, *, Tales Hemfarth*, Carlos Eduardo Perera*, Armando Morado Ferrera, Flávo Rech

More information

IT09 - Identity Management Policy

IT09 - Identity Management Policy IT09 - Identty Management Polcy Introducton 1 The Unersty needs to manage dentty accounts for all users of the Unersty s electronc systems and ensure that users hae an approprate leel of access to these

More information

Efficient mining of group patterns from user movement data

Efficient mining of group patterns from user movement data Data & Knowledge Engneerng 57 (2006) 240 282 www.elsever.com/locate/datak Effcent mnng of group patterns from user movement data Yda Wang a, Ee-Peng Lm a, *, San-Yh Hwang b a Centre for Advanced Informaton

More information

BUSINESS PROCESS PERFORMANCE MANAGEMENT USING BAYESIAN BELIEF NETWORK. 0688, dskim@ssu.ac.kr

BUSINESS PROCESS PERFORMANCE MANAGEMENT USING BAYESIAN BELIEF NETWORK. 0688, dskim@ssu.ac.kr Proceedngs of the 41st Internatonal Conference on Computers & Industral Engneerng BUSINESS PROCESS PERFORMANCE MANAGEMENT USING BAYESIAN BELIEF NETWORK Yeong-bn Mn 1, Yongwoo Shn 2, Km Jeehong 1, Dongsoo

More information

Enterprise Master Patient Index

Enterprise Master Patient Index Enterprse Master Patent Index Healthcare data are captured n many dfferent settngs such as hosptals, clncs, labs, and physcan offces. Accordng to a report by the CDC, patents n the Unted States made an

More information

RESEARCH ON DUAL-SHAKER SINE VIBRATION CONTROL. Yaoqi FENG 1, Hanping QIU 1. China Academy of Space Technology (CAST) yaoqi.feng@yahoo.

RESEARCH ON DUAL-SHAKER SINE VIBRATION CONTROL. Yaoqi FENG 1, Hanping QIU 1. China Academy of Space Technology (CAST) yaoqi.feng@yahoo. ICSV4 Carns Australa 9- July, 007 RESEARCH ON DUAL-SHAKER SINE VIBRATION CONTROL Yaoq FENG, Hanpng QIU Dynamc Test Laboratory, BISEE Chna Academy of Space Technology (CAST) yaoq.feng@yahoo.com Abstract

More information

Logistic Regression. Lecture 4: More classifiers and classes. Logistic regression. Adaboost. Optimization. Multiple class classification

Logistic Regression. Lecture 4: More classifiers and classes. Logistic regression. Adaboost. Optimization. Multiple class classification Lecture 4: More classfers and classes C4B Machne Learnng Hlary 20 A. Zsserman Logstc regresson Loss functons revsted Adaboost Loss functons revsted Optmzaton Multple class classfcaton Logstc Regresson

More information

MAC Layer Service Time Distribution of a Fixed Priority Real Time Scheduler over 802.11

MAC Layer Service Time Distribution of a Fixed Priority Real Time Scheduler over 802.11 Internatonal Journal of Software Engneerng and Its Applcatons Vol., No., Aprl, 008 MAC Layer Servce Tme Dstrbuton of a Fxed Prorty Real Tme Scheduler over 80. Inès El Korb Ecole Natonale des Scences de

More information

Research Article Enhanced Two-Step Method via Relaxed Order of α-satisfactory Degrees for Fuzzy Multiobjective Optimization

Research Article Enhanced Two-Step Method via Relaxed Order of α-satisfactory Degrees for Fuzzy Multiobjective Optimization Hndaw Publshng Corporaton Mathematcal Problems n Engneerng Artcle ID 867836 pages http://dxdoorg/055/204/867836 Research Artcle Enhanced Two-Step Method va Relaxed Order of α-satsfactory Degrees for Fuzzy

More information

Optimal Choice of Random Variables in D-ITG Traffic Generating Tool using Evolutionary Algorithms

Optimal Choice of Random Variables in D-ITG Traffic Generating Tool using Evolutionary Algorithms Optmal Choce of Random Varables n D-ITG Traffc Generatng Tool usng Evolutonary Algorthms M. R. Mosav* (C.A.), F. Farab* and S. Karam* Abstract: Impressve development of computer networks has been requred

More information

CS 2750 Machine Learning. Lecture 3. Density estimation. CS 2750 Machine Learning. Announcements

CS 2750 Machine Learning. Lecture 3. Density estimation. CS 2750 Machine Learning. Announcements Lecture 3 Densty estmaton Mlos Hauskrecht mlos@cs.ptt.edu 5329 Sennott Square Next lecture: Matlab tutoral Announcements Rules for attendng the class: Regstered for credt Regstered for audt (only f there

More information

HowHow to Find the Best Online Stock Broker

HowHow to Find the Best Online Stock Broker A GENERAL APPROACH FOR SECURITY MONITORING AND PREVENTIVE CONTROL OF NETWORKS WITH LARGE WIND POWER PRODUCTION Helena Vasconcelos INESC Porto hvasconcelos@nescportopt J N Fdalgo INESC Porto and FEUP jfdalgo@nescportopt

More information

320 The Internatonal Arab Journal of Informaton Technology, Vol. 5, No. 3, July 2008 Comparsons Between Data Clusterng Algorthms Osama Abu Abbas Computer Scence Department, Yarmouk Unversty, Jordan Abstract:

More information

Enabling P2P One-view Multi-party Video Conferencing

Enabling P2P One-view Multi-party Video Conferencing Enablng P2P One-vew Mult-party Vdeo Conferencng Yongxang Zhao, Yong Lu, Changja Chen, and JanYn Zhang Abstract Mult-Party Vdeo Conferencng (MPVC) facltates realtme group nteracton between users. Whle P2P

More information

Bayesian Cluster Ensembles

Bayesian Cluster Ensembles Bayesan Cluster Ensembles Hongjun Wang 1, Hanhua Shan 2 and Arndam Banerjee 2 1 Informaton Research Insttute, Southwest Jaotong Unversty, Chengdu, Schuan, 610031, Chna 2 Department of Computer Scence &

More information

Performance Management and Evaluation Research to University Students

Performance Management and Evaluation Research to University Students 631 A publcaton of CHEMICAL ENGINEERING TRANSACTIONS VOL. 46, 2015 Guest Edtors: Peyu Ren, Yancang L, Hupng Song Copyrght 2015, AIDIC Servz S.r.l., ISBN 978-88-95608-37-2; ISSN 2283-9216 The Italan Assocaton

More information

Efficient Project Portfolio as a tool for Enterprise Risk Management

Efficient Project Portfolio as a tool for Enterprise Risk Management Effcent Proect Portfolo as a tool for Enterprse Rsk Management Valentn O. Nkonov Ural State Techncal Unversty Growth Traectory Consultng Company January 5, 27 Effcent Proect Portfolo as a tool for Enterprse

More information

Software project management with GAs

Software project management with GAs Informaton Scences 177 (27) 238 241 www.elsever.com/locate/ns Software project management wth GAs Enrque Alba *, J. Francsco Chcano Unversty of Málaga, Grupo GISUM, Departamento de Lenguajes y Cencas de

More information

Fast Fuzzy Clustering of Web Page Collections

Fast Fuzzy Clustering of Web Page Collections Fast Fuzzy Clusterng of Web Page Collectons Chrstan Borgelt and Andreas Nürnberger Dept. of Knowledge Processng and Language Engneerng Otto-von-Guercke-Unversty of Magdeburg Unverstätsplatz, D-396 Magdeburg,

More information

Joint Scheduling of Processing and Shuffle Phases in MapReduce Systems

Joint Scheduling of Processing and Shuffle Phases in MapReduce Systems Jont Schedulng of Processng and Shuffle Phases n MapReduce Systems Fangfe Chen, Mural Kodalam, T. V. Lakshman Department of Computer Scence and Engneerng, The Penn State Unversty Bell Laboratores, Alcatel-Lucent

More information

How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence

How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence 1 st Internatonal Symposum on Imprecse Probabltes and Ther Applcatons, Ghent, Belgum, 29 June 2 July 1999 How Sets of Coherent Probabltes May Serve as Models for Degrees of Incoherence Mar J. Schervsh

More information