Size: px
Start display at page:

Download ""

Transcription

1 320 The Internatonal Arab Journal of Informaton Technology, Vol. 5, No. 3, July 2008 Comparsons Between Data Clusterng Algorthms Osama Abu Abbas Computer Scence Department, Yarmouk Unversty, Jordan Abstract: Clusterng s a dvson of data nto groups of smlar objects. Each group, called a cluster, conssts of objects that are smlar between themselves and dssmlar compared to objects of other groups. Ths paper s ntended to study and compare dfferent data clusterng algorthms. The algorthms under nvestgaton are: k-means algorthm, herarchcal clusterng algorthm, self-organzng maps algorthm, and expectaton maxmzaton clusterng algorthm. All these algorthms are compared accordng to the followng factors: sze of dataset,, type of dataset and type of software used. Some conclusons that are extracted belong to the performance, qualty, and accuracy of the clusterng algorthms. Keywords: Clusterng, k-means algorthm, herarchcal clusterng algorthm, self-organzng maps algorthm, expectaton maxmzaton clusterng algorthm. Receved January 18, 2007 ; accepted May 2, Introducton Clusterng s a dvson of data nto groups of smlar objects. Each group, called cluster, conssts of objects that are smlar amongst themselves and dssmlar compared to objects of other groups. Representng data by fewer necessarly loses certan fne detals, but acheves smplfcaton. It represents many data objects by few, and hence, t models data by ts [3]. Cluster analyss s the organzaton of a collecton of patterns (usually represented as a vector of measurements, or a pont n a multdmensonal space) nto based on smlarty. Patterns wthn a vald cluster are more smlar to each other than they are to a pattern belongng to a dfferent cluster. It s mportant to understand the dfference between clusterng (unsupervsed classfcaton) and dscrmnate analyss (supervsed classfcaton). In supervsed classfcaton, we are provded wth a collecton of labeled (preclassfed) patterns; the problem s to label a newly encountered, yet unlabeled, pattern. Typcally, the gven labeled (tranng) patterns are used to learn the descrptons of classes whch n turn are used to label a new pattern. In the case of clusterng, the problem s to group a gven collecton of unlabeled patterns nto meanngful. In a sense, labels are assocated wth also, but these category labels are data drven; that s, they are obtaned solely from the data [5, 7, 8, 13]. Some researchers mproved some data clusterng algorthms, others mplemented new ones, and some others studed and compared dfferent data clusterng algorthms. Followng are some of the prevous studes that consdered the effect of dfferent factors on the performance of some data clusterng algorthms and compared the results. However, these studes dffer from my analyss n the algorthms and the factors: [1] appled several ndces to evaluate the performance of clusterng algorthms, ncludng herarchcal clusterng, k-means, PAM and SOM. The ndces were homogenety and separaton scores, slhouette wdth, redundant score (based on redundant genes), and WADP (testng the robustness of clusterng results after small perturbaton). [9] descrbed the mplementaton of an out-of-core technque for the data analyss of very large data sets wth the sequental and parallel verson of the clusterng algorthm AutoClass. They dscussed the out-of-core technque and showed performance results n terms of executon tme and speed up. [8] employed an agglomeratve algorthm to construct a dendrogram and used a smple dstnctness heurstc to extract a partton of the data. They studed the performance of Smlarty- Based Agglomeratve Clusterng (SBAC) algorthm on real and artfcally generated data sets. They demonstrated the effectveness of ths algorthm n unsupervsed dscovery tasks. They llustrated the superor performance of ths approach by makng comparsons wth other clusterng schemes. [10] showed how to perform some typcal mnng tasks usng conceptual graphs as formal but meanngful representatons of texts. Ther methods nvolved qualtatve and quanttatve comparsons of conceptual graphs, conceptual clusterng, buldng a conceptual herarchy, and applcaton of data mnng technques to ths herarchy n order to detect nterestng assocatons and devatons. Ther experments showed that, despte wdespread

2 Comparsons Between Data Clusterng Algorthms 321 dsbelef, detaled meanngful mnng wth conceptual graphs s computatonally affordable. [12] compared two graph-colorng programs: one exact and another based on heurstcs whch can gve, however, provably exact results on some types of graphs. They proved that the exact graph colorng s not necessary for hgh-qualty functonal decomposers. Comparson of ther expermental results wth competng decomposers shows that for nearly all benchmarks ther solutons are the best and tme s usually not too hgh. Jan and Dubes [1988] and Dubes [1993] used a relatve test to compare two structures and to measure ther relatve mert. They also dscussed n detal the ndces that are used for ths comparson. In ths paper dfferent data clusterng algorthms that have not been consdered before are compared accordng to dfferent factors that haven't been studed yet. 2. How Algorthms are Implemented? An extensve web search s done to fnd some data clusterng algorthms mplementaton to test on. After selecton, I ended up wth two of them: Software: It s publc doman software made avalable from MIT Lncoln Laboratory [6]. It s located at the followng ste: ( TreeVew Software: TreeVew are programs that provde a computatonal and graphcal envronment for analyzng data from dfferent datasets [2]. It s located at the followng ste: ( The reasons behnd choosng these two software are: They are the most popular software for mplementng dfferent data clusterng algorthms. They are very powerful n mplementng data clusterng algorthms. They mplement the four data clusterng algorthms that are chosen n ths paper. They have deal dataset as a part of them whch can be used for testng and mplementng the algorthms Data Sample The dataset that s used to test the clusterng algorthms and compare among them s obtaned from the ste: ( or from another ste, whch s, ( Ths s a good dataset to test tme seres clusterng algorthms because eucldean dstance wll not be able to acheve perfect accuracy. In partcular the followng pars of classes wll often be confused (normal/ cyclc) (decreasng trend/ downward shft) and (ncreasng trend/ upward shft). Ths dataset s stored n an ASCII fle, 600 rows, 60 columns, wth a sngle chart per lne. The classes are organzed as follows: Normal Cyclc Increasng trend Decreasng trend Upward shft Downward shft However, ths format s not totally sutable for the two packages used to compare the clusterng algorthms ( Software and TreeVew Software). So some reasonable changes are done n the dataset format to be acceptable for the packages. The dataset formats of the two software are revewed by referrng to the manuals of the software and the changes that are done to the dataset formats do not affect the dataset tself at all. Also, a part of ths data set (200 rows and 20 columns) s taken as an nput fle for the clusterng algorthms to study the sze of the datasets (huge and small datasets) on these algorthms. Fnally, the clusterng algorthms are tested usng the dataset stored n the two packages themselves to study the effect of dfferent datasets on the algorthms Whch Algorthms are Compared? Four dfferent clusterng algorthms are chosen to nvestgate, study, and compare them. The algorthms that are chosen are: k-means algorthm, herarchcal clusterng algorthm, Self-Organzaton Map (SOM) algorthm and Expectaton Maxmzaton (EM) clusterng algorthm. The general reasons for selectng these four algorthms are: Popularty. Flexblty. Applcablty. Handlng hgh dmensonalty. However, detaled reasons behnd selectng every algorthm are lsted n the context. In ths secton, for every algorthm some dea s gven about t; how t works and the reasons for choosng t K-means Algorthm K-means s a well-known parttonng method. Objects are classfed as belongng to one of k groups, k chosen a pror. Cluster membershp s determned by calculatng the centrod for each group (the multdmensonal verson of the mean) and assgnng each object to the group wth the closest centrod. Ths approach mnmzes the overall wthn-cluster dsperson by teratve reallocaton of cluster members.

3 322 The Internatonal Arab Journal of Informaton Technology, Vol. 5, No. 3, July 2008 In a general sense, a k-parttonng algorthm takes as nput a set S of objects and an nteger k, and outputs a partton of S nto subsets S 1,...S 2, S k. It uses the sum of squares as the optmzaton crteron. Let x r be the r th element of S, S be the elements n S, and d(x r, x s) be the dstance between x r and x s. The sum of squares crteron s defned by the cost functon: c( S S S = ( d( xr,xs 2 (1) r= 1x= 1 ) )) In partcular, k-means works by calculatng the centrod of each cluster S, denoted x, and optmzng the cost functon: c( S S = ( d( x,xr 2 (2) r= 1 ) )) The goal of the algorthm s to mnmze the total cost: c( S ) +... c( S ) (3) + Here, the pseudo code of the k-means algorthm s to explan how t works: A. Choose K as the. B. Intalze the codebook vectors of the K (randomly, for nstance) C. For every new sample vector: C.1. Compute the dstance between the new vector and every cluster's codebook vector. C.2. Recompute the closest codebook vector wth the new vector, usng a learnng rate that decreases n tme. The reason behnd choosng the k-means algorthm to study s ts popularty for the followng reasons: Its tme complexty s O(nkl), where n s the number of patterns, k s the, and l s the teratons taken by the algorthm to converge. Its space complexty s O(k+n). It requres addtonal space to store the data matrx. It s order-ndependent; for a gven ntal seed set of cluster centers, t generates the same partton of the data rrespectve of the order n whch the patterns are presented to the algorthm Herarchcal Clusterng Algorthm Parttonng algorthms are based on specfyng an ntal groups, and teratvely reallocatng objects among groups to convergence. In contrast, herarchcal algorthms combne or dvde exstng groups, creatng a herarchcal structure that reflects the order n whch groups are merged or dvded. In an agglomeratve method, whch bulds the herarchy by mergng, the objects ntally belong to a lst of sngleton sets S 1,..., S 2, S n. Then a cost functon s used to fnd the par of sets {S, S j } from the lst that s the k cheapest to merge. Once merged, S and S j are removed from the lst of sets and replaced wth S S j. Ths process terates untl all objects are n a sngle group. Dfferent varants of agglomeratve herarchcal clusterng algorthms may use dfferent cost functons. Complete lnkage, average lnkage, and sngle lnkage methods use maxmum, average, and mnmum dstances between the members of two, respectvely. Followng s the pseudo code of the herarchcal clusterng algorthm to explan how t works: Compute the proxmty matrx contanng the dstance between each par of patterns. Treat each pattern as a cluster. Fnd the most smlar par of usng the proxmty matrx. Merge these two nto one cluster. Update the proxmty matrx to reflect ths merge operaton. If all patterns are n one cluster, stop. Otherwse, go to step 2. The advantages of the herarchcal clusterng algorthms are the reason ths algorthm was chosen for dscusson. These advantages nclude: Embedded flexblty regardng a level of granularty. Ease of handlng of any forms of smlarty or dstance. Consequently applcablty to any attrbutes types. Herarchcal clusterng algorthms are more versatle Self-Organzaton Map Algorthm Inspred by neural networks n the bran, Self- Organzaton Map (SOM) uses a competton and cooperaton mechansm to acheve unsupervsed learnng. In the classcal SOM, a set of nodes s arranged n a geometrc pattern, typcally 2- dmensonal lattce. Each node s assocated wth a weght vector wth the same dmenson as the nput space. The purpose of SOM s to fnd a good mappng from the hgh dmensonal nput space to the 2-D representaton of the nodes. One way to use SOM for clusterng s to regard the objects n the nput space represented by the same node as grouped nto a cluster. Durng the tranng, each object n the nput s presented to the map and the best matchng node s dentfed. Formally, when nput and weght vectors are normalzed, for nput sample x(t) the wnner ndex c (best match) s dentfed by the condton: for all, x( t ) m ( t ) x( t ) m ( t ) (4) c where t s the tme step n the sequental tranng, m s the weght vector of the th node. After that, weght vectors of nodes around the best-matchng node c= c(x) are updated as:

4 Comparsons Between Data Clusterng Algorthms 323 m ( t + 1) = m ( t ) + ah ( x( t ) m ( t )) (5) c( x ), where α s the learnng rate and h c(x), s the neghborhood functon, a decreasng functon of the dstance between the th and c th nodes on the map grd. To make the map converge quckly, the learnng rate and neghborhood radus are often decreasng functons of t. After the learnng process fnshes, each object s assgned to ts closest node. There are varants of SOM to the above classcal scheme. Followng s the pseudo code of the SOM algorthm to explan how t works: A. Choose the dmenson and sze of the map. B. For every new sample vector: B.1. Compute the dstance between the new vector and every cluster's codebook vector. B.2. Recompute all codebook vectors wth the new vector, usng both a dstance radus on the map and learnng rate that decrease n tme. The followng advantages of SOM are behnd choosng ths algorthm for studyng: Whle the vorono regons of the map unts are convex, the combnaton of several map unts allows the constructon of non-convex. Dfferent knds of dstance measures and jonng crtera can be utlzed to form the bg. It has been successfully used for vector quantzaton and speech recognton. The SOM generates a sub-optmal partton f the ntal weghts are not chosen properly The Expectaton Maxmzaton Clusterng Algorthm Expectaton Maxmzaton (EM) s a well-establshed clusterng algorthm n the statstcs communty. EM s a dstance-based algorthm that assumes the data set can be modeled as a lnear combnaton of multvarate normal dstrbutons and the algorthm fnds the dstrbuton parameters that maxmze a model qualty measure, called log lkelhood. EM s chosen to cluster data for the followng reasons among others: It has a strong statstcal bass. It s lnear n database sze. It s robust to nosy data. It can accept the desred as nput. It can handle hgh dmensonalty. It converges fast gven a good ntalzaton. 3. How Algorthms are Compared? The four clusterng algorthms are compared accordng to the followng factors: The sze of the dataset. Number of the. Type of dataset. Type of software. For each factor, four tests are made, one for each algorthm. For example, accordng to the sze of data, each of the four algorthms: k-means, Herarchcal Clusterng, SOM, and EM s executed twce; frst by tryng a huge dataset and then by tryng a small dataset. Table 1 explans how the four algorthms are compared. The total tmes the algorthms have been executed s 32. For each 8-runs group, the results of the executons are studed and compared. The conclusons are wrtten down. Ths step s repeated for all the factors. Table 1. The factors accordng to whch the algorthms are compared. k- means HC SOM EM Sze of Number of Clusters Type of Ideal Ideal Ideal Ideal Type of Software TreeVew TreeVew TreeVew TreeVew Accordng to the, k (see Table 2), except for herarchcal clusterng, all clusterng algorthms compared here requre settng k n advance (for SOM, k s the nodes n the lattce). Here, the performance of dfferent algorthms for dfferent k s s compared n order to test the performances that are related to k. To smplfy the stuaton and to make the comparsons easer, k s chosen equal to 8, 16, 32, and 64, and the lattces for SOM are the square of them. To compare herarchcal clusterng wth other algorthms, the herarchcal tree s cut at two dfferent levels to obtan correspondng numbers of (8, 16, 32 and 64). As a result, as the value of k becomes greater the performance of SOM algorthm becomes lower. However, the performance of k-means and EM

5 324 The Internatonal Arab Journal of Informaton Technology, Vol. 5, No. 3, July 2008 algorthms become better than herarchcal clusterng algorthm. Table 2. The relatonshp between and the performance of the algorthms. Number Of Performance Clusters (K) SOM k-means EM HCA Accordng to the accuracy (see Table 3), SOM shows more accuracy n classfyng most the objects to ther than other algorthms. But as the number of k becomes greater the accuracy of herarchcal clusterng becomes better untl t reaches the accuracy of SOM algorthm. k-means and EM algorthms have less qualty (accuracy) than the others. However, all the algorthms have some ambguty n some nosy data to be clustered. Table 3. The relatonshp between and the qualty of the algorthms. Number Of Qualty Clusters (K) SOM K-Means EM HCA Accordng to the sze of dataset (see Table 4), a huge dataset s used consstng of 600 rows and 60 columns and a small dataset usng 200 rows and 20 columns. The small dataset s extracted as a subset of the huge dataset. The qualty of EM and k-means algorthms becomes very good when usng a huge dataset. The other two algorthms herarchcal clusterng and SOM algorthms show good results when usng a small dataset. As a concluson, parttonng algorthms (lke k-means and EM) are used for huge dataset whle herarchcal clusterng algorthms are used for small dataset. Table 4. The affect of the data sze on the algorthms. K=32 Data Sze SOM K-Means EM HCA Accordng to the type of dataset (see Table 5), a random dataset s used whch s extracted from the nternet and used for dfferent jobs. On the other hand, an deal dataset s used whch s part of the software ( and TreeVew). It s deal because t s desgned to be sutable for testng and tranng the software tself and havng less nosy data whch leads to ambguty. As a result, herarchcal clusterng and SOM algorthms gve better results than k-means and EM algorthms when usng random dataset and the vce versa. Ths ndcates that k-means and EM algorthms are very senstve for nose n the dataset. Ths nose makes t dffcult for the algorthm to nclude an object n a certan cluster. Ths wll affect the results of the algorthm. However, herarchcal clusterng algorthm s more senstve for nosy dataset than SOM algorthm. Table 5. The affect of the data type on the algorthms. K=32 Data Type SOM K-Means EM HCA Ideal Accordng to the type of software, two packages are used to compare between the algorthms: (UNIX envronment) and cluster and TreeVew (WINDOWS envronment). However, runnng the clusterng algorthms usng any one of them gves almost the same results even when changng any of the other three factors (dataset sze, number and dataset type). Ths, I beleve, s because most software use the same procedures and deas n any algorthm mplemented by them. 4. Conclusons After analyzng the results of testng the clusterng algorthms and runnng them under dfferent factors and stuaton, the followng conclusons are obtaned: As the, k becomes greater; the performance of SOM algorthm becomes lower. The performance of k-means and EM algorthms s better than herarchcal clusterng algorthm. SOM algorthm shows more accuracy n classfyng most the objects nto ther sutable than other algorthms. As the value of k becomes greater, the accuracy of herarchcal clusterng becomes better untl t reaches the accuracy of SOM algorthm. k-means and EM algorthms have less qualty (accuracy) than the others. All the algorthms have some ambguty n some (nosy) data when clustered. The qualty of EM and k-means algorthms become very good when usng huge dataset. Herarchcal clusterng and SOM algorthms show good results when usng small dataset. As a general concluson, parttonng algorthms (lke k-means and EM) are recommended for huge dataset whle herarchcal clusterng algorthms are recommended for small dataset. Herarchcal clusterng and SOM algorthms gve better results compared to k-means and EM algorthms when usng random dataset and the vce versa. k-means and EM algorthms are very senstve for nose n dataset. Ths nose makes t dffcult for the algorthm to cluster an object nto ts sutable cluster. Ths wll affect the results of the algorthm.

6 Comparsons Between Data Clusterng Algorthms 325 Herarchcal clusterng algorthm s more senstve for nosy dataset than SOM algorthm. Runnng the clusterng algorthms usng any software gves almost the same results even when changng any of the factors because most software use the same procedures and deas n any algorthm mplemented by them. 5. Future Work Ths paper was ntended to compare between some data clusterng algorthms. Through my extensve search, I was unable to fnd any study that attempts to compare between the four clusterng algorthms under nvestgaton. As a future work, comparsons between these four algorthms (or may other algorthms) can be attempted accordng to dfferent factors other than those consdered n ths paper. One mportant factor s normalzaton. Comparng between the results of algorthms usng normalzed data or non-normalzed data wll gve dfferent results. Of course normalzaton wll affect the performance of the algorthm and the qualty of the results. Another approach may consder usng data clusterng algorthms n applcatons such as object and character recognton or nformaton retreval whch s concerned wth automatc storage and retreval of documents. References [1] Chen G., Jaradat S., Banerjee N., Tanaka T., Ko M., and Zhang M., Evaluaton and Comparson of Clusterng Algorthms n Analyzng ES Cell Gene Expresson Data, Statstca Snca, vol. 12, pp , [2] Esen M., Tree Vew Manual, Stanford Unversty, [3] Han J. and Kamber M., Data Mnng: Concepts and Technques, Morgan Kaufmann Publshers, [4] Jan A., Murty M., and Flynn P., Data Clusterng: A Revew, ACM Computng Surveys, vol. 31, no. 3, [5] Keogh E., Chakrabart K., Pazzan M., and Mehrotra S., Dmensonalty Reducton for Fast Smlarty Search n Tme Seres Databases, Knowledge and Informaton Systems, vol. 3, pp , [6] Kukolch L. and Lppmann R., User s Gude, MIT Lncoln Laboratory, [7] Lepere R. and Trystram D., A New Clusterng Algorthm for Communcaton Delays, n Proceedngs of 16 th IEEE-ACM Annual Internatonal Parallel and Dstrbuted Processng Symposum (IPDPS 02), Fort Lauderdale, USA, [8] L C. and Bswas G., Unsupervsed Learnng wth Mxed Numerc and Nomnal Data, IEEE Transactons on Knowledge and Data Engneerng, vol. 14, no. 4, pp , [9] Mascar E., Pzzut C., and Ramondo G., Usng an Out-of-Core Technque for Clusterng Data Sets, n Proceedngs of 12 th Internatonal Workshop on Database and Expert Systems Applcatons (DEXA), Munch, Germany, pp , [10] Montes-Y-G Mez M., Gelbukh A., and L Pez- L Pez A., Text Mnng at Detal Level Usng Conceptual Graphs, Lecture Notes n Computer Scence, vol. 2393, pp , [11] Ordonez C. and Cereghn P., SQLEM: Fast Clusterng n SQL usng the EM Algorthm, n Proceedngs of the 2000 ACM SIGMOD Internatonal Conference on Management of Data, Dallas, Unted States, pp , [12] Perkowsk M., Malv R., Grygel S., Burns M., and Mshchenko A., Graph Colorng Algorthms for Fast Evaluaton Of Curts Decompostons, n Proceedngs of the 36 th Desgn Automaton Conference(DAC), ACM, Lousana, pp , [13] Rabov A., Lu Z., Wolf L., Yu S. and Zhang L., Clusterng Algorthms for Content-Based Publcaton-Subscrpton Systems, n Proceedngs of the 22 nd Internatonal Conference on Dstrbuted Computng Systems (ICDCS 02), USA, pp. 133, [14] Zha H., He X., Dng C., Smon H., and Gu M., Bpartte Graph Parttonng and Data Clusterng, n Proceedngs of the 10 th Internatonal Conference on Informaton and Knowledge Management, ACM Press, pp , Osama Abu Abbas s an nstructor n the department of Computer Scence at Yarmouk Unversty. He got the BSc n Computer Scence from Yarmouk Unversty, Jordan n He then got the Master degree n Computer Scence and Informaton from Yarmouk Unversty, Jordan n Hs teachng nterest focus on algorthms desgn and analyss, data structure, compler constructon, and artfcal ntellgent programmng.

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis The Development of Web Log Mnng Based on Improve-K-Means Clusterng Analyss TngZhong Wang * College of Informaton Technology, Luoyang Normal Unversty, Luoyang, 471022, Chna wangtngzhong2@sna.cn Abstract.

More information

An Interest-Oriented Network Evolution Mechanism for Online Communities

An Interest-Oriented Network Evolution Mechanism for Online Communities An Interest-Orented Network Evoluton Mechansm for Onlne Communtes Cahong Sun and Xaopng Yang School of Informaton, Renmn Unversty of Chna, Bejng 100872, P.R. Chna {chsun,yang}@ruc.edu.cn Abstract. Onlne

More information

Descriptive Models. Cluster Analysis. Example. General Applications of Clustering. Examples of Clustering Applications

Descriptive Models. Cluster Analysis. Example. General Applications of Clustering. Examples of Clustering Applications CMSC828G Prncples of Data Mnng Lecture #9 Today s Readng: HMS, chapter 9 Today s Lecture: Descrptve Modelng Clusterng Algorthms Descrptve Models model presents the man features of the data, a global summary

More information

A DATA MINING APPLICATION IN A STUDENT DATABASE

A DATA MINING APPLICATION IN A STUDENT DATABASE JOURNAL OF AERONAUTICS AND SPACE TECHNOLOGIES JULY 005 VOLUME NUMBER (53-57) A DATA MINING APPLICATION IN A STUDENT DATABASE Şenol Zafer ERDOĞAN Maltepe Ünversty Faculty of Engneerng Büyükbakkalköy-Istanbul

More information

Document Clustering Analysis Based on Hybrid PSO+K-means Algorithm

Document Clustering Analysis Based on Hybrid PSO+K-means Algorithm Document Clusterng Analyss Based on Hybrd PSO+K-means Algorthm Xaohu Cu, Thomas E. Potok Appled Software Engneerng Research Group, Computatonal Scences and Engneerng Dvson, Oak Rdge Natonal Laboratory,

More information

Vision Mouse. Saurabh Sarkar a* University of Cincinnati, Cincinnati, USA ABSTRACT 1. INTRODUCTION

Vision Mouse. Saurabh Sarkar a* University of Cincinnati, Cincinnati, USA ABSTRACT 1. INTRODUCTION Vson Mouse Saurabh Sarkar a* a Unversty of Cncnnat, Cncnnat, USA ABSTRACT The report dscusses a vson based approach towards trackng of eyes and fngers. The report descrbes the process of locatng the possble

More information

Feature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College

Feature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College Feature selecton for ntruson detecton Slobodan Petrovć NISlab, Gjøvk Unversty College Contents The feature selecton problem Intruson detecton Traffc features relevant for IDS The CFS measure The mrmr measure

More information

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module LOSSLESS IMAGE COMPRESSION SYSTEMS Lesson 3 Lossless Compresson: Huffman Codng Instructonal Objectves At the end of ths lesson, the students should be able to:. Defne and measure source entropy..

More information

Single and multiple stage classifiers implementing logistic discrimination

Single and multiple stage classifiers implementing logistic discrimination Sngle and multple stage classfers mplementng logstc dscrmnaton Hélo Radke Bttencourt 1 Dens Alter de Olvera Moraes 2 Vctor Haertel 2 1 Pontfíca Unversdade Católca do Ro Grande do Sul - PUCRS Av. Ipranga,

More information

What is Candidate Sampling

What is Candidate Sampling What s Canddate Samplng Say we have a multclass or mult label problem where each tranng example ( x, T ) conssts of a context x a small (mult)set of target classes T out of a large unverse L of possble

More information

A Simple Approach to Clustering in Excel

A Simple Approach to Clustering in Excel A Smple Approach to Clusterng n Excel Aravnd H Center for Computatonal Engneerng and Networng Amrta Vshwa Vdyapeetham, Combatore, Inda C Rajgopal Center for Computatonal Engneerng and Networng Amrta Vshwa

More information

Cluster Analysis of Data Points using Partitioning and Probabilistic Model-based Algorithms

Cluster Analysis of Data Points using Partitioning and Probabilistic Model-based Algorithms Internatonal Journal of Appled Informaton Systems (IJAIS) ISSN : 2249-0868 Foundaton of Computer Scence FCS, New York, USA Volume 7 No.7, August 2014 www.jas.org Cluster Analyss of Data Ponts usng Parttonng

More information

An Alternative Way to Measure Private Equity Performance

An Alternative Way to Measure Private Equity Performance An Alternatve Way to Measure Prvate Equty Performance Peter Todd Parlux Investment Technology LLC Summary Internal Rate of Return (IRR) s probably the most common way to measure the performance of prvate

More information

Forecasting the Direction and Strength of Stock Market Movement

Forecasting the Direction and Strength of Stock Market Movement Forecastng the Drecton and Strength of Stock Market Movement Jngwe Chen Mng Chen Nan Ye cjngwe@stanford.edu mchen5@stanford.edu nanye@stanford.edu Abstract - Stock market s one of the most complcated systems

More information

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network 700 Proceedngs of the 8th Internatonal Conference on Innovaton & Management Forecastng the Demand of Emergency Supples: Based on the CBR Theory and BP Neural Network Fu Deqang, Lu Yun, L Changbng School

More information

Mining Multiple Large Data Sources

Mining Multiple Large Data Sources The Internatonal Arab Journal of Informaton Technology, Vol. 7, No. 3, July 2 24 Mnng Multple Large Data Sources Anmesh Adhkar, Pralhad Ramachandrarao 2, Bhanu Prasad 3, and Jhml Adhkar 4 Department of

More information

Project Networks With Mixed-Time Constraints

Project Networks With Mixed-Time Constraints Project Networs Wth Mxed-Tme Constrants L Caccetta and B Wattananon Western Australan Centre of Excellence n Industral Optmsaton (WACEIO) Curtn Unversty of Technology GPO Box U1987 Perth Western Australa

More information

Web Object Indexing Using Domain Knowledge *

Web Object Indexing Using Domain Knowledge * Web Object Indexng Usng Doman Knowledge * Muyuan Wang Department of Automaton Tsnghua Unversty Bejng 100084, Chna (86-10)51774518 Zhwe L, Le Lu, We-Yng Ma Mcrosoft Research Asa Sgma Center, Hadan Dstrct

More information

A Dynamic Load Balancing for Massive Multiplayer Online Game Server

A Dynamic Load Balancing for Massive Multiplayer Online Game Server A Dynamc Load Balancng for Massve Multplayer Onlne Game Server Jungyoul Lm, Jaeyong Chung, Jnryong Km and Kwanghyun Shm Dgtal Content Research Dvson Electroncs and Telecommuncatons Research Insttute Daejeon,

More information

Face Verification Problem. Face Recognition Problem. Application: Access Control. Biometric Authentication. Face Verification (1:1 matching)

Face Verification Problem. Face Recognition Problem. Application: Access Control. Biometric Authentication. Face Verification (1:1 matching) Face Recognton Problem Face Verfcaton Problem Face Verfcaton (1:1 matchng) Querymage face query Face Recognton (1:N matchng) database Applcaton: Access Control www.vsage.com www.vsoncs.com Bometrc Authentcaton

More information

NEURO-FUZZY INFERENCE SYSTEM FOR E-COMMERCE WEBSITE EVALUATION

NEURO-FUZZY INFERENCE SYSTEM FOR E-COMMERCE WEBSITE EVALUATION NEURO-FUZZY INFERENE SYSTEM FOR E-OMMERE WEBSITE EVALUATION Huan Lu, School of Software, Harbn Unversty of Scence and Technology, Harbn, hna Faculty of Appled Mathematcs and omputer Scence, Belarusan State

More information

L10: Linear discriminants analysis

L10: Linear discriminants analysis L0: Lnear dscrmnants analyss Lnear dscrmnant analyss, two classes Lnear dscrmnant analyss, C classes LDA vs. PCA Lmtatons of LDA Varants of LDA Other dmensonalty reducton methods CSCE 666 Pattern Analyss

More information

FREQUENCY OF OCCURRENCE OF CERTAIN CHEMICAL CLASSES OF GSR FROM VARIOUS AMMUNITION TYPES

FREQUENCY OF OCCURRENCE OF CERTAIN CHEMICAL CLASSES OF GSR FROM VARIOUS AMMUNITION TYPES FREQUENCY OF OCCURRENCE OF CERTAIN CHEMICAL CLASSES OF GSR FROM VARIOUS AMMUNITION TYPES Zuzanna BRO EK-MUCHA, Grzegorz ZADORA, 2 Insttute of Forensc Research, Cracow, Poland 2 Faculty of Chemstry, Jagellonan

More information

Fast Fuzzy Clustering of Web Page Collections

Fast Fuzzy Clustering of Web Page Collections Fast Fuzzy Clusterng of Web Page Collectons Chrstan Borgelt and Andreas Nürnberger Dept. of Knowledge Processng and Language Engneerng Otto-von-Guercke-Unversty of Magdeburg Unverstätsplatz, D-396 Magdeburg,

More information

How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence

How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence 1 st Internatonal Symposum on Imprecse Probabltes and Ther Applcatons, Ghent, Belgum, 29 June 2 July 1999 How Sets of Coherent Probabltes May Serve as Models for Degrees of Incoherence Mar J. Schervsh

More information

J. Parallel Distrib. Comput.

J. Parallel Distrib. Comput. J. Parallel Dstrb. Comput. 71 (2011) 62 76 Contents lsts avalable at ScenceDrect J. Parallel Dstrb. Comput. journal homepage: www.elsever.com/locate/jpdc Optmzng server placement n dstrbuted systems n

More information

Lecture 2: Single Layer Perceptrons Kevin Swingler

Lecture 2: Single Layer Perceptrons Kevin Swingler Lecture 2: Sngle Layer Perceptrons Kevn Sngler kms@cs.str.ac.uk Recap: McCulloch-Ptts Neuron Ths vastly smplfed model of real neurons s also knon as a Threshold Logc Unt: W 2 A Y 3 n W n. A set of synapses

More information

A Comparative Study of Data Clustering Techniques

A Comparative Study of Data Clustering Techniques A COMPARATIVE STUDY OF DATA CLUSTERING TECHNIQUES A Comparatve Study of Data Clusterng Technques Khaled Hammouda Prof. Fakhreddne Karray Unversty of Waterloo, Ontaro, Canada Abstract Data clusterng s a

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Max Wellng Department of Computer Scence Unversty of Toronto 10 Kng s College Road Toronto, M5S 3G5 Canada wellng@cs.toronto.edu Abstract Ths s a note to explan support vector machnes.

More information

Cluster Analysis. Cluster Analysis

Cluster Analysis. Cluster Analysis Cluster Analyss Cluster Analyss What s Cluster Analyss? Types of Data n Cluster Analyss A Categorzaton of Maor Clusterng Methos Parttonng Methos Herarchcal Methos Densty-Base Methos Gr-Base Methos Moel-Base

More information

A hybrid global optimization algorithm based on parallel chaos optimization and outlook algorithm

A hybrid global optimization algorithm based on parallel chaos optimization and outlook algorithm Avalable onlne www.ocpr.com Journal of Chemcal and Pharmaceutcal Research, 2014, 6(7):1884-1889 Research Artcle ISSN : 0975-7384 CODEN(USA) : JCPRC5 A hybrd global optmzaton algorthm based on parallel

More information

CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK. Sample Stability Protocol

CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK. Sample Stability Protocol CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK Sample Stablty Protocol Background The Cholesterol Reference Method Laboratory Network (CRMLN) developed certfcaton protocols for total cholesterol, HDL

More information

Logistic Regression. Lecture 4: More classifiers and classes. Logistic regression. Adaboost. Optimization. Multiple class classification

Logistic Regression. Lecture 4: More classifiers and classes. Logistic regression. Adaboost. Optimization. Multiple class classification Lecture 4: More classfers and classes C4B Machne Learnng Hlary 20 A. Zsserman Logstc regresson Loss functons revsted Adaboost Loss functons revsted Optmzaton Multple class classfcaton Logstc Regresson

More information

Luby s Alg. for Maximal Independent Sets using Pairwise Independence

Luby s Alg. for Maximal Independent Sets using Pairwise Independence Lecture Notes for Randomzed Algorthms Luby s Alg. for Maxmal Independent Sets usng Parwse Independence Last Updated by Erc Vgoda on February, 006 8. Maxmal Independent Sets For a graph G = (V, E), an ndependent

More information

The Greedy Method. Introduction. 0/1 Knapsack Problem

The Greedy Method. Introduction. 0/1 Knapsack Problem The Greedy Method Introducton We have completed data structures. We now are gong to look at algorthm desgn methods. Often we are lookng at optmzaton problems whose performance s exponental. For an optmzaton

More information

Performance Analysis and Coding Strategy of ECOC SVMs

Performance Analysis and Coding Strategy of ECOC SVMs Internatonal Journal of Grd and Dstrbuted Computng Vol.7, No. (04), pp.67-76 http://dx.do.org/0.457/jgdc.04.7..07 Performance Analyss and Codng Strategy of ECOC SVMs Zhgang Yan, and Yuanxuan Yang, School

More information

How To Know The Components Of Mean Squared Error Of Herarchcal Estmator S

How To Know The Components Of Mean Squared Error Of Herarchcal Estmator S S C H E D A E I N F O R M A T I C A E VOLUME 0 0 On Mean Squared Error of Herarchcal Estmator Stans law Brodowsk Faculty of Physcs, Astronomy, and Appled Computer Scence, Jagellonan Unversty, Reymonta

More information

Conversion between the vector and raster data structures using Fuzzy Geographical Entities

Conversion between the vector and raster data structures using Fuzzy Geographical Entities Converson between the vector and raster data structures usng Fuzzy Geographcal Enttes Cdála Fonte Department of Mathematcs Faculty of Scences and Technology Unversty of Combra, Apartado 38, 3 454 Combra,

More information

Gender Classification for Real-Time Audience Analysis System

Gender Classification for Real-Time Audience Analysis System Gender Classfcaton for Real-Tme Audence Analyss System Vladmr Khryashchev, Lev Shmaglt, Andrey Shemyakov, Anton Lebedev Yaroslavl State Unversty Yaroslavl, Russa vhr@yandex.ru, shmaglt_lev@yahoo.com, andrey.shemakov@gmal.com,

More information

Data Broadcast on a Multi-System Heterogeneous Overlayed Wireless Network *

Data Broadcast on a Multi-System Heterogeneous Overlayed Wireless Network * JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 24, 819-840 (2008) Data Broadcast on a Mult-System Heterogeneous Overlayed Wreless Network * Department of Computer Scence Natonal Chao Tung Unversty Hsnchu,

More information

CS 2750 Machine Learning. Lecture 3. Density estimation. CS 2750 Machine Learning. Announcements

CS 2750 Machine Learning. Lecture 3. Density estimation. CS 2750 Machine Learning. Announcements Lecture 3 Densty estmaton Mlos Hauskrecht mlos@cs.ptt.edu 5329 Sennott Square Next lecture: Matlab tutoral Announcements Rules for attendng the class: Regstered for credt Regstered for audt (only f there

More information

IMPACT ANALYSIS OF A CELLULAR PHONE

IMPACT ANALYSIS OF A CELLULAR PHONE 4 th ASA & μeta Internatonal Conference IMPACT AALYSIS OF A CELLULAR PHOE We Lu, 2 Hongy L Bejng FEAonlne Engneerng Co.,Ltd. Bejng, Chna ABSTRACT Drop test smulaton plays an mportant role n nvestgatng

More information

On the Optimal Control of a Cascade of Hydro-Electric Power Stations

On the Optimal Control of a Cascade of Hydro-Electric Power Stations On the Optmal Control of a Cascade of Hydro-Electrc Power Statons M.C.M. Guedes a, A.F. Rbero a, G.V. Smrnov b and S. Vlela c a Department of Mathematcs, School of Scences, Unversty of Porto, Portugal;

More information

Estimating the Number of Clusters in Genetics of Acute Lymphoblastic Leukemia Data

Estimating the Number of Clusters in Genetics of Acute Lymphoblastic Leukemia Data Journal of Al Azhar Unversty-Gaza (Natural Scences), 2011, 13 : 109-118 Estmatng the Number of Clusters n Genetcs of Acute Lymphoblastc Leukema Data Mahmoud K. Okasha, Khaled I.A. Almghar Department of

More information

Ants Can Schedule Software Projects

Ants Can Schedule Software Projects Ants Can Schedule Software Proects Broderck Crawford 1,2, Rcardo Soto 1,3, Frankln Johnson 4, and Erc Monfroy 5 1 Pontfca Unversdad Católca de Valparaíso, Chle FrstName.Name@ucv.cl 2 Unversdad Fns Terrae,

More information

Data Visualization by Pairwise Distortion Minimization

Data Visualization by Pairwise Distortion Minimization Communcatons n Statstcs, Theory and Methods 34 (6), 005 Data Vsualzaton by Parwse Dstorton Mnmzaton By Marc Sobel, and Longn Jan Lateck* Department of Statstcs and Department of Computer and Informaton

More information

Software project management with GAs

Software project management with GAs Informaton Scences 177 (27) 238 241 www.elsever.com/locate/ns Software project management wth GAs Enrque Alba *, J. Francsco Chcano Unversty of Málaga, Grupo GISUM, Departamento de Lenguajes y Cencas de

More information

8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by

8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by 6 CHAPTER 8 COMPLEX VECTOR SPACES 5. Fnd the kernel of the lnear transformaton gven n Exercse 5. In Exercses 55 and 56, fnd the mage of v, for the ndcated composton, where and are gven by the followng

More information

Design and Development of a Security Evaluation Platform Based on International Standards

Design and Development of a Security Evaluation Platform Based on International Standards Internatonal Journal of Informatcs Socety, VOL.5, NO.2 (203) 7-80 7 Desgn and Development of a Securty Evaluaton Platform Based on Internatonal Standards Yuj Takahash and Yoshm Teshgawara Graduate School

More information

Survey on Virtual Machine Placement Techniques in Cloud Computing Environment

Survey on Virtual Machine Placement Techniques in Cloud Computing Environment Survey on Vrtual Machne Placement Technques n Cloud Computng Envronment Rajeev Kumar Gupta and R. K. Paterya Department of Computer Scence & Engneerng, MANIT, Bhopal, Inda ABSTRACT In tradtonal data center

More information

Improved SVM in Cloud Computing Information Mining

Improved SVM in Cloud Computing Information Mining Internatonal Journal of Grd Dstrbuton Computng Vol.8, No.1 (015), pp.33-40 http://dx.do.org/10.1457/jgdc.015.8.1.04 Improved n Cloud Computng Informaton Mnng Lvshuhong (ZhengDe polytechnc college JangSu

More information

THE APPLICATION OF DATA MINING TECHNIQUES AND MULTIPLE CLASSIFIERS TO MARKETING DECISION

THE APPLICATION OF DATA MINING TECHNIQUES AND MULTIPLE CLASSIFIERS TO MARKETING DECISION Internatonal Journal of Electronc Busness Management, Vol. 3, No. 4, pp. 30-30 (2005) 30 THE APPLICATION OF DATA MINING TECHNIQUES AND MULTIPLE CLASSIFIERS TO MARKETING DECISION Yu-Mn Chang *, Yu-Cheh

More information

Abstract. Clustering ensembles have emerged as a powerful method for improving both the

Abstract. Clustering ensembles have emerged as a powerful method for improving both the Clusterng Ensembles: {topchyal, Models jan, of punch}@cse.msu.edu Consensus and Weak Parttons * Alexander Topchy, Anl K. Jan, and Wllam Punch Department of Computer Scence and Engneerng, Mchgan State Unversty

More information

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ).

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ). REVIEW OF RISK MANAGEMENT CONCEPTS LOSS DISTRIBUTIONS AND INSURANCE Loss and nsurance: When someone s subject to the rsk of ncurrng a fnancal loss, the loss s generally modeled usng a random varable or

More information

Statistical Approach for Offline Handwritten Signature Verification

Statistical Approach for Offline Handwritten Signature Verification Journal of Computer Scence 4 (3): 181-185, 2008 ISSN 1549-3636 2008 Scence Publcatons Statstcal Approach for Offlne Handwrtten Sgnature Verfcaton 2 Debnath Bhattacharyya, 1 Samr Kumar Bandyopadhyay, 2

More information

Enterprise Master Patient Index

Enterprise Master Patient Index Enterprse Master Patent Index Healthcare data are captured n many dfferent settngs such as hosptals, clncs, labs, and physcan offces. Accordng to a report by the CDC, patents n the Unted States made an

More information

Dynamic Fuzzy Pattern Recognition

Dynamic Fuzzy Pattern Recognition Dynamc Fuzzy Pattern Recognton Von der Fakultät für Wrtschaftswssenschaften der Rhensch-Westfälschen Technschen Hochschule Aachen zur Erlangung des akademschen Grades enes Doktors der Wrtschafts- und Sozalwssenschaften

More information

Calculating the high frequency transmission line parameters of power cables

Calculating the high frequency transmission line parameters of power cables < ' Calculatng the hgh frequency transmsson lne parameters of power cables Authors: Dr. John Dcknson, Laboratory Servces Manager, N 0 RW E B Communcatons Mr. Peter J. Ncholson, Project Assgnment Manager,

More information

A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION. Michael E. Kuhl Radhamés A. Tolentino-Peña

A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION. Michael E. Kuhl Radhamés A. Tolentino-Peña Proceedngs of the 2008 Wnter Smulaton Conference S. J. Mason, R. R. Hll, L. Mönch, O. Rose, T. Jefferson, J. W. Fowler eds. A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION

More information

PEER REVIEWER RECOMMENDATION IN ONLINE SOCIAL LEARNING CONTEXT: INTEGRATING INFORMATION OF LEARNERS AND SUBMISSIONS

PEER REVIEWER RECOMMENDATION IN ONLINE SOCIAL LEARNING CONTEXT: INTEGRATING INFORMATION OF LEARNERS AND SUBMISSIONS PEER REVIEWER RECOMMENDATION IN ONLINE SOCIAL LEARNING CONTEXT: INTEGRATING INFORMATION OF LEARNERS AND SUBMISSIONS Yunhong Xu, Faculty of Management and Economcs, Kunmng Unversty of Scence and Technology,

More information

Automated information technology for ionosphere monitoring of low-orbit navigation satellite signals

Automated information technology for ionosphere monitoring of low-orbit navigation satellite signals Automated nformaton technology for onosphere montorng of low-orbt navgaton satellte sgnals Alexander Romanov, Sergey Trusov and Alexey Romanov Federal State Untary Enterprse Russan Insttute of Space Devce

More information

Network Aware Load-Balancing via Parallel VM Migration for Data Centers

Network Aware Load-Balancing via Parallel VM Migration for Data Centers Network Aware Load-Balancng va Parallel VM Mgraton for Data Centers Kun-Tng Chen 2, Chen Chen 12, Po-Hsang Wang 2 1 Informaton Technology Servce Center, 2 Department of Computer Scence Natonal Chao Tung

More information

Fault tolerance in cloud technologies presented as a service

Fault tolerance in cloud technologies presented as a service Internatonal Scentfc Conference Computer Scence 2015 Pavel Dzhunev, PhD student Fault tolerance n cloud technologes presented as a servce INTRODUCTION Improvements n technques for vrtualzaton and performance

More information

Using Series to Analyze Financial Situations: Present Value

Using Series to Analyze Financial Situations: Present Value 2.8 Usng Seres to Analyze Fnancal Stuatons: Present Value In the prevous secton, you learned how to calculate the amount, or future value, of an ordnary smple annuty. The amount s the sum of the accumulated

More information

On-Line Fault Detection in Wind Turbine Transmission System using Adaptive Filter and Robust Statistical Features

On-Line Fault Detection in Wind Turbine Transmission System using Adaptive Filter and Robust Statistical Features On-Lne Fault Detecton n Wnd Turbne Transmsson System usng Adaptve Flter and Robust Statstcal Features Ruoyu L Remote Dagnostcs Center SKF USA Inc. 3443 N. Sam Houston Pkwy., Houston TX 77086 Emal: ruoyu.l@skf.com

More information

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12 14 The Ch-squared dstrbuton PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 1 If a normal varable X, havng mean µ and varance σ, s standardsed, the new varable Z has a mean 0 and varance 1. When ths standardsed

More information

A Data Mining-Based OLAP Aggregation of. Complex Data: Application on XML Documents

A Data Mining-Based OLAP Aggregation of. Complex Data: Application on XML Documents 1 Runnng head: A DATA MINING-BASED OLAP AGGREGATION A Data Mnng-Based OLAP Aggregaton of Complex Data: Applcaton on XML Documents Radh Ben Messaoud, Omar Boussad, Sabne Loudcher Rabaséda {rbenmessaoud

More information

"Research Note" APPLICATION OF CHARGE SIMULATION METHOD TO ELECTRIC FIELD CALCULATION IN THE POWER CABLES *

Research Note APPLICATION OF CHARGE SIMULATION METHOD TO ELECTRIC FIELD CALCULATION IN THE POWER CABLES * Iranan Journal of Scence & Technology, Transacton B, Engneerng, ol. 30, No. B6, 789-794 rnted n The Islamc Republc of Iran, 006 Shraz Unversty "Research Note" ALICATION OF CHARGE SIMULATION METHOD TO ELECTRIC

More information

MATHEMATICAL ENGINEERING TECHNICAL REPORTS. Sequential Optimizing Investing Strategy with Neural Networks

MATHEMATICAL ENGINEERING TECHNICAL REPORTS. Sequential Optimizing Investing Strategy with Neural Networks MATHEMATICAL ENGINEERING TECHNICAL REPORTS Sequental Optmzng Investng Strategy wth Neural Networks Ryo ADACHI and Akmch TAKEMURA METR 2010 03 February 2010 DEPARTMENT OF MATHEMATICAL INFORMATICS GRADUATE

More information

Can Auto Liability Insurance Purchases Signal Risk Attitude?

Can Auto Liability Insurance Purchases Signal Risk Attitude? Internatonal Journal of Busness and Economcs, 2011, Vol. 10, No. 2, 159-164 Can Auto Lablty Insurance Purchases Sgnal Rsk Atttude? Chu-Shu L Department of Internatonal Busness, Asa Unversty, Tawan Sheng-Chang

More information

Using Content-Based Filtering for Recommendation 1

Using Content-Based Filtering for Recommendation 1 Usng Content-Based Flterng for Recommendaton 1 Robn van Meteren 1 and Maarten van Someren 2 1 NetlnQ Group, Gerard Brandtstraat 26-28, 1054 JK, Amsterdam, The Netherlands, robn@netlnq.nl 2 Unversty of

More information

Master s Thesis. Configuring robust virtual wireless sensor networks for Internet of Things inspired by brain functional networks

Master s Thesis. Configuring robust virtual wireless sensor networks for Internet of Things inspired by brain functional networks Master s Thess Ttle Confgurng robust vrtual wreless sensor networks for Internet of Thngs nspred by bran functonal networks Supervsor Professor Masayuk Murata Author Shnya Toyonaga February 10th, 2014

More information

An Enhanced Super-Resolution System with Improved Image Registration, Automatic Image Selection, and Image Enhancement

An Enhanced Super-Resolution System with Improved Image Registration, Automatic Image Selection, and Image Enhancement An Enhanced Super-Resoluton System wth Improved Image Regstraton, Automatc Image Selecton, and Image Enhancement Yu-Chuan Kuo ( ), Chen-Yu Chen ( ), and Chou-Shann Fuh ( ) Department of Computer Scence

More information

Machine Learning and Software Quality Prediction: As an Expert System

Machine Learning and Software Quality Prediction: As an Expert System I.J. Informaton Engneerng and Electronc Busness, 2014, 2, 9-27 Publshed Onlne Aprl 2014 n MECS (http://www.mecs-press.org/) DOI: 10.5815/jeeb.2014.02.02 Machne Learnng and Software Qualty Predcton: As

More information

A Replication-Based and Fault Tolerant Allocation Algorithm for Cloud Computing

A Replication-Based and Fault Tolerant Allocation Algorithm for Cloud Computing A Replcaton-Based and Fault Tolerant Allocaton Algorthm for Cloud Computng Tork Altameem Dept of Computer Scence, RCC, Kng Saud Unversty, PO Box: 28095 11437 Ryadh-Saud Araba Abstract The very large nfrastructure

More information

A Prefix Code Matching Parallel Load-Balancing Method for Solution-Adaptive Unstructured Finite Element Graphs on Distributed Memory Multicomputers

A Prefix Code Matching Parallel Load-Balancing Method for Solution-Adaptive Unstructured Finite Element Graphs on Distributed Memory Multicomputers Ž. The Journal of Supercomputng, 15, 25 49 2000 2000 Kluwer Academc Publshers. Manufactured n The Netherlands. A Prefx Code Matchng Parallel Load-Balancng Method for Soluton-Adaptve Unstructured Fnte Element

More information

PAS: A Packet Accounting System to Limit the Effects of DoS & DDoS. Debish Fesehaye & Klara Naherstedt University of Illinois-Urbana Champaign

PAS: A Packet Accounting System to Limit the Effects of DoS & DDoS. Debish Fesehaye & Klara Naherstedt University of Illinois-Urbana Champaign PAS: A Packet Accountng System to Lmt the Effects of DoS & DDoS Debsh Fesehaye & Klara Naherstedt Unversty of Illnos-Urbana Champagn DoS and DDoS DDoS attacks are ncreasng threats to our dgtal world. Exstng

More information

ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING

ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING Matthew J. Lberatore, Department of Management and Operatons, Vllanova Unversty, Vllanova, PA 19085, 610-519-4390,

More information

DEFINING %COMPLETE IN MICROSOFT PROJECT

DEFINING %COMPLETE IN MICROSOFT PROJECT CelersSystems DEFINING %COMPLETE IN MICROSOFT PROJECT PREPARED BY James E Aksel, PMP, PMI-SP, MVP For Addtonal Informaton about Earned Value Management Systems and reportng, please contact: CelersSystems,

More information

Optimal Choice of Random Variables in D-ITG Traffic Generating Tool using Evolutionary Algorithms

Optimal Choice of Random Variables in D-ITG Traffic Generating Tool using Evolutionary Algorithms Optmal Choce of Random Varables n D-ITG Traffc Generatng Tool usng Evolutonary Algorthms M. R. Mosav* (C.A.), F. Farab* and S. Karam* Abstract: Impressve development of computer networks has been requred

More information

Distributed Column Subset Selection on MapReduce

Distributed Column Subset Selection on MapReduce Dstrbuted Column Subset Selecton on MapReduce Ahmed K. arahat Ahmed Elgohary Al Ghods Mohamed S. Kamel Unversty of Waterloo Waterloo, Ontaro, Canada N2L 3G1 Emal: {afarahat, aelgohary, aghodsb, mkamel}@uwaterloo.ca

More information

A neuro-fuzzy collaborative filtering approach for Web recommendation. G. Castellano, A. M. Fanelli, and M. A. Torsello *

A neuro-fuzzy collaborative filtering approach for Web recommendation. G. Castellano, A. M. Fanelli, and M. A. Torsello * Internatonal Journal of Computatonal Scence 992-6669 (Prnt) 992-6677 (Onlne) Global Informaton Publsher 27, Vol., No., 27-39 A neuro-fuzzy collaboratve flterng approach for Web recommendaton G. Castellano,

More information

1. Fundamentals of probability theory 2. Emergence of communication traffic 3. Stochastic & Markovian Processes (SP & MP)

1. Fundamentals of probability theory 2. Emergence of communication traffic 3. Stochastic & Markovian Processes (SP & MP) 6.3 / -- Communcaton Networks II (Görg) SS20 -- www.comnets.un-bremen.de Communcaton Networks II Contents. Fundamentals of probablty theory 2. Emergence of communcaton traffc 3. Stochastc & Markovan Processes

More information

How To Understand The Results Of The German Meris Cloud And Water Vapour Product

How To Understand The Results Of The German Meris Cloud And Water Vapour Product Ttel: Project: Doc. No.: MERIS level 3 cloud and water vapour products MAPP MAPP-ATBD-ClWVL3 Issue: 1 Revson: 0 Date: 9.12.1998 Functon Name Organsaton Sgnature Date Author: Bennartz FUB Preusker FUB Schüller

More information

Customer Segmentation Using Clustering and Data Mining Techniques

Customer Segmentation Using Clustering and Data Mining Techniques Internatonal Journal of Computer Theory and Engneerng, Vol. 5, No. 6, December 2013 Customer Segmentaton Usng Clusterng and Data Mnng Technques Kshana R. Kashwan, Member, IACSIT, and C. M. Velu fronter

More information

Mining Feature Importance: Applying Evolutionary Algorithms within a Web-based Educational System

Mining Feature Importance: Applying Evolutionary Algorithms within a Web-based Educational System Mnng Feature Importance: Applyng Evolutonary Algorthms wthn a Web-based Educatonal System Behrouz MINAEI-BIDGOLI 1, and Gerd KORTEMEYER 2, and Wllam F. PUNCH 1 1 Genetc Algorthms Research and Applcatons

More information

Joint Scheduling of Processing and Shuffle Phases in MapReduce Systems

Joint Scheduling of Processing and Shuffle Phases in MapReduce Systems Jont Schedulng of Processng and Shuffle Phases n MapReduce Systems Fangfe Chen, Mural Kodalam, T. V. Lakshman Department of Computer Scence and Engneerng, The Penn State Unversty Bell Laboratores, Alcatel-Lucent

More information

ERP Software Selection Using The Rough Set And TPOSIS Methods

ERP Software Selection Using The Rough Set And TPOSIS Methods ERP Software Selecton Usng The Rough Set And TPOSIS Methods Under Fuzzy Envronment Informaton Management Department, Hunan Unversty of Fnance and Economcs, No. 139, Fengln 2nd Road, Changsha, 410205, Chna

More information

Using Supervised Clustering Technique to Classify Received Messages in 137 Call Center of Tehran City Council

Using Supervised Clustering Technique to Classify Received Messages in 137 Call Center of Tehran City Council Usng Supervsed Clusterng Technque to Classfy Receved Messages n 137 Call Center of Tehran Cty Councl Mahdyeh Haghr 1*, Hamd Hassanpour 2 (1) Informaton Technology engneerng/e-commerce, Shraz Unversty (2)

More information

APPLICATION OF PROBE DATA COLLECTED VIA INFRARED BEACONS TO TRAFFIC MANEGEMENT

APPLICATION OF PROBE DATA COLLECTED VIA INFRARED BEACONS TO TRAFFIC MANEGEMENT APPLICATION OF PROBE DATA COLLECTED VIA INFRARED BEACONS TO TRAFFIC MANEGEMENT Toshhko Oda (1), Kochro Iwaoka (2) (1), (2) Infrastructure Systems Busness Unt, Panasonc System Networks Co., Ltd. Saedo-cho

More information

Detecting Credit Card Fraud using Periodic Features

Detecting Credit Card Fraud using Periodic Features Detectng Credt Card Fraud usng Perodc Features Alejandro Correa Bahnsen, Djamla Aouada, Aleksandar Stojanovc and Björn Ottersten Interdscplnary Centre for Securty, Relablty and Trust Unversty of Luxembourg,

More information

Recurrence. 1 Definitions and main statements

Recurrence. 1 Definitions and main statements Recurrence 1 Defntons and man statements Let X n, n = 0, 1, 2,... be a MC wth the state space S = (1, 2,...), transton probabltes p j = P {X n+1 = j X n = }, and the transton matrx P = (p j ),j S def.

More information

A Secure Password-Authenticated Key Agreement Using Smart Cards

A Secure Password-Authenticated Key Agreement Using Smart Cards A Secure Password-Authentcated Key Agreement Usng Smart Cards Ka Chan 1, Wen-Chung Kuo 2 and Jn-Chou Cheng 3 1 Department of Computer and Informaton Scence, R.O.C. Mltary Academy, Kaohsung 83059, Tawan,

More information

Implementations of Web-based Recommender Systems Using Hybrid Methods

Implementations of Web-based Recommender Systems Using Hybrid Methods Internatonal Journal of Computer Scence & Applcatons Vol. 3 Issue 3, pp 52-64 2006 Technomathematcs Research Foundaton Implementatons of Web-based Recommender Systems Usng Hybrd Methods Janusz Sobeck Insttute

More information

A Novel Methodology of Working Capital Management for Large. Public Constructions by Using Fuzzy S-curve Regression

A Novel Methodology of Working Capital Management for Large. Public Constructions by Using Fuzzy S-curve Regression Novel Methodology of Workng Captal Management for Large Publc Constructons by Usng Fuzzy S-curve Regresson Cheng-Wu Chen, Morrs H. L. Wang and Tng-Ya Hseh Department of Cvl Engneerng, Natonal Central Unversty,

More information

WISE-Integrator: An Automatic Integrator of Web Search Interfaces for E-Commerce

WISE-Integrator: An Automatic Integrator of Web Search Interfaces for E-Commerce WSE-ntegrator: An Automatc ntegrator of Web Search nterfaces for E-Commerce Ha He, Wey Meng Dept. of Computer Scence SUNY at Bnghamton Bnghamton, NY 13902 {hahe,meng}@cs.bnghamton.edu Clement Yu Dept.

More information

Search Efficient Representation of Healthcare Data based on the HL7 RIM

Search Efficient Representation of Healthcare Data based on the HL7 RIM 181 JOURNAL OF COMPUTERS, VOL. 5, NO. 12, DECEMBER 21 Search Effcent Representaton of Healthcare Data based on the HL7 RIM Razan Paul Department of Computer Scence and Engneerng, Bangladesh Unversty of

More information

Number of Levels Cumulative Annual operating Income per year construction costs costs ($) ($) ($) 1 600,000 35,000 100,000 2 2,200,000 60,000 350,000

Number of Levels Cumulative Annual operating Income per year construction costs costs ($) ($) ($) 1 600,000 35,000 100,000 2 2,200,000 60,000 350,000 Problem Set 5 Solutons 1 MIT s consderng buldng a new car park near Kendall Square. o unversty funds are avalable (overhead rates are under pressure and the new faclty would have to pay for tself from

More information

Overview of monitoring and evaluation

Overview of monitoring and evaluation 540 Toolkt to Combat Traffckng n Persons Tool 10.1 Overvew of montorng and evaluaton Overvew Ths tool brefly descrbes both montorng and evaluaton, and the dstncton between the two. What s montorng? Montorng

More information

Rate Monotonic (RM) Disadvantages of cyclic. TDDB47 Real Time Systems. Lecture 2: RM & EDF. Priority-based scheduling. States of a process

Rate Monotonic (RM) Disadvantages of cyclic. TDDB47 Real Time Systems. Lecture 2: RM & EDF. Priority-based scheduling. States of a process Dsadvantages of cyclc TDDB47 Real Tme Systems Manual scheduler constructon Cannot deal wth any runtme changes What happens f we add a task to the set? Real-Tme Systems Laboratory Department of Computer

More information