Comparing Improved Versions of K-Means and Subtractive Clustering in a Tracking Application
|
|
|
- Frederick Rodney Franklin
- 10 years ago
- Views:
Transcription
1 Comparing Improved Versions of K-Means and Subtractive Clustering in a Tracking Application Marta Marrón Romera, Miguel Angel Sotelo Vázquez, and Juan Carlos García García Electronics Department, University of Alcala, Edificio Politécnico, Campus Universitario Alcalá de Henares, 28871, Madrid, Spain {marta,sotelo,jcarlos}@depeca.uah.es Abstract. A partitional and a fuzzy clustering algorithm are compared in this paper in terms of accuracy, robustness and efficiency. 3D position data extracted from a stereo-vision system have to be clustered to use them in a tracking application in which a particle filter is the kernel of the estimation task. K-Means and Subtractive algorithms have been modified and enriched with a validation process in order improve its functionality in the tracking system. Comparisons and conclusions of the clustering results both in a stand-alone process and in the proposed tracking task are shown in the paper. Keywords: clustering, probabilistic-deterministic, particle-filters, tracking. 1 Introduction Clustering algorithms are used in a large amount of applications ([1], [2], [3]). In artificial vision, these processes are particularly useful as compress visual data generating classes that contain more accurate and robust environmental information. In the application developed by the authors in [4], information about the 3D position of a variable number of objects in an indoor environment is obtained from a stereo-vision system and used as measurement vector of a probabilistic estimation process, in a multi-obstacle detection and tracking process. This information is clustered in 2D before using it in the probabilistic tracker, as it is explained in [4]. It has been proven ([5]) that an association algorithm is needed in order to make robust the multi-tracking process. Most of the association solutions are based on the Probabilistic Data Association (PDA) theory [6], such as the Joint Probabilistic Particle Filter (JPDAF) like in [7], but the computational load of these techniques is generally a problem for the real time execution of the tracking algorithms. On the other hand, some of the solutions presented by other authors ([8]) lack of robustness if the measurements used in the estimation process are not wisely preprocessed. The use of a clustering process solves the association problems. Fig. 1 includes a flowchart describing the global tracking application. In the same figure, an image showing the 3D position data extracted from the objects to track in a real experiment is also included, as white dots. It has to be remarked that, though 3D position information of the objects to be tracked is available, the height coordinate of these data is not useful in the clustering, and thus, the position measurements are cluster in a 2D space. R. Moreno-Díaz et al. (Eds.): EUROCAST 2007, LNCS 4739, pp , Springer-Verlag Berlin Heidelberg 2007
2 718 M. Marrón Romera, M.A. Sotelo Vázquez, and J.C. García García Estimation Process: Re-initialization Step Estimation Process: Prediction Step Stereo-Vision System Estimation Process: Correction Step Classification Process Estimation Process: Selection Step Fig. 1. Functional description of the multimodal estimation tracker, and the clustering process implication in it. A frame extracted from a real experiment is also included. In the image the position data points to be clustered are plotted in white. On the other hand, as it can be notice in Fig. 1, position measurements extracted by the stereo-vision system are not equally distributed among all objects in the environment, and hence, the tracking algorithm, and what is of interest in this paper, clustering proposals should be able to manage clusters with very different likelihood. In this paper, two different clustering algorithms are tested and compared in order to find the most adequate solution for the tracking task exposed. The contributions of the clustering proposal to the multi-tracking process are also shown in the paper. 2 Clustering Algorithms As mentioned two different clustering algorithms are compared in their application to the objective pursuit. They are described in the following paragraphs: Modified K-Means : The deterministic solution presented by MacQueen in [9], and afterwards improved ([10]), is modified to classify a variable number of clusters. A flowchart of this clustering proposal is presented in Fig. 2, in which the functionality included to achieve an adaptive behavior of the algorithm to the number of clusters generated by the algorithm is marked with a dashed line. On the other hand, in order to improve the algorithm speed in its recursive performance ([11]), the K-Means segmentation process starts looking for clusters centroides near the predicted values that are calculated with the clustering results in the previous execution of the algorithm. This other improvement is also shown in dashed line in Fig. 2. Modified Subtractive : This algorithm is a fuzzy solution for the clustering task based on the Mountain algorithm [12], and presented afterwards in [13]. The Subtractive algorithm is unlinked, and therefore a process to obtain the each cluster centroide and the members has to be included in the standard algorithm if
3 Comparing Improved Versions of K-Means and Subtractive Clustering 719 Initialization: Cluster centroids prediction Calculate distance from each data to all cluster centroids (dist) dist>distmax Yes New cluster No Assign data to the most similar cluster (nearest centroid) => becomes in one of its members Cluster centroid =data(dist) Recalculate cluster centroids as its members mean location For all data Yes Changes in cluster centroids/number? No Erase clusters without members End Fig. 2. Flowchart of the modified K-Means proposed by the authors. Dashed lines remark the main improvements included in the standard version of the algorithm. needed, as it is in the present application. This improved functionality is shown with dashed lines in Fig. 3. Moreover, the probability of each cluster members is reset in order to decrease the cluster duplication error rate of the standard fuzzy classifier. This process is also included in Fig. 3 with dashed lines. All the modifications described increase the efficiency and accuracy of the basic classifiers. To improve their robustness, a validation process is also added to both clustering algorithms. This procedure is useful when noisy measurements or outliers produce a cluster creation or deletion erroneously. In these situations none of the modified algorithms shown in previous figures behave robustly. The validation process is based in two parameters: Distance between the estimated and the resulting cluster centroide. The clusters centroide estimation process in the modified K-Means is also developed for the Subtractive version of Fig. 3. Estimated centroides are then compared within two consecutive iterations, to obtain a confidence value for clusters validation.
4 720 M. Marrón Romera, M.A. Sotelo Vázquez, and J.C. García García Initialization: Calculate each data density (pdf) Selection of the data with biggest density: pdfmax pdfmax>limit Yes New cluster No End Cluster centroide=data(pdfmax) Obtain cluster members according to the distance to the cluster centroide & Reset members pdf Subtract pdfmax to all densities Fig. 3. Flowchart of the modified Subtractive algorithm proposed by the authors. Dashed lines remark the main improvements included in the standard version of the algorithm. Cluster likelihood. The clusters probability information inherent in Subtractive algorithm is also calculated in K-Means, as a function of the data agglomeration in each cluster. This likelihood value is also used as a validation or confidence parameter in the cluster validation process. In the following sections the results obtained with the clustering proposals described are deeply commented. 3 Comparison In order to extract comparative conclusions about the accuracy, robustness and efficiency of the clustering methods presented, they have been run with different data sets containing 2D position measurements obtained from the objects in the environment during the tracking task. The objective in all situations is that a cluster is created for each object in the scene by the algorithms if some data points related to it are present in the clustered data set. The main conclusions extracted from the global set of experiments (2931 frames of 3 situations of different complexity) are the following: In general K-Means shows higher reliability than Subtractive, with a mean error rate of 3.9% against 6.7%. High accuracy is obtained in the whole testing process, since less than a 1% of errors is due to shifts if K-Means is used, and 1.6% if it is Subtractive. The most recurrent error (half of the global error rate with both algorithms) appears when an object is poorly measured and only a few data points related to it are included in the set. This type of error is generally called a missing error.
5 Comparing Improved Versions of K-Means and Subtractive Clustering 721 The generation of a duplicated class related to the same object is a typical error generated by Subtractive (1.2% error rate against less than 1% in K-Means ). Nevertheless this error rate is doubled if the reset process included as an improvement in the Subtractive algorithm is not used. The validation process proposed rejects almost the 90% of noisy measurements when included in any of the clustering algorithms. In any case, Subtractive clustering behaves better with noisy measurements than K-Means as almost 50% of the global error rate when using K-Means is due to noise. K-Means is less time consuming than Subtractive (a mean execution time of 1.5ms for K-Means versus 17ms for Subtractive 1 ). The execution time of K- Means is decreased almost in a 50% if using the centroide estimation process. Table 1 shows the main errors generated by both clustering proposals in a complex experiment of 1054 frames, in which situations of until 6 static and dynamic classes are included. The results displayed in the table validate the comments exposed in previous paragraphs and confirm the better behavior of the clustering proposal based on the standard K-Means. Table 1. Rate of different types of errors obtained with the proposed versions of K-Means and Subtractive algorithms in a 1054 frames experiment of none until 6 classes K-Means (% frames with error) Subtract (% frames with error) Missing Duplicated Shift as Total Fig. 4 shows the results of two sequential iterations of the clustering proposals in the complex experiment mentioned. Numbers at the top and at the bottom of the images indicate the number of clusters generated by K-Means and Subtractive, respectively. Regular rectangles display the clusters generated by K-Means, and dashed the ones generated with Subtractive. In the image of the right, the shift error generated by Subtractive clustering can be observed in person tacked with number 3. In the left image, a missing error generated by Subtractive with person number 4 that is partially occluded by person number 2, is shown. Results generated by K- Means clustering are right in both cases. The error rate is, in any case, too high to use the output of the clustering process as output of the multi-tracking process. The difference in the likelihood of the measurements sensed by the stereo-vision system for each object in the environment cannot be properly handled by the clustering proposals. The use of a probabilistic estimator that reinforces the probability of poorly sensed objects is therefore justified. 1 The algorithms are run in an Intel Dual Core processor at 1.8GHz with 1Gbyte RAM.
6 722 M. Marrón Romera, M.A. Sotelo Vázquez, and J.C. García García Fig. 4. Two sequential images extracted from a complex experiment. Clustering results are shown with regular rectangles for the K-Means proposal and dashed for the Subtractive one. 4 Application of the Clustering Algorithms to the Tracking Process As mentioned in section 1, the clustering algorithms developed were designed to be used in a multi-tracking application ([4]), in order to increase the tracker robustness. A particle filter (PF) is used as position estimator, and the algorithm is extended and the clustering process is used as association process in order to achieve the desired functionality and efficiency of the multi-tracker. The final algorithm is called Extended Particle Filter with Clustering Process (XPFCP). The functionality of the clustering algorithms in the tracking process is analyzed in detail in [4]. The general idea is to use the process in two different steps of the multimodal estimator (see Fig. 1) as follows: At the re-initialization step, position information compressed within the clusters is used to create new hypotheses in the multimodal estimator. The clustering process is thus used to adapt the tracker functionality to situations with a variable number of objects. At the correction step, the cluster probability is used to reinforce or weaken each object position estimation hypothesis. This procedure increases the robustness of the global tracking task. Both algorithms have been used within the probabilistic tracker mentioned, to test which one is the best solution to increase the multimodal estimator robustness. Table 2 summarizes the results obtaining when comparing both clustering proposals in the XPFCP designed, in the same 1054 frames experiment used to extract results in Table 1. As it can be notice in the table, the solution based on K-Means has better performance than the one based on Subtractive. In a more general set of tests (2931 frames of 3 situations of different complexity) the results are similar to those shown in Table 2: The mean error rate is 1.8% if using K-Means and 3.7% if using Subtractive in the multi-tracking algorithm. Moreover, only missing errors appear if using the K- Means algorithm, while the Subtractive proposal shows also duplicating errors. The mean execution time of the multimodal tracker is 19.1ms when using K- Means, almost four times lower than if Subtractive is used (73.3ms). The reason
7 Comparing Improved Versions of K-Means and Subtractive Clustering 723 for this difference is that the execution time of the estimation process without clustering is comparable to the one of the stand alone version of Subtractive (a mean value of 17.8ms). Outliers do not appear in the tracker results, so the problem of noise has been erased from the tracking application, thanks to the clustering process. Table 2. Rate of different types of errors obtained with the XPFCP proposed in a 1054 frames tracking experiment of none until 6 classes K-Means (% frames with error) Subtract (% frames with error) Missing Duplicated Total All exposed results recommend the use of K-Means in the multi-tracking process. An example of the results achieved with the XPFCP is shown in Fig. 5, where the modified K-Means is used as association process in the multi-object tracker. Tracked objects are displayed with rectangles in the images. This figure proofs the correct functionality of the global proposed tracker also in crowd situations where multiple objects cross and are partially occluded during various iterations. Fig. 5. Three frames in a real tracking sequence showing results generated by the XPFCP in the tracking process. Clusters are plotted with rectangles. 5 Conclusions In this paper two new clustering proposals based on the standards K-Means and Subtractive, are presented. The improved algorithms are thought to be applied in the multi-object tracking system described by the authors in [4], called XPFCP. The clustering process is used as association algorithm in the multimodal estimator. On the other hand, the use of clustered measurements increases the tracker robustness. The two clustering proposals performance is compared and some interesting conclusions are extracted: The proposal based on K-Means shows higher reliability than the one based on Subtractive.
8 724 M. Marrón Romera, M.A. Sotelo Vázquez, and J.C. García García Subtractive behaves better with noisy measurements than K-Means, but the validation process included in both clustering proposals solves successfully the robustness problems generated by outliers. The execution time of Subtractive is almost 10 times bigger than the K-Means one, and comparable to the multimodal estimator one. All these conclusions recommend the use of the modified K-Means in the XPFCP, and the success of the resultant multi-tracking algorithm is also demonstrated in the paper. Acknowledgments. This work has been financed by the Spanish administration (CICYT: DPI C03-02). References 1. Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Computing Surveys 31(nº 3), (1999) 2. Berkhin, P.: Survey of clustering data mining techniques. Technical Report (2002) 3. Everitt, B., Landau, S., Leese, M.: Cluster analysis. Edward Arnold Publishers, 4th Edition, London, ISBN: (2001) 4. Marrón, M., Sotelo, M.A., García, J.C., Fernández, D., Pizarro, D.: XPFCP: An extended particle filter for tracking multiple and dynamic objects in complex environments. In: Proceedings of the 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS05), Edmonton, pp (2005), ISBN: Isard, M., Blake, C.A.: Conditional density propagation for visual tracking. International Journal of Computer Vision 29(nº1), 5 28 (1998) 6. Bar-Shalom, Y., Fortmann, T.: Tracking and data association. Mathematics in Science and Engineering, vol Academic Press, London (1988) 7. Schulz, D., Burgard, W., Fox, D., Cremers, A.B.: People tracking with mobile robots using sample-based joint probabilistic data association filters. International Journal of Robotics Research 22 (nº 2), (2003) 8. Koller-Meier, E.B., Ade, F.: Tracking multiple objects using a condensation algorithm. Journal of Robotics and Autonomous Systems 34, (2001) 9. MacQueen, J.B.: Some methods for classification and analysis of multivariate observations. In: Proceedings of Fifth Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, vol. 1, pp (1967) 10. Pelleg, D., Moore, A.: X-means: Extending k-means with efficient estimation of the number of clusters. In: Proceedings of the Seventeenth International Conference on Machine Learning (ICML00), San Francisco, pp (2000) (ISBN: ) 11. Weiss, Y.: Belief propagation and revision in networks with loops. Technical Report 1616, Artificial Intelligence Laboratory, Massachusetts Institute of Technology (1997) 12. Yager, R.R., Filev, D.: Generation of fuzzy rules by mountain clustering. Journal of Intelligent and Fuzzy Systems 2(nº 3), (1994) 13. Chiu, S.L.: Fuzzy model identification based on cluster estimation. Journal of Intelligent & Fuzzy Systems 2(nº 3), (1994)
Vision-Based Blind Spot Detection Using Optical Flow
Vision-Based Blind Spot Detection Using Optical Flow M.A. Sotelo 1, J. Barriga 1, D. Fernández 1, I. Parra 1, J.E. Naranjo 2, M. Marrón 1, S. Alvarez 1, and M. Gavilán 1 1 Department of Electronics, University
Speed Performance Improvement of Vehicle Blob Tracking System
Speed Performance Improvement of Vehicle Blob Tracking System Sung Chun Lee and Ram Nevatia University of Southern California, Los Angeles, CA 90089, USA [email protected], [email protected] Abstract. A speed
EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set
EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set Amhmed A. Bhih School of Electrical and Electronic Engineering Princy Johnson School of Electrical and Electronic Engineering Martin
A Comparative Study of clustering algorithms Using weka tools
A Comparative Study of clustering algorithms Using weka tools Bharat Chaudhari 1, Manan Parikh 2 1,2 MECSE, KITRC KALOL ABSTRACT Data clustering is a process of putting similar data into groups. A clustering
VEHICLE TRACKING USING ACOUSTIC AND VIDEO SENSORS
VEHICLE TRACKING USING ACOUSTIC AND VIDEO SENSORS Aswin C Sankaranayanan, Qinfen Zheng, Rama Chellappa University of Maryland College Park, MD - 277 {aswch, qinfen, rama}@cfar.umd.edu Volkan Cevher, James
Comparison of K-means and Backpropagation Data Mining Algorithms
Comparison of K-means and Backpropagation Data Mining Algorithms Nitu Mathuriya, Dr. Ashish Bansal Abstract Data mining has got more and more mature as a field of basic research in computer science and
Mean-Shift Tracking with Random Sampling
1 Mean-Shift Tracking with Random Sampling Alex Po Leung, Shaogang Gong Department of Computer Science Queen Mary, University of London, London, E1 4NS Abstract In this work, boosting the efficiency of
Context-aware taxi demand hotspots prediction
Int. J. Business Intelligence and Data Mining, Vol. 5, No. 1, 2010 3 Context-aware taxi demand hotspots prediction Han-wen Chang, Yu-chin Tai and Jane Yung-jen Hsu* Department of Computer Science and Information
A Performance Study of Load Balancing Strategies for Approximate String Matching on an MPI Heterogeneous System Environment
A Performance Study of Load Balancing Strategies for Approximate String Matching on an MPI Heterogeneous System Environment Panagiotis D. Michailidis and Konstantinos G. Margaritis Parallel and Distributed
Robust Outlier Detection Technique in Data Mining: A Univariate Approach
Robust Outlier Detection Technique in Data Mining: A Univariate Approach Singh Vijendra and Pathak Shivani Faculty of Engineering and Technology Mody Institute of Technology and Science Lakshmangarh, Sikar,
How To Use Neural Networks In Data Mining
International Journal of Electronics and Computer Science Engineering 1449 Available Online at www.ijecse.org ISSN- 2277-1956 Neural Networks in Data Mining Priyanka Gaur Department of Information and
Social Media Mining. Data Mining Essentials
Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers
Deterministic Sampling-based Switching Kalman Filtering for Vehicle Tracking
Proceedings of the IEEE ITSC 2006 2006 IEEE Intelligent Transportation Systems Conference Toronto, Canada, September 17-20, 2006 WA4.1 Deterministic Sampling-based Switching Kalman Filtering for Vehicle
Introduction to Mobile Robotics Bayes Filter Particle Filter and Monte Carlo Localization
Introduction to Mobile Robotics Bayes Filter Particle Filter and Monte Carlo Localization Wolfram Burgard, Maren Bennewitz, Diego Tipaldi, Luciano Spinello 1 Motivation Recall: Discrete filter Discretize
Data Mining for Customer Service Support. Senioritis Seminar Presentation Megan Boice Jay Carter Nick Linke KC Tobin
Data Mining for Customer Service Support Senioritis Seminar Presentation Megan Boice Jay Carter Nick Linke KC Tobin Traditional Hotline Services Problem Traditional Customer Service Support (manufacturing)
Real-time Visual Tracker by Stream Processing
Real-time Visual Tracker by Stream Processing Simultaneous and Fast 3D Tracking of Multiple Faces in Video Sequences by Using a Particle Filter Oscar Mateo Lozano & Kuzahiro Otsuka presented by Piotr Rudol
Using Data Mining for Mobile Communication Clustering and Characterization
Using Data Mining for Mobile Communication Clustering and Characterization A. Bascacov *, C. Cernazanu ** and M. Marcu ** * Lasting Software, Timisoara, Romania ** Politehnica University of Timisoara/Computer
Tracking and Recognition in Sports Videos
Tracking and Recognition in Sports Videos Mustafa Teke a, Masoud Sattari b a Graduate School of Informatics, Middle East Technical University, Ankara, Turkey [email protected] b Department of Computer
An Overview of Knowledge Discovery Database and Data mining Techniques
An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree S.Florence 1, N.G.Bhuvaneswari Amma 2, G.Annapoorani 3, K.Malathi 4 PG Scholar, Indian Institute of Information Technology, Srirangam,
K-Means Clustering Tutorial
K-Means Clustering Tutorial By Kardi Teknomo,PhD Preferable reference for this tutorial is Teknomo, Kardi. K-Means Clustering Tutorials. http:\\people.revoledu.com\kardi\ tutorial\kmean\ Last Update: July
MapReduce Approach to Collective Classification for Networks
MapReduce Approach to Collective Classification for Networks Wojciech Indyk 1, Tomasz Kajdanowicz 1, Przemyslaw Kazienko 1, and Slawomir Plamowski 1 Wroclaw University of Technology, Wroclaw, Poland Faculty
Tracking Groups of Pedestrians in Video Sequences
Tracking Groups of Pedestrians in Video Sequences Jorge S. Marques Pedro M. Jorge Arnaldo J. Abrantes J. M. Lemos IST / ISR ISEL / IST ISEL INESC-ID / IST Lisbon, Portugal Lisbon, Portugal Lisbon, Portugal
DATA MINING CLUSTER ANALYSIS: BASIC CONCEPTS
DATA MINING CLUSTER ANALYSIS: BASIC CONCEPTS 1 AND ALGORITHMS Chiara Renso KDD-LAB ISTI- CNR, Pisa, Italy WHAT IS CLUSTER ANALYSIS? Finding groups of objects such that the objects in a group will be similar
A Study of Web Log Analysis Using Clustering Techniques
A Study of Web Log Analysis Using Clustering Techniques Hemanshu Rana 1, Mayank Patel 2 Assistant Professor, Dept of CSE, M.G Institute of Technical Education, Gujarat India 1 Assistant Professor, Dept
Clustering Marketing Datasets with Data Mining Techniques
Clustering Marketing Datasets with Data Mining Techniques Özgür Örnek International Burch University, Sarajevo [email protected] Abdülhamit Subaşı International Burch University, Sarajevo [email protected]
Impelling Heart Attack Prediction System using Data Mining and Artificial Neural Network
General Article International Journal of Current Engineering and Technology E-ISSN 2277 4106, P-ISSN 2347-5161 2014 INPRESSCO, All Rights Reserved Available at http://inpressco.com/category/ijcet Impelling
A Novel Approach for Network Traffic Summarization
A Novel Approach for Network Traffic Summarization Mohiuddin Ahmed, Abdun Naser Mahmood, Michael J. Maher School of Engineering and Information Technology, UNSW Canberra, ACT 2600, Australia, [email protected],[email protected],M.Maher@unsw.
Determining optimal window size for texture feature extraction methods
IX Spanish Symposium on Pattern Recognition and Image Analysis, Castellon, Spain, May 2001, vol.2, 237-242, ISBN: 84-8021-351-5. Determining optimal window size for texture feature extraction methods Domènec
Data Mining Cluster Analysis: Basic Concepts and Algorithms. Lecture Notes for Chapter 8. Introduction to Data Mining
Data Mining Cluster Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 8 by Tan, Steinbach, Kumar 1 What is Cluster Analysis? Finding groups of objects such that the objects in a group will
Tracking of Small Unmanned Aerial Vehicles
Tracking of Small Unmanned Aerial Vehicles Steven Krukowski Adrien Perkins Aeronautics and Astronautics Stanford University Stanford, CA 94305 Email: [email protected] Aeronautics and Astronautics Stanford
Clustering. 15-381 Artificial Intelligence Henry Lin. Organizing data into clusters such that there is
Clustering 15-381 Artificial Intelligence Henry Lin Modified from excellent slides of Eamonn Keogh, Ziv Bar-Joseph, and Andrew Moore What is Clustering? Organizing data into clusters such that there is
DEA implementation and clustering analysis using the K-Means algorithm
Data Mining VI 321 DEA implementation and clustering analysis using the K-Means algorithm C. A. A. Lemos, M. P. E. Lins & N. F. F. Ebecken COPPE/Universidade Federal do Rio de Janeiro, Brazil Abstract
Data Mining Analytics for Business Intelligence and Decision Support
Data Mining Analytics for Business Intelligence and Decision Support Chid Apte, T.J. Watson Research Center, IBM Research Division Knowledge Discovery and Data Mining (KDD) techniques are used for analyzing
Standardization of Components, Products and Processes with Data Mining
B. Agard and A. Kusiak, Standardization of Components, Products and Processes with Data Mining, International Conference on Production Research Americas 2004, Santiago, Chile, August 1-4, 2004. Standardization
International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014
RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer
SoSe 2014: M-TANI: Big Data Analytics
SoSe 2014: M-TANI: Big Data Analytics Lecture 4 21/05/2014 Sead Izberovic Dr. Nikolaos Korfiatis Agenda Recap from the previous session Clustering Introduction Distance mesures Hierarchical Clustering
Standardization and Its Effects on K-Means Clustering Algorithm
Research Journal of Applied Sciences, Engineering and Technology 6(7): 399-3303, 03 ISSN: 040-7459; e-issn: 040-7467 Maxwell Scientific Organization, 03 Submitted: January 3, 03 Accepted: February 5, 03
Data Mining Project Report. Document Clustering. Meryem Uzun-Per
Data Mining Project Report Document Clustering Meryem Uzun-Per 504112506 Table of Content Table of Content... 2 1. Project Definition... 3 2. Literature Survey... 3 3. Methods... 4 3.1. K-means algorithm...
Tracking of Moving Objects from a Moving Vehicle Using a Scanning Laser Rangefinder
Tracking of Moving Objects from a Moving Vehicle Using a Scanning Laser Rangefinder Robert A. MacLachlan, Member, IEEE, Christoph Mertz Abstract The capability to use a moving sensor to detect moving objects
STOCK MARKET TRENDS USING CLUSTER ANALYSIS AND ARIMA MODEL
Stock Asian-African Market Trends Journal using of Economics Cluster Analysis and Econometrics, and ARIMA Model Vol. 13, No. 2, 2013: 303-308 303 STOCK MARKET TRENDS USING CLUSTER ANALYSIS AND ARIMA MODEL
Visualizing non-hierarchical and hierarchical cluster analyses with clustergrams
Visualizing non-hierarchical and hierarchical cluster analyses with clustergrams Matthias Schonlau RAND 7 Main Street Santa Monica, CA 947 USA Summary In hierarchical cluster analysis dendrogram graphs
Edge tracking for motion segmentation and depth ordering
Edge tracking for motion segmentation and depth ordering P. Smith, T. Drummond and R. Cipolla Department of Engineering University of Cambridge Cambridge CB2 1PZ,UK {pas1001 twd20 cipolla}@eng.cam.ac.uk
A Robust Multiple Object Tracking for Sport Applications 1) Thomas Mauthner, Horst Bischof
A Robust Multiple Object Tracking for Sport Applications 1) Thomas Mauthner, Horst Bischof Institute for Computer Graphics and Vision Graz University of Technology, Austria {mauthner,bischof}@icg.tu-graz.ac.at
EFFICIENT DATA PRE-PROCESSING FOR DATA MINING
EFFICIENT DATA PRE-PROCESSING FOR DATA MINING USING NEURAL NETWORKS JothiKumar.R 1, Sivabalan.R.V 2 1 Research scholar, Noorul Islam University, Nagercoil, India Assistant Professor, Adhiparasakthi College
Clustering. Adrian Groza. Department of Computer Science Technical University of Cluj-Napoca
Clustering Adrian Groza Department of Computer Science Technical University of Cluj-Napoca Outline 1 Cluster Analysis What is Datamining? Cluster Analysis 2 K-means 3 Hierarchical Clustering What is Datamining?
Multi-Modal Acoustic Echo Canceller for Video Conferencing Systems
Multi-Modal Acoustic Echo Canceller for Video Conferencing Systems Mario Gazziro,Guilherme Almeida,Paulo Matias, Hirokazu Tanaka and Shigenobu Minami ICMC/USP, Brazil Email: [email protected] Wernher
Distributed Framework for Data Mining As a Service on Private Cloud
RESEARCH ARTICLE OPEN ACCESS Distributed Framework for Data Mining As a Service on Private Cloud Shraddha Masih *, Sanjay Tanwani** *Research Scholar & Associate Professor, School of Computer Science &
PLAANN as a Classification Tool for Customer Intelligence in Banking
PLAANN as a Classification Tool for Customer Intelligence in Banking EUNITE World Competition in domain of Intelligent Technologies The Research Report Ireneusz Czarnowski and Piotr Jedrzejowicz Department
Use of Data Mining Techniques to Improve the Effectiveness of Sales and Marketing
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 4, April 2015,
Linear programming approach for online advertising
Linear programming approach for online advertising Igor Trajkovski Faculty of Computer Science and Engineering, Ss. Cyril and Methodius University in Skopje, Rugjer Boshkovikj 16, P.O. Box 393, 1000 Skopje,
Cluster Analysis. Alison Merikangas Data Analysis Seminar 18 November 2009
Cluster Analysis Alison Merikangas Data Analysis Seminar 18 November 2009 Overview What is cluster analysis? Types of cluster Distance functions Clustering methods Agglomerative K-means Density-based Interpretation
STATISTICA. Clustering Techniques. Case Study: Defining Clusters of Shopping Center Patrons. and
Clustering Techniques and STATISTICA Case Study: Defining Clusters of Shopping Center Patrons STATISTICA Solutions for Business Intelligence, Data Mining, Quality Control, and Web-based Analytics Table
There are a number of different methods that can be used to carry out a cluster analysis; these methods can be classified as follows:
Statistics: Rosie Cornish. 2007. 3.1 Cluster Analysis 1 Introduction This handout is designed to provide only a brief introduction to cluster analysis and how it is done. Books giving further details are
Detecting E-mail Spam Using Spam Word Associations
Detecting E-mail Spam Using Spam Word Associations N.S. Kumar 1, D.P. Rana 2, R.G.Mehta 3 Sardar Vallabhbhai National Institute of Technology, Surat, India 1 [email protected] 2 [email protected]
BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL
The Fifth International Conference on e-learning (elearning-2014), 22-23 September 2014, Belgrade, Serbia BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL SNJEŽANA MILINKOVIĆ University
Customer Classification And Prediction Based On Data Mining Technique
Customer Classification And Prediction Based On Data Mining Technique Ms. Neethu Baby 1, Mrs. Priyanka L.T 2 1 M.E CSE, Sri Shakthi Institute of Engineering and Technology, Coimbatore 2 Assistant Professor
Dynamic Data in terms of Data Mining Streams
International Journal of Computer Science and Software Engineering Volume 2, Number 1 (2015), pp. 1-6 International Research Publication House http://www.irphouse.com Dynamic Data in terms of Data Mining
Visualization of Breast Cancer Data by SOM Component Planes
International Journal of Science and Technology Volume 3 No. 2, February, 2014 Visualization of Breast Cancer Data by SOM Component Planes P.Venkatesan. 1, M.Mullai 2 1 Department of Statistics,NIRT(Indian
An Analysis of Missing Data Treatment Methods and Their Application to Health Care Dataset
P P P Health An Analysis of Missing Data Treatment Methods and Their Application to Health Care Dataset Peng Liu 1, Elia El-Darzi 2, Lei Lei 1, Christos Vasilakis 2, Panagiotis Chountas 2, and Wei Huang
Least Squares Estimation
Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David
Specific Usage of Visual Data Analysis Techniques
Specific Usage of Visual Data Analysis Techniques Snezana Savoska 1 and Suzana Loskovska 2 1 Faculty of Administration and Management of Information systems, Partizanska bb, 7000, Bitola, Republic of Macedonia
PERFORMANCE ANALYSIS OF CLUSTERING ALGORITHMS IN DATA MINING IN WEKA
PERFORMANCE ANALYSIS OF CLUSTERING ALGORITHMS IN DATA MINING IN WEKA Prakash Singh 1, Aarohi Surya 2 1 Department of Finance, IIM Lucknow, Lucknow, India 2 Department of Computer Science, LNMIIT, Jaipur,
Comparing large datasets structures through unsupervised learning
Comparing large datasets structures through unsupervised learning Guénaël Cabanes and Younès Bennani LIPN-CNRS, UMR 7030, Université de Paris 13 99, Avenue J-B. Clément, 93430 Villetaneuse, France [email protected]
INTERNET TRAFFIC CLUSTERING USING PACKET HEADER INFORMATION
INTERNET TRAFFIC CLUSTERING USING PACKET HEADER INFORMATION Pekka Kumpulainen 1, Kimmo Hätönen 2, Olli Knuuti 3 and Teemu Alapaholuoma 1 1 Pori Unit, Tampere University of Technology, Pori, Finland 2 CTO
ON INTEGRATING UNSUPERVISED AND SUPERVISED CLASSIFICATION FOR CREDIT RISK EVALUATION
ISSN 9 X INFORMATION TECHNOLOGY AND CONTROL, 00, Vol., No.A ON INTEGRATING UNSUPERVISED AND SUPERVISED CLASSIFICATION FOR CREDIT RISK EVALUATION Danuta Zakrzewska Institute of Computer Science, Technical
Support Vector Machine-Based Human Behavior Classification in Crowd through Projection and Star Skeletonization
Journal of Computer Science 6 (9): 1008-1013, 2010 ISSN 1549-3636 2010 Science Publications Support Vector Machine-Based Human Behavior Classification in Crowd through Projection and Star Skeletonization
Research of Postal Data mining system based on big data
3rd International Conference on Mechatronics, Robotics and Automation (ICMRA 2015) Research of Postal Data mining system based on big data Xia Hu 1, Yanfeng Jin 1, Fan Wang 1 1 Shi Jiazhuang Post & Telecommunication
A Review of Anomaly Detection Techniques in Network Intrusion Detection System
A Review of Anomaly Detection Techniques in Network Intrusion Detection System Dr.D.V.S.S.Subrahmanyam Professor, Dept. of CSE, Sreyas Institute of Engineering & Technology, Hyderabad, India ABSTRACT:In
An Introduction to Data Mining
An Introduction to Intel Beijing [email protected] January 17, 2014 Outline 1 DW Overview What is Notable Application of Conference, Software and Applications Major Process in 2 Major Tasks in Detail
Removing Moving Objects from Point Cloud Scenes
1 Removing Moving Objects from Point Cloud Scenes Krystof Litomisky [email protected] Abstract. Three-dimensional simultaneous localization and mapping is a topic of significant interest in the research
E-Learning Using Data Mining. Shimaa Abd Elkader Abd Elaal
E-Learning Using Data Mining Shimaa Abd Elkader Abd Elaal -10- E-learning using data mining Shimaa Abd Elkader Abd Elaal Abstract Educational Data Mining (EDM) is the process of converting raw data from
How To Cluster
Data Clustering Dec 2nd, 2013 Kyrylo Bessonov Talk outline Introduction to clustering Types of clustering Supervised Unsupervised Similarity measures Main clustering algorithms k-means Hierarchical Main
LVQ Plug-In Algorithm for SQL Server
LVQ Plug-In Algorithm for SQL Server Licínia Pedro Monteiro Instituto Superior Técnico [email protected] I. Executive Summary In this Resume we describe a new functionality implemented
Summary Data Mining & Process Mining (1BM46) Content. Made by S.P.T. Ariesen
Summary Data Mining & Process Mining (1BM46) Made by S.P.T. Ariesen Content Data Mining part... 2 Lecture 1... 2 Lecture 2:... 4 Lecture 3... 7 Lecture 4... 9 Process mining part... 13 Lecture 5... 13
Real-Time Camera Tracking Using a Particle Filter
Real-Time Camera Tracking Using a Particle Filter Mark Pupilli and Andrew Calway Department of Computer Science University of Bristol, UK {pupilli,andrew}@cs.bris.ac.uk Abstract We describe a particle
Modeling and Design of Intelligent Agent System
International Journal of Control, Automation, and Systems Vol. 1, No. 2, June 2003 257 Modeling and Design of Intelligent Agent System Dae Su Kim, Chang Suk Kim, and Kee Wook Rim Abstract: In this study,
Automatic Traffic Estimation Using Image Processing
Automatic Traffic Estimation Using Image Processing Pejman Niksaz Science &Research Branch, Azad University of Yazd, Iran [email protected] Abstract As we know the population of city and number of
COMPARISON OF OBJECT BASED AND PIXEL BASED CLASSIFICATION OF HIGH RESOLUTION SATELLITE IMAGES USING ARTIFICIAL NEURAL NETWORKS
COMPARISON OF OBJECT BASED AND PIXEL BASED CLASSIFICATION OF HIGH RESOLUTION SATELLITE IMAGES USING ARTIFICIAL NEURAL NETWORKS B.K. Mohan and S. N. Ladha Centre for Studies in Resources Engineering IIT
Real time vehicle detection and tracking on multiple lanes
Real time vehicle detection and tracking on multiple lanes Kristian Kovačić Edouard Ivanjko Hrvoje Gold Department of Intelligent Transportation Systems Faculty of Transport and Traffic Sciences University
Learning outcomes. Knowledge and understanding. Competence and skills
Syllabus Master s Programme in Statistics and Data Mining 120 ECTS Credits Aim The rapid growth of databases provides scientists and business people with vast new resources. This programme meets the challenges
Analecta Vol. 8, No. 2 ISSN 2064-7964
EXPERIMENTAL APPLICATIONS OF ARTIFICIAL NEURAL NETWORKS IN ENGINEERING PROCESSING SYSTEM S. Dadvandipour Institute of Information Engineering, University of Miskolc, Egyetemváros, 3515, Miskolc, Hungary,
Mobile Phone APP Software Browsing Behavior using Clustering Analysis
Proceedings of the 2014 International Conference on Industrial Engineering and Operations Management Bali, Indonesia, January 7 9, 2014 Mobile Phone APP Software Browsing Behavior using Clustering Analysis
Tracking People with a 360-Degree Lidar
Tracking People with a 360-Degree Lidar John Shackleton and Brian VanVoorst Raytheon BBN Technologies Saint Louis Park MN 55416 [email protected] Joel Hesch University of Minnesota Minneapolis, MN
A comparison of various clustering methods and algorithms in data mining
Volume :2, Issue :5, 32-36 May 2015 www.allsubjectjournal.com e-issn: 2349-4182 p-issn: 2349-5979 Impact Factor: 3.762 R.Tamilselvi B.Sivasakthi R.Kavitha Assistant Professor A comparison of various clustering
IMPLICIT SHAPE MODELS FOR OBJECT DETECTION IN 3D POINT CLOUDS
IMPLICIT SHAPE MODELS FOR OBJECT DETECTION IN 3D POINT CLOUDS Alexander Velizhev 1 (presenter) Roman Shapovalov 2 Konrad Schindler 3 1 Hexagon Technology Center, Heerbrugg, Switzerland 2 Graphics & Media
Real-Time Tracking of Pedestrians and Vehicles
Real-Time Tracking of Pedestrians and Vehicles N.T. Siebel and S.J. Maybank. Computational Vision Group Department of Computer Science The University of Reading Reading RG6 6AY, England Abstract We present
Real-Time Indoor Mapping for Mobile Robots with Limited Sensing
Real-Time Indoor Mapping for Mobile Robots with Limited Sensing Ying Zhang, Juan Liu Gabriel Hoffmann Palo Alto Research Center Palo Alto, California 94304 Email: [email protected] Mark Quilling, Kenneth
DATA MINING TECHNIQUES AND APPLICATIONS
DATA MINING TECHNIQUES AND APPLICATIONS Mrs. Bharati M. Ramageri, Lecturer Modern Institute of Information Technology and Research, Department of Computer Application, Yamunanagar, Nigdi Pune, Maharashtra,
Enhancing Quality of Data using Data Mining Method
JOURNAL OF COMPUTING, VOLUME 2, ISSUE 9, SEPTEMBER 2, ISSN 25-967 WWW.JOURNALOFCOMPUTING.ORG 9 Enhancing Quality of Data using Data Mining Method Fatemeh Ghorbanpour A., Mir M. Pedram, Kambiz Badie, Mohammad
A Survey on Intrusion Detection System with Data Mining Techniques
A Survey on Intrusion Detection System with Data Mining Techniques Ms. Ruth D 1, Mrs. Lovelin Ponn Felciah M 2 1 M.Phil Scholar, Department of Computer Science, Bishop Heber College (Autonomous), Trichirappalli,
SPATIAL DATA CLASSIFICATION AND DATA MINING
, pp.-40-44. Available online at http://www. bioinfo. in/contents. php?id=42 SPATIAL DATA CLASSIFICATION AND DATA MINING RATHI J.B. * AND PATIL A.D. Department of Computer Science & Engineering, Jawaharlal
SPECIAL PERTURBATIONS UNCORRELATED TRACK PROCESSING
AAS 07-228 SPECIAL PERTURBATIONS UNCORRELATED TRACK PROCESSING INTRODUCTION James G. Miller * Two historical uncorrelated track (UCT) processing approaches have been employed using general perturbations
Large-Scale Data Sets Clustering Based on MapReduce and Hadoop
Journal of Computational Information Systems 7: 16 (2011) 5956-5963 Available at http://www.jofcis.com Large-Scale Data Sets Clustering Based on MapReduce and Hadoop Ping ZHOU, Jingsheng LEI, Wenjun YE
