Labeling of Textured Data with Co-training and Active Learning

Size: px
Start display at page:

Download "Labeling of Textured Data with Co-training and Active Learning"

Transcription

1 M. Turtinen and M. Pietikäinen: Labeling of textured data with co-training and active learning. In Texture 2005: Proceedings of the 4th International Workshop on Texture Analysis and Synthesis, pp , Labeling of Textured Data with Co-training and Active Learning Markus Turtinen and Matti Pietikäinen Machine Vision Group, Electrical and Information Engineering Department P.O.Box 4500, FI University of Oulu, Finland Abstract In this paper, we present a robust texture labeling method that requires minimum user interaction. Initially only a fraction of the textures needs to be manually labeled, and then a co-training procedure is used to automatically label most of the unlabeled samples. Simultanously an active learning framework is used to learn those unlabeled samples that would provide much information for the system if labeled. Samples found by active learning are labeled explicitly with a visualization-based approach which provides a very user-friendly view into the data and enables possibility to learn new classes. In the experiments, the labeling framework is applied to real texture image data for building a training set for a classifier. 1. Introduction A major problem when building a practical computer vision system is to collect an extensive labeled data set for training. Typically it is not difficult to acquire a large amount of images from targets utilizing, for example, multiple cameras or video streaming. Problems arise when one is asked to describe the content of them and label images according to their appearance. Manual labeling of each image is laborious, time consuming and prone to human made errors that make the labeling procedure very critical task in many computer vision applications. Real surfaces are not usually flat but contain variations in some scale. Several real-world computer vision applications might utilize proper texture information derived from images. Much research has been done to describe, model and recognize textural properties of digital images but only little effort has been devoted to the design of robust and user friendly texture data labeling systems. A typical example of application where labeling is required is texture classification. The performance of the analysis is directly connected to the success of labeling. For a human, it is typically very difficult to label textures reliably. Instead of manual labeling it would be preferable if the labeling can be automated as much as possible. This defines a relevant learning problem: consider we want to learn interesting models from the current texture representation and provide labels for the data automatically using this information [14]. On the other hand, we should be able to determine the level of labeling confidence and prevent the system to learn wrong models for textures. To understand better the difficulties of manual labeling of textured samples, let us consider a digital image taken with a typical four megapixel camera from outdoor view. One should put individual labels to each pixel or region indicating classes like sky, grass or trees, for example. It should not be too difficult to manually segment few images into predefined categories. But huge within class variations caused by illumination changes, non-homogeneity of objects, foreshortening, shadowing or occlusions, for example, require a lot of labeled samples per class and one should manually analyze several input images. A straighforward approach for labeling textures would be to ask a human to give labels for a small subset of samples and then automatically provide labels for the rest of the data with supervised learning. This requires that the features used are very effective and initial samples represent all classes reliably. Often these are too strict restrictions in the real-world applications. The use of invariant feature sets, see for example [10], are effective for tacling against intraclass variation and reducing the number of models needed per class, but still labeling is required. We approach the texture labeling problem from the view of learning, utilizing both active learning [18] and cotraining [2]. In active learning, those unlabeled samples that provide most additional information are automatically selected for explicit labeling. In practice, these samples are usually the ones that are difficult to classify. Simultaneously, a great number of samples can be automatically labeled with high confidence utilizing co-training. Several approaches for both active learning and learning from labeled and unlabeled data have been proposed. In active learning new samples to be labeled are typically

2 selected to maximize the performance of the classifier in some way [4, 8]. Some attempts for learning with both labeled and unlabeled data sets have also been done. See, for example, a review of Seeger [15]. Our goal is label textures with only a small human effort. In our framework two different texture feature sets are extracted from images. The feature sets should complement each other. We select to use local binary pattern (LBP) [13] and Gabor [11] features. LBP features are used to capture microstructural information of the textures. Gabor features offer larger spatial support than LBP and describe more macrostructural elements. Feature sets are then used in the co-training algorithm [2] for labeling a part of the samples automatically. Active learning, resembling the one proposed in [18], is used to select the most informative and ambiguous samples for manual labeling. The actual labeling in this phase is made with the visualization-based apporoach [16] utilizing the self-organizing map [9] to cluster the feature data and represent it on a low-dimensional space. Our major contribution is to propose a multi-class texture labeling system that is robust, easy to use and flexible. Textures are modeled with complementary features providing two separate representations of the images. Confidence based co-training is used to automatically label those samples that can be predicted with high confidence. Visualization-based approach can be used to learn new classes during labeling, which makes the framework very flexible. The proposed method is applied for analyzing outdoor scene images and for training a classifier to classify image regions into classes like road, tree and grass, for example. In addition, the approach is validated with three other real-world texture sets. 2. Texture Labeling Framework In our labeling system two different texture feature sets, LBP [13] and Gabor [11], are extracted from the image data. Small amount of textures are selected to be initially labeled while most of the data is unlabeled. At this point, we have a small labeled data set D l having labels from finite set Y of size k. We also have a larger unlabeled data set D u obtained with both features. Multi-class co-training and active learning is applied to the data. In the co-training, part of those unlabeled samples that are correctly labeled with high confidence are automatically added to the labeled set D l. In the active learning step some of those samples that have ambiguously predicted labels are selected for explicit labeling. The labeling is made with visualization-based approach and labeled samples are added to the D l. We are able to learn new texture classes and increse the size of the Y with visualization. The following subsections give more detailed explanation of the components used Texture Feature Extraction Textures are modeled with two separate feature sets providing an extensive representation of the textures. The local binary pattern (LBP) [13] distributions and Gabor filtering [11] are selected as texture measures because they both have performed very well in various texture analysis problems. They also give different types of representations of the textured images, which is useful in the co-training and active learning phases. LBP detects microstructures like curved edges, spots, flat areas etc. from the images. LBP is invariant against monotonic gray-scale variations and it has been extended to have rotation invariant and multiscale properties. Gabor filtering offers larger spatial support than LBP and is more likely to capture the macrostructures of a texture. The image is convolved with Gabor filters at different scales and orientations providing a multiresolution representation of the texture. As a texture measure, the means and the standard deviations of the filtered images are used. There are also rotation invariant extensions to Gabor filtering approach. Basically, the proposed approach does not set strict limits to the feature sets used, but they should provide complementary information. It is also possible to combine texture with some other cue, like color Co-training Based on Labeling Confidence The co-training algorithm proposed by Blum and Mitchell [2] utilizes two different views from the data and trains a pair of learning algorithms to classify samples in the two class problem. They assume that the feature sets used are statistically independent and samples that are confidently labeled with another feature set might be mislabeled with the other learner. The learners can therefore train each other by allowing each one to label some amount of the confidently labeled samples in both negative and positive categories. The assumption that feature sets are conditionally independent rarely holds with real-world data distributions. Regardless of that, co-training has been succesfully used, for example, in text classification [2]. Suprisingly, co-training has beed seldom applied to multi-class problems. Ghani [6] presented a multi-class text categorization approach decomposing the original classification problem into multiple binary problems using errorcorrecting output codes (ECOC). Co-training was then applied separately to each binary classification problem. In our approach the original classification task is also decomposed into multiple binary problems. Allwein et al. [1] proposed a unified approach for reducing multi-class problem into multiple binary problems to be solved by a margin classifier, like the support vec-

3 tor machine (SVM) [17] which is used in this paper. The decomposition can be performed via coding matrix M { 1, 0, +1} k l, where k is the number of classes and l is the number of binary problems. M(r, s) = 1 if the learner s should consider samples from the class r as positive examples, M(r, s) = -1 if the samples from the class r should be considered as negative examples, and if M(r, s) = 0 there is no matter how the learner considers samples from the class r. For example, for the popular one-against-all coding, M is a k k matrix in which all diagonal elements are +1 and all other elements are -1. As there are several methods for constructing a coding matrix, there is also more than one way to find the predicted label for the sample after classified it with multiple binary classifiers. Allwein et al. [1] proposed two different decoding methods. In the first approach the Hamming distance between the predicted labels obtained with different binary classifiers and codebook rows indicating classes is calculated. Hamming coding scheme ignores entirely the magnitude of the predictions which can often be an indication of a level of confidence. That is the reason they also proposed the loss-based decoding method d L (M(r), f(x)) = l L(M(r, s)f s (x)). (1) s=1 r After calculating the distances with some distance measure, the predicted label ŷ 1,..., k is attained with ŷ = arg min dist(m(r), f(x)). Allwein et al. [1] suggested to use the same loss function L in Eq. 1 than used in the learning algorithm. They also obtained better results with loss-based decoding than with Hamming decoding. In our co-training algorithm the loss-based approach is used. Even though the texture features used measure different properties of texture they still are pure texture statistics. For this reason there is some lack of statistical independence of the features and it might lead to errornous label predictions if the basic co-training is used. Therefore, we slightly modify the labeling rule typically used in co-training algorithms where both feature sets are able to label some amount of confidently labeled samples [2]. At the beginning we have two labeled data sets Dl 1 and Dl 2. Multi-class learners are applied to the data sets and estimated output ŷ and the level of confidence (estimated loss) d L (M(y), f D (x)) for every x in the unlabeled set D u are calculated. The total loss of prediction is obtained by summing individual losses of separate learning algorithms. Then n samples with highest confidence (smallest d L (M(y), f(x))) are labeled and added into the labeled sets only if the both predictions ŷ 1 and ŷ 2 are the same. If there is no similarly predicted labels with high confidence then both learners are allowed to label some of the most confidently predicted samples, like in the traditional co-training. This modification improves the labeling especially in the beginning of the learning when there are only a limited number of the training samples Active Learning of Texture Models Active learning model used in this work is based on the approach of Yan et al. [18] where they labeled video frames utilizing color features to identify different humans in the scene. The approach uses margin-based learner and decomposes a multiclass problem into a set of binary-class problems. So settings used in the co-training phase can be directly extended and utilized in the active learning step. The goal is to find such unlabeled samples which can minimize the expected loss on the data set. Consider we have a labeled data set D l of samples to be used in the training of the learner to classify unlabeled samples. P (x y) is the conditional distribution over a sample x, and P (x) is the marginal distribution of x. Multi-class learner gives an estimated output ŷ and an estimated loss d L (M(y), f D l (x)) for each x in the unlabeled set D u. The expected risk function of such a learner is R(f D l ) = E x E y x (d L (M(y), f D l (x)) = y Y d L(M(y), f D l (x))p (y x)p (x)dx. X The task of an active learner is to select some of the unlabeled samples and ask labels for them and add labeled samples into the D l to form new labeled data set D l. Yan et al. [18] showed that the optimal query set can be found by maximizing R(f D l ) R(f D l ). The maximization is usually impractical and computationally intensive due the enormous amount of combinations. But there exist some simple heuristic strategies that can be applied for sample selection without requiring to re-learn the classifier to estimate the expected risk for each combination. One solution is to use the best worst case model, which chooses the most ambiguous samples for labeling [3]. It is based on the assumption that the loss function can be expected to be small if the confidently labeled samples are most likely correctly labeled. Now, let y x be the predicted label for sample x. If Eq. 1 is used for calculting the confidence of each labeling, in the best worst case we choose the sample with the maximum expected loss for the predicted label with arg max x l L(M(y x, s), f s (x)). (2) s=1 In our case, we have two feature sets extracted from the same images. The total loss for the predicted label is formed by summing the individual losses like in the cotraining phase. The ambiguous samples (different predicted label ŷ for both learners and high loss value) are selected for explicit labeling. When the learning process progresses the predicted labels are most likely to be similar with both learners and only Eq. 2 is used for selecting samples.

4 2.4. Visualization-based Learning Explicit labeling is made with the visualization-based method that resembles the approach presented in [16]. In that approach, the self-organizing map (SOM) [9] was applied for scene image data obtained by dividing original images into non-overlapping image patches. Two-phase visualization for the data extracted from each training image was made and the training set was incrementally built. Suspicious and hardly visualizable samples were mapped back to the original image providing contextual information for image patches and helping to determine the correct class for them. The SOM represents the original high-dimensional data on a low-dimensional, typically 2D, grid. It preserves the topology of the data, i.e. samples close to each other in the high-dimensional space locate close to each other in the projection space also. This is very attractive property in terms of visualization-based learning: similar data construct clusters on the 2D plane and user is able to visualize them and rapidly label number of samples. The user could select nodes representing some class and label all the samples inside them or select them under more careful visualization. During visualization new classes can be revealed and introduced, making the whole learning process very flexible. Compared to [16] we do not use all the data available to train SOMs. Only those samples are fed to the map that active learner requires. Visualization is thus easier because the map is not so dense and only one-stage visualization is required. Determining the label for individual image patches may be difficult without contextual information [7] and in such an application the back-projection of patches into the original image is useful. 3. Experiments A. Outdoor Scene Image Analysis Natural scene images from the Outex texture database [12] were used in the experiments. Test suite ID Outex NS consists of 22 sequential outdoor scene images of pixels taken by a human walking on the street. Half of them were selected into the training image set to be used in the labeling framework and others were left for the validation set. There are manually defined ground truth regions for each image (sky, trees, grass, road and buildings). To study the performance of the labeling framework only these ground truth regions were first used and manual labeling with visualization-based approach was not used at all. We selected the first and the last training image from the sequence and manually defined ground truth regions for them. The ground truth regions of the other nine training images were then used in the co-training and active learning and the ground truth regions of the remaining eleven images were used for testing. Images were divided into the blocks of pixels and only those blocks whose every pixel belonged to some ground truth class were considered. With a block-based approach the amount of data reduces dramatically compared to the case when all pixels in the images are covered using sliding blocks. The radial basis function (RBF) support vector machine (SVM) was used as a learner. The grid-search and 5-fold cross-validation using the initial training data obtained from the two manually labeled images were used to estimate the SVM parameters C and γ. Multi-class co-training and active learning were then made using the two manually labeled images as initial training data D l. The remaining nine training images were used to generate the sample pool D u. LBP8,1 (dimensionality 59) and Gabor filtering at four scales and six orientations (dimensionality 48) texture features were extracted from pixels sub-images. Two different coding schemes, one-against-all and pairwise, were used in multi-class to binary coding. The algorithm was iterated 10 rounds and 100 samples in each round were labeled with co-training and other 100 with active learning process. Fig. 1 shows the classification results obtained with both feature sets using separate testing data. The classification is made with the similar SVM classifier that is used in the labeling phase using the same parameters C and γ as before. The last results (iteration 11) are achieved by optimizing C and γ with the new training set. With fully supervised learning using all the training data available and optimized SVM classifier the baseline results for LBP8,1 and Gabor features were 93.5% and 89.3%. The results obtained with co-training and active learning indicate that the classification rate reaches close to the optimum during learning. There is not much difference between coding schemes used. C lassification rate [%] LB P 8,1 OneV sall Gabor OneVsAll LB P P airwise 8,1 Gabor Pairwise Iterations Figure 1. Classification rates during learning. To study more about the effects of co-training and active learning phases, a separate experiment was run where only co-training or active learning was used. In each iteration maximum 50 samples were labeled automatically with cotraining or explicitly with active learning and added to the

5 Dl. The classification performance converges slower with co-training compared to active learning. This is reasonable because with co-training the easiest samples are labeled first and added to the training set. Active learning can boost very fast the classification rate by discovering the most ambiguous samples in the beginning of the training. This is a very important property in the data labeling system. For this reason, an experiment was run where active learning was first applied four times to the data and 50 samples were explicitly labeled. After that the co-training was used to label remaining samples automatically and active learning step was applied in every fifth iteration to imporeve the labeling. With these settings the classification rate incresed from 85.5% to 90.3% during first five iterations with LBP8,1 and from 83.8% to 86.9% with Gabor features, respectively. In the end of the learning the rates were 93.3% and 89.1% that are very close to the optimum. It can be noted that very good classification performance can be achieved using only a small labeling effort. In the last experiment the whole data available was used and visualization-based learning was also applied in the labeling. The initial training set consisted of the same two labeled scene images as before. The samplepool was created dividing the rest of the training images into pixels subimages. Totally there were 8190 unlabeled subimages. For the first four iterations active learning alone was used to select 100 samples to be explicitly labeled in each round. Visualization-based learning was applied for manual labeling. After that the co-training was used to automatically label maximum 200 samples in each iteration. Active learning with visualization-based labeling was performed in every fifth iteration to improve the labeling. Those samples whose class was difficult to determine with visualization (i.e. several classes mixed) were left out of the training set. Fig. 2 shows examples of segmented training image (top) and testing image (bottom) created by classifying pixels regions in the images using SVM classifier and cre ated training set (LBP8,1 ). Most of the regions are correctly classified but especially blocks on the category boundaries are difficult to classify. B. Other Texture Classification Problems Three texture classification problems from Outex [12] and CUReT [5] texture databases were used to study the performance of the learning framework in a more general way. Datasets 1 and 2 are from the Outex database. Dataset 1 (ID Outex TC 00004) represents a typical texture classification problem. It has 24 texture classes with 88 images of 64*64 pixels in each. Half of them (N=1056) were selected to the training set and the other half for testing. Dataset 2 (ID Outex TC 00013) is a color texture classification problem having *128 pixels images from 68 classes. Figure 2. Examples of training and testing image classified into five categories. Half of the images (N=680) were used in the training phase. Dataset 3 consists of CUReT textures. 118 images from 20 classes taken under varying viewpoint and illumination with a viewing angle less than 60 degrees were selected to the set. Half of the images were considered as training images (N=1180). With the first dataset, 70 images of 1056 were selected in the initial training set. LBP8,1 and Gabor (4 scales, 6 orientations) features were extracted from the images. Active learning (without visualization-based labeling) was used to explicitly label samples on the first four iterations (50 samples in each iteration). After that co-training was used to automatically label 50 samples in each iteration and active learning was applied in every fifth round. Five different coding schemes were experimented: one-against-all, pairwise, ECOC with 15bit BCH code, ECOC with 31bit BCH code, and ECOC with 63bit BCH code. There was no significant difference in the performance between coding schemes. During the first five iterations the classification rate increased from 73% to near 97% with Gabor features and from 89% to near 97% with LBP8,1 with all coding schemes. The baseline classification result obtained with the full training data and supervised learning were 98.5% with Gabor and 98.8 with LBP8,1. All coding schemes were able to achieve very good performance during learning. It can also be noted that active learning was able to find the most informative samples very fast (during the first few iterations). and Gabor filthe multiresolution LBP8,1+16,3+24,5 tering at four scales and six orientations were applied to the Outex TC textures. Initially 225 images were put to the labeled training set. Similar settings as before were used, expect only 25 samples were labeled in each iteration and one vs. all coding was used. On the first five rounds the classication rates increased from 68.1% to 77.8% with LBP8,1+16,3+24,5 and from 68.1% to 73.4%

6 with Gabor, respectively. During learning the classification rates exceeded 80.2% and 74.0%, while using supervised learning with full training data the rates were 84.4% and 74.9%, recpectively. Because the Outex TC images are color textures, we also experimented how texture features and separate color features can be used together in the learning framework. The same LBP8,1+16,3+24,5 features as before were used as the other feature set. The color feature set was constructed by quantizing each color channel into four levels in the RGB colorcube. Using the same kind of learning procedure than before, the classification rates increased from 68.1% to 76.1% with LBP8,1+16,3+24,5 and 67.1% to 71.8% with color features on the first four iterations. During learning, the rates reached 84.4% and 76.6% which are close to the rates obtained with fully supervised learning (84.4% and 76.8%). With CUReT textures 20% of the training images were selected into the initial training set. LBP8,1+16,3+24,5 ri and Gabor features were extracted and similar settings than with Outex TC textures were used in the labeling procedure. During first five iterations the classification rate increased from 70.9% to 92.6% with LBP8,1+16,3+24,5 ri and from 79.8% to 92.0% with Gabor features. During learning the rates increased to 94.5% and 97.3% which are close to the ones obtained with supervised learning (95.0% and 97.6%). 4. Conclusions Collection of labeled training data is typically laborious and prone to human made errors in many computer vision tasks. In this paper, we proposed a robust texture data labeling framework utilizing efficient texture features, cotraining and active learning in order to ease human work in labeling. Visualization-based learning was used as a userinterface in actual labeling task providing very useful view into the data. We experimented the labeling framework with various texture image sets for creating a labeled data set for texture classifier. Active learning was able to select such unlabeled samples that would provide useful information to the training set and increased the testing classification rate very rapidly. Co-training was used to automatically label those samples whose label can be known with high confidence. Those unlabeled samples that active learner suggested to be labeled were analyzed and labeled with visualization-based user interface. With this kind of approach the amount of human work needed in the labeling reduces dramatically, but the labeling performance remains very good. In addition, the user can introduce totally new classes in the visualization phase which makes the labeling procedure very flexible. Acknowledgments Financial support of the Infotech Oulu Graduate Schools and Academy of Finland is gratefully acknowledged. References [1] E. Allwein, R. Schapire, and Y. Singer. Reducing multiclass to binary: a unifying approach for margin classifiers. Journal of Machine Learning Research, 1: , [2] A. Blum and T. Mitchell. Combining labeled and unlabeled data with co-training. In Computational Learning Theory, pages , [3] C. Campbell, N. Cristianini, and A. Smola. Query learning with large margin classifiers. In ICML 00, pages , [4] D. Cohn, Z. Ghahramani, and M. Jordan. Active learning with statistical models. Journal of Artificial Intelligence Research, 4: , [5] K. Dana, B. van Ginneken, S. Nayar, and J. Koenderink. Reflectance and texture of real world surfaces. ACM Trans. Graphichs, 18(1):1 34, [6] R. Ghani. Combining labeled and unlabeled data for multiclass text categorization. In ICML 02, pages , [7] X. He, R. Zemel, and M. Carreira-Perpinan. Multiscale conditional random fields for image labeling. In IEEE Conf. CVPR 04, pages , [8] V. Iyengar, C. Apte, and T. Zhang. Active learning using adaptive resampling. In Int l Conf. on Knowledge Discovery and Data Mining, pages 91 98, [9] T. Kohonen. Self-organizing Maps. Springer-Verlag, Berlin, Germany, [10] S. Lazebnik, C. Schmid, and J. Ponce. A sparse texture representation using local affine region. IEEE Trans. PAMI, 27(8): , [11] B. Manjunath and W. Ma. Texture features for browsing and retrieval of image data. IEEE Trans. PAMI, 18(8): , [12] T. Ojala, T. Mäenpää, M. Pietikäinen, J. Viertola, J. Kyllönen, and S. Huovinen. Outex - new framework for empirical evaluation of texture analysis algorithms. In ICPR 02, volume 1, pages , [13] T. Ojala, M. Pietikäinen, and T. Mäenpää. Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. PAMI, 24(7): , [14] R. Picard and T. Minka. Vision texture for annotation. Multimedia Systems, 3(1):3 14, [15] M. Seeger. Learning with labeled and unlabeled data. Technical report, Institute of Adaptive and Neural Computation, University of Edinburgh, [16] M. Turtinen and M. Pietikäinen. Visual training and classification of textured scene images. In The 3rd International Workshop on Texture Analysis and Synthesis, pages , [17] V. Vapnik. The Nature of Statistical Learning Theory. Springer-Verlag, New York, [18] R. Yan, J. Yang, and A. Hauptmann. Automatically labeling video data using multi-class active learning. In IEEE Conf. ICCV 03, pages , 2003.

Local features and matching. Image classification & object localization

Local features and matching. Image classification & object localization Overview Instance level search Local features and matching Efficient visual recognition Image classification & object localization Category recognition Image classification: assigning a class label to

More information

Probabilistic Latent Semantic Analysis (plsa)

Probabilistic Latent Semantic Analysis (plsa) Probabilistic Latent Semantic Analysis (plsa) SS 2008 Bayesian Networks Multimedia Computing, Universität Augsburg Rainer.Lienhart@informatik.uni-augsburg.de www.multimedia-computing.{de,org} References

More information

Object Recognition. Selim Aksoy. Bilkent University saksoy@cs.bilkent.edu.tr

Object Recognition. Selim Aksoy. Bilkent University saksoy@cs.bilkent.edu.tr Image Classification and Object Recognition Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr Image classification Image (scene) classification is a fundamental

More information

The Scientific Data Mining Process

The Scientific Data Mining Process Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In

More information

Sub-class Error-Correcting Output Codes

Sub-class Error-Correcting Output Codes Sub-class Error-Correcting Output Codes Sergio Escalera, Oriol Pujol and Petia Radeva Computer Vision Center, Campus UAB, Edifici O, 08193, Bellaterra, Spain. Dept. Matemàtica Aplicada i Anàlisi, Universitat

More information

Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches

Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches PhD Thesis by Payam Birjandi Director: Prof. Mihai Datcu Problematic

More information

Visualization of large data sets using MDS combined with LVQ.

Visualization of large data sets using MDS combined with LVQ. Visualization of large data sets using MDS combined with LVQ. Antoine Naud and Włodzisław Duch Department of Informatics, Nicholas Copernicus University, Grudziądzka 5, 87-100 Toruń, Poland. www.phys.uni.torun.pl/kmk

More information

An Energy-Based Vehicle Tracking System using Principal Component Analysis and Unsupervised ART Network

An Energy-Based Vehicle Tracking System using Principal Component Analysis and Unsupervised ART Network Proceedings of the 8th WSEAS Int. Conf. on ARTIFICIAL INTELLIGENCE, KNOWLEDGE ENGINEERING & DATA BASES (AIKED '9) ISSN: 179-519 435 ISBN: 978-96-474-51-2 An Energy-Based Vehicle Tracking System using Principal

More information

Open Access A Facial Expression Recognition Algorithm Based on Local Binary Pattern and Empirical Mode Decomposition

Open Access A Facial Expression Recognition Algorithm Based on Local Binary Pattern and Empirical Mode Decomposition Send Orders for Reprints to reprints@benthamscience.ae The Open Electrical & Electronic Engineering Journal, 2014, 8, 599-604 599 Open Access A Facial Expression Recognition Algorithm Based on Local Binary

More information

Tensor Methods for Machine Learning, Computer Vision, and Computer Graphics

Tensor Methods for Machine Learning, Computer Vision, and Computer Graphics Tensor Methods for Machine Learning, Computer Vision, and Computer Graphics Part I: Factorizations and Statistical Modeling/Inference Amnon Shashua School of Computer Science & Eng. The Hebrew University

More information

International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015

International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015 RESEARCH ARTICLE OPEN ACCESS Data Mining Technology for Efficient Network Security Management Ankit Naik [1], S.W. Ahmad [2] Student [1], Assistant Professor [2] Department of Computer Science and Engineering

More information

Social Media Mining. Data Mining Essentials

Social Media Mining. Data Mining Essentials Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers

More information

A Learning Based Method for Super-Resolution of Low Resolution Images

A Learning Based Method for Super-Resolution of Low Resolution Images A Learning Based Method for Super-Resolution of Low Resolution Images Emre Ugur June 1, 2004 emre.ugur@ceng.metu.edu.tr Abstract The main objective of this project is the study of a learning based method

More information

Ensemble Methods. Knowledge Discovery and Data Mining 2 (VU) (707.004) Roman Kern. KTI, TU Graz 2015-03-05

Ensemble Methods. Knowledge Discovery and Data Mining 2 (VU) (707.004) Roman Kern. KTI, TU Graz 2015-03-05 Ensemble Methods Knowledge Discovery and Data Mining 2 (VU) (707004) Roman Kern KTI, TU Graz 2015-03-05 Roman Kern (KTI, TU Graz) Ensemble Methods 2015-03-05 1 / 38 Outline 1 Introduction 2 Classification

More information

Learning with Local and Global Consistency

Learning with Local and Global Consistency Learning with Local and Global Consistency Dengyong Zhou, Olivier Bousquet, Thomas Navin Lal, Jason Weston, and Bernhard Schölkopf Max Planck Institute for Biological Cybernetics, 7276 Tuebingen, Germany

More information

BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL

BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL The Fifth International Conference on e-learning (elearning-2014), 22-23 September 2014, Belgrade, Serbia BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL SNJEŽANA MILINKOVIĆ University

More information

Mining Signatures in Healthcare Data Based on Event Sequences and its Applications

Mining Signatures in Healthcare Data Based on Event Sequences and its Applications Mining Signatures in Healthcare Data Based on Event Sequences and its Applications Siddhanth Gokarapu 1, J. Laxmi Narayana 2 1 Student, Computer Science & Engineering-Department, JNTU Hyderabad India 1

More information

Methodology for Emulating Self Organizing Maps for Visualization of Large Datasets

Methodology for Emulating Self Organizing Maps for Visualization of Large Datasets Methodology for Emulating Self Organizing Maps for Visualization of Large Datasets Macario O. Cordel II and Arnulfo P. Azcarraga College of Computer Studies *Corresponding Author: macario.cordel@dlsu.edu.ph

More information

Learning with Local and Global Consistency

Learning with Local and Global Consistency Learning with Local and Global Consistency Dengyong Zhou, Olivier Bousquet, Thomas Navin Lal, Jason Weston, and Bernhard Schölkopf Max Planck Institute for Biological Cybernetics, 7276 Tuebingen, Germany

More information

Cees Snoek. Machine. Humans. Multimedia Archives. Euvision Technologies The Netherlands. University of Amsterdam The Netherlands. Tree.

Cees Snoek. Machine. Humans. Multimedia Archives. Euvision Technologies The Netherlands. University of Amsterdam The Netherlands. Tree. Visual search: what's next? Cees Snoek University of Amsterdam The Netherlands Euvision Technologies The Netherlands Problem statement US flag Tree Aircraft Humans Dog Smoking Building Basketball Table

More information

The Data Mining Process

The Data Mining Process Sequence for Determining Necessary Data. Wrong: Catalog everything you have, and decide what data is important. Right: Work backward from the solution, define the problem explicitly, and map out the data

More information

CLOUD CHARACTERIZATION USING LOCAL TEXTURE INFORMATION. Antti Isosalo, Markus Turtinen and Matti Pietikäinen

CLOUD CHARACTERIZATION USING LOCAL TEXTURE INFORMATION. Antti Isosalo, Markus Turtinen and Matti Pietikäinen CLOUD CHARACTERIZATION USING LOCAL TEXTURE INFORMATION Antti Isosalo, Markus Turtinen and Matti Pietikäinen Machine Vision Group Department of Electrical and Information Engineering University of Oulu,

More information

Mean-Shift Tracking with Random Sampling

Mean-Shift Tracking with Random Sampling 1 Mean-Shift Tracking with Random Sampling Alex Po Leung, Shaogang Gong Department of Computer Science Queen Mary, University of London, London, E1 4NS Abstract In this work, boosting the efficiency of

More information

Assessment. Presenter: Yupu Zhang, Guoliang Jin, Tuo Wang Computer Vision 2008 Fall

Assessment. Presenter: Yupu Zhang, Guoliang Jin, Tuo Wang Computer Vision 2008 Fall Automatic Photo Quality Assessment Presenter: Yupu Zhang, Guoliang Jin, Tuo Wang Computer Vision 2008 Fall Estimating i the photorealism of images: Distinguishing i i paintings from photographs h Florin

More information

Tracking Moving Objects In Video Sequences Yiwei Wang, Robert E. Van Dyck, and John F. Doherty Department of Electrical Engineering The Pennsylvania State University University Park, PA16802 Abstract{Object

More information

Comparing the Results of Support Vector Machines with Traditional Data Mining Algorithms

Comparing the Results of Support Vector Machines with Traditional Data Mining Algorithms Comparing the Results of Support Vector Machines with Traditional Data Mining Algorithms Scott Pion and Lutz Hamel Abstract This paper presents the results of a series of analyses performed on direct mail

More information

Character Image Patterns as Big Data

Character Image Patterns as Big Data 22 International Conference on Frontiers in Handwriting Recognition Character Image Patterns as Big Data Seiichi Uchida, Ryosuke Ishida, Akira Yoshida, Wenjie Cai, Yaokai Feng Kyushu University, Fukuoka,

More information

A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow

A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow , pp.233-237 http://dx.doi.org/10.14257/astl.2014.51.53 A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow Giwoo Kim 1, Hye-Youn Lim 1 and Dae-Seong Kang 1, 1 Department of electronices

More information

The Role of Size Normalization on the Recognition Rate of Handwritten Numerals

The Role of Size Normalization on the Recognition Rate of Handwritten Numerals The Role of Size Normalization on the Recognition Rate of Handwritten Numerals Chun Lei He, Ping Zhang, Jianxiong Dong, Ching Y. Suen, Tien D. Bui Centre for Pattern Recognition and Machine Intelligence,

More information

Simple and efficient online algorithms for real world applications

Simple and efficient online algorithms for real world applications Simple and efficient online algorithms for real world applications Università degli Studi di Milano Milano, Italy Talk @ Centro de Visión por Computador Something about me PhD in Robotics at LIRA-Lab,

More information

Blog Post Extraction Using Title Finding

Blog Post Extraction Using Title Finding Blog Post Extraction Using Title Finding Linhai Song 1, 2, Xueqi Cheng 1, Yan Guo 1, Bo Wu 1, 2, Yu Wang 1, 2 1 Institute of Computing Technology, Chinese Academy of Sciences, Beijing 2 Graduate School

More information

Mobile Phone APP Software Browsing Behavior using Clustering Analysis

Mobile Phone APP Software Browsing Behavior using Clustering Analysis Proceedings of the 2014 International Conference on Industrial Engineering and Operations Management Bali, Indonesia, January 7 9, 2014 Mobile Phone APP Software Browsing Behavior using Clustering Analysis

More information

Novelty Detection in image recognition using IRF Neural Networks properties

Novelty Detection in image recognition using IRF Neural Networks properties Novelty Detection in image recognition using IRF Neural Networks properties Philippe Smagghe, Jean-Luc Buessler, Jean-Philippe Urban Université de Haute-Alsace MIPS 4, rue des Frères Lumière, 68093 Mulhouse,

More information

Semi-Supervised Learning for Blog Classification

Semi-Supervised Learning for Blog Classification Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence (2008) Semi-Supervised Learning for Blog Classification Daisuke Ikeda Department of Computational Intelligence and Systems Science,

More information

Environmental Remote Sensing GEOG 2021

Environmental Remote Sensing GEOG 2021 Environmental Remote Sensing GEOG 2021 Lecture 4 Image classification 2 Purpose categorising data data abstraction / simplification data interpretation mapping for land cover mapping use land cover class

More information

CS 534: Computer Vision 3D Model-based recognition

CS 534: Computer Vision 3D Model-based recognition CS 534: Computer Vision 3D Model-based recognition Ahmed Elgammal Dept of Computer Science CS 534 3D Model-based Vision - 1 High Level Vision Object Recognition: What it means? Two main recognition tasks:!

More information

Visualization of Breast Cancer Data by SOM Component Planes

Visualization of Breast Cancer Data by SOM Component Planes International Journal of Science and Technology Volume 3 No. 2, February, 2014 Visualization of Breast Cancer Data by SOM Component Planes P.Venkatesan. 1, M.Mullai 2 1 Department of Statistics,NIRT(Indian

More information

A Study of Web Log Analysis Using Clustering Techniques

A Study of Web Log Analysis Using Clustering Techniques A Study of Web Log Analysis Using Clustering Techniques Hemanshu Rana 1, Mayank Patel 2 Assistant Professor, Dept of CSE, M.G Institute of Technical Education, Gujarat India 1 Assistant Professor, Dept

More information

Self Organizing Maps for Visualization of Categories

Self Organizing Maps for Visualization of Categories Self Organizing Maps for Visualization of Categories Julian Szymański 1 and Włodzisław Duch 2,3 1 Department of Computer Systems Architecture, Gdańsk University of Technology, Poland, julian.szymanski@eti.pg.gda.pl

More information

CI6227: Data Mining. Lesson 11b: Ensemble Learning. Data Analytics Department, Institute for Infocomm Research, A*STAR, Singapore.

CI6227: Data Mining. Lesson 11b: Ensemble Learning. Data Analytics Department, Institute for Infocomm Research, A*STAR, Singapore. CI6227: Data Mining Lesson 11b: Ensemble Learning Sinno Jialin PAN Data Analytics Department, Institute for Infocomm Research, A*STAR, Singapore Acknowledgements: slides are adapted from the lecture notes

More information

Multiclass Classification. 9.520 Class 06, 25 Feb 2008 Ryan Rifkin

Multiclass Classification. 9.520 Class 06, 25 Feb 2008 Ryan Rifkin Multiclass Classification 9.520 Class 06, 25 Feb 2008 Ryan Rifkin It is a tale Told by an idiot, full of sound and fury, Signifying nothing. Macbeth, Act V, Scene V What Is Multiclass Classification? Each

More information

The Delicate Art of Flower Classification

The Delicate Art of Flower Classification The Delicate Art of Flower Classification Paul Vicol Simon Fraser University University Burnaby, BC pvicol@sfu.ca Note: The following is my contribution to a group project for a graduate machine learning

More information

Incremental PCA: An Alternative Approach for Novelty Detection

Incremental PCA: An Alternative Approach for Novelty Detection Incremental PCA: An Alternative Approach for Detection Hugo Vieira Neto and Ulrich Nehmzow Department of Computer Science University of Essex Wivenhoe Park Colchester CO4 3SQ {hvieir, udfn}@essex.ac.uk

More information

Towards better accuracy for Spam predictions

Towards better accuracy for Spam predictions Towards better accuracy for Spam predictions Chengyan Zhao Department of Computer Science University of Toronto Toronto, Ontario, Canada M5S 2E4 czhao@cs.toronto.edu Abstract Spam identification is crucial

More information

Image Classification for Dogs and Cats

Image Classification for Dogs and Cats Image Classification for Dogs and Cats Bang Liu, Yan Liu Department of Electrical and Computer Engineering {bang3,yan10}@ualberta.ca Kai Zhou Department of Computing Science kzhou3@ualberta.ca Abstract

More information

Speed Performance Improvement of Vehicle Blob Tracking System

Speed Performance Improvement of Vehicle Blob Tracking System Speed Performance Improvement of Vehicle Blob Tracking System Sung Chun Lee and Ram Nevatia University of Southern California, Los Angeles, CA 90089, USA sungchun@usc.edu, nevatia@usc.edu Abstract. A speed

More information

Human behavior analysis from videos using optical flow

Human behavior analysis from videos using optical flow L a b o r a t o i r e I n f o r m a t i q u e F o n d a m e n t a l e d e L i l l e Human behavior analysis from videos using optical flow Yassine Benabbas Directeur de thèse : Chabane Djeraba Multitel

More information

Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data

Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data CMPE 59H Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data Term Project Report Fatma Güney, Kübra Kalkan 1/15/2013 Keywords: Non-linear

More information

Clustering. Adrian Groza. Department of Computer Science Technical University of Cluj-Napoca

Clustering. Adrian Groza. Department of Computer Science Technical University of Cluj-Napoca Clustering Adrian Groza Department of Computer Science Technical University of Cluj-Napoca Outline 1 Cluster Analysis What is Datamining? Cluster Analysis 2 K-means 3 Hierarchical Clustering What is Datamining?

More information

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015 An Introduction to Data Mining for Wind Power Management Spring 2015 Big Data World Every minute: Google receives over 4 million search queries Facebook users share almost 2.5 million pieces of content

More information

Data Mining - Evaluation of Classifiers

Data Mining - Evaluation of Classifiers Data Mining - Evaluation of Classifiers Lecturer: JERZY STEFANOWSKI Institute of Computing Sciences Poznan University of Technology Poznan, Poland Lecture 4 SE Master Course 2008/2009 revised for 2010

More information

Data Mining Practical Machine Learning Tools and Techniques

Data Mining Practical Machine Learning Tools and Techniques Ensemble learning Data Mining Practical Machine Learning Tools and Techniques Slides for Chapter 8 of Data Mining by I. H. Witten, E. Frank and M. A. Hall Combining multiple models Bagging The basic idea

More information

Support Vector Machines with Clustering for Training with Very Large Datasets

Support Vector Machines with Clustering for Training with Very Large Datasets Support Vector Machines with Clustering for Training with Very Large Datasets Theodoros Evgeniou Technology Management INSEAD Bd de Constance, Fontainebleau 77300, France theodoros.evgeniou@insead.fr Massimiliano

More information

COMPARISON OF OBJECT BASED AND PIXEL BASED CLASSIFICATION OF HIGH RESOLUTION SATELLITE IMAGES USING ARTIFICIAL NEURAL NETWORKS

COMPARISON OF OBJECT BASED AND PIXEL BASED CLASSIFICATION OF HIGH RESOLUTION SATELLITE IMAGES USING ARTIFICIAL NEURAL NETWORKS COMPARISON OF OBJECT BASED AND PIXEL BASED CLASSIFICATION OF HIGH RESOLUTION SATELLITE IMAGES USING ARTIFICIAL NEURAL NETWORKS B.K. Mohan and S. N. Ladha Centre for Studies in Resources Engineering IIT

More information

Geometric-Guided Label Propagation for Moving Object Detection

Geometric-Guided Label Propagation for Moving Object Detection MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Geometric-Guided Label Propagation for Moving Object Detection Kao, J.-Y.; Tian, D.; Mansour, H.; Ortega, A.; Vetro, A. TR2016-005 March 2016

More information

Steven C.H. Hoi School of Information Systems Singapore Management University Email: chhoi@smu.edu.sg

Steven C.H. Hoi School of Information Systems Singapore Management University Email: chhoi@smu.edu.sg Steven C.H. Hoi School of Information Systems Singapore Management University Email: chhoi@smu.edu.sg Introduction http://stevenhoi.org/ Finance Recommender Systems Cyber Security Machine Learning Visual

More information

Journal of Industrial Engineering Research. Adaptive sequence of Key Pose Detection for Human Action Recognition

Journal of Industrial Engineering Research. Adaptive sequence of Key Pose Detection for Human Action Recognition IWNEST PUBLISHER Journal of Industrial Engineering Research (ISSN: 2077-4559) Journal home page: http://www.iwnest.com/aace/ Adaptive sequence of Key Pose Detection for Human Action Recognition 1 T. Sindhu

More information

Determining optimal window size for texture feature extraction methods

Determining optimal window size for texture feature extraction methods IX Spanish Symposium on Pattern Recognition and Image Analysis, Castellon, Spain, May 2001, vol.2, 237-242, ISBN: 84-8021-351-5. Determining optimal window size for texture feature extraction methods Domènec

More information

VEHICLE LOCALISATION AND CLASSIFICATION IN URBAN CCTV STREAMS

VEHICLE LOCALISATION AND CLASSIFICATION IN URBAN CCTV STREAMS VEHICLE LOCALISATION AND CLASSIFICATION IN URBAN CCTV STREAMS Norbert Buch 1, Mark Cracknell 2, James Orwell 1 and Sergio A. Velastin 1 1. Kingston University, Penrhyn Road, Kingston upon Thames, KT1 2EE,

More information

Recognizing Cats and Dogs with Shape and Appearance based Models. Group Member: Chu Wang, Landu Jiang

Recognizing Cats and Dogs with Shape and Appearance based Models. Group Member: Chu Wang, Landu Jiang Recognizing Cats and Dogs with Shape and Appearance based Models Group Member: Chu Wang, Landu Jiang Abstract Recognizing cats and dogs from images is a challenging competition raised by Kaggle platform

More information

An Overview of Knowledge Discovery Database and Data mining Techniques

An Overview of Knowledge Discovery Database and Data mining Techniques An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,

More information

Face Model Fitting on Low Resolution Images

Face Model Fitting on Low Resolution Images Face Model Fitting on Low Resolution Images Xiaoming Liu Peter H. Tu Frederick W. Wheeler Visualization and Computer Vision Lab General Electric Global Research Center Niskayuna, NY, 1239, USA {liux,tu,wheeler}@research.ge.com

More information

Azure Machine Learning, SQL Data Mining and R

Azure Machine Learning, SQL Data Mining and R Azure Machine Learning, SQL Data Mining and R Day-by-day Agenda Prerequisites No formal prerequisites. Basic knowledge of SQL Server Data Tools, Excel and any analytical experience helps. Best of all:

More information

Building an Advanced Invariant Real-Time Human Tracking System

Building an Advanced Invariant Real-Time Human Tracking System UDC 004.41 Building an Advanced Invariant Real-Time Human Tracking System Fayez Idris 1, Mazen Abu_Zaher 2, Rashad J. Rasras 3, and Ibrahiem M. M. El Emary 4 1 School of Informatics and Computing, German-Jordanian

More information

Colour Image Segmentation Technique for Screen Printing

Colour Image Segmentation Technique for Screen Printing 60 R.U. Hewage and D.U.J. Sonnadara Department of Physics, University of Colombo, Sri Lanka ABSTRACT Screen-printing is an industry with a large number of applications ranging from printing mobile phone

More information

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014 RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer

More information

Subspace Analysis and Optimization for AAM Based Face Alignment

Subspace Analysis and Optimization for AAM Based Face Alignment Subspace Analysis and Optimization for AAM Based Face Alignment Ming Zhao Chun Chen College of Computer Science Zhejiang University Hangzhou, 310027, P.R.China zhaoming1999@zju.edu.cn Stan Z. Li Microsoft

More information

Fast Matching of Binary Features

Fast Matching of Binary Features Fast Matching of Binary Features Marius Muja and David G. Lowe Laboratory for Computational Intelligence University of British Columbia, Vancouver, Canada {mariusm,lowe}@cs.ubc.ca Abstract There has been

More information

How To Use Neural Networks In Data Mining

How To Use Neural Networks In Data Mining International Journal of Electronics and Computer Science Engineering 1449 Available Online at www.ijecse.org ISSN- 2277-1956 Neural Networks in Data Mining Priyanka Gaur Department of Information and

More information

Video OCR for Sport Video Annotation and Retrieval

Video OCR for Sport Video Annotation and Retrieval Video OCR for Sport Video Annotation and Retrieval Datong Chen, Kim Shearer and Hervé Bourlard, Fellow, IEEE Dalle Molle Institute for Perceptual Artificial Intelligence Rue du Simplon 4 CH-190 Martigny

More information

Multi-Class Active Learning for Image Classification

Multi-Class Active Learning for Image Classification Multi-Class Active Learning for Image Classification Ajay J. Joshi University of Minnesota Twin Cities ajay@cs.umn.edu Fatih Porikli Mitsubishi Electric Research Laboratories fatih@merl.com Nikolaos Papanikolopoulos

More information

Simultaneous Gamma Correction and Registration in the Frequency Domain

Simultaneous Gamma Correction and Registration in the Frequency Domain Simultaneous Gamma Correction and Registration in the Frequency Domain Alexander Wong a28wong@uwaterloo.ca William Bishop wdbishop@uwaterloo.ca Department of Electrical and Computer Engineering University

More information

Multiscale Object-Based Classification of Satellite Images Merging Multispectral Information with Panchromatic Textural Features

Multiscale Object-Based Classification of Satellite Images Merging Multispectral Information with Panchromatic Textural Features Remote Sensing and Geoinformation Lena Halounová, Editor not only for Scientific Cooperation EARSeL, 2011 Multiscale Object-Based Classification of Satellite Images Merging Multispectral Information with

More information

An Analysis on Density Based Clustering of Multi Dimensional Spatial Data

An Analysis on Density Based Clustering of Multi Dimensional Spatial Data An Analysis on Density Based Clustering of Multi Dimensional Spatial Data K. Mumtaz 1 Assistant Professor, Department of MCA Vivekanandha Institute of Information and Management Studies, Tiruchengode,

More information

Active Learning with Boosting for Spam Detection

Active Learning with Boosting for Spam Detection Active Learning with Boosting for Spam Detection Nikhila Arkalgud Last update: March 22, 2008 Active Learning with Boosting for Spam Detection Last update: March 22, 2008 1 / 38 Outline 1 Spam Filters

More information

AN IMPROVED DOUBLE CODING LOCAL BINARY PATTERN ALGORITHM FOR FACE RECOGNITION

AN IMPROVED DOUBLE CODING LOCAL BINARY PATTERN ALGORITHM FOR FACE RECOGNITION AN IMPROVED DOUBLE CODING LOCAL BINARY PATTERN ALGORITHM FOR FACE RECOGNITION Saurabh Asija 1, Rakesh Singh 2 1 Research Scholar (Computer Engineering Department), Punjabi University, Patiala. 2 Asst.

More information

Feature Tracking and Optical Flow

Feature Tracking and Optical Flow 02/09/12 Feature Tracking and Optical Flow Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Many slides adapted from Lana Lazebnik, Silvio Saverse, who in turn adapted slides from Steve

More information

A Real Time Hand Tracking System for Interactive Applications

A Real Time Hand Tracking System for Interactive Applications A Real Time Hand Tracking System for Interactive Applications Siddharth Swarup Rautaray Indian Institute of Information Technology Allahabad ABSTRACT In vision based hand tracking systems color plays an

More information

EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set

EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set Amhmed A. Bhih School of Electrical and Electronic Engineering Princy Johnson School of Electrical and Electronic Engineering Martin

More information

Introduction to Support Vector Machines. Colin Campbell, Bristol University

Introduction to Support Vector Machines. Colin Campbell, Bristol University Introduction to Support Vector Machines Colin Campbell, Bristol University 1 Outline of talk. Part 1. An Introduction to SVMs 1.1. SVMs for binary classification. 1.2. Soft margins and multi-class classification.

More information

Low-resolution Character Recognition by Video-based Super-resolution

Low-resolution Character Recognition by Video-based Super-resolution 2009 10th International Conference on Document Analysis and Recognition Low-resolution Character Recognition by Video-based Super-resolution Ataru Ohkura 1, Daisuke Deguchi 1, Tomokazu Takahashi 2, Ichiro

More information

Scalable Developments for Big Data Analytics in Remote Sensing

Scalable Developments for Big Data Analytics in Remote Sensing Scalable Developments for Big Data Analytics in Remote Sensing Federated Systems and Data Division Research Group High Productivity Data Processing Dr.-Ing. Morris Riedel et al. Research Group Leader,

More information

DATA MINING CLUSTER ANALYSIS: BASIC CONCEPTS

DATA MINING CLUSTER ANALYSIS: BASIC CONCEPTS DATA MINING CLUSTER ANALYSIS: BASIC CONCEPTS 1 AND ALGORITHMS Chiara Renso KDD-LAB ISTI- CNR, Pisa, Italy WHAT IS CLUSTER ANALYSIS? Finding groups of objects such that the objects in a group will be similar

More information

3D Model based Object Class Detection in An Arbitrary View

3D Model based Object Class Detection in An Arbitrary View 3D Model based Object Class Detection in An Arbitrary View Pingkun Yan, Saad M. Khan, Mubarak Shah School of Electrical Engineering and Computer Science University of Central Florida http://www.eecs.ucf.edu/

More information

Data Mining and Neural Networks in Stata

Data Mining and Neural Networks in Stata Data Mining and Neural Networks in Stata 2 nd Italian Stata Users Group Meeting Milano, 10 October 2005 Mario Lucchini e Maurizo Pisati Università di Milano-Bicocca mario.lucchini@unimib.it maurizio.pisati@unimib.it

More information

Active Learning SVM for Blogs recommendation

Active Learning SVM for Blogs recommendation Active Learning SVM for Blogs recommendation Xin Guan Computer Science, George Mason University Ⅰ.Introduction In the DH Now website, they try to review a big amount of blogs and articles and find the

More information

Edge tracking for motion segmentation and depth ordering

Edge tracking for motion segmentation and depth ordering Edge tracking for motion segmentation and depth ordering P. Smith, T. Drummond and R. Cipolla Department of Engineering University of Cambridge Cambridge CB2 1PZ,UK {pas1001 twd20 cipolla}@eng.cam.ac.uk

More information

The Artificial Prediction Market

The Artificial Prediction Market The Artificial Prediction Market Adrian Barbu Department of Statistics Florida State University Joint work with Nathan Lay, Siemens Corporate Research 1 Overview Main Contributions A mathematical theory

More information

Data Mining: A Preprocessing Engine

Data Mining: A Preprocessing Engine Journal of Computer Science 2 (9): 735-739, 2006 ISSN 1549-3636 2005 Science Publications Data Mining: A Preprocessing Engine Luai Al Shalabi, Zyad Shaaban and Basel Kasasbeh Applied Science University,

More information

INTERACTIVE DATA EXPLORATION USING MDS MAPPING

INTERACTIVE DATA EXPLORATION USING MDS MAPPING INTERACTIVE DATA EXPLORATION USING MDS MAPPING Antoine Naud and Włodzisław Duch 1 Department of Computer Methods Nicolaus Copernicus University ul. Grudziadzka 5, 87-100 Toruń, Poland Abstract: Interactive

More information

MapReduce Approach to Collective Classification for Networks

MapReduce Approach to Collective Classification for Networks MapReduce Approach to Collective Classification for Networks Wojciech Indyk 1, Tomasz Kajdanowicz 1, Przemyslaw Kazienko 1, and Slawomir Plamowski 1 Wroclaw University of Technology, Wroclaw, Poland Faculty

More information

Face Recognition For Remote Database Backup System

Face Recognition For Remote Database Backup System Face Recognition For Remote Database Backup System Aniza Mohamed Din, Faudziah Ahmad, Mohamad Farhan Mohamad Mohsin, Ku Ruhana Ku-Mahamud, Mustafa Mufawak Theab 2 Graduate Department of Computer Science,UUM

More information

Image Segmentation and Registration

Image Segmentation and Registration Image Segmentation and Registration Dr. Christine Tanner (tanner@vision.ee.ethz.ch) Computer Vision Laboratory, ETH Zürich Dr. Verena Kaynig, Machine Learning Laboratory, ETH Zürich Outline Segmentation

More information

HSI BASED COLOUR IMAGE EQUALIZATION USING ITERATIVE n th ROOT AND n th POWER

HSI BASED COLOUR IMAGE EQUALIZATION USING ITERATIVE n th ROOT AND n th POWER HSI BASED COLOUR IMAGE EQUALIZATION USING ITERATIVE n th ROOT AND n th POWER Gholamreza Anbarjafari icv Group, IMS Lab, Institute of Technology, University of Tartu, Tartu 50411, Estonia sjafari@ut.ee

More information

HANDS-FREE PC CONTROL CONTROLLING OF MOUSE CURSOR USING EYE MOVEMENT

HANDS-FREE PC CONTROL CONTROLLING OF MOUSE CURSOR USING EYE MOVEMENT International Journal of Scientific and Research Publications, Volume 2, Issue 4, April 2012 1 HANDS-FREE PC CONTROL CONTROLLING OF MOUSE CURSOR USING EYE MOVEMENT Akhil Gupta, Akash Rathi, Dr. Y. Radhika

More information

Supervised Feature Selection & Unsupervised Dimensionality Reduction

Supervised Feature Selection & Unsupervised Dimensionality Reduction Supervised Feature Selection & Unsupervised Dimensionality Reduction Feature Subset Selection Supervised: class labels are given Select a subset of the problem features Why? Redundant features much or

More information

SPECIAL PERTURBATIONS UNCORRELATED TRACK PROCESSING

SPECIAL PERTURBATIONS UNCORRELATED TRACK PROCESSING AAS 07-228 SPECIAL PERTURBATIONS UNCORRELATED TRACK PROCESSING INTRODUCTION James G. Miller * Two historical uncorrelated track (UCT) processing approaches have been employed using general perturbations

More information

Using Data Mining for Mobile Communication Clustering and Characterization

Using Data Mining for Mobile Communication Clustering and Characterization Using Data Mining for Mobile Communication Clustering and Characterization A. Bascacov *, C. Cernazanu ** and M. Marcu ** * Lasting Software, Timisoara, Romania ** Politehnica University of Timisoara/Computer

More information

In this presentation, you will be introduced to data mining and the relationship with meaningful use.

In this presentation, you will be introduced to data mining and the relationship with meaningful use. In this presentation, you will be introduced to data mining and the relationship with meaningful use. Data mining refers to the art and science of intelligent data analysis. It is the application of machine

More information

LABEL PROPAGATION ON GRAPHS. SEMI-SUPERVISED LEARNING. ----Changsheng Liu 10-30-2014

LABEL PROPAGATION ON GRAPHS. SEMI-SUPERVISED LEARNING. ----Changsheng Liu 10-30-2014 LABEL PROPAGATION ON GRAPHS. SEMI-SUPERVISED LEARNING ----Changsheng Liu 10-30-2014 Agenda Semi Supervised Learning Topics in Semi Supervised Learning Label Propagation Local and global consistency Graph

More information