Mitosis Detection in Breast Cancer Histology Images with Deep Neural Networks
|
|
|
- Henry Ramsey
- 9 years ago
- Views:
Transcription
1 Mitosis Detection in Breast Cancer Histology Images with Deep Neural Networks Dan C. Cireşan, Alessandro Giusti, Luca M. Gambardella, Jürgen Schmidhuber IDSIA, Dalle Molle Institute for Artificial Intelligence, USI-SUPSI, Lugano, Switzerland Abstract. We use deep max-pooling convolutional neural networks to detect mitosis in breast histology images. The networks are trained to classify each pixel in the images, using as context a patch centered on the pixel. Simple postprocessing is then applied to the network output. Our approach won the ICPR 2012 mitosis detection competition, outperforming other contestants by a significant margin. 1 Introduction The number of mitotic figures visible in histology sections is an important indicator for cancer screening and assessment. Normally, the count is performed manually by histologists, but automating the process could reduce its time and costs (thus making it more accessible), minimize errors, and improve the comparability of results obtained in different labs. Mitosis detection is very hard. In fact, mitosis is a complex process during which a cell nucleus undergoes various transformations. In addition, different image areas are characterized by different tissue types, which exhibit highly variable appearance. A large amount of different structures can be observed in histology images stained with Hematosin & Eosin, and in particular many dark-blue spots, most of which correspond to cell nuclei. Only a subset of them is in a mitotic phase and must be detected. In most stages a mitotic nucleus looks very much like a non-mitotic one, or like other dark-blue spots, to the point that a human observer without extensive training cannot differentiate them (Figure 1). As an additional complication, in later stages of the mitosis process a nucleus may appear to split in two dark-blue spots, to be counted as one single mitosis. Our approach is conceptually very simple. We use a supervised Deep Neural Network (DNN) as a powerful pixel classifier. The DNN is a max-pooling (MP) convolutional neural network (CNN). It directly operates on raw RGB data sampled from a square patch of the source image, centered on the pixel itself. The DNN is trained to differentiate patches with a mitotic nucleus close to the center from all other windows. Mitosis in unseen images are detected by applying the classifier on a sliding window, and postprocessing its outputs with simple techniques. Because the DNN operates on raw pixel values, no human input is needed : on the contrary, the DNN automatically learns a set of visual features from the training data. Our main contribution is a new, important, practical application of DNN, which recently produced outstanding results in image classification, segmentation and detection. Our approach is tested on a publicly available dataset. It significantly outperforms
2 2 all competing techniques, with manageable computational effort: processing a 4MPixel image requires few minutes on a standard laptop. Supplementary material for this paper is available at Related Work Different flavors of CNN have been used for decades to classify objects. Introduced in 1980 [6] and gradually improved over the next two decades [11,1,17], they unfold their full potential when combined with MP and made both deep and wide [3,4]. They excel on data sets ranging from handwritten characters [3] to complex cluttered images (NORB) [4], faces, and natural color images. DNN-based pattern recognition is not limited to object classification, but can be used for detection as well. Recently, DNN were used to segment images of neural tissue in Electron Microscopy [2] and natural scenes [5]. Many detection problems in biomedical images are solved by means of pixel classifiers, and are characterized by the relatively obvious appearance of the objects to be detected. Difficulties may arise due to clumping/touching objects which may be hard to separate and count [13]. Mitosis detection is different. While mitosis are normally rare and well-separated, they are very hard to differentiate from non-mitotic nuclei. 2 Methods Given an input RGB image I, the problem is to find a set D = {d 1, d 2,..., d N } of detections, each reporting the centroid coordinates for a single mitosis. The problem is solved by training a detector on training images with given ground truth information about the centroid of each visible mitosis. Each pixel is assigned one of two possible classes, mitosis or non-mitosis, the former to pixels at (or close to) mitosis centroids, the latter to all other pixels. Our detector is a DNN-based pixel classifier. For any given pixel p, the DNN predicts its class using raw RGB values in a square image window centered on p (Figure 1). Windows of class mitosis contain a visible mitosis around the window s center. Others contain off-center or no mitosis. Deep Neural Netwok architecture A DNN [4] is a feed-forward net made of successive pairs of convolutional and max-pooling layers, followed by several fully connected layers. Raw pixel intensities of the input image are passed through this general, hierarchical feature extractor. The feature vector it produces is classified by the fully connected layers. All weights are optimized through minimization of the misclassification error over the training set. Each convolutional layer performs a 2D convolution of its input maps with a rectangular filter. The filter is applied in every possible position of the input map. If the previous layer contains more than one map, the activations of the corresponding convolutions are summed up, then passed through a nonlinear activation function. One of the architectural differences between our DNN and previous CNN [11] are max-pooling (MP) layers [14,16] instead of sub-sampling layers. Their outputs are given by the maximum activations over non-overlapping square regions. MP layers are fixed, non-trainable layers selecting the winning features. Typical DNN also are much wider than previous CNN, with many more connections, weights and non-linearities.
3 3 Fig. 1. Top left: one image (4 MPixels) corresponding to one of the 50 high power fields represented in the dataset. Our detected mitosis are circled green (true positives) and red (false positives); cyan denotes mitosis not detected by our approach. Top right: details of three areas (fullsize results on the whole dataset in supplementary material). Note the challenging appearance of mitotic nuclei and other very similar non-mitotic structures. Bottom: overview of our detection approach. After several pairs of convolutional and MP layers, one fully connected layer further mixes the outputs into a feature vector. The output layer is a simple fully connected layer with one neuron per class (two for this problem), activated by a softmax function, thus ensuring that each neuron s output activation can be interpreted as the probability of a particular input belonging to that class. Training a detector Using ground truth data, we label each pixel of each training image as either mitosis (when closer than d pixels to the centroid of a mitosis) or nonmitosis (elsewhere). Then, we build a training set in which each instance maps a square window of RGB values sampled from the original image to the class of the central pixel. If a window lies partly outside of the image boundary, the missing pixels are synthesized by mirroring. The mitosis detection problem is rotationally invariant. Therefore, additional training instances are generated by transforming windows within the training set by applying arbitrary rotations and/or mirroring. This is especially important considering that there are extremely few mitosis examples in the training set. Processing a testing image To process an unseen image I, we apply the DNN to all windows whose central pixel is within the image boundaries. Pixels outside of the image boundaries are again synthesized by mirroring. This yields a probability map
4 4 M, in which each pixel is assigned a probability of being close to the centroid of a mitosis. Ideally, we expect M to be zero everywhere except within d-pixel-radius disks centered on each mitosis. In practice, M is extremely noisy. Therefore, M is convolved with a d-pixel-radius disk kernel, which yields a smoothed probability map M f ; the local maxima of M f are expected to lie at the disk centers in M, i.e., at the centroids of each mitosis. To obtain a set of detections D I for image I, we first initialize D I, then iterate the following two steps until no pixel in M f exceeds a given threshold t. Let p m be the pixel with the largest value in M f ; D I = D I (p m, M f (p m )). M f (p) 0 for each p : p p m < 2d (non-maxima suppression). This yields a (possibly empty) set D I (depending on threshold t) containing the detected centroids of all mitosis in image I, as well as their respective score. Exploiting multiple nets and rotational invariance Because the DNN classifier is very flexible and has many degrees of freedom, it is expected to exhibit large variance and low bias [7]. In fact, in related work [2] it was observed that large nets with different architectures, even when trained on the same dataset, tend to yield significantly different outputs, especially for challenging image parts. We reduce such variance by averaging the outputs of multiple classifiers with different architectures. Moreover, we exploit rotational invariance by separately processing rotated and mirrored versions of each input image, and averaging their results. 3 Materials, Experiments and Results Dataset and Performance Measures We evaluate our method on the public MITOS dataset including 50 images corresponding to 50 high-power fields in 5 different biopsy slides stained with Hematosin & Eosin. A total of about 300 mitosis are visible in MITOS. Each field represents a µm 2 area, and is acquired using three different setups: two slide scanners and a multispectral microscope. Here we consider images acquired by the Aperio XT scanner, the most widespread and accessible solution among the three. It has a resolution of µm per pixel, resulting in a RGB image for each field. Expert pathologists manually annotated all visible mitosis. We partition the 50 images into three subsets: T1 (26 images), T2 (9 images), and T3 (15 images). T3 coincides with the evaluation images for the 2012 ICPR Mitosis Detection Contest. Its ground truth was withheld from contestants until the end of the contest. T3 is exclusively used for computing our performance measures once, to ensure a fair comparison with other algorithms. Given a set of detections for dataset T3, according to the contest criteria, we count the number N TP of True Positives (i.e. detections whose coordinates are closer than 5µm(20 px) from the ground truth centroid), False Positives (N FP ) and False Negatives (N FN ). We compute the following performance measures: recall (R = N TP /(N TP + N FN )), precision (P = N TP /(N TP + N FP )) and F 1 score (F 1 = 2P R/(P + R)). We randomly split the remaining 35 images, for which ground truth was available, in two disjoint sets T1 (training) and T2 (validation). Detectors trained on the former are evaluated on the latter for determining the threshold yielding the largest F-score.
5 5 Building the detector For images in T1 and T2, the mitosis class is assigned to all windows whose center pixel is closer than d = 10 pixels to the centroid of a ground-truth mitosis; all remaining windows are given the non-mitosis class. This results in a total of roughly mitosis pixels and 151 million non-mitosis pixels. Note that, among all non-mitosis pixels, only a tiny fraction (i.e. those close to non-mitotic nuclei and similarly looking structures) represent interesting instances. In contrast, the largest part of the image area is covered by background pixels far from any nucleus, whose class (non-mitosis) is quite trivial to determine. If training instances for class non-mitosis were uniformly sampled from images, most of the training effort would be wasted. Other approaches [18,8,12,20] address this issue by first detecting all nuclei, then classifying each nucleus separately as mitotic or non-mitotic. We follow a different, simpler approach, which does not need any additional ground-truth information and relies on a single trained detector. In particular, we build our training set so that the relatively rare challenging non-mitosis instances are well represented, whereas instances obviously belonging to class non-mitosis (which prevail in the input images) appear only rarely. This approach, loosely inspired by boosting techniques, allows us to spend most of the training time in learning important differences among mitotic and nonmitotic nuclei. We adopt a general approach to building such a training set, without relying on problem-specific heuristics. We build a small training set Sd, which includes all mitosis instances and the same number of non-mitosis instances, uniformly sampled from the 151 million non-mitosis pixels. We use Sd to briefly train a simple DNN classifier Cd. Because Cd is trained on a limited training set in which challenging non-mitosis instances are severely underrepresented, it tends to misclassify most non-mitotic nuclei as class mitosis. We apply Cd to all images in T1 and T2. Let D(p) denote the mitosis probability that Cd assigns to pixel p. D(p) will be large for challenging non-mitosis pixels. We build the actual training set, composed by 1 million instances, which includes all mitosis pixels (6.6% of the training instances). The remaining 95.4% is sampled from non-mitosis pixels by assigning to each pixel p a weight D(p). The resulting optimized training set is used for learning two nets DNN1 and DNN2 (architectures outlined in Table 1). Because the problem is rotationally invariant, during each training epoch each patch is subject to a random rotation around its center and a 50% chance of mirroring, in order to artificially augment the training set. Each unseen image I is processed 16 times: each of the two nets is applied to each of 8 variations of the input image, namely: rotations by k 90, k = 0, 1, 2, 3, with and without mirroring. For each variation, the resulting map is subject to the inverse transformation, to match the input image. The resulting 16 maps are averaged, yielding M, from which a set of detections D I is determined as described in Section 2. The whole procedure is first performed by training the nets on data from T1 and detecting mitosis in T2 images. The threshold yielding the largest F 1 score (t = 0.35) is then determined. The final detector is obtained by training the two nets on data from T1 and T2, and evaluated on T3. Training each network requires one day of computation with an optimized GPU implementation. Less than 30 epochs are needed to reach a minimum on validation
6 6 Table layer architecture for network DNN1 (left) and 11-layer architecture for network DNN2 (right). Layer type: I - input, C - convolutional, MP - max-pooling, FC - fully-connected. Layer Type Maps Filter Weights Connections and neurons size 0 I 3M x 101x101N 1 C 16M x 100x100N 2x MP 16M x 50x50N 2x2 3 C 16M x 48x48N 3x MP 16M x 24x24N 2x2 5 C 16M x 22x22N 3x MP 16M x 11x11N 2x2 7 C 16M x 10x10N 2x MP 16M x 5x5N 2x2 9 C 16M x 4x4N 2x MP 16M x 2x2N 2x2 11 FC 100N 1x FC 2N 1x Layer Type Maps Filter Weights Connections and neurons size 0 I 3M x 101x101N 1 C 16M x 98x98N 4x MP 16M x 49x49N 2x2 3 C 16M x 46x46N 4x MP 16M x 23x23N 2x2 5 C 16M x 20x20N 4x MP 16M x 10x10N 2x2 7 C 16M x 8x8N 3x MP 16M x 4x4N 2x2 9 FC 100N 1x FC 2N 1x data. For detecting mitosis in a single image, our MATLAB implementation required 31 seconds to apply each network on each input variation, which amounts to a total time of roughly 8 minutes per image. Significantly faster results can obtained by averaging fewer variations with minimal performance loss (see Table 2). Performance and Comparison Performance results on the T3 dataset are reported in Table 2. Our approach yields an F-score of 0.782, significantly higher than the F-score obtained by the closest competitor (0.718). The same data is plotted in the Precision- Recall plane in Figure 3. The choice of the detection threshold t affects the resulting F-score: Figure 3 (right) shows that the parameter is not particularly critical, since even significant deviations from t result in limited performance loss. Table 2. Performance results for our approach (DNN) compared with competing approaches [15]. We also report the performance of faster but less accurate versions of our approach, namely: DNNf12, which averages the results of nets DNN1 and DNN2 without computing input variations (1 minute per image), and DNNf1, which is computed only from the result of DNN1 (31s per image). method precision recall F 1 score method precision recall F 1 score DNN NUS DNNf ISIK [19] DNNf ETH-HEILDERBERG [18] IPAL [9] OKAN-IRISA-LIAMA SUTECH IITG NEC [12] DREXEL UTRECHT [20] BII WARWICK [10] QATAR
7 7 Fig. 2. All 143 detections (29 per row) on T3 with scores larger than 0.1, sorted by descending score. For each, we report the corresponding image patch, score, and whether it is a mitosis (TRUE, bright green background) or a non-mitosis (FALSE, dark red background). Vertical dotted line for score 0.35 reports the detection threshold t0 determined on T2. Fig. 3. Left: Performance of our approach compared to others in the PR plane. Right: sensitivity to the choice of threshold. 4 Conclusion and Future Works We presented an approach for mitosis detection that outperformed all competitors on the first public annotated dataset of breast cancer histology images. Future work will aim at validating our approach on larger datasets, and comparing its performance to the one of expert histologists, with the ultimate goal of gradually bringing automated mitosis detection into clinical practice. Acknowledgments This work was partially supported by the Supervised Deep / Recurrent Nets SNF grant, Project Code
8 8 References 1. Behnke, S.: Hierarchical Neural Networks for Image Interpretation, Lecture Notes in Computer Science, vol Springer (2003) 2 2. Ciresan, D.C., Giusti, A., Gambardella, L.M., Schmidhuber, J.: Deep Neural Networks Segment Neuronal Membranes in Electron Microscopy Images. In: Neural Information Processing Systems. pp (2012) 2, 4 3. Ciresan, D.C., Meier, U., Masci, J., Gambardella, L.M., Schmidhuber, J.: Flexible, High Performance Convolutional Neural Networks for Image Classification. In: International Joint Conference on Artificial Intelligence. pp (2011) 2 4. Ciresan, D.C., Meier, U., Schmidhuber, J.: Multi-column Deep Neural Networks for Image Classification. In: Computer Vision and Pattern Recognition. pp (2012) 2 5. Farabet, C., Couprie, C., Najman, L., LeCun, Y.: Learning Hierarchical Features for Scene Labeling. IEEE Transactions on Pattern Analysis and Machine Intelligence (2013), in press 2 6. Fukushima, K.: Neocognitron: A self-organizing neural network for a mechanism of pattern recognition unaffected by shift in position. Biological Cybernetics 36(4), (1980) 2 7. Geman, S., Bienenstock, E., Doursat, R.: Neural networks and the bias/variance dilemma. Neural computation 4(1), 1 58 (1992) 4 8. Huang, C., Hwee, K.: Automated Mitosis Detection Based on Exclusive Independent Component Analysis. In: Proc. ICPR 2012 (2012) 5 9. Irshad, H.: Automated mitosis detection in histopathology using morphological and multichannel statistics features. Journal of Pathology Informatics 4(1) (2013) Khan, A., ElDaly, H., Rajpoot, N.: A gamma-gaussian mixture model for detection of mitotic cells in breast cancer histopathology images. Journal of Pathology Informatics 4(1) (2013) LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-Based Learning Applied to Document Recognition. Proceedings of the IEEE 86(11), (November 1998) Malon, C., Cosatto, E.: Classification of mitotic figures with convolutional neural networks and seeded blob features. Journal of Pathology Informatics 4(1) (2013) 5, Pan, J., Kanade, T., Chen, M.: Heterogeneous conditional random field: Realizing joint detection and segmentation of cell regions in microscopic images. In: Computer Vision and Pattern Recognition, 2010 IEEE Conference on. pp IEEE (2010) Riesenhuber, M., Poggio, T.: Hierarchical models of object recognition in cortex. Nat. Neurosci. 2(11), (1999) Roux, L., Racoceanu, D., Lomnie, N., Kulikova, M., Irshad, H., Klossa, J., Capron, F., Genestie, C., Naour, G.L., Gurcan, M.N.: Mitosis detection in breast cancer histological images An ICPR 2012 contest. Journal of Pathology Informatics 4(1) (2013) Scherer, D., Müller, A., Behnke, S.: Evaluation of Pooling Operations in Convolutional Architectures for Object Recognition. In: International Conference on Artificial Neural Networks (2010) Simard, P.Y., Steinkraus, D., Platt, J.C.: Best practices for convolutional neural networks applied to visual document analysis. In: Seventh International Conference on Document Analysis and Recognition. pp (2003) Sommer, C., Fiaschi, L., Heidelberg, H., Hamprecht, F., Gerlich, D.: Learning-based mitotic cell detection in histopathological images. In: Proc. ICPR 2012 (2012) 5, Tek, F.: Mitosis detection using generic features and an ensemble of cascade adaboosts. Journal of Pathology Informatics 4(1) (2013) Veta, M., van Diestb, P., Pluim, J.: Detecting mitotic figures in breast cancer histopathology images. In: Proc. of SPIE Medical Imaging (2013) 5, 6
Multi-Column Deep Neural Network for Traffic Sign Classification
Multi-Column Deep Neural Network for Traffic Sign Classification Dan Cireşan, Ueli Meier, Jonathan Masci and Jürgen Schmidhuber IDSIA - USI - SUPSI Galleria 2, Manno - Lugano 6928, Switzerland Abstract
Transfer Learning for Latin and Chinese Characters with Deep Neural Networks
Transfer Learning for Latin and Chinese Characters with Deep Neural Networks Dan C. Cireşan IDSIA USI-SUPSI Manno, Switzerland, 6928 Email: [email protected] Ueli Meier IDSIA USI-SUPSI Manno, Switzerland, 6928
Supporting Online Material for
www.sciencemag.org/cgi/content/full/313/5786/504/dc1 Supporting Online Material for Reducing the Dimensionality of Data with Neural Networks G. E. Hinton* and R. R. Salakhutdinov *To whom correspondence
Novelty Detection in image recognition using IRF Neural Networks properties
Novelty Detection in image recognition using IRF Neural Networks properties Philippe Smagghe, Jean-Luc Buessler, Jean-Philippe Urban Université de Haute-Alsace MIPS 4, rue des Frères Lumière, 68093 Mulhouse,
Lecture 6: Classification & Localization. boris. [email protected]
Lecture 6: Classification & Localization boris. [email protected] 1 Agenda ILSVRC 2014 Overfeat: integrated classification, localization, and detection Classification with Localization Detection. 2 ILSVRC-2014
3D Object Recognition using Convolutional Neural Networks with Transfer Learning between Input Channels
3D Object Recognition using Convolutional Neural Networks with Transfer Learning between Input Channels Luís A. Alexandre Department of Informatics and Instituto de Telecomunicações Univ. Beira Interior,
Analecta Vol. 8, No. 2 ISSN 2064-7964
EXPERIMENTAL APPLICATIONS OF ARTIFICIAL NEURAL NETWORKS IN ENGINEERING PROCESSING SYSTEM S. Dadvandipour Institute of Information Engineering, University of Miskolc, Egyetemváros, 3515, Miskolc, Hungary,
Pedestrian Detection with RCNN
Pedestrian Detection with RCNN Matthew Chen Department of Computer Science Stanford University [email protected] Abstract In this paper we evaluate the effectiveness of using a Region-based Convolutional
Online Evolution of Deep Convolutional Network for Vision-Based Reinforcement Learning
Online Evolution of Deep Convolutional Network for Vision-Based Reinforcement Learning Jan Koutník, Jürgen Schmidhuber, and Faustino Gomez IDSIA, USI-SUPSI Galleria 2 Manno-Lugano, CH 6928 {hkou,juergen,tino}@idsia.ch
Stochastic Pooling for Regularization of Deep Convolutional Neural Networks
Stochastic Pooling for Regularization of Deep Convolutional Neural Networks Matthew D. Zeiler Department of Computer Science Courant Institute, New York University [email protected] Rob Fergus Department
Flexible, High Performance Convolutional Neural Networks for Image Classification
Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence Flexible, High Performance Convolutional Neural Networks for Image Classification Dan C. Cireşan, Ueli Meier,
The Role of Size Normalization on the Recognition Rate of Handwritten Numerals
The Role of Size Normalization on the Recognition Rate of Handwritten Numerals Chun Lei He, Ping Zhang, Jianxiong Dong, Ching Y. Suen, Tien D. Bui Centre for Pattern Recognition and Machine Intelligence,
Applying Deep Learning to Car Data Logging (CDL) and Driver Assessor (DA) October 22-Oct-15
Applying Deep Learning to Car Data Logging (CDL) and Driver Assessor (DA) October 22-Oct-15 GENIVI is a registered trademark of the GENIVI Alliance in the USA and other countries Copyright GENIVI Alliance
Introduction to Machine Learning CMU-10701
Introduction to Machine Learning CMU-10701 Deep Learning Barnabás Póczos & Aarti Singh Credits Many of the pictures, results, and other materials are taken from: Ruslan Salakhutdinov Joshua Bengio Geoffrey
Morphological segmentation of histology cell images
Morphological segmentation of histology cell images A.Nedzved, S.Ablameyko, I.Pitas Institute of Engineering Cybernetics of the National Academy of Sciences Surganova, 6, 00 Minsk, Belarus E-mail [email protected]
Environmental Remote Sensing GEOG 2021
Environmental Remote Sensing GEOG 2021 Lecture 4 Image classification 2 Purpose categorising data data abstraction / simplification data interpretation mapping for land cover mapping use land cover class
Supervised and unsupervised learning - 1
Chapter 3 Supervised and unsupervised learning - 1 3.1 Introduction The science of learning plays a key role in the field of statistics, data mining, artificial intelligence, intersecting with areas in
Tattoo Detection for Soft Biometric De-Identification Based on Convolutional NeuralNetworks
1 Tattoo Detection for Soft Biometric De-Identification Based on Convolutional NeuralNetworks Tomislav Hrkać, Karla Brkić, Zoran Kalafatić Faculty of Electrical Engineering and Computing University of
Chapter 6. The stacking ensemble approach
82 This chapter proposes the stacking ensemble approach for combining different data mining classifiers to get better performance. Other combination techniques like voting, bagging etc are also described
CS 1699: Intro to Computer Vision. Deep Learning. Prof. Adriana Kovashka University of Pittsburgh December 1, 2015
CS 1699: Intro to Computer Vision Deep Learning Prof. Adriana Kovashka University of Pittsburgh December 1, 2015 Today: Deep neural networks Background Architectures and basic operations Applications Visualizing
3 An Illustrative Example
Objectives An Illustrative Example Objectives - Theory and Examples -2 Problem Statement -2 Perceptron - Two-Input Case -4 Pattern Recognition Example -5 Hamming Network -8 Feedforward Layer -8 Recurrent
Open-Set Face Recognition-based Visitor Interface System
Open-Set Face Recognition-based Visitor Interface System Hazım K. Ekenel, Lorant Szasz-Toth, and Rainer Stiefelhagen Computer Science Department, Universität Karlsruhe (TH) Am Fasanengarten 5, Karlsruhe
L25: Ensemble learning
L25: Ensemble learning Introduction Methods for constructing ensembles Combination strategies Stacked generalization Mixtures of experts Bagging Boosting CSCE 666 Pattern Analysis Ricardo Gutierrez-Osuna
6.2.8 Neural networks for data mining
6.2.8 Neural networks for data mining Walter Kosters 1 In many application areas neural networks are known to be valuable tools. This also holds for data mining. In this chapter we discuss the use of neural
Facebook Friend Suggestion Eytan Daniyalzade and Tim Lipus
Facebook Friend Suggestion Eytan Daniyalzade and Tim Lipus 1. Introduction Facebook is a social networking website with an open platform that enables developers to extract and utilize user information
Lecture 6: CNNs for Detection, Tracking, and Segmentation Object Detection
CSED703R: Deep Learning for Visual Recognition (206S) Lecture 6: CNNs for Detection, Tracking, and Segmentation Object Detection Bohyung Han Computer Vision Lab. [email protected] 2 3 Object detection
Image Classification for Dogs and Cats
Image Classification for Dogs and Cats Bang Liu, Yan Liu Department of Electrical and Computer Engineering {bang3,yan10}@ualberta.ca Kai Zhou Department of Computing Science [email protected] Abstract
Self Organizing Maps: Fundamentals
Self Organizing Maps: Fundamentals Introduction to Neural Networks : Lecture 16 John A. Bullinaria, 2004 1. What is a Self Organizing Map? 2. Topographic Maps 3. Setting up a Self Organizing Map 4. Kohonen
Steven C.H. Hoi School of Information Systems Singapore Management University Email: [email protected]
Steven C.H. Hoi School of Information Systems Singapore Management University Email: [email protected] Introduction http://stevenhoi.org/ Finance Recommender Systems Cyber Security Machine Learning Visual
Comparison of Supervised and Unsupervised Learning Classifiers for Travel Recommendations
Volume 3, No. 8, August 2012 Journal of Global Research in Computer Science REVIEW ARTICLE Available Online at www.jgrcs.info Comparison of Supervised and Unsupervised Learning Classifiers for Travel Recommendations
III. SEGMENTATION. A. Origin Segmentation
2012 International Conference on Frontiers in Handwriting Recognition Handwritten English Word Recognition based on Convolutional Neural Networks Aiquan Yuan, Gang Bai, Po Yang, Yanni Guo, Xinting Zhao
Data Mining. Nonlinear Classification
Data Mining Unit # 6 Sajjad Haider Fall 2014 1 Nonlinear Classification Classes may not be separable by a linear boundary Suppose we randomly generate a data set as follows: X has range between 0 to 15
Supervised Learning (Big Data Analytics)
Supervised Learning (Big Data Analytics) Vibhav Gogate Department of Computer Science The University of Texas at Dallas Practical advice Goal of Big Data Analytics Uncover patterns in Data. Can be used
Neural Networks and Support Vector Machines
INF5390 - Kunstig intelligens Neural Networks and Support Vector Machines Roar Fjellheim INF5390-13 Neural Networks and SVM 1 Outline Neural networks Perceptrons Neural networks Support vector machines
Data Mining Practical Machine Learning Tools and Techniques
Ensemble learning Data Mining Practical Machine Learning Tools and Techniques Slides for Chapter 8 of Data Mining by I. H. Witten, E. Frank and M. A. Hall Combining multiple models Bagging The basic idea
Multiscale Object-Based Classification of Satellite Images Merging Multispectral Information with Panchromatic Textural Features
Remote Sensing and Geoinformation Lena Halounová, Editor not only for Scientific Cooperation EARSeL, 2011 Multiscale Object-Based Classification of Satellite Images Merging Multispectral Information with
Face Recognition in Low-resolution Images by Using Local Zernike Moments
Proceedings of the International Conference on Machine Vision and Machine Learning Prague, Czech Republic, August14-15, 014 Paper No. 15 Face Recognition in Low-resolution Images by Using Local Zernie
COMBINED NEURAL NETWORKS FOR TIME SERIES ANALYSIS
COMBINED NEURAL NETWORKS FOR TIME SERIES ANALYSIS Iris Ginzburg and David Horn School of Physics and Astronomy Raymond and Beverly Sackler Faculty of Exact Science Tel-Aviv University Tel-A viv 96678,
arxiv:1312.6034v2 [cs.cv] 19 Apr 2014
Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps arxiv:1312.6034v2 [cs.cv] 19 Apr 2014 Karen Simonyan Andrea Vedaldi Andrew Zisserman Visual Geometry Group,
Ensemble Methods. Knowledge Discovery and Data Mining 2 (VU) (707.004) Roman Kern. KTI, TU Graz 2015-03-05
Ensemble Methods Knowledge Discovery and Data Mining 2 (VU) (707004) Roman Kern KTI, TU Graz 2015-03-05 Roman Kern (KTI, TU Graz) Ensemble Methods 2015-03-05 1 / 38 Outline 1 Introduction 2 Classification
A Dynamic Convolutional Layer for Short Range Weather Prediction
A Dynamic Convolutional Layer for Short Range Weather Prediction Benjamin Klein, Lior Wolf and Yehuda Afek The Blavatnik School of Computer Science Tel Aviv University [email protected], [email protected],
Tracking and Recognition in Sports Videos
Tracking and Recognition in Sports Videos Mustafa Teke a, Masoud Sattari b a Graduate School of Informatics, Middle East Technical University, Ankara, Turkey [email protected] b Department of Computer
Recognizing Cats and Dogs with Shape and Appearance based Models. Group Member: Chu Wang, Landu Jiang
Recognizing Cats and Dogs with Shape and Appearance based Models Group Member: Chu Wang, Landu Jiang Abstract Recognizing cats and dogs from images is a challenging competition raised by Kaggle platform
Local features and matching. Image classification & object localization
Overview Instance level search Local features and matching Efficient visual recognition Image classification & object localization Category recognition Image classification: assigning a class label to
Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.
Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C
Efficient online learning of a non-negative sparse autoencoder
and Machine Learning. Bruges (Belgium), 28-30 April 2010, d-side publi., ISBN 2-93030-10-2. Efficient online learning of a non-negative sparse autoencoder Andre Lemme, R. Felix Reinhart and Jochen J. Steil
Selecting Receptive Fields in Deep Networks
Selecting Receptive Fields in Deep Networks Adam Coates Department of Computer Science Stanford University Stanford, CA 94305 [email protected] Andrew Y. Ng Department of Computer Science Stanford
The Delicate Art of Flower Classification
The Delicate Art of Flower Classification Paul Vicol Simon Fraser University University Burnaby, BC [email protected] Note: The following is my contribution to a group project for a graduate machine learning
Poker Vision: Playing Cards and Chips Identification based on Image Processing
Poker Vision: Playing Cards and Chips Identification based on Image Processing Paulo Martins 1, Luís Paulo Reis 2, and Luís Teófilo 2 1 DEEC Electrical Engineering Department 2 LIACC Artificial Intelligence
Signature Segmentation from Machine Printed Documents using Conditional Random Field
2011 International Conference on Document Analysis and Recognition Signature Segmentation from Machine Printed Documents using Conditional Random Field Ranju Mandal Computer Vision and Pattern Recognition
WATER BODY EXTRACTION FROM MULTI SPECTRAL IMAGE BY SPECTRAL PATTERN ANALYSIS
WATER BODY EXTRACTION FROM MULTI SPECTRAL IMAGE BY SPECTRAL PATTERN ANALYSIS Nguyen Dinh Duong Department of Environmental Information Study and Analysis, Institute of Geography, 18 Hoang Quoc Viet Rd.,
T O B C A T C A S E G E O V I S A T DETECTIE E N B L U R R I N G V A N P E R S O N E N IN P A N O R A MISCHE BEELDEN
T O B C A T C A S E G E O V I S A T DETECTIE E N B L U R R I N G V A N P E R S O N E N IN P A N O R A MISCHE BEELDEN Goal is to process 360 degree images and detect two object categories 1. Pedestrians,
ARTIFICIAL INTELLIGENCE (CSCU9YE) LECTURE 6: MACHINE LEARNING 2: UNSUPERVISED LEARNING (CLUSTERING)
ARTIFICIAL INTELLIGENCE (CSCU9YE) LECTURE 6: MACHINE LEARNING 2: UNSUPERVISED LEARNING (CLUSTERING) Gabriela Ochoa http://www.cs.stir.ac.uk/~goc/ OUTLINE Preliminaries Classification and Clustering Applications
Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data
CMPE 59H Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data Term Project Report Fatma Güney, Kübra Kalkan 1/15/2013 Keywords: Non-linear
CS 2750 Machine Learning. Lecture 1. Machine Learning. http://www.cs.pitt.edu/~milos/courses/cs2750/ CS 2750 Machine Learning.
Lecture Machine Learning Milos Hauskrecht [email protected] 539 Sennott Square, x5 http://www.cs.pitt.edu/~milos/courses/cs75/ Administration Instructor: Milos Hauskrecht [email protected] 539 Sennott
Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches
Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches PhD Thesis by Payam Birjandi Director: Prof. Mihai Datcu Problematic
Random Forest Based Imbalanced Data Cleaning and Classification
Random Forest Based Imbalanced Data Cleaning and Classification Jie Gu Software School of Tsinghua University, China Abstract. The given task of PAKDD 2007 data mining competition is a typical problem
Image Normalization for Illumination Compensation in Facial Images
Image Normalization for Illumination Compensation in Facial Images by Martin D. Levine, Maulin R. Gandhi, Jisnu Bhattacharyya Department of Electrical & Computer Engineering & Center for Intelligent Machines
Automated Image Forgery Detection through Classification of JPEG Ghosts
Automated Image Forgery Detection through Classification of JPEG Ghosts Fabian Zach, Christian Riess and Elli Angelopoulou Pattern Recognition Lab University of Erlangen-Nuremberg {riess,elli}@i5.cs.fau.de
Robust Real-Time Face Detection
Robust Real-Time Face Detection International Journal of Computer Vision 57(2), 137 154, 2004 Paul Viola, Michael Jones 授 課 教 授 : 林 信 志 博 士 報 告 者 : 林 宸 宇 報 告 日 期 :96.12.18 Outline Introduction The Boost
Machine Learning Final Project Spam Email Filtering
Machine Learning Final Project Spam Email Filtering March 2013 Shahar Yifrah Guy Lev Table of Content 1. OVERVIEW... 3 2. DATASET... 3 2.1 SOURCE... 3 2.2 CREATION OF TRAINING AND TEST SETS... 4 2.3 FEATURE
Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 269 Class Project Report
Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 69 Class Project Report Junhua Mao and Lunbo Xu University of California, Los Angeles [email protected] and lunbo
A Study of Automatic License Plate Recognition Algorithms and Techniques
A Study of Automatic License Plate Recognition Algorithms and Techniques Nima Asadi Intelligent Embedded Systems Mälardalen University Västerås, Sweden [email protected] ABSTRACT One of the most
SPECIAL PERTURBATIONS UNCORRELATED TRACK PROCESSING
AAS 07-228 SPECIAL PERTURBATIONS UNCORRELATED TRACK PROCESSING INTRODUCTION James G. Miller * Two historical uncorrelated track (UCT) processing approaches have been employed using general perturbations
Speed Performance Improvement of Vehicle Blob Tracking System
Speed Performance Improvement of Vehicle Blob Tracking System Sung Chun Lee and Ram Nevatia University of Southern California, Los Angeles, CA 90089, USA [email protected], [email protected] Abstract. A speed
Visualization of Breast Cancer Data by SOM Component Planes
International Journal of Science and Technology Volume 3 No. 2, February, 2014 Visualization of Breast Cancer Data by SOM Component Planes P.Venkatesan. 1, M.Mullai 2 1 Department of Statistics,NIRT(Indian
A Stock Pattern Recognition Algorithm Based on Neural Networks
A Stock Pattern Recognition Algorithm Based on Neural Networks Xinyu Guo [email protected] Xun Liang [email protected] Xiang Li [email protected] Abstract pattern respectively. Recent
COMPARISON OF OBJECT BASED AND PIXEL BASED CLASSIFICATION OF HIGH RESOLUTION SATELLITE IMAGES USING ARTIFICIAL NEURAL NETWORKS
COMPARISON OF OBJECT BASED AND PIXEL BASED CLASSIFICATION OF HIGH RESOLUTION SATELLITE IMAGES USING ARTIFICIAL NEURAL NETWORKS B.K. Mohan and S. N. Ladha Centre for Studies in Resources Engineering IIT
Blood Vessel Classification into Arteries and Veins in Retinal Images
Blood Vessel Classification into Arteries and Veins in Retinal Images Claudia Kondermann and Daniel Kondermann a and Michelle Yan b a Interdisciplinary Center for Scientific Computing (IWR), University
Towards better accuracy for Spam predictions
Towards better accuracy for Spam predictions Chengyan Zhao Department of Computer Science University of Toronto Toronto, Ontario, Canada M5S 2E4 [email protected] Abstract Spam identification is crucial
Artificial Neural Networks and Support Vector Machines. CS 486/686: Introduction to Artificial Intelligence
Artificial Neural Networks and Support Vector Machines CS 486/686: Introduction to Artificial Intelligence 1 Outline What is a Neural Network? - Perceptron learners - Multi-layer networks What is a Support
Data quality in Accounting Information Systems
Data quality in Accounting Information Systems Comparing Several Data Mining Techniques Erjon Zoto Department of Statistics and Applied Informatics Faculty of Economy, University of Tirana Tirana, Albania
Make and Model Recognition of Cars
Make and Model Recognition of Cars Sparta Cheung Department of Electrical and Computer Engineering University of California, San Diego 9500 Gilman Dr., La Jolla, CA 92093 [email protected] Alice Chu Department
Image Analysis Using the Aperio ScanScope
Image Analysis Using the Aperio ScanScope Allen H. Olson, PhD Algorithm Development Engineer Aperio Technologies INTRODUCTION Why should I choose the Aperio ScanScope over competing systems for image analysis?
Vision based Vehicle Tracking using a high angle camera
Vision based Vehicle Tracking using a high angle camera Raúl Ignacio Ramos García Dule Shu [email protected] [email protected] Abstract A vehicle tracking and grouping algorithm is presented in this work
The Artificial Prediction Market
The Artificial Prediction Market Adrian Barbu Department of Statistics Florida State University Joint work with Nathan Lay, Siemens Corporate Research 1 Overview Main Contributions A mathematical theory
MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 5 9/17/2008 RANDOM VARIABLES
MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 5 9/17/2008 RANDOM VARIABLES Contents 1. Random variables and measurable functions 2. Cumulative distribution functions 3. Discrete
Monotonicity Hints. Abstract
Monotonicity Hints Joseph Sill Computation and Neural Systems program California Institute of Technology email: [email protected] Yaser S. Abu-Mostafa EE and CS Deptartments California Institute of Technology
A Learning Algorithm For Neural Network Ensembles
A Learning Algorithm For Neural Network Ensembles H. D. Navone, P. M. Granitto, P. F. Verdes and H. A. Ceccatto Instituto de Física Rosario (CONICET-UNR) Blvd. 27 de Febrero 210 Bis, 2000 Rosario. República
STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables
Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables Discrete vs. continuous random variables Examples of continuous distributions o Uniform o Exponential o Normal Recall: A random
Evaluation & Validation: Credibility: Evaluating what has been learned
Evaluation & Validation: Credibility: Evaluating what has been learned How predictive is a learned model? How can we evaluate a model Test the model Statistical tests Considerations in evaluating a Model
Introduction to Pattern Recognition
Introduction to Pattern Recognition Selim Aksoy Department of Computer Engineering Bilkent University [email protected] CS 551, Spring 2009 CS 551, Spring 2009 c 2009, Selim Aksoy (Bilkent University)
Unit 9 Describing Relationships in Scatter Plots and Line Graphs
Unit 9 Describing Relationships in Scatter Plots and Line Graphs Objectives: To construct and interpret a scatter plot or line graph for two quantitative variables To recognize linear relationships, non-linear
Data Mining - Evaluation of Classifiers
Data Mining - Evaluation of Classifiers Lecturer: JERZY STEFANOWSKI Institute of Computing Sciences Poznan University of Technology Poznan, Poland Lecture 4 SE Master Course 2008/2009 revised for 2010
The Combination Forecasting Model of Auto Sales Based on Seasonal Index and RBF Neural Network
, pp.67-76 http://dx.doi.org/10.14257/ijdta.2016.9.1.06 The Combination Forecasting Model of Auto Sales Based on Seasonal Index and RBF Neural Network Lihua Yang and Baolin Li* School of Economics and
Colour Image Segmentation Technique for Screen Printing
60 R.U. Hewage and D.U.J. Sonnadara Department of Physics, University of Colombo, Sri Lanka ABSTRACT Screen-printing is an industry with a large number of applications ranging from printing mobile phone
How To Use Neural Networks In Data Mining
International Journal of Electronics and Computer Science Engineering 1449 Available Online at www.ijecse.org ISSN- 2277-1956 Neural Networks in Data Mining Priyanka Gaur Department of Information and
Performance Metrics for Graph Mining Tasks
Performance Metrics for Graph Mining Tasks 1 Outline Introduction to Performance Metrics Supervised Learning Performance Metrics Unsupervised Learning Performance Metrics Optimizing Metrics Statistical
A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow
, pp.233-237 http://dx.doi.org/10.14257/astl.2014.51.53 A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow Giwoo Kim 1, Hye-Youn Lim 1 and Dae-Seong Kang 1, 1 Department of electronices
Advanced Ensemble Strategies for Polynomial Models
Advanced Ensemble Strategies for Polynomial Models Pavel Kordík 1, Jan Černý 2 1 Dept. of Computer Science, Faculty of Information Technology, Czech Technical University in Prague, 2 Dept. of Computer
A Learning Based Method for Super-Resolution of Low Resolution Images
A Learning Based Method for Super-Resolution of Low Resolution Images Emre Ugur June 1, 2004 [email protected] Abstract The main objective of this project is the study of a learning based method
Implementation of Canny Edge Detector of color images on CELL/B.E. Architecture.
Implementation of Canny Edge Detector of color images on CELL/B.E. Architecture. Chirag Gupta,Sumod Mohan K [email protected], [email protected] Abstract In this project we propose a method to improve
arxiv:1505.04597v1 [cs.cv] 18 May 2015
U-Net: Convolutional Networks for Biomedical Image Segmentation Olaf Ronneberger, Philipp Fischer, and Thomas Brox arxiv:1505.04597v1 [cs.cv] 18 May 2015 Computer Science Department and BIOSS Centre for
An Early Attempt at Applying Deep Reinforcement Learning to the Game 2048
An Early Attempt at Applying Deep Reinforcement Learning to the Game 2048 Hong Gui, Tinghan Wei, Ching-Bo Huang, I-Chen Wu 1 1 Department of Computer Science, National Chiao Tung University, Hsinchu, Taiwan
Blog Post Extraction Using Title Finding
Blog Post Extraction Using Title Finding Linhai Song 1, 2, Xueqi Cheng 1, Yan Guo 1, Bo Wu 1, 2, Yu Wang 1, 2 1 Institute of Computing Technology, Chinese Academy of Sciences, Beijing 2 Graduate School
Clustering & Visualization
Chapter 5 Clustering & Visualization Clustering in high-dimensional databases is an important problem and there are a number of different clustering paradigms which are applicable to high-dimensional data.
STA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! [email protected]! http://www.cs.toronto.edu/~rsalakhu/ Lecture 6 Three Approaches to Classification Construct
Recurrent Neural Networks
Recurrent Neural Networks Neural Computation : Lecture 12 John A. Bullinaria, 2015 1. Recurrent Neural Network Architectures 2. State Space Models and Dynamical Systems 3. Backpropagation Through Time
Taking Inverse Graphics Seriously
CSC2535: 2013 Advanced Machine Learning Taking Inverse Graphics Seriously Geoffrey Hinton Department of Computer Science University of Toronto The representation used by the neural nets that work best
Università degli Studi di Bologna
Università degli Studi di Bologna DEIS Biometric System Laboratory Incremental Learning by Message Passing in Hierarchical Temporal Memory Davide Maltoni Biometric System Laboratory DEIS - University of
