Preprocessing of Screening Mammograms Based on Local Statistical Models
|
|
|
- Jean Underwood
- 10 years ago
- Views:
Transcription
1 Preprocessing of Screening Mammograms Based on Local Statistical Models Jiří Grim Institute of Information Theory and Automation Czech Academy of Sciences P.O. BOX 18, Prague 8 Czech Republic [email protected] ABSTRACT In a recent paper we have proposed evaluation of screening mammograms by means of local statistical model. The model describes local statistical properties of internal pixels of a small search window - as they occur when scanning the mammogram. It is defined as a mixture of Gaussian densities which can be estimated by EM algorithm from data obtained by shifting the search window. The estimated Gaussian mixture is used to compute at all window positions the log-likelihood values which can be displayed as grey levels at the respective window centers. The resulting log-likelihood image closely correlates with the structural details of the original mammogram and emphasizes potential malignant findings as untypical locations of high novelty. In this paper we discuss the possibilities to enhance the log-likelihood image for diagnostic purposes. Categories and Subject Descriptors I.4 [Image Processing And Computer Vision]: Image processing software; J.3 [Life And Medical Sciences]: Health General Terms Algorithms, Theory 1. INTRODUCTION Mammographic screening is currently the most effective and widely accepted strategy to detect early stages of breast cancer and to reduce the related mortality rates by appropriate treatment. In many countries there are numerous screening programs producing millions of mammograms to be evaluated by specially trained radiologists. Reading of mammograms is a difficult task underlying very strong reliability requirements. Moreover, the early subtle stage of pathological finding is of key importance since palpable malignant lesions already can make metastases. One can use Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Copyright 20XX ACM X-XXXXX-XX-X/XX/XX...$ double reading to reduce the risk of possible false negative evaluation of mammograms but, as a standard of care, this approach becomes too resource demanding. For obvious reasons there is a strong motivation to help radiologists by using computer-aided decision-supporting systems. However, despite of long history of research in this area, there is no satisfactory solution of the early breast cancer detection problem. From the point of view of statistical decision-making the evaluation of screening mammograms is known to be extremely complex. The malignant abnormalities may be essentially of two kinds, so called masses and calcifications, but they may have different shapes, margins or distributions, may differ in size and, unlike typical pattern recognition problems, their locations in mammograms may be nearly arbitrary. The aim of breast cancer screening is to detect malignant lesions at very early stages, i.e. of very small size and not fully developed form. Simultaneously, the location of the suspect finding is of basic importance. In this connection there is a serious question if the diagnostic evaluation of screening mammograms can reasonably be solved as a classification problem. Let us recall that, considering a statistical decision problem, we assume that some multivariate observations have to be classified with respect to a finite number of classes. In order to design a statistical decision rule we need sufficiently large set of training data and a comparably large test-data set having identical properties. According to general experience the size of training data set should be much higher than the data dimension but, unfortunately, the dimension of screening mammogram (considered as a data vector) is given by the very high number of pixels. Since any subsampling could cause an unacceptable information loss, the widely accepted approach is to reduce the decision-making only to a limited region of interest (ROI). Of course, in such a case the classifier must evaluate the content of a sliding window in all possible mammogram positions to specify ROI or the suspect locations have to be chosen by radiologist or found by another special procedure. (Note that two ROI s in adjacent positions produce completely different data vectors since the internal pixel values are shifted into different dimensions.) Nevertheless, even a reasonably small ROI must contain hundreds or rather thousands of pixels in order to encompass possible greater findings and therefore some feature extraction method would be indispensable. We recall that, even if we would succeed to extract a small number of informative features (invariant with respect to
2 the size, shape and location of the finding), it would be very difficult to create a reasonably representative training set (or training sets for all possible types of malignant lesions). The reason is the extreme diversity of malignant abnormalities and high natural variability of mammograms differing by age and type (dense, fatty) and additionally influenced by the positional noise due to manual placement of the breasts by the radiographer. It is probable that, under strongly independent test conditions and without any preselection of mammograms the classification performance would not be very optimistic. At present the widely used computer-aided detection systems (CADe) just only place prompts on mammogram to direct the attention of the radiologists toward potential abnormalities. According to different studies (cf. [3, 2, 4, 6]), CADe systems can improve breast cancer detection by more than 20% but the false positive rates of placing prompts are high, usually in units per image [3, 11]. Thus, because of the low incidence of malignant findings ( %), standard CADe systems generate hundreds or even thousands of false positive prompts in order to correctly mark one malignant lesion. 2. LOCAL STATISTICAL MODEL In a recent paper [8] we have proposed preprocessing of screening mammograms by means of local statistical models. The idea of the method is to emphasize diagnostically important details as unusual locations of high novelty. We estimate the local statistical model of a mammogram as a joint probability density of pixel values in a suitably chosen search window. In particular, we have used a square window of 13x13 pixels with trimmed corners with the resulting dimension N=145 (cf. [8] for more details). Denoting by x = (x 1, x 2,..., x N ) R N the real vector of grey levels of the window inside in a fixed pixel arrangement, we assume the joint probability density P (x) in the form of a mixture of Gaussian components P (x) = w mf (x µ m, σ m), x R N. (1) Here we denote M = {1,..., M}, and N = {1,..., N} the index sets of components and variables respectively, and define the mixture components as products of univariate Gaussian densities F (x µ m, σ m) = f n(x n µ mn, σ mn), x R N, (2) f n(x n µ mn, σ mn) = n N 1 (xn µmn)2 exp{ }. (3) 2πσmn 2σmn 2 The standard way to estimate Gaussian mixture is to apply EM algorithm. We use the data set S obtained by pixelwise scanning the original mammogram with the search window S = {x (1), x (2),... }, x (i) R N. (4) The corresponding log-likelihood function [ ] L = 1 log w mf (x µ S m, σ m) (5) can be maximized by the following EM iteration equations (cf. [7, 9]) (m M, n N, x S) q(m x) = wmf (x µ m, σm) j M wjf (x µ j, σ j), (6) w m = 1 S q(m x), (7) µ mn = 1 w m S (σ mn) 2 = 1 w m S x nq(m x), (8) (x n µ mn) 2 q(m x). (9) Here the apostrophe denotes the new parameter values in each iteration. In our experiments we have used M = 36 components and the component parameters have been initialized randomly (cf. [8] for more details). The local statistical model is estimated from each mammogram individually and therefore the method is not confronted with the natural variability of mammograms. This circumstance is relevant in view of the extreme diversity of malignant lesions. We recall that the mixture model (1) is invariant with respect to the linear transform of the underlying grey scale (cf. [8], Appendix I). Let us remark that the estimation of Gaussian mixtures may cause computational difficulties in multidimensional spaces. For dimensions of order N 10 2 the component values (2) may become too small to be correctly represented in memory and therefore the EM algorithm may become instable. The problem may be easily removed if we compute the components in logarithmic form. By adding a suitably chosen constant we actually multiply the components by a coefficient which can be reduced in the fraction (6). In this way only very small component values will be neglected against the current maximum. 3. LOG-LIKELIHOOD IMAGE Having estimated the parameters we evaluate the mixture density P (x) at each window position for the underlying window-patch vector x and display the corresponding loglikelihood value log P (x) as grey level at the central reference pixel of the window. In this sense the light grey levels correspond to the typical highly probable parts of the image and the dark values reflect the less-probable, untypical or unusual locations. In this way, the resulting log likelihood image should facilitate the identification of malignant abnormalities as locations of high novelty - exactly in the sense of the idea proposed by Rose and Taylor and others (cf. [13, 12]). A frequent subject of different CADe techniques is the detection of microcalcifications usually based on some kind of thresholding (cf. e.g., [5] for extensive references). In case of log-likelihood image, the detection of microcalcifications is closely related to the underlying Gaussian mixture. At all window positions containing a disturbing light pixel we obtain lower log-likelihood values (cf. (2)). As a result, the log-likelihood image will contain a dark spot of window size centered on the light pixel. The size and darkness of the spot continuously depend on the size and contrast of the underlying micro-calcification, respectively.
3 Figure 1: Original image of the mammogram B (right- and left medio-lateral-oblique parts) from the DDSM database [1]. In the left upper part of the image there is a malignant circumscribed mass with lobulated margins. Comparing the original mammogram and the corresponding log-likelihood image, we can see that structural details closely correlate in both images but the boundaries of different regions are emphasized. The tendency of the local statistical model to create contour lines is an artefact having a simple theoretical reason. The log-likelihood values log P (x) are typically dominated by a single component of the mixture, which is most adequate to the underlying region. A detailed numerical observation shows that the switching of dominating mixture components is responsible for the arising dark contour lines at the boundaries of regions having different textural properties. The most apparent demonstration of this mechanism can be seen at the margins of the breast region which are characterized by continuously decreasing grey levels (cf. Fig. 1 and 2). We observe that, unlike iso-intensity contours (cf. e.g., [10]), the contour lines produced by mixture components should generally emphasize regions of similar textural properties. 4. MODIFIED STATISTICAL MODEL Let us recall that the log-likelihood image is a purely statistical tool based on the local statistical model. There is no specific relation between the log-likelihood image and screening mammography, except for the hypotheses that the malignant lesions can be identified as unusual locations of high novelty. The advantage of this principle is that the method need not be trained, it is generally applicable and, unlike statistical pattern recognition, the high natural variability of mammograms and diversity of malignant abnormalities do not cause any specific problems. The resulting preprocessed mammogram may be helpful to the radiologists because any asymmetry of structural details becomes well visible, the regions of identical textural properties are emphasized by contour lines and microcalcifications are displayed as dark well visible spots. On the other hand, the most dominant features of the loglikelihood image typically correspond to the light structural details which represent a small part of the mammogram and therefore occur as untypical. Consequently, they correspond to dark grey levels but they are usually quite normal without any pathological meaning. We would like modify the log-likelihood image with the aim to suppress the irrelevant features and increase the sensitivity of the method with respect to the potentially pathological findings. There are different possibilities to modify the method by changing the structure of the search window or by including additional variables to the data vector (cf. [8]) but also the EM iteration equations can be modified. A promising approach provides the weighted version of EM algorithm. In particular, denoting γ(x) the relative frequency of x in the training data set S : { N(x)/ S, for x S, γ(x) = (10) 0, for x / S
4 Figure 2: The log-likelihood image of the original mammogram from Fig.1. (in the upper part) has the same resolution. Each pixel value is defined by log-likelihood value log P (x) where x is the 145-dimensional vector defined by the search window. The malignant lesion is partly emphasized by contour lines. The lower part of the figure shows the modified (weighted) version of the log-likelihood image. The resulting weighted log-likelihood image (in the lower part) contains finer details and additional contour lines in the light region.
5 and X the subset of R N with γ(x) > 0: X = {x R N : γ(x) > 0}, ( γ(x) = 1) we can rewrite the criterion (5) and the EM iteration equations (7) - (9) equivalently in the following form: L = [ ] γ(x) log w mf (x µ m, σ m), (11) w m = γ(x)q(m x), (12) µ mn = 1 w m (σ mn) 2 = 1 w m x nγ(x)q(m x), (13) (x n µ mn) 2 γ(x)q(m x). (14) Note that, in the above equations, we may specify arbitrary weights γ(x) which can be interpreted as a repeated occurrence of the respective vectors in the data set. In this way the weighted EM algorithm can be used to increase the meaning of different data vectors individually. In this paper we derive the weight function directly from the original mammogram (cf. Fig.1). In order to address the problem of the too emphasized light normal regions we define the weight function by Eq. γ(x) = 1 x n. (15) N n N In other words, if we use the mean value of the grey levels x n in the window as a weight, then the presence of light locations in the data set will artificially increase. Consequently, they will be less unusual and therefore less emphasized. In the experiment we have used the modified EM algorithm (11) - (14) with the weight function (15). It can be seen that the resulting weighted log-likelihood image (cf. Fig.2, lower part) contains finer details and additional contour lines in the light region. In this sense the weighted modification of our method could be useful in case of dense mammograms when the identification of malignant masses is particularly difficult. 5. CONCLUSION We propose a weighted modification of a recently proposed method of diagnostic enhancement of screening mammograms. In our experiments we have used the mean grey level of the window inside as a weighting function. In comparison with the original log-likelihood image the modified image shows finer structural details and additional contour lines. Again, there is no specific relation between the modified method and screening mammography, only the underlying high-novelty hypotheses has been slightly modified. Nevertheless, we recall that there is wide variety of other possibilities to specify the weight function which could improve the diagnostic conspicuity of the final log-likelihood image. Acknowledgment This work was supported by the MŠMT projects 2C06019 ZI- MOLEZ and 1M0572 DAR. 6. REFERENCES [1] K. W. Bowyer, D. Kopans, W. P. Kegelmeyer, R. Moore, and M. Sallam. The digital database for screening mammography. Kluwer Academic Publishers, [2] R. F. Brem, J. K. Baum, S. Kaplan, S. Sounders, L. G. Naul, and J. Hoffmeister. Improvement in sensitivity screening mammography with computer-aided detection (cad): a multi-institutional trial. American Journal of Roentgenology, 181: , [3] R. F. Brem, J. A. Rapelyea, G. Zisman, J. W. Hoffmeister, and M. P. DeSimio. Evaluation of breast cancer with a computer-aided detection system by mammographic appearance and histopathology. Cancer, 104(5): , [4] W. L. J. Burhenne, S. A. Wood, and C. J. D. et al. Potential contribution of computer-aided detection to the sensitivity of screening mammography. Radiology, 215: , [5] H. D. Cheng, X. J. Shi, R. Min, L. M. Hu, X. P. Cai, and H. N. Du. Approaches for automated detection and classification of masses in mammograms. Pattern Recognition, 39(11): , November [6] T. W. Freer and M. J. Ulissey. Screening mammography with computer-aided detection: prospective study of 12,860 patients in a community breast center. Radiology, 220: , [7] J. Grim, M. Haindl, P. Somol, and P. Pudil. A subspace approach to texture modelling by using gaussian mixtures. In Proceedings of the 18th IAPR International Conference on Pattern Recognition ICPR 2006, volume 2, pages , Los Alamitos, CA, USA, IEEE Computer Society. [8] J. Grim, P. Somol, M. Haindl, and J. Danes. Computer-aided evaluation of screening mammograms based on local texture models. Image Processing, IEEE Transactions on, 18(4): , april [9] J. Grim, P. Somol, M. Haindl, and P. Pudil. A statistical approach to local evaluation of a single texture image. In Proceedings of the 16-th Annual Symposium PRASA 2005, pages , [10] D. Guliato, R. M. Rangayyan, J. A. Zuffo,, and J. E. L. Desautels. Detection of breast tumor boundaries using iso-intensity contours and dynamic thresholding. In Proc. 4th Int. Workshop on Digital Mammography IWDM 1998, pages , Nijmegen, The Netherlands, [11] N. R. Mudigonda, R. M. Rangayyan, and J. E. L. Desautels. Detection of breast masses in mammograms by density slicing and texture flow-field analysis. IEEE Trans. Med. Imaging, 20: , November [12] C. Spence, L. Parra, and P. Sajda. Detection, synthesis and compression in mammographic image analysis with a hierarchical image probability model. In Proc. of the IEEE Workshop on Mathematical Methods in Biomedical Image Analysis, [13] L. Tarassenko, P. Hayton, and N. C. et al. Novelty detection for the identification of masses in mammograms. In Proceedings of the 4th IEE International Conference on Artificial Neural Networks, page 442Ű447, Cambridge, UK, 1995.
Determining optimal window size for texture feature extraction methods
IX Spanish Symposium on Pattern Recognition and Image Analysis, Castellon, Spain, May 2001, vol.2, 237-242, ISBN: 84-8021-351-5. Determining optimal window size for texture feature extraction methods Domènec
Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches
Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches PhD Thesis by Payam Birjandi Director: Prof. Mihai Datcu Problematic
Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.
Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C
DYNAMIC RANGE IMPROVEMENT THROUGH MULTIPLE EXPOSURES. Mark A. Robertson, Sean Borman, and Robert L. Stevenson
c 1999 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or
Component Ordering in Independent Component Analysis Based on Data Power
Component Ordering in Independent Component Analysis Based on Data Power Anne Hendrikse Raymond Veldhuis University of Twente University of Twente Fac. EEMCS, Signals and Systems Group Fac. EEMCS, Signals
EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set
EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set Amhmed A. Bhih School of Electrical and Electronic Engineering Princy Johnson School of Electrical and Electronic Engineering Martin
LOCAL SURFACE PATCH BASED TIME ATTENDANCE SYSTEM USING FACE. [email protected]
LOCAL SURFACE PATCH BASED TIME ATTENDANCE SYSTEM USING FACE 1 S.Manikandan, 2 S.Abirami, 2 R.Indumathi, 2 R.Nandhini, 2 T.Nanthini 1 Assistant Professor, VSA group of institution, Salem. 2 BE(ECE), VSA
Hologic Selenia Dimensions C-View Software Module. October 24, 2012
Hologic Selenia Dimensions C-View Software Module October 24, 2012 Introduction and Agenda Peter Soltani, Ph.D. Senior VP & GM, Breast Health Hologic, Inc. Agenda Technology Overview Clinical Overview
Multivariate Normal Distribution
Multivariate Normal Distribution Lecture 4 July 21, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #4-7/21/2011 Slide 1 of 41 Last Time Matrices and vectors Eigenvalues
Data Mining. Cluster Analysis: Advanced Concepts and Algorithms
Data Mining Cluster Analysis: Advanced Concepts and Algorithms Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004 1 More Clustering Methods Prototype-based clustering Density-based clustering Graph-based
ADVANCED MACHINE LEARNING. Introduction
1 1 Introduction Lecturer: Prof. Aude Billard ([email protected]) Teaching Assistants: Guillaume de Chambrier, Nadia Figueroa, Denys Lamotte, Nicola Sommer 2 2 Course Format Alternate between: Lectures
Robert Collins CSE598G. More on Mean-shift. R.Collins, CSE, PSU CSE598G Spring 2006
More on Mean-shift R.Collins, CSE, PSU Spring 2006 Recall: Kernel Density Estimation Given a set of data samples x i ; i=1...n Convolve with a kernel function H to generate a smooth function f(x) Equivalent
Logistic Regression. Jia Li. Department of Statistics The Pennsylvania State University. Logistic Regression
Logistic Regression Department of Statistics The Pennsylvania State University Email: [email protected] Logistic Regression Preserve linear classification boundaries. By the Bayes rule: Ĝ(x) = arg max
Data, Measurements, Features
Data, Measurements, Features Middle East Technical University Dep. of Computer Engineering 2009 compiled by V. Atalay What do you think of when someone says Data? We might abstract the idea that data are
Statistical Machine Learning
Statistical Machine Learning UoC Stats 37700, Winter quarter Lecture 4: classical linear and quadratic discriminants. 1 / 25 Linear separation For two classes in R d : simple idea: separate the classes
Morphological segmentation of histology cell images
Morphological segmentation of histology cell images A.Nedzved, S.Ablameyko, I.Pitas Institute of Engineering Cybernetics of the National Academy of Sciences Surganova, 6, 00 Minsk, Belarus E-mail [email protected]
Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 269 Class Project Report
Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 69 Class Project Report Junhua Mao and Lunbo Xu University of California, Los Angeles [email protected] and lunbo
Visualization of Breast Cancer Data by SOM Component Planes
International Journal of Science and Technology Volume 3 No. 2, February, 2014 Visualization of Breast Cancer Data by SOM Component Planes P.Venkatesan. 1, M.Mullai 2 1 Department of Statistics,NIRT(Indian
Clustering & Visualization
Chapter 5 Clustering & Visualization Clustering in high-dimensional databases is an important problem and there are a number of different clustering paradigms which are applicable to high-dimensional data.
Personalized Breast Screening Service
Frequently Asked Questions WHAT IS BREAST DENSITY? Breasts are made up of a mixture of fibrous, glandular and fatty tissue. Your breasts are considered if you have predominantly fibrous or glandular tissue
Breast Imaging Made Brief and Simple. Jane Clayton MD Associate Professor Department of Radiology LSUHSC New Orleans, LA
Breast Imaging Made Brief and Simple Jane Clayton MD Associate Professor Department of Radiology LSUHSC New Orleans, LA What women are referred for breast imaging? Two groups of women are referred for
BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES
BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES 123 CHAPTER 7 BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES 7.1 Introduction Even though using SVM presents
STA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! [email protected]! http://www.cs.toronto.edu/~rsalakhu/ Lecture 6 Three Approaches to Classification Construct
for breast cancer detection
Microwave imaging for breast cancer detection Sara Salvador Summary Epidemiology of breast cancer The early diagnosis: State of the art Microwave imaging: The signal processing algorithm 2D/3D models and
ECE 533 Project Report Ashish Dhawan Aditi R. Ganesan
Handwritten Signature Verification ECE 533 Project Report by Ashish Dhawan Aditi R. Ganesan Contents 1. Abstract 3. 2. Introduction 4. 3. Approach 6. 4. Pre-processing 8. 5. Feature Extraction 9. 6. Verification
COMPARISON OF OBJECT BASED AND PIXEL BASED CLASSIFICATION OF HIGH RESOLUTION SATELLITE IMAGES USING ARTIFICIAL NEURAL NETWORKS
COMPARISON OF OBJECT BASED AND PIXEL BASED CLASSIFICATION OF HIGH RESOLUTION SATELLITE IMAGES USING ARTIFICIAL NEURAL NETWORKS B.K. Mohan and S. N. Ladha Centre for Studies in Resources Engineering IIT
Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus
Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus Tihomir Asparouhov and Bengt Muthén Mplus Web Notes: No. 15 Version 8, August 5, 2014 1 Abstract This paper discusses alternatives
Lecture 9: Introduction to Pattern Analysis
Lecture 9: Introduction to Pattern Analysis g Features, patterns and classifiers g Components of a PR system g An example g Probability definitions g Bayes Theorem g Gaussian densities Features, patterns
Object Recognition and Template Matching
Object Recognition and Template Matching Template Matching A template is a small image (sub-image) The goal is to find occurrences of this template in a larger image That is, you want to find matches of
Image Segmentation and Registration
Image Segmentation and Registration Dr. Christine Tanner ([email protected]) Computer Vision Laboratory, ETH Zürich Dr. Verena Kaynig, Machine Learning Laboratory, ETH Zürich Outline Segmentation
CS229 Lecture notes. Andrew Ng
CS229 Lecture notes Andrew Ng Part X Factor analysis Whenwehavedatax (i) R n thatcomesfromamixtureofseveral Gaussians, the EM algorithm can be applied to fit a mixture model. In this setting, we usually
RUN-LENGTH ENCODING FOR VOLUMETRIC TEXTURE
RUN-LENGTH ENCODING FOR VOLUMETRIC TEXTURE Dong-Hui Xu, Arati S. Kurani, Jacob D. Furst, Daniela S. Raicu Intelligent Multimedia Processing Laboratory, School of Computer Science, Telecommunications, and
New Hash Function Construction for Textual and Geometric Data Retrieval
Latest Trends on Computers, Vol., pp.483-489, ISBN 978-96-474-3-4, ISSN 79-45, CSCC conference, Corfu, Greece, New Hash Function Construction for Textual and Geometric Data Retrieval Václav Skala, Jan
A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution
A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 4: September
Document Image Retrieval using Signatures as Queries
Document Image Retrieval using Signatures as Queries Sargur N. Srihari, Shravya Shetty, Siyuan Chen, Harish Srinivasan, Chen Huang CEDAR, University at Buffalo(SUNY) Amherst, New York 14228 Gady Agam and
How To Fix Out Of Focus And Blur Images With A Dynamic Template Matching Algorithm
IJSTE - International Journal of Science Technology & Engineering Volume 1 Issue 10 April 2015 ISSN (online): 2349-784X Image Estimation Algorithm for Out of Focus and Blur Images to Retrieve the Barcode
The File-Card-Browser View for Breast DCE-MRI Data
The File-Card-Browser View for Breast DCE-MRI Data Sylvia Glaßer 1, Kathrin Scheil 1, Uta Preim 2, Bernhard Preim 1 1 Department of Simulation and Graphics, University of Magdeburg 2 Department of Radiology,
Signal Detection. Outline. Detection Theory. Example Applications of Detection Theory
Outline Signal Detection M. Sami Fadali Professor of lectrical ngineering University of Nevada, Reno Hypothesis testing. Neyman-Pearson (NP) detector for a known signal in white Gaussian noise (WGN). Matched
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION Introduction In the previous chapter, we explored a class of regression models having particularly simple analytical
Subspace Analysis and Optimization for AAM Based Face Alignment
Subspace Analysis and Optimization for AAM Based Face Alignment Ming Zhao Chun Chen College of Computer Science Zhejiang University Hangzhou, 310027, P.R.China [email protected] Stan Z. Li Microsoft
An Overview of Knowledge Discovery Database and Data mining Techniques
An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,
STATISTICAL ANALYSIS OF ULTRASOUND ECHO FOR SKIN LESIONS CLASSIFICATION HANNA PIOTRZKOWSKA, JERZY LITNIEWSKI, ELŻBIETA SZYMAŃSKA *, ANDRZEJ NOWICKI
STATISTICAL ANALYSIS OF ULTRASOUND ECHO FOR SKIN LESIONS CLASSIFICATION HANNA PIOTRZKOWSKA, JERZY LITNIEWSKI, ELŻBIETA SZYMAŃSKA *, ANDRZEJ NOWICKI Institute of Fundamental Technological Research, Department
Unsupervised Data Mining (Clustering)
Unsupervised Data Mining (Clustering) Javier Béjar KEMLG December 01 Javier Béjar (KEMLG) Unsupervised Data Mining (Clustering) December 01 1 / 51 Introduction Clustering in KDD One of the main tasks in
Mammography Education, Inc.
Mammography Education, Inc. 2011 LÁSZLÓ TABÁR, M.D.,F.A.C.R (Hon) 3D image of a milk duct MULTIMODALITY DETECTION and DIAGNOSIS of BREAST DISEASES PRAGUE, Czech Republic Crown Plaza, Prague June 29 - July
The Big Data mining to improve medical diagnostics quality
The Big Data mining to improve medical diagnostics quality Ilyasova N.Yu., Kupriyanov A.V. Samara State Aerospace University, Image Processing Systems Institute, Russian Academy of Sciences Abstract. The
Euler Vector: A Combinatorial Signature for Gray-Tone Images
Euler Vector: A Combinatorial Signature for Gray-Tone Images Arijit Bishnu, Bhargab B. Bhattacharya y, Malay K. Kundu, C. A. Murthy fbishnu t, bhargab, malay, [email protected] Indian Statistical Institute,
Norbert Schuff Professor of Radiology VA Medical Center and UCSF [email protected]
Norbert Schuff Professor of Radiology Medical Center and UCSF [email protected] Medical Imaging Informatics 2012, N.Schuff Course # 170.03 Slide 1/67 Overview Definitions Role of Segmentation Segmentation
Statistics Graduate Courses
Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.
Application Note. The Optimization of Injection Molding Processes Using Design of Experiments
The Optimization of Injection Molding Processes Using Design of Experiments PROBLEM Manufacturers have three primary goals: 1) produce goods that meet customer specifications; 2) improve process efficiency
Predict the Popularity of YouTube Videos Using Early View Data
000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050
A Method of Caption Detection in News Video
3rd International Conference on Multimedia Technology(ICMT 3) A Method of Caption Detection in News Video He HUANG, Ping SHI Abstract. News video is one of the most important media for people to get information.
Supervised and unsupervised learning - 1
Chapter 3 Supervised and unsupervised learning - 1 3.1 Introduction The science of learning plays a key role in the field of statistics, data mining, artificial intelligence, intersecting with areas in
Mixtures of Robust Probabilistic Principal Component Analyzers
Mixtures of Robust Probabilistic Principal Component Analyzers Cédric Archambeau, Nicolas Delannay 2 and Michel Verleysen 2 - University College London, Dept. of Computer Science Gower Street, London WCE
Breast Ultrasound: Benign vs. Malignant Lesions
October 25-November 19, 2004 Breast Ultrasound: Benign vs. Malignant Lesions Jill Steinkeler,, Tufts University School of Medicine IV Breast Anatomy Case Presentation-Patient 1 62 year old woman with a
Digital Mammogram National Database
Digital Mammogram National Database Professor Michael Brady FRS FREng Medical Vision Laboratory Oxford University Chairman: Mirada Solutions Ltd PharmaGrid 2/7/03 ediamond aims construct a federated database
Making Sense of the Mayhem: Machine Learning and March Madness
Making Sense of the Mayhem: Machine Learning and March Madness Alex Tran and Adam Ginzberg Stanford University [email protected] [email protected] I. Introduction III. Model The goal of our research
Probabilistic Latent Semantic Analysis (plsa)
Probabilistic Latent Semantic Analysis (plsa) SS 2008 Bayesian Networks Multimedia Computing, Universität Augsburg [email protected] www.multimedia-computing.{de,org} References
Palmprint Recognition. By Sree Rama Murthy kora Praveen Verma Yashwant Kashyap
Palmprint Recognition By Sree Rama Murthy kora Praveen Verma Yashwant Kashyap Palm print Palm Patterns are utilized in many applications: 1. To correlate palm patterns with medical disorders, e.g. genetic
Convolution. 1D Formula: 2D Formula: Example on the web: http://www.jhu.edu/~signals/convolve/
Basic Filters (7) Convolution/correlation/Linear filtering Gaussian filters Smoothing and noise reduction First derivatives of Gaussian Second derivative of Gaussian: Laplacian Oriented Gaussian filters
Diagram of Security. - define the attributes of Diagram of security that make it possible to evaluate security properties of modeled elements,
Diagram of Security Marek Vysoký Department of Computers and Informatics Faculty of Electrical Engineering and Informatics Technical University of Košice Letná 9, 042 00 Košice, Slovakia [email protected]
Mammography. What is Mammography?
Scan for mobile link. Mammography Mammography is a specific type of breast imaging that uses low-dose x-rays to detect cancer early before women experience symptoms when it is most treatable. Tell your
SPECIAL PERTURBATIONS UNCORRELATED TRACK PROCESSING
AAS 07-228 SPECIAL PERTURBATIONS UNCORRELATED TRACK PROCESSING INTRODUCTION James G. Miller * Two historical uncorrelated track (UCT) processing approaches have been employed using general perturbations
Intrusion Detection via Machine Learning for SCADA System Protection
Intrusion Detection via Machine Learning for SCADA System Protection S.L.P. Yasakethu Department of Computing, University of Surrey, Guildford, GU2 7XH, UK. [email protected] J. Jiang Department
Statistics in Retail Finance. Chapter 7: Fraud Detection in Retail Credit
Statistics in Retail Finance Chapter 7: Fraud Detection in Retail Credit 1 Overview > Detection of fraud remains an important issue in retail credit. Methods similar to scorecard development may be employed,
SOLVING LINEAR SYSTEMS
SOLVING LINEAR SYSTEMS Linear systems Ax = b occur widely in applied mathematics They occur as direct formulations of real world problems; but more often, they occur as a part of the numerical analysis
11 Linear and Quadratic Discriminant Analysis, Logistic Regression, and Partial Least Squares Regression
Frank C Porter and Ilya Narsky: Statistical Analysis Techniques in Particle Physics Chap. c11 2013/9/9 page 221 le-tex 221 11 Linear and Quadratic Discriminant Analysis, Logistic Regression, and Partial
CCNY. BME I5100: Biomedical Signal Processing. Linear Discrimination. Lucas C. Parra Biomedical Engineering Department City College of New York
BME I5100: Biomedical Signal Processing Linear Discrimination Lucas C. Parra Biomedical Engineering Department CCNY 1 Schedule Week 1: Introduction Linear, stationary, normal - the stuff biology is not
Support Vector Machines with Clustering for Training with Very Large Datasets
Support Vector Machines with Clustering for Training with Very Large Datasets Theodoros Evgeniou Technology Management INSEAD Bd de Constance, Fontainebleau 77300, France [email protected] Massimiliano
Low-resolution Character Recognition by Video-based Super-resolution
2009 10th International Conference on Document Analysis and Recognition Low-resolution Character Recognition by Video-based Super-resolution Ataru Ohkura 1, Daisuke Deguchi 1, Tomokazu Takahashi 2, Ichiro
How To Decide If You Should Get A Mammogram
American Medical Women s Association Position Paper on Principals of Breast Cancer Screening Breast cancer affects one woman in eight in the United States and is the most common cancer diagnosed in women
The Artificial Prediction Market
The Artificial Prediction Market Adrian Barbu Department of Statistics Florida State University Joint work with Nathan Lay, Siemens Corporate Research 1 Overview Main Contributions A mathematical theory
GENERATING SIMULATION INPUT WITH APPROXIMATE COPULAS
GENERATING SIMULATION INPUT WITH APPROXIMATE COPULAS Feras Nassaj Johann Christoph Strelen Rheinische Friedrich-Wilhelms-Universitaet Bonn Institut fuer Informatik IV Roemerstr. 164, 53117 Bonn, Germany
Comparing Methods to Identify Defect Reports in a Change Management Database
Comparing Methods to Identify Defect Reports in a Change Management Database Elaine J. Weyuker, Thomas J. Ostrand AT&T Labs - Research 180 Park Avenue Florham Park, NJ 07932 (weyuker,ostrand)@research.att.com
Visualization of large data sets using MDS combined with LVQ.
Visualization of large data sets using MDS combined with LVQ. Antoine Naud and Włodzisław Duch Department of Informatics, Nicholas Copernicus University, Grudziądzka 5, 87-100 Toruń, Poland. www.phys.uni.torun.pl/kmk
Predict Influencers in the Social Network
Predict Influencers in the Social Network Ruishan Liu, Yang Zhao and Liuyu Zhou Email: rliu2, yzhao2, [email protected] Department of Electrical Engineering, Stanford University Abstract Given two persons
Solution of Linear Systems
Chapter 3 Solution of Linear Systems In this chapter we study algorithms for possibly the most commonly occurring problem in scientific computing, the solution of linear systems of equations. We start
Infrared Thermography Not a Useful Breast Cancer Screening Tool
Contact: Jeanne-Marie Phillips Sharon Grutman HealthFlash Marketing The American Society of Breast Surgeons 203-977-3333 877-992-5470 Infrared Thermography Not a Useful Breast Cancer Screening Tool Mammography
The Scientific Data Mining Process
Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In
Introduction to Logistic Regression
OpenStax-CNX module: m42090 1 Introduction to Logistic Regression Dan Calderon This work is produced by OpenStax-CNX and licensed under the Creative Commons Attribution License 3.0 Abstract Gives introduction
Environmental Remote Sensing GEOG 2021
Environmental Remote Sensing GEOG 2021 Lecture 4 Image classification 2 Purpose categorising data data abstraction / simplification data interpretation mapping for land cover mapping use land cover class
The CUSUM algorithm a small review. Pierre Granjon
The CUSUM algorithm a small review Pierre Granjon June, 1 Contents 1 The CUSUM algorithm 1.1 Algorithm............................... 1.1.1 The problem......................... 1.1. The different steps......................
MVA ENS Cachan. Lecture 2: Logistic regression & intro to MIL Iasonas Kokkinos [email protected]
Machine Learning for Computer Vision 1 MVA ENS Cachan Lecture 2: Logistic regression & intro to MIL Iasonas Kokkinos [email protected] Department of Applied Mathematics Ecole Centrale Paris Galen
Application of Data Mining Techniques in Intrusion Detection
Application of Data Mining Techniques in Intrusion Detection LI Min An Yang Institute of Technology [email protected] Abstract: The article introduced the importance of intrusion detection, as well as
By choosing to view this document, you agree to all provisions of the copyright laws protecting it.
This material is posted here with permission of the IEEE Such permission of the IEEE does not in any way imply IEEE endorsement of any of Helsinki University of Technology's products or services Internal
Introduction to General and Generalized Linear Models
Introduction to General and Generalized Linear Models General Linear Models - part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby
MapReduce Approach to Collective Classification for Networks
MapReduce Approach to Collective Classification for Networks Wojciech Indyk 1, Tomasz Kajdanowicz 1, Przemyslaw Kazienko 1, and Slawomir Plamowski 1 Wroclaw University of Technology, Wroclaw, Poland Faculty
Automatic parameter regulation for a tracking system with an auto-critical function
Automatic parameter regulation for a tracking system with an auto-critical function Daniela Hall INRIA Rhône-Alpes, St. Ismier, France Email: [email protected] Abstract In this article we propose
