How To Analyze Medical Image Data With A Feature Based Approach To Big Data Medical Image Analysis

Similar documents
Machine Learning for Medical Image Analysis. A. Criminisi & the InnerEye MSRC

How To Use A Webmail On A Pc Or Macodeo.Com

Why do we have so many brain coordinate systems? Lilla ZölleiZ WhyNHow seminar 12/04/08

Ensemble Methods. Adapted from slides by Todd Holloway h8p://abeau<fulwww.com/2007/11/23/ ensemble- machine- learning- tutorial/

Pa8ern Recogni6on. and Machine Learning. Chapter 4: Linear Models for Classifica6on

Probabilistic Latent Semantic Analysis (plsa)

ANALYTICAL TECHNIQUES FOR DATA VISUALIZATION

siftservice.com - Turning a Computer Vision algorithm into a World Wide Web Service

CATEGORIZATION OF SIMILAR OBJECTS USING BAG OF VISUAL WORDS AND k NEAREST NEIGHBOUR CLASSIFIER

TouchPaper - An Augmented Reality Application with Cloud-Based Image Recognition Service

Na#onal Asbestos Forum 2013: Advance in Medical Research on Asbestos- Related Diseases

Android Ros Application

Anatomic Surface Reconstruc1on from Sampled Point Cloud Data and Prior Models

Randomized Trees for Real-Time Keypoint Recognition

Local features and matching. Image classification & object localization

BIOINF 585 Fall 2015 Machine Learning for Systems Biology & Clinical Informatics

The Delicate Art of Flower Classification

Social Media Mining. Data Mining Essentials

FACE RECOGNITION BASED ATTENDANCE MARKING SYSTEM

The use of computer vision technologies to augment human monitoring of secure computing facilities

Reference Books. Data Mining. Supervised vs. Unsupervised Learning. Classification: Definition. Classification k-nearest neighbors

A Proposed Data Mining Model for the Associated Factors of Alzheimer s Disease

An Order-Invariant Time Series Distance Measure [Position on Recent Developments in Time Series Analysis]

Cees Snoek. Machine. Humans. Multimedia Archives. Euvision Technologies The Netherlands. University of Amsterdam The Netherlands. Tree.

Identifying Group-wise Consistent White Matter Landmarks via Novel Fiber Shape Descriptor

Prediction of Heart Disease Using Naïve Bayes Algorithm

Norbert Schuff Professor of Radiology VA Medical Center and UCSF

Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 269 Class Project Report

Recognizing Cats and Dogs with Shape and Appearance based Models. Group Member: Chu Wang, Landu Jiang

ECBDL 14: Evolu/onary Computa/on for Big Data and Big Learning Workshop July 13 th, 2014 Big Data Compe//on

Principles of Data Mining by Hand&Mannila&Smyth

Image Segmentation and Registration

USING DATA SCIENCE TO DISCOVE INSIGHT OF MEDICAL PROVIDERS CHARGE FOR COMMON SERVICES

Face Recognition in Low-resolution Images by Using Local Zernike Moments

Data Mining & Data Stream Mining Open Source Tools

Big Data: Image & Video Analytics

Discovering Local Subgroups, with an Application to Fraud Detection

FastKeypointRecognitioninTenLinesofCode

Linköping University Electronic Press

Performance Analysis of Data Mining Techniques for Improving the Accuracy of Wind Power Forecast Combination

PharmaSUG2011 Paper HS03

Studying Auto Insurance Data

Keywords data mining, prediction techniques, decision making.

Data Mining. Supervised Methods. Ciro Donalek Ay/Bi 199ab: Methods of Sciences hcp://esci101.blogspot.

Distributed forests for MapReduce-based machine learning

Patient Similarity-guided Decision Support

BRIEF: Binary Robust Independent Elementary Features

Data Mining Cluster Analysis: Advanced Concepts and Algorithms. Lecture Notes for Chapter 9. Introduction to Data Mining

ICD-10-CM for Ophthalmology. Presented by:

Hadoop SNS. renren.com. Saturday, December 3, 11

India s Integrated Taxpayer Data Management System (ITDMS) - A data mining tool for non-intrusive anti-tax evasion work

A Study Of Bagging And Boosting Approaches To Develop Meta-Classifier

Fast Matching of Binary Features

User Authentication using Combination of Behavioral Biometrics over the Touchpad acting like Touch screen of Mobile Device

International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015

How To Solve The Kd Cup 2010 Challenge

AN IMPROVED DOUBLE CODING LOCAL BINARY PATTERN ALGORITHM FOR FACE RECOGNITION

GPU Programming in Computer Vision

A Comparative Study between SIFT- Particle and SURF-Particle Video Tracking Algorithms

Customer Classification And Prediction Based On Data Mining Technique

E-commerce Transaction Anomaly Classification

The Scientific Data Mining Process

Data Mining: A Preprocessing Engine

Florida International University - University of Miami TRECVID 2014

Change is Coming in 2014! ICD-10 will replace ICD-9 for Diagnosis Coding

A SECURE DECISION SUPPORT ESTIMATION USING GAUSSIAN BAYES CLASSIFICATION IN HEALTH CARE SERVICES

KNOWLEDGE-BASED IN MEDICAL DECISION SUPPORT SYSTEM BASED ON SUBJECTIVE INTELLIGENCE

Data Quality Mining: Employing Classifiers for Assuring consistent Datasets

Signature Segmentation and Recognition from Scanned Documents

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

Advances towards Remote Assessment of Disease and Relapse in Multiple Sclerosis

Simple and efficient online algorithms for real world applications

2. MATERIALS AND METHODS

GE Global Research. The Future of Brain Health

Data Mining Cluster Analysis: Advanced Concepts and Algorithms. Lecture Notes for Chapter 9. Introduction to Data Mining

Mobile Phone APP Software Browsing Behavior using Clustering Analysis

Alessandro Laio, Maria d Errico and Alex Rodriguez SISSA (Trieste)

Image Classification for Dogs and Cats

How To Cluster

A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow

Simultaneous Gamma Correction and Registration in the Frequency Domain

The Visual Internet of Things System Based on Depth Camera

Manifold Learning with Variational Auto-encoder for Medical Image Analysis

Object Recognition. Selim Aksoy. Bilkent University

Data Mining with R. Decision Trees and Random Forests. Hugh Murrell

Surgical Tools Recognition and Pupil Segmentation for Cataract Surgical Process Modeling

Documenting & Coding. Chronic Obstructive Pulmonary Disease (COPD) Presented by: David S. Brigner, MLA, CPC

Transcription:

A Feature- based Approach to Big Data Medical Image Analysis Ma$hew Toews $, Chris/an Wachinger, Raul San Jose Estepar, William Wells III $ École de Technologie Supérieur, Montreal Canada BWH, Harvard Medical School CSAIL, Massachuse$s Ins/tute of Technology h$p://www.ma$hewtoews.com July 3, 2015

Context Big data Massive digital memories, rapid data transmission Large- scale data mining, novel discoveries,... Big medical image data sets E.g. 10K subjects, 20K lung CTs, 3.8 TB Per- subject labels, disease stage, Can we leverage this data? Computer assisted diagnosis Image biomarker discovery 2

Challenge Efficient image- to- image correspondence E.g. N = 20K lung CT volumes O(N 2 ) Intractable 3

Most Relevant Prior Work Nearest neighbor classifica/on (Cover & Hart 1967) As N - >, error is upper bounded by 2x op/mal Bayes error Big Data Scale- invariant feature transform SIFT (Lowe 2004) Iden/fy & match dis/nc/ve keypoints in images Efficient NN correspondence via random KD- trees O(N log N) 4

3D SIFT Features Lung CT Volume σ Geometry Location, scale, orientation Appearance Descriptor Gradient orientation histogram, 64 elements, rank-ordering Efficient and Robust Model-to-Image Alignment using 3D Scale-Invariant Features M. Toews, W.M. Wells III, MedIA 2013 SIFT-Rank: Ordinal Descriptors for Invariant Feature Correspondence M. Toews, W.M. Wells III, CVPR 2009 5

3D SIFT Features Classifying Alzheimer s disease, discovering image biomarkers Modeling infant brain development Aligning images: robust, mul/- modal, group- wise Segmen/ng organs in full- body CT 6

Analysis: Kernel Density Es/ma/on Es/mate maximum a- posteriori (MAP) subject label C given feature descriptor set F = { f i } i p(c F) p(c) p( f i C) F = { f i } p( f i C) j:c=c j N # exp f f i j N % C $ α 2 i +1 & ( ' f j KNN i α i = min j f i f j Adap/ve kernel bandwidth: distance to NN 7

Analysis: Kernel Density Es/ma/on On- the- fly parameter es/ma/on Lazy Learning, easy to incorporate new data MAP es/ma/on: for each feature f i F : 1) Iden/fy KNN correspondence set 2) Compute p( f i C), posterior product F = { f i } p(c F) p(c) p( f i C) i O(log N) 8

COPD Chronic Obstruc/ve Pulmonary Disorder Major cause of chronic morbidity and mortality COPDGene data 21 sites, 10K subjects, 20K images, 95M features 5- category disease stage labels (GOLD score) Regan, Elizabeth A., et al. "Gene/c epidemiology of COPD (COPDGene) study design." COPD: Journal of Chronic Obstruc8ve Pulmonary Disease 7.1 (2011) 9

COPD Classifica/on Label C = [0,4] GOLD disease stage Maximum a- posterior es/ma/on C* = argmax{ p(c F) } < 1 second per image State- of- the- art GOLD predic/on accuracy GOLD Labels Predicted GOLD 10

COPD Dis/nct phenotypes Source: Frank H. Ne<er, MD and Ar/st 11

COPD Dis/nct phenotypes Blue Bloaters Pink Puffers 12

COPD Phenotype- informa/ve features? Musculoskeletal features 13

Other Aspects Same- subject iden/fica/on Label C = subject ID Perfect iden/fica/on across breathing state 65 highly similar images iden/fied 20 known duplicate subjects iden/fied via DNA 14

Other Aspects Significant data reduc/on 15

Other Aspects Feature geometry unused Es/ma/on from appearance descriptors only Subject images are unaligned, bag- of- features Soxware implementa/on available 16

References 1) M. Toews, C. Wachinger, R. S. et al. "A Feature- based Approach to Big Data Analysis of Medical Images Informa/on Processing in Medical Imaging (IPMI), 2015. 2) C. Wachinger, M. Toews, et al. "Keypoint Transfer SegmentaAon, Informa/on Processing in Medical Imaging (IPMI), 2015. 3) Gill, G. et Toews, M. et Beichel, R. R.. 2014. «Robust iniaalizaaon of acave shape models for lung segmentaaon in CT scans: a feature- based atlas approach». Interna/onal Journal of Biomed Imaging. p. 479154. 4) Toews, Ma$hew et Wells III, William M.. 2013. «Efficient and robust model- to- image alignment using 3D scale- invariant features». Medical Image Analysis, vol. 17, nº 3. p. 271-282. 5) Toews, Ma$hew et Wells III, William M. et Zöllei, Lilla. 2012. «A feature- based developmental model of the infant brain in structural MRI». In Medical Image Compu/ng and Computer- Assisted Interven/on MICCAI 2012. Coll. «Lecture Notes in Computer Science», 7511. p. 204-211. Springer Berlin Heidelberg. 6) Toews, Ma$hew et Wells III, William M. et Collins, D. Louis et Arbel, Tal. 2010. «Feature- based morphometry: discovering group- related anatomical pa<erns». NeuroImage, vol. 49, nº 3. p. 2318-2327. 7) Toews, Ma$hew. et Wells III, William M.. 2009. «SIFT- Rank: Ordinal descripaon for invariant feature correspondence». In IEEE Conference oncomputer Vision and Pa$ern Recogni/on, 2009. CVPR 2009 (Miami, FL, USA, June 20-25, 2009), p. 172-177. 8) Toews, Ma$hew. et Arbel, T.. 2007. «A staasacal parts- based model of anatomical variability». IEEE Transac/ons on Medical Imaging, vol. 26, nº 4. p. 497-508. h$p://www.ma$hewtoews.com 17

Thank You 18