An introduction to OBJECTIVE ASSESSMENT OF IMAGE QUALITY. Harrison H. Barrett University of Arizona Tucson, AZ



Similar documents
Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

STA 4273H: Statistical Machine Learning

Exploiting A Constellation of Narrowband RF Sensors to Detect and Track Moving Targets

MACHINE LEARNING IN HIGH ENERGY PHYSICS

CCNY. BME I5100: Biomedical Signal Processing. Linear Discrimination. Lucas C. Parra Biomedical Engineering Department City College of New York

Advanced Signal Processing and Digital Noise Reduction

Assessment. Presenter: Yupu Zhang, Guoliang Jin, Tuo Wang Computer Vision 2008 Fall

COMPARISON OF OBJECT BASED AND PIXEL BASED CLASSIFICATION OF HIGH RESOLUTION SATELLITE IMAGES USING ARTIFICIAL NEURAL NETWORKS

Lecture 8: Signal Detection and Noise Assumption

STATISTICA Formula Guide: Logistic Regression. Table of Contents

Kristine L. Bell and Harry L. Van Trees. Center of Excellence in C 3 I George Mason University Fairfax, VA , USA kbell@gmu.edu, hlv@gmu.

Data Mining - Evaluation of Classifiers

Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches

Analecta Vol. 8, No. 2 ISSN

Algorithms for the resizing of binary and grayscale images using a logical transform

Data, Measurements, Features

Linear Classification. Volker Tresp Summer 2015

Statistics Graduate Courses

Computational Optical Imaging - Optique Numerique. -- Deconvolution --

The effect of mismatched recording conditions on human and automatic speaker recognition in forensic applications

Component Ordering in Independent Component Analysis Based on Data Power

Cross Validation. Dr. Thomas Jensen Expedia.com

Enhancing the SNR of the Fiber Optic Rotation Sensor using the LMS Algorithm

EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION

Applications to Data Smoothing and Image Processing I

Classification Problems

Signal Detection. Outline. Detection Theory. Example Applications of Detection Theory

Image Compression through DCT and Huffman Coding Technique

2DI36 Statistics. 2DI36 Part II (Chapter 7 of MR)

Methodologies for Evaluation of Standalone CAD System Performance

Logistic Regression. Vibhav Gogate The University of Texas at Dallas. Some Slides from Carlos Guestrin, Luke Zettlemoyer and Dan Weld.

VARIANCE REDUCTION TECHNIQUES FOR IMPLICIT MONTE CARLO SIMULATIONS

Bayesian Machine Learning (ML): Modeling And Inference in Big Data. Zhuhua Cai Google, Rice University

Environmental Remote Sensing GEOG 2021

Measuring Line Edge Roughness: Fluctuations in Uncertainty

The Fundamentals of MTF, Wiener Spectra, and DQE. Motivation

11 Linear and Quadratic Discriminant Analysis, Logistic Regression, and Partial Least Squares Regression

Chapter 6. The stacking ensemble approach

Detection Sensitivity and Response Bias

Parallel Data Selection Based on Neurodynamic Optimization in the Era of Big Data

Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus

VEHICLE TRACKING USING ACOUSTIC AND VIDEO SENSORS

Predict Influencers in the Social Network

Norbert Schuff Professor of Radiology VA Medical Center and UCSF

Image Compression and Decompression using Adaptive Interpolation

Object Recognition and Template Matching

Lecture 14. Point Spread Function (PSF)

Assessment of Camera Phone Distortion and Implications for Watermarking

Another Look at Sensitivity of Bayesian Networks to Imprecise Probabilities

The Scientific Data Mining Process

Java Modules for Time Series Analysis

Question 2 Naïve Bayes (16 points)

Determining optimal window size for texture feature extraction methods

Spatial Statistics Chapter 3 Basics of areal data and areal data modeling

PHASE ESTIMATION ALGORITHM FOR FREQUENCY HOPPED BINARY PSK AND DPSK WAVEFORMS WITH SMALL NUMBER OF REFERENCE SYMBOLS

CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES. From Exploratory Factor Analysis Ledyard R Tucker and Robert C.

Monte Carlo testing with Big Data

A General Framework for Mining Concept-Drifting Data Streams with Skewed Distributions

Time Domain and Frequency Domain Techniques For Multi Shaker Time Waveform Replication

Communication on the Grassmann Manifold: A Geometric Approach to the Noncoherent Multiple-Antenna Channel

Variations of Statistical Models

Automated Stellar Classification for Large Surveys with EKF and RBF Neural Networks

1. Classification problems

APPM4720/5720: Fast algorithms for big data. Gunnar Martinsson The University of Colorado at Boulder

Supporting Online Material for

MUSICAL INSTRUMENT FAMILY CLASSIFICATION

Nonlinear Iterative Partial Least Squares Method

CHAPTER VII CONCLUSIONS

Supervised Learning (Big Data Analytics)

Low-resolution Image Processing based on FPGA

The Wondrous World of fmri statistics

High Quality Image Magnification using Cross-Scale Self-Similarity

Machine Learning Final Project Spam Filtering

MATHEMATICAL METHODS OF STATISTICS

Neovision2 Performance Evaluation Protocol

Ericsson T18s Voice Dialing Simulator

Probabilistic Methods for Time-Series Analysis

Health Care and Life Sciences

1816 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 15, NO. 7, JULY Principal Components Null Space Analysis for Image and Video Classification

SAS Software to Fit the Generalized Linear Model

Bootstrapping Big Data

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

Imputing Missing Data using SAS

SAS Certificate Applied Statistics and SAS Programming

Method for detecting software anomalies based on recurrence plot analysis

Stability of the LMS Adaptive Filter by Means of a State Equation

New Work Item for ISO Predictive Analytics (Initial Notes and Thoughts) Introduction

RF Network Analyzer Basics

What mathematical optimization can, and cannot, do for biologists. Steven Kelk Department of Knowledge Engineering (DKE) Maastricht University, NL

Review Jeopardy. Blue vs. Orange. Review Jeopardy

Performance Metrics for Graph Mining Tasks

EFFICIENT DATA PRE-PROCESSING FOR DATA MINING

A secure face tracking system

Part 4 fitting with energy loss and multiple scattering non gaussian uncertainties outliers

APPLICATION OF DATA MINING TECHNIQUES FOR DIRECT MARKETING. Anatoli Nachev

PTE505: Inverse Modeling for Subsurface Flow Data Integration (3 Units)

Subspace Analysis and Optimization for AAM Based Face Alignment

CS Introduction to Data Mining Instructor: Abdullah Mueen

Software and Hardware Solutions for Accurate Data and Profitable Operations. Miguel J. Donald J. Chmielewski Contributor. DuyQuang Nguyen Tanth

Transcription:

An introduction to OBJECTIVE ASSESSMENT OF IMAGE QUALITY Harrison H. Barrett University of Arizona Tucson, AZ

Outline! Approaches to image quality! Why not fidelity?! Basic premises of the task-based approach! Classification tasks Figures of merit Observers Computational issues Examples from current research! Estimation tasks Figures of merit Estimation methods! Summary, challenges for the future

Approaches from the literature Preference Potter Stewart s method Structured preference (panels, check lists) Fidelity measures The best image is the one that looks the most like the object (By what measure?) Perceptual difference measures Information content The best image is the one that gives the most information (Information about what? To whom?)

It s obvious...

... isn t it? (Hint: The good one is always on the right!)

Is fidelity a virtue? If we think the best image is the one that looks most like the object, then......mean-square error is a good figure of merit Problem: Objects are functions Digital images are discrete vectors How do you compare them?

Three ways to make object and image commensurable: Discretize the object Interpolate the image to a function Ignore the problem and do a simulation

Three ways to define mean-square error: Spatial average for one image object image 2 Average over noise realizations object image 2 noise Average over noise realizations and object class 2 object image noise object

3 ways of making object and image commensurable 3 ways of defining squared error = 9 definitions of MSE AND THEY GIVE VERY DIFFERENT ANSWERS

OTHER PROBLEMS WITH FIDELITY METRICS Sensitive to small changes in scale or orientation Sensitive to trivial grey-level mappings Insensitive to tradeoffs between noise and blur Very different images can have same MSE

Five Lenas, one MSE! Z. Wang, A. C. Bovik and L. Lu, IEEE International Conference on Acoustics, Speech, & Signal Processing, May 2002.

The actual object Lena Soderberg

But the biggest problem with MSE is that...... it is independent of what you want to do with the image

BASIC PREMISE OF OBJECTIVE ASSESSMENT A definition of image quality must specify: The task What information do you want from the image? The observer How will you extract the information? Object and image statistics What set of objects are you looking at? What is the measurement noise?

TASKS CLASSIFICATION (outcome is a label for each image) Signal detection Discrimination Pattern recognition ESTIMATION (outcome is one or more numbers) Absolute brightness, changes in brightness Integral over a region of interest Mensuration Localization Image reconstruction?

IMPORTANCE OF OBJECT AND IMAGE STATISTICS IN SIGNAL DETECTION Signal detection performance degrades with more Poisson noise (top to bottom) or more structure in object background (left to right). Work of Jannick Rolland

OBSERVERS FOR CLASSIFICATION TASKS HUMANS IDEAL (BAYESIAN) OBSERVER IDEAL LINEAR OBSERVER (HOTELLING OBSERVER) MODEL OBSERVERS THAT PREDICT HUMAN PERFORMANCE (ANTHROPOMORPHIC)

Binary (two-alternative) classification tasks Example: signal detection True-positive fraction (TPF) = probability of detection False-positive fraction (FPF) = false-alarm rate Key points: There is always some decision threshold for detection there is always a tradeoff between TPF and FPF

The receiver operating characteristic (ROC) Different points on the ROC curve are generated by applying different decision thresholds to some test statistic Ideal system (for a binary detection task): True-positive fraction = 1 False-positive fraction = 0 Area under the curve (AUC) = 1 Worthless system: AUC = 0.5 AUC is a figure of merit for detection

USING AUC TO COMPARE SYSTEMS System A is better than B, and B is better than C (for this particular detection task and observer)

HUMAN OBSERVERS Performance on binary classification tasks can be measured by psychophysical experiments Results can be plotted as ROC curves Area under the ROC curve can be used as FOM

PROBLEMS WITH HUMAN OBSERVERS Psychophysical studies are time-consuming and difficult to control Many observers and images are required for good statistical accuracy Tests of statistical significance are controversial ALTERNATIVE: Mathematical model observers Ideal Hotelling Anthropomorphic

One that: WHAT IS THE IDEAL OBSERVER? Maximizes area under the ROC curve Maximizes true-positive fraction at any specified false-positive fraction Minimizes Bayes risk Neat result: By any of these criteria, the test statistic is always the likelihood ratio

THE LIKELIHOOD RATIO H 0 = null hypothesis (e.g., signal absent) H = alternative hypothesis (e.g., signal present) 1 Points on ROC curve for ideal observer are generated by comparing (g) to different thresholds

DIFFICULTIES WITH THE IDEAL OBSERVER Requires immense amount of prior knowledge (huge-dimensional PDFs) Usually a nonlinear function of the image data Difficult to compute except for a few textbook cases Not obviously related to humans or other practical observers

NEW COMPUTATIONAL TOOLS FOR IDEAL OBSERVER Likelihood-generating function All statistical properties of LR in one scalar function! Realistic but tractable object models Analytic forms for object statistics Characteristic functionals Propagate object statistics through noisy, low-resolution imaging systems Markov-chain Monte Carlo Dimensionality reduction with minimal information loss Channelized ideal observer

STATISTICAL MODELS FOR RANDOM OBJECTS These simulations look very much like real medical images, but their complete statistics are known analytically to all orders! Question: Can we do the same for astronomy?

THE HOTELLING OBSERVER Harold Hotelling Has full knowledge of the mean vectors and covariance matrices Uses that knowledge to compute an optimum linear discriminant t(g) = wg t Maximizes a detectability measure called the Hotelling trace or Hotelling SNR Also maximizes AUC if t(g) is normally distributed

COMPUTATIONAL ISSUES Good news Hotelling doesn t need image PDFs Bad news It does need image covariance matrix Worse news You gotta invert it!

Hotelling observer, signal-known-exactly (SKE) task Basic equations boil down to: t(g) = w t g w = K -1 s 2 t -1 SNR = s K s s = signal vector K = covariance matrix

Basic problem at a glance: SNR = sk 2 t Example from tomography: -1 s 128 128 detector, 64 projection angles 6 s has 10 elements 6 6 K is a 10 10 matrix 12 K has 10 elements!

MEGALOPINAKOPHOBIA Megalopinakophobia (m g -l p n -k f b - ) n. 1. Fear of large matrices. 2. Abnormality characterized by excessive fondness for Fourier methods. 3. Fear of large paintings [< Gk. megas, great + pinakas, matrix or painting + phobos, fear ] (Courtesy of Georgios Kastis)

WHAT TO DO INSTEAD OF HUGE MATRIX INVERSIONS: Use decomposition of the covariance: K = K + K noise obj Use simulated noise-free images to estimate K obj Solve for Hotelling template iteratively Fiete et al., J. Opt. Soc. Am., 1987 Use matrix-inversion lemma Reduces N N problem to J J, where N = number of pixels, J = number of image samples Use Neumann series for K -1 Reduce dimensionality by judicious feature extraction Barrett et al., Proc. SPIE 3340, 1998

ANTHROPOMORPHIC MODEL OBSERVERS (MODELS THAT PREDICT HUMAN PERFORMANCE) USUALLY LINEAR OBSERVERS INCORPORATE SPATIAL-FREQUENCY CHANNELS David Hubel Torsten Wiesel REQUIRE INTERNAL NOISE TO ACCOUNT FOR HUMAN UNCERTAINTIES OTHERWISE IDEAL!

Why do we use anthropomorphic models? Basic perceptual studies Compare psychophysical studies with model, gain insights into human observers Objective assessment of imaging hardware Computationally simpler than ideal or Hotelling Faster and more accurate than psychophysical Assessment of processing algorithms and displays Ideal or Hotelling observer gives no information about algorithms Purpose of algorithm or display is to match the raw data to the characteristics of the human observer

VALIDATION OF HUMAN OBSERVER MODELS Artificial backgrounds Phantoms Real clinical images Procedure: Generate many images Estimate low-dimensional covariance on channel outputs Compute Hotelling SNR Compare to psychophysical studies

Abbey et al., J. Opt. Soc. Am. A, 2001 Regularization study (Variation of filter cutoff) Exposure-time study (Trades off Poisson and anatomical noise) Anatomical-slope study (Varies fractal dimension of anatomical background) Model observer: Channelized Hotelling Observer (CHO) with and without internal noise (Myers et al., JOSA A, 1987)

ESTIMATION TASKS f(r) = object distribution {f(r)} = parameter of interest g = data vector g = {f(r)} + n f(r) = f meas(r) + f null(r) where f null(r) = 0 {f(r)}is estimable if and only if {f(r)} = {f meas(r)} For estimable parameters, can define bias, variance, MSE Pixel values are almost never estimable

MAXIMUM-LIKELIHOOD ESTIMATES For an estimable parameter, can define bias, variance, MSE Cramer-Rao bound defines minimum achievable variance Estimator that achieves this bound is efficient ML estimate is: Asymptotically unbiased Asymptotically efficient Efficient if an efficient estimator exists But ML estimate uses no prior information

FIGURES OF MERIT FOR ESTIMATION # MSE of ML estimator # Cramer-Rao bound # Fisher information # Bayes risk

CONCLUSIONS Logical definitions of image quality must specify the task and observer as well as object and image statistics Anthropomorphic or real human observers should be used to evaluate algorithms and displays Ideal or ideal-linear observers should be used to evaluate imaging hardware on detection tasks Estimation task performance meaningful only for estimable params. Computation of observer performance is challenging, but methodology is developing rapidly Realism in system and object modeling is paramount The ultimate goal is full system optimization