Environmental Remote Sensing GEOG 2021

Similar documents
Calculation of Minimum Distances. Minimum Distance to Means. Σi i = 1

ENVI Classic Tutorial: Classification Methods

Some elements of photo. interpretation

Remote Sensing and Land Use Classification: Supervised vs. Unsupervised Classification Glen Busch

An Assessment of the Effectiveness of Segmentation Methods on Classification Performance

Multiscale Object-Based Classification of Satellite Images Merging Multispectral Information with Panchromatic Textural Features

WATER BODY EXTRACTION FROM MULTI SPECTRAL IMAGE BY SPECTRAL PATTERN ANALYSIS

CROP CLASSIFICATION WITH HYPERSPECTRAL DATA OF THE HYMAP SENSOR USING DIFFERENT FEATURE EXTRACTION TECHNIQUES

Supervised Classification workflow in ENVI 4.8 using WorldView-2 imagery

Image Analysis CHAPTER ANALYSIS PROCEDURES

VCS REDD Methodology Module. Methods for monitoring forest cover changes in REDD project activities

SAMPLE MIDTERM QUESTIONS

How To Cluster

Digital image processing

The Scientific Data Mining Process

Land Use/Land Cover Map of the Central Facility of ARM in the Southern Great Plains Site Using DOE s Multi-Spectral Thermal Imager Satellite Images

A KNOWLEDGE-BASED APPROACH FOR REDUCING CLOUD AND SHADOW ABSTRACT

Review for Introduction to Remote Sensing: Science Concepts and Technology

Data Mining. Cluster Analysis: Advanced Concepts and Algorithms

Norbert Schuff Professor of Radiology VA Medical Center and UCSF

The Role of SPOT Satellite Images in Mapping Air Pollution Caused by Cement Factories

Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data

ARTIFICIAL INTELLIGENCE (CSCU9YE) LECTURE 6: MACHINE LEARNING 2: UNSUPERVISED LEARNING (CLUSTERING)

Introduction to data mining. Example of remote sensing image analysis

Cafcam: Crisp And Fuzzy Classification Accuracy Measurement Software

Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

Feature Selection vs. Extraction

BIDM Project. Predicting the contract type for IT/ITES outsourcing contracts

Mapping coastal landscapes in Sri Lanka - Report -

Clustering. Danilo Croce Web Mining & Retrieval a.a. 2015/201 16/03/2016

DATA MINING CLUSTER ANALYSIS: BASIC CONCEPTS

Statistical Data Mining. Practical Assignment 3 Discriminant Analysis and Decision Trees

FUZZY CLUSTERING ANALYSIS OF DATA MINING: APPLICATION TO AN ACCIDENT MINING SYSTEM

Understanding Raster Data

Introduction to Pattern Recognition

STATISTICA Formula Guide: Logistic Regression. Table of Contents

Data Mining Cluster Analysis: Basic Concepts and Algorithms. Lecture Notes for Chapter 8. Introduction to Data Mining

CLUSTER ANALYSIS FOR SEGMENTATION

Galaxy Morphological Classification

Azure Machine Learning, SQL Data Mining and R

TIBCO Spotfire Business Author Essentials Quick Reference Guide. Table of contents:

The Delicate Art of Flower Classification

Data Exploration and Preprocessing. Data Mining and Text Mining (UIC Politecnico di Milano)

: Introduction to Machine Learning Dr. Rita Osadchy

Extraction of Satellite Image using Particle Swarm Optimization

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION

Machine Learning using MapReduce

Neural Networks Lesson 5 - Cluster Analysis

Machine Learning: Overview

Introduction to Imagery and Raster Data in ArcGIS

Anomaly Detection and Predictive Maintenance

Objectives. Raster Data Discrete Classes. Spatial Information in Natural Resources FANR Review the raster data model

Topographic Change Detection Using CloudCompare Version 1.0

Medical Information Management & Mining. You Chen Jan,15, 2013 You.chen@vanderbilt.edu

Knowledge Discovery and Data Mining. Bootstrap review. Bagging Important Concepts. Notes. Lecture 19 - Bagging. Tom Kelsey. Notes

Knowledge Discovery and Data Mining. Structured vs. Non-Structured Data

Hyperspectral Data Analysis and Supervised Feature Reduction Via Projection Pursuit

Social Media Mining. Data Mining Essentials

Institute of Natural Resources Departament of General Geology and Land use planning Work with a MAPS

Object-Oriented Approach of Information Extraction from High Resolution Satellite Imagery

Files Used in this Tutorial

Lecture 9: Introduction to Pattern Analysis

Performance Metrics for Graph Mining Tasks

What is GIS? Geographic Information Systems. Introduction to ArcGIS. GIS Maps Contain Layers. What Can You Do With GIS? Layers Can Contain Features

Data Mining: Exploring Data. Lecture Notes for Chapter 3. Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler

Data Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining

Applying MapCalc Map Analysis Software

Class #6: Non-linear classification. ML4Bio 2012 February 17 th, 2012 Quaid Morris

Digital Classification and Mapping of Urban Tree Cover: City of Minneapolis

Pixel-based and object-oriented change detection analysis using high-resolution imagery

Data Processing Flow Chart

K-Means Clustering Tutorial

Crater detection with segmentation-based image processing algorithm

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R

CI6227: Data Mining. Lesson 11b: Ensemble Learning. Data Analytics Department, Institute for Infocomm Research, A*STAR, Singapore.

An Image Classification Sampling Methodology Based on the Integration of IP/GIS Capabilities

RULE INHERITANCE IN OBJECT-BASED IMAGE CLASSIFICATION FOR URBAN LAND COVER MAPPING INTRODUCTION

Clustering. Adrian Groza. Department of Computer Science Technical University of Cluj-Napoca

SPECIAL PERTURBATIONS UNCORRELATED TRACK PROCESSING

Nature Values Screening Using Object-Based Image Analysis of Very High Resolution Remote Sensing Data

Dimensionality Reduction: Principal Components Analysis

Assessment. Presenter: Yupu Zhang, Guoliang Jin, Tuo Wang Computer Vision 2008 Fall

Component Ordering in Independent Component Analysis Based on Data Power

Cluster Analysis. Isabel M. Rodrigues. Lisboa, Instituto Superior Técnico

Facebook Friend Suggestion Eytan Daniyalzade and Tim Lipus

Determining optimal window size for texture feature extraction methods

Final Project Report

Object Recognition and Template Matching

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat

Additional sources Compilation of sources:

UNSUPERVISED MACHINE LEARNING TECHNIQUES IN GENOMICS

MACHINE LEARNING IN HIGH ENERGY PHYSICS

A HYBRID APPROACH FOR AUTOMATED AREA AGGREGATION

Decision Support System Methodology Using a Visual Approach for Cluster Analysis Problems

Application of Face Recognition to Person Matching in Trains

ADVANCED MACHINE LEARNING. Introduction

STATISTICA. Clustering Techniques. Case Study: Defining Clusters of Shopping Center Patrons. and

3D Building Roof Extraction From LiDAR Data

Lab #8: Introduction to ENVI (Environment for Visualizing Images) Image Processing

Transcription:

Environmental Remote Sensing GEOG 2021 Lecture 4 Image classification

2

Purpose categorising data data abstraction / simplification data interpretation mapping for land cover mapping use land cover class as a surrogate for other information of interest (i.e. assign relevant information/characteristics to a land cover class) 3

Purpose 4

Multispectral image classification Very widely used method of extracting thematic information Use multispectral (and other) information Separate different land cover classes based on spectral response, texture,. i.e. separability in feature space 5

Basis for 'classifying' method: pattern recognition use any/all of the following properties in an image to differentiate between land cover classes: spectral spatial temporal directional [time / distance-resolved (LIDAR)] 6

Image properties Temporal pattern recognition the ability to distinguish based on spectral or spatial considerations may vary over the year use variations in image DN (or derived data) over time to distinguish between different cover types e.g. variations in VI over agricultural crops Directional pattern recognition surface with different structures will tend to give different trends in reflectance as a function of view and illumination angles 7

Image properties Spatial pattern recognition use spatial context to distinguish between different classes e.g. measures of image texture, spatial context of 'objects ' derived from data. Spectral pattern recognition most widely used distinguish between different land cover classes from differences in the spectral reflectance (or more typically, image DN) in different wavebands 8

Raster Data to Classes 1 1 1 1 1 7 7 1 1 1 1 1 1 1 1 1 1 1 2 7 7 7 1 1 1 1 1 1 1 3 1 1 2 2 7 9 9 1 1 1 1 1 1 3 3 2 1 1 2 9 9 9 7 1 1 1 1 1 3 1 2 1 9 9 9 1 1 7 5 3 3 1 3 1 1 2 2 9 9 9 9 3 7 7 3 3 3 1 9 9 9 9 9 Grid cells or pixels Each pixel has a digital number (DN) which represents: a class identifier (here) a color value as with a gray scale or color photo image quantitative value such as land surface elevation LEGEND 1 Deep Water 2 Residential 3 Shallow Water 5 Island 7 Wetland 9 Forest 9

NIR i.e. separate in feature space Use different spectral response of different materials to separate e.g. plot red v NIR DN values. red 10

Approaches to Classification We need some form of automated (rule-based) classification algorithm to allow us to distinguish one surface type from another Supervised Classification Unsupervised Classification 11

2D Feature Space 12

Pixel Clouds in Feature Space 13

Supervised classification Training stage (significant user input/expertise) Identify areas of cover types of interest (map, ground survey, spectral characteristics) in bands of an image CCRS, 1998 & http://portal.survey.ntua.gr/main/labs/rsens/deceti/ntua/examples.htm 14

Supervised classification: training stage Regions of interest (ROIs) delineated by user Spectral information on the cover types is gathered for these areas Training data (subset of whole) These are classes we will place all remaining pixels in according to their DN values Can plot in feature space do we see clusters? Separability : 0-2 in envi 15

Supervised classification: classification stage Need rule(s) to decide into which class we put given pixel e.g. Traditional Hard Classifiers Minimum distance to means (MDM) Parallelepiped Maximum Likelihood 16

Supervised classification Feature space clusters E.g. 2 channels of information Are all clusters separate? 17

Supervised classification: MDM Minimum distance to means (MDM) for each land cover class, calculate the mean vector in feature space (i.e. the mean value in each waveband) Put every pixel into nearest class/cluster define a limit beyond which a pixel remains unclassified 18

Supervised classification: MDM Simple & quick, but what about point 1& 2? Insensitive to variance of clusters Can we improve? Based on proximity to training sites means Use of normalized distances (standard scores) rather than raw distances alleviates this. ( x x) z Unknown classes can be accommodated by setting a maximum distance 19

Supervised classification: parallelepiped ( box ) Assign boundaries around the spread of a class in feature space i.e. take account of variance Typically use minimum/maximum of DN in a particular class to define limits: Giving a rectangle in 2D, Box in 3D (if we have > 2 bands), etc. 20

Supervised classification: parallelepiped ( box ) Based on min and max thresholds on each band from training site data Threshold Definition: - using signature min and max - using standard scores Accounts for variance but the presence of inter-band correlation leads to potential ambiguity in classification (overlaps) Recognizes unknown classes 21

Supervised classification: parallelepiped ( box ) Simple boxes defined by min/max limits of each training class. But overlaps..? so use stepped boxes 22

Supervised classification: Gaussian maximum likelihood Now we use probability rather than distance in feature space Which class is each pixel most likely to belong to?? 23

Supervised classification: Gaussian maximum likelihood Based on evaluation of probability for each pixel of belonging to every category Not based on distance, but on probability Assumes normal distribution in the training sites Accounts for both variance and covariance in the training sites data Assigns the pixel to a category in which it possesses the highest probability Computationally expensive 1. Dashed lines are contours of equal probability 2. Their shape expresses the sensitivity of this classifier 24 to covariance

Supervised classification: Gaussian maximum likelihood A few concepts encountered in our practical ROIs Spectral signatures Output rules for Maximum Likelihood Classifier A series of images of probabilities 25

Supervised classification: Gaussian maximum likelihood 26

Supervised classification: Decision Tree Classify in steps, where the classifier has only to be able to distinguish between two or more classes at each step Can combine various types of classifiers as appropriate using such methods 27

Classification Accuracy How do we tell if classification is any good? Classification error matrix (aka confusion matrix or contingency table) Need truth data sample pixels of known classes How many pixels of KNOWN class X are incorrectly classified as anything other than X (errors of omission)?» So-called Type 2 error, or false negative Divide correctly classified pixels in each class of truth data by COLUMN totals (Producer s Accuracy) How many pixels are incorrectly classified as class X when they should be some other known class (errors of commission)?» So-called Type 1 error, or false positive Divide correctly classified pixels in each class by ROW totals (User s Accuracy) 28

Classification Accuracy Errors of comission for class U Errors of omission for class U 29

Can use original training data to test BUT. this only tells us how well the classifier can classify the training areas Ideally, use an independent set of samples to give a better 'overall' accuracy estimate 30

Unsupervised Classification (clustering) Little input from user required (few assumptions) BUT means results hard to interpret (may not represent classes we recognise) Cluster pixels in feature space based on some measure of their proximity Interpretation of results / assigned classes can be useful, e.g. in picking up variations within what would otherwise be distinguished as a single class e.g. stressed/unstressed crop in a single field) clusters can be of little intrinsic value in themselves e.g. sunlit trees, shaded trees is perhaps not a useful discrimination if one simply wants to classify 'trees', and so clusters may have to be combined 31

Unsupervised Classification: K-means A large number of clustering algorithms exist K-means input number of clusters desired algorithm typically initiated with arbitrarily-located 'seeds' for cluster means each pixel then assigned to closest cluster mean revised mean vectors are then computed for each cluster repeat until some convergence criterion is met (e.g. cluster means don't move between iterations) computationally-expensive because it is iterative 32

Unsupervised classification: ISODATA (Iterative self-organising data analysis) algorithm Same as K-means but now we can vary number of clusters (by splitting / merging) Start with (user-defined number) randomly located clusters Assign each pixel to nearest cluster (mean spectral distance) Re-calculate cluster means and standard deviations If distance between two clusters < some threshold, merge them If standard deviation in any one dimension > some threshold, split into two clusters Delete clusters with small number of pixels Re-assign pixels, re-calculate cluster statistics etc. until changes of clusters < some fixed threshold 33

ISODATA example: 2 classes, 2 bands DN Ch 1 a Pixel 1 Initial cluster means Pixel 2 Assign pixel 1 to cluster a, 2 to b etc. DN Ch 1 Cluster means move towards pixels 1 and 2 respectively a Pixel 1 Pixel 2 b b DN Ch 2 DN Ch 2 DN Ch 1 New positions of cluster means All pixels assigned to a or b - update stats DN Ch 2 SD of cluster a too large? DN Ch 1 New positions of cluster means Split a into 2, recalculate. Repeat. DN Ch 2 34

Hybrid Approaches Useful if large variability in the DN of individual classes Use clustering concepts from unsupervised classification to derive subclasses for individual classes, followed by standard supervised methods. Can apply e.g. K-means algorithm to (test) subareas, to derive class statistics and use the derived clusters to classify the whole scene Requirement that all classes of interest are represented in these test areas Clustering algorithms may not always determine all relevant classes in an image e.g. linear features (roads etc.) may not be picked-up by the textural methods described above 35

Post-classification filtering The result of a classification from RS data can often appear rather 'noisy' Can we aggregate information in some way? Simplest & most common way is majority filtering a kernel is passed over the classification result and the class which occurs most commonly in the kernel is used May not always be appropriate; the particular method for spatial aggregation of categorical data of this sort depends on the particular application to which the data are to be put e.g. successive aggregations will typically lose scattered data of a certain class, but keep tightly-clustered data 36

Post-classification filtering Majority filter 37

What s wrong with pixels? 38

Geographical entities vs. Pixels Pixel size (resolution) <= Target object sizes Object-based Image Analysis (OBIA) in RS Segmentation Post-segmentation Classification 39

Region-growing Segmentation Baatz & Schape (2000) 40

http://www.eecs.berkeley.edu/~lbourdev/hat/ 41