Quantifying Seasonal Variation in Cloud Cover with Predictive Models



Similar documents
SATELLITE IMAGES IN ENVIRONMENTAL DATA PROCESSING

16 th IOCCG Committee annual meeting. Plymouth, UK February mission: Present status and near future

Time Domain and Frequency Domain Techniques For Multi Shaker Time Waveform Replication

Ensemble Data Mining Methods

APPLICATION OF TERRA/ASTER DATA ON AGRICULTURE LAND MAPPING. Genya SAITO*, Naoki ISHITSUKA*, Yoneharu MATANO**, and Masatane KATO***

MODIS IMAGES RESTORATION FOR VNIR BANDS ON FIRE SMOKE AFFECTED AREA

Satellite Remote Sensing of Volcanic Ash

y = Xβ + ε B. Sub-pixel Classification

A comparison of NOAA/AVHRR derived cloud amount with MODIS and surface observation

Using Remote Sensing to Monitor Soil Carbon Sequestration

Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches

Estimating Firn Emissivity, from 1994 to1998, at the Ski Hi Automatic Weather Station on the West Antarctic Ice Sheet Using Passive Microwave Data

Remote Sensing of Clouds from Polarization

Component Ordering in Independent Component Analysis Based on Data Power

Gerard Mc Nulty Systems Optimisation Ltd BA.,B.A.I.,C.Eng.,F.I.E.I

Adaptive HSI Data Processing for Near-Real-time Analysis and Spectral Recovery *

Selecting the appropriate band combination for an RGB image using Landsat imagery

A Study Of Bagging And Boosting Approaches To Develop Meta-Classifier

Probability and Random Variables. Generation of random variables (r.v.)

Satellite remote sensing using AVHRR, ATSR, MODIS, METEOSAT, MSG

QUALITY ENGINEERING PROGRAM

2.3 Spatial Resolution, Pixel Size, and Scale

Automotive Applications of 3D Laser Scanning Introduction

MODULATION TRANSFER FUNCTION MEASUREMENT METHOD AND RESULTS FOR THE ORBVIEW-3 HIGH RESOLUTION IMAGING SATELLITE

CROP CLASSIFICATION WITH HYPERSPECTRAL DATA OF THE HYMAP SENSOR USING DIFFERENT FEATURE EXTRACTION TECHNIQUES

Introduction to nonparametric regression: Least squares vs. Nearest neighbors

Blind Deconvolution of Barcodes via Dictionary Analysis and Wiener Filter of Barcode Subsections

CS Master Level Courses and Areas COURSE DESCRIPTIONS. CSCI 521 Real-Time Systems. CSCI 522 High Performance Computing

Artificial Neural Networks and Support Vector Machines. CS 486/686: Introduction to Artificial Intelligence

GOES-R AWG Cloud Team: ABI Cloud Height

The Scientific Data Mining Process

Efficient implementation of Full Matrix Capture in real time ultrasonic damage imaging

Predictive Modeling and Big Data

REQUIREMENTS TRACEABILITY

Environmental Remote Sensing GEOG 2021

Performance Evaluation of Artificial Neural. Networks for Spatial Data Analysis

BIOINF 585 Fall 2015 Machine Learning for Systems Biology & Clinical Informatics

A remote sensing instrument collects information about an object or phenomenon within the

APPM4720/5720: Fast algorithms for big data. Gunnar Martinsson The University of Colorado at Boulder

Dimensionality Reduction: Principal Components Analysis

The Effect of Droplet Size Distribution on the Determination of Cloud Droplet Effective Radius

Digital image processing

Machine Learning and Data Mining. Regression Problem. (adapted from) Prof. Alexander Ihler

STA 4273H: Statistical Machine Learning

Passive Remote Sensing of Clouds from Airborne Platforms

Knowledge Discovery from patents using KMX Text Analytics

Numerical Analysis An Introduction

Advances in Cloud Imager Remote Sensing

AUTOMATION OF ENERGY DEMAND FORECASTING. Sanzad Siddique, B.S.

Accurate and robust image superresolution by neural processing of local image representations

Sky Monitoring Techniques using Thermal Infrared Sensors. sabino piazzolla Optical Communications Group JPL

4F7 Adaptive Filters (and Spectrum Estimation) Least Mean Square (LMS) Algorithm Sumeetpal Singh Engineering Department sss40@eng.cam.ac.

Statistical Analysis. NBAF-B Metabolomics Masterclass. Mark Viant

Master of Mathematical Finance: Course Descriptions

Section A. Index. Section A. Planning, Budgeting and Forecasting Section A.2 Forecasting techniques Page 1 of 11. EduPristine CMA - Part I

Least Squares Estimation

Java Modules for Time Series Analysis

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

EVALUATING SOLAR ENERGY PLANTS TO SUPPORT INVESTMENT DECISIONS

THE HYBRID CART-LOGIT MODEL IN CLASSIFICATION AND DATA MINING. Dan Steinberg and N. Scott Cardell

Cross Validation. Dr. Thomas Jensen Expedia.com

PIXEL-LEVEL IMAGE FUSION USING BROVEY TRANSFORME AND WAVELET TRANSFORM

RESOLUTION MERGE OF 1: SCALE AERIAL PHOTOGRAPHS WITH LANDSAT 7 ETM IMAGERY

Cloud detection and clearing for the MOPITT instrument

DISCOVERING SYSTEM HEALTH ANOMALIES USING DATA MINING TECHNIQUES 1

The Effect of Network Cabling on Bit Error Rate Performance. By Paul Kish NORDX/CDT

OPTIMIZATION OF VENTILATION SYSTEMS IN OFFICE ENVIRONMENT, PART II: RESULTS AND DISCUSSIONS

Artificial Neural Network and Non-Linear Regression: A Comparative Study

Machine Learning and Data Mining. Fundamentals, robotics, recognition

The Applied and Computational Mathematics (ACM) Program at The Johns Hopkins University (JHU) is

A Partially Supervised Metric Multidimensional Scaling Algorithm for Textual Data Visualization

Linear Classification. Volker Tresp Summer 2015

TerraColor White Paper

D-optimal plans in observational studies

Multisensor Data Fusion and Applications

Analysis of Wing Leading Edge Data. Upender K. Kaul Intelligent Systems Division NASA Ames Research Center Moffett Field, CA 94035

Assay Development and Method Validation Essentials

REDUCING UNCERTAINTY IN SOLAR ENERGY ESTIMATES

Machine Learning and Data Analysis overview. Department of Cybernetics, Czech Technical University in Prague.

Support Vector Machines with Clustering for Training with Very Large Datasets

Scanners and How to Use Them

Solar Irradiance Forecasting Using Multi-layer Cloud Tracking and Numerical Weather Prediction

Step-by-Step Analytical Methods Validation and Protocol in the Quality System Compliance Industry

A Wavelet Based Prediction Method for Time Series

VALIDATION OF ANALYTICAL PROCEDURES: TEXT AND METHODOLOGY Q2(R1)

Statistical Modeling by Wavelets

CONTROL, COMMUNICATION & SIGNAL PROCESSING (CCSP)

LANDSAT 8 Level 1 Product Performance

SVM Ensemble Model for Investment Prediction

Transcription:

Quantifying Seasonal Variation in Cloud Cover with Predictive Models Ashok N. Srivastava, Ph.D. ashok@email.arc.nasa.gov Deputy Area Lead, Discovery and Systems Health Group Leader, Intelligent Data Understanding NASA Ames Research Center Joint Work with Rama Nemani, Ph.D. (NASA Ames) Other Collaborators: Nikunj Oza, Ph.D., Mike Way, Ph.D., Jeff Scargle, Ph.D. (NASA Ames), Julienne Stroeve, Ph.D. (NSIDC, University of Colorado)

Outline of Talk Motivating problem* General Virtual Sensors problem Results Related accomplishments Summary * Nearly all of this work was presented at the 2005 AGU conference. 2

Motivating Problem Many remote sensing problems data analysis problems can be broken down into two components: (1) Seasonal variation (2) Variation induced by the model class The purpose of this study is to develop algorithms that help us identify these variations separately. It is necessary to understand this variation in order to develop stable models that can predict energy in one spectral band based on the energy in other spectral bands. We address these problems through the development of a Virtual Sensor. 3

Emulating Sensor Signals back in Time for Cloud Trending MODIS 1.6 micron Channel 6 (Black) MODIS Spectral Measurements AVHRR/2 Spectral Measurements No Black Signal for AVHRR/2. Need to emulate AVHRR/2 cloud signal 1980 1985 1990 1995 2000 2005 year 4

Cloud Detection back in Time Solution: Predict 1.6µm channel using a Virtual Sensor MODIS Spectral Measurements MODIS 1.6 micron Signal (Black) AVHRR/2 Spectral Measurements Black Signal for AVHRR/2 estimated by a Virtual Sensor. 5 1980 1985 1990 1995 2000 2005 year

Multi Resolution Analysis The Virtual Sensors concept can also be used to deal with multi-resolution analysis: MODIS Channels 1 & 2: 250 m resolution MODIS Channel 6: 500 m resolution. Note: Channel 6 (at 1.6 microns) is not available at the 250 m resolution. 6

Virtual Sensors Approach Given: MODIS channels 1, 2, 20, 31, 32 correspond to five AVHRR/2 channels Develop: Model MODIS channel 6 (1.6µm) as a function of five MODIS channels Apply: Use function to construct estimate of 1.6µm channel for AVHRR/2 MODIS 1,2,20,31,32 model MODIS 6 Model Construction AVHRR 1,2,3,4,5 model AVHRR 6 7 Model Application

Virtual Sensors Virtual Sensors predict the historical record for spectral measurements using relationships found from existing sensors and inputs from historical record. Useful for simulating sensors back in time or multi resolution analyses. Accuracy of learned models for MODIS data: 70%-90% (over 2 weeks) Channel 6 MLP prediction 8

Model Classes Linear Models: Least squares regression (used as a very simple baseline) Nonlinear Models: Neural nets (used as a simple baseline) Gaussian Processes & Kernel Methods Original Data without Kernel Mapped Data using Kernel F(x) Kernel Map 9 Data in original space: highly complex decision boundaries. Data in high dimensional feature space can yield simple decision boundaries.

10 Linear Correlation Matrix for MODIS Channels over Fresno CA in 2005

Correlation Matrix over Greenland, Year 2004 Linear Correlation Matrix MODIS Channel # 2 4 6 1 2 3 4 5 6 7 MODIS Channel # Mutual Information Matrix Correlation Matrices Changes with location MODIS Channel # 2 4 6 11 1 2 3 4 5 6 7 MODIS Channel #

12 Seasonal Variation for 5 input channels

13 Seasonal Variation for 2 input channels

Seasonal Variation for 2 input channels Black Diamonds: Linear Model 14

15 Multi-Resolution Predictions

16 Distributions of Multiresolution Predictions

Conclusions The model class significantly affects the model s stability: Linear Models: Produce higher overall error and are less robust to seasonal variation. Nonlinear Models: Produce lower overall error and are more robust with respect to seasonal variations. These results support the idea that a Virtual Sensor can be used to characterize sensor measurements through time or at different resolutions. Can be used to reduce processing times significantly for some applications. 17