DIVER (Data Integration Visualization Exploration and Reporting)



Similar documents
Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

The Application of Synthetic Aperture Radar (SAR) to Natural Resource Damage Assessment

GeoKettle: A powerful open source spatial ETL tool

Introduction to Natural Resource Damage Assessment NRDA

Introduction to Natural Resource Damage Assessment

Sextant. Spatial Data Infrastructure for Marine Environment. C. Satra Le Bris, E. Quimbert, M. Treguer

Background on Elastic Compute Cloud (EC2) AMI s to choose from including servers hosted on different Linux distros

Overview of Atlantic Offshore Renewable Energy Studies Program. Brian Hooker Office of Renewable Energy Programs

Lecture 8. Online GIS

There are various ways to find data using the Hennepin County GIS Open Data site:

The Integration of Hydrographic and Oceanographic Data in a Marine Geographic Information System U.S. Hydro 2015

Natural Resource Damage Assessment and Restoration

The distribution of marine OpenData via distributed data networks and Web APIs. The example of ERDDAP, the message broker and data mediator from NOAA

Microsoft. Course 20463C: Implementing a Data Warehouse with Microsoft SQL Server

Expansion of metadata management, visualization and data processing functionality of OBIS-SEAMAP for passive acoustic monitoring data

Rakesh Tej Kumar Kalahasthi and Benson Hilbert SAP BI Practice, Bangalore, India

Deepwater Horizon Oil Spill: FWC s Response with a focus on wildlife

Deepwater Horizon Oil Spill Phase I Early Restoration Plan and Environmental Assessment

Florida Institute of Oceanography

Status of Restoration in Mississippi

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Open Source Business Intelligence Intro

Overview of the Division of Water Restoration Assistance

NCDC Strategic Vision

Data Integration Checklist

Business Intelligence for Big Data

Addressing Risk Data Aggregation and Risk Reporting Ben Sharma, CEO. Big Data Everywhere Conference, NYC November 2015

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes

STATEMENT OF WORK. NETL Cooperative Agreement DE-FC26-02NT41476

Geospatial Data Stewardship at an Interdisciplinary Data Center

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Three Open Blueprints For Big Data Success

ArcGIS. Server. A Complete and Integrated Server GIS

In ediscovery and Litigation Support Repositories MPeterson, June 2009

Big Data Spatial Analytics An Introduction

CHAPTER 13: PUBLIC COMMENT ON THE DRAFT PHASE III ERP/PEIS AND RESPONSES Introduction Organization of this Chapter

Harvard Data Visualization Project

Virginia Commonwealth University Rice Rivers Center Data Management Plan

The True Cost of the BP Oil Spill for People, Communities, and the Environment

IBM BigInsights for Apache Hadoop

Environment Canada Data Management Program. Paul Paciorek Corporate Services Branch May 7, 2014

Conservation Workshop ArcGIS Explorer

3. Provide the capacity to analyse and report on priority business questions within the scope of the master datasets;

Build a Streamlined Data Refinery. An enterprise solution for blended data that is governed, analytics-ready, and on-demand

Developing Business Intelligence and Data Visualization Applications with Web Maps

6 Steps to Faster Data Blending Using Your Data Warehouse

Big Data Solutions. Portal Development with MongoDB and Liferay. Solutions

How To Write An Nccwsc/Csc Data Management Plan

Oracle Architecture, Concepts & Facilities

AN INTEGRATED SOLUTION FOR MANAGING EXPLORATION DATA

Pentaho BI Capability Profile

Implementing Data Models and Reports with Microsoft SQL Server 20466C; 5 Days

OMAO Data Management Roadmap

Data Governance for Regulated Industries

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth

An Introduction to Open Source Geospatial Tools

Integrated Information Management System, Development of Web Interface, a.k.a. Online Data Portal (ODP)

Implementing a Data Warehouse with Microsoft SQL Server

Final Report - HydrometDB Belize s Climatic Database Management System. Executive Summary

Investigating Hadoop for Large Spatiotemporal Processing Tasks

2010 Oracle Corporation 1

Paper DM10 SAS & Clinical Data Repository Karthikeyan Chidambaram

The Enterprise Data Hub and The Modern Information Architecture

Extracting and Preparing Metadata to Make Video Files Searchable

Making MAGIC with Your Data: Interactive Maps, Map Mashups, and Data Visualization Tools

The Big Picture on Big Data. Princeton Section 307 Dinner Meeting December 11, 2013 Richard Herczeg

CrossPoint for Managed Collaboration and Data Quality Analytics

Apache Hadoop: The Big Data Refinery

Ganzheitliches Datenmanagement

BIG DATA COURSE 1 DATA QUALITY STRATEGIES - CUSTOMIZED TRAINING OUTLINE. Prepared by:

Guidelines on Information Deliverables for Research Projects in Grand Canyon National Park

North Highland Data and Analytics. Data Governance Considerations for Big Data Analytics

Oklahoma s Open Source Spatial Data Clearinghouse: OKMaps

Global Earth Observation Integrated Data Environment (GEO-IDE) Presentation to the Data Archiving and Access Requirements Working Group (DAARWG)

Enterprise Data Quality Dashboards and Alerts: Holistic Data Quality

GRAPHICAL USER INTERFACE, ACCESS, SEARCH AND REPORTING

desert conservation program Data Management Guidelines

Oracle BI 11g R1: Create Analyses and Dashboards

Geographic Information Systems GIS at UCSD. Here to help you explore our world

Transcription:

DIVER (Data Integration Visualization Exploration and Reporting) Data Warehouse and Query Tools For the Deepwater Horizon Natural Resource Damage Assessment Data and Beyond Jay Coady I.M Systems Group Ben Shorr Spatial Data Branch Assessment & Restoration Division NOAA National Ocean Service Office of Response and Restoration 4/24/2015

How to effectively manage unprecedented amounts of environmental data and analysis? Leverage big data techniques Data warehouse and information portal Ingest, integrate and organize information. Business Intelligence Question Environmental Intelligence 2

Presentation Overview Background on NRDA and Data Sources Variations in data sources with the need to bring together across the NRDA case Data Warehouse Solution Flexible/scalable framework; data models and standards; related information/data DIVER Explorer (Data Query and Delivery) Query, reporting and export tools- supporting scientific analysis and reports for the Damage Assessment case 3

Natural Resource Damage Assessment (NRDA) 1) Preliminary Assessment (exposure assessment) 2) Injury Assessment/Restoration Planning Field Studies Data Evaluation Modeling Injury Quantification 3) Restoration Implementation

Marsh Assessment Shoreline Data Toxicity Data Oyster Collections Water Column Telemetry Data Seafood Safety Marine Mammal & Turtle Assessment

How did we get here? Vast amount of NRDA and Response data collected under different authorities, different formats, different destinations and management We (NOAA OR&R and partners) were part of key NRDA and Response data streams early and created: On-line repositories including File Collections Secure FTP (File Transfer Protocol) Site National Oceanographic Data Center (NODC) Archive

File Collections (aka NOAANRDA website)

Signal to Noise 1.5 years into NRDA case Priority of Measure Implementation Preliminary Measures and Dimensions; Priority of "Questions to Answer"; Data Sources that can be used to Answer Questions Dimensions (Ways to slice the Question) Measures (The Question to Answer) Time Spatial Depth Sample Type Habitat Site Study Workplan/Method Instrument Type Oiling Species Hypothesis Status Lab Current Data Warehouse Pulling Data From: 1 Contaminant Lab Results X X X X X X X X X X X X X Validated EDD, QM 2 Observation Data X X X X X X X X X X X X nn.org 3 Additional Lab Data Results X X X X X X X X X X X X X nn.org, 50+ labs 4 Response Activities Count and Duration X X X X X X X Spatial Data Team 5 Species Count X X X X X X X X X Observation (nn.org), Telemetry 6 Instrument Results X X X X X X X X X X X TBD: - Photographs (Photologger) - Video Clips (TBD-"Kaltura?") - Acoustic Clips (TBD-"Kaltura?") Multiple: NODC, Source, Database per Instrument (Currently does not exist)

Data Warehouse Approach Ingest Data Bring in data from different sources; flexible and scalable Adopt or adapt existing standards; develop and document new standards Manage structured and unstructured data/information Litigation quality Documented processes Relate Information Examples: samples and observations; field data and photographs 9

Common Data Model Examples (schemas) Samples: chemistry (QM), biological, more Oceanographic: cruise-collected sensor data Observations: shoreline, marsh, birds and mammals Telemetry: location tracking devices Photographs: keywords, location Restoration data: potential and implemented projects 10

Data Warehouse and Standardization Collate Source Data Apply Business Intelligence / ETL * Methods DIVER Data Warehouse Data Integration DIVER Explorer Visualization, Exploration, and Reporting Samples Samples Oceanographic Ocean Data Steps include: 1. Define the common model Telemetry Observations DIVER S COMMON DATA MODELS Restoration Photos 2. Accommodate additional data 3. STANDARDIZE Related Information Observations 4. Incorporate QA/QC, Validation and Auditing Visualization (ERMA, GIS) Export Photos *Extract-Transform-Load 11 Data for analysis Reports Technical Memos Publications Litigation Distribution

Data Integration Visualization Exporting and Reporting: DIVER Explorer Application Queries: Guided, Custom & Saved Download Data Packages Map & Legend Query by Shape Data Summary Data Tables Charts Photos Metadata Study Notes Export 12

DIVER Explorer: Guided Queries 13

Data Summary Data Table Charts Metadata Study Notes Export

DIVER Explorer: Dashboard Approach Data Summary

DIVER Explorer: Dashboard Approach Data Table

DIVER Explorer: Dashboard Approach Data Table

DIVER Explorer: Dashboard Approach

DIVER Explorer: Dashboard Approach

DIVER Explorer: Dashboard Approach

DIVER Explorer: Dashboard Approach

DIVER Explorer: Export

DIVER Explorer: Export

DIVER Explorer: Export

DIVER Explorer: Query By Shape Draw and Edit; Buffers; Standard Query Shapes

Export Packages: DIVER Explorer Exports Includes full FGDC Metadata (Federal Geographic Data Committee) Spreadsheet; Shapefile (GIS); KML (Google Earth) Electronic field data (spreadsheets) Automated output of updated data to: Gulf Spill Restoration http://www.gulfspillrestoration.noaa.gov/ and ERMA Gulf Response: http://gomex.erma.noaa.gov/erma.html

Public DIVER for Deepwater Horizon

DIVER Strategy for Data Management and Public DWH DIVER site Query Tools Make validated data (and approach) accessible to scientists, academia and public audience National DIVER OR&R developing public Regional DIVER sites, Contaminant Chemistry, Photos, Restoration, Response & Restoration data models Developing field data collection capability based on DWH techniques and tools

15 minutes goes quickly when you re talking big data! Technical Details: Amazon AWS; FedRAMP; NIST 800-53 security standards compliant Liferay Portal Pentaho Data Integration tools PostgreSQL/PostGIS Infobright (Hadoop integration) Mapserver/OpenLayers Dojo Toolkit Javascript library Custom Java API and query engine Agile development approach: (data management and tool development) Senior Team (and co-authors): Dr. Amy Merten (Spatial Data Branch Chief) Ben Shorr (Spatial Data Branch) Jay Coady (I.M Systems Group Spatial Data Branch) Dan Hudgens (IEc Inc.) Neal Etre (IEc, Inc.) Jim Anderton (Solea Consulting) Jerry Bower (Sirius Computer Solutions)