Seeing the Big Picture

Similar documents
U.S. IOOS Office Carl Gouldman January 2015

Spyglass Portal Manual v

OCEAN DATA MANAGEMENT EXPERT FORUM. November 18-19, 2015 Fairmont The Queen Elizabeth Hotel Montreal, Quebec, Canada

Advancing the implementation of a National Water Quality Monitoring Network (The Network) for U.S. Coastal Waters and their Tributaries

Data Integration Framework (DIF) Final Assessment Report

The Arctic Observing Network and its Data Management Challenges Florence Fetterer (NSIDC/CIRES/CU), James A. Moore (NCAR/EOL), and the CADIS team

Global Earth Observation Integrated Data Environment (GEO-IDE) Presentation to the Data Archiving and Access Requirements Working Group (DAARWG)

Survey of Big Data Architecture and Framework from the Industry

Adaptive Sampling and the Autonomous Ocean Sampling Network: Bringing Data Together With Skill

Big Data and Advanced Analytics Technologies for the Smart Grid

Big Data and Analytics in Government

Norwegian Satellite Earth Observation Database for Marine and Polar Research USE CASES

How To Choose A Business Intelligence Toolkit

The Scientific Data Mining Process

NASA's Strategy and Activities in Server Side Analytics

DIVER (Data Integration Visualization Exploration and Reporting)

NASA s Big Data Challenges in Climate Science

A beginners guide to accessing Argo data. John Gould Argo Director

Bathymetric Attributed Grids (BAGs): Discovery of Marine Datasets and Geospatial Metadata Visualization

CRM: Retaining Your Customers: Preventing Your Competitors

Extend your analytic capabilities with SAP Predictive Analysis

HFIP Web Support and Display and Diagnostic System Development

DISMAR: Data Integration System for Marine Pollution and Water Quality

REMOTE MONITORING SYSTEM MATURE FIELDS UNCONVENTIONALS MONITORING SYSTEM. Solving challenges.

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: Vol. 1, Issue 6, October Big Data and Hadoop

13.2 THE INTEGRATED DATA VIEWER A WEB-ENABLED APPLICATION FOR SCIENTIFIC ANALYSIS AND VISUALIZATION

White Paper. How Streaming Data Analytics Enables Real-Time Decisions

Simplifying Big Data Analytics: Unifying Batch and Stream Processing. John Fanelli,! VP Product! In-Memory Compute Summit! June 30, 2015!!

EDS Data Sheet. Features. Control System Integration

Increase your Performance with the PSA Suite for Microsoft Dynamics CRM

HYCOM Meeting. Tallahassee, FL

SIXTH GRADE WEATHER 1 WEEK LESSON PLANS AND ACTIVITIES

SeaDataNet pan-european infrastructure for ocean and marine data management. Dick M.A. Schaap MARIS

Data Management Handbook

NOAA. Integrated Ocean Observing System (IOOS) Program. Data Integration Framework (DIF) Master Project Plan. (Version 1.0) November 8, 2007

Ganzheitliches Datenmanagement

The 4 Pillars of Technosoft s Big Data Practice

Big Data Are You Ready? Jorge Plascencia Solution Architect Manager

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

Oracle Business Intelligence EE. Prab h akar A lu ri

Melissa Coates. Tools & Techniques for Implementing Corporate and Self-Service BI. Triad SQL BI User Group 6/25/2013. BI Architect, Intellinet

Long-term Marine Monitoring in Willapa Bay. WA State Department of Ecology Marine Monitoring Program

Graduate Co-op Students Information Manual. Department of Computer Science. Faculty of Science. University of Regina

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON

Delivering Smart Answers!

Julie Pullen, CSR Director Stevens Institute of Technology

DATA CENTER INFRASTRUCTURE MANAGEMENT

Lecture 9: Data Mining, Data Analytics and Big Data

The Marine Protected Area Inventory

Arun Veeramani. Principal Marketing Manger. National Instruments. ni.com

Luncheon Webinar Series May 13, 2013

Stuart Gillen. Principal Marketing Manger. National Instruments ni.com

Setting the Standard for Safe City Projects in the United States

MTM delivers complete manufacturing solutions from top to bottom enterprise to the factory floor.

Satellite Pursuit: Tracking Marine Mammals

Integrated business intelligence solutions for your organization

Data and data product visualization in EMODNET Chemistry

Three steps to put Predictive Analytics to Work

BIG DATA What it is and how to use?

Analysis of Data Mining Concepts in Higher Education with Needs to Najran University

Spatio-Temporal Networks:

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat

A Web-based Interactive Data Visualization System for Outlier Subspace Analysis

Seeing by Degrees: Programming Visualization From Sensor Networks

Managing Data in Motion

QULU VMS AND SERVERS Elegantly simple, Ultimately scalable

The distribution of marine OpenData via distributed data networks and Web APIs. The example of ERDDAP, the message broker and data mediator from NOAA

Using Tableau Software with Hortonworks Data Platform

The State of the Electrical Grid in Washington State. Michael Pesin, PMP, P.E. Seattle City Light

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

A standards-based open source processing chain for ocean modeling in the GEOSS Architecture Implementation Pilot Phase 8 (AIP-8)

BI & DASHBOARDS WITH SHAREPOINT 2007

Web-based spatio-temporal visualization and analysis of the Siberian Earth System Science Cluster (SIB-ESS-C)

KNOWLEDGENT REPORT Big Data Survey: Current Implementation Challenges

Data Analytics. How Operational Analytics can transform the grid. Sameer Tiku. Product Architect. Bentley Systems

Transcription:

Seeing the Big Picture Sensing, Linking, Analyzing and Visualizing Big Data Dr. Paul Janecek

Introduction Seeing Details and Context in Big Data Example: Real-Time Monitoring Sensing: In-Situ Data Linking: IOOS Analyzing: Pattern Recognition & Analysis Visualizing: Data Portal Challenges l 2

l 3

Detecting Hazardous Algal Blooms (HAB) In-Situ Sensors Environmental Sample Processor (ESP) Networked Ocean Data Integrated Ocean Observing Initiative (IOOS) Ocean Observatories Initiative (OOI) Data Visualization Portal Spyglass Data Portal Developed by Think Blue Data, Thailand Impact: Detection reduced from 3-5 days to 3.5 hours l 5

Variety Structured and Unstructured data Time series, Log Files, Images Volume Massive historical archives of open data Velocity Real-time and near-real-time data streams l 6

ESP: an Underwater Ecogenomic Robot Detects DNA and Toxins Sends result as image Near real-time data Image Source: Center for Environmental Visualization, UW, in (Scholin, 2008) l 7

Image Source: WHOI

Image Source: OOI, Center for Environmental Visualization, UW l 9

Image Source: IOOS

Data Sources Data Capture Data Storage Standards Metadata Quality Data Access Networking Security l 11

Image Source: IOOS

Pacific Pacific Coast Central Atlantic AOOS (Alaska) NANOOS (Northwest Pacific) GLOS (Great Lakes) NERACOOS (Northeast) Integrate, Catalog, Provide Public Access CeNCOOS (Central California) GCOOS (Gulf of Mexico) MARACOOS (Mid-Atlantic) PacIOOS (Pacific) Southern California CariCOOS (Caribbean) SECOORA (Southeast) Image Source: Regional IOOS Portals l 13

and CO-OPS in IOOS DIF format for gridded data, i.e. netcdf/cf. In the Phase 2, the surface currents modeled by CSDL were used instead of observations. The CSDL model output data was provided by CSDL in the same netcdf/cf format, and was accessible via IOOS DIF standard access service, i.e. OPeNDAP/WCS. However, the netcdf files to feed the particle tracker were retrieved via FTP because of limitations on the client side. The project architecture and data flow diagram is presented in Figure 3-1. Example: Tracking Hazardous Algal Blooms Image Source: IOOS Data Integration Framework, Final Assessment Report l 14

Data Access Data Discovery Standards Data Integration Vocabularies Data Preparation l 15

Actionable Data from In-Situ Sensors Parsing, Data Preparation Pattern Recognition, Signal Detection Update Monitoring Dashboard Image Source (top right): Metfies et al.,ecology of Harmful Algae, 2006 l 16

ESP Sensor Data Time Series Data Histogram Time Series Data Image Data Image Spot Map Spot Box Plot Images Organism Concentration Organism Summary Data Spot Detail Data Diagnostic Data Sensor Components Log File Details Time Series Scatterplot Matrix Calendars Horizon Chart Details Timeline of Events l 17

Portal Data ESP Data Organisms Images CTD, Can Diagnostics IOOS Data Context Metadata Facets Spatial Facet Time Series Data Overview Result Sets l 18

Web-based visualization Massive distributed data sets Bandwidth constraints Highly Interactive Graphics l 20

Variety Volume Velocity Challenges Research & Technology Applications Structured: (Time Series) Unstructured: (Log Files) Massive Data Sets Real-Time Data Data and Metadata Standards Data Services & Discovery Data Integration Data Mining Process Mining Faceted Search Distributed Databases Distributed Processing Distributed Architecture Sensor Networks Process Integration Visualization Architecture Data Analysis Process Analysis Business Intelligence Decision Support Monitoring l 21

Thank You Dr. Paul Janecek CEO, Think Blue Data Visiting Faculty, Computer Science and Information Management paul@thinkbluedata.com