Earth Data Science in The Era of Big Data and Compute



Similar documents
Development of an Impervious-Surface Database for the Little Blackwater River Watershed, Dorchester County, Maryland

Can GIS Help You Manage Water Resources? Erika Boghici Texas Natural Resources Information Systems

The following was presented at DMT 14 (June 1-4, 2014, Newark, DE).

Michigan Tech Research Institute Wetland Mitigation Site Suitability Tool

AUTOMATION OF FLOOD HAZARD MAPPING BY THE FEDERAL EMERGENCY MANAGEMENT AGENCY ABSTRACT INTRODUCTION

Request for Proposals for Topographic Mapping. Issued by: Teton County GIS and Teton County Engineering Teton County, Wyoming

Business Plan for Orthoimagery in North Carolina

A Method Using ArcMap to Create a Hydrologically conditioned Digital Elevation Model

USGS QUADRANGLES IN GOOGLE EARTH

Remote Sensing, GPS and GIS Technique to Produce a Bathymetric Map

delorme.com/earthmate Earthmate A Guide to the Complete GPS Navigation Solution for Smartphones and Tablets.

Geospatial Positioning Accuracy Standards Part 3: National Standard for Spatial Data Accuracy

North Dakota GIS Program Report To Governor Jack Dalrymple. July 1, 2011 June 30, 2012

ArcGIS Reference Document

DEVELOPING AN INUNDATION MAP STANDARD FOR THE U.S. ARMY CORPS OF ENGINEERS

GIS Data Discovery Workshop

Create a folder on your network drive called DEM. This is where data for the first part of this lesson will be stored.

Geospatial Information for disaster risk reduction and natural resources management. Rolando Ocampo Alcántar

White Paper. PlanetDEM 30. PlanetObserver 25/11/ Update

Appendix J Online Questionnaire

TerraColor White Paper

STATE OF NEVADA Department of Administration Division of Human Resource Management CLASS SPECIFICATION

GIS Initiative: Developing an atmospheric data model for GIS. Olga Wilhelmi (ESIG), Jennifer Boehnert (RAP/ESIG) and Terri Betancourt (RAP)

Guidance for Flood Risk Analysis and Mapping. Changes Since Last FIRM

Notable near-global DEMs include

A HYDROLOGIC NETWORK SUPPORTING SPATIALLY REFERENCED REGRESSION MODELING IN THE CHESAPEAKE BAY WATERSHED

Qatar National Geospatial Infrastructure

REGIONAL SEDIMENT MANAGEMENT: A GIS APPROACH TO SPATIAL DATA ANALYSIS. Lynn Copeland Hardegree, Jennifer M. Wozencraft 1, Rose Dopsovic 2 INTRODUCTION

Utah State General Records Retention Schedule SCHEDULE 1 GEOSPATIAL DATA SETS

A Brief Explanation of Basic Web Services

CityGML goes to Broadway

Finding GIS Data and Preparing it for Use

The Status of Geospatial Information Management in China

Site-specific management at Bowles Farming Company. UC Davis Precision Ag Workshop 7/14/2010 Cannon Michael Bowles Farming Company, Inc.

Mapping Solar Energy Potential Through LiDAR Feature Extraction

Advanced Image Management using the Mosaic Dataset

Learning about GPS and GIS

Enterprise GIS Business Plan July 4, 2008

Metadata for Big River Watershed Geologic and Geomorphic Data

A Geospatial Solution for Minimizing Risk. Pipeline Hazard Categorization

Opportunities for the generation of high resolution digital elevation models based on small format aerial photography

Version 3.0, April 16, 2012, updated for ArcGIS 10.0 Produced by the Geographic Information Network of Alaska

GIS Data Quality and Evaluation. Tomislav Sapic GIS Technologist Faculty of Natural Resources Management Lakehead University

Making Geospatial Data Available and Accessible in Jamaica

Managing Lidar (and other point cloud) Data. Lindsay Weitz Cody Benkelman

Landsat Monitoring our Earth s Condition for over 40 years

AUTOMATED DEM VALIDATION USING ICESAT GLAS DATA INTRODUCTION

Creating a More Resilient Future. Friday 30 May, 11:00 to 12:30, Rooms S29-31

MSDI: Workflows, Software and Related Data Standards

Creating the US Topo

Data Integration Strategies

LIDAR and Digital Elevation Data

IMPERVIOUS SURFACE MAPPING UTILIZING HIGH RESOLUTION IMAGERIES. Authors: B. Acharya, K. Pomper, B. Gyawali, K. Bhattarai, T.

Homeland Security Infrastructure Program HSIP Gold 2012 September 2012

2D Modeling of Urban Flood Vulnerable Areas

Geographic Information Systems

Principles and Practices of Data Integration

MINE MAP DIGITIZATION & GIS IMPLIMENTATION

Numerical Modeling and Simulation of Extreme Flood Inundation to Assess Vulnerability of Transportation Infrastructure Assets

AERIAL PHOTOGRAPHS. For a map of this information, in paper or digital format, contact the Tompkins County Planning Department.

3-D Object recognition from point clouds

Natural Resource-Based Planning*

The USGS Landsat Big Data Challenge

Pima Regional Remote Sensing Program

Publishing Hosted 3D Feature Layers. An Esri White Paper September 2015

Inter Swath Data Quality Measures to Assess Quality of Calibration of Lidar System. U.S. Department of the Interior U.S.

Review for Introduction to Remote Sensing: Science Concepts and Technology

Coastal Engineering Indices to Inform Regional Management

Application of Google Earth for flood disaster monitoring in 3D-GIS

Description of the table of the in-situ data requirements of GMES services

Applying GIS in seismic hazard assessment and data integration for disaster management

Implementation of information system to respond to a nuclear emergency affecting agriculture and food products - Case of Morocco

RiMONITOR. Monitoring Software. for RIEGL VZ-Line Laser Scanners. Ri Software. visit our website Preliminary Data Sheet

A GIS helps you answer questions and solve problems by looking at your data in a way that is quickly understood and easily shared.

APLS GIS Data: Classification, Potential Misuse, and Practical Limitations

Getting Started With LP360

LiDAR Data Management Lessons for Geospatial Data Managers

GEOENGINE MSc in Geomatics Engineering (Master Thesis) Anamelechi, Falasy Ebere

Oklahoma s Open Source Spatial Data Clearinghouse: OKMaps

EO based glacier monitoring

The X100. Safe and fully automatic. Fast and with survey accuracy. revolutionary mapping. create your own orthophotos and DSMs

Transcription:

Earth Data Science in The Era of Big Data and Compute E. Lynn Usery U.S. Geological Survey usery@usgs.gov http://cegis.usgs.gov U.S. Department of the Interior U.S. Geological Survey Board on Earth Sciences and Resources April 29, 2015

Panel Questions How do we collect, access, manage, process, analyze, visualize, interpret and curate Earth big data? What are the novel approaches to the science of Earth big data and needs for future development of data cyberinfrastructure?

The National Map The National Map includes eight data layers: land cover, structures, boundaries, hydrography, geographic names, transportation, elevation, orthoimagery Public domain data to support USGS topographic maps at 1:24,000-scale Products and services at multiple scales and resolutions Analysis, modeling and other applications at multiple scales and resolutions The National Map is built on partnerships and standards

Products and Services of The National Map Data Products National databases of base geospatial data content National Hydrography Dataset Best Practices Databases: Transportation, Structures, Boundaries (Gov Units), Elevation data and other lidar derivatives NAIP and High Resolution Orthoimagery National Land Cover Dataset (developed and maintained under a separate USGS program) Geographic Names (Geographic Names Information System and Gaz-Vector Integrated Data) Derived Products US Topo Historical Topographic Map Collection 4

The National Map- Hydrography The National Hydrography Dataset is the hydrography component of The National Map Represents the surface water of the United States Complete, national, seamless coverage at 1:100,000-scale (2001); 1:24,000-scale (2007); 1:4,800-scale level of content ongoing National coverage at 1:24,000 scale is greater than 600 Gb of data

The National Map- Orthoimagery National Agriculture Imagery Program Acquired by the US Dept of Agriculture, Farm Services Agency; USGS, other Federal agencies and States are partners Nationwide coverage 4-band; Natural color 1-meter 3.75 X 3.75 min tiles Data at full resolution for Dent County, MO is 800 Gb 6

The National Map- Elevation Complete national coverage of 10- meter resolution or better elevation data; substantial data at 3 m (1/9 th arc-second) New data collected are lidar (target resolution 1/9 th arc second) IfSAR data in Alaska (5 m) Lidar point cloud data also delivered as a product USGS Base lidar Acquisition specification

The National Map - Elevation: Quality Levels Quality Level Horizontal Point Spacing (meters) Vertical Accuracy (centimeters) 1 0.35 9.25 Description High accuracy and resolution lidar example: lidar data collected in the Pacific Northwest 2 0.7 9.25 Medium-high accuracy and resolution lidar 3 1-2 <18.5 4 5 46-139 5 5 93-185 Medium accuracy and resolution lidar analogous to USGS specification v. 13 and most data collected to date Early or lower quality lidar and photogrammetric elevations produced from aerotriangulated NAIP imagery Lower accuracy and resolution, primarily from IfSAR

3D Elevation Program (3DEP) The 3DEP initiative implements one of the 10 program scenarios resulting from the National Enhanced Elevation Assessment (NEEA) study Key 3DEP goals: Lidar data QL-2 over the conterminous United States, Hawaii, and the territories on an eight-year cycle IfSAR data QL-5 over Alaska Lidar point cloud data to be publically accessible Multiple derivative products will be supported as services and will be freely available 3DEP is a program initiative of the USGS with operational distribution begun in 2015

3DEP Data Volumes For the purposes of the infrastructure assessment, 3DEP data volume is estimated at 9.4 PB.

Example Areas of Application of 3DEP Elevation Data Precision Farming Land Navigation and Safety Geologic Resources and Hazards Mitigation Natural Resource Conservation Infrastructure Management Flood Risk Mitigation

USGS Big Data Big data is not simply volume of data USGS has collected, processed, and distributed petabytes of data for decades We process and use these data for earth science applications with a divide and conquer approach Quadrangle mapping is a divide and conquer approach Staging data and scale thresholds for viewing are a divide and conquer approach 12

Big Data Big data occurs when divide and conquer will not work and a requirement to handle all data to get solution exists Similar to global operations in image processing vs local or neighborhood processes Example with 3DEP data Watershed modeling and analysis must handle data for the entire watershed and with lidar data at QL 2 resolutions, this is a big data problem. 13

Novel approaches Parallel computing, but for our problems parallel input/output operations are critical Geospatial data are generally well-suited to parallel approaches Segment geographic space and send each spatial component to a different processor 14

Novel approaches Move processing to the data Once data are loaded on our servers, we do not move the data again. Use server-side processing on the computer on which the data are stored Build cyberinfrastructure to support big data processing Network speeds are critical Parallel operations requires rethinking how we build systems and software 15

Earth Data Science in The Era of Big Data and Compute E. Lynn Usery U.S. Geological Survey usery@usgs.gov http://cegis.usgs.gov U.S. Department of the Interior U.S. Geological Survey Board on Earth Sciences and Resources April 29, 2015