USGS Community for Data Integration



Similar documents
Data Integration Strategies

Databases & Data Infrastructure. Kerstin Lehnert

REACCH PNA Data Management Plan

Data Management Best Practices for Landscape Conservation Cooperatives Part 1: LCC Funded Science

CDI SSF Category 1: Management, Policy and Standards

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21)

The Arctic Observing Network and its Data Management Challenges Florence Fetterer (NSIDC/CIRES/CU), James A. Moore (NCAR/EOL), and the CADIS team

OpenAIRE Research Data Management Briefing paper

Metadata Hierarchy in Integrated Geoscientific Database for Regional Mineral Prospecting

GEOG 482/582 : GIS Data Management. Lesson 10: Enterprise GIS Data Management Strategies GEOG 482/582 / My Course / University of Washington

DISCIPLINES LIST DIVISION 2

The Digicene: the Age of Big Data in the Geosciences

Overcoming the Technical and Policy Constraints That Limit Large-Scale Data Integration

Big data from Space : esperienze in GEOSS e altre iniziative internazionali

Task AR-09-01a Progress and Contributions

School Department Subject Course #

Data Management Considerations for the Data Life Cycle

A Capability Maturity Model for Scientific Data Management

Description and Testing of the Geo Data Portal: A Data Integration Framework and Web Processing Services for Environmental Science Collaboration

NATIONAL CLIMATE CHANGE & WILDLIFE SCIENCE CENTER & CLIMATE SCIENCE CENTERS DATA MANAGEMENT PLAN GUIDANCE

National LCC Meeting. ensuring maximum efficiency of LCC conservation delivery

NOAA Environmental Data Management Update for Unidata SAC

How To Write An Nccwsc/Csc Data Management Plan

Luc Declerck AUL, Technology Services Declan Fleming Director, Information Technology Department

Environment Canada Data Management Program. Paul Paciorek Corporate Services Branch May 7, 2014

EUDAT. Towards a pan-european Collaborative Data Infrastructure

How To Become A Scientist

Global Scientific Data Infrastructures: The Big Data Challenges. Capri, May, 2011

Big Data R&D Initiative

BIG DATA Funding Opportunities

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21)

Research Data Alliance: Current Activities and Expected Impact. SGBD Workshop, May 2014 Herman Stehouwer

Standard Big Data Architecture and Infrastructure

National Geothermal Data System and Global Geosciences Data Integration

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova

Project Title: Project PI(s) (who is doing the work; contact Project Coordinator (contact information): information):

Computational Science and Informatics (Data Science) Programs at GMU

SHARING RESEARCH DATA POLICY, INFRASTRUCTURE, PEOPLE

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21) $100,070,000 -$32,350,000 / %

Data Grids. Lidan Wang April 5, 2007

Nevada NSF EPSCoR Track 1 Data Management Plan

National Big Data R&D Initiative

Development of Enhanced Feature Recognition Software for the Extraction of Mine Features from USGS Topographic Maps

USGS Data Management Training Modules

Workprogramme

Standard 5: Use a consistent data management framework in accordance with internal and partner organization data standards.

Workspaces Concept and functional aspects

CDI/THREDDS Interoperability: the SeaDataNet developments. P. Mazzetti 1,2, S. Nativi 1,2, 1. CNR-IMAA; 2. PIN-UNIFI

Coastal Waters Consortium (CWC) Data Management Plan

Pan-European infrastructure for management of marine and ocean geological and geophysical data

Global Earth Observation Integrated Data Environment (GEO-IDE) Presentation to the Data Archiving and Access Requirements Working Group (DAARWG)

GOSIC NEXRAD NIDIS NOMADS

11-12 June 2015, Bari-Italy. Stefano Nativi CNR-IIA

3. CyAir Recommendations

BYODs & FAIR Data Stewardship

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance

Technical Layer (Technical Interoperability) Information Layer (Information Interoperability. Business Layer (Business Process Interoperability)

Creating a More Resilient Future. Friday 30 May, 11:00 to 12:30, Rooms S29-31

Data Publishing Workflows with Dataverse

National Land Cover Database Visualization and Information Tool

CYBERINFRASTRUCTURE FRAMEWORK $143,060,000 FOR 21 ST CENTURY SCIENCE, ENGINEERING, +$14,100,000 / 10.9% AND EDUCATION (CIF21)

Managing Idaho s Landscapes for Ecosystem Services (MILES) Idaho EPSCoR Research Data Management Plan Version 5.0 (October 12, 2015)

How To Teach Data Science

Introduction to Service Oriented Architectures (SOA)

Meeting Increasing Demands for Metadata

Data Driven Discovery In the Social, Behavioral, and Economic Sciences

Region 10 Sharing Analytical Data

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Interoperable Solutions in Web-based Mapping

ICSTI 2014 General Assembly October 18-19, 2014

Carlos Iglesias, Open Data Consultant.

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

Assessment Plan Mission

COGNITIVE SCIENCE AND NEUROSCIENCE

CAREER TRACKS PHASE 1 UCSD Information Technology Family Function and Job Function Summary

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

The EcoTrends Web Portal: An Architecture for Data Discovery and Exploration

Open Access to Manuscripts, Open Science, and Big Data

ESS EA TF Item 2 Enterprise Architecture for the ESS

NOS for Data Analysis (802) September 2014 V1.3

Global Research Data Infrastructure: Path Forward for Progress. Dr. Chris Greer Senior Executive for Cyber Physical Systems

STUDENT APPLICATION FOR SLOAN SIGP SCHOLARSHIP

CASRAI, eurocris, Lattes, and VIVO: Four Perspectives on Research Information Standards

Databases in Organizations

Graduate Degree Granting Units and Programs at Michigan State University Accounting - Master of Science Advertising - Master of Arts African American

A Web services solution for Work Management Operations. Venu Kanaparthy Dr. Charles O Hara, Ph. D. Abstract

NCDC Strategic Vision

SAHRA Metadata Guidelines

Data at NIST: A View from the Office of Data and Informatics

Data Curation for the Long Tail of Science: The Case of Environmental Sciences

THE BRITISH LIBRARY. Unlocking The Value. The British Library s Collection Metadata Strategy Page 1 of 8

Geospatial Data Stewardship at an Interdisciplinary Data Center

data.bris: collecting and organising repository metadata, an institutional case study

Potential standardization items for the cloud computing in SC32

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

Enabling Semantic Search in Geospatial Metadata Catalogue to Support Polar Sciences

STUDENT APPLICATION FOR SLOAN SCHOLARSHIP

Report to the NOAA Science Advisory Board

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan

Exploring the roles and responsibilities of data centres and institutions in curating research data a preliminary briefing.

Transcription:

Community of Science: Strategies for Coordinating Integration of Data USGS Community for Data Integration Kevin T. Gallagher USGS Core Science Systems January 11, 2013 U.S. Department of the Interior U.S. Geological Survey

United Nations & World Bank Global Issues 2

The USGS Science Strategy Reiterates the Need to Be Interdisciplinary The inherent complexity of nature The need to explore questions and problems that are not confined to a single discipline The need to solve societal problems The power of new technologies (Committee on Facilitating Interdisciplinary Research, National Academy of Sciences, 2004) 3

The Specialization or Balkanization of Science Microbiology Geology/Geochemistry/Hy drology Landscape Ecology Community Ecology Epidemiology Ornithology Mammalogy Entomology Botany Sc cale Population Ecology Behavior Physiology Cell Biology Molecular Biology Agriculture Health Natural Resources Application

The President s Science Policy Quotes from the Office of Science and Technology: Data-Enabled Science ( Big Data ) Improved approaches are needed to derive science and social value from the vast amount of data we are now acquiring. Research and development in such approaches as algorithms, data mining, analytics, and visualization tools should be priorities. 5

U.S. Department of the Interior U.S. Geological Survey Data Integration

Data Integration: What is it? It appears intangible. Is it a concept? A picture? An Architecture? APortal? Data Standards? A Data Warehouse? Will we know it when we see it? How will we measure progress towards it? 7

Data Integration: What is it? Access & Discovery Tools Data Extraction, Transformation, Load Interoperable Data Semantic Standards, Vocabulary, Portals, Registries, Catalogs Taxonomy, Schema or Dictionary Applications Geospatial Analysis, Modeling, Visualization Standards for Exchange Protocols, Metadata, Standards 8

Data Integration: Examples Access & Discovery Search Tools Indexed d Science Content t (by geographic location, science topic, or time) Comprehensive Science Catalog (Data, Publications, Specimens, s, etc.) Web Services Portals Applications Geospatial Viewers Quantitative and Analytical Models 3-D and 4-D Visualization High Performance Computing Standards d & Tools XML, and evolving international standards (GeoSciML) Data Loading, Metadata, and Data Exchange Interoperable Data Semantics, Integrated Data Modeling, Archiving and Retrieval Data Hosting Capability Extraction, Load, and Query 9

What Does Data Integration Look Like? Eye Test Data -or - Data What do you see? 10

After Many s, What do you See? -or - Data 11

The Integrated Data View Many participate in the data resource The Data is: Visible Indexed and Accessible Developed According to Science Planning Commonly defined Quality Controlled Designed and Standardized Yet Continually Evolving How do we Get There? Data 12

The Community for Data Integration Today: Di Driving i the Agenda, Defining i Scope, Priority, Pi it and dtasks Volunteers! Chartered with over 500 members Sharing our stories once a month * Unified behind a long-term vision * Identifying and prioritizing projects that can deliver value or savings in the short term Sharing best practices Leveraging investments Serving on working groups (e.g., semantics, citizen science, data management, technology) Evolving solutions through pilot development & testing 13

Community for Data Integration Selection Guidelines Focus on targeted efforts that solve a short-term science challenge Leverage existing capabilities ( bottom up ) Develop solutions or methodology that can be replicated or repeated as well as scaled Create sustainable infrastructure Seek efficiencies or substantial return on investment Expose corporate data Organize science models and outputs t Preserve and access project data 14

The John Wesley Powell Center http://powellcenter.usgs.gov/ Cross-disciplinary scientific collaboration through Working Groups, now partnering with NSF, EPA and others. JWP Center and Community for Data Integration Use JWP Center as an incubator to test the viability of CDI tools and guidance Use Science Working Groups to identify new data management, modeling and tool requirements Apply CDI tools to Science Working Groups, ensuring models and data are preserved and available using standards 15

Working with Partners Earth Science Information ESRI Partnership (ESIP) Fish and Wildlife Service DataONE Lamont-Doherty Earth Earthcube Observatory Geoscience Information University of Auckland, New Network (GIN) Zealand OneGeology University of Idaho CUASI University of Wisconsin Arizona Geological Survey Rensselaer Polytechnic Colorado State University Institute (RPI) DiscoverLife EPA Tufts University Department of Agriculture

USGS Community for Data Integration 17 https://my.usgs.gov/confluence/display/cdi/home

Community for Data Integration Science Support Framework 18

Big Picture Computational Tools & Services Management, Policy & Standards Data & Information Assets Communities of Practice 19

Upper Level Categories Data & Information Assets Data Persistent archives, data registries, catalogs, data, metadata, derived information products, knowledge bases, vocabularies/ontologies. Technology Applications, web services, data discovery tools, models, semantic services & tools, infrastructure, data brokers and visualization tools Computational Tools & Services and visualization tools. Management, Policy & Standards Communities of Practice Business Data stewardship, implementation ti of the Data Management Life Cycle, knowledge management, data standards, governance and policy. People Scientists, CDI as a whole, CDI Working Groups, external partners, and the human network of scientific domain collaborators. 20

CDI Science Support Framework The foundation for sharing science 21

CDI s Mapped to the Science Support Framework 22

National Water Information System

CDI Data Upload, Registry and Access

Mobile Applications

Semantic Technologies for Integrating USGS Data

Data Management Website

GeoDataPortal

To Join the Community for Data Integration Contact: cdi@usgs.govgov Thank You! For more information, contact: Kevin T. Gallagher (kgallagher@usgs.gov) Cheryl Morris (cmorris@usgs.gov) gov) Jennifer Carlino (jcarlino@usgs.gov)

Geo Data Portal (GDP) A simple Web application and set of processing services that t access downscaled d climate projections and other data resources (such as geospatial datasets) that are otherwise difficult to access and manipulate Currently, historic weather and downscaled climate projections are available (CMIP3, RegCM3, PRISM, and others), more climate, weather and geographic data resources will be added 30