Biodiversity Data Exchange Using PRAGMA Cloud

Similar documents
GIS Data Models for INSPIRE and ELF

Workload Characterization and Analysis of Storage and Bandwidth Needs of LEAD Workspace

Building PRAGMA Grid

Report to the NOAA Science Advisory Board

National LCC Meeting. ensuring maximum efficiency of LCC conservation delivery

Network Analysis with Python. Deelesh Mandloi

Why a single source for assets should be. the backbone of all your digital activities

INTRODUCTION TO DATA MANAGEMENT

Using Big Data and GIS to Model Aviation Fuel Burn

Top Ten Security and Privacy Challenges for Big Data and Smartgrids. Arnab Roy Fujitsu Laboratories of America

LIBER Case Study: Author: Mijke Jetten, University Library, Radboud University,

Overview of state of art in Data management. Stefano Cozzini CNR/IOM and exact lab srl

NIH Commons Overview, Framework & Pilots - Version 1. The NIH Commons

EXPLORING AND SHARING GEOSPATIAL INFORMATION THROUGH MYGDI EXPLORER

Report of the DTL focus meeting on Life Science Data Repositories

Intro to Data Management. Chris Jordan Data Management and Collections Group Texas Advanced Computing Center

Project Title: Project PI(s) (who is doing the work; contact Project Coordinator (contact information): information):

An Esri White Paper June 2011 ArcGIS for INSPIRE

Enterprise GIS Solutions to GIS Data Dissemination

CIP s Open Data & Data Management Guidelines and Procedures

Technical. Overview. ~ a ~ irods version 4.x

EBONE. European Biodiversity Observation Network: Design of a plan for an integrated biodiversity observing system in space and time

Data quality Vision at SBBr Danny Vélez

Biology Institute: 7 PhD programs Expertise in all areas of biological sciences

DMBI: Data Management for Bio-Imaging.

EVERYTHING YOU NEED FOR BRANDING ON MULTIPLE CHANNELS

Data Models For Interoperability. Rob Atkinson

Data Governance in the Hadoop Data Lake. Michael Lang May 2015

National Geothermal Data System and Global Geosciences Data Integration

Paxata Security Overview

Digital Asset Management Developing your Institutional Repository

Task AR-09-01a Progress and Contributions

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences

USGS Community for Data Integration

A Service for Data-Intensive Computations on Virtual Clusters

Scalable Services for Digital Preservation

Coastal Waters Consortium (CWC) Data Management Plan

A Binary Tree SMART CTO Webinar. Analyzing and Rationalizing Big Data in Messaging

ediscovery Solutions

GeoManitoba Spatial Data Infrastructure Update. Presented by: Jim Aberdeen Shawn Cruise

Extend Business Scope and Improve Governance with SAP Content Management

Building Platform as a Service for Scientific Applications

REACCH PNA Data Management Plan

IBM ediscovery Identification and Collection

Integrated Rule-based Data Management System for Genome Sequencing Data

BIG DATA: DATA EVERYWHERE

How To Write A Blog Post On Globus

Research Data Management

Spatial Data Infrastructure. A Collaborative Network

Document Management. Document Management for the Agile Enterprise. AuraTech Pte Ltd

UK Location Programme

TRTML - A Tripleset Recommendation Tool based on Supervised Learning Algorithms

Sextant. Spatial Data Infrastructure for Marine Environment. C. Satra Le Bris, E. Quimbert, M. Treguer

A Binary Tree SMART Migration Webinar. SMART Solutions for Notes- to- Exchange Migrations

A Radicati Group Webconference

Processing Biological Data in i-marine

Data Management. Facility Access Challenges: Rudi Eigenmann NEES Operations Headquarters NEEScomm Center Purdue University

Data access and management

THE BRITISH LIBRARY. Unlocking The Value. The British Library s Collection Metadata Strategy Page 1 of 8

Best Practices for Data Management. RMACC HPC Symposium, 8/13/2014

Survey of Canadian and International Data Management Initiatives. By Diego Argáez and Kathleen Shearer

Content Management Integrated with DAM for Marketing Communication

INTEGRATING RECORDS SYSTEMS WITH DIGITAL ARCHIVES CURRENT STATUS AND WAY FORWARD

SURFsara Data Services

Cloud-based Infrastructures. Serving INSPIRE needs

Meister Going Beyond Maven

GOSIC NEXRAD NIDIS NOMADS

Virginia Commonwealth University Rice Rivers Center Data Management Plan

I.R.I.S. Solutions for Invoice Processing

Data Management. Courtesy of Calisphere: SUSAN BORDA DIGITAL CURATION LIBRARIAN

Information Migration

Autonomy Consolidated Archive

IO Informatics The Sentient Suite

Functional Requirements for Digital Asset Management Project version /30/2006

How To Write An Nccwsc/Csc Data Management Plan

Managing Bathymetry in the Cloud with GIS

E- Discovery in Criminal Law

Distribution Services: How to Automate and Publish Personalized Analytics via , File and Print. Suhrud Atre

ICT Perspectives on Big Data: Well Sorted Materials

An ESRI White Paper July 2009 Creating and Maintaining a Geoportal Management Considerations

Profiling as a Service

Mindshare Studios Introductory Guide to Content Management Systems

From Geoportal to Spatial Data Service Platform. Jani Kylmäaho National Land Survey of Finland Development Centre


EPrints Preservation Update

Achieving a Step Change in Digital Preservation Capability

Enabling embedded maps

The Czech Digital Library and Tools for the Management of Complex Digitization Processes

Portal for ArcGIS. Satish Sankaran Robert Kircher

Networking Library Services:

EEOS Spatial Databases and GIS Applications

Real-Time Analytics on Large Datasets: Predictive Models for Online Targeted Advertising

Building community clouds to support access to scholarship. Michele Kimpton CEO, DuraSpace Jonathan Markow CSO, DuraSpace

From classical web mapping publica2on to INSPIRE service architecture in the Cloud InGeoCloudS BRGM, Pierre LAGARDE

California Department of Fish and Game (Wildlife) GIS Data and Services

Data Lifecycle Management

Archive I. Metadata. 26. May 2015

Global Scientific Data Infrastructures: The Big Data Challenges. Capri, May, 2011

Big Data in OpenTopography

Data Management using irods

Transcription:

Biodiversity Data Exchange Using PRAGMA Cloud Mount Kinabalu biodiversity interoperability experiment Umashanthi Pavalanathan, Aimee Stewart, Reed Beaman, Shahir Shamsir C. J. Grady, Beth Plale

Experimenters, infrastructure, and U. Pavalanathan B. Plale A. Stewart C.J. Grady S. Shamsir S.N. Azmy C.T. Han R. Beaman A. Weischselbaumer data providers

Biodiversity Research Examines variajon and interacjon among living things and complex systems Fundamental to a healthy and sustainable planet Loss is a leading environmental and social issue.

MoJvaJon Biodiversity applicajons are data driven by nature DistribuJon paoerns can be revealed through analysis of large volumes of species occurrence data using techniques such as species distribujon modeling Analysis tools, data discovery methods, and cloud compujng all contribute to the solujon

RaJonale for the interoperability experiment Opening opportunijes to do biodiversity research with scalable infrastructure Improving access to shared data Forming a Community of PracJce through collaborajons in biology, informajon sciences, computer science, engineering

Experiment Proof of concept biodiversity applicajon ujlizing distributed data and doing useful data exchange in the PRAGMA cloud Basic applicajon of species distribujon modeling using Lifemapper LmSDM

Data Specimen collecjon records illustrajng plant diversity on Mount Kinabalu, notable for its high diversity and endemism of species and ultramafic environments Metadata files describing nine species distribujon data sets are uploaded to a GeoPortal server running at UniversiJ Teknologi Malaysia (UTM)

Workflow

Lifemapper: LmSDM: Species DistribuJon Modeling Species Occurrence Data SDM Modeling Algorithm Environmental Data Predicted Habitat

Biodiversity ExpediJon Data Prep Input data Requirements for Occurrence points Requirements for Environmental Layers ModificaJons for Mt Kinabalu data Extensions to Lifemapper core

PAM Basics The world is divided in an equal-area grid of cells The PAM is a binary matrix. δ i,j notes presence or absence of each species j in each cell i The marginals provide siterichnesses (α i ) and the species-range sizes (ω j ) β W = 1/ω Sites Species A 1 0 1 1 3 B 1 1 0 0 2 C 1 0 0 0 1 3 1 1 1 6 Ranges Richness

Terrestrial Mammals ProporJonal Species Richness High Yellow Moderate Red Low Blue Per- site Range Size

Design for Collaboration Data Archive 13

Cataloging Metadata Metadata repositories are crucial to preserving scienjfic investments in data by enabling metadata collecjon, long- term preservajon, and reuse of scienjfic data

Esri GeoPortal Server Open source metadata server that enables discovery and use of geospajal resources Uses emerging standards such as Open GeospaJal ConsorJum (OGC)'s Catalog Service for the Web (CSW) Simplifies the cataloging and avoids staleness of metadata

The workflow (Demo)

Open Problems PRAGMA Cloud Security Data are sensijve in that they reveal ecologically sensijve informajon. What are the cloud security measures to be taken for controlled access of sensijve data? Agreements on Core Metadata Discovery and reuse of scienjfic outcomes from these applicajons depend on automated or manual extracjon of rich metadata about the datasets and predicjon outputs. For this to happen, some agreement must exist on core metadata.

Open Problems Ownership of Results When analysis is carried out on PRAGMA cloud, the resuljng dataset can contribute to enriching the data of the cloud. How is ownership and sharing tracked?

Open Problems Metadata Catalog FederaEon: We demonstrated use of two GeoPortal instances. What is the PRAGMA- wide solujon for metadata catalog federajon? - Using GeoGrid? - Discussion during Resources and Data Working Group Breakout Session Thursday 11:00 12:00

Future PRAGMA Biodiversity ExpediJon Extend for muljple Mt. Kinabalu species High resolujon grid Extend metadata To automate data ingesjon To more fully capture provenance of outputs For transparent, reproducible science

Thank You!