Designing A Data Commons for Sustainability Science: Lessons Learned from a World Data Center

Similar documents
Geospatial Data Stewardship at an Interdisciplinary Data Center

Assessing a Scientific Data Center as a Trustworthy Digital Repository

CIESIN Columbia University

The Use of Geographic Information Systems in Risk Assessment

Report to the NOAA Science Advisory Board

GEOSPATIAL DATA STEWARDSHIP: KEY ONLINE RESOURCES AN NDSA REPORT

Belmont Forum Collaborative Research Action on Mountains as Sentinels of Change

How To Write An Nccwsc/Csc Data Management Plan

The Global Earth Observation System of Systems (GEOSS) 10-Year Implementation Plan. (As adopted 16 February 2005)

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan

Geospatial Data Archiving

TRUSTED DATA SERVICES TO SUPPORT CLIMATE CHANGE RESEARCH

Metadata Hierarchy in Integrated Geoscientific Database for Regional Mineral Prospecting

NASA s Big Data Challenges in Climate Science

DA-09-02a Task Report Data Integration and Analysis System

2015 Global Risk Assessment. Sahar Safaie Program Officer, UNISDR Risk Knowledge Section

Creating a More Resilient Future. Friday 30 May, 11:00 to 12:30, Rooms S29-31

Data Integration Strategies

The European Soil Data Center 欧 洲 土 壤 数 据 中 心

Curriculum Material and Activities of Climate Education:

7th Framework Programme Theme 6 Environment (including climate change)

INPE Spatial Data Infrastructure for Big Spatiotemporal Data Sets. Karine Reis Ferreira (INPE-Brazil)

Tim Killeen President Designate, University of Illinois April 29, 2015

Building Caribbean GeoNode Platform in Support of Climate Risk Management Jacob Opadeyi, PhD

European Soil Data Centre (ESDAC) Marc Van Liedekerke Land Management and Natural Harzards Unit

REACCH PNA Data Management Plan

Global Earth Observation Integrated Data Environment (GEO-IDE) Presentation to the Data Archiving and Access Requirements Working Group (DAARWG)

EUROPEAN COMMISSION. Better Regulation "Toolbox" This Toolbox complements the Better Regulation Guideline presented in in SWD(2015) 111

A DISTRIBUTED CATALOG AND DATA SERVICES SYSTEM FOR REMOTE SENSING DATA

The Integrated Web Portal (IWP)

NASA Earth System Science: Structure and data centers

INTEGRATED NATIONAL ADAPTATION PILOT República de Colombia INAP

Data Management Framework for the North American Carbon Program

GLOBAL SPATIAL DATA AND INFORMATION: DEVELOPMENT, DISSEMINATION AND USE

Digital Preservation and Access - The 5 Opens Project

International Open Data Charter

A Selection of Questions from the. Stewardship of Digital Assets Workshop Questionnaire

GEOG 482/582 : GIS Data Management. Lesson 10: Enterprise GIS Data Management Strategies GEOG 482/582 / My Course / University of Washington

Enabling Semantic Search in Geospatial Metadata Catalogue to Support Polar Sciences

Global Terrestrial Network HYDROLOGY (GTN-H)

Linked Science as a producer and consumer of big data in the Earth Sciences

HAZARD RISK ASSESSMENT, MONITORING, MAINTENANCE AND MANAGEMENT SYSTEM (HAMMS) FOR LANDSLIDE AND FLOOD. Mohd. Nor Desa, Rohayu and Lariyah, UNITEN

Country and Regional Reports on GEOSS-related Activities. Ms. Taniya Koswatta Coordinator, APN Secretariat

DISASTER RISK DETECTION AND MANAGEMENT COURSES SETUP SCENARIO AT MAKERERE UNIVERSITY. Makerere University

Architectural Heritage Information System (S.I.P.A.)

Progress in Creating a Global Polar Metadata Interoperability Network

Why Demographic Data are not Up to the Challenge of Measuring Climate Risks, and What to do about it. CIESIN Columbia University

Multi-Hazard Disaster Risk Assessment (v2)

TerraAmazon - The Amazon Deforestation Monitoring System - Karine Reis Ferreira

Setting Priorities: Global Patterns of Disaster Risk. Maxx Dilley 1 United Nations Development Programme Bureau for Crisis Prevention and Recovery

Realizing the research library - data center alliance

Integrated Global Carbon Observations. Beverly Law Prof. Global Change Forest Science Science Chair, AmeriFlux Network Oregon State University

Task AR-09-01a Progress and Contributions

New challenges of water resources management: Title the future role of CHy

Adding Robust Digital Asset Management to Oracle s Storage Archive Manager (SAM)

COASTAL MONITORING & OBSERVATIONS LESSON PLAN Do You Have Change?

NASA's Strategy and Activities in Server Side Analytics

Climate and Environment Advisers Competency Framework

GOSIC NEXRAD NIDIS NOMADS

Protected Areas Resilient to Climate Change, PARCC West Africa

Assessment Plan Mission

145,000+ Royalty-Free Maps for the K-12, Academic, and Library Markets. Digital media by:

18 Month Summary of Progress

GEO-SPATIAL-TECHNOLOGIES", a trans-university new integrative master degree

Metadata, Geospatial data, Data management, Project management, GIS

JRC and GMES GIO-EMS

Doctor of Philosophy (Ph.D.), Major in. Geographic Information Science. Ph.D. Program. Educational Goal. Admission Policy. Advancement to Candidacy

Survey and Mapping Activities in Vietnam *

The Open Source Approach in Spatial Planning: Applicability Challenges in Bulgaria

11-12 June 2015, Bari-Italy. Stefano Nativi CNR-IIA

Strategic Activities to Support Sustainability of Canada s Geospatial Data Infrastructure

Driving Earth Systems Collaboration across the Pacific

Emerging Trends in SDI.

A Shared Data Infrastructure (SDI) for integrated coastal management in the Mediterranean and Black Sea Basins

Virginia Commonwealth University Rice Rivers Center Data Management Plan

Disaster Risk Reduction UNESCO s contribution to a global challenge

Probabilistic Risk Assessment Studies in Yemen

THE STATUS OF GEOSPATIAL TRAINING IN TANZANIAN UNIVERSITIES

Master of Science in Geography

Natural Hazard Risk Assessment in the Australasian Region: Informing Disaster Risk Reduction and Building Community Resilience.

Bangladesh Earthquake Risk Mitigation Program May 23, 2012

SCAR report. SCAR Data Policy ISSN International Council for Science. No 39 June Scientific Committee on Antarctic Research

Utilizing satellite-based information for disaster risk assessment- why and how?

DATA STEWARDSHIP from a geoscience and academic perspective

Best Practices for Data Management. RMACC HPC Symposium, 8/13/2014

Wyoming Geographic Information Science Center University Planning 3 Unit Plan, #

Integrating Social Vulnerability into Research on Food Systems and Global Change

Combining Technologies to Create New Solutions

NODC s Data Stewardship for Jason-2 and Jason-3

THE STRATEGIC PLAN OF THE HYDROMETEOROLOGICAL PREDICTION CENTER

Report on data management and infrastructure

Introduction to Geospatial Web Services

Cover Sheet. Geography B.A. via UF Online for Fall Info Program Modify Platform Ugrad/Pro

Knowledge-based policy making

What is FRIEND -Water?

The Portal of the International Research Centre on El Niño

M.S. Civil Engineering, Drexel University, Philadelphia, PA. Dec B.S. Industrial Engineering, Los Andes University, Bogotá, Colombia. Sep.

Advancing Graduate Education in the Agricultural and Environmental Sciences Dr. Thomas J. Dormody, Dean of the Graduate School, CATIE

Global environmental information Examples of EIS Data sets and applications

TOPIC 4 Disaster Risks, Mitigation, Warning Systems & Socio-Economic Impacts. Jim Davidson

Transcription:

Designing A Data Commons for Sustainability Science: Lessons Learned from a World Data Center Alex de Sherbinin Center for International Earth Science Information Network (CIESIN) The Earth Institute, Columbia University Presentation to the International Workshop on Open Access to Scientific Literature and other Digital Scientific Information Resources in Central America and the Caribbean: Focus on Education and Health for Sustainable Development 3-4 September 2008 Havana, Cuba

Overview Conceptual approach: the modern data center The World Data Center for Human Interactions in the Environment The case for open access (OA) International principles supporting OA Issues around OA The growing movement towards OA

Evolution of the Data Provider Model Researcher Researcher Researcher Researcher Researcher Researcher Research community Cyberinfrastructure Data Repository Public Public Decision-making community

Reliability and continuity Discovery Access Stewardship Catalogs On-line databases Standards; georeferencing; time-stamping Data Documentation, visualization Integration Domain Understanding (What do these data mean? Are they suitable for my use?) Interoperability Linkage Understanding (How do these data relate to other data?)

Reliability and continuity Discovery What data centers provide Access Data Domain Understanding (What do these data mean? Are they suitable for my use?) Interoperability Linkage Understanding (How do these data relate to other data?)

CIESIN s World Data Center CIESIN is a data and research center that was established in Michigan in 1989 and which joined The Earth Institute at Columbia University in 1998 CIESIN runs the Socioeconomic Data and Applications Center (SEDAC), which is one of eight Distributed Active Archive Centers (DAACs) run by NASA s Earth Observing System Data and Information System (EOSDIS) CIESIN s World Data Center (WDC) for Human Interactions in the Environment is part of the International Council of Science (ICSU) WDC system A virtual data center Supported by SEDAC Focus on global scale geospatial data

CIESIN s World Data Center Climate Conservation Governance Hazards Health Population Poverty Sustainability

Discovery Domain Understanding Sustainability Thematic Portal

Discovery Catalog Utilizes opensource Geonetwork by FAO Distributed search using Z39.50 protocol FGDC metadata Short profiles with thumbnails Full metadata records available All data freely available for download

Domain Understanding Map Gallery

Domain Understanding Sample Map Maps are distributed using a Creative Commons license.

Domain Understanding Linkage Understanding 2008 TerraViva! SEDAC Stand-alone data exploration tool Visualization and interpretation Carry out simple spatial analyses http://sedac.ciesin.columbia.edu/terravivauserweb/

Interoperability Domain Understanding Linkage Understanding OGC-compliant Web Mapping

Linkage Understanding Helping Users Make Wise Choices Traditional Documentation not enough Multi-faceted approach required Ph.D B.A. High School 5 th Grade Comparative Guides Visualizations Common Pitfalls Examples Citations

http://sedac.ciesin.org/citations/ CitationIndex.html

Poor Livestock Keepers at the Global Level for Research and Development Targeting. Land Use Policy, 20(4): 311-322. Wilson, S.J., Steenhuisen, F., Pacnya, J.M., and Pacnya, E.G. (2006) Mapping the Spatial Distribution of Global Tobler, W., Deichmann, U., Gottsegen, J., and Maloy, K. (1997) World Population in a Grid of Spherical Quadrilaterals. International Journal of Population Geography, 3: 203-225. van Lieshout, M., Kovats, R.S., Livermore, M.T.J., and Martens, P. (2004) Climate Change and Malaria: Analysis of the SRES Climate and Socio-Economic Scenarios. Global Environmental Change, 14(1): 87-99. Vassolo, S. and Döll, P. (2005) Global-scale Gridded Estimates of Thermoelectric Power and Manufacturing Water Use. Water Resources Research, 41. Verburg, P.H., and Chen, Y. (2000) Multiscale Characterization of Land-Use Patterns in China. Ecosystems, 3(4): 369-385. Viviroli, D. and Weingartner, R. (2004) The Hydrological Significance of Mountains: from Regional to Global Scale. Hydrology and Earth System Science, 8(6): 1016-1029. Vorosmarty, C.J., Green, P., Salisbury, J., and Lammers, R.B. (2000) Global Water Resources: Vulnerability From Climate Change Acid Population Growth. Science, 289(5477): 284-288. Vorosmarty, C.J., and Sahagian, D. (2000) Anthropogenic Disturbance of the Terrestrial Water Cycle. Bioscience, 50(9): 753-765. White, M.A., Hoffman, F., Hargrove, W.W., and Nemani, R.R. (2005) A Global Framework for Monitoring Phenological Responses to Climate Change. Geophysical Research Letters, 32(L04705): 4pp. White, M.A., Nemani, R.R., Thornton, P.E., and Running, S.W. (2002) Satellite Evidence of Phenological Differences Between Urbanized and Rural Areas of the Eastern United States Deciduous Broadleaf Forest. Ecosystems, 5(3): 260-273.

Information on data quality is critical to judging appropriate uses

Sample Analysis Data documentation helps to determine fitness of use for given analyses How is the global distribution of poverty associated with the distribution of natural hazards?

Exposure to Multiple Natural Hazards Seismic hazards include earthquakes and volcanoes; hydrological hazards include floods, cyclones, and landslides

Helping users make wise choices is a community-building and communitystrengthening task

Reliability & Continuity Need for Long Term Archiving Much information is now born digital Data are being lost at an alarming rate either not preserved at all or unreadable due to changing computer systems Important to preserve the discovery and access capabilities not just back up the data on offline tapes

Reliability & Continuity LTA Architecture Review CIESIN reviewed commercial and open source systems to facilitate ingest, preservation, and access Digital asset management systems Electronic records management systems Document management systems Digital repository systems We decided to focus on open source approaches to avoid proprietary dependencies Dspace Eprints Fedora Greenstone We selected the Flexible Extensible Digital Object Repository Architecture (Fedora) Developed by Cornell University and the University of Virginia Modular approach to facilitate enhancement Active user community of developers and implementers We purchased VITAL with Fedora from VTLS Source: Downs, R. and Chen, R. (2008). Implementing a Digital Repository for the Preservation of Interdisciplinary Data. Presented to the International Association for Social Science Information Services & Technology (IASSIST) 2008 Conference, Palo Alto, CA.

The Case for Open Acess (1) The US set a benchmark by dedicating much federal data to the public domain Government funded data are often required to be distributed freely, even though creators may retain copyright Europe is gradually following suit Some developing countries are remarkably far along

The Case for Open Access (2) OECD Principles and Guidelines for Access to Research Data from Public Funding. GEOSS data sharing principles: There will be full and open exchange of data, metadata, and products shared within GEOSS, recognizing relevant international instruments and national policies and legislation. All shared data, metadata, and products will be made available with minimum time delay and at minimum cost. All shared data, metadata, and products being free of charge or no more than cost of reproduction will be encouraged for research and education.

Regional Examples Inter-American Institute s Data and Information Service (IAI-DIS) NASA s SERVIR for hazards & meteorological data Centro Internacional de Agricultura Tropical (CIAT) Comisión Económica para América Latina y el Caribe (CEPAL) databases and statistics Spatial Data Infrastructure (SDI): Central American Geographic Information Project (PROCIG) Brazilian Institute of Geography and Statistics (IBGE) China-Brazil satellite data (CBERS)

Barriers to Open Access Researchers may not want to release data for fear of findings being scooped Data limitations are too costly/time consuming to correct and would reflect poorly on the researcher The transaction costs of data documentation and metadata creation may be too high Distributing data freely will undermine institutional objectives: Cost recovery Power Quid pro quo

OA Data Creates Opportunities Creates opportunities for new scientific discovery Creates opportunities for sustainable development By providing data that solve real problems By creating economic opportunities for small or medium sized enterprises Creates opportunities for international exchange and collaboration

Gracias! CIESIN World Data Center http://sedac.ciesin.columbia.edu/wdc/