The ORIENTGATE data platform
|
|
|
- Charleen Eaton
- 10 years ago
- Views:
Transcription
1 Research Papers Issue RP0195 December 2013 The ORIENTGATE data platform SCO Scientific Computing and Operations Division By Alessandra Nuzzo University of Salento and Scientific Computing and Operations Division, CMCC Sandro Fiore University of Salento and Scientific Computing and Operations Division, CMCC and Giovanni Aloisio University of Salento and Scientific Computing and Operations Division, CMCC SUMMARY The ORIENTGATE project fosters concerted and coordinated climate adaptation actions across the SEE region by exploring climate risks faced by coastal, rural and urban communities; contributing to a better understanding of the impact of climate variability and change on water regimes, forests and agro-ecosystems; and analysing specific adaptation needs in the hydroelectricity, agro-alimentary and tourism sectors. The principal project results include six pilot studies of specific climate adaptation exercises, a data platform connected to the EU Clearinghouse on Climate Adaptation, capacity enhancing seminars and workshops, working partnership among the hydro-meteorological offices of the SEE countries. In particular, this document provide an overview on the activities carried out on the ORIENTGATE data platform, designed to store and manage data produced by the project partners. Keywords: climate change adaptation, vulnerabilities and risks, impact indicators, data platform, metadata, climate datasets
2 CMCC Research Papers 02 INTRODUCTION The management of large volumes of data, a secure and efficient access to environmental datasets, the definition and adoption of metadata schemas represent some of the most relevant challenges that must be faced up by scientists and researchers. A key architectural building block of an operational environment is represented by the data access layer, which has to provide an efficient, robust and secure access to the data stored on the storage devices. The data platform aims to offer a single entry point to access data produced during the project, both regarding climate simulations and impact indicators. The planned activities for the data platform are: data platform design: in the context of this project, the data platform, will provide access to a large volume of heterogeneous data. NetCDF, GRIB, CSV, ASCII Grid Data, JPG, GIF will be some of the main formats that will be managed by this platform. The design of this platform will take into consideration users needs and requirements; setup of the virtual machine based environment: a virtual machine based environment to host climate change datasets will represent the proper infrastructure to integrate several software like OPeNDAP, THREDDS, RAMADDA, Integrated Data Viewer, as well as ad hoc software developed in this task/project. The data platform will run on a virtual machine-based environment with a large amount of RAM, fast disk storage and processing power to efficiently manage the ORIENTGATE datasets; data sub-setting functionality and data compression: data sub-setting functionalities related to NetCDF data will be also provided to allow partial download of huge climate datasets. Datasets compression will be also taken into account to reduce disk-space allocation as well as the time for data delivery; deployment of the main data services: OPeNDAP/THREDDS and RAMADDA represent two middleware tools bridging the gap between data providers and data users and that will play a key role in the data platform. While the goal of the former is to simplify the discovery and use of scientific data and to allow scientific publications and educational materials to reference scientific data, the latter represents a comprehensive content repository, publishing platform and collaboration environment for the scientific domains. Adopting well-known and de-facto standard access interfaces, will make easier the integration and sharing of the data repository hosted by the ORIENTGATE data platform with other projects and institutions at an international level; enhanced configurations for the data services: to meet the efficiency requirement, different architectural solutions for the OPENDAP/THREDDS software will be evaluated and tested against the standard configuration with a single server to evaluate performance and load balancing issues; design and implementation of a dashboard based monitoring and browsing tool for the data platform: the data platform will need operational tools to check the status of the data services, as well as advanced data browsing tools providing integrated views about the available OPeN- DAP/THREDDS services. A dashboard-
3 The ORIENTGATE data platform based system providing monitoring and multi-server browsing capabilities will be designed and developed to address both system management (through the service monitor) and user access needs (through the browsing system). From a monitoring point of view, the dashboard system will provide a data platform monitor with several charts and reports about the services availability and other relevant statistics concerning the data platform health. On the other hand, the browsing interface will allow to manage into the same context several THREDDS installations that will be deployed, configured and tuned during the project. Browsing capabilities across the THREDDS catalogues will be also provided. THE DATA PLATFORM ARCHITECTURE A key action in the ORIENTGATE project is the provision of a data platform which enables, at the project level, secure, efficient and transparent data sharing, access and management. The design of the data platform will take into account key requirements like: efficient, scalable and transparent access to large volumes of scientific data; metadata management. With a data centric approach, the same data is available through different access interfaces providing different features. The data platform is a collection of services, running on multiple virtual machines, that are: general purpose services, such as HTTP and FTP; domain oriented services, like OPeN- DAP/THREDDS, ESGF data node and GeoNetwork. Figure 1 shows the overall data platform architecture. The figure schematically shows the architecture of the data platform as consisting of a series of services, deployed on virtual machines. Schematically, HTTP, FTP, OPeNDAP and THREDDS provide support in terms of data access and download. GeoNetwork, on the other hand, allows filling out a metadata sheet for each dataset to enable data search and discovery as well as metadata browsing. Regarding the ESGF Data Node, it was not planned in the original proposal of the project, but we re evaluating the possibility to integrate it into the platform since it could be useful for managing climate data. The latest release of this software has been made available last week and we re testing the installation on a virtual machine of the project. DATA ARCHIVE A DRS (Data Reference Syntax) provides a common naming system to be used in files, directories and URLs to identify datasets wherever they might be located within a distributed archive. The DRS should provide a clear and structured set of conventions to facilitate the naming of data entities within the data archive and files delivered to users. The DRS should use controlled vocabularies to facilitate documentation and discovery. Providing users with data in files with well-structured names will facilitate management of the data on the users file systems and simplify communication among users and between users and user support. The controlled vocabularies will be useful in developing category-based data discovery services. In order to better organise the data on the data platform, specific directory structures have been defined both in terms of climate simulations and impact indicators. A hierarchical directory structure allows an easy management 03
4 CMCC Research Papers 04 and a simple organisation of the data, so the data regarding the output of simulations and impact indicators will be located according to the directory structure which starts from the root folder that is the project, orientgate and is split into two paths which will store the two different types of data. DATA ACCESS SERVICE The Data Access Service (DAS) provides several functionalities through the integration of different services. This service must provide the coexistence of various data services for data delivery, sub-setting and browsing. The role of the data is central, which implies that this component must necessarily be "data-centric"; the data has to be accessible from different points, although being stored in a single location on the data archive. The following services have been installed on Figure 1: ORIENTGATE Data Platform a virtual machine of the data platform: OPeNDAP; THREDDS; HTTP; FTP. A brief overview of these services will be presented in the following. OPENDAP OPeNDAP (Figure 2) [1] is a framework that aims to simplify the sharing of scientific data on the web by making local data accessible to remote connections. It is a freely available software and allows the transformation of existing applications into OPeNDAP clients, enabling them to remote access. The advantage is that the user may not be aware of the different
5 The ORIENTGATE data platform format in which different dataset are stored, but can equally access to the information. The uniformity with which the data are presented make the system very useful for the analysis, transfer and automatic manipulation of data. It is possible to choose the way to get the information by selecting: Get ASCII, which displays the required data in the browser window; Binary (DAP) Object, which locally stores the data in binary format; Get as NetCDF, which locally saves the file in NetCDF format. THREDDS THREDDS Data Server (TDS) [2] is a web server designed to provide access to scientific data and metadata using different remote access protocols. THREDDS uses Dataset Inventory Catalog in order to provide virtual directories of available data and metadata associated with them. Each catalog describes the datasets and services through which access to the data. The services offered by THREDDS provide access to the full dataset, carry out subsetting, graphically display the information, analyze the associated metadata and regenerate them according to specific standards. The TDS provides the user a series of services that represent different modes of access to data such as: OGC Web Coverage Service (WCS) that supports the retrieval of geospatial data representing phenomena with spatial and temporal variations; OpenGIS Web Map Service Interface Standard (WMS) that provides a simple HTTP interface for the request of maps from one or more distributed geospatial databases; FTP OPeNDAP that provides access to portions of the dataset by extending the functionality of the HTTP protocol; HTTP Server that allows user to access the entire contents of the file. File Transfer Protocol (FTP) is an application layer protocol based on the TCP protocol for data transmission between hosts. The main objectives of FTP are: promote sharing of files (programs or data); encourage the use of indirect or implicit remote computers; solve in a transparent manner any incompatibilities between different file storage systems; transfer data in a reliably and efficiently way. A FTP server provides several features that allow the client to interact with the remote file system: download / upload files; recovery of interrupted file transfers; removal of files; change of the file name; creation of directories; navigation between directories. 05
6 CMCC Research Papers 06 HTTP Figure 2: OPeNDAP Dataset Access Form authentication based on digital certificates. HyperText Transfer Protocol (HTTP) is the primary system for the transmission of information in Internet. The HTTP protocol is based on TCP protocol and adopts the client-server architecture. The client sends a request message to the server for a certain resource (such as HTML files or web pages objects) or for the execution of an operation and the server returns a response message which containing information specifying whether the request was fulfilled or possible error codes. It may also include the requested contents. HTTP provides a simple authentication service based on sending the login credentials (username and password). A more advanced security service is offered by HTTPS protocol, which integrates the HTTP protocol with an encryption system like Transport Layer Security (SSL/TLS). The service includes encryption of messages exchanged between client and server and server METADATA SERVICE Metadata are information about the data and describe the content, the quality and features of the data like origin, scope, point of contact, credits, etc. Metadata helps to search and discovery data, to better organise and use data and finally enables interoperability. The metadata service is the heart of the information system for the data stored on the data archive of the project. It manages the metadata that are used to carry out search and discovery operations on the data archive. Of course, the adoption of standards will allow the interoperability and the possibility to interact with other projects and share and compare results. To manage metadata in this project we were inspired by the international standard ISO
7 The ORIENTGATE data platform ISO This standard defines the schema required for describing geographic information and services. It provides information about the identification, the extent, the quality, the spatial and temporal schema, spatial reference, and distribution of digital geographic data. ISO standard [3] is part of the standards produced by ISO TC 211 Geographic Information/Geomatics and defines a common set of metadata describing information relating to the identification, the extent, the quality, the temporal and spatial pattern and the distribution of geographic data. This international standard was then developed in order to classify and describe geographic datasets, facilitate the creation and management of metadata related to geospatial data, facilitate discovery, access, evaluation and use of geographic data. An important characteristic is the possibility to apply this standard to a single dataset, to aggregations of single datasets at to single geographic features. ISO is an abstract standard because it does not define "how to" fill in metadata, but "what" a metadata should contain. In particular, the standard establishes a common core metadata that is a minimum set of elements required to describe the "geographic dataset"; this minimum set includes mandatory elements (Mandatory, M), conditional elements (mandatory under certain conditions - Conditional; C ) and optional elements (optional, O), as shown in the table in Figure 3. The minimum set of metadata allows the search, access, transfer and exchange of geographic data; in addition to this, the international standard also provides some extensions, in order to allow a more extensive description of geographic data and to satisfy more specialist needs. The main features that differentiate ISO from other metadata standard used for geographic data are as follows: adoption of UML as a standard language for modelling; adoption of the XML standard format for creation and management of metadata; ability to create profiles; definition of a minimum common set (core metadata); multi-level metadata; ability to integrate with other ISO standards. The standard ISO organises the list of available metadata into several "UML packages." Each package contains one or more entities (or UML classes) that can be divided into subclasses or aggregated in superclasses. Entities contain elements identifying the unit of metadata; each entity may have different logical relationships with other entities. The following Figure 3 shows the organisation of the packages. ISO ISO is a technical specification that defines the implementation schema of ISO according the "extensible Markup Language" (XML) format and can be used to describe, validate and exchange geospatial metadata. The standard ISO 19115, in fact, did not provide the specific information to generate metadata "XML compliant", that were standard not only in content but also in its structure. The implementation standard of ISO has, therefore, developed schemas in XML format in order to improve interoperability, by providing a common specification for describing, validating and exchanging metadata about geographic datasets, individual dataset, individual geographic features, attributes, types of characteristic function, etc.. 07
8 CMCC Research Papers 08 Figure 3: Minimum set of metadata for geographic datasets
9 The ORIENTGATE data platform 09 Figure 4: Structure of metadata packages
10 CMCC Research Papers 10 GEONETWORK GeoNetwork opensource [4] is a standard based and decentralised spatial information management system, designed to enable access to geo-referenced databases and cartographic products from a variety of data providers through descriptive metadata, enhancing the spatial information exchange and sharing between organisations and their audience, using the capacities and the power of the Internet. The system provides a broad community of users with easy and timely access to available spatial data and thematic maps from multidisciplinary sources, that may in the end support informed decision making. The main goal of the software is to increase collaboration within and between organisations for reducing duplication and enhancing information consistency and quality and to improve the accessibility of a wide variety of geographic information along with the associated information, organised and documented in a standard and consistent way. GeoNetwork s main features are: instant search on local and distributed geospatial catalogues; uploading and downloading of data, documents, PDF s and any other content; an interactive Web map viewer that combines Web Map Services from distributed servers around the world; online map layout generation and export in PDF format; online editing of metadata with a powerful template system; scheduled harvesting and synchronisation of metadata between distributed catalogues; groups and users management; fine grained access control. The metadata profile is organised in sections and the most important, illustrated in Figure 4, are: Identification Section, Distribution Section, Reference System Section, Data Quality Section and Metadata Section. MAIN FILE FORMATS The back-end platform of the project will manage data in different formats. Below a brief overview of the main used formats. NETCDF FORMAT NetCDF format (Network Common Data Form) [5] is a set of software libraries and portable data format supporting creation, access, and sharing of array-oriented scientific data. It was developed and is still a project by UNIDATA, commonly used in climatological, meteorological, oceanographic and GIS applications. In NetCDF format, the dataset is stored in a binary file consists of two main parts: header, containing the information about dimensions, attributes and variables; data, which includes "fixed-size data", containing the data for variables that have limited size, and record-data, containing the data of the variables that have unlimited size, like the weather; The notation used to describe an object in NetCDF format is called CDL (Common Data Language) and it is a simple tool to define the contents of a NetCDF dataset. Figure 6 shown an example of NetCDF file format.
11 The ORIENTGATE data platform 11 ESRI SHAPEFILE FORMAT ESRI shapefile [6], or simply a shapefile, is a popular geospatial vector data format for geographic information system. It is developed and regulated by ESRI as an open specification for data interoperability among ESRI and other GIS software products. Shape files spatially describe vector features: points, lines and polygons, representing, for example, water wells, rivers and lakes. Each item usually has attributes that describe it, such as name or temperature. ESRI is an international supplier of Geographic Information System (GIS) software and geo-database management applications. A shape file stores non-topological geometry and attribute information for the spatial features in a data set. The geometry for a feature is stored as a shape comprising a set of vector coordinates. Since shape files do not have the processing overhead of a topological data structure, they have advantages over other Figure 5: Main metadata section data sources such as faster drawing speed and edit ability. Shape files handle single features that overlap or that are non-contiguous. They also typically require less disk space and are easier to read and write. Shape files are simple because they store the primitive geometric data types of points, lines, and polygons. Therefore, a table of records will store properties/attributes for each primitive shape in the shape file. Shapes (points/lines/polygons) together with data attributes can create infinitely many representations about geographic data. Representation provides the ability for powerful and accurate computations. While the term "shape file" is quite common, a "shape file" is actually a set of several files. Three individual files are mandatory to store the core data that comprises a shape file:.shp,.shx and.dbf. The shape file relates specifically to.shp files but alone is incomplete for distribution, as the other supporting files are required. There are further optional files which store primarily index data to
12 CMCC Research Papers 12 Figure 6: Example of NetCDF file
13 The ORIENTGATE data platform improve performance, locate under the same folder. Mandatory files are:.shp - shape format: geometry; stores the feature.shx - shape index format: a positional index of the feature geometry;.dbf - attribute format: columnar attributes for each shape. Optional files are:.prj - projection format: the coordinate system and projection information;.sbn and.sbx - projection format: coordinate system and projection information;.fbn and.fbx - a spatial index of the features for shapefiles that are read-only;.ain and.aih - an attribute index of the active fields in a table;.ixs - geocoding index for read-write shapefiles;.mxs - a geocoding index for read-write shapefiles (ODB format);.shp.xml - geospatial metadata in xml format;.atx - attribute index for the.dbf file in the form of shapefile.columnname.atx;.cpg - used to identify the code page for identifing the character encoding to be used. GRID RASTER FORMAT An ESRI grid s a raster GIS file format developed by ESRI, which defines geographic space as an array of equally sized square. grid points arranged in rows and columns. Each grid point stores a numeric value that represents a geographic attribute (such as elevation or surface slope) for that unit of space. Each grid cell is referenced by its x,y coordinate location. An ESRI grid has two formats: a proprietary binary format, also known as an ARC/INFO GRID, ARC GRID and many other variations; a non-proprietary ASCII format, also known as an ARC/INFO ASCII GRID. The formats were introduced for ARC/INFO. The binary format is widely used within ESRI programs, such as ArcGIS, while the ASCII format is used as an exchange, or export format, due to the simple and portable ASCII file structure. In particular, a binary ESRI grid is stored in several files contained in at least two directories: the name directory and an info directory, where name has strict naming conventions. The name subdirectory is composed of different types of files containing information on the geographical location and attributes of the relative grid: dblbnd.adf: - contains the geographical boundaries of the raster grid; hdr.adf - contains information on the resolution of the grid, the compression, the number and size of cells; sta.adf - contains the statistical values of the raster; vat.adf - contains the attributes associated with the areas of the grid; 13
14 CMCC Research Papers 14 w adf - contains the values of the cells; w001001x.adf - index file containing pointers to each tile of the raster file w adf. The info subdirectory, instead, contains other information on the grid: arcnnnn.dir (where N is any integer): list of records; arcnnnn.dat (where N is any integer): tabular data; arcnnnn.nit (where N is any integer): fields definition. CONCLUSIONS This report described the ORIENTGATE data platform. It is composed of a set of services through which data produced during the different work packages will be stored, managed and published providing multiple, specific functionalities. Moreover, this report provides an overview of the data platform architecture, describing the main services and data formats as well as the metadata part responsible for enabling the data search and discovery.
15 The ORIENTGATE data platform Bibliography [1] OPeNDAP - [2] THREDDS - /thredds/current/tds/ [3] detail.htm?csnumber=26020/ [4] [5] NetCDF - [6] shapefile.pdf 15 c 2014 Visit for information on our activities and publications. The Euro-Mediteranean Centre on Climate Change is a Ltd Company with its registered office and administration in Lecce and local units in Bologna, Venice, Capua, Sassari, Viterbo, Benevento and Milan. The society doesn t pursue profitable ends and aims to realize and manage the Centre, its promotion, and research coordination and different scientific and applied activities in the field of climate change study.
The ORIENTGATE data platform
Seminar on Proposed and Revised set of indicators June 4-5, 2014 - Belgrade (Serbia) The ORIENTGATE data platform WP2, Action 2.4 Alessandra Nuzzo, Sandro Fiore, Giovanni Aloisio Scientific Computing and
Structure? Integrated Climate Data Center How to use the ICDC? Tools? Data Formats? User
Integrated Climate Data Center? Data Formats?? Tools???? visits Structure???? User Contents Which Data Formats do we offer? What is the Structure of our data center? Which Tools do we provide? Our Aims
INTRODUCTION TO ARCGIS SOFTWARE
INTRODUCTION TO ARCGIS SOFTWARE I. History of Software Development a. Developer ESRI - Environmental Systems Research Institute, Inc., in 1969 as a privately held consulting firm that specialized in landuse
ES341 Overview of key file formats and file extensions in ArcGIS
ES341 Overview of key file formats and file extensions in ArcGIS Commonly Encountered File Types/Extensions in ArcGIS.mxd A file containing a map, its layers, display information, and other elements used
PART 1. Representations of atmospheric phenomena
PART 1 Representations of atmospheric phenomena Atmospheric data meet all of the criteria for big data : they are large (high volume), generated or captured frequently (high velocity), and represent a
J9.6 GIS TOOLS FOR VISUALIZATION AND ANALYSIS OF NEXRAD RADAR (WSR-88D) ARCHIVED DATA AT THE NATIONAL CLIMATIC DATA CENTER
J9.6 GIS TOOLS FOR VISUALIZATION AND ANALYSIS OF RADAR (WSR-88D) ARCHIVED DATA AT THE NATIONAL CLIMATIC DATA CENTER Steve Ansari * STG Incorporated, Asheville, North Carolina Stephen Del Greco NOAA National
The THREDDS Data Repository: for Long Term Data Storage and Access
8B.7 The THREDDS Data Repository: for Long Term Data Storage and Access Anne Wilson, Thomas Baltzer, John Caron Unidata Program Center, UCAR, Boulder, CO 1 INTRODUCTION In order to better manage ever increasing
GIS Databases With focused on ArcSDE
Linköpings universitet / IDA / Div. for human-centered systems GIS Databases With focused on ArcSDE Imad Abugessaisa [email protected] 20071004 1 GIS and SDBMS Geographical data is spatial data whose
How To Use The Alabama Data Portal
113 The Alabama Metadata Portal: http://portal.gsa.state.al.us By Philip T. Patterson Geological Survey of Alabama 420 Hackberry Lane P.O. Box 869999 Tuscaloosa, AL 35468-6999 Telephone: (205) 247-3611
GeoNetwork User Manual
GeoNetwork User Manual Release 2.10.4-0 GeoNetwork July 06, 2015 Contents 1 Preface 3 1.1 About this Project..................................... 3 1.2 License Information....................................
13.2 THE INTEGRATED DATA VIEWER A WEB-ENABLED APPLICATION FOR SCIENTIFIC ANALYSIS AND VISUALIZATION
13.2 THE INTEGRATED DATA VIEWER A WEB-ENABLED APPLICATION FOR SCIENTIFIC ANALYSIS AND VISUALIZATION Don Murray*, Jeff McWhirter, Stuart Wier, Steve Emmerson Unidata Program Center, Boulder, Colorado 1.
INTEROPERABLE IMAGE DATA ACCESS THROUGH ARCGIS SERVER
INTEROPERABLE IMAGE DATA ACCESS THROUGH ARCGIS SERVER Qian Liu Environmental Systems Research Institute 380 New York Street Redlands, CA92373, U.S.A - [email protected] KEY WORDS: OGC, Standard, Interoperability,
CDI/THREDDS Interoperability: the SeaDataNet developments. P. Mazzetti 1,2, S. Nativi 1,2, 1. CNR-IMAA; 2. PIN-UNIFI
CDI/THREDDS Interoperability: the SeaDataNet developments P. Mazzetti 1,2, S. Nativi 1,2, 1. CNR-IMAA; 2. PIN-UNIFI Outline Interoperability Issues in SeaDataNet A broker solution for CDI/THREDDS interoperability
A GIS helps you answer questions and solve problems by looking at your data in a way that is quickly understood and easily shared.
A Geographic Information System (GIS) integrates hardware, software, and data for capturing, managing, analyzing, and displaying all forms of geographically referenced information. GIS allows us to view,
ArcGIS Data Models Practical Templates for Implementing GIS Projects
ArcGIS Data Models Practical Templates for Implementing GIS Projects GIS Database Design According to C.J. Date (1995), database design deals with the logical representation of data in a database. The
GIS AS A DECISION SUPPORT FOR SUPPLY CHAIN MANAGEMENT
Paper Reference No.: PN-253 GIS AS A DECISION SUPPORT FOR SUPPLY CHAIN MANAGEMENT Sanjay Kumar 1 and Suneeta Agrawal 2 1. M. Tech. (GIS & Remote Sensing); GIS Cell; MNNIT, Allahabad, India (E-mail: [email protected])
Publishing geospatial data to the web using the EEA infrastructure
Publishing geospatial data to the web using the EEA infrastructure *Note: Map Services should be published using the EEA Map Services Tick Sheet for guidance. Contact Sebastien Petit ([email protected])
Open Source Visualisation with ADAGUC Web Map Services
Open Source Visualisation with ADAGUC Web Map Services Maarten Plieger Ernst de Vreede John van de Vegte, Wim Som de Cerff, Raymond Sluiter, Ian van der Neut, Jan Willem Noteboom 1 ADAGUC project Cooperative
Sextant. Spatial Data Infrastructure for Marine Environment. C. Satra Le Bris, E. Quimbert, M. Treguer
Sextant On-Line information system for marine geographical information E. Quimbert, M. Bellouis, F. Lecuy, M. Treguer Centre de Bretagne BP 70, Plouzané 29280 France E-mail: [email protected] Sextant
Scientific Data Management and Dissemination
Federal GIS Conference February 9 10, 2015 Washington, DC Scientific Data Management and Dissemination John Fry Solution Engineer, Esri [email protected] Agenda Background of Scientific Data Management through
The Arctic Observing Network and its Data Management Challenges Florence Fetterer (NSIDC/CIRES/CU), James A. Moore (NCAR/EOL), and the CADIS team
The Arctic Observing Network and its Data Management Challenges Florence Fetterer (NSIDC/CIRES/CU), James A. Moore (NCAR/EOL), and the CADIS team Photo courtesy Andrew Mahoney NSF Vision What is AON? a
GIS Data in ArcGIS. Pay Attention to Data!!!
GIS Data in ArcGIS Pay Attention to Data!!! 1 GIS Data Models Vector Points, lines, polygons, multi-part, multi-patch Composite & secondary features Regions, dynamic segmentation (routes) Raster Grids,
The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets
The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets!! Large data collections appear in many scientific domains like climate studies.!! Users and
A Java Tool for Creating ISO/FGDC Geographic Metadata
F.J. Zarazaga-Soria, J. Lacasta, J. Nogueras-Iso, M. Pilar Torres, P.R. Muro-Medrano17 A Java Tool for Creating ISO/FGDC Geographic Metadata F. Javier Zarazaga-Soria, Javier Lacasta, Javier Nogueras-Iso,
Guidelines on Information Deliverables for Research Projects in Grand Canyon National Park
INTRODUCTION Science is playing an increasing role in guiding National Park Service (NPS) management activities. The NPS is charged with protecting and maintaining data and associated information that
MyOcean Copernicus Marine Service Architecture and data access Experience
MyOcean Copernicus Marine Service Architecture and data access Experience Sophie Besnard CLS, Toulouse, France February 2015 MyOcean Story MyOcean Challenge & Success MyOcean Service MyOcean System MyOcean
GIS Architecture and Data Management Practices Boone County GIS Created and Maintained by the Boone County Planning Commission GIS Services Division
GIS Architecture and Data Management Practices Boone County GIS Created and Maintained by the Boone County Planning Commission June 2008 Background Boone County s GIS program has been in existence for
A Web services solution for Work Management Operations. Venu Kanaparthy Dr. Charles O Hara, Ph. D. Abstract
A Web services solution for Work Management Operations Venu Kanaparthy Dr. Charles O Hara, Ph. D Abstract The GeoResources Institute at Mississippi State University is leveraging Spatial Technologies and
NetCDF and HDF Data in ArcGIS
2013 Esri International User Conference July 8 12, 2013 San Diego, California Technical Workshop NetCDF and HDF Data in ArcGIS Nawajish Noman Kevin Butler Esri UC2013. Technical Workshop. Outline NetCDF
Introduction to GIS (Basics, Data, Analysis) & Case Studies. 13 th May 2004. Content. What is GIS?
Introduction to GIS (Basics, Data, Analysis) & Case Studies 13 th May 2004 Content Introduction to GIS Data concepts Data input Analysis Applications selected examples What is GIS? Geographic Information
GEOG 482/582 : GIS Data Management. Lesson 10: Enterprise GIS Data Management Strategies GEOG 482/582 / My Course / University of Washington
GEOG 482/582 : GIS Data Management Lesson 10: Enterprise GIS Data Management Strategies Overview Learning Objective Questions: 1. What are challenges for multi-user database environments? 2. What is Enterprise
Interoperable Solutions in Web-based Mapping
ISPRS SIPT IGU UCI CIG ACSG Table of contents Table des matières Authors index Index des auteurs Search Recherches Exit Sortir Interoperable Solutions in Web-based Mapping Marta Wojnarowska and Bridget
Norwegian Satellite Earth Observation Database for Marine and Polar Research http://normap.nersc.no USE CASES
Norwegian Satellite Earth Observation Database for Marine and Polar Research http://normap.nersc.no USE CASES The NORMAP Project team has prepared this document to present functionality of the NORMAP portal.
Description and Testing of the Geo Data Portal: A Data Integration Framework and Web Processing Services for Environmental Science Collaboration
Description and Testing of the Geo Data Portal: A Data Integration Framework and Web Processing Services for Environmental Science Collaboration Open-File Report 2011 1157 U.S. Department of the Interior
UK Location Programme
Location Information Interoperability Board Data Publisher How To Guide Understand the background to establishing an INSPIRE View Service using GeoServer DOCUMENT CONTROL Change Summary Version Date Author/Editor
Data access and management
B Data access and management CONTENTS B.1 Introduction... B-1 B.2 Data requirements and availability... B-1 B.3 Data access... B-2 B.4 Overall procedures... B-2 B.5 Data tools and management... B-4 Appendix
GIS Initiative: Developing an atmospheric data model for GIS. Olga Wilhelmi (ESIG), Jennifer Boehnert (RAP/ESIG) and Terri Betancourt (RAP)
GIS Initiative: Developing an atmospheric data model for GIS Olga Wilhelmi (ESIG), Jennifer Boehnert (RAP/ESIG) and Terri Betancourt (RAP) Unidata seminar August 30, 2004 Presentation Outline Overview
Nevada NSF EPSCoR Track 1 Data Management Plan
Nevada NSF EPSCoR Track 1 Data Management Plan August 1, 2011 INTRODUCTION Our data management plan is driven by the overall project goals and aims to ensure that the following are achieved: Assure that
Portal Version 1 - User Manual
Portal Version 1 - User Manual V1.0 March 2016 Portal Version 1 User Manual V1.0 07. March 2016 Table of Contents 1 Introduction... 4 1.1 Purpose of the Document... 4 1.2 Reference Documents... 4 1.3 Terminology...
SRS BIO OPTICAL WORKFLOW
SRS BIO OPTICAL WORKFLOW Version 2.0 22 nd March 2013 Data Workflows emii, the data management facility for IMOS, has developed workflows for each IMOS sub facility to describe the flow of IMOS data from
Catalogue or Register? A Comparison of Standards for Managing Geospatial Metadata
Catalogue or Register? A Comparison of Standards for Managing Geospatial Metadata Gerhard JOOS and Lydia GIETLER Abstract Publication of information items of any kind for discovery purposes is getting
Quick Start Guide to. ArcGISSM. Online
Quick Start Guide to ArcGISSM Online ArcGIS Online Quick Start Guide ArcGIS SM Online is a cloud-based mapping platform for organizations. Users get access to dynamic, authoritative content to create,
Big Data and Analytics: Getting Started with ArcGIS. Mike Park Erik Hoel
Big Data and Analytics: Getting Started with ArcGIS Mike Park Erik Hoel Agenda Overview of big data Distributed computation User experience Data management Big data What is it? Big Data is a loosely defined
TIBCO Spotfire Automation Services 6.5. User s Manual
TIBCO Spotfire Automation Services 6.5 User s Manual Revision date: 17 April 2014 Important Information SOME TIBCO SOFTWARE EMBEDS OR BUNDLES OTHER TIBCO SOFTWARE. USE OF SUCH EMBEDDED OR BUNDLED TIBCO
Managing a Geographic Database From Mobile Devices Through OGC Web Services
Managing a Geographic Database From Mobile Devices Through OGC Web Services Nieves R. Brisaboa 1, Miguel R. Luaces 1, Jose R. Parama 1, and Jose R. Viqueira 2 1 Database Laboratory, University of A Coruña,
Lecture 8. Online GIS
Lecture 8 Online GIS Lecture 8: Outline I. Online GIS 1. Google Earth 2. MSN Live Maps II. Open Source GIS III. ArcGIS Server and the ESRI suite of online software utility options IV. Advanced Data Mining
GeoNetwork, The Open Source Solution for the interoperable management of geospatial metadata
GeoNetwork, The Open Source Solution for the interoperable management of geospatial metadata Ing. Emanuele Tajariol, GeoSolutions Ing. Simone Giannecchini, GeoSolutions GeoSolutions GeoSolutions GeoNetwork
RS MDM. Integration Guide. Riversand
RS MDM 2009 Integration Guide This document provides the details about RS MDMCenter integration module and provides details about the overall architecture and principles of integration with the system.
DATA SHARING AND SPATIAL QUERY
Technical Memorandum No. 2 DATA SHARING AND SPATIAL QUERY Raghavan Srinivasan Spatial Sciences Laboratory Texas Agricultural Experiment Station, Texas A&M University Submitted to El Paso Water Utilities
ISO 19119 and OGC Service Architecture
George PERCIVALL, USA Keywords: Geographic Information, Standards, Architecture, Services. ABSTRACT ISO 19119, "Geographic Information - Services," has been developed jointly with the Services Architecture
Bentley ArcGIS. Connector
Bentley ArcGIS Connector Introduction ESRI, as a GIS software products company, and Bentley Systems, Incorporated, as a developer of solutions for architecture/engineering/construction (AEC) professionals,
smespire - Exercises for the Hands-on Training on INSPIRE Network Services April 2014 Jacxsens Paul SADL KU Leuven
smespire - Exercises for the Hands-on Training on INSPIRE Network Services April 2014 Jacxsens Paul SADL KU Leuven These exercises aim at people who already have some basic knowledge of INSPIRE Network
REACCH PNA Data Management Plan
REACCH PNA Data Management Plan Regional Approaches to Climate Change (REACCH) For Pacific Northwest Agriculture 875 Perimeter Drive MS 2339 Moscow, ID 83844-2339 http://www.reacchpna.org [email protected]
HELCOM Data and Map Service. User Manual
HELCOM Data and Map Service User Manual Version 2.2 - February 2015 1 Table of contents 1. General Information... 3 1.1 Background... 3 1.2 Technical requirements... 3 1.3 Contact... 3 2. Accessing HELCOM
Oklahoma s Open Source Spatial Data Clearinghouse: OKMaps
Oklahoma s Open Source Spatial Data Clearinghouse: OKMaps Presented by: Mike Sharp State Geographic Information Coordinator Oklahoma Office of Geographic Information MAGIC 2014 Symposium April 28-May1,
Managing Large Imagery Databases via the Web
'Photogrammetric Week 01' D. Fritsch & R. Spiller, Eds. Wichmann Verlag, Heidelberg 2001. Meyer 309 Managing Large Imagery Databases via the Web UWE MEYER, Dortmund ABSTRACT The terramapserver system is
Choosing the right GIS framework for an informed Enterprise Web GIS Solution
13 ANNUAL INTERNATIONAL CONFERENCE AND EXHIBITION ON GEOSPATIAL INFORMATION TECHNOLOGY AND APPLICATIONS Epicentre; Gurgaon, India; 19-21 January, 2010 Choosing the right GIS framework for an informed Enterprise
Documentation of open source GIS/RS software projects
Contract no. Workpackage Delivery Delivery Date 030776 WP1 D1.6 2007-07-02 CASCADOSS Development of a trans-national cascade training programme on Open Source GIS&RS Software for environmental applications
CatMDEdit Metadata editor
Advanced Information Systems Laboratory GeoSpatiumLab S.L. CatMDEdit Metadata editor University of Zaragoza Computer Science and Systems Engineering Department Advanced Information Systems Laboratory (IA3)
GEOGRAPHIC INFORMATION SYSTEMS CERTIFICATION
GEOGRAPHIC INFORMATION SYSTEMS CERTIFICATION GIS Syllabus - Version 1.2 January 2007 Copyright AICA-CEPIS 2009 1 Version 1 January 2007 GIS Certification Programme 1. Target The GIS certification is aimed
An Esri White Paper June 2011 ArcGIS for INSPIRE
An Esri White Paper June 2011 ArcGIS for INSPIRE Esri, 380 New York St., Redlands, CA 92373-8100 USA TEL 909-793-2853 FAX 909-793-5953 E-MAIL [email protected] WEB esri.com Copyright 2011 Esri All rights reserved.
MrSID Plug-in for 3D Analyst
LizardTech MrSID Plug-in for 3D Analyst User Manual Copyrights Copyright 2009 2010 LizardTech. All rights reserved. Information in this document is subject to change without notice. The software described
Data-Intensive Science and Scientific Data Infrastructure
Data-Intensive Science and Scientific Data Infrastructure Russ Rew, UCAR Unidata ICTP Advanced School on High Performance and Grid Computing 13 April 2011 Overview Data-intensive science Publishing scientific
User's Guide. Product Version: 2.5.0 Publication Date: 7/25/2011
User's Guide Product Version: 2.5.0 Publication Date: 7/25/2011 Copyright 2009-2011, LINOMA SOFTWARE LINOMA SOFTWARE is a division of LINOMA GROUP, Inc. Contents GoAnywhere Services Welcome 6 Getting Started
Reading Questions. Lo and Yeung, 2007: 2 19. Schuurman, 2004: Chapter 1. 1. What distinguishes data from information? How are data represented?
Reading Questions Week two Lo and Yeung, 2007: 2 19. Schuurman, 2004: Chapter 1. 1. What distinguishes data from information? How are data represented? 2. What sort of problems are GIS designed to solve?
Final Report - HydrometDB Belize s Climatic Database Management System. Executive Summary
Executive Summary Belize s HydrometDB is a Climatic Database Management System (CDMS) that allows easy integration of multiple sources of automatic and manual stations, data quality control procedures,
DATA QUALITY AND SCALE IN CONTEXT OF EUROPEAN SPATIAL DATA HARMONISATION
DATA QUALITY AND SCALE IN CONTEXT OF EUROPEAN SPATIAL DATA HARMONISATION Katalin Tóth, Vanda Nunes de Lima European Commission Joint Research Centre, Ispra, Italy ABSTRACT The proposal for the INSPIRE
CLOUD BASED N-DIMENSIONAL WEATHER FORECAST VISUALIZATION TOOL WITH IMAGE ANALYSIS CAPABILITIES
CLOUD BASED N-DIMENSIONAL WEATHER FORECAST VISUALIZATION TOOL WITH IMAGE ANALYSIS CAPABILITIES M. Laka-Iñurrategi a, I. Alberdi a, K. Alonso b, M. Quartulli a a Vicomteh-IK4, Mikeletegi pasealekua 57,
Software. PowerExplorer. Information Management and Platform DATA SHEET
DATA SHEET PowerExplorer Software Information Management and Platform KEY FEATURES Web-enabled Advanced, ad-hoc query capabilities Spatial E&P data presentation ZGF file import/export Spatializer Tabular
Introduction to GIS. http://libguides.mit.edu/gis
Introduction to GIS http://libguides.mit.edu/gis 1 Overview What is GIS? Types of Data and Projections What can I do with GIS? Data Sources and Formats Software Data Management Tips 2 What is GIS? 3 Characteristics
Portal for ArcGIS. Satish Sankaran Robert Kircher
Portal for ArcGIS Satish Sankaran Robert Kircher ArcGIS A Complete GIS Data Management Planning & Analysis Field Mobility Operational Awareness Constituent Engagement End to End Integration Collect, Organize,
INTRODUCTION to ESRI ARCGIS For Visualization, CPSC 178
INTRODUCTION to ESRI ARCGIS For Visualization, CPSC 178 1) Navigate to the C:/temp folder 2) Make a directory using your initials. 3) Use your web browser to navigate to www.library.yale.edu/mapcoll/ and
SeaDataNet pan-european infrastructure for ocean and marine data management. Dick M.A. Schaap MARIS
SeaDataNet pan-european infrastructure for ocean and marine data management By Dick M.A. Schaap MARIS SeaDataNet II Technical Coordinator Co-authors: Michele Fichaut (Ifremer) and Giuseppe Manzella (ENEA)
Data interchange between Web client based task controllers and management information systems using ISO and OGC standards
Data interchange between Web client based task controllers and management information systems using ISO and OGC standards Michael Nørremark 1*, Ole Jørgensen 1, Jens Bligaard 2 and Claus G. Sørensen 1
Jozef Matula. Visualisation Team Leader IBL Software Engineering. 13 th ECMWF MetOps Workshop, 31 th Oct - 4 th Nov 2011, Reading, United Kingdom
Visual Weather web services Jozef Matula Visualisation Team Leader IBL Software Engineering Outline Visual Weather in a nutshell. Path from Visual Weather (as meteorological workstation) to Web Server
INSPIRE Dashboard. Technical scenario
INSPIRE Dashboard Technical scenario Technical scenarios #1 : GeoNetwork catalogue (include CSW harvester) + custom dashboard #2 : SOLR + Banana dashboard + CSW harvester #3 : EU GeoPortal +? #4 :? + EEA
Integrated Data Viewer (IDV) a visualization framework Yuan Ho Unidata Program Center Boulder, CO Presentation Outline Integrated Data Viewer (IDV) overview The IDV features IDV examples and customized
Cloud application for water resources modeling. Faculty of Computer Science, University Goce Delcev Shtip, Republic of Macedonia
Cloud application for water resources modeling Assist. Prof. Dr. Blagoj Delipetrev 1, Assist. Prof. Dr. Marjan Delipetrev 2 1 Faculty of Computer Science, University Goce Delcev Shtip, Republic of Macedonia
DATABASE MANAGEMENT FILES GIS06
DATABASE MANAGEMENT Last day we looked at spatial data structures for both vector and raster data models. When working with large amounts of data, it is important to have good procedures for managing the
EXPLORING AND SHARING GEOSPATIAL INFORMATION THROUGH MYGDI EXPLORER
EXPLORING AND SHARING GEOSPATIAL INFORMATION THROUGH MYGDI EXPLORER Subashini Panchanathan Malaysian Centre For Geospatial Data Infrastructure ( MaCGDI ) Ministry of National Resources and Environment
How To Write An Nccwsc/Csc Data Management Plan
Guidance and Requirements for NCCWSC/CSC Plans (Required for NCCWSC and CSC Proposals and Funded Projects) Prepared by the CSC/NCCWSC Working Group Emily Fort, Data and IT Manager for the National Climate
Steve Ansari *, Stephen Del Greco, Brian Nelson, and Helen Frederick NOAA National Climatic Data Center, Asheville, North Carolina 2.
11.4 THE SEVERE WEATHER DATA INVENTORY (SWDI): SPATIAL QUERY TOOLS, WEB SERVICES AND DATA PORTALS AT NOAA S NATIONAL CLIMATIC DATA CENTER (NCDC) Steve Ansari *, Stephen Del Greco, Brian Nelson, and Helen
Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan
Metadata for Data Discovery: The NERC Data Catalogue Service Steve Donegan Introduction NERC, Science and Data Centres NERC Discovery Metadata The Data Catalogue Service NERC Data Services Case study:
RingStor User Manual. Version 2.1 Last Update on September 17th, 2015. RingStor, Inc. 197 Route 18 South, Ste 3000 East Brunswick, NJ 08816.
RingStor User Manual Version 2.1 Last Update on September 17th, 2015 RingStor, Inc. 197 Route 18 South, Ste 3000 East Brunswick, NJ 08816 Page 1 Table of Contents 1 Overview... 5 1.1 RingStor Data Protection...
The distribution of marine OpenData via distributed data networks and Web APIs. The example of ERDDAP, the message broker and data mediator from NOAA
The distribution of marine OpenData via distributed data networks and Web APIs. The example of ERDDAP, the message broker and data mediator from NOAA Dr. Conor Delaney 9 April 2014 GeoMaritime, London
FileMaker Server 9. Custom Web Publishing with PHP
FileMaker Server 9 Custom Web Publishing with PHP 2007 FileMaker, Inc. All Rights Reserved. FileMaker, Inc. 5201 Patrick Henry Drive Santa Clara, California 95054 FileMaker is a trademark of FileMaker,
VISUAL INSPECTION OF EO DATA AND PRODUCTS - OVERVIEW
WMS services from the EUMETSAT real-time Image Gallery Uwe Voges (1), Michael Schick (2), Udo Einspanier (1) (1) con terra GmbH Martin-Luther-King-Weg 24, 48155, Münster, Germany EMail: (U.Voges U.Einspanier)@conterra.de
Soil Data Viewer 5.1 User Guide
Soil Data Viewer 5.1 User Guide Soil Survey Division October 2006 INTRODUCTION... 5 WHAT IS SOIL DATA VIEWER... 5 WHAT SOFTWARE AND CAPABILITIES DO I NEED TO USE SOIL DATA VIEWER 5.1?... 5 WHAT DATA DO
What's new in gvsig Desktop 2.0
What's new in gvsig Desktop 2.0 What are the novelties? 2.0 1.12 Migrating and building... Some examples... Please pardon our appearance during construction Pie and bar chart legends Table in layout 1.12
Activity: Using ArcGIS Explorer
Activity: Using ArcGIS Explorer Requirements You must have ArcGIS Explorer for this activity. Preparation: Download ArcGIS Explorer. The link below will bring you to the ESRI ArcGIS Explorer download page.
(Geo)database and Data Management
(Geo)database and Data Management An introduction to the possibilities of a centralized (spatial) data storing and access as well as its data management constraints (GLOWA Volta Project) GLOWA Volta Project
