Creating Geospatial Metadata Kim Durante Geo4Lib Camp
Overview Metadata requirements for geospatial data Workflows for creating layer-level ISO metadata Auto-generating metadata using templates, XSL, Python, and data extraction Transforming and exporting metadata
Collection-level Metadata Workflow Identify data collection Layers Create metadata Assign rights policy register Repository
Check Data Properties Filenames Spatial Reference Objects Attributes
Layer-level Metadata Workflow Check data properties layer Items with no existing metadata Template XSLT Items with existing metadata localized Assign collection and rights policy register auto-generate deposit Submit for accessioning
http://purl.stanford.edu/vh286rq6087.mods
Required and Recommended Elements Title Originator Distributor Publisher Date Point of Contact Distribution Format Scale Resolution Abstract Identifier Distribution Protocol Lineage Language Theme keyword Distribution Name Status Metadata Contact Metadata DateStamp ISO Topic Category Place keyword Geographic Extent Temporal Extent Aggregate Dataset Title Aggregate Dataset Identifier Update Frequency Access Constraints Use Constraints Metadata Identifier Parent Metadata Identifier Credit Hierarchy Level
XML Template Thesauri Citations Metadata Contact Distributor Contact lcsh, lcnaf, geonames Stanford Stanford Citation Thesaurus Name: lcsh Identifier: http://id.loc.gov/authorities/subjects.html Date: 2011-04-26T00:00:00 Contact Organisation Name: Stanford City: Stanford State: California Zip Code: 94305 Email: brannerlibrary@gmail.com
XML Template Title Abstract Purpose Theme keyword Place keyword Originator Publisher Point of Contact TITLE ABSTRACT PURPOSE THEME PLACE ORIGINATOR PUBLISHER POINT OF CONTACT
XML Template Identifier (Online linkage) Online Distribution URL Online distribution filename Parent Metadata Identifier Aggregate Dataset Title Aggregate Dataset Identifier Credit Feature Catalog Title Feature Catalog Identifier Metadata file identifier* URL URL FILENAME PARENTID COLLECTIONTITLE COLLECTIONID CREDIT FCTITLE UUID METADATAID*
Collection Template Originator Publisher Point of Contact Collins Bartholomew Ltd. Collins Bartholomew Ltd. Collins Bartholomew Ltd. Publication Date 2014-01-01 ISO Topic Category Place keyword Theme keyword Parent Metadata Identifier Utilities and Communication Earth Global system for mobile communications http://purl.stanford.edu/vh286rq6087.mods
Collection Template Title Abstract Update Frequency Status Global XG Mobile Communications Network Coverage, YYYY This polygon shapefile is a representation of the coverage area for XG mobile communications networks around the world in YYYY. The data is created from submissions made directly to Collins Bartholomew or the GSMA from mobile operators, which provide roaming detail for inclusion in the online mapping application known as Collins Mobile Coverage Web. Polygons representing the network coverage have been split into country areas using the country polygons from the Collins World Explorer vector product. Polygons are supplied as multipart, and include any country ISO or FIPS codes where known. The data is supplied with a nominal resolution of approximately 1km on the ground. Annually Completed
Collection Template Aggregate Dataset Title Collins Mobile Coverage Explorer, 2007-2013 Aggregate Dataset Identifier Supplemental information Access Constraints (Legal) Use Constraints (Legal) Temporal Extent Description Purpose http://purl.stanford.edu/vh286rq6087 Operators are asked to submit strong (>= -92dBm) and variable (>= -100dBm and <92dBm) signal strengths as part of their submissions. The data here includes both types but does not make a distinction between the two. Restricted Restricted ground condition This map can be used to locate and analyze mobile communications coverage on a global scale.
Import Collection Template Geoprocessing Python import arcpy from arcpy import env import os env.workspace: C:/Desktop/Geo4Lib/Mobile for dirname, subdirs, filenames in os.walk(env.workspace): for f in filenames: if f.endswith('shp') or f.endswith(.tif ): f = os.path.join(dirname, f) arcpy.importmetadata_conversion('mobile.xml',f)
Localized Editing Title Global 2G Mobile Communications Network Coverage, 2007 Abstract This polygon shapefile is a representation of the coverage area for 2G mobile communications networks around the world in 2007. The data is created from submissions made directly to Collins Bartholomew or the GSMA from mobile operators, which provide roaming detail for inclusion in the online mapping application known as Collins Mobile Coverage Web. Polygons representing the network coverage have been split into country areas using the country polygons from the Collins World Explorer vector product. Polygons are supplied as multipart, and include any country ISO or FIPS codes where known. The data is supplied with a nominal resolution of approximately 1km on the ground. Temporal extent begin: 2007-01-01 end: 2007-12-31
Localized Editing
Attribute Definitions ATTRIBUTE LABEL CNTRY_CODE COUNTRY ATTRIBUTE DEFINITION a unique numerical identifier for each country country name, in English CNTRY_ISO a unique numerical code based on the ISO 3166-1 standard CNTRY_ISO2 a unique alpha-2 code based on the ISO 3166-1 standard CNTRY_ISO3 a unique alpha-3 code based on the ISO 3166-1 standard CNTRY_FIPS a unique alpha-2 code based on the FIPS 10-4 standard
Registering Layers Source_ID Title Global_GSM_2007Q1.shp Global 2G Mobile Communications Network Coverage, 2007 Global_GSM_2008Q1.shp Global 2G Mobile Communications Network Coverage, 2008 Global_GSM_2009Q1.shp Global 2G Mobile Communications Network Coverage, 2009 Global_GSM_201112.shp Global 2G Mobile Communications Network Coverage, 2011 Global_GSM_201212.shp Global 2G Mobile Communications Network Coverage, 2012 Global_GSM_201312.shp Global 2G Mobile Communications Network Coverage, 2013 Global_GSM_2007Q1.shp Global 2G Mobile Communications Network Coverage, 2007 Global_GSM_2008Q1.shp Global 2G Mobile Communications Network Coverage, 2008
Registering Layers Source_ID Title DRUID Global_GSM_2007Q1.shp Global 2G Mobile Communications Network Coverage, 2007 nm879wg1606 Global_GSM_2008Q1.shp Global 2G Mobile Communications Network Coverage, 2008 tv718rb3367 Global_GSM_2009Q1.shp Global 2G Mobile Communications Network Coverage, 2009 rc929pg4438 Global_GSM_201112.shp Global 2G Mobile Communications Network Coverage, 2011 cm290sm0643 Global_GSM_201212.shp Global 2G Mobile Communications Network Coverage, 2012 fq595rd0954
Identifiers Source_ID Title DRUID UUID Global_GSM_2007Q1.shp Global 2G Mobile Communications Network Coverage, 2007 nm879wg1606 f3a3374e-3327-45fd-98c7-c2fbfb524284 Global_GSM_2008Q1.shp Global 2G Mobile Communications Network Coverage, 2008 tv718rb3367 ab18d0cc-0ff8-472c-831c-448696bd808e Global_GSM_2009Q1.shp Global 2G Mobile Communications Network Coverage, 2009 rc929pg4438 78c04bc9-f289-47b0-a3d2-3f2c67aabb67 Global_GSM_201112.shp Global 2G Mobile Communications Network Coverage, 2011 cm290sm0643 c3e68635-b7cb-4aae-8b06-34664de92bf5 Global_GSM_201212.shp Global 2G Mobile Communications Network Coverage, 2012 fq595rd0954 67a298ca-6bd3-497e-8790-b3f6995f9c65
Auto-generating Metadata Dataset Identifier http://purl.stanford.edu/nm879wg1606 Dataset URI http://purl.stanford.edu/nm879wg1606 Distribution URL http://purl.stanford.edu/nm879wg1606 Distribution Filename Global_GSM_2007Q1.shp Metadata FIle Identifier edu.stanford.purl:nm879wg1606 Feature Catalog Title (ISO 19110) Feature Catalog Identifier (ISO 19110) Feature catalog for Global 2G Mobile Communications Network Coverage, 2007 f3a3374e-3327-45fd-98c7-c2fbfb524284
Attribute Definitions ATTRIBUTE LABEL CNTRY_CODE COUNTRY ATTRIBUTE DEFINITION a unique numerical identifier for each country country name, in English CNTRY_ISO a unique numerical code based on the ISO 3166-1 standard CNTRY_ISO2 a unique alpha-2 code based on the ISO 3166-1 standard CNTRY_ISO3 a unique alpha-3 code based on the ISO 3166-1 standard CNTRY_FIPS a unique alpha-2 code based on the FIPS 10-4 standard
Attribute Definitions LABEL DEFINITION
Creating ISO 19110 Metadata ArcGIS to ISO 19110 using XSLT (arc219110.xsl): Feature Catalog Title Feature Catalog Identifier (uuid) Originator Attribute Labels Attribute Definitions
Updating Existing Metadata Varies according to collection Preserves existing metadata XSLT adds required and recommended elements Normalizes existing values (themes, places, names, etc.) May require localized editing
Update Metadata Geoprocessing Python import arcpy from arcpy import env import os env.workspace: C:/Desktop/Geo4Lib/NationalAtlas for dirname, subdirs, filenames in os.walk(env.workspace): for f in filenames: if f.endswith('shp.xml') or f.endswith(.tif.xml ) of f.endswith( metadata.xml ): f = os.path.join(dirname, f) arcpy.xsltransform_conversion(f,' modify.xsl', f)
Updating Existing Metadata - Titles WIND_FARMS.shp Wind Farms, United States, 2015 Average Annual Rainfall, Monterey County, California Average Annual Rainfall, Monterey County, California, 2010 Current Distributions of Selected Amphibians in the United States Current Distributions of Selected Amphibians in the United States, 1940-2005 USGS Small-scale Dataset - 1:1,000,000-Scale Urban Areas 201309 Shapefile 1:1,000,000-Scale Urban Areas, United States, 2013 North American Bat Ranges - Direct Download North American Bat Ranges, 1830-2005
Temporal Extents Ground Condition vs. Publication date Other Considerations Related items Lineage metadata for source materials Custom Projections Use free text to describe projections when no codes exist Rights - Access and Use Conditions Restricted, Public domain, Creative Commons, GNU
Geometry Types Point, Line, Polygon Other Considerations Distribution Formats Shapefile, Raster Dataset (GeoTIFF, ArcGRID,.) Collection Development $$$$$
Thanks