ModestR: a software tool for managing and analyzing species distribution map databases



Similar documents
Required and Recommended Supporting Information for IUCN Red List Assessments

Introduction to GIS software

Working with climate data and niche modeling I. Creation of bioclimatic variables

EMODnet Biology. bio.emodnet.eu

MicroStrategy Desktop

There are various ways to find data using the Hennepin County GIS Open Data site:

RISKSCAPE TUTORIAL 4: 200 YEAR ANNUAL RETURN INTERVAL (ARI) HEATHCOTE RIVER FLOOD EVENT: MITIGATING IMPACTS ON CHRISTCHURCH BUILDINGS

ModEco: an integrated software package for ecological niche modeling

Quick and Easy Web Maps with Google Fusion Tables. SCO Technical Paper

SAS BI Dashboard 3.1. User s Guide

ALTAIR SOFTWARE ASSET OPTIMIZATION USER GUIDE

Cookbook 23 September 2013 GIS Analysis Part 1 - A GIS is NOT a Map!

HELCOM Data and Map Service. User Manual

University of Arkansas Libraries ArcGIS Desktop Tutorial. Section 4: Preparing Data for Analysis

Data source, type, and file naming convention

CyberTracker. Step-by-Step Tutorial. How to Customise a Wildlife Monitoring Application

ArcGIS online Introduction Module 1: How to create a basic map on ArcGIS online Creating a public account with ArcGIS online...

Guidelines on Information Deliverables for Research Projects in Grand Canyon National Park

Institute of Natural Resources Departament of General Geology and Land use planning Work with a MAPS

Figure 2: System Flow Diagram for Workflow Management

A METHODOLOGY FOR GIS INTERFACING OF MARINE DATA

From GPS Data Collection to GIS Data Display A Walk-Through Example

Activity: Using ArcGIS Explorer

WFP Liberia Country Office

Developing Business Intelligence and Data Visualization Applications with Web Maps

What s new in Carmenta Server 4.2

DIVA-GIS v5 - Tutorial February 2005

Virginia Commonwealth University Rice Rivers Center Data Management Plan

Tutorial 8 Raster Data Analysis

Latin American and Caribbean Flood and Drought Monitor Tutorial Last Updated: November 2014

Chapter 6: Data Acquisition Methods, Procedures, and Issues

NETWORK MONITORING. Network Monitoring. Product brief. NETWORK MONITORING Logger Only

SAS BI Dashboard 4.3. User's Guide. SAS Documentation

Week 7 - Geographic Analysis of Living Species

HydroDesktop Overview

Application of GIS Analyzes with Cloud Computing

Data access and management

Guide to Viewing Maps in Google Earth

Tutorial 3 - Map Symbology in ArcGIS

SimplyMap Canada Tutorial

Modeling Fire Hazard By Monica Pratt, ArcUser Editor

GIS Tools for Land Managers

ReceivablesVision SM Getting Started Guide

GEM global earthquake model. User guide: Android mobile tool for field data collection. Rosser, J., J.G. Morley, A. Vicini. Data capture tools

Arctic Network SQL Server Data Analysis Using Microsoft Access

Portal Version 1 - User Manual

MicroStrategy Analytics Express User Guide

Publishing KML Services Tutorial

Create a folder on your network drive called DEM. This is where data for the first part of this lesson will be stored.

Expert Review and Questionnaire (PART I)

INTRODUCTION to ESRI ARCGIS For Visualization, CPSC 178

UCM-MACB 2.0: A COMPLUTENSE UNIVERSITY VIRTUAL HERBARIUM PROJECT

S&C IntelliTeam CNMS Communication Network Management System Table of Contents Overview Topology

Enterprise GIS Solutions to GIS Data Dissemination

GEOGRAPHIC INFORMATION GATEWAY New York Department of State (NYDOS) data acceptance & metadata standards

How To Write An Nccwsc/Csc Data Management Plan

Product Navigator User Guide

A Tutorial on dynamic networks. By Clement Levallois, Erasmus University Rotterdam

Managing Software Updates with System Center 2012 R2 Configuration Manager

Custom Reporting System User Guide

AHUDesigner. The Air Handling Units selection software. Product description

Tutorial Segmentation and Classification

GEM global earthquake model. User guide: Tool for spatial inventory data development. Hu, Z., C. Huyck, M. Eguchi, J. Bevington. Data capture tools

ArcGIS. Image Server tutorial

Introduction to GIS.

MARXAN Conservation Planning. Decision Support Software. T u t o r i a l

InterCall Online. Administrator Reports and Invoices. Basic Reports CONFERENCING & COLLABORATION

NATIONAL GENETICS REFERENCE LABORATORY (Manchester)

BusinessObjects Enterprise InfoView User's Guide

DATA VISUALIZATION WITH TABLEAU PUBLIC. (Data for this tutorial at

Data Quality Working Group

Managing Bathymetry in the Cloud with GIS

CRMS Website Training

Census Data: Access, Mapping and Visualization

GIS-Integrated Emissions Inventory Software Solution

Implementing Geospatial Data in Parametric Environment Elçin ERTUĞRUL*

Creating Figure Ground Maps in ArcMap 10.x: Basic procedures to download, open, manipulate and print spatial data

Information technology to assist in conserving and using crop wild relatives and landrace diversity (the boring version without pictures)

An Esri White Paper October 2010 Esri Production Mapping Product Library: Spatially Enabled Document Management System

Visualization with Excel Tools and Microsoft Azure

User Guide. Analytics Desktop Document Number:

The Olympus stereology system. The Computer Assisted Stereological Toolbox

Discover PA An example session displayed in Discover Viewer, which was created in Discover PA Professional.

ES341 Overview of key file formats and file extensions in ArcGIS

Lesson 15 - Fill Cells Plugin

Earthquakes, volcanoes and plate tectonics in ArcGIS Online

Promoting Your Location Platform

mysensors mysensors Wireless Sensors and and Cellular Gateway Quick Start Guide Information to Users Inside the Box

CHAPTER 2: APPROACH AND METHODS APPROACH

Tutorial 2 Online and offline Ship Visualization tool Table of Contents

WebSphere Business Monitor

Colligo Manager 6.0. Connected Mode - User Guide

Producing Listings and Reports Using SAS and Crystal Reports Krishna (Balakrishna) Dandamudi, PharmaNet - SPS, Kennett Square, PA

RS/GIS Quick Start Guides Collaborative training materials available from the Biodiversity Informatics & Geospatial Innovation Facilities

Transcription:

Ecography 36: 1202 1207, 2013 doi: 10.1111/j.1600-0587.2013.00374.x 2013 The Authors. Ecography 2013 Nordic Society Oikos Subject Editor: Thiago Rangel. Accepted 21 June 2013 ModestR: a software tool for managing and analyzing species distribution map databases Emilio Garc í a-rosell ó, C á stor Guisande, Jacinto Gonz á lez-dacosta, Juergen Heine, Patricia Pelayo-Villamil, Ana Manjarr é s-hern á ndez, Antonio Vaamonde and Carlos Granado-Lorencio E. Garc í a-rosell ó (erosello@uvigo.es), J. Gonz á lez-dacosta and J. Heine, Computer Science Dept, Univ. of Vigo, Campus Lagoas-Marcosende s/n, ES-36310 Vigo, Spain. C. Guisande, Faculty of Sea Sciences, Univ. of Vigo, Campus Lagoas-Marcosende s/n, ES-36310 Vigo, Spain. P. Pelayo-Villamil, Ichthyology group, Univ. of Antioquia, A.A. 1226, Medell í n, Colombia. A. Manjarr é s-hern á ndez, Amazonia Research Inst., Univ. National of Colombia, Km 2 v í a Tarapac á, Leticia, Colombia. A. Vaamonde, Faculty of Economy, Univ. of Vigo, Torrecedeira 105, ES-36208 Vigo, Spain. C. Granado-Lorencio, Faculty of Biology, Univ. of Sevilla, Avenida Reina Mercedes s/n, ES-41012 Sevilla, Spain. Th e ModestR package consists of three applications: MapMaker, DataManager and MRFinder. MapMaker facilitates making range maps by drawing the areas, by importing existing data or using the Global Biodiversity Information Facility portal. It can discriminate between different habitats, thereby making data cleaning tasks easier. DataManager allows the management of taxonomically structured databases for range maps. MRFinder supports querying ModestR databases to find the species present in specific areas. Possible applications include the compilation and management of species distribution databases, cleaning data and computing aggregated data to perform subsequent analyses in other packages thanks to emphasized interoperability. ModestR package has been developed with the primary aim of providing the scientific community with an easy-to-use but powerful tool for managing species distribution data. It is designed to be simple and intuitive even for users not familiar with general-purpose Geographical Information Systems tools (GIS) that are broadly used for species distribution mapping. ModestR supports databases structured on hierarchical taxonomy. It provides features to easily clean, manage, analyse and summarise large species distribution datasets. It offers high interoperability with the other software tools widely used for pre-post processing data related to distributions, such as spreadsheets, GIS software, and SDM software. Particular attention, however, has been paid to facilitate data exchange with R statistical environment (R Development Core Team) and subsequently with packages widely used in distribution analysis, such as SDMtools (VanDerWal et al. 2012) or dismo (Hijmans et al. 2012). Moreover, a new interface is being developed, which is specifically designed to link the output of ModestR with R environment. Data exchange, however, is also possible with other stand-alone applications such as SAM spatial analysis software (Rangel et al. 2010) and SDM software such as Maxent (Phillips et al. 2006). ModestR software design The ModestR package consists of three applications: MapMaker, DataManager and MRFinder. MapMaker allows users to easily draw range maps by means of an intuitive interface. A map made with MapMaker will be stored in a ModestR database linked to taxonomic data. This database can be created and managed with DataManager, which also allows maps to be processed, and information to be aggregated and exported for subsequent analysis using other packages. MRFinder allows querying a ModestR database to retrieve the species that are present in a specific area, and subsequently calculate and export aggregated data from those species. To be useful to the broadest range of users, it was decided to prioritise the ease of use and the innovative and useful features in ModestR, together with the data interoperability with existing tools that already provide many other capabilities that different users may need. The following sections provide further details about each of the applications in ModestR. MapMaker description Th e MapMaker application provides a simple but powerful user interface for building precise individual species distribution maps. ModestR incorporates habitats, a concept that is specific for species distribution maps. Five habitats are currently supported in MapMaker: sea, land, small rivers, large rivers and lentic habitats. Th ere are several ways of using MapMaker to create a distribution map. First, by importing a CSV file with 1202

occurrence coordinates, which can be useful when analysing field samples. Second, by selecting a species (synonyms of the species can also be included) and importing data from the Global Biodiversity Information Facility (GBIF 2010) online portal. The data is cleaned as all duplicates are removed and those locations that are not in the habitat selected by the user are considered invalid, with a current error of 0.1 minutes. As an example, the upper map of Fig. 1 shows the geographical records for the white shark Carcharodon carcharias in MapMaker, obtained from the GBIF database. MapMaker automatically classifies valid (red) and invalid samples (black) in Fig. 1 depending on whether coordinates are within or outside the habitat of the species (this shark is a marine species). Data cleaning can be reinforced by applying user-defined rules involving environmental variables. This requires providing MapMaker environmental data files in CSV or ESRI ASC format. The user can always manually reclassify samples. Figure 1. Range maps for the white shark Carcharodon carcharias made with MapMaker. The upper map was made with data obtained from the online GBIF database; red and black points correspond to samples classified as valid and invalid by MapMaker, respectively. The lower map was made using the freehand tool, with information about the hypothetical distribution of this species (Compagno et al. 2005). 1203

Third, MapMaker also supports expert-drawn maps that usually consist of several close areas that specify species presence. The user can select an area with the freehand tool and choose the type of habitats within this area occupied by the species. MapMaker will then fill in the corresponding portion of the map. For example, the lower map of Fig. 1 shows the distribution of Carcharodon carcharias made using the freehand tool with information about the potential distribution of this species, according to Compagno et al. (2005). This feature makes drawing distribution maps very intuitive, as the user can immediately see the real area selected. Four, MapMaker can also import range maps from files in KML or shapefile formats commonly used by GIS tools. Finally, MapMaker can create maps by importing ESRI ASCII grid probability files generated by distribution modelling software such as Maxent (Phillips et al. 2006) and dismo (Hijmans et al. 2012). In this case, a probability threshold has to be entered, and MapMaker will build a range map with all the areas where the probability of species presence is higher than this threshold. Maps drawn with MapMaker can be saved in standalone files or in a ModestR database by assigning the map to a particular species. They can also be exported to several formats, such as high-resolution BMP or JPEG, CSV, KML, or shapefile (a free Geospatial Data Abstraction Library package is used for the latter option). DataManager description DataManager provides integrated management of taxonomic information and distribution maps in ModestR databases. Although there is an option for manually introducing the taxa, a ModestR database will usually be populated by importing existing taxonomic data from files in CSV or in phyloxml formats. Once the taxonomic data is introduced, distribution maps for any species can be stored in the database, either by saving the map directly to the database from MapMaker, or by importing data maps from DataManager. Map import/export features in DataManager are the same as in MapMaker, with the difference being that bulk tasks can be done straightforwardly for all species from any branch of the taxonomy. For example, a single operation in DataManager can download data for all species of an order from GBIF, or import samples for several species from a CSV file, or export all species of a taxonomic group to JPEG files, or import several ESRI probability files generated by an SDM software and building and storing the corresponding maps in the database. This feature allows users to easily complete tasks that involve the import, processing and export of large amounts of data (i.e. Fig. 2). Th e database contents are presented in DataManager as a hierarchical taxonomy tree with classes at the highest level, and species at the lowest level. An icon signals species that have an associated map. Several filtering and searching options are available to make it easier to find taxa and maps (Fig. 3, upper panel). Moreover, any modifications in the taxonomy can be directly made onto the tree. This taxonomic tree facilitates the understanding and management of the data structure, as well as the taxonomic relationships among the species. It also allows users to explore the species distribution patterns at higher taxonomic ranks than species, which is a valuable feature in species distribution analysis (Gruenwald and Zhang 2008). DataManager also performs the rasterization of distribution maps, currently to a 1 1 minute cell, which can be Figure 2. Steps used to obtain a latitudinal range index in cells of 5 5 of fishes belong to orders with only marine species, using maps created by importing the predictions of the distribution models for all the species inferred with Maxent ver. 3.3.3, based on occurrence records previously obtained from the GBIF by DataManager, and using the Bio-ORACLE global dataset as environmental factors (Tyberghein et al. 2012). A record was considered valid when the probability of the distribution models obtained was higher than 0.75 (this setting can be modified by the user). Results are drawn with R environment software. Each step described on the left flowchart required only one single operation in DataManager (provided that required ModestR taxonomy database previously existed). Steps with shaded background indicate operations performed with other applications and importing/exporting data to/from DataManager. 1204

Figure 3. Top: the menu of DataManager showing the taxonomy of IPez (Guisande et al. 2010). The left tree displays taxonomic data. A world icon beside the species indicates that there is a map assigned to the species. Several filtering and searching options are available in the right panel. Middle: the DataManager dialog box used to configure exportation settings; the left tree allows selecting which metrics will be calculated as well as environmental variables. Bottom: the MRFinder dialog box where species found in an area are shown. Filters can be added to select specific taxonomic groups from the species found and several export tools are available. 1205

considered an appropriate resolution for a broad range of usages. The resulting matrix is stored in the database to be used to perform subsequent calculations. As rasterization is a laborious task that can take a significant amount of time, particularly for expert-drawn maps (far less for sample-based maps), the user can decide when this task can be carried out, thus avoiding hindering his/her work. Once maps have been rasterized, all the data exporting options of DataManager are fully available. It is possible to export aggregated data from any branch of the taxonomy, from the class to the species level, which is a feature that is not usually available in software not specifically designed for species distribution mapping purposes (Gruenwald and Zhang 2008). In order to generate metrics from one species or from any branch of the taxonomy, output cell resolution can be selected from 1 1 to 60 60 minutes. A tree shows the available metrics that DataManager can calculate, which include several biogeographical indices and the area of each type of habitat for each considered cell (Fig. 3, middle panel). DataManager also allows configuring environmental variables to be included in the output. This data can be easily imported from files in CSV or ESRI ASC format as for example a Bi-Oracle dataset (Tyberghein et al. 2012). The purpose of this feature is to integrate exportation of aggregated data for species distributions and environmental variables. Output data can be exported to CSV format, arranged as a matrix or as a list of values, to a raster in ESRI ASCII format, or to a Excel-xlm file (Fig. 3, middle panel). These files can also be easily imported to spreadsheets, the R environment software and GIS tools. Latitudinal gradients for the supported metrics can also be calculated for bands of any height from 1 to 10 of latitude. These are particularly useful to analyse distribution patterns. Samples from any taxonomic group can be exported to a single CSV file, making it easy to use them in other packages, including spreadsheets, statistical or SDM software. When existing maps are expert-drawn range maps, pseudosamples may also be generated. Finally, samples from maps can also be exported to a CSV format designed to be easily combined with available GBIF tools to generate Darwin Core archives. MRFinder description MRFinder is the final application in ModestR package. Its user interface is very similar to that of MapMaker (Fig. 1), but MRFinder allows the user to select an arbitrary zone in the world map, whether by drawing it freehand for an irregular zone, or by entering precise coordinates for a rectangular zone. MRFinder performs a search for species in the selected area, showing the results of the query in a taxonomic tree where filters can be applied to select specific taxonomic groups (Fig. 3, lower panel). It is also possible to select only rare species from those found in an area. The criteria for considering a species as rare may vary among studies, but they are usually associated to small range size, and this was the criteria used in ModestR. A species is considered to be rare when it is only found in a particular region and nowhere else, and its geographical distribution is restricted to an area smaller than a size defined by the user. As well as the same data aggregation and exportation features already available in DataManager (exporting metrics, latitudinal gradients, taxonomy lists, samples, etc.), statistical data such as the area of occupancy, maximum, mean and minimum latitude and longitude, and the reference area/occupied area ratio, can also be generated. These results are calculated and summarized for each taxonomic level of the species, from species up to classes, using the selected area boundaries as reference (even in the case of an irregular area). In this way MRFinder allows different analyses to DataManager: whereas in DataManager the analyses can only be performed by aggregating the species of a taxonomic group, in MRFinder they can include species from different groups that are present in the same area, allowing for more complex analyses. Package installation Th e ModestR software package is free for academic noncommercial use. It can be downloaded from ModestR website at www.ipez.es/modestr or from persistent URL www.purl.org/modestr. It requires Microsoft Windows XP or later and the.net framework 4.0. Additional materials, including a tutorial, an R package with several functions to make maps with data exported from ModestR, and sample databases, are available at the ModestR website. Geographic data about country borders were obtained from the web page www.divagis.org (Hijmans et al. 2001), while geographic data for the freshwater habitats were obtained from www.openstreetmap.org/. To cite ModestR or acknowledge its use, cite this Software note as follows, substituting the version of the application that you used for version 0 : Garc í a-rosell ó, E., Guisande, C., Gonz á lez-dacosta, J., Heine, J., Pelayo-Villamil, P., Manjarr é s-hern á ndez, A., Vaamonde, A. and Granado-Lorencio, C. 2013. ModestR: a software tool for managing and analyzing species distribution map databases. Ecography 36: 1202 1207 (ver. 0). References Compagno, L. et al. 2005. Sharks of the world. Princeton Univ. Press. GBIF 2010. GBIF data portal. http://data.gbif.org. Gruenwald, L. and Zhang, J. 2008. Embedding and extending GIS for exploratory analysis of large-scale species distribution data. In: Proceedings of the 16th ACM SIGSPATIAL conference on advances in geographic information systems. New York, USA, article no. 28. Guisande, C. et al. 2010. IPez: an expert system for the taxonomic identification of fishes based on machine learning techniques. Fish. Res. 102: 240 247. Hijmans, R. et al. 2001. Computer tools for spatial analysis of plant genetic resources data: 1. DIVA-GIS. Plant Genet. Res. Newslett. 27: 15 19. 1206

Hijmans, R. J. et al. 2012. dismo: species distribution modeling. R package ver. 0.7-23, CRAN.R-project.org/package dismo. Phillips, S. J. et al. 2006. Maximum entropy modeling of species geographic distributions. Ecol. Model. 190: 231 259. Rangel, T. F. L. V. B. et al. 2010. SAM: a comprehensive application for spatial analysis in macroecology. Ecography 33: 46 50. Tyberghein, L. et al. 2012. Bio-ORACLE: a global environmental dataset for marine species distribution modelling. Global Ecol. Biogeogr. 21: 272 281. VanDerWal, J. et al. 2012. SDMTools: tools for processing data associated with species distribution modelling exercises. R package, ver. 1.1-13, http://cran.r-project.org/ package SDMTools. 1207