Name-based Approach to Build a Hub for Biodiversity LOD
|
|
|
- Ursula Webster
- 10 years ago
- Views:
Transcription
1 Name-based Approach to Build a Hub for Biodiversity LOD Yoshitaka Minami a,, Hideaki Takeda a*, Fumihiro Kato a, Ikki Ohmukai a, Noriko Arai a, Utsugi Jinbo b, Shoko Kawamoto c, Satoshi Kobayashi d, and Motomi Ito e a National Institute of Informatics, Hitotsubashi, Chiyoda-ku, Tokyo, Japan b National Museum of Nature and Science, 7-20, Ueno-Koen, Taito-ku, Tokyo, Japan c Database Center for Life Science, Research Organization of Information and Systems, , Yayoi, Bunkyo-ku, Tokyo, Japan d Transdisciplinary Research Integration Center, National Institute of Polar Research, 10-3, Midori-cho, Tachikawa-shi, Tokyo, Japan e Department of General Systems Studies, Graduate School of Arts and Sciences, The University of Tokyo, 3-8-1, Komaba, Meguro-ku, Tokyo, Japan Abstract. Because of a huge variety of biological studies focused on different targets, i.e., from molecules to ecosystem, data produced and used in each field is also managed independently so that it is difficult to know the relationship among them. We aim to build a data hub with LOD to connect data in different biological fields to enhance search and use of data across the fields. We build a prototype data hub on taxonomic information on species, which is a key to retrieve data and link to databases in different fields. The core of this hub is the dataset for species and taxa. We adopted the database called Building Dictionary for Life Science (BDLS) that contains relationship between scientific names and common Japanese names. Based on this dataset, we integrate various datasets such as domain-specific taxonomies and specimen databases. Keywords: Linked Open Data (LOD), biodiversity, taxonomy, data integration 1. Introduction Biodiversity [1] becomes a big scientific and social problem according to global awareness to the environmental problems. Biodiversity is related to many research fields, in particular biology, but biology itself consists of many research fields focused on various targets, from molecules to ecosystem. Thus, there are many biological disciplines, from molecular and cell biology, to ecology, evolution and taxonomy. Data collected from biological studies are highly diverse in contents and formats. Each research field can yield and use data for own field, but such data often lacks information on relationships to one another. As collaborative research projects across different fields are developing, demands for a data coordination system is increasing. When focusing on data, diversity can be categorized in the following three ways. Firstly, there is diversity in subjects of biological researches. There are different fields depending on hierarchical level of focus ranging from molecular biology to ecology, and analysis using multi-scale data is often required. There are large databases for molecular data (e.g. DDBJ, NCBI) and for specimen and observation data (GBIF), but their relationships are rather weak. On the other hand, some specialists of specific groups of organisms build their own specific databases. Such databases contain valuable data but are often independent from each other. Secondly, representation in local languages is needed for wide range of people, for example, governmental people working on biological resource management or biodiversity conservation in individual countries. * Corresponding author. [email protected].
2 Thirdly, there is diversity by people. In addition to researchers and governmental people, general people are also looking for biological data for their activities. Recently, Citizen Science programs, namely, researches and studies in collaboration with general people, are emerging and biology is in its forefront. We aim to build a data hub to connect data collected in various biological fields to enhance researchers to search and use data across fields. Therefore, the information infrastructure for biodiversity should be required to treat heterogeneous, multi-scale and multilingual data in scattered databases and to provide them for various people. We aim to build a data hub to absorb the above diversity. 2. Related work In biodiversity informatics [2], ensuring interoperability of the various databases specialized in individual purpose, is one of the most important issues [3]. Several researchers and groups have started to research about Linked Data in biodiversity information but they have not reached standard or consensus fully [4]. Peterson et al. [5] emphasized that the integration of scientific names using linked data approach has a big potential and enables to create rich services that biologist can benefit. Darwin Core [6] is a well-known standard for metadata for biodiversity information, but Linked Data for Darwin Core is under discussion. TaxonConcept 1 provides ontology and data for species but data is limited to the specific geographical area. 3. The basic policies for integration In order to fulfill the requirements mentioned in Section 1, we set up two basic policies. The first one is that we focus on taxonomic information on species since they are common and mandatory fields for most biological information. It provides very basic information on each species including classification, and scientific and general (English and Japanese) names. It also provides links to entries on other database such as NCBI (National Center for Biotechnology Information), EOL (Encyclopedia of Life), and DBpedia based on scientific name. The second is that we treat names as the first class entities. A taxon can be represented as a set of names 1 that are linked to each other. It is a different approach in other studies like TaxonConcept and Darwin Core [6] where taxa are treated as the first class entities. There are two main benefits for our approach. The first is that it is easy to build and maintain the database since identification of the authorized names can be postponed when building the database. The second is that linking to other databases is relatively easy including non research-based databases like those maintained by citizen since variation of names is naturally included. There are also the drawbacks. It is not easy to provide authorized names since it needs some processing on the network. The other is that homonymies, i.e., some names are used for different taxa. 4. The core dataset: BDLS (Building Dictionary for Life Science) We selected the dataset called BDLS (Building Dictionary for Life Science) for a core dataset for taxa. BDLS 2 is an integrated dictionary for biology which is built from nearly 100 sources which include various illustrated books and specimen dataset in museums, the latter of that are provided by the Science Museum Net 3 in Japan. There are two main parts, i.e., one is the dictionary for taxa (mainly species) and the other is the dictionary for terminology. We used the former mainly. It contains scientific names, Japanese common names, and common names in other languages (mainly English) for taxa and their relations (relation between scientific names and common names). Every relation is annotated with provenance, i.e., the source like a name of book or database. It contains 55,759 scientific names and 57,929 Japanese common names. Among them, there are 55,245 relations between scientific and Japanese common names. It is probably the largest dictionary for Japanese common names for taxa. The analysis 2 It is developed by Database Center for Life Science (DBCLS), Research Organization of Information and Systems (ROIS), Japan and available from 3 The network is maintained by the National Museum of Nature and Science, Tokyo.
3 about correspondence with other databases like Species2000 and NCBI is available Other databases There exist various datasets even just concerning species information. Among them we selected mainly three datasets to examine feasibility of integration as a necessary condition to publish and to contain species data. We used the following two databases to test integration. 1. A domain-specific dataset for taxon names: the Current Checklist of Japanese Butterflies We selected the Current Checklist of Japanese Butterflies [7] as a domain-specific dataset. It is a checklist (a list of species names) created and authorized by the butterfly taxonomists. It covers all butterfly species which number is 327 found in Japan, and describes each species by scientific and Japanese general names and higher taxa. 2. A domain-specific dataset for specimens: Bryophytes Specimen Collection We selected the Bryophytes Specimen Collection that National Institute of Polar Research developed and maintained as another domain-specific dataset. This data has 56,590 specimen data. 6. The data model for species information These dataset that we have selected have common information about scientific name and taxon. Then, we made a data model shown in Fig.1 in order to link to entries among the databases through the common information and to be used from outside conveniently. This data model was expressed in Named Graph for the data sources, i.e., the sub datasets in BDLS, Butterflies and Bryophytes. One of the big issues in species information is treatment on various names. Each species has its scientific name but can be represented differently. One case is caused by different citation forms (e.g., use of abbreviation for genus, omission of authors). The other case is derived from multiple names for c one species. The valid species name and the combination of genus and species might be changed as taxonomic studies proceed. Furthermore a species may have multiple general names in local languages. It is a delicate problem in taxonomy to choose a unique valid name for each species and it is beyond our scope 5. Rather we represent each name as a node and associate nodes by relationship such as synonyms. A basic model is as follows. We provide a class for taxon name and classes for scientific name and common name as its subclass. All nodes on names are instances of these classes. A node can have a hastaxonrank property of which value is an instance of Class TaxonRank, i.e., either kingdom, phylum, class, order, family, subfamily, tribe, subtribe, genus, subgenus or species. Another type is a node for specimen which is an instance of Class Specimen providing specific properties on specimen. All triples on instances are associated to data source URIs in Named Graph. The benefit of name-based approach is rapid integration of data from different data sources. Its drawback is complexness of representation since a single specimen is represented as a network but it can be compensated by inference in RDF. Representation of species name common to three datasets is defined as follows. First, nodes of the ScientificName type and CommonName type are generated for species name and for common name respectively. Next, the hascommonname property links a node of the ScientificName to a node of the CommonName, and the hasscientificname property vice versa. A node representing taxon is linked to a node of the TaxonRank type by the hastaxonrank property. And, other items are literal. Nodes representing specimen in Bryophytes dataset describe ID, collected date, collector, latitude, longitude, floral region, floral subsection, locality, sporophyte, altitude, determiner, and herbarium housed 6. Nodes representing specimen are defined as follows. First, the node is generated assigning a URI to each specimen because there can multiple specimens for one species. Next, a specimen node is linked to the node of ScientificName by the species property. A specimen node has a link by the 5 The exception in our dataset is Butterflies dataset where a list of valid scientific and common names is authorized by taxonomic experts. 6 The list of herbarium index is available from Index Herbariorum: ( sp)
4 crm:has_current_location property to a node representing a facility to represent herbarium housed relationship. And, other Items are literal. Source information in BDLS dataset is represented by name in Named Graph. RDF triples representing data in a data source has a name which has a link by the dcterms:source property to the node representing the data source. Then it has properties such as rdfs:label and dcterms:publisher. 7. The Results In accordance with the data model, we generated LOD from the selected data. As a result, the number of taxon name is 443,248, scientific name of species is 226,141, common name of species is 219,865, hasscientificname property node is 87,160 and hascommonname property node is 84,610. The numbers of names become roughly four times larger than those in BDLS due to introduction of other databases. But the number of the relation between scientific and common names does not increase so much. It indicates that many of the relations between them are left to be added. Our approach is successful in integration basically. But it causes some problems when the dataset is used. For example, we implemented a simple taxon search interface by name. We can show the results by matching names but we can just a set of taxon names but not show representative names. One of the potential problems of our approach is homonymy, i.e., two taxa may share a name. Though the naming rule of scientific name is not essentially permitted a name sharing two or more taxa, there are some exceptional cases e.g., an animal and a plant can share one genus name. We checked how it is in the real dataset by using the NCBI taxonomy. We found 1797 homonymical names. By using this data, we can distinguish taxon names properly. 8. Publishing data as Linked Data We created the model for species information and translated the data into Linked Data according to the model. The whole data is available from SPARQL Endpoint 7 and a simple search interface 8. It also includes 7 the out-going links to LOD such as DBpedia (en) and to non-lod data sites such as NCBI taxonomy and Encyclopedia of Life. But since licensing for some of the original datasets is not clear as open, the whole dataset itself has not been registered to the Data Hub yet. We are currently working towards open license for them. BDLS dataset is clearly open with CC-BY-SA license. It is registered in the Data Hub 9 and available as the dumped data and the SPARQL Endpoint 10. It is alone valuable as the database for species with scientific names and Japanese common names, which can work a hub of biodiversity information by interlinking various datasets by not only scientists but ordinary people. 9. Conclusion We described the concept of the data bub for species based on names and the prototype system with translating the datasets into Linked Data. Namebased approach is well suited to Linked Data since different names for species which may appear in different datasets can be linked to each other. We publish the integrated dataset as a prototype and also the core dataset as LOD to prompt integration with more datasets. References [1] Bisby, F.A.: The quiet revolution: biodiversity informatics and the Internet, Science, Vol. 289, No. 5488, pp , (2000) Edwards, J.L., Meredith A.L., and Nielsen E.S.: Interoperability of Biodiversity Databases: Biodiversity Information on Every Desktop. Science, Vol. 289, No. 5488, pp , (2000) [2] Jenkins M (2003) Prospects in biodiversity. Science 302: [3] Global Biodiversity Information Facility (2011) Recommendations for the Use of Knowledge Organisation Systems by GBIF. accessible online at [4] Patterson DJ, Cooper J, Kirk PM, Pyle RL and Remsen DP (2010) Names are key to the big new 8 Access from by selecting SPECIES as dataset
5 biology. Trends in Ecology and Evolution 25: doi: /j.tree [5] Wieczorek J, Bloom D, Guralnick R, Blum S, Döring M, et al. (2012) Darwin Core: An Evolving Community-Developed Biodiversity Data Standard. PLoS ONE 7(1): e doi: /journal.pone [6] Inomata T, Uémura Y, Yago M, Jinbo U and Ueda K (2010) The Current Checklist of Japanese Butterflies. Fig. 1. Data model
Data quality Vision at SBBr Danny Vélez
Data quality Vision at SBBr Danny Vélez 4th workshop SiBBr: data quality and ecological data 25-29 August 2014 SiBBr: national level Community Universities NGO s Government agencies Research centers Citizens
UCM-MACB 2.0: A COMPLUTENSE UNIVERSITY VIRTUAL HERBARIUM PROJECT
UCM-MACB 2.0: A COMPLUTENSE UNIVERSITY VIRTUAL HERBARIUM PROJECT A. Avalos, A. Barrera, J.M. Gabriel y Galán, T. Gallardo, A. Gómez, J.M. Hernández, R. Lahoz, P. López, N. Marcos, L. Martin, A. G.Moreno,
Linked Open Data Infrastructure for Public Sector Information: Example from Serbia
Proceedings of the I-SEMANTICS 2012 Posters & Demonstrations Track, pp. 26-30, 2012. Copyright 2012 for the individual papers by the papers' authors. Copying permitted only for private and academic purposes.
Digitization in the Pacific. Larry M. Page PD, idigbio Curator, FLMNH
Digitization in the Pacific Larry M. Page PD, idigbio Curator, FLMNH Advancing Digitization of Biodiversity Collections (ADBC) Coordinating center for the nationaleffort to digitize natural history collections
RESPONSE FROM GBIF TO QUESTIONS FOR FURTHER CONSIDERATION
RESPONSE FROM GBIF TO QUESTIONS FOR FURTHER CONSIDERATION A. Policy support tools and methodologies developed or used under the Convention and their adequacy, impact and obstacles to their uptake, as well
Required and Recommended Supporting Information for IUCN Red List Assessments
Required and Recommended Supporting Information for IUCN Red List Assessments This is Annex 1 of the Rules of Procedure IUCN Red List Assessment Process 2013-2016 as approved by the IUCN SSC Steering Committee
Semantic Interoperability
Ivan Herman Semantic Interoperability Olle Olsson Swedish W3C Office Swedish Institute of Computer Science (SICS) Stockholm Apr 27 2011 (2) Background Stockholm Apr 27, 2011 (2) Trends: from
The Digital Atlas of Ordovician Life: digitizing and mobilizing data for paleontologists and the public
doi: 10.3176/earth.2014.36 The Digital Atlas of Ordovician Life: digitizing and mobilizing data for paleontologists and the public Alycia L. Stigall a,b, Jennifer E. Bauer a* and Hannah-Maria R. Brame
LinkZoo: A linked data platform for collaborative management of heterogeneous resources
LinkZoo: A linked data platform for collaborative management of heterogeneous resources Marios Meimaris, George Alexiou, George Papastefanatos Institute for the Management of Information Systems, Research
DISCOVERING RESUME INFORMATION USING LINKED DATA
DISCOVERING RESUME INFORMATION USING LINKED DATA Ujjal Marjit 1, Kumar Sharma 2 and Utpal Biswas 3 1 C.I.R.M, University Kalyani, Kalyani (West Bengal) India [email protected] 2 Department of Computer
Interactive Information Visualization in the Digital Flora of Texas
Interactive Information Visualization in the Digital Flora of Texas Teong Joo Ong 1, John J. Leggett 1, Hugh D. Wilson 2, Stephan L. Hatch 3, Monique D. Reed 2 1 Center for the Study of Digital Libraries,
QUALITY CONTROL PROCESS FOR TAXONOMY DEVELOPMENT
AUTHORED BY MAKOTO KOIZUMI, IAN HICKS AND ATSUSHI TAKEDA JULY 2013 FOR XBRL INTERNATIONAL, INC. QUALITY CONTROL PROCESS FOR TAXONOMY DEVELOPMENT Including Japan EDINET and UK HMRC Case Studies Copyright
Mining the Web of Linked Data with RapidMiner
Mining the Web of Linked Data with RapidMiner Petar Ristoski, Christian Bizer, and Heiko Paulheim University of Mannheim, Germany Data and Web Science Group {petar.ristoski,heiko,chris}@informatik.uni-mannheim.de
Information technology to assist in conserving and using crop wild relatives and landrace diversity (the boring version without pictures)
Information technology to assist in conserving and using crop wild relatives and landrace diversity (the boring version without pictures) Theo van Hintum Centre for Genetic Resources, The Netherlands (CGN)
Summary of Responses to the Request for Information (RFI): Input on Development of a NIH Data Catalog (NOT-HG-13-011)
Summary of Responses to the Request for Information (RFI): Input on Development of a NIH Data Catalog (NOT-HG-13-011) Key Dates Release Date: June 6, 2013 Response Date: June 25, 2013 Purpose This Request
Global Ecology and Wildlife Conservation
Vaughan Centre for Lifelong Learning Part-Time Certificate of Higher Education in Global Ecology and Wildlife Conservation Delivered via Distance Learning FAQs What are the aims of the course? This course
Data Quality Working Group
Data Quality Working Group In order to get an overview of the issues that have been discussed so far, I have summarized them a little and added some links and references. The interesting discussion is
Publishing Linked Data Requires More than Just Using a Tool
Publishing Linked Data Requires More than Just Using a Tool G. Atemezing 1, F. Gandon 2, G. Kepeklian 3, F. Scharffe 4, R. Troncy 1, B. Vatant 5, S. Villata 2 1 EURECOM, 2 Inria, 3 Atos Origin, 4 LIRMM,
CitationBase: A social tagging management portal for references
CitationBase: A social tagging management portal for references Martin Hofmann Department of Computer Science, University of Innsbruck, Austria [email protected] Ying Ding School of Library and Information Science,
Assign: Unit 1: Preparation Activity page 4-7. Chapter 1: Classifying Life s Diversity page 8
Assign: Unit 1: Preparation Activity page 4-7 Chapter 1: Classifying Life s Diversity page 8 1.1: Identifying, Naming, and Classifying Species page 10 Key Terms: species, morphology, phylogeny, taxonomy,
Serendipity a platform to discover and visualize Open OER Data from OpenCourseWare repositories Abstract Keywords Introduction
Serendipity a platform to discover and visualize Open OER Data from OpenCourseWare repositories Nelson Piedra, Jorge López, Janneth Chicaiza, Universidad Técnica Particular de Loja, Ecuador [email protected],
Assessment by Land Use Change using SI models in Khon Kaen, Thailand
Assessment by Land Use Change using SI models in Khon Kaen, Thailand Hideyuki ITO Keiji WATANABE Ministry of Economy, Trade and Industry, Japan Kiichiro HAYASHIT Institute of Materials and Systems for
Host specificity and the probability of discovering species of helminth parasites
Host specificity and the probability of discovering species of helminth parasites 79 R. POULIN * and D. MOUILLOT Department of Zoology, University of Otago, P.O. Box 6, Dunedin, New Zealand UMR CNRS-UMII
Presente e futuro del Web Semantico
Sistemi di Elaborazione dell informazione II Corso di Laurea Specialistica in Ingegneria Telematica II anno 4 CFU Università Kore Enna A.A. 2009-2010 Alessandro Longheu http://www.diit.unict.it/users/alongheu
LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model
LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model 22 October 2014 Tony Hammond Michele Pasin Background About Macmillan
Semantic Search in Portals using Ontologies
Semantic Search in Portals using Ontologies Wallace Anacleto Pinheiro Ana Maria de C. Moura Military Institute of Engineering - IME/RJ Department of Computer Engineering - Rio de Janeiro - Brazil [awallace,anamoura]@de9.ime.eb.br
Industry 4.0 and Big Data
Industry 4.0 and Big Data Marek Obitko, [email protected] Senior Research Engineer 03/25/2015 PUBLIC PUBLIC - 5058-CO900H 2 Background Joint work with Czech Institute of Informatics, Robotics and
Towards a Sales Assistant using a Product Knowledge Graph
Towards a Sales Assistant using a Product Knowledge Graph Haklae Kim, Jungyeon Yang, and Jeongsoon Lee Samsung Electronics Co., Ltd. Maetan dong 129, Samsung-ro, Yeongtong-gu, Suwon-si, Gyeonggi-do 443-742,
Linked Statistical Data Analysis
Linked Statistical Data Analysis Sarven Capadisli 1, Sören Auer 2, Reinhard Riedl 3 1 Universität Leipzig, Institut für Informatik, AKSW, Leipzig, Germany, 2 University of Bonn and Fraunhofer IAIS, Bonn,
LDIF - Linked Data Integration Framework
LDIF - Linked Data Integration Framework Andreas Schultz 1, Andrea Matteini 2, Robert Isele 1, Christian Bizer 1, and Christian Becker 2 1. Web-based Systems Group, Freie Universität Berlin, Germany [email protected],
How To Write A Network Analysis
FishGraph: A Network-Driven Data Analysis Patrícia Cavoto*, Victor Cardoso*, Régine Vignes Lebbe, André Santanchè* *UNICAMP University ofcampinas, São Paulo, Brasil ISYEB - UMR 7205 CNRS, MNHN, UPMC, EPHE
Interpretation of Data (IOD) Score Range
These Standards describe what students who score in specific score ranges on the Science Test of ACT Explore, ACT Plan, and the ACT college readiness assessment are likely to know and be able to do. 13
eflora and DialGraph, tools for enhancing identification processes in plants Fernando Sánchez Laulhé, Cecilio Cano Calonge, Antonio Jiménez Montaño
Nimis P. L., Vignes Lebbe R. (eds.) Tools for Identifying Biodiversity: Progress and Problems pp. 163-169. ISBN 978-88-8303-295-0. EUT, 2010. eflora and DialGraph, tools for enhancing identification processes
Mobilising Vegetation Plot Data: the National Vegetation Survey Databank. Susan Wiser April 2016 http://nvs.landcareresearch.co.nz
Mobilising Vegetation Plot Data: the National Vegetation Survey Databank Susan Wiser April 2016 http://nvs.landcareresearch.co.nz Nationally Significant Databases and Collections http://natsigdc.landcareresearch.co.nz/natsigdc_list.html
Web NDL Authorities: Authority Data of the National Diet Library, Japan, as Linked Data
Submitted on: 6/20/2014 Web NDL Authorities: Authority Data of the National Diet Library, Japan, as Linked Data Tadahiko Oshiba Library Support Division, Kansai-kan of the National Diet Library, Kyoto,
Understanding by Design. Title: BIOLOGY/LAB. Established Goal(s) / Content Standard(s): Essential Question(s) Understanding(s):
Understanding by Design Title: BIOLOGY/LAB Standard: EVOLUTION and BIODIVERSITY Grade(s):9/10/11/12 Established Goal(s) / Content Standard(s): 5. Evolution and Biodiversity Central Concepts: Evolution
Joan Thorne (Editor, Zoological Record) BIOSIS UK, 54 Micklegate, York, North Yorkshire YO1 6WF, U.K. (e-mail: [email protected].
Zoological Record and registration of new names in zoology Joan Thorne (Editor, Zoological Record) BIOSIS UK, 54 Micklegate, York, North Yorkshire YO1 6WF, U.K. (e-mail: [email protected]) Abstract.
LINKED OPEN DRUG DATA FROM THE HEALTH INSURANCE FUND OF MACEDONIA
LINKED OPEN DRUG DATA FROM THE HEALTH INSURANCE FUND OF MACEDONIA Milos Jovanovik, Bojan Najdenov, Dimitar Trajanov Faculty of Computer Science and Engineering, Ss. Cyril and Methodius University Skopje,
Name Class Date. binomial nomenclature. MAIN IDEA: Linnaeus developed the scientific naming system still used today.
Section 1: The Linnaean System of Classification 17.1 Reading Guide KEY CONCEPT Organisms can be classified based on physical similarities. VOCABULARY taxonomy taxon binomial nomenclature genus MAIN IDEA:
Lightweight Data Integration using the WebComposition Data Grid Service
Lightweight Data Integration using the WebComposition Data Grid Service Ralph Sommermeier 1, Andreas Heil 2, Martin Gaedke 1 1 Chemnitz University of Technology, Faculty of Computer Science, Distributed
Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness
Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness Melanie Dulong de Rosnay Fellow, Science Commons and Berkman Center for Internet & Society at Harvard University This article
A Survey on: Efficient and Customizable Data Partitioning for Distributed Big RDF Data Processing using hadoop in Cloud.
A Survey on: Efficient and Customizable Data Partitioning for Distributed Big RDF Data Processing using hadoop in Cloud. Tejas Bharat Thorat Prof.RanjanaR.Badre Computer Engineering Department Computer
COLINDA: Modeling, Representing and Using Scientific Events in the Web of Data
COLINDA: Modeling, Representing and Using Scientific Events in the Web of Data Selver Softic 1, Laurens De Vocht 2, Martin Ebner 1, Erik Mannens 2, and Rik Van de Walle 2 1 Graz University of Technology,
Teacher s Guide For. Core Biology: Environmental Sciences
Teacher s Guide For Core Biology: Environmental Sciences For grade 7 - College Programs produced by Centre Communications, Inc. for Ambrose Video Publishing, Inc. Executive Producer William V. Ambrose
BYODs & FAIR Data Stewardship
BYODs & FAIR Data Stewardship Luiz Olavo Bonino [email protected] www.elixir-europe.org Summary FAIR Data stewardship Approach in NL BYOD FAIR Data tooling ecosystem Way of working (FAIR) Data Stewardship
Open Data Integration Using SPARQL and SPIN
Open Data Integration Using SPARQL and SPIN A Case Study for the Tourism Domain Antonino Lo Bue, Alberto Machi ICAR-CNR Sezione di Palermo, Italy Research funded by Italian PON SmartCities Dicet-InMoto-Orchestra
Semantic Exploration of Archived Product Lifecycle Metadata under Schema and Instance Evolution
Semantic Exploration of Archived Lifecycle Metadata under Schema and Instance Evolution Jörg Brunsmann Faculty of Mathematics and Computer Science, University of Hagen, D-58097 Hagen, Germany [email protected]
Revealing Trends and Insights in Online Hiring Market Using Linking Open Data Cloud: Active Hiring a Use Case Study
Revealing Trends and Insights in Online Hiring Market Using Linking Open Data Cloud: Active Hiring a Use Case Study Amar-Djalil Mezaour 1, Julien Law-To 1, Robert Isele 3, Thomas Schandl 2, and Gerd Zechmeister
GetLOD - Linked Open Data and Spatial Data Infrastructures
GetLOD - Linked Open Data and Spatial Data Infrastructures W3C Linked Open Data LOD2014 Roma, 20-21 February 2014 Stefano Pezzi, Massimo Zotti, Giovanni Ciardi, Massimo Fustini Agenda Context Geoportal
Environment and Natural Resources Trust Fund 2016 Request for Proposals (RFP)
Project Title: Total Project Budget: Environment and Natural Resources Trust Fund 2016 Request for Proposals (RFP) Scientific Asset Management: Digital Preservation for Future Generations Category: Proposed
RDF Dataset Management Framework for Data.go.th
RDF Dataset Management Framework for Data.go.th Pattama Krataithong 1,2, Marut Buranarach 1, and Thepchai Supnithi 1 1 Language and Semantic Technology Laboratory National Electronics and Computer Technology
An Ontology in Project Management Knowledge Domain
An Ontology in Knowledge Domain T.Sheeba, Muscat College, P.O.Box:2910, P.C:112, Ruwi, Sultanate of Oman Reshmy Krishnan, PhD. Muscat College, P.O.Box:2910, P.C:112, Ruwi, Sultanate of Oman M.Justin Bernard,
San Diego Zoo Global Advanced Inquiry Program Course Descriptions*
San Diego Zoo Global Advanced Inquiry Program Course Descriptions* Foundations of Inquiry 3 credits; Required as first course, Summer Year 1 5 in person discussion dates combined with online content In
Application of ontologies for the integration of network monitoring platforms
Application of ontologies for the integration of network monitoring platforms Jorge E. López de Vergara, Javier Aracil, Jesús Martínez, Alfredo Salvador, José Alberto Hernández Networking Research Group,
Secure Semantic Web Service Using SAML
Secure Semantic Web Service Using SAML JOO-YOUNG LEE and KI-YOUNG MOON Information Security Department Electronics and Telecommunications Research Institute 161 Gajeong-dong, Yuseong-gu, Daejeon KOREA
It s all around the domain ontologies - Ten benefits of a Subject-centric Information Architecture for the future of Social Networking
It s all around the domain ontologies - Ten benefits of a Subject-centric Information Architecture for the future of Social Networking Lutz Maicher and Benjamin Bock, Topic Maps Lab at University of Leipzig,
Environment and Natural Resources Trust Fund (ENRTF) M.L. 2015 Work Plan
Date of Report: 15 October 2015 Date of Next Status Update Report: 31 December 2015 Date of Work Plan Approval: Project Completion Date: 30 June 2018 Does this submission include an amendment request?
ITIS. ITIS Data Model. Entity Relationships and Element Definitions. Revision 2.1 7/20/2015
ITIS ITIS Data Model Entity Relationships and Element Definitions Revision 2.1 7/20/2015 This document is designed to provide information on ITIS data semantics. It describes what is stored, their characteristics,
The Ontological Approach for SIEM Data Repository
The Ontological Approach for SIEM Data Repository Igor Kotenko, Olga Polubelova, and Igor Saenko Laboratory of Computer Science Problems, Saint-Petersburg Institute for Information and Automation of Russian
HadoopSPARQL : A Hadoop-based Engine for Multiple SPARQL Query Answering
HadoopSPARQL : A Hadoop-based Engine for Multiple SPARQL Query Answering Chang Liu 1 Jun Qu 1 Guilin Qi 2 Haofen Wang 1 Yong Yu 1 1 Shanghai Jiaotong University, China {liuchang,qujun51319, whfcarter,yyu}@apex.sjtu.edu.cn
MBARI Deep Sea Guide: Designing a web interface that represents information about the Monterey Bay deep-sea world.
MBARI Deep Sea Guide: Designing a web interface that represents information about the Monterey Bay deep-sea world. Pierre Venuat, University of Poitiers Mentors: Brian Schlining and Nancy Jacobsen Stout
Linked Data Interface, Semantics and a T-Box Triple Store for Microsoft SharePoint
Linked Data Interface, Semantics and a T-Box Triple Store for Microsoft SharePoint Christian Fillies 1 and Frauke Weichhardt 1 1 Semtation GmbH, Geschw.-Scholl-Str. 38, 14771 Potsdam, Germany {cfillies,
BIO 182 General Biology (Majors) II with Lab. Course Package
BIO 182 General Biology (Majors) II with Lab (Title change ONLY Oct. 2013) Course Package Modification Approved February 23, 2005 Modified April 3, 2009 COURSE INFORMATION New Course Course Modification
BSc Botany and Zoology For students entering Part 1 in October 2006
BSc Botany and Zoology For students entering Part 1 in October 2006 Awarding Institution: Teaching Institution: Relevant QAA subject benchmarking group(s): Faculty of Life Sciences Date of specification:
Converging Web-Data and Database Data: Big - and Small Data via Linked Data
DBKDA/WEB Panel 2014, Chamonix, 24.04.2014 DBKDA/WEB Panel 2014, Chamonix, 24.04.2014 Reutlingen University Converging Web-Data and Database Data: Big - and Small Data via Linked Data Moderation: Fritz
Modeling an Ontology for Managing Contexts in Smart Meeting Space
Modeling an Ontology for Managing Contexts in Smart Meeting Space Mohammad Rezwanul Huq, Nguyen Thi Thanh Tuyen, Young-Koo Lee, Byeong-Soo Jeong and Sungyoung Lee Department of Computer Engineering Kyung
Data Registry Workshop Report
Data Registry Workshop Report Background A Joint Working Group on Data Sharing and Archiving (JWG), representing major professional societies that publish ecology, evolution, and organismal biology journals,
