ipg2p Data Integration: Semantic Web Services Ground Assessment

Similar documents
Describing Web Services for user-oriented retrieval

arxiv: v1 [cs.dl] 20 May 2013

Combining SAWSDL, OWL DL and UDDI for Semantically Enhanced Web Service Discovery

CONTEMPORARY SEMANTIC WEB SERVICE FRAMEWORKS: AN OVERVIEW AND COMPARISONS

Service Oriented Architecture

Using Ontology with Semantic Web Services to Support Modeling in Systems Biology

On the Standardization of Semantic Web Services-based Network Monitoring Operations

SmartLink: a Web-based editor and search environment for Linked Services

A Collaborative System Software Solution for Modeling Business Flows Based on Automated Semantic Web Service Composition

A QoS-aware Method for Web Services Discovery

A Service-oriented Data Integration and Analysis Environment for In Silico Experiments and Bioinformatics Research

Ontological Identification of Patterns for Choreographing Business Workflow

Vertical Integration of Enterprise Industrial Systems Utilizing Web Services

Extending SOA Infrastructure for Semantic Interoperability

Soaplab - a unified Sesame door to analysis tools

Protecting the Hosted Application Server

Incorporating Semantic Discovery into a Ubiquitous Computing Infrastructure

Semantic and Personalised Service Discovery

Data integration options using the Semantic Web. David Shotton

Research on the Model of Enterprise Application Integration with Web Services

NIST s Guide to Secure Web Services

Evaluating Semantic Web Service Tools using the SEALS platform

Service-Oriented Architecture: Analysis, the Keys to Success!

Semantification of Query Interfaces to Improve Access to Deep Web Content

Enabling Business Experts to Discover Web Services for Business Process Automation. Emerging Web Service Technologies

BioMOBY: An open source biological web services proposal Mark D. Wilkinson and Matthew Links Date received (in revised form): 24th September 2002

Web 3.0 image search: a World First

The Ontology and Architecture for an Academic Social Network

Semantic Data Management. Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies

Semantic Transformation of Web Services

Semantically Enhanced Web Personalization Approaches and Techniques

A Survey on Approaches for Interoperability and Portability of Cloud Computing Services

I. INTRODUCTION NOESIS ONTOLOGIES SEMANTICS AND ANNOTATION

e-science Technologies in Synchrotron Radiation Beamline - Remote Access and Automation (A Case Study for High Throughput Protein Crystallography)

LDaaSWS: Toward Linked Data as a Semantic Web Service

ACE GIS Project Overview: Adaptable and Composable E-commerce and Geographic Information Services

Ontology-based Web Service Composition: Part 1. Rolland Brunec Betreuerin: Sabine Maßmann Universität Leipzig, Abteilung Datenbanken

Emerging Web Services Technologies WiSe 2009/2010. Tools for Semantic Web Services

The open source ISA sooware suite and its internaqonal user community:

WHITE PAPER SPLUNK SOFTWARE AS A SIEM

SERVICE-ORIENTED ARCHITECTURE FOR INTEGRATION OF BIOINFORMATIC DATA AND APPLICATIONS. A Dissertation. Submitted to the Graduate School

Secure Semantic Web Service Using SAML

Semantic Enterprise Services Platform: Motivation, Potential, Functionality and Application Scenarios

Toward a Semantic Web service discovery and dynamic orchestration based on the formal specification of functional domain knowledge

agriopenlink: Semantic Services for Adaptive Processes in Livestock Farming

A QoS-Aware Web Service Selection Based on Clustering

Semantic and Personalised Service Discovery

Towards Trusted Semantic Service Computing

Lecture 11 Data storage and LIMS solutions. Stéphane LE CROM

PUR1311/19. Request for Information (RFI) Provision of an Enterprise Service Bus. to the. European Bank for Reconstruction and Development

An Ontology Based Method to Solve Query Identifier Heterogeneity in Post- Genomic Clinical Trials

A Multiple DNA Sequence Translation Tool Incorporating Web Robot and Intelligent Recommendation Techniques

Application of ontologies for the integration of network monitoring platforms

Scientific versus Business Workflows

Building Semantic Content Management Framework

Semantic Information on Electronic Medical Records (EMRs) through Ontologies

1 Introduction FEDERATED THROUGH-LIFE SUPPORT, ENABLING ONLINE INTEGRATION OF SYSTEMS WITHIN THE PLM DOMAIN. Abstract. Jonas Rosén

A Semantic Web of Know-How: Linked Data for Community-Centric Tasks

Ontology based Recruitment Process

Presente e futuro del Web Semantico

Visionet IT Modernization Empowering Change

A Semantic Approach for Access Control in Web Services

Logical Framing of Query Interface to refine Divulging Deep Web Data

Exploiting Semantic of Web Services through ebxml Registries

Christoph Bussler. B2B Integration. Concepts and Architecture. With 165 Figures and 4 Tables. IIIBibliothek. Springer

Contract No. IST Hydra

ONTODESIGN; A DOMAIN ONTOLOGY FOR BUILDING AND EXPLOITING PROJECT MEMORIES IN PRODUCT DESIGN PROJECTS

Bio-DSGS: An Automated Bioinformatics Data Service Generation System

Enhancing E-Learning Architectures A Case Study

Automating Cloud Service Level Agreements using Semantic Technologies

Explorer's Guide to the Semantic Web

A HUMAN RESOURCE ONTOLOGY FOR RECRUITMENT PROCESS

Towards Semantics-Enabled Distributed Infrastructure for Knowledge Acquisition

Run-time Service Oriented Architecture (SOA) V 0.1

Internationalization and Web Services

Usage of Business Process Choreography

<Insert Picture Here> The Evolution Of Clinical Data Warehousing

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova

zen Platform technical white paper

Integration of Human Services into Technical GI Service Chains

Fundamentals of Web Programming a

Ontology-Based Discovery of Workflow Activity Patterns

Web-Based Genomic Information Integration with Gene Ontology

Service-Oriented Architectures

Agents and Web Services

An ARIS-based Transformation Approach to Semantic Web Service Development

Module 1. Sequence Formats and Retrieval. Charles Steward

Semantically-enabled Service Oriented Architecture : Concepts, Technology and Application

HEALTH INFORMATION MANAGEMENT ON SEMANTIC WEB :(SEMANTIC HIM)

2 (18) - SOFTWARE ARCHITECTURE Service Oriented Architecture - Sven Arne Andreasson - Computer Science and Engineering.

Ontology construction on a cloud computing platform

Multi-Agent Based Semantic Web Service Model and Ontological Description for Medical Health Planning

BUSINESS PROCESS AND EBXML - WEB SERVICES INTEGRATION PLATFORM, REQUIREMENTS, ARCHITECTURES, SECURITY

A Web Services Created Online Training and Assessment Scheme

Data Integration of Bioinformatics Database Based on Web Services

Distributed Systems Architectures

Delivering the power of the world s most successful genomics platform

Distributed Database for Environmental Data Integration

Annotation for the Semantic Web during Website Development

Transcription:

November Version11.16A 09 ipg2pdataintegration: SemanticWebServices GroundAssessment DamianGessler,Ph.D. SemanticWebArchitect Weassessthestateofsemanticwebservices,withspecialemphasisontheavailabilityofservicesforgenomic sequences and ortholog identification. This report is a request from the ipg2p Data Integration Working Group,November12,2009. Wereportthatdeploymentsofsemanticwebservices,initiallyrestrictedaswebservicesthatadheretoone ofthetwow3crecommendationsofeithersawsdl(semanticannotationsforwsdl[webservicedefinition Language]) or POWDER (Protocol for Web Description Resources) are essentially non existent. While numerousresearch gradeimplementationsofsawsdlexist(perhapsafewdozentoafewhundred),neither W3Crecommendedtechnologyhasbeenadoptedbyanybroadcommunity. Recognizing that the broader informatic community has not yet standardized on a semantic web service technology,wenote: 1. Webservices astraditionalwebserviceimplementationsusingsoap,wsdl,and/orrest interfaces,butexplicitlywithoutsemanticsupport arenumerous.communityexamplesinclude NCBIwebservices,EBIwebservices,andGramenewebservices. 2. SemanticwebservicesusingtheNSF fundedtechnologysswap(simplesemanticwebarchitecture andprotocol)aredeployedatgramene,soybase,andthelegumeinformationsystem.these servicescovervariousgenomic,transcriptomic,geneticandphysicalmapping,andqtl(quantitative TraitLocus)services.Orthologsupportcurrentlyexistsonlyfordiscovery.

2iPG2P SemanticWebServicesGroundAssessment Purposeofthisdocument To assess the current state of semantic web services relevant to ipg2p goals, with specific attention to genomic sequences and ortholog identification. Audience ipg2p Data Integration Working Group Synopsis W3C semantic web service technology recommendations (SAWSDL and POWDER) have yet to receive broad implementation or adoption. Non W3C technologies exist; of these the author s work in SSWAP (Simple Semantic Web Architecture and Protocol) currently has the largest deployment in biology as restricted specifically to semantic web services. Introduction Semantic web services are an emerging technology. The need for enabling computers to assess information based on context (in contradistinction to more simplistic lexical or syntactical equivalency) is widely recognized. If computers could do this, they could, for example, discover, assess, retrieve, aggregate, assimilate, and even aid in integrating data and services from a broad and disparate range of sources. Today, context determination can be done in a lowthroughput, one off manner (i.e., with some component of directed, human intervention at one or more points in the data flow), but no system is yet widely deployed such that this can be done in a high throughput manner across the web. By high throughput we mean an automated, scalable process that can be applied to both existing and new entrants. Specifically, we note that widely deployed web service installations have nonexistent to poor explicit semantics amenable to high throughput machine reasoning. (Non semantic) web services, such as those at NCBI, EBI, Amazon, Google, and others, are high throughput in terms of syntactical, but not semantic, processing. This means that web services while enabling high throughput data transfer once engaged remain low throughput in the areas of discovery and automated determination of suitability for purpose. Web services thus remain lowthroughput (non scalable) as data integration technologies. For example, there is virtually no support for machines to determine if one web service s use of the token Gene is semantically equivalent to another web service s use of Gene. Blind aggregation across semantically dissimilar tokens will result in erroneous service engagement, information flow, and data assimilation. Semantic web technologies excel in empowering high throughput semantic determination, but lack sufficient web service protocols and supporting informatics. Semantic web technologies such as OWL (the W3C recommended Web Ontology Language) are theoretically applicable to web service applications, but to date largely lack the specifics to allow for off the shelf implementation. Semantic web services aims to address the limitations of separate pure play web service or semantic web technologies; i.e., they aim to bring either semantics to web services, or web service capabilities to semantics, but regardless of the approach, they aim to enable semantically aware, high throughput resource discovery and engagement.

SemanticWebServicesGroundAssessment ipg2p3 Methods To assess the state of semantic web services we: 1. Identified W3C recommended technologies for semantic web services. There are two, somewhat competing, technologies reflecting the influence, contributions, and legacy of two schools of thought: a. Bring semantics to web services The first is a response by the web services community to bring semantics to web services. This is formalized in a W3C recommended technology called SAWSDL (Semantic Annotations for WSDL) 1. WSDL (Web Service Description Language) is a well recognized, non semantic technology for web service description. b. Bring web services to semantics The second is a response by the semantic web community to bring web service support to the semantic web. This is formalized in a W3C recommended technology called POWDER (Protocol for Web Description Resources) 2. POWDER is instantiated in a semantic POWDER S document, which grounds POWDER in the well recognized semantic web language of OWL (Web Ontology Language) 3. 2. To assess semantic web service adoption, we searched for published reports of SAWSDL and POWDER implementations across: a. Two hundred and five BMC Central journals, notably including BMC Bioinformatics; b. Ninety Oxford Journals notably including Bioinformatics, Briefings in Bioinformatics, and Nucleic Acids Research; c. Two thousand Elsevier journals notably including the Journal of Web Semantics; d. Over 2000 Springer journals notably including LNCS (Lecture Notes in Computer Science); 3. Notable systems (not all true semantic web services) that do not use SAWSDL or POWDER include: Results a. BioMoby b. SADI c. MyGrid d. SSWAP SAWSDL and POWDER show virtually no implementation in our search across BMC Central, Oxford Journals, and Elsevier journals. In LNCS there are over 140 papers referencing SAWSDL to some extent, though few implementations. Of the implementations reported, virtually all are of research grade trials and prototypes. POWDER had substantially fewer reports than SAWSDL. These results 1 http://www.w3.org/2002/ws/sawsdl 2 http://www.w3.org/2007/powder 3 http://www.w3.org/2007/owl

4iPG2P SemanticWebServicesGroundAssessment support an expert opinion reflected in various points in the literature and on the web that there exists no de facto standard for semantic web services. The relative lack of W3C semantic web service implementations reflects the early state of this technology development. Similarly, there are relatively few non W3C models. We briefly note four: 1. BioMoby (also known as MOBY Services) has no support for formal semantics (e.g., it does not require OWL compliance or compliance with the framework of any formal logic system), and thus is not a semantic web service technology in the sense of refs. [4, 5, 6], but it does support limited semantics in its use of controlled vocabularies and subclass relations. BioMoby has hundreds of implementations. BioMoby s use of controlled vocabularies in subsumption hierarchies supports an ad hoc application of subsumption logic. BioMoby s successor (pre release MOBY 2) is SADI (Semantic Automated Discovery and Integration). 2. SADI (Semantic Automated Discovery and Integration) (MOBY 2) is a substantial technological change from BioMoby. Unlike BioMoby, SADI is OWL compliant but does not use POWDER. SADI is not a new technology per se, but proposes a model for how to use OWL to achieve some of the goals of semantic web services. Its use is focused on allowing the distribution of SPARQL queries across OWL compliant sites. SADI is newly released in 2009; more information is available at http://sadiframework.org. 3. The European escience program and specifically the mygrid project had a notable and early history in ontologies and semantic web service models (viz., the mygrid Ontology, Feta; see also www.semanticgrid.org). In more recent years, mygrid appears to have de emphasized semantic web services and has focused more heavily in maturing workflow tools (Taverna), social networking for workflows (myexperiment), web service discovery (BioCatalogue). 4. SSWAP 7 (Simple Semantic Web Architecture and Protocol; http://sswap.info) is an NSF funded project of the author. SSWAP is a full featured semantic web service architecture and protocol supporting resource description, publication, discovery, querying, invocation, and data return. SSWAP supports a model of third party, extensible ontologies under the formal logic of OWL DL. SSWAP is implemented at Gramene (10 services), Soybase (13 services), and the Legume Information System (3 services). Plans are underway to implement four services at the Plant Ontology. The LIS sequence retrieval service returns over 2,000,000 DNA and RNA sequences keyed on accession IDs. Gramene, Soybase, and LIS services have full ontological markup on both input and output data types. From a knowledge base of over 2400 resources, SSWAP discovers 57 phylogeny reconstruction services (relevant for 4 McIlraith et al. Semantic Web services. Intelligent Systems, IEEE (2001) 16(2):46 53 5 Bruijn et al. Semantic Web Services. Modeling Semantic Web Services (2008) Springer Berlin Heidelberg. pp. 9 21 6 Haniewicz et al. Semantic Web Services Applications a Reality Check. Wirtschaftsinformatik (2008) 50(1): 39 46 7 Gessler et al. SSWAP: A Simple Semantic Web Architecture and Protocol for semantic web services. BMC Bioinformatics (2009) 10:309.

SemanticWebServicesGroundAssessment ipg2p5 ortholog identification or retrieval) as classified by the NAR (Nucleic Acid Research) categories. These have markup only on the service classification, not on the input and output data types. SSWAP is OWL compliant but does not use POWDER. Conclusion It is hopeful that a W3C recommendation will lead to standardization for semantic web services. Yet historical examples of technologies such as CORBA, ebxml, UDDI, and many others shows that industry support and official standards are no guarantee for wide spread adoption. Indeed, early use of standards may carry far greater risk in terms of a lower ROI (return on investment) than an early adoption of simply something that works. Conversely, use of something that works may drive and create a de facto standard that becomes a recommendation exactly because of its real world problem solving ability (e.g.,. XML). In light of this, at the time of this writing we should neither adopt (nor dismiss) SAWSDL or POWDER because they are W3C recommendations: we should assess their use in iplant with heavy emphasis on how they solve our real world problems, while being cognizant of changes in their trend for broader adoption.