A semantic overlay for service discovery across Web Information Systems



Similar documents
Using Peer to Peer Dynamic Querying in Grid Information Services

WSPDS: Web Services Peer-to-peer Discovery Service

INTEGRATION OF XML DATA IN PEER-TO-PEER E-COMMERCE APPLICATIONS

Automated Retrieval of Semantic Web Services: A Matching Based on Conceptual Indexation

Semantics-enabled Web API Organization and Recommendation

Digital resource discovery: semantic annotation and matchmaking techniques

Enhance UDDI and Design Peer-to-Peer Network for UDDI to Realize Decentralized Web Service Discovery

A Collaborative and Semantic Data Management Framework for Ubiquitous Computing Environment

Grid Data Integration based on Schema-mapping

Super-Consumer Based Reputation Management for Web Service Systems

Distributed Database for Environmental Data Integration

Learning Domain Ontology for Web Services

Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object

Varalakshmi.T #1, Arul Murugan.R #2 # Department of Information Technology, Bannari Amman Institute of Technology, Sathyamangalam

International journal of Engineering Research-Online A Peer Reviewed International Journal Articles available online

Concept-Based Discovery of Mobile Services

XOP: Sharing XML Data Objects through Peer-to-Peer Networks

GRIDB: A SCALABLE DISTRIBUTED DATABASE SHARING SYSTEM FOR GRID ENVIRONMENTS *

Cloud-dew architecture: realizing the potential of distributed database systems in unreliable networks

Application of ontologies for the integration of network monitoring platforms

Supporting Change-Aware Semantic Web Services

Schema Mappings and Agents Actions in P2P Data Integration System 1

Load Balancing in Structured Peer to Peer Systems

Towards P2P-based Computer Network Management *

Load Balancing in Structured Peer to Peer Systems

An Overview of Search Strategies in Distributed Environments

RESEARCH ISSUES IN PEER-TO-PEER DATA MANAGEMENT

Autoconfiguration and maintenance of the IP address in ad-hoc mobile networks

Semantic-based Smart Homes: a Multi-Agent Approach

EXPLOITING FOLKSONOMIES AND ONTOLOGIES IN AN E-BUSINESS APPLICATION

Requirements for Context-dependent Mobile Access to Information Services

Semantic Search in Portals using Ontologies

Integration and Coordination in in both Mediator-Based and Peer-to-Peer Systems

Towards Trusted Semantic Service Computing

A P2P SERVICE DISCOVERY STRATEGY BASED ON CONTENT

Incorporating Semantic Discovery into a Ubiquitous Computing Infrastructure

Architecture-based QoS Prediction for Service-oriented Computing

Content Delivery Network (CDN) and P2P Model

Service Oriented Architecture

A Semantic Marketplace of Peers Hosting Negotiating Intelligent Agents

Data Management in P2P Systems: Challenges and Research Issues. Timos Sellis National Technical Univ. of Athens

Information Services for Smart Grids

A QoS-Aware Web Service Selection Based on Clustering

Using Ontologies to Enhance Data Management in Distributed Environments

A PROXIMITY-AWARE INTEREST-CLUSTERED P2P FILE SHARING SYSTEM

Simulating a File-Sharing P2P Network

Query Reformulation over Ontology-based Peers (Extended Abstract)

Ontological Identification of Patterns for Choreographing Business Workflow

Composite Process Oriented Service Discovery in Preserving Business and Timed Relation

Some Research Challenges for Big Data Analytics of Intelligent Security

OWL based XML Data Integration

Costs and Benefits of Reputation Management Systems

Interacting the Edutella/JXTA Peer-to-Peer Network with Web Services

SmartLink: a Web-based editor and search environment for Linked Services

CLOUD BASED PEER TO PEER NETWORK FOR ENTERPRISE DATAWAREHOUSE SHARING

Dynamic Web Service Calls for Data Integration

WSPeer - An Interface to Web Service Hosting and Invocation

A Hybrid Approach for Ontology Integration

UNIVERSITY OF TRENTO A PEER-TO-PEER DATABASE MANAGEMENT SYSTEM. Albena Roshelova. June Technical Report # DIT

LOAD BALANCING WITH PARTIAL KNOWLEDGE OF SYSTEM

ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY

An ARIS-based Transformation Approach to Semantic Web Service Development

A Case Study of Question Answering in Automatic Tourism Service Packaging

The Role of Ontologies in Data Integration

A Method of Web Service Discovery based on Semantic Message Bipartite Matching for Remote Medical System

Exploiting peer group concept for adaptive and highly available services

Analysis on Leveraging social networks for p2p content-based file sharing in disconnected manets

PROPOSAL AND EVALUATION OF A COOPERATIVE MECHANISM FOR HYBRID P2P FILE-SHARING NETWORKS

On the Standardization of Semantic Web Services-based Network Monitoring Operations

CONTEMPORARY SEMANTIC WEB SERVICE FRAMEWORKS: AN OVERVIEW AND COMPARISONS

Digital libraries of the future and the role of libraries

SERENITY Pattern-based Software Development Life-Cycle

Optimised Realistic Test Input Generation

How To Model Web Service Quality Composition

The Role and uses of Peer-to-Peer in file-sharing. Computer Communication & Distributed Systems EDA 390

SWAP: ONTOLOGY-BASED KNOWLEDGE MANAGEMENT WITH PEER-TO-PEER TECHNOLOGY

Combining SAWSDL, OWL DL and UDDI for Semantically Enhanced Web Service Discovery

A Workbench for Prototyping XML Data Exchange (extended abstract)

Evaluating Semantic Web Service Tools using the SEALS platform

D1.1 Service Discovery system: Load balancing mechanisms

Model-driven Adapter Development for Web Services Interactions

Mobile App Discovery through Conceptual Models

Balancing Gossip Exchanges in Networks with Firewalls

INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY

Automatic Annotation Wrapper Generation and Mining Web Database Search Result

An Ontology Based Method to Solve Query Identifier Heterogeneity in Post- Genomic Clinical Trials

AN EFFICIENT DISTRIBUTED CONTROL LAW FOR LOAD BALANCING IN CONTENT DELIVERY NETWORKS

Semantically Enhanced Web Personalization Approaches and Techniques

An Ants Algorithm to Improve Energy Efficient Based on Secure Autonomous Routing in WSN

A Secure Mediator for Integrating Multiple Level Access Control Policies

Collaboration on the Social Semantic Desktop. Groza, Tudor; Handschuh, Siegfried

Super-Agent Based Reputation Management with a Practical Reward Mechanism in Decentralized Systems

Grid Data Integration Based on Schema Mapping

Efficient Intelligent Secure for Web Service Composition

Introduction to UDDI: Important Features and Functional Concepts

PEER-TO-PEER (P2P) systems have emerged as an appealing

Topic Communities in P2P Networks

Locality Based Protocol for MultiWriter Replication systems

A Semantic Approach for Access Control in Web Services

A Semantically Enriched Competency Management System to Support the Analysis of a Web-based Research Network

Transcription:

A semantic overlay for service discovery across Web Information Systems Devis Bianchini, Valeria De Antonellis, Michele Melchiori and Denise Salvi Università di Brescia Dip. Elettronica per l Automazione Via Branze, 38 25123 Brescia - Italy bianchin deantone melchior salvi@ing.unibs.it Abstract. Nowadays many collaborative organizations require advanced semantic interoperability tools to enable cooperation and communication across distributed Web Information Systems (WIS). Adopting the service oriented technology, they have improved interoperability at the application level by using services for WIS functionalities provisioning. Furthermore, in order to support effective peer-to-peer collaboration, they require semantic interoperability techniques for service discovery and sharing. In this paper, we focus on semantic interoperability issues for distributed collaboration and provide techniques for building a service semantic overlay, across heterogeneous WIS. In particular, semantic links among peers that offer comparable services in a given domain are defined and maintained over the time. The service semantic overlay is ontologybased for formally organizing the shared services to enhance the capability of service interchange and interoperation among the collaborative systems. The approach and prototype have been proposed to demonstrate practical benefits in the framework of the ESTEEM (Emergent Semantics and cooperation in multi-knowledge EnvironMents) project. 1 Introduction Nowadays many collaborative organizations require advanced semantic interoperability tools to enable cooperation and communication across distributed Web Information Systems (WIS). Methods and tools devoted to the interconnection of the systems according to a peer-to-peer approach have enabled collaboration at the technological level. Adoption of service oriented technology has improved interoperability at the application level by using services for WIS functionalities provisioning. Furthermore, in order to support effective peer-to-peer collaboration, they require semantic interoperability techniques for service discovery and sharing. We focus on the application level and, specifically, provide techniques This work has been partially supported by the ESTEEM PRIN Project (http://www.dis.uniroma1.it/ esteem/index.html) and TEKNE FIRB Project (http://www.tekne-project.it/), funded by the Italian Ministry of Education, University and Research.

for service analysis and organization in order to build a service semantic overlay, across heterogeneous Web Information Systems, by means of semantic links among peers offering comparable services (semantic neighbors). For service analysis and automated matchmaking, we have already proposed the fc(functional Comparison)-match approach characterized by a novel hybrid matchmaking and discovery environment [7]. In this paper, we propose a P2P-based Semantic Driven Service Discovery (P2P-SDSD) approach, where an ontological framework supports the service semantic overlay construction, devoted to the definition and maintenance of semantic links among services belonging to different peers. Semantic links between two service specifications are computed with respect to the service interfaces, described in terms of the entities exchanged with the service during its execution (input and output parameters) and the operations performed on these entities. Two services are similar with respect to their functionality if they perform the same or similar operations on the same or similar entities. Following this perspective, we have defined an ontology-based approach to establish semantic links between services belonging either to a single peer (intra-peer semantic links) or to different peers (inter-peer semantic links); in this way it is possible to identify possible synergic service centres over the network. Each peer maintains semantic links of interest (preferential semantic links, selected according to threshold based criteria) together with local service descriptions. When a peer receives a service request, semantic links are exploited to optimize local search within a single peer and distributed search of possible candidate services over the P2P network. Contributions of this work are: the development of techniques for semantic link definition and organization without heavily affecting the P2P network workload; their exploitation process, according to optimization policies, to support effective and efficient service discovery in the collaborative scenario. The approach descrided in this paper is part of the ESTEEM project [14], where peers of a P2P network are organized in semantic communities to improve contextaware and trust-based data and service discovery. The paper is organized as follows: Section 2 introduces the considered application scenario; Section 3 describes the ontology-based framework for service semantic overlay management; Section 4 illustrates the service discovery process based on the semantic overlay; Section 5 presents some experimental results; Section 6 discusses related work and points out some differences with our approach; finally, Section 7 presents concluding remarks and future work. 2 Application scenario Let s consider a community of peers in a given application domain which aim at providing and retrieving services. In our vision, each service has an abstract description (abstract service) in terms of service functionalities (operations) and input/output messages (parameters), based on the WSDL standard. For each abstract service, a set of concrete implementations are provided on the network. Two distinct peers could register the same abstract service, for which different

implementations could be provided, or they could advertise different abstract services that could partially overlap. When a peer is looking for a service, it formulates the service request by specifying the expected functional interface and sends the request to one of the peers of the community. According to this vision and to the service-oriented paradigm, each peer can play three different roles: (i) it can store abstract services and references to corresponding concrete implementations (broker); (ii) it can publish on a broker of the community the implementation of a service, described by its functional interface (provider); (iii) it can look for a service (requester). When a peer joins the collaborative community, it can act both as a broker/provider or a requester. When a peer P X joins the community site, it obtains the list of brokers IP currently available in the community and starts to contact them. When a broker P Y replies, P X can be connected to it. If P X is a provider, it publishes its services on broker P Y and waits for a request to serve. If P X is a requester, it sends a request to the broker P Y and waits for the answer. Finally, if P X is a broker in the community, it is connected to P Y, from which it receives a list of brokers known by P Y through a shuffling-based mechanism [16] carried out by technological level, that allows more effective information diffusion among brokers. In fact, according to this mechanism, two brokers periodically select and exchange a subset of brokers connected to them, randomly chosen. Brokers are in charge of establishing and maintaining semantic links towards other brokers on the basis of similarity between abstract services they store. Semantic links constitute a semantic overlay over the community of collaborative peers. In the rest of the paper, we will explain how the semantic overlay is built, maintained and exploited for service discovery purposes. 3 Ontology-based model for the service semantic overlay Broker service registry. Each broker in the network has its local: (i) UDDI Registry, where concrete service implementations are registered and associated to their abstract services through tmodels; (ii) a peer ontology, that provides a conceptualization of abstract service operations (e.g., flight reservation) and I/O parameters (e.g., electronic ticket) through concepts and semantic relationships between them; (iii) a standard service categorization, denoted with Service Category Taxonomy (SCT), extracted from available standard taxonomies (e.g., UNSPSC, NAiCS) to conceptualize service domain. Abstract services are associated to SCT categories in the UDDI Registry. Both the peer ontology and SCT are represented using OWL-DL formalism. The peer ontology and the SCT are augmented by a thesaurus containing terms that are related by terminological relationships (as synonymy or hypernymy) to the names of concepts and categories. The thesaurus is automatically built from the general, domain independent source of lexical information Word- Net [8]. By means of the thesaurus, peer matching capabilities based on the peer ontology and categories are extended. More details about the combined use of ontologies and thesaurus can be found in [7].

Fig.1. Service semantic overlay built on the P2P logical network. Broker service matchmaker. Each broker in the community is endowed with a matchmaker. In [7] we defined a hybrid matchmaking strategy, where a deductive matchmaking model is combined with a similarity-based model to compare services on the basis of their functional interface. The deductive matchmaking is used to qualify the kind of match MatchType (S 1,S 2 ) between two abstract services S 1 and S 2. According to this matchmaking model, it is possible to state if S 1 and S 2 provide the same functionalities (S 1 Exact S 2 ), if S 1 provide additional functionalities with respect to S 2 (S 1 Extends S 2 ) or viceversa, if there is a non empty intersection between functionalities provided by S 1 and S 2 (S 1 Intersects S 2 ) or if S 1 and S 2 have nothing in common (S 1 Mismatch S 2 ). In case of partial overlapping among service functionalities (Extends or Intersects) the similarity-based matchmaking model is used to quantify service similarity Sim(S 1, S 2 ) [0, 1] through coefficients properly defined to compare service interfaces. Otherwise, if S 1 Exact S 2 or S 1 Mismatch S 2, then Sim(S 1, S 2 ) = 1.0 or Sim(S 1, S 2 ) = 0.0, respectively. A detailed presentation of the hybrid matchmaking model with presentation of experimental results about the matchmaker is given in [7]. In this paper, we show how to exploit the information derived from the service matchmaking to improve a semantic-driven search of services among collaborative peers. Service semantic overlay. Matchmaking information (MatchType and Sim) are used to define semantic links among abstract services when advertised on brokers. A semantic link between two services S 1 and S 2 is established if they are similar (denoted with S 1 S 2 ), that is, whenmatchtype(s 1,S 2 ) is not Mismatch and Sim(S 1, S 2 ) δ, where δ is a similarity threshold. If S 1 and S 2 are published on the same broker P and S 1 S 2, then an intra-peer semantic link is established

between them, denoted with sl P (S 1, S 2 ). For example, in Figure 1 on broker P X, abstract services S1 X and S2 X are connected to S3 X. In particular, S3 X and S2 X have overlapping functionalities, while S1 X adds functionalities with respect to those of S3 X. Semantic links are labeled with the similarity degree. For example, similarity between S3 X and S2 X is 0.75. If S 1 and S 2 are published on two different brokers P 1 and P 2 and S 1 S 2, an inter-peer semantic link is established between them. To establish inter-peer semantic links, the broker P X sends a probe service request for each service Si X it wants to make sharable to other brokers connected at the logical network overlay in the community. The probe service request contains the interface of Si X and the IP address of P X. The broker P Y that receives the probe service request, matches it against its own abstract services Sj Y by applying the matchmaking techniques based on its own peer ontology and SCT and obtains for each comparison the MatchType and the similarity value. P Y then replies to P X with the list of Sj Y such that Si X Sj Y, together with the MatchType and the similarity value. P X establishes an inter-peer semantic link isl PX P Y (Si X, Sj Y ) for each Sj Y ) and P Y is recognized as a semantic neighbor of P X. Inter-peer semantic links are stored in the Semantic Neighbors Knowledge of broker P X. For example, in Figure 1, service S3 X on broker P X adds functionalities with respect to service S2 Y on broker P Y and the similarity degree is 0.8. Probe service requests are sent according to a Time-To-Live (TTL) mechanism with a low TTL value to avoid network overload. In fact, brokers that cannot be reached from P X, due to the low TTL value, will be reached thanks to the shuffling-based mechanism. Experimentation is being performed to establish the best value of TTL. Logical network overlay evolution. In the considered collaborative scenario there is the need to guarantee the communication among peers and manage the dynamic aspects featuring the P2P network organization. The logical network overlay collects all the peer participating the system: each node represents a peer and each link is a logical connection between two peers. In order to guarantee connection between peers an Overlay Management Protocol (OMP) is used, which defines specific procedures to join, leave and modify the logical network overlay. In our approach a shuffling-based OMP is chosen in order to allow more effective information diffusion among peers [16]. This kind of OMP arranges the logical network overlay as a graph in which each peer is directly connected to a small portion of the entire peer population, that is their logical neighbors. The shuffling protocol is quite simple: each peer continuously changes the set of its logical neighbors by occasionally contacting a random neighbor, then they exchange some of their neighbors. The obtained result is inexpensive overlay membership management, in the sense that any joining or leaving of nodes is quickly and efficiently managed. Moreover, a peer eventually become aware of new peers that are made available on the network. Service semantic overlay evolution. The service semantic overlay has to be managed with respect to changes that occur to the participating brokers:

acquisition of the IP address of a new broker in the logical network overlay, disconnection of a semantic neighbor from the community, publication/cancellation of a service in the broker registry. When the IP of a new broker P Y is acquired due to the shuffling-based protocol at the logical network overlay, then probe service requests are sent to P Y and possibly inter-peer semantic links between P X and P Y are established based on the received answers. A broker P X recognizes that a semantic neighbor P X becomes unavailable if a given number of messages sent to P Y are not answered. In this case the inter-peer semantic links toward abstract services published on P Y are removed from P X. Finally, if a new service Si X is published on a broker P X, a probe service request is sent to semantic neighbors of P X and inter-peer semantic links are established on the basis of obtained answers. If a service is removed, the interpeer semantic links based on it are removed on the broker on which it has been published. 4 Service discovery based on the service semantic overlay A service request S R is formulated by specifying one or more service categories and the desired functional interface. It is sent to broker P X, to which the requester is connected, and it is matched against abstract services published on P X (local search). Afterwards, S R is sent to the other brokers that are semantic neighbors of P X, according to different forwarding policies (distributed search). Each broker, which receives the request, applies the same matching process. Each broker collects its local concrete implementations for the abstract services matching S R with those received from semantic neighbors and sends them back to the broker from which the request came, up to the requester. matchtype(s R,Si X) matchtype(si X,Sj X) Mismatch matchtype(s R,Sj X) R1) Si X Exact S R Si X <matchtype> Sj X Sj X <matchtype> S R R2a) Si X Extends S R Sj X Exact Extends Si X Sj X Extends S R R2b) Sj X Intersects Si X evaluate MatchType(Sj X,S R) R2c) Si X Extends Sj X evaluate MatchType(Sj X,S R) R3a) S R Extends Si X Sj X Exact Si X S R Extends Sj X R3b) Si X Extends Sj X S R Extends Sj X R3c) otherwise evaluate MatchType(Sj X,S R) R4a) Si X Intersects S R Sj X Exact Si X Sj X Intersects S R R4b) otherwise evaluate MatchType(Sj X,S R) Table 1. Exploitation of intra-peer semantic links during local search.

Local search. When the service request S R reaches the broker P X, the broker, according to the categories, searches for abstract services in its own registry to build a list MS(S R ) = { S1 X, Sim 1, mt 1,... Sn X, Sim n, mt n } of matching services such that S R Si X, where Sim i = Sim(S R, Si X ) and mt i =MatchType(S R, Si X ). Intra-peer semantic links on broker P X can be exploited to efficiently find all available services which match locally. In fact, if Si X is found such that S R Si X, then Si X is included into MS(S R ) and only abstract services Sj X related with intra-peer semantic links to Si X are considered as candidate services for the request S R, according to the rules presented in Table 1. After performing local search, broker P X forwards the request to its semantic neighbors. If no matching service has been found, P X selects randomly a set of brokers connected at the logical network overlay and forwards S R to them. Example 1. If S R S3 X such that S3 X Extends S R (Figure 1), then also S R S1 X (S1 X Extends S R ) according to rule R2a on Table 1. The kind of match between S R and S2 X must be evaluated (according to rule R2b), while S4 X is not considered for S R, since no intra-peer semantic links exist between S4 X and the other abstract services relevant for S R. Distributed search. Once matching abstract services have been found on peer P X, two forwarding policies based on inter-peer semantic links can be applied. Minimal policy. Search over the semantic overlay stops when matching services which fully satisfy the request have been found; this strategy is performed according to the following rules: for each Si X MS(S R ) such that Si X Exact Extends S R, it is not necessary to forward the request to semantic neighbors, since concrete implementations of Si X already satisfy completely the request; for each Si X MS(S R ) such that S R Intersects Extends Si X, the request is not completely satisfied by Si X and is forwarded to semantic neighbors, that could add further functionalities to those already provided by Si X ; however, broker P X does not consider semantic neighbors that provide services Sj Y such that Si X Extends Exact Sj Y, because this means that Sj Y does not provide additional functionalities with respect to those already provided by Si X ; if no semantic neighbors exist for any abstract service Si X MS(S R ), the request is forwarded to one of the brokers that are connected to P X in the logical network overlay (randomly chosen); Example 2. According to the first forwarding rule, since S3 X Extends S R on P X (Figure 1), the request is not forwarded to P Y ; moreover, according to rule R2a on Table 1, S1 X Extends S R, then S R is not forwarded towards P Z too. On the other hand, since S R Extends S1 X, the request is forwarded to P Z, where S2 Z Intersects S1 X could provide additional functionalities with respect to S1 X ; moreover, according to rule R3b on Table 1, S R Extends S3 X, but the request is not forwarded to P Y, where no further functionalities are provided with respect to those of S3 X.

Exhaustive policy. This policy follows the same rules, but it does not stop when matching services that fully satisfy the request are found; according to this policy, also for each Si X MS(S R ) such that Si X Exact Extends S R, the request S R is forwarded to semantic neighbors to find other services that could present, for example, better non functional features (not discussed in this paper). A broker which receives S R from a semantic neighbor applies one of the two forwarding policies; the search stops according to a Time To Live mechanism. Example 3. In Figure 1, since S R Extends S1 X (and S R Extends S3 X according to rule R3b on Table 1), the request is forwarded both to broker P Y and broker P Z. 5 Experimental evaluation In this section, we present a preliminary evaluation of performances of our approach. To this purpose, we performed a set of simulations based on Neuro- Grid [11], an extensible network overlay simulator in which we have implemented the P2P-SDSD service semantic overlay network and the minimal forwarding policy explained in Section 4. Simulation Parameter Description N Total number of brokers on the network NS Total number of available services Initial TTL Initial TTL associated to a request message Simulation Type P2P-SDSD or Gnutella γ Probability that two brokers has established semantic links ( (0,1)) ASP Average number of services per broker Fig. 2. Simulation Parameters The simulations we have run compares P2P-SDSD forwarding policy with the Gnutella one [9] both in terms of efficiency and scalability. The choice of a comparison with Gnutella is due to the fact that both P2P-SDSD and Gnutella define an overlay network built on top of an unstructured P2P network. Apart from the architectural similarities, we have considered Gnutella since its message forwarding strategy is well-known and it is frequently considered as a reference example. Some other P2P forwarding strategies have been also considered for a comparison with P2P-SDSD and we plan to perform additional experiments in future work. The experiments have been performed: (i) to demonstrate the better recall results of our distributed search with respect to Gnutella search; (ii) to confirm that the use of the P2P-SDSD request forwarding policy results in an improved scalability. In particular, P2P-SDSD and Gnutella service discovery have been compared on these parameters: Recall: with respect to a submitted request it is the ratio among the number of services retrieved by the distributed search policy and the number of

services retrieved if all offered services would be available on the node posing the request and the local search applied; the higher the recall, the more effective the distributed search by providing an answer similar to a local search. Generated Messages: it is the number of overall messages generated to answer a service request; this parameter depends on the TTL (Time To Live) value and on the peer s average number of connections to its neighbors; the lower the generated messages the better the scalability, since a lower number of messages per request reduces the possibility of network congestion. The different experimental settings for the simulation are obtained by setting some operative parameters shown in Figure 2. These parameters determine how the simulated logical overlay network is built and run. The simulator generates a random P2P network [11] of N peers and NS artificial services. Services are assigned to each peer according to the ASP parameter. The distribution of probability P() of the kind of match between a pair of services has been set as follows: P(Mismatch) = 50%, P(Intersect) = 35%, P(Extends) = 10%, P(Exact) = 5%. The γ value is the probability that a pair of peers has already exchanged probe queries and therefore have possibly established inter-peer semantic links. The Initial TTL determines the maximum number of times a message can be forwarded on the network and therefore the higher this value, the higher the number of peers involved in answering the request. In particular, the results discussed in the following have been obtained with these simulation parameter values: the Initial TTL has been varied between 1 and 12, N=100, γ = 0.60, NS=50 and ASPP=20. 110 100 90 80 70 Recall (%) 60 50 40 30 20 10 P2P SDSD Gnutella 0 0 2 4 6 8 10 12 Initial TTL Fig.3. Recall vs. Initial TTL

Figure 3 compares P2P-SDSD and Gnutella approaches through the Recall curves by setting different values for the Initial TTL parameter. The figure shows how P2P-SDSD overperforms the Gnutella even with low TTL values. This can be explained by the fact that inter-peer semantic links allow to selectively reach the most of the peers that provide relevant services with a low number of request forwardings. As TTL gets higher also the Gnutella performs better since the request reaches the most of the peers in the network. The analysis of generated messages has been performed on a series of 13 simulations with Initial TTL value set to 5 and a network of 100 peers. In each simulation we ran, a request has been submitted and we have collected the results obtained with both P2P-SDSD and Gnutella on the same simulated network. In these simulations the average number of generated messages for P2P-SDSD is nearly 19.31, that is about 50% lower than the value of 38.32 obtained by the Gnutella approach. 6 Related work In literature, P2P semantic-driven resource discovery has attracted much attention from Web services and Semantic Web area and relies on several efforts in related research fields, such as data integration and emergent semantics in P2P environments [1, 6, 10] to go beyond limitations of centralized service-oriented architectures. However, in P2P environments semantic heterogeneity and scalability issues must be addressed. Recent works involved structured [2, 13] and unstructured P2P architectures [12]. The first ones are less flexible and are mainly based on DHT (Distributed Hash Tables) structures, which require much efforts for maintenance. In [5] a P2P-based system to support efficient access to e-catalogs is provided, where scalability issues are solved by organizing information space in communities that are inter-related using peer relationships, defined as mappings between ontologies. Selection of relevant peers to which queries must be forwarded is based on a query rewriting algorithm. Our approach focuses on unstructured P2P architectures, featured by scalability issues, applied to the service discovery problem. Other approaches addressed the same problem. METEOR-S [15] uses a centralized registry ontology to classify peer registries. During the discovery process, registry ontology is browsed to find the proper registry to which the request must be submitted. ARTEMIS [3] defines a P2P network, where each peer has an ontology, based on medical information standards, to annotate services. Peers store the services they provide in a mediator super-peer. A peer sends a request to its reference mediator expressed in terms of its own ontologies; mediator uses ontology mappings to find matching services in its local registry and forwards the request to other mediators. WSPDS [4] describes a P2P network where peers have local DAML-S ontologies to provide service semantics and links with other peers based on an average similarity between services they provide. When a request is submitted to a peer, it searches for local matching results and forwards the request to all the linked peers, independently from the current request or the

local results of the query. In our approach a service semantic overlay is built by relating similar services through inter-peer semantic links in a community of collaborative peers. Like WSPDS, no common ontology is required and no super-peer is defined. However, in our approach, combined use of search results found locally and inter-peer semantic links allows for application of fine-grained request forwarding strategies. 7 Conclusions In this paper, we proposed a P2P service discovery approach to enable cooperation and communication across distributed Web Information Systems, where Web Services are used for WIS functionalities provisioning. Semantic interoperability is based on a service semantic overlay, over the logical network overlay, built by establishing semantic links among peers that offer comparable services. The semantic overlay is exploited to fasten service discovery and improve its efficacy without affecting network overload. Preliminary experiments have been performed to confirm the advantages derived from the exploitation of semantic overlay if compared with traditional Gnutella approach. Further experimentation will evaluate the impact of the proposed approach on open P2P networks, where additional issues must be considered during the construction of semantic overlay, and concrete applications in those kinds of networks will be investigated. References 1. K. Aberer, P. Cudre-Mauroux, and M. Hauswirth. The Chatty Web: Emergent Semantics Through Gossiping. In Proc. of the 12th International World Wide Web Conference, pages 197 206, Budapest, Hungary, 2003. 2. K. Arabshian and H. Schulzrinne. An Ontology-based Hierarchical Peer-to-Peer Global Service Discovery System. Journal of Ubiquitous Computing and Intelligence (JUCI), 2006. 3. The ARTEMIS Project: A Semantic Web Service-based P2P Infrastructure for the Interoperability of Medical Information Systems. http://www.srdc.metu.edu.tr/webpage/ projects/artemis/. 4. F. Banaei-Kashani, C. Chen, and C. Shahabi. WSPDS: Web Services Peer-to-Peer Discovery Service. In Proc. of the Int. Conference on Internet Computing (IC 04), pages 733 743, Las Vegas, Nevada, USA, 2004. 5. B. Benatallah, M.S. Hacid, H.Y. Paik, C. Rey, and F. Toumani. Towards semanticdriven, flexible and scalable framework for peering and querying e-catalog communities. Information Systems, 31(4-5):266 294, 2008. 6. P. Bernstein, F. Giunchigiloa, A. Kementsietsidis, J. Mylopoulos, L. Serafini, and I. Zaihrayeu. Data Management for Peer-to-Peer Computing: A Vision. In Proc. of the 5th International Workshop on the Web and Databases (WebDB 02), Madison, Wisconsin, 2002. 7. D. Bianchini, V. De Antonellis, and M. Melchiori. Flexible Semanticbased Service Matchmaking and Discovery. World Wide Web Journal (http://www.springerlink.com/content/ 4514473142836174/), 2008. 8. C. Fellbaum. Wordnet: An Electronic Lexical Database. MIT Press, 1998.

9. The Gnutella Protocol Specification v.0.4. http://www.clip2.com/gnutellaprotocol04.pdf. 10. A. Halevy, Z. Ives, D. Suciu, and I. Tatarinov. Schema Mediation in Peer Data Management Systems. In Proc. of the 19th International Conference on Data Engineering (ICDE 03), Bangalore, India, 2003. 11. NeuroGrid Home Page. http://www.neurogrid.net. 12. M. Paolucci, K.P. Sycara, T. Nishimura, and N. Srinivasan. Using daml-s for p2p discovery. In Proceedings of the Int. Conference on Web Services (ICWS2003), 2003. 13. C. Schmidt and M. Parashar. A peer-to-peer approach to web service discovery. World Wide Web Journal, 7(2):211 229, 2004. 14. The ESTEEM Team. Emergent Semantics and Cooperation in Multi-Knowledge Environments: the ESTEEM Architecture. In VLDB Int. Workshop on Semantic Data and Service Integration (SDSI07), pages 1 12, Vienna, Austria, 2007. 15. K. Verma, K. Sivashanmugam, A. Sheth, A. Patil, S. Oundhakar, and J. Miller. METEOR-S WSDI: A Scalable Infrastructure of Registries for Semantic Publication and Discovery of Web Services. Journal of Information Technology and Management, Special Issue on Universal Global Integration, 6(1):17 39, 2005. 16. S. Voulgaris, D. Gavidia, and M. van Steen. CYCLON: Inexpensive Membership Management for Unstructured P2P Overlays. Journal of Network and Systems Management, 13(2):197 217, 2005.