RESEARCH ISSUES IN PEER-TO-PEER DATA MANAGEMENT



Similar documents
Research Issues in Peer-to-Peer Data Management

An Introduction to Peer-to-Peer Networks

The Role and uses of Peer-to-Peer in file-sharing. Computer Communication & Distributed Systems EDA 390

A PROXIMITY-AWARE INTEREST-CLUSTERED P2P FILE SHARING SYSTEM

GRIDB: A SCALABLE DISTRIBUTED DATABASE SHARING SYSTEM FOR GRID ENVIRONMENTS *

Varalakshmi.T #1, Arul Murugan.R #2 # Department of Information Technology, Bannari Amman Institute of Technology, Sathyamangalam

Counteracting free riding in Peer-to-Peer networks q

International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November ISSN

Architectures and protocols in Peer-to-Peer networks

A Reputation Management System in Structured Peer-to-Peer Networks

Multicast vs. P2P for content distribution

Load Balancing in Structured Overlay Networks. Tallat M. Shafaat

Chord - A Distributed Hash Table

A Review on Efficient File Sharing in Clustered P2P System

A Peer-to-Peer File Sharing System for Wireless Ad-Hoc Networks

Raddad Al King, Abdelkader Hameurlain, Franck Morvan

Adapting Distributed Hash Tables for Mobile Ad Hoc Networks

A Survey of Peer-to-Peer File Sharing Technologies

Information Searching Methods In P2P file-sharing systems

PSON: A Scalable Peer-to-Peer File Sharing System Supporting Complex Queries

New Structured P2P Network with Dynamic Load Balancing Scheme

Peer-to-Peer Systems: "A Shared Social Network"

Using Peer to Peer Dynamic Querying in Grid Information Services

Clustering in Peer-to-Peer File Sharing Workloads

Distributed Computing over Communication Networks: Topology. (with an excursion to P2P)

PEER TO PEER FILE SHARING USING NETWORK CODING

5. Peer-to-peer (P2P) networks

SCALABLE RANGE QUERY PROCESSING FOR LARGE-SCALE DISTRIBUTED DATABASE APPLICATIONS *

P2P: centralized directory (Napster s Approach)

Peer-to-Peer Computing

LOOKING UP DATA IN P2P SYSTEMS

Acknowledgements. Peer to Peer File Storage Systems. Target Uses. P2P File Systems CS 699. Serving data with inexpensive hosts:

Simulating a File-Sharing P2P Network

How To Create A P2P Network

PROPOSAL AND EVALUATION OF A COOPERATIVE MECHANISM FOR HYBRID P2P FILE-SHARING NETWORKS

Department of Computer Science Institute for System Architecture, Chair for Computer Networks. File Sharing

Tornado: A Capability-Aware Peer-to-Peer Storage Network

File sharing using IP-Multicast

Principles of Distributed Database Systems

A P2P SERVICE DISCOVERY STRATEGY BASED ON CONTENT

P2P Storage Systems. Prof. Chun-Hsin Wu Dept. Computer Science & Info. Eng. National University of Kaohsiung

International journal of Engineering Research-Online A Peer Reviewed International Journal Articles available online

Efficient Content Location Using Interest-Based Locality in Peer-to-Peer Systems

PEER-TO-PEER (P2P) systems have emerged as an appealing

XOP: Sharing XML Data Objects through Peer-to-Peer Networks

Load Balancing in Peer-to-Peer Data Networks

query enabled P2P networks Park, Byunggyu

Evolution of Peer-to-Peer Systems

p2p: systems and applications Internet Avanzado, QoS, Multimedia Carmen Guerrero

Distributed Hash Tables in P2P Systems - A literary survey

DUP: Dynamic-tree Based Update Propagation in Peer-to-Peer Networks

How To Make A Network Overlay More Efficient

Peer-to-Peer Replication

Enhance UDDI and Design Peer-to-Peer Network for UDDI to Realize Decentralized Web Service Discovery

Decentralized supplementary services for Voice-over-IP telephony

HollyShare: Peer-to-Peer File Sharing Application

Peer-to-Peer: an Enabling Technology for Next-Generation E-learning

Distributed file system in cloud based on load rebalancing algorithm

A NEW FULLY DECENTRALIZED SCALABLE PEER-TO-PEER GIS ARCHITECTURE

CS5412: TIER 2 OVERLAYS

P2P Characteristics and Applications

Load Balancing in Structured P2P Systems

Clustering in Peer-to-Peer File Sharing Workloads

A Survey of Peer-to-Peer Content Distribution Technologies

IMPACT OF DISTRIBUTED SYSTEMS IN MANAGING CLOUD APPLICATION

Peer to peer networks: sharing between peers. Trond Aspelund

Optimizing and Balancing Load in Fully Distributed P2P File Sharing Systems

Common P2P Examples. Peer to Peer Networks. Client-Server Architecture. Uses of P2P. Napster Morpheus Gnutella Freenet BitTorrent Skype

Super-Agent Based Reputation Management with a Practical Reward Mechanism in Decentralized Systems

A Measurement Study of Peer-to-Peer File Sharing Systems

GISP: Global Information Sharing Protocol a distributed index for peer-to-peer systems

Professor Yashar Ganjali Department of Computer Science University of Toronto.

CSCI-1680 CDN & P2P Chen Avin

Enhancing Secure File Transfer by Analyzing Repeated Server Based Strategy using Gargantuan Peers (G-peers)

Krunal Patel Department of Information Technology A.D.I.T. Engineering College (G.T.U.) India. Fig. 1 P2P Network

Mapping the Gnutella Network: Macroscopic Properties of Large-Scale Peer-to-Peer Systems

8 Conclusion and Future Work

Attacks Against Peer-to-peer Networks and Countermeasures

Research on P2P-SIP based VoIP system enhanced by UPnP technology

LOAD BALANCING WITH PARTIAL KNOWLEDGE OF SYSTEM

Peer-to-Peer Networks 02: Napster & Gnutella. Christian Schindelhauer Technical Faculty Computer-Networks and Telematics University of Freiburg

Index Terms : Load rebalance, distributed file systems, clouds, movement cost, load imbalance, chunk.

Improving Data Availability through Dynamic Model-Driven Replication in Large Peer-to-Peer Communities

Accessing XML Documents using Semantic Meta Data in a P2P Environment

Scalable Source Routing

Data Integration Hub for a Hybrid Paper Search

Transcription:

RESEARCH ISSUES IN PEER-TO-PEER DATA MANAGEMENT Bilkent University 1

OUTLINE P2P computing systems Representative P2P systems P2P data management Incentive mechanisms Concluding remarks Bilkent University 2

PEER-TO-PEER COMPUTING Peer-to-Peer (P2P) is a distributed computing paradigm that enables a collection of nodes (peers) to share computer resources in a decentralized manner. Communication is symmetric; i.e., peers act as both clients and servers. Bilkent University 3

PEER-TO-PEER SYSTEMS Benefits of P2P model: self-organization adaptation scalability autonomy load-balancing fault-tolerance availability resource aggregation Bilkent University 4

SOME P2P STATISTICS At its peak, about 60 million people joined the Napster community. It managed to reach a peak population of approximately 1.5 million simultaneous users. [http://www.slyck.com/story1314.html] The number of simultaneous users participating on various P2P Networks increased from 3.8 million (Aug. 2003) to 9.0 million (Sept. 2006). [http://www.slyck.com/story1314.html] P2P population in Japan has increased from 1.3 million (2005) 1.8 million (2006) individuals. [http://www.slyck.com/story1249.html] P2P traffic in Germany consumes approximately 30% of available bandwidth during the day, and 70% at night. [http://www.slyck.com/story1320.html] Bilkent University 5

P2P DATA MANAGEMENT Initial motivation of P2P systems: file-sharing. To support sharing of semantically rich data, some data management issues need to be addressed: data integration query processing data consistency replication clustering Bilkent University 6

P2P DATA MANAGEMENT Data management in P2P systems is challenging due to the large scale of the network and highly-transient peer population. Decentralized and self-adaptive data management is required. Bilkent University 7

P2P NETWORK STRUCTURE Each peer maintains links with a selected subset of other peers, forming an overlay network. A message between any two peers is routed through the overlay network. Overlay networks can be distinguished in terms of their centralization and structure. Bilkent University 8

OVERLAY NETWORK CENTRALIZATION Purely Decentralized Architectures Partially Centralized Architectures Hybrid Decentralized Architectures Bilkent University 9

Purely Decentralized No central coordination of activities. download request (5) (1) (6) (1) (1) (2) requester (1) (1) (4) Peer IP (2) (3) Peer IP target Bilkent University 10

Partially Centralized requester Super Peer Super Peer Super Peer Superpeers act as local central indices for files shared by local peers. target Bilkent University 11

Hybrid Decentralized A central server facilitates the interaction between peers by maintaining directories of metadata. target (4) download Server (1) (2) Peer IP requester (3) request Bilkent University 12

OVERLAY NETWORK STRUCTURE Classification of overlay networks in terms of structure: Unstructured: overlay network is created nondeterministically as peers and files are added. Structured: creation of overlay networks is based on specific rules. Bilkent University 13

UNSTRUCTURED P2P SYSTEMS There is no restriction on data placement. Searching mechanisms employed can range from flooding to those that use indexing. Topologies are usually not restricted to some regular structure. Unstructured systems are more appropriate for accommodating highly-transient peer populations. Bilkent University 14

STRUCTURED P2P SYSTEMS Overlay topology is tightly controlled and pointers to files are placed at precisely specified locations. A mapping is provided between files and location in the form of a distributed routing table. It is hard to maintain the structure in the face of a highlytransient peer population. Bilkent University 15

OUTLINE P2P computing systems Representative P2P systems P2P data management Incentive mechanisms Concluding remarks Bilkent University 16

NAPSTER First P2P file sharing system. Enables sharing of music files over the Internet. Uses a centralized directory to maintain the list of music files. Keyword based querying. Centralized index, distributed data. Bilkent University 17

NAPSTER Napster Server Search (1) (2) requester abc.mp3? (3) request download (4) abc.mp3 Bilkent University 18

GNUTELLA Purely decentralized architecture. A file sharing protocol. Users can search for and download files from the other users connected to the Internet. Bilkent University 19

GNUTELLA (3) (3) (2) (2) (2) (2) (4) (1) (3) abc.mp3 download (6) (2) (2) (5) request (1) requester abc.mp3? Bilkent University 20

GNUTELLA Flooding to distribute messages. Time-to-live (TTL) to limit the spread of messages. Superpeers in more recent versions of Gnutella. Shared files are indexed at superpeers. Queries are initially directed to superpeers. Bilkent University 21

CAN (Content Addressable Network) A distributed hash table. Uses a d-dimensional coordinate space. Each peer is responsible for a portion of data space, called zone. A key is mapped onto a point in the coordinate space, and is stored at the peer whose zone contains the point s coordinates. Bilkent University 22

CAN (Content Addressable Network) (0, 1) (1, 1) Peer 2 Peer 1 Peer 4 ((0, 0.5), (0.5, 1)) Peer 3 Peer 7 Peer 5 Peer 6 Peer 8 (0, 0) (1, 0) Bilkent University 23

CAN (Content Addressable Network) Each peer maintains a routing table of its neighbors. Query messages are forwarded along a path to the source zone containing the key. Bilkent University 24

CAN (Content Addressable Network) (0, 1) (1, 1) search(0.9, 0.2) Peer 2 Peer 1 Peer 4 ((0, 0.5), (0.5, 1)) Peer 3 Peer 7 Peer 5 Peer 6 ((0.75,0),(1,0.25)) Peer 8 (0, 0) (1, 0) Key (0.9, 0.2) is stored in this peer Bilkent University 25

CHORD Peers form a ring called identifier ring or Chord ring. File keys are distributed over the same identifier space. Successor(k): The peer whose identifier is equal to or follows k in the identifier space. The data file with key k is assigned to Successor(k). Bilkent University 26

CHORD 7 id = 0 1 6 2 Successor(1) = 3 Successor(5) = 0 5 4 3 Successor(4) = 4 Keys 1, 4, 5 are located at peers 3, 4, 0 respectively. Bilkent University 27

CHORD A skiplist-like routing table, called finger table, is used by peers. Each peer keeps track of log(n) other peers in its finger table. Bilkent University 28

CHORD 28 2 5 Finger Table 25 +16 +1 of Peer 5 5+1 12 23 +8 +4 +2 5+2 5+4 5+8 12 12 16 5+16 23 12 16 Bilkent University 29

CHORD Search operation is performed through finger tables. A peer forwards a query for key k to the peer in its finger table with the highest id not exceeding k. Bilkent University 30

K26 28 CHORD 2 25 5 search(26) 23 12 16 Bilkent University 31

OUTLINE P2P computing systems Representative P2P systems P2P data management Incentive mechanisms Concluding remarks Bilkent University 32

OUTLINE P2P data management Indexing Data integration Query processing Replication Clustering Bilkent University 33

INDEXING Traditional distributed systems use centralized or distributed indices. Indices in P2P systems: must support frequent updates, need to be highly scalable. Index types: Local Centralized Distributed Bilkent University 34

LOCAL INDEX Peers index only their own data. Flooding is used for routing. Large volume of query traffic with no guarantee for matching. Scalability problem due to large network overhead. To improve scalability: fixed time-to-leave (TTL) rings, expanding rings, random walk, random k-walkers. Bilkent University 35

CENTRALIZED INDEX A single peer maintains references to data on all peers. Index is centralized but the data is distributed. The index server becomes a bottleneck and a single point of failure. Replicas of the centralized index may be maintained. Higher reliability and scalability. Huge number of updates is not handled efficiently. Bilkent University 36

DISTRIBUTED INDEX Index distribution can vary depending on the structure of overlay network, and the degree of centralization of the P2P system. Partially centralized P2P systems (e.g., Kazaa) A superpeer maintains index information about a number of other peers it is responsible for. Bilkent University 37

DISTRIBUTED INDEX Structured P2P systems Each peer maintains index for the data assigned to it. Distributed Hash Tables (DHTs) greedy routing robustness low diameter Bilkent University 38

OUTLINE P2P data management Indexing Data integration Query processing Replication Clustering Bilkent University 39

DATA INTEGRATION Schema Mappings Required as different names or formalisms are used to describe data. Aims to provide uniform querying environment that hides heterogeneity. Traditional approach is to use a global unified schema. Queries are specified interms of the global schema. Bilkent University 40

DATA INTEGRATION Schema Mappings (contd.) A global unified schema is not applicable to P2P systems due to: Autonomous nature of peers Volatility of the system Scalability of the system Bilkent University 41

DATA INTEGRATION P2P Schema Mappings Pair Mappings: only between pairs of peers. Peer-Mediated Mappings: a generalization of pair mappings. (e.g., Piazza, PeerDB). Super-Peer Mediated Mappings: at the super-peer level (e.g., Edutella). Bilkent University 42

Pair Mappings Schema B Map(C) peer Map(B) Schema A peer peer Schema C Bilkent University 43

Peer-Mediated Mappings Schema B Map(B,A,C) peer Schema A peer peer Schema C Bilkent University 44

SuperPeer-Mediated Mappings peer B2 B1 peer Super Peer B Map(B1, B2, B3) peer A1 peer B3 C1 peer peer A2 Super Peer A Map(B) peer C2 Super Peer C peer C3 peer A3 peer C4 Bilkent University 45

OUTLINE P2P data management Indexing Data integration Query processing Replication Clustering Bilkent University 46

QUERY PROCESSING Querying shared files Queries are routed to peers to locate files. Querying structured data More expressive queries are allowed. Current research in P2P systems: key lookups in structured networks keyword queries in unstructured networks Key lookups and keyword queries are not expressive enough. Bilkent University 47

QUERYING SHARED FILES Aim is to locate the source peers. Popular files are replicated. Unpopular files may not be found. Routing schemes: Blind searches Informed searches Bilkent University 48

QUERYING SHARED FILES Blind Search Query is propagated arbitrarily to peers. The query may not be satisfied. No information about file locations is utilized. Routing methods: Flooding, TTL rings, random walks. Bilkent University 49

QUERYING SHARED FILES Informed Search Each peer maintains some form of routing table containing file placement information. The chances for locating file increases as the hop count in the route increases. Bilkent University 50

QUERYING SHARED FILES Informed Search (contd.) Search methods differ in the information collected about the peers and file placements. Query Routing Protocol (QRP) Keywords describing the file contents that a peer offers are summarized in routing tables and exchanged with neighbors. Routing Indices FreeNet s routing scheme Bilkent University 51

QUERYING STRUCTURED DATA Support for complex query types is desirable (range queries, multi-attribute queries, join queries, aggregation queries). Multi-Attribute Addressable Network (MAAN) Supports multi-attribute and range queries. Built on CHORD structured P2P network. A locality preserving hash function is used to map attribute values. Bilkent University 52

COMPLEX QUERIES Multi-Attribute Addressable Network (contd.) A range query searching for files with attribute value in the range [l, u] is forwarded to peer pl where l has been hashed to. All the files of pl that satisfy the range query are gathered in the result. Next, the query is forwarded to the immediate successor of peer pl in the ring. This process continues until the query reaches peer pu where u has been hashed to. Bilkent University 53

COMPLEX QUERIES Multi-Attribute Addressable Network (contd.) In multi-attribute setting, data consists of a set of attributed and their respective values. A multi-attribute query is treated as a combination of subqueries on each attribute dimension. Subqueries are executed at appropriate peers and results are merged at query initiator. Bilkent University 54

COMPLEX QUERIES Supporting SQL in P2P Systems PIER, PeerDB projects. A great deal of additional research is needed to advance current work. Bilkent University 55

OUTLINE P2P data management Indexing Data integration Query processing Replication Clustering Bilkent University 56

REPLICATION Aims to improve system performance, and increase data availability and reliability in case of peer failures. Classical issues of interest: what to replicate, granularity of replicas, where to place them, how to keep them consistent. Research challenges specific to P2P systems: high rates of peer joins and failures, lack of global knowledge, low online probability. Bilkent University 57

REPLICATION ISSUES IN P2P Data Consistency and Data Continuity In Chord, k replicas are stored at k successors in the ring. If the primary replica fails, the successor immediately takes over. Push update to all the other peers storing the replicas, similar to a flooding scheme. When become online, pull update from the most recent replica. Bilkent University 58

REPLICATION ISSUES IN P2P Number of Replicas Uniform replication: same number of replicas are created for all data items. Proportional replication: number of replicas created is proportional to the popularity of items. Square-root replication: for any two data items, the ratio of replication is the square root of the ratio of their query rates. Bilkent University 59

REPLICATION ISSUES IN P2P Placing Replicas Unstructured Networks Owner replication: whenever a peer issues a successful query about a data item, a replica of the item is created at that peer. (e.g., Gnutella) Path replication: copies of the requested data are stored at all peers along the path in the overlay network from the provider to the requestor. (e.g., FreeNet) Bilkent University 60

REPLICATION ISSUES IN P2P Placing Replicas Structured Networks Issues addressed by replication: load balancing, data availability. Some peers may be assigned with many popular data items by distributed hashing, and thus may become hot spots. CAN uses multiple hash functions to assign each item to more than one peer. Bilkent University 61

OUTLINE P2P data management Indexing Data integration Query processing Replication Clustering Bilkent University 62

CLUSTERING Clustering data: similar data are placed in neighboring peers. Clustering peers: peers with similar interests or similar data items are placed nearby in the overlay network. Bilkent University 63

CLUSTERING DATA Order preserving hash functions can be used to store similar documents at the same or neighboring peers. Content-based clustering is achieved by using a semantic vector describing the contents of files, as the input of the hash function. Bilkent University 64

CLUSTERING PEERS Intra-cluster organization: specification of how the peers within a cluster are connected. Inter-cluster organization: specification of how the clusters are connected. Two-step routing: identification of appropriate cluster, and then routing the query within that cluster. Bilkent University 65

CLUSTERING PEERS Peer communities Membership depends on the relationship between peers that share common interests. Clustering is implemented through sets of attributes that the peers choose. Bilkent University 66

CLUSTERING PEERS Semantic overlay clustering Based on superpeer networks. Clusters of peers with similar semantic profiles are formed. Each superpeer acts as cluster representative, which is in charge of the management of the cluster for query processing. Bilkent University 67

CLUSTERING P2P Challenges of Clustering Autonomy of peers Data clustering violates storage autonomy. Dynamic nature of connections and content Clustering method must be dynamic and incremental. Lack of global knowledge Clustering algorithms assume global knowledge of data. Bilkent University 68

OUTLINE P2P computing systems Representative P2P systems P2P data management Incentive mechanisms Concluding remarks Bilkent University 69

FREE RIDING Free riding: exploiting P2P network resources without contributing to the network at an acceptable level. Effects on P2P networks A small number of peers serves a large number of requests: scalability problem degeneration to client-server like paradigm Renewal or presentation of new content may decrease in time: limited grow in the number of shared files Bilkent University 70

FREE RIDING Effects on P2P networks (contd.) Quality of search process may degrade: less satisfaction in query results decrease in peer population Network traffic increases: degradation of P2P services delay, congestion and loss of messages Bilkent University 71

FREE RIDING STATISTICS Gnutella 85% of peers do not share any files at all. Top 1% of the sharing peers provide 50% of all query responses, while the top 25% provide 98%. Napster About 20-40% of the Napster peers share little or no files. 30% of the users that reported their bandwidth as 64 Kbps or less, but they actually had a significantly higher bandwidth. Similar findings were reported for KaZaA, Maze, edonkey networks. Bilkent University 72

INCENTIVE MECHANISMS Methods to encourage peer cooperation Micropayment-based Reciprocity-based Reputation-based Bilkent University 73

INCENTIVE MECHANISMS Micropayment-based Approaches Peers are required to pay for the services they get or resources they consume. Incentive: peers are charged for every download and paid for every upload. A trusted third-party is required for tracking accounts, distributing virtual currency, and providing security. Bilkent University 74

INCENTIVE MECHANISMS Reciprocity-Based Approaches Each peer decides how to serve any other peer based on the direct service exchange with this peer in the past. Example: Tit-for-Tat mechanism of BitTorrent A peer uploads to the peers that have given him the best downloading rate. The other peers are disallowed from downloading. Bilkent University 75

INCENTIVE MECHANISMS Reputation-Based Approaches A reputation rating is produced for the peers. Peers with low reputation are excluded from the network. Reputation information is locally generated, and it is spread throughout the network. Implementation issues: security and availability of reputation information, identity management, communication overhead. Bilkent University 76

OUTLINE P2P computing systems Representative P2P systems P2P data management Incentive mechanisms Concluding remarks Bilkent University 77

CONCLUDING REMARKS Addressed in this talk: Key research issues relevant to P2P data management Associated challenges Open research problems and directions: Design of effective incentive mechanisms and reputation systems. Extension of the key-value lookup and building application flexibility into Distributed Hash Tables (DHTs). Creation of semantic mappings to handle heterogeneity in shared data sets. Adaptation of more expressive queries. Methods to maintain replica consistency in the face of P2P specific challenges. Bilkent University 78

THANK YOU Bilkent University 79

REFERENCES R. Blanco et al. A Survey of Data Management in Peer-to-Peer Systems, Technical Report CS-2006-18, David R. Cheriton School of Computer Science, University of Waterloo, Waterloo, Ontario, Canada, 2006. M. Cai et al. Maan: A Multi-Attribute Addressable Network for Grid Information Services, IEEE International Workshop on Grid Computing (GRID), pp.184-191, 2003. E. Cohen, S. Shenker, Replication Strategies in Unstructured Peer-to-Peer Networks, ACM Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication (SIGCOMM), pp.177-190, 2002. B. Cohen, Incentives Build Robustness in Bittorrent, Workshop on Economics of Peer-to-Peer Systems, 2003. F. Dabek et al. Building Peer-to-Peer Systems with Chord, a Distributed Lookup Service, IEEE Workshop on Hot Topics in Operating Systems (HoTOS), pp.81-86, 2001. Bilkent University 80

REFERENCES A. Datta et al. Updates in Highly Unreliable, Replicated Peer-to-Peer Systems, International Conference on Distributed Computing Systems (ICDCS), pp.76-85, 2003. D. Hughes et al. Free Riding on Gnutella Revisited: the Bell Tolls?, IEEE Distributed Systems Online, vol.6, no.6, 2005. M. Karakaya, I. Korpeoglu, Ö. Ulusoy, A Distributed and Measurement- Based Framework Against Free Riding in Peer-to-Peer Networks, IEEE International Conference on Peer-to-Peer Computing, 2004. M. Karakaya, I. Korpeoglu, Ö. Ulusoy, A Connection Management Protocol for Promoting Cooperation in Peer-to-Peer Networks, Computer Communications, to appear, 2007. M. Khambatti et al. Structuring Peer-to-Peer Networks Using Interest-Based Communities. International Workshop on Databases, Information Systems and Peer-to-Peer Computing (DBISP2P), pp.48-63, 2003. Bilkent University 81

REFERENCES G. Koloniari, E. Pitoura, Peer-to-Peer Management of XML Data: Issues and Research Challenges, ACM SIGMOD Record, vol.34, no.2, pp.6-17, 2005. G. Koloniari et al. Overlay Networks and Query Processing: A Survey, Technical Report TR2006-08, Computer Science Department, University of Ioannina, 2006. W. S. Ng et al. Peerdb: A P2P-Based System for Distributed Data Sharing, International Conference on Data Engineering (ICDE), pp.633-644, 2003. S. Ratnasamy et al. A Scalable Content-Addressable Network, ACM Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication (SIGCOMM), pp.161-172, 2001. J. Risson, T. Moors, Survey of Research towards Robust Peer-to-Peer Networks: Search Methods, Computer Networks, vol.50, no.17, pp.3485-3521, 2006. C. Rohrs, Query Routing for the Gnutella Network, Unpublished Work. Available at http://rfc-gnutella.sourceforge.net/src/qrp.html, 2002. Bilkent University 82

REFERENCES S. Saroiu et al. Measuring and Analyzing the Characteristics of Napster and Gnutella Hosts, Multimedia Systems, vol.9, no.2, pp.170-184, 2003. I. Stoica et al. Chord: A Scalable Peer-to-Peer Lookup Service for Internet Applications, ACM Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication (SIGCOMM), 2001. I. Tatarinov et al. The Piazza Peer Data Management Project, ACM SIGMOD Record, vol.32, no.3, 2003. S. Androutsellis-Theotokis, D.Spinellis, A Survey of Peer-to-Peer Content Distribution Technologies, ACM Computing Surveys, vol.36, no.4, pp.335-371, 2004. P. Valduriez, E. Pacitti, Data Management in Large-Scale P2P Systems, High Performance Computing for Computational Science (VECPAR), pp.104-118, 2004. Bilkent University 83