Consistency of Replicated Datasets in Grid Computing

Size: px
Start display at page:

Download "Consistency of Replicated Datasets in Grid Computing"

Transcription

1 Consistency of Replicated Datasets in Grid Computing Gianni Pucciani, Flavia Donno CERN, European Organization for Nuclear Research, CH 1211 Geneva 23, Switzerland Andrea Domenici DIIEIT, University of Pisa, v. Diotisalvi 2, I Pisa, Italy Heinz Stockinger Swiss Institute of Bioinformatics, Quartier Sorge, CH 1015 Lausanne, Switzerland

2 Consistency of Replicated Datasets in Grid Computing I N T R O D U C T I O N Replica consistency is the property exhibited by a set of data items, such as files or databases located at different nodes of a Grid, that contain the same information; when these data items are modifiable, all of them should be updated (or synchronized) so that consistency is maintained. Replica consistency is a very well studied research topic and has its roots in distributed systems as well as in distributed database management systems, where it is sometimes referred to as external consistency (Cellary et al., 1988). Replica consistency is obviously related to data replication, a technique that is used pervasively in Grids to achieve fast data access, high availability, increased fault tolerance, and better load balancing. Data replication involves databases, files, and possibly other units of information, such as objects or records, and relies on the functions provided by plain file systems, storage systems, database management systems, and middleware services. Currently, existing Grids offer scarce support, if any, for data consistency. Often, data is considered to be read only, i.e. data is consistent by definition since no updates are allowed on existing data items. The rest of this entry presents an introduction to the problem in the Background section, where the key concepts are introduced. Furthermore, the data management capabilities provided by middleware services currently used in some of the largest Grids are reviewed, pointing out their approach to replica management and support for replica synchronization. The core analysis of the problem is presented in the Main Focus section, where the main issues in the development of a Replica Consistency Service for Data Grids are discussed. B A C K G R O U N D

3 Data replication Data replication is a technique that is most commonly used in distributed database management systems and is tightly coupled with the transaction system. For example, a relational database management system can have identical copies at three geographically different sites, each holding a full copy of the data. End users can place their SQL queries to any of the replicas: distributed transactions are then used to make sure that data does not get corrupted by multiple writers. Simply put, data consistency makes sure that different data copies are synchronized, i.e., have the same values. In Grid computing, data replication is done at different levels of granularity than in traditional, distributed relational database management systems. In particular, Grids often replicate entire files rather than database objects. Furthermore, data synchronization and therefore consistency has to be managed by external services which often do not provide a unique interface for reading and writing data based on traditional database transactions. In the rest of this article we concentrate on the specific issues of Data Grids. However, before we go into the details of replica consistency in Grids, let us first review typical data replication components and services that are commonly used in Grid computing. A Data Grid typically offers a Replica Management Service (RMS), a middleware component that creates replicas of files (rather than relational databases) on request by applications or possibly in a transparent way in order to optimize data access. This service uses a Replica Catalogue (RC) to keep track of the replicas. The RMS may also rely on a Replica Optimization Service (ROS) to select dynamically the best 1 replicas to be accessed by a given application. Such file replication tools must then implement policies concerning the following major issues: 1. When and where to create or remove replicas? A replication service should perform dynamic replication (Ranganathan, 2001), that is the automatic creation and removal of replicas based on different system parameters and/or user needs. 2. Data location and cataloguing. Replicas can be created and removed in the course of time: they are created somewhere when needed and they must be deleted when they are no longer used. How does a user or application, or the RMS itself know where a replica is, at a certain point in time? To this end, replica catalogues (Chervenak et al., 2002) are normally used. 3. Replica synchronization. When a replica must be updated, how are the other replicas synchronized with the new content? How is replica consistency enforced throughout the system? This is the topic of this entry. Users of a RMS need not be aware of the existence of replicas. Normally, they refer to a file by a logical name that identifies the information carried by the file, independently of the physical location of its replicas. Applications relying on the RMS pass the logical name to the service, that retrieves from the RC the physical names specifying the actual locations of the replicas. Replicas are created using RMS client tools, such as the Globus Data Replication Service (Chervenak et al., 2005) or LCG Data Management tools (Peris et al., 2004) (which most of the times rely on lower level services like GridFTP), and kept on Storage Elements (SE) of different types. A storage element is a complex system that may support a hierarchy of storage systems, such as fast disk 1 The best replica in this case is chosen considering access speed and supported protocols.

4 caches, long term and high capacity disks, and tapes. Different types of storage elements exist, providing support for different access protocols. Efforts in promoting the usage of a standard interface for heterogeneous SEs are present (Shoshani, A., 2003). A storage element may replicate data internally to optimize file access, but this kind of replication is independent of the RMS and will not be discussed further. Replica management services usually replicate files and possibly file collections, but they offer no support for the replication of data stored in relational or object oriented databases. Database replication relies on the proprietary mechanisms provided by the database management systems. For example, in the project WLCG, Worldwide LHC Computing Grid (WLCG, 2007), Oracle Streams is used for unidirectional replication of Oracle Databases. Replication of databases is especially important for the availability and reliability of the Grid middleware services since most of them use databases to keep track of service specific metadata that is frequently updated. Database replication has different and usually more complex requirements with respect to file replication. Databases can be large, they must be accessed through their management systems, and they cannot just be copied as a whole, but need to be installed with a rather complex procedure. Furthermore, different sites may want to keep copies of the same data in database systems provided by different vendors: this is called heterogeneous replication. In the remainder of this article we will discuss the requirements and features for both, file and database replication and their synchronization.. Key concepts Depending on the application (counting also middleware services as applications), data to be replicated may be stored in a file or a file collection, a database or a database table, or an object stored in a file or in a database. We will use the term dataset to cover these possibilities. Datasets may be structured or unstructured. A dataset is structured if a user/application accesses it by means of record oriented operations. A relational database is a typical example of a structured dataset that is accessed through SQL commands. Unstructured datasets are those whose internal structure is either unknown or ignored for the purposes of replication, and are accessed by users/applications by means of file management operations, local or remote file I/O protocols. Unstructured datasets will also be referred to as flat files. We distinguish between the logical contents of a dataset, i.e., the information it carries, and its physical instances, called replicas. A logical dataset (or dataset for short) may then be defined as an abstract entity composed of its contents and its logical name. To each logical dataset is associated a set of replicas, each identified by a physical name. Replicas can be stored at a particular location (on a file system, database or mass storage system) and accessed by the users/applications with some sort of access protocol. A replica contains a physical representation of the dataset contents and, as observed before, different representations are possible for a given dataset. A semantic function maps a representation to the contents: for flat files the semantic function is the identity function, while for structured datasets it is the mechanism that extracts information from the replica (in a database, it is simply the query

5 processing interpreter). Replica Synchronization Protocols A replica synchronization protocol is performed by any system (such as a distributed database manager or a Grid middleware service for file replication) whose purpose is to maintain a set of replicas consistent. Several such protocols have been proposed, each aimed at satisfying different sets of requirements that arise in different environments. For example, in Grid environments it is often not possible to keep all replicas up to date, and at any given time one or more replicas might be outdated. Depending on the application, certain more relaxed consistency requirements and states can be defined (Düllmann, 2002; Gray, 1997; Breitbart, 1997). For instance, certain applications can deal with datasets that are outdated for several minutes, sometimes even hours. If this feature is known a priori, adequate consistency models can be chosen. In particular, we can distinguish between two main approaches: Synchronous, or eager synchronization. In this approach, all replicas for a given logical dataset are updated within the same transaction, with a protocol that usually is a variation of the basic two phase commit protocol (Özsu et al., 1999). This has the consequence that no single replica can be accessed during the update process but after the transaction all replicas have the same physical state and they are consistent. Although high data consistency is a desirable feature, this approach has important limitations for distributed systems and in particular for Data Grids: replicas need to be locked which can result in long down times of replicas due to update contention. When no timeout or quorum systems are used, disconnected sites can block an update operation indefinitely. Asynchronous, or lazy synchronization. The second approach tries to overcome the problem of distributed locks by updating only a subset of replicas during an update transaction and propagating the update to the other replicas at a later time. Certainly, some of the replicas will be outdated for a certain period, which is the price for speeding up the write access and increasing data availability. In order to further characterize replica consistency mechanisms, we introduce more definitions. When using lazy synchronization, a simple and reliable solution is the one where a replica is designated as master or primary replica. In single master systems, the unique master replica is the only one that can be modified by users, while the other replicas (slave or secondary replicas) are updated by the replica synchronization protocol. Secondary replicas are useful to speed up read operations. In case of failures at the master site that compromise the use of the master replicas, an election algorithm (Garcia Molina, 1982) can be used among secondary replicas to elect a new master replica. Multi master solutions can expose the system to update conflicts. Conflict resolution is a highly application specific problem. With low write access rates or when the semantics of the application allows the resolution of a conflict without affecting the normal behavior and performance of the system, multi master solutions can be implemented, increasing data availability and speeding up both read and write operations.

6 Depending on how the update of a replica is performed, we can further classify synchronization protocols distinguishing between push versus pull based and log versus content transfer systems (Saito et al., 2005). Existing support for replica consistency Today, most of the commercial database management systems provide replication features with mechanisms enforcing consistency; Oracle Streams, IBM DB Replication and Microsoft SQL Server are some of the best known ones. However, in this case the replication is homogeneous in that it regards databases of the same vendor, with a few exceptions 2. As regards Grid environments, no consistency service has been yet developed in important middleware solutions such as the Globus Toolkit and the LHC Computing Grid. Both these solutions do provide file replication features, but the automatic management of replica consistency is not supported. A prototype Grid service for maintaining consistency of replicated files and databases can be found in (Domenici et al., 2006). The SDSC Storage Resource Broker (SRB, 2007) instead provides a rather complete set of replication and consistency management features including synchronous and asynchronous approaches. Other studies in replica consistency management can be found in (Yu et al., 2002) and (Susarla et al., 2005) but their application in a real Grid environment has not yet been considered. M A I N F O C U S The need for replica consistency mechanisms in Grid environments has been pointed out early in (Stockinger, 2001), (Düllmann et al., 2002), and (Casey et al., 2003), but few solutions have been proposed so far. This is partly due to the fact that many applications that are driving the development of Grid middleware 3 expect to use modifiable datasets in the future, but currently use mostly read only data. As a consequence, requirements for replica consistency are still unclear. More precise requirements will be defined when users begin to try new models of computation and data access. Issues in designing a Replica Consistency Service The design of a Replica Consistency Service (RCS) as part of a Grid middleware is faced with many difficult issues that derive from specific properties of a Grid environment. 2 Oracle Streams can use Oracle Heterogeneous Connectivity technology to replicate data from an Oracle system to a non Oracle (Informix, MS SQL Server and Sybase) system. IBM DB2 can share and replicate data with an Informix database. 3 It is the case, for example, in the WLCG middleware, where High Energy Physics experiments mainly use the Grid to perform analysis on read only files.

7 In general, being replica consistency a highly application specific problem, designing one consistency management mechanism for different applications requires finding trade offs on many different design choices. In the next paragraphs we review some of the most important issues that need to be dealt with, providing hints for the design of a Replica Consistency Service. Scalability First of all, any Grid infrastructure involves the management of many sites, hence, in case of flat files 4, it is likely to have to deal with several thousands of replicas, some of which could not be continuously available. Thus, update propagation algorithms must be properly designed to provide good performance also with large numbers of replicas. Keeping the design simple can be the key to success; whenever possible, single master solutions are the recommended way to provide fast read access and high data availability. Security Security issues must be considered in the development of a RCS. Communication with the service should be secure; this means that the service should deal with authentication, authorization, privacy, and integrity issues. The Grid Security Infrastructure (GSI) provided by the Globus Toolkit is widely adopted as an integrated solution to security problems, and it is based on the public key infrastructure. The GSI can be easily integrated in a Grid service. Replica Location Replica location services and replica catalogues are used in Grid middleware to store the association between a logical dataset and all its replicas. Among the most used implementations we cite the Globus Replica Location Service (RLS) (Chervenak et al., 2002) and the LCG File Catalogue (LFC). The RCS has two options: interfacing with this catalogue or implementing its own replica catalogue. Both options have advantages and disadvantages. Using an external replica catalogue would avoid duplicating information and complicating the system. On the other hand, the integration with an external service should be carefully planned and would require such catalogues to be modified. For example, not all the logical datasets registered in a replica catalogue need the consistency management, like read only datasets. For datasets that do require consistency management, some new attributes (e.g. master/slaves, fresh/stale, version number) should be added to each replica's metadata. Efficient file transfer An efficient file transfer tool should be used for update propagation. File transfer services for Grid computing are normally built on top of the GridFTP protocol. The RCS should use either GridFTP or higher level services to efficiently propagate updates to possibly thousand of replicas. Most of the Storage Elements support the GridFTP protocol, making it a good choice to solve the file transfer issue in the RCS. Note that GridFTP is optimized for transferring rather big amounts of data with relatively big file sizes. This is partially due to the fact that TCP/IP works more efficiently with larger than with smaller file sizes due to the TCP window size tuning and a slow start up with smaller window sizes. Performance tests have shown that transfers of smaller data items (up to about 5 MB) can be achieved more efficiently using alternative approaches such as SOAP with attachment (Sciolla, 2007). 4 In case of replicated databases the number of replicas can range from a few units to a few tens of replicas.

8 SE heterogeneity A Grid connects many different resources. Storage Elements, where datasets are stored, can have different implementations and different access protocols. Although a standard interface could be available in the next few years (Shoshani, 2003), a RCS should interface with different SEs. Lock management functionalities should be provided by the SE since, in certain scenarios, the access to a replica may need to be blocked to avoid concurrent accesses. Disconnected nodes The RCS should be able to complete the synchronization of replicated datasets even when some of them are not available. Quorum mechanisms could be used to ensure that an update propagation process can execute when at least a given number of replicas are available, and it should be possible to select this number depending on the application requirements. Synchronization of unavailable replicas should be retried as soon as they become available. Metadata Consistency The RCS should provide synchronization capabilities both for application and middleware services. Many middleware services in fact use replication for fault tolerance and reliability. One example can be found in the Globus RLS, where catalogues are replicated but consistency management is not supported. This leads us to consider, as already stated in this article, the consistency of both files and databases, that is the subject of the next paragraph. Database Consistency A Replica Consistency Service to be used in a Grid middleware should be able to manage the consistency of both applications' data and middleware services' data. Many Grid services in fact use persistent data stores, usually relational databases, to save critical information. In order to provide fault tolerance and increase the performance of these services, such data are often replicated over several sites, and hence a consistency mechanism is needed to enforce consistency among these replicas. Practical examples of replicated services that use relational database are the Globus Replica Location Service and the LCG File Catalogue. Such services usually can be implemented using backend databases from different vendors. Oracle databases are a common choice for larger sites. In other cases, open source databases (often MySQL and PostgreSQL) are good alternatives. Thus, crossvendor replication also needs to be supported by a Replica Consistency Service. Cross vendor or heterogeneous database synchronization requires that the RCS is built using pluggable modules to interface with many different software packages. Differences in the SQL dialects used by different database vendors must be matched, both by limiting the use of non standard SQL, and by providing some translation capabilities. Unidirectional Oracle to MySQL synchronization has been tested in the CONStanza project (Domenici et al., 2006). Another open source software that provides heterogeneous replication through a Java based data extraction, transformation and loading tool is Enhydra Octopus (Octopus, 2007). A third example for a Grid database replication system is presented in (Chen, 2007). The problem of concurrency control in distributed heterogeneous databases in a Grid environment is studied in (Taniar,

9 2007). Although they present different characteristics, file synchronization and database synchronization have common features that should be exploited to provide a general and flexible Replica Consistency Service. F U T U R E T R E N D S In general, the Grid software developers deal more with the efficient replication and replica selection of read only datasets than with update synchronization and consistency. One of the reasons for that is that there are not many use cases of the latter kind in classical Grid applications. On the other hand, database research has shown that update replication comes at some cost in terms of data availability, so that only certain applications can fully profit from replicated data with update features. Just as replica consistency has become an essential property in distributed databases and certain file systems, the same will occur in Grid infrastructures. Further, considering that Grid computing is a rapidly emerging domain, it is likely that new applications, outside the scientific field, will arise in the next few years, providing more requirements for the implementation of a Replica Consistency Service. C O N C L U S I O N We have presented the problem of replica consistency in Grid environments and discussed possible solutions to be considered when implementing such a system. Nowadays many Grid applications deal with read only replicas; for this reason Grid middleware frameworks do not provide any support for replica synchronization. Another reason is that replica synchronization is a highly application specific domain, and providing a universal solution suitable for multiple dataset types and access patterns is very difficult. In this entry we analyzed the main issues in developing a Replica Consistency Service (RCS) in a Grid environment, suggesting practical approaches. Some of these approaches have been implemented and tested in a prototype service described in (Domenici et al., 2006), that allows for the synchronization of both files and heterogeneous database replicas. We expect that future Grid applications will have more stringent requirements for replica consistency; this will help to better characterize the design of the RCS and will also speed up the implementation of reliable solutions. R E F E R E N C E S Baud, J.P, Casey, J., Lemaitre, S., Nicholson, C., Smith, D., Stewart, G. (2005). LCG Data Management: from EDG to EGEE. GLAS PPE/

10 Breitbart Y., & Korth, H. F. (1997). Replication and consistency: Being lazy helps sometimes. In Proc. of the 16th ACM SIGACT SIGMOD SIGART Symposium on Principles of Database Systems. Casey, J. et al. (2003). Next Generation EU DataGrid Data Management Services. In Proc. Conference for Computing in High Energy and Nuclear Physics (CHEP 2003), La Jolla, California. Cellary, W., Gelenbe, E. and Morzy, T. (1988). Concurrency Control in Distributed Database Systems. Amsterdam: North Holland. Chen, Y., Berry D., Dantressangle P., (2007). Transaction Based Grid Database Replication, In Proc. of the UK e Science All Hands Meeting Chervenak, A., Deelman, E., Foster, I., Guy, L., Hoschek, W., Iamnitchi, A., Kesselman, C., Kunszt, P., Ripeanu, M., Schwarz, B., Stockinger, H., Stockinger, K., & Tierney, B. (2002). Giggle: A Framework for Constructing Scalable Replica Location Services. In Proc. of the Int'l. ACM/IEEE Supercomputing Conference (SC 2002), IEEE Computer Society Press. Chervenak, A., Schuler, R., Kesselman, C., Koranda, S., Moe, B. Wide (2005). Area Data Replication for Scientific Collaboration. In Proc. of 6th IEEE/ACM Int'l Workshop on GridComputing (Grid2005). Domenici, A., Donno, F., Pucciani, G., Stockinger, H. (2006). Relaxed Data Consistency with CONStanza. Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGrid06), Singapore May 2006, IEEE Computer Society. Domenici, A., Donno, F., Pucciani, G., Stockinger, H., Stockinger, K. (2003). Replica consistency in a Data Grid. In Proc. of the IX International Workshop on Advanced Computing and Analysis Techniques in Physics Research, Tsukuba, Japan. Düllmann, D., Hoschek, W., Jean Martinez, J., Samar, A., Stockinger, H., & Stockinger, K. (2002). Models for Replica Synchronisation and Consistency in a Data Grid. In Proc. of 10th IEEE Symposium on High Performance and Distributed Computing (HPDC 10), IEEE Computer Society Press. Garcia Molina, H. (1982). Elections in a Distributed Computing System. IEEE Transaction on Computers, vol. 32. Gray, J., Helland, P., O'Neil, P., & Shasha. D. (1997). The dangers of replication and a solution. In Proc. of the 1996 ACM SIGMOD International Conference on Management of Data, pp LCG 3D (2007). Distributed Deployment of Databases for LCG. From Octopus (2007). Enhydra Octopus, JDBC Data Transformation. From Özsu, M.T., Valduriez, P. (1999). Principles of Distributed Database Systems, Prentice Hall. Peris, A.D., Lorenzo, P.M, Donno, F., Sciabà, A., Campana, S., Santinelli, R. (2004). LCG 2 User Guide, v2.1. Ranganathan, K. and Foster, I. (2001). Identifying Dynamic Replication Strategies for a High Performance Data Grid. In Proc. of the International Grid Computing Workshop, Denver, CO. RLS (2007). Data Management: Key Concepts of RLS. From Saito, Y., Shapiro, M. (2005). Optimistic Replication. ACM Computing Surveys.

11 Sciolla, C. (2007), Implementazione e valutazione di un sistema di trasferimento file basato su SOAP in ambiente GRID, In Italian, Master s Thesis at the University of Pisa. Shoshani, A. (2003). Storage Resource Managers: Essential Components for the Grid. Chapter in book: Grid Resource management: State of the Art and Future Trends, Edited by Jarek Nabrzyski, Jennifer M. Schopf, Jan Weglarz, Kluwer Academic Publishers. SRB (2007). The SDSC Storage Resource Broker. From Stockinger, H. (2001). Database Replication in World Wide Distributed Data Grids. Ph.D. Thesis, Institute of Computer Science and Business Informatics, University of Vienna, Austria. Susarla, S., Carter, J. (2005). Flexible Consistency for Wide area Peer Replication. In Proc. of the 25th International Conference on Distributed Computing Systems. Taniar D., Goel S. (2007), Concurrency control issues in grid databases, Future Generation Computer Systems 23:1 WLCG (2007). Worlwide LHC Computing Grid. From Yu, H., Vahdat, A. (2002). Design and Evaluation of a Conit based Continuous Consistency Model for Replicated Services. ACM Transactions on Computer Systems (TOCS). Terms and Definitions Data Replication Having and managing more copies of datasets. These copies are typically synchronized. Replica Catalogue Used to locate replicas (physical locations) which are mapped to logical file names. Logical File Name A name used to identify a set of replicated files. Physical File Name The name of a replicated file which defines his location. Replica Management System A Grid service that takes care of replicating datasets and keeping track of locations in a Replica Catalogue. Replica Consistency The property exhibited by a set of replicas that contain the same information. Replica Synchronization The task of updating replicas in order to enforce their consistency. Strict Synchronization Updating all the replicas of the same dataset in a single transaction to make sure that replicas are never outdated.

12 Lazy Synchronization Allowing for certain delays in the update process, i.e. replicas can be outdated for a certain time. Heterogeneous Database Synchronization Used to enforce consistency among replicated databases of different vendors.

Heterogeneous Database Replication Gianni Pucciani

Heterogeneous Database Replication Gianni Pucciani LCG Database Deployment and Persistency Workshop CERN 17-19 October 2005 Heterogeneous Database Replication Gianni Pucciani A.Domenici andrea.domenici@iet.unipi.it F.Donno flavia.donno@cern.ch L.Iannone

More information

Replica Management Services in the European DataGrid Project

Replica Management Services in the European DataGrid Project Replica Management Services in the European DataGrid Project David Cameron 2, James Casey 1, Leanne Guy 1, Peter Kunszt 1, Sophie Lemaitre 1, Gavin McCance 2, Heinz Stockinger 1, Kurt Stockinger 1, Giuseppe

More information

A Reputation Replica Propagation Strategy for Mobile Users in Mobile Distributed Database System

A Reputation Replica Propagation Strategy for Mobile Users in Mobile Distributed Database System A Reputation Replica Propagation Strategy for Mobile Users in Mobile Distributed Database System Sashi Tarun Assistant Professor, Arni School of Computer Science and Application ARNI University, Kathgarh,

More information

Distributed Database Management Systems and the Data Grid

Distributed Database Management Systems and the Data Grid Distributed Database Management Systems and the Data Grid Heinz Stockinger CERN, European Organization for Nuclear Research, Geneva, Switzerland Institute for Computer Science and Business Informatics,

More information

Web Service Based Data Management for Grid Applications

Web Service Based Data Management for Grid Applications Web Service Based Data Management for Grid Applications T. Boehm Zuse-Institute Berlin (ZIB), Berlin, Germany Abstract Web Services play an important role in providing an interface between end user applications

More information

Data Management in an International Data Grid Project. Timur Chabuk 04/09/2007

Data Management in an International Data Grid Project. Timur Chabuk 04/09/2007 Data Management in an International Data Grid Project Timur Chabuk 04/09/2007 Intro LHC opened in 2005 several Petabytes of data per year data created at CERN distributed to Regional Centers all over the

More information

Towards Heterogeneous Grid Database Replication. Kemian Dang

Towards Heterogeneous Grid Database Replication. Kemian Dang Towards Heterogeneous Grid Database Replication Kemian Dang Master of Science Computer Science School of Informatics University of Edinburgh 2008 Abstract Heterogeneous database replication in the Grid

More information

EDG Project: Database Management Services

EDG Project: Database Management Services EDG Project: Database Management Services Leanne Guy for the EDG Data Management Work Package EDG::WP2 Leanne.Guy@cern.ch http://cern.ch/leanne 17 April 2002 DAI Workshop Presentation 1 Information in

More information

Atomic Commitment in Grid Database Systems

Atomic Commitment in Grid Database Systems Atomic Commitment in Grid Database Systems Sushant Goel 1 Hema Sharda 2 David Taniar 3 1,2 School of Electrical and Computer Systems Engineering, Royal Melbourne Institute of Technology, Australia 1 s2013070@student.rmit.edu.au

More information

The glite File Transfer Service

The glite File Transfer Service The glite File Transfer Service Peter Kunszt Paolo Badino Ricardo Brito da Rocha James Casey Ákos Frohner Gavin McCance CERN, IT Department 1211 Geneva 23, Switzerland Abstract Transferring data reliably

More information

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets!! Large data collections appear in many scientific domains like climate studies.!! Users and

More information

GridFTP: A Data Transfer Protocol for the Grid

GridFTP: A Data Transfer Protocol for the Grid GridFTP: A Data Transfer Protocol for the Grid Grid Forum Data Working Group on GridFTP Bill Allcock, Lee Liming, Steven Tuecke ANL Ann Chervenak USC/ISI Introduction In Grid environments,

More information

Transaction-Based Grid Database Replication

Transaction-Based Grid Database Replication Transaction-Based Grid Database Replication Yin Chen 1, Dave Berry 1, Patrick Dantressangle 2 1 National e-science Centre, Edinburgh, UK 2 IBM, Hursley Lab, Winchester, UK Abstract We present a framework

More information

Survey on Comparative Analysis of Database Replication Techniques

Survey on Comparative Analysis of Database Replication Techniques 72 Survey on Comparative Analysis of Database Replication Techniques Suchit Sapate, Student, Computer Science and Engineering, St. Vincent Pallotti College, Nagpur, India Minakshi Ramteke, Student, Computer

More information

Database Replication with Oracle 11g and MS SQL Server 2008

Database Replication with Oracle 11g and MS SQL Server 2008 Database Replication with Oracle 11g and MS SQL Server 2008 Flavio Bolfing Software and Systems University of Applied Sciences Chur, Switzerland www.hsr.ch/mse Abstract Database replication is used widely

More information

Real-time Data Replication

Real-time Data Replication Real-time Data Replication from Oracle to other databases using DataCurrents WHITEPAPER Contents Data Replication Concepts... 2 Real time Data Replication... 3 Heterogeneous Data Replication... 4 Different

More information

A High-Performance Virtual Storage System for Taiwan UniGrid

A High-Performance Virtual Storage System for Taiwan UniGrid Journal of Information Technology and Applications Vol. 1 No. 4 March, 2007, pp. 231-238 A High-Performance Virtual Storage System for Taiwan UniGrid Chien-Min Wang; Chun-Chen Hsu and Jan-Jan Wu Institute

More information

THE CCLRC DATA PORTAL

THE CCLRC DATA PORTAL THE CCLRC DATA PORTAL Glen Drinkwater, Shoaib Sufi CCLRC Daresbury Laboratory, Daresbury, Warrington, Cheshire, WA4 4AD, UK. E-mail: g.j.drinkwater@dl.ac.uk, s.a.sufi@dl.ac.uk Abstract: The project aims

More information

Deploying a distributed data storage system on the UK National Grid Service using federated SRB

Deploying a distributed data storage system on the UK National Grid Service using federated SRB Deploying a distributed data storage system on the UK National Grid Service using federated SRB Manandhar A.S., Kleese K., Berrisford P., Brown G.D. CCLRC e-science Center Abstract As Grid enabled applications

More information

DATABASE REPLICATION A TALE OF RESEARCH ACROSS COMMUNITIES

DATABASE REPLICATION A TALE OF RESEARCH ACROSS COMMUNITIES DATABASE REPLICATION A TALE OF RESEARCH ACROSS COMMUNITIES Bettina Kemme Dept. of Computer Science McGill University Montreal, Canada Gustavo Alonso Systems Group Dept. of Computer Science ETH Zurich,

More information

Comparing Microsoft SQL Server 2005 Replication and DataXtend Remote Edition for Mobile and Distributed Applications

Comparing Microsoft SQL Server 2005 Replication and DataXtend Remote Edition for Mobile and Distributed Applications Comparing Microsoft SQL Server 2005 Replication and DataXtend Remote Edition for Mobile and Distributed Applications White Paper Table of Contents Overview...3 Replication Types Supported...3 Set-up &

More information

A Metadata Catalog Service for Data Intensive Applications

A Metadata Catalog Service for Data Intensive Applications A Metadata Catalog Service for Data Intensive Applications Gurmeet Singh, Shishir Bharathi, Ann Chervenak, Ewa Deelman, Carl Kesselman, Mary Manohar, Sonal Patil, Laura Pearlman Information Sciences Institute,

More information

A Survey Study on Monitoring Service for Grid

A Survey Study on Monitoring Service for Grid A Survey Study on Monitoring Service for Grid Erkang You erkyou@indiana.edu ABSTRACT Grid is a distributed system that integrates heterogeneous systems into a single transparent computer, aiming to provide

More information

Data Grids. Lidan Wang April 5, 2007

Data Grids. Lidan Wang April 5, 2007 Data Grids Lidan Wang April 5, 2007 Outline Data-intensive applications Challenges in data access, integration and management in Grid setting Grid services for these data-intensive application Architectural

More information

TECHNIQUES FOR DATA REPLICATION ON DISTRIBUTED DATABASES

TECHNIQUES FOR DATA REPLICATION ON DISTRIBUTED DATABASES Constantin Brâncuşi University of Târgu Jiu ENGINEERING FACULTY SCIENTIFIC CONFERENCE 13 th edition with international participation November 07-08, 2008 Târgu Jiu TECHNIQUES FOR DATA REPLICATION ON DISTRIBUTED

More information

Archiving, Indexing and Accessing Web Materials: Solutions for large amounts of data

Archiving, Indexing and Accessing Web Materials: Solutions for large amounts of data Archiving, Indexing and Accessing Web Materials: Solutions for large amounts of data David Minor 1, Reagan Moore 2, Bing Zhu, Charles Cowart 4 1. (88)4-104 minor@sdsc.edu San Diego Supercomputer Center

More information

Presentation of Multi Level Data Replication Distributed Decision Making Strategy for High Priority Tasks in Real Time Data Grids

Presentation of Multi Level Data Replication Distributed Decision Making Strategy for High Priority Tasks in Real Time Data Grids Presentation of Multi Level Data Replication Distributed Decision Making Strategy for High Priority Tasks in Real Time Data Grids Naghmeh Esmaieli Esmaily.naghmeh@gmail.com Mahdi Jafari Ser_jafari@yahoo.com

More information

A Distributed Architecture for Multi-dimensional Indexing and Data Retrieval in Grid Environments

A Distributed Architecture for Multi-dimensional Indexing and Data Retrieval in Grid Environments A Distributed Architecture for Multi-dimensional Indexing and Data Retrieval in Grid Environments Athanasia Asiki, Katerina Doka, Ioannis Konstantinou, Antonis Zissimos and Nectarios Koziris National Technical

More information

Data Management in an International Data Grid Project

Data Management in an International Data Grid Project Data Management in an International Data Grid Project Wolfgang Hoschek 1,3, Javier Jaen-Martinez 1, Asad Samar 1,4, Heinz Stockinger 1,2, and Kurt Stockinger 1,2 1 CERN, European Organization for Nuclear

More information

GORDA: An Open Architecture for Database Replication

GORDA: An Open Architecture for Database Replication GORDA: An Open Architecture for Database Replication Alfrânio Correia Jr. University of Minho Nuno Carvalho University of Lisboa José Pereira University of Minho Ricardo Vilaça University of Minho Susana

More information

ICOM 6005 Database Management Systems Design. Dr. Manuel Rodríguez Martínez Electrical and Computer Engineering Department Lecture 2 August 23, 2001

ICOM 6005 Database Management Systems Design. Dr. Manuel Rodríguez Martínez Electrical and Computer Engineering Department Lecture 2 August 23, 2001 ICOM 6005 Database Management Systems Design Dr. Manuel Rodríguez Martínez Electrical and Computer Engineering Department Lecture 2 August 23, 2001 Readings Read Chapter 1 of text book ICOM 6005 Dr. Manuel

More information

Analyses on functional capabilities of BizTalk Server, Oracle BPEL Process Manger and WebSphere Process Server for applications in Grid middleware

Analyses on functional capabilities of BizTalk Server, Oracle BPEL Process Manger and WebSphere Process Server for applications in Grid middleware Analyses on functional capabilities of BizTalk Server, Oracle BPEL Process Manger and WebSphere Process Server for applications in Grid middleware R. Goranova University of Sofia St. Kliment Ohridski,

More information

Data Management Services Design and Development

Data Management Services Design and Development Next-Generation EU DataGrid Data Management Services Diana Bosio, James Casey, Akos Frohner, Leanne Guy, Peter Kunszt, Erwin Laure, Sophie Lemaitre, Levi Lucio, Heinz Stockinger, Kurt Stockinger CERN,

More information

An Overview of Distributed Databases

An Overview of Distributed Databases International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 4, Number 2 (2014), pp. 207-214 International Research Publications House http://www. irphouse.com /ijict.htm An Overview

More information

Web Service Robust GridFTP

Web Service Robust GridFTP Web Service Robust GridFTP Sang Lim, Geoffrey Fox, Shrideep Pallickara and Marlon Pierce Community Grid Labs, Indiana University 501 N. Morton St. Suite 224 Bloomington, IN 47404 {sblim, gcf, spallick,

More information

It takes know-how to retrieve large files over public networks

It takes know-how to retrieve large files over public networks It takes know-how to retrieve large files over public networks Adam H. Villa and Elizabeth Varki University of New Hampshire Department of Computer Science Durham, NH, 03824 USA Abstract Retrieving large

More information

MIGRATING DESKTOP AND ROAMING ACCESS. Migrating Desktop and Roaming Access Whitepaper

MIGRATING DESKTOP AND ROAMING ACCESS. Migrating Desktop and Roaming Access Whitepaper Migrating Desktop and Roaming Access Whitepaper Poznan Supercomputing and Networking Center Noskowskiego 12/14 61-704 Poznan, POLAND 2004, April white-paper-md-ras.doc 1/11 1 Product overview In this whitepaper

More information

In Memory Accelerator for MongoDB

In Memory Accelerator for MongoDB In Memory Accelerator for MongoDB Yakov Zhdanov, Director R&D GridGain Systems GridGain: In Memory Computing Leader 5 years in production 100s of customers & users Starts every 10 secs worldwide Over 15,000,000

More information

16th International Conference on Control Systems and Computer Science (CSCS16 07)

16th International Conference on Control Systems and Computer Science (CSCS16 07) 16th International Conference on Control Systems and Computer Science (CSCS16 07) TOWARDS AN IO INTENSIVE GRID APPLICATION INSTRUMENTATION IN MEDIOGRID Dacian Tudor 1, Florin Pop 2, Valentin Cristea 2,

More information

Information Sciences Institute University of Southern California Los Angeles, CA 90292 {annc, carl}@isi.edu

Information Sciences Institute University of Southern California Los Angeles, CA 90292 {annc, carl}@isi.edu _ Secure, Efficient Data Transport and Replica Management for High-Performance Data-Intensive Computing Bill Allcock 1 Joe Bester 1 John Bresnahan 1 Ann L. Chervenak 2 Ian Foster 1,3 Carl Kesselman 2 Sam

More information

Status and Evolution of ATLAS Workload Management System PanDA

Status and Evolution of ATLAS Workload Management System PanDA Status and Evolution of ATLAS Workload Management System PanDA Univ. of Texas at Arlington GRID 2012, Dubna Outline Overview PanDA design PanDA performance Recent Improvements Future Plans Why PanDA The

More information

Tier Architectures. Kathleen Durant CS 3200

Tier Architectures. Kathleen Durant CS 3200 Tier Architectures Kathleen Durant CS 3200 1 Supporting Architectures for DBMS Over the years there have been many different hardware configurations to support database systems Some are outdated others

More information

Database Replication with MySQL and PostgreSQL

Database Replication with MySQL and PostgreSQL Database Replication with MySQL and PostgreSQL Fabian Mauchle Software and Systems University of Applied Sciences Rapperswil, Switzerland www.hsr.ch/mse Abstract Databases are used very often in business

More information

A Brief Analysis on Architecture and Reliability of Cloud Based Data Storage

A Brief Analysis on Architecture and Reliability of Cloud Based Data Storage Volume 2, No.4, July August 2013 International Journal of Information Systems and Computer Sciences ISSN 2319 7595 Tejaswini S L Jayanthy et al., Available International Online Journal at http://warse.org/pdfs/ijiscs03242013.pdf

More information

Data Management System for grid and portal services

Data Management System for grid and portal services Data Management System for grid and portal services Piotr Grzybowski 1, Cezary Mazurek 1, Paweł Spychała 1, Marcin Wolski 1 1 Poznan Supercomputing and Networking Center, ul. Noskowskiego 10, 61-704 Poznan,

More information

The Step from Grid Research to Successful Grid Businesses

The Step from Grid Research to Successful Grid Businesses The Step from Grid Research to Successful Grid Businesses Jörn Altmann School of Information Technology International University in Germany 76646 Bruchsal, Germany jorn.altmann@acm.org Abstract Resent

More information

Designing a Cloud Storage System

Designing a Cloud Storage System Designing a Cloud Storage System End to End Cloud Storage When designing a cloud storage system, there is value in decoupling the system s archival capacity (its ability to persistently store large volumes

More information

The EU DataGrid Data Management

The EU DataGrid Data Management The EU DataGrid Data Management The European DataGrid Project Team http://www.eu-datagrid.org DataGrid is a project funded by the European Union Grid Tutorial 4/3/2004 n 1 EDG Tutorial Overview Workload

More information

Technical. Overview. ~ a ~ irods version 4.x

Technical. Overview. ~ a ~ irods version 4.x Technical Overview ~ a ~ irods version 4.x The integrated Ru e-oriented DATA System irods is open-source, data management software that lets users: access, manage, and share data across any type or number

More information

Load Balancing in Distributed Data Base and Distributed Computing System

Load Balancing in Distributed Data Base and Distributed Computing System Load Balancing in Distributed Data Base and Distributed Computing System Lovely Arya Research Scholar Dravidian University KUPPAM, ANDHRA PRADESH Abstract With a distributed system, data can be located

More information

A Taxonomy and Survey of Grid Resource Planning and Reservation Systems for Grid Enabled Analysis Environment

A Taxonomy and Survey of Grid Resource Planning and Reservation Systems for Grid Enabled Analysis Environment A Taxonomy and Survey of Grid Resource Planning and Reservation Systems for Grid Enabled Analysis Environment Arshad Ali 3, Ashiq Anjum 3, Atif Mehmood 3, Richard McClatchey 2, Ian Willers 2, Julian Bunn

More information

Survey and Taxonomy of Grid Resource Management Systems

Survey and Taxonomy of Grid Resource Management Systems Survey and Taxonomy of Grid Resource Management Systems Chaitanya Kandagatla University of Texas, Austin Abstract The resource management system is the central component of a grid system. This paper describes

More information

Distributed File System. MCSN N. Tonellotto Complements of Distributed Enabling Platforms

Distributed File System. MCSN N. Tonellotto Complements of Distributed Enabling Platforms Distributed File System 1 How do we get data to the workers? NAS Compute Nodes SAN 2 Distributed File System Don t move data to workers move workers to the data! Store data on the local disks of nodes

More information

Distributed Database Design

Distributed Database Design Distributed Databases Distributed Database Design Distributed Database System MS MS Web Web data mm xml mm dvanced Database Systems, mod1-1, 2004 1 Advanced Database Systems, mod1-1, 2004 2 Advantages

More information

DSS. Data & Storage Services. Cloud storage performance and first experience from prototype services at CERN

DSS. Data & Storage Services. Cloud storage performance and first experience from prototype services at CERN Data & Storage Cloud storage performance and first experience from prototype services at CERN Maitane Zotes Resines, Seppo S. Heikkila, Dirk Duellmann, Geoffray Adde, Rainer Toebbicke, CERN James Hughes,

More information

Grid Technology and Information Management for Command and Control

Grid Technology and Information Management for Command and Control Grid Technology and Information Management for Command and Control Dr. Scott E. Spetka Dr. George O. Ramseyer* Dr. Richard W. Linderman* ITT Industries Advanced Engineering and Sciences SUNY Institute

More information

Model-Driven Cloud Data Storage

Model-Driven Cloud Data Storage Model-Driven Cloud Data Storage Juan Castrejón 1, Genoveva Vargas-Solar 1, Christine Collet 1, and Rafael Lozano 2 1 Université de Grenoble, LIG-LAFMIA, 681 rue de la Passerelle, Saint Martin d Hères,

More information

Analisi di un servizio SRM: StoRM

Analisi di un servizio SRM: StoRM 27 November 2007 General Parallel File System (GPFS) The StoRM service Deployment configuration Authorization and ACLs Conclusions. Definition of terms Definition of terms 1/2 Distributed File System The

More information

Chapter 7. Using Hadoop Cluster and MapReduce

Chapter 7. Using Hadoop Cluster and MapReduce Chapter 7 Using Hadoop Cluster and MapReduce Modeling and Prototyping of RMS for QoS Oriented Grid Page 152 7. Using Hadoop Cluster and MapReduce for Big Data Problems The size of the databases used in

More information

An approach to grid scheduling by using Condor-G Matchmaking mechanism

An approach to grid scheduling by using Condor-G Matchmaking mechanism An approach to grid scheduling by using Condor-G Matchmaking mechanism E. Imamagic, B. Radic, D. Dobrenic University Computing Centre, University of Zagreb, Croatia {emir.imamagic, branimir.radic, dobrisa.dobrenic}@srce.hr

More information

DFSgc. Distributed File System for Multipurpose Grid Applications and Cloud Computing

DFSgc. Distributed File System for Multipurpose Grid Applications and Cloud Computing DFSgc Distributed File System for Multipurpose Grid Applications and Cloud Computing Introduction to DFSgc. Motivation: Grid Computing currently needs support for managing huge quantities of storage. Lacks

More information

Evolution of Database Replication Technologies for WLCG

Evolution of Database Replication Technologies for WLCG Home Search Collections Journals About Contact us My IOPscience Evolution of Database Replication Technologies for WLCG This content has been downloaded from IOPscience. Please scroll down to see the full

More information

New method for data replication in distributed heterogeneous database systems

New method for data replication in distributed heterogeneous database systems New method for data replication in distributed heterogeneous database systems Miroslaw Kasper Department of Computer Science AGH University of Science and Technology Supervisor: Grzegorz Dobrowolski Krakow,

More information

Distributed Systems LEEC (2005/06 2º Sem.)

Distributed Systems LEEC (2005/06 2º Sem.) Distributed Systems LEEC (2005/06 2º Sem.) Introduction João Paulo Carvalho Universidade Técnica de Lisboa / Instituto Superior Técnico Outline Definition of a Distributed System Goals Connecting Users

More information

In-Memory BigData. Summer 2012, Technology Overview

In-Memory BigData. Summer 2012, Technology Overview In-Memory BigData Summer 2012, Technology Overview Company Vision In-Memory Data Processing Leader: > 5 years in production > 100s of customers > Starts every 10 secs worldwide > Over 10,000,000 starts

More information

Ant Colony Optimization for Data Grid Replication Services Technical Report RR-06-08. DIIS. UNIZAR.

Ant Colony Optimization for Data Grid Replication Services Technical Report RR-06-08. DIIS. UNIZAR. Ant Colony Optimization for Data Grid Replication Services Technical Report RR-06-08. DIIS. UNIZAR. Víctor Méndez Muñoz 1 and Felix García Carballeira 2 1 Universidad de Zaragoza, CPS, Edificio Ada Byron,

More information

Database Replication Techniques: a Three Parameter Classification

Database Replication Techniques: a Three Parameter Classification Database Replication Techniques: a Three Parameter Classification Matthias Wiesmann Fernando Pedone André Schiper Bettina Kemme Gustavo Alonso Département de Systèmes de Communication Swiss Federal Institute

More information

DataMover: Robust Terabyte-Scale Multi-file Replication over Wide-Area Networks

DataMover: Robust Terabyte-Scale Multi-file Replication over Wide-Area Networks DataMover: Robust Terabyte-Scale Multi-file Replication over Wide-Area Networks Alex Sim, Junmin Gu, Arie Shoshani, Vijaya Natarajan Lawrence Berkeley National Laboratory (asim, jgu, shoshani, vnatarajan)@lbl.gov

More information

A Multidatabase System as 4-Tiered Client-Server Distributed Heterogeneous Database System

A Multidatabase System as 4-Tiered Client-Server Distributed Heterogeneous Database System A Multidatabase System as 4-Tiered Client-Server Distributed Heterogeneous Database System Mohammad Ghulam Ali Academic Post Graduate Studies and Research Indian Institute of Technology, Kharagpur Kharagpur,

More information

Michał Jankowski Maciej Brzeźniak PSNC

Michał Jankowski Maciej Brzeźniak PSNC National Data Storage - architecture and mechanisms Michał Jankowski Maciej Brzeźniak PSNC Introduction Assumptions Architecture Main components Deployment Use case Agenda Data storage: The problem needs

More information

Distributed Systems and Recent Innovations: Challenges and Benefits

Distributed Systems and Recent Innovations: Challenges and Benefits Distributed Systems and Recent Innovations: Challenges and Benefits 1. Introduction Krishna Nadiminti, Marcos Dias de Assunção, and Rajkumar Buyya Grid Computing and Distributed Systems Laboratory Department

More information

Multi-Channel Clustered Web Application Servers

Multi-Channel Clustered Web Application Servers THE AMERICAN UNIVERSITY IN CAIRO SCHOOL OF SCIENCES AND ENGINEERING Multi-Channel Clustered Web Application Servers A Masters Thesis Department of Computer Science and Engineering Status Report Seminar

More information

Distributed Database Access in the LHC Computing Grid with CORAL

Distributed Database Access in the LHC Computing Grid with CORAL Distributed Database Access in the LHC Computing Grid with CORAL Dirk Duellmann, CERN IT on behalf of the CORAL team (R. Chytracek, D. Duellmann, G. Govi, I. Papadopoulos, Z. Xie) http://pool.cern.ch &

More information

Locality Based Protocol for MultiWriter Replication systems

Locality Based Protocol for MultiWriter Replication systems Locality Based Protocol for MultiWriter Replication systems Lei Gao Department of Computer Science The University of Texas at Austin lgao@cs.utexas.edu One of the challenging problems in building replication

More information

IV Distributed Databases - Motivation & Introduction -

IV Distributed Databases - Motivation & Introduction - IV Distributed Databases - Motivation & Introduction - I OODBS II XML DB III Inf Retr DModel Motivation Expected Benefits Technical issues Types of distributed DBS 12 Rules of C. Date Parallel vs Distributed

More information

Chapter 18: Database System Architectures. Centralized Systems

Chapter 18: Database System Architectures. Centralized Systems Chapter 18: Database System Architectures! Centralized Systems! Client--Server Systems! Parallel Systems! Distributed Systems! Network Types 18.1 Centralized Systems! Run on a single computer system and

More information

Database Replication

Database Replication Database Systems Journal vol. I, no. 2/2010 33 Database Replication Marius Cristian MAZILU Academy of Economic Studies, Bucharest, Romania mariuscristian.mazilu@gmail.com, mazilix@yahoo.com For someone

More information

Classic Grid Architecture

Classic Grid Architecture Peer-to to-peer Grids Classic Grid Architecture Resources Database Database Netsolve Collaboration Composition Content Access Computing Security Middle Tier Brokers Service Providers Middle Tier becomes

More information

Database Replication: A Survey of Open Source and Commercial Tools

Database Replication: A Survey of Open Source and Commercial Tools Database Replication: A Survey of Open Source and Commercial Tools Salman Abdul Moiz Research Scientist Centre for Development of Advanced Computing, Bangalore. Sailaja P. Senior Staff Scientist Centre

More information

In-memory databases and innovations in Business Intelligence

In-memory databases and innovations in Business Intelligence Database Systems Journal vol. VI, no. 1/2015 59 In-memory databases and innovations in Business Intelligence Ruxandra BĂBEANU, Marian CIOBANU University of Economic Studies, Bucharest, Romania babeanu.ruxandra@gmail.com,

More information

Designing an Object Relational Data Warehousing System: Project ORDAWA * (Extended Abstract)

Designing an Object Relational Data Warehousing System: Project ORDAWA * (Extended Abstract) Designing an Object Relational Data Warehousing System: Project ORDAWA * (Extended Abstract) Johann Eder 1, Heinz Frank 1, Tadeusz Morzy 2, Robert Wrembel 2, Maciej Zakrzewicz 2 1 Institut für Informatik

More information

Bryan Tuft Sr. Sales Consultant Global Embedded Business Unit bryan.tuft@oracle.com

Bryan Tuft Sr. Sales Consultant Global Embedded Business Unit bryan.tuft@oracle.com Bryan Tuft Sr. Sales Consultant Global Embedded Business Unit bryan.tuft@oracle.com Agenda Oracle Approach Embedded Databases TimesTen In-Memory Database Snapshots Q&A Real-Time Infrastructure Challenges

More information

Database replication for commodity database services

Database replication for commodity database services Database replication for commodity database services Gustavo Alonso Department of Computer Science ETH Zürich alonso@inf.ethz.ch http://www.iks.ethz.ch Replication as a problem Gustavo Alonso. ETH Zürich.

More information

A Grid Architecture for Manufacturing Database System

A Grid Architecture for Manufacturing Database System Database Systems Journal vol. II, no. 2/2011 23 A Grid Architecture for Manufacturing Database System Laurentiu CIOVICĂ, Constantin Daniel AVRAM Economic Informatics Department, Academy of Economic Studies

More information

Experiences with the GLUE information schema in the LCG/EGEE production Grid

Experiences with the GLUE information schema in the LCG/EGEE production Grid Experiences with the GLUE information schema in the LCG/EGEE production Grid Stephen Burke, Sergio Andreozzi and Laurence Field CHEP07, Victoria, Canada www.eu-egee.org EGEE and glite are registered trademarks

More information

Towards Distributed Service Platform for Extending Enterprise Applications to Mobile Computing Domain

Towards Distributed Service Platform for Extending Enterprise Applications to Mobile Computing Domain Towards Distributed Service Platform for Extending Enterprise Applications to Mobile Computing Domain Pakkala D., Sihvonen M., and Latvakoski J. VTT Technical Research Centre of Finland, Kaitoväylä 1,

More information

Run Control and Monitor System for the CMS Experiment

Run Control and Monitor System for the CMS Experiment Computing in High Energy and Nuclear Physics, La Jolla CA, March 24-28, 2003 1 Run Control and Monitor System for the CMS Experiment V. Brigljevic, G. Bruno, E. Cano, S. Cittolin, A. Csilling, D. Gigi,

More information

IBM WebSphere Distributed Caching Products

IBM WebSphere Distributed Caching Products extreme Scale, DataPower XC10 IBM Distributed Caching Products IBM extreme Scale v 7.1 and DataPower XC10 Appliance Highlights A powerful, scalable, elastic inmemory grid for your business-critical applications

More information

CHAPTER 2 MODELLING FOR DISTRIBUTED NETWORK SYSTEMS: THE CLIENT- SERVER MODEL

CHAPTER 2 MODELLING FOR DISTRIBUTED NETWORK SYSTEMS: THE CLIENT- SERVER MODEL CHAPTER 2 MODELLING FOR DISTRIBUTED NETWORK SYSTEMS: THE CLIENT- SERVER MODEL This chapter is to introduce the client-server model and its role in the development of distributed network systems. The chapter

More information

Performance Analysis of Cloud-Based Applications

Performance Analysis of Cloud-Based Applications Performance Analysis of Cloud-Based Applications Peter Budai and Balazs Goldschmidt Budapest University of Technology and Economics, Department of Control Engineering and Informatics, Budapest, Hungary

More information

Data Management in the Cloud

Data Management in the Cloud Data Management in the Cloud Ryan Stern stern@cs.colostate.edu : Advanced Topics in Distributed Systems Department of Computer Science Colorado State University Outline Today Microsoft Cloud SQL Server

More information

Overview of Scalable Distributed Database System SD-SQL Server

Overview of Scalable Distributed Database System SD-SQL Server Overview of Scalable Distributed Database System Server Witold Litwin 1, Soror Sahri 2, Thomas Schwarz 3 CERIA, Paris-Dauphine University 75016 Paris, France Abstract. We present a scalable distributed

More information

SOFT 437. Software Performance Analysis. Ch 5:Web Applications and Other Distributed Systems

SOFT 437. Software Performance Analysis. Ch 5:Web Applications and Other Distributed Systems SOFT 437 Software Performance Analysis Ch 5:Web Applications and Other Distributed Systems Outline Overview of Web applications, distributed object technologies, and the important considerations for SPE

More information

The Sierra Clustered Database Engine, the technology at the heart of

The Sierra Clustered Database Engine, the technology at the heart of A New Approach: Clustrix Sierra Database Engine The Sierra Clustered Database Engine, the technology at the heart of the Clustrix solution, is a shared-nothing environment that includes the Sierra Parallel

More information

Implementing New Approach for Enhancing Performance and Throughput in a Distributed Database

Implementing New Approach for Enhancing Performance and Throughput in a Distributed Database 290 The International Arab Journal of Information Technology, Vol. 10, No. 3, May 2013 Implementing New Approach for Enhancing Performance and in a Distributed Database Khaled Maabreh 1 and Alaa Al-Hamami

More information

Plateforme de Calcul pour les Sciences du Vivant. SRB & glite. V. Breton. http://clrpcsv.in2p3.fr

Plateforme de Calcul pour les Sciences du Vivant. SRB & glite. V. Breton. http://clrpcsv.in2p3.fr SRB & glite V. Breton http://clrpcsv.in2p3.fr Introduction Goal: evaluation of existing technologies for data and tools integration and deployment Data and tools integration should be addressed using web

More information

CFS: A New Dynamic Replication Strategy for Data Grids

CFS: A New Dynamic Replication Strategy for Data Grids 94 The International Arab Journal of Information Technology, Vol. 9, No. 1, January 2012 CFS: A New Dynamic Replication Strategy for Data Grids Feras Hanandeh 1, Mutaz Khazaaleh 2, Hamidah Ibrahim 3, and

More information

Resource Monitoring in GRID computing

Resource Monitoring in GRID computing Seminar May 16, 2003 Resource Monitoring in GRID computing Augusto Ciuffoletti Dipartimento di Informatica - Univ. di Pisa next: Network Monitoring Architecture Network Monitoring Architecture controls

More information

SCC717 Recent Developments in Information Technology

SCC717 Recent Developments in Information Technology SCC717 Recent Developments in Information Technology Lecturer: Wanlei ZHOU, Ph.D., DSc., Professor, Acting Head School of Information Technology Deakin University, Melbourne Campus Office: T205.2, Ext.

More information

Data Management and Network Marketing Model

Data Management and Network Marketing Model Distributed Data Management Services for Dynamic Data Grids Houda Lamehamedi, Boleslaw K. Szymanski, Brenden Conte Department of Computer Science, Rensselaer Polytechnic Institute, Troy, NY 12180 {lamehh,

More information