XII. Distributed Systems and Middleware. Laurea Triennale in Informatica Corso di Ingegneria del Software I A.A. 2006/2007 Andrea Polini



Similar documents
Middleware: Past and Present a Comparison

Middleware Lou Somers

System types. Distributed systems

Distributed Systems. Outline. What is a Distributed System?

Distributed Systems Architectures

SOFT 437. Software Performance Analysis. Ch 5:Web Applications and Other Distributed Systems

Chapter 6. CORBA-based Architecture. 6.1 Introduction to CORBA 6.2 CORBA-IDL 6.3 Designing CORBA Systems 6.4 Implementing CORBA Applications

Web Services. Copyright 2011 Srdjan Komazec

Software Engineering and Middleware: A Roadmap

Tier Architectures. Kathleen Durant CS 3200

Chapter 2: Enterprise Applications from a Middleware Perspective

Client-Server Applications

Distributed Objects and Components

Software Engineering and Middleware: A Roadmap

Introduction to CORBA. 1. Introduction 2. Distributed Systems: Notions 3. Middleware 4. CORBA Architecture

Infrastructure that supports (distributed) componentbased application development

What is Middleware? Software that functions as a conversion or translation layer. It is also a consolidator and integrator.

Middleware and Distributed Systems. Introduction. Dr. Martin v. Löwis

Client/Server Computing Distributed Processing, Client/Server, and Clusters

Chapter Outline. Chapter 2 Distributed Information Systems Architecture. Middleware for Heterogeneous and Distributed Information Systems

CHAPTER 2 MODELLING FOR DISTRIBUTED NETWORK SYSTEMS: THE CLIENT- SERVER MODEL

How To Understand The Concept Of A Distributed System

Ingegneria del Software II academic year: Course Web-site: [

Motivation Definitions EAI Architectures Elements Integration Technologies. Part I. EAI: Foundations, Concepts, and Architectures

The Service Availability Forum Specification for High Availability Middleware

DISTRIBUTED AND PARALLELL DATABASE

Lehrstuhl für Informatik 4 Kommunikation und verteilte Systeme. Middleware. Chapter 8: Middleware

Introduction CORBA Distributed COM. Sections 9.1 & 9.2. Corba & DCOM. John P. Daigle. Department of Computer Science Georgia State University

3-Tier Architecture. 3-Tier Architecture. Prepared By. Channu Kambalyal. Page 1 of 19

25 May Code 3C3 Peeling the Layers of the 'Performance Onion John Murphy, Andrew Lee and Liam Murphy

A distributed system is defined as

Chapter 16 Distributed Processing, Client/Server, and Clusters

Chapter 2: Remote Procedure Call (RPC)

Virtual machine interface. Operating system. Physical machine interface

Event-based middleware services

Module 17. Client-Server Software Development. Version 2 CSE IIT, Kharagpur

CHAPTER 1: OPERATING SYSTEM FUNDAMENTALS

Service-Oriented Architecture and Software Engineering

Chapter 18: Database System Architectures. Centralized Systems

Distributed Systems. Distributed Systems

Distributed Systems. Security concepts; Cryptographic algorithms; Digital signatures; Authentication; Secure Sockets

Distributed Systems. REK s adaptation of Prof. Claypool s adaptation of Tanenbaum s Distributed Systems Chapter 1

Enterprise Application Integration

MIDDLEWARE 1. Figure 1: Middleware Layer in Context

Distributed Systems Lecture 1 1

Principles and characteristics of distributed systems and environments

1 File Processing Systems

Chapter 3. Database Environment - Objectives. Multi-user DBMS Architectures. Teleprocessing. File-Server

Distributed systems. Distributed Systems Architectures

JOURNAL OF OBJECT TECHNOLOGY

C/S Basic Concepts. The Gartner Model. Gartner Group Model. GM: distributed presentation. GM: distributed logic. GM: remote presentation

Centralized Systems. A Centralized Computer System. Chapter 18: Database System Architectures

The EMSX Platform. A Modular, Scalable, Efficient, Adaptable Platform to Manage Multi-technology Networks. A White Paper.

Lecture 7: Java RMI. CS178: Programming Parallel and Distributed Systems. February 14, 2001 Steven P. Reiss

Objectives. Distributed Databases and Client/Server Architecture. Distributed Database. Data Fragmentation

Service Oriented Architectures

CS550. Distributed Operating Systems (Advanced Operating Systems) Instructor: Xian-He Sun

Distributed System: Definition

Implementing Java Distributed Objects with JDBC

System Models for Distributed and Cloud Computing

COMP5426 Parallel and Distributed Computing. Distributed Systems: Client/Server and Clusters

3F6 - Software Engineering and Design. Handout 10 Distributed Systems I With Markup. Steve Young

Overview of CORBA 11.1 I NTRODUCTION TO CORBA Object services 11.5 New features in CORBA Summary

How To Write A Network Operating System For A Network (Networking) System (Netware)

2.1 What are distributed systems? What are systems? Different kind of systems How to distribute systems? 2.2 Communication concepts

Client/Server and Distributed Computing

CORBAservices. Naming. Part of the CORBA Naming Service Interface in IDL. CORBA Naming Service

Network/Socket Programming in Java. Rajkumar Buyya

Chapter 1: Distributed Systems: What is a distributed system? Fall 2008 Jussi Kangasharju

Distributed Systems LEEC (2005/06 2º Sem.)

A Framework for ADS Application Software Development Based on CORBA

Designing for Maintainability

COMP9243 Week 11 (15s1) Ihor Kuz, Manuel M. T. Chakravarty

Architecture of the CORBA Component Model CORBA 3.0

SCALABILITY AND AVAILABILITY

Distributed Systems and Recent Innovations: Challenges and Benefits

Stock Trader System. Architecture Description

Scientific versus Business Workflows

A Comparison of Distributed Systems: ChorusOS and Amoeba

Proactive, Resource-Aware, Tunable Real-time Fault-tolerant Middleware

What Is the Java TM 2 Platform, Enterprise Edition?

Chapter 4. Architecture. Table of Contents. J2EE Technology Application Servers. Application Models

Distributed System Principles

Software Life-Cycle Management

Division of Informatics, University of Edinburgh

What is a database? COSC 304 Introduction to Database Systems. Database Introduction. Example Problem. Databases in the Real-World

Resource Utilization of Middleware Components in Embedded Systems

Elements of Advanced Java Programming

Service Mediation. The Role of an Enterprise Service Bus in an SOA

Middleware. Peter Marwedel TU Dortmund, Informatik 12 Germany. technische universität dortmund. fakultät für informatik informatik 12

Remote Method Invocation in JAVA

EJB & J2EE. Component Technology with thanks to Jim Dowling. Components. Problems with Previous Paradigms. What EJB Accomplishes

OMG/CORBA: An Object-Oriented Middleware

ICS 434 Advanced Database Systems

A Java-based system support for distributed applications on the Internet

S y s t e m A r c h i t e c t u r e

CORBA and object oriented middleware. Introduction

Client/server is a network architecture that divides functions into client and server

Unification of Transactions and Replication in Three-Tier Architectures Based on CORBA

Transcription:

XII. Distributed Systems and Middleware Laurea Triennale in Informatica Corso di

Outline Distributed Systems Basics Middleware generalities Middleware for Distributed Objects Distributed Computing Models 2

Distributed systems: a definition A distributed system is a collection of processors that do not share memory or a clock. Instead, each processor has its own local memory, and the processors communicate with each other through various communication lines. The processors in a distributed system vary in size and function. They may include small microprocessor, workstations, minicomputers, and large general purpose computer systems. A distributed system must provide various mechanisms for process synchronization and communication, for dealing with the deadlock problem, and for dealing with a variety of failures that are not encountered in a centralized system. (Silberschatz & Galvin, Operating System Concepts) 3

Highlights No shared memory and neither shared time Heterogeneity Typical problems of concurrency and necessity of suitable means to deal with them New and more points of failures Security and Integrity issues CONCLUSION Building a distributed system is really more difficult and expensive than building a centralized one. Hence the choice must be pondered. In general it is better not to build a distributed system if it can be avoided 4

Why building a distributed system? However some non-functional requirements cannot be achieved by a centralized system, and this lead to the construction of Distributed System, in particular: Scalability Resource Sharing Heterogeneity Fault-Tolerance Openness 5

Scalability: Non-functional requirements The system must be capable of accommodating growing load in the future Distributed system can be easily scalable adding new computers in the original configuration Resource Access and Sharing: Often it is necessary to share hardware, software and data Resource manager and security issues 6

Heterogeneity: Non-functional requirements Use of different technology for the implementation of services and legacy components Differences in: programming languages, operating systems, hardware platforms, network protocols Fault Tolerance: Operations that continue also in presence of faults (many components higher probability of having faults) Generally obtained by means of redundant components 7

Openness: Non-functional requirements System can be easily extended and modified with new functionalities It is necessary to establish well-documented and well-defined interfaces What about Performance? Distributed system can certainly improve performance (real parallelism available) However performance should not be the main motivation Communications really expensive Consider instead Multiprocessor and Massively Parallel System 8

Outline Distributed Systems Basics Middleware generalities Middleware for Distributed Objects Distributed Computing Models 9

Middleware: a definition Middleware is a set of common business-unaware services that enable applications and end users to interact with each other across a network. In essence, middleware is the software that resides above the network and below the business-aware application software. (Umar, Object-Oriented Client/Server Internet environments) Application Middleware Network Services Local Services Operating System and Computing hardware 10

Middleware: a glance 11

Why do we need middleware? Programming in heterogeneous distributed environment is made difficult by several factors: It is often necessary to exchange complex data Different encoding of data types References to other distributed components as return values Distributed Components activation and deactivation Synchronization and Real Parallelism Need for atomic sequence operations... 12

13

Trasparency Trasparency complexity introduced by distribution (mobility...) should disappear The system should appear, to the final user, as an integrated computing facility Certainly useful to hide the distribution as much as possible to the application engineer Several dimension of transparency (not orthogonals) can be identified 14

Trasparency Relations Part of the International Standard on Open Distributed Processing (ODP) Scalability Transparency Performance Transparency Failure Transparency Migration Transparency Replication Transparency Concurrency Transparency Access Transparency Location Transparency (W.Emmerich Engineering Distributed Objects John Wiley and Sons) 15

Access Transparency: Transparency Dimensions interface to a service do not depends from the location of the components that use it. Without it is not an easy task to move the service to a different host. Location Transparency: a request for a service can be made without knowing the physical location of the components that provide the service. Without it moving components becomes almost impossible Migration Transparency: Components can be migrated to different host without that the user are aware of that, and that the developer of clients components take a special consideration 16

Transparency Dimensions Replication Transparency: The user of a service and the application programmer are not aware that a service they are using are provided by a replica Concurrency Transparency: Several components may concurrently request services from a shared component while the shared component s integrity is preserved and neither users nor application engineers have to see how concurrency is controlled Scalability Transparency: To the users and designers is transparent how the system scales to accommodate a growing load 17

Transparency Dimensions Performance Transparency: The users and the application programmers are not aware of how the system performance is actually achieved (really difficult to obtain given the general unpredictability of load) Failure Transparency: the user and the application programmer are unaware of how the system hides the failure. In some way they should believe that the service cannot fail 18

Outline Distributed Systems Basics Middleware generalities Middleware for Distributed Objects Distributed Computing Models 19

Types of middleware Transaction Oriented Middleware: often used when the distributed components are database applications. It use the two-phase commit protocol to implement distributed transactions. (IBM CICS, BEA Tuxedo, Transarc Encina) Message-Oriented Middleware: supports the communication by message exchange. Components use messages to require services and can wait for a response. It simplifies decoupling of clients and servers (then scalability) and supports multi-cast in a transparent way. (IBM MQSeries, Sun ToolTalk, NCR TopEnd) 20

Types of middleware Remote Procedure Calls: operation that can be invoked remotely across different hardware and operating systems platforms. The intent was to provide a mean to invoke remote procedures as local procedures. Interface Definition Language (IDL) Client and Server stubs RPC is not reflexive 21

Object-Oriented middleware Objective in the developing of OO Middleware is to export the paradigm of OO into the distributed world The development of a OO distributed application must be similar to the development of a normal OO application Distribution should not completely disappear!!! 22

OO Middleware Main elements of a OO middleware are: Object Request Broker Interface Definition Language Three main examples of OO middleware: CORBA (OMG) DCOM/.Net (Microsoft) JavaRMI (Sun) 23

Just a glance on remote invocations Based on the same schema of RPC: 24

Just a glance on remote invocations A distributed AddressBook: public interface AddressBookI { public String[] getaddress(string name) throws Trowable; public String[] gettown(string name) throws Trowable; public String gettelnumber(string name) throws Trowable; } public clas s AddressBookServer implements AddressBookI { private AddressBookTable AddrBook; public String addentry(string name, String street, String town, String tel) { AddressBook.addEntry(name,street,town,tel); } public String[] getstreet(string name) { return AddrBook.getStreet(name); } public String[] gettown(string name) { return AddrBook.getTown(name); } public String[] gettelnumber(string name) { return AddrBook.getTel(name); } } public clas s AddressRemoteClient { public s tatic void main(string[] args) { try { AddressBookI ab = new AddressBookServerStub(); System.out.println( Via: +ab.getstreet(args[1])+ Comune di: +gettown(args[1])+ Tel: + gettelnumber(args[1]);} } catch (Throwable t) {t.printstacktrace();} } } Seminario al RETIS Lab Scuola Sant'Anna March 23, 2006 Machine X Machine Y 25

Class Stub import java.io.objectoutputstream; import java.io.objectinputstream; import java.net.socket; public clas s AddressStub implements AddressBookI { Socket socket; public AddressStub throws Trowable { socket = new Socket( localhost, 9000 ); } public String getstreet(name) throws Throwable { ObjectOutputStream outstream = new ObjectOutputStream(socket.getOutputStream()); outstream.writeobject( Street ); outstream.flush(); outstream.writeobject(name); outstream.flush(); ObjectInputStream instream = new ObjectInputStream(socket.getInputStream()); return (String)inStream.readObject(); } public String gettown() throws Throwable {... } } 26

Class Skeleton import java.io.objectoutputstream; import java.io.objectinputstream; import java.net.socket; import java.net.serversocket; public clas s AddressSkeleton extends Thread { AddressServer server; public AddressSkeleton (Address server) { this.server = server; } public void run() { try { ServerSocket serversocket = new ServerSocket(9000); Socket socket = serversocket.accept(); while (socket!= null) { ObjectInputStream instream = new ObjectInputStream(socket.getInputStream()); String method = (String)inStream.readObject(); String name = (String)inStream.readObject(); if (method.equals( Street )) { ObjectOutputStream OutStream = new ObjectOutputStream(socket.getOutputStream()); outstream.writeobject(server.getstreet(name)); outstream.flush(); } els e if (method.equals( Town )) {... } } } } catch (Trowable t) {t.printstacktrace(); System.exit(0);} } public s tatic void main(string[] args) { AddressServer server = new AddressServer( Via Salaria, Roma ); AddressSkeleton addressskel = new AddressSkeleton(server); addressskel.start(); } } 27

Distribution can t disappear!!! Differences between local and distributed objects, and that must be considered by the application designer, concern: Life Cycle Object References Request Latency Activation Parallelism Communications Failures Security 28

Life cycle Creation: in traditional OO programming creation via constructor (compiler solve the problem). In distributed OO programming created object do not necessarily reside in the same process space, therefore is not possible to directly invoke constructors. Migration: after the creation an object could migrate to other hosts; when necessary the object should have been designed to permit migration Deletion: difficulties in implementing garbage collections algorithms referential integrity is rather expensive, it is difficult to know all the existing references to an object then it is necessary to deal with server objects no more available 29

Object references Parameter-passing Remote references are quite big data structure In complex middleware the necessity of space for a reference can be 100 times bigger than in normal OO programming Applications cannot maintain large numbers of object references Designing distributed object-based applications we have to minimize the number of objects 30

Request latency Local request call requires, in modern workstations, 250 nanoseconds (4ya) Remote request could require between 0.1 and 10 milliseconds (4ya) A remote request is about 400-4000 times more expensive than a local one It is really important try to obtain locality. Objects that communicate a lot between them, should reside on the same host!! 31

Activation/Deactivation More new problems in distributed OO programming: machines, hosting server objects, could be restarted Resource required by all the server objects on a host may be greater than the resources available Server object could be idle for long time between two invocations than could be opportune do not waste resources It is necessary to introduce other two operation to objects life cycle: Activation and Deactivation 32

Activation/Deactivation This operation can increase latency if the requested service is provided by a deactivated object Obviously activation and deactivation must be transparent to the client It is necessary to implement a persistence service to handle stateful objects 33

Parallelism Certainly we have to manage real parallelism, independently from the use of threads Access to server objects must be controlled 34

Communications The numerous cause of delay impose the possibility of using non-blocking calls invoking a service Often is also useful to use a form of multi-cast Request Synchronization (Synchronous, One way, extended rendez-vous, asynchronous) Request Multiplicity (Unicast request, Group request, Multiple request) 35

Failures Distributed objects have to deal with major probability of failures Partial failure (a new kind of failure) Different reliabilities available for distributed objects Unicast request exactly-once, atomic, at-least-once, at-most-once, maybe Group request and Multiple request k-reliability, totally ordered, best effort 36

Security Distributed objects use the network for communications!!! Centralized applications trust that the user will not make the session available to unauthorized users In distributed applications each request might be authenticated 37

Common problems Recurring problems not strictly related to the application logic Middleware can provide solutions known as services. We will see in brief four of them : Location Life Cycle management Object persistence Transactions 38

Object Naming Location service a sequence of identifiers is bound to an object reference Hierarchical structuring of name spaces (as DNS) The object must be registered within a naming server that then can provide the reference to clients (CORBA Naming Service, COM Monikers, JavaRMI Registry) Object Trading three main actors: Trader, Exporter and Importer reference retrieved on the base of properties concerning provided functionalities and quality 39

CREATION: Life cycle service The new operator is not available for remote construction Client need reference to Factory objects (objects capable of create other objects) Administrator influences distribution placing factories MIGRATION: Moving or Copying (using a factory) an object from its current location moved objects retain the same reference more factories on heterogeneous platforms to solve heterogeneity machine code DELETION Composite Objects Life Cycle Management 40

Persistence service Three reason that can cause the necessity of storing object state information: deactivation hosts have to be restarted memory lacking Java Serialization, CORBA Externalization, COM Structured Storage 41

Transaction service Several object requests into a coarse grained one with ACID properties Middleware provide mechanisms for the implementation of the 2PC CORBA Transaction Service, Microsoft Transaction Server (MTS), Java Transaction API (JTA) 42

CORBA Short Overview Common Object Request Broker Architecture (CORBA) Open Standard defined by the Object Management Group (OMG) Last Formal Spec March 2004 (1152 Pages) Main Elements: Object Model ORB CORBA IDL CORBAServices, CORBAFacilities, CORBA Domain Interfaces http://www.corba.org/vc.htm (~220 companies committed) http://www.omg.org/technology/corba/corbadownloads.htm (~15 free CORBA implementation) 43

CORBA Architecture 44

CORBAservices Additional Structuring Mechanisms for the OTS Notification Service Collection Service Persistent State Service Concurrency Service Property Service Enhanced View of Time Query Service Event Service Lightweight Service Management of Event Domain Relationship Service Externalization Service Security Service Naming Service Time Service Licensing Service Trading Object Service Life Cycle Service Transaction Service Notification/JMS Interworking Telecoms Log Service 45

Outline Distributed Systems Basics Middleware generalities Middleware for Distributed Objects Distributed Computing Models 46

File Transfer: Distributed computing models this was one of the first model trying to exploit the resources distribution. Following this paradigm the applications, that need data residing in another machine, make the log-on on it and then transfer the data. Therefore offline the foreseen computations are performed The distribution regards only the data Applicable only with low load and low concurrency e.g. e-mail 47

Client/Server: Distributed computing models Client/server model is a concept for describing communications between computing processes that are classified as service consumers (clients) and service providers (servers) Today the most used paradigm, therefore we will see major details in the following In this context C/S is a software architecture not hardware!! 48

Distributed computing models Peer-to-Peer (P2P): following this paradigm more processes, located on different machines, cooperate to reach the solution of the problem. In different moments the process perform both server and client duties Also consequence of the great underutilization of many resources linked to the net. 49

Client/Server model Client: process that requires services Server: process that provides services Three different logic levels of computation concerning: Graphical interface Business logic Data Management 50

Two-tier architecture: Client/Server model the business logic is divided between the two elements. We can have: fat client and thin server fat server and thin client Three-tier and n-tier architecture: There is a tier for each of the logic levels It is also possible to split the business logic in more than one tier (e.g. web applications) 51

The J2EE example: Client/Server model 52