Network-Based Cluster Computing

Similar documents
Distributed RAID Architectures for Cluster I/O Computing. Kai Hwang

PARALLEL & CLUSTER COMPUTING CS 6260 PROFESSOR: ELISE DE DONCKER BY: LINA HUSSEIN

Simplest Scalable Architecture

Client/Server Computing Distributed Processing, Client/Server, and Clusters

Virtual machine interface. Operating system. Physical machine interface

Architecture. System Software. Applications. Architecture. System Software. Applications. Commercialization Commodity. Research and Development

Microsoft Exchange Server 2003 Deployment Considerations

Linux clustering. Morris Law, IT Coordinator, Science Faculty, Hong Kong Baptist University

COM 444 Cloud Computing

Symmetric Multiprocessing

FROM RELATIONAL TO OBJECT DATABASE MANAGEMENT SYSTEMS

OVERVIEW. CEP Cluster Server is Ideal For: First-time users who want to make applications highly available

Principles and characteristics of distributed systems and environments

TotalStorage Network Attached Storage 300G Cost effective integration of NAS and LAN solutions

HPC Cloud. Focus on your research. Floris Sluiter Project leader SARA

DAS, NAS or SAN: Choosing the Right Storage Technology for Your Organization

How To Build A Clustered Storage Area Network (Csan) From Power All Networks

NEXTGEN v5.8 HARDWARE VERIFICATION GUIDE CLIENT HOSTED OR THIRD PARTY SERVERS

Highly Available Mobile Services Infrastructure Using Oracle Berkeley DB

MOSIX: High performance Linux farm

Enterprise Edition. Hardware Requirements

Network Attached Storage. Jinfeng Yang Oct/19/2015

Using an MPI Cluster in the Control of a Mobile Robots System

Computational infrastructure for NGS data analysis. José Carbonell Caballero Pablo Escobar

Scalable Cloud Computing Solutions for Next Generation Sequencing Data

Priority Pro v17: Hardware and Supporting Systems

Distributed Systems. REK s adaptation of Prof. Claypool s adaptation of Tanenbaum s Distributed Systems Chapter 1

NETOP SUITE NETOP POLICY MANAGER (PM)

Chapter 1: Introduction. What is an Operating System?

How To Create A Multi Disk Raid

Hardware Requirements

Windows Compute Cluster Server Miron Krokhmal CTO

Centralized Systems. A Centralized Computer System. Chapter 18: Database System Architectures

Professional and Enterprise Edition. Hardware Requirements

An Oracle White Paper July Oracle Primavera Contract Management, Business Intelligence Publisher Edition-Sizing Guide

SINGLE SYSTEM IMAGE (SSI)

Chapter 16 Distributed Processing, Client/Server, and Clusters

M.Sc. IT Semester III VIRTUALIZATION QUESTION BANK Unit 1 1. What is virtualization? Explain the five stage virtualization process. 2.

MedInformatix System Requirements

Distributed (Operating) Systems. Introduction

Cluster Implementation and Management; Scheduling

High Performance Computing. Course Notes HPC Fundamentals

Terminal Server Software and Hardware Requirements. Terminal Server. Software and Hardware Requirements. Datacolor Match Pigment Datacolor Tools

Handling Multimedia Under Desktop Virtualization for Knowledge Workers

Delphi 2015 SP1-AP1 System Requirements

Local Area Networks: Software

Distributed Systems LEEC (2005/06 2º Sem.)

OAISYS and ShoreTel: Call Recording Solution Configuration. An OAISYS White Paper

High Performance Computing. Course Notes High Performance Storage

Fido: Fast Inter-Virtual-Machine Communication for Enterprise Appliances

High Availability Databases based on Oracle 10g RAC on Linux

Distributed and Cloud Computing

Distributed Operating Systems

Types Of Operating Systems

Content Distribution Management

CS550. Distributed Operating Systems (Advanced Operating Systems) Instructor: Xian-He Sun

SMB Direct for SQL Server and Private Cloud

Sabita Kundu. Keywords: R&D libraries, Cloud Computing, Operational Expenditure, SaaS, IaaS, PaaS.

Fibre Channel Overview of the Technology. Early History and Fibre Channel Standards Development

IBM TotalStorage Network Attached Storage 300G

High Performance. CAEA elearning Series. Jonathan G. Dudley, Ph.D. 06/09/ CAE Associates

Chapter 18: Database System Architectures. Centralized Systems

IT service for life science

AuditMatic Enterprise Edition Installation Specifications

HC Emission Protected Security Workstation

A Study on the Scalability of Hybrid LS-DYNA on Multicore Architectures

Flow 3 Media Asset Management incorporating AirFlow and Automation

Single System Image with Virtualization Technology for Cluster Computing Environment

Dependable Systems. 9. Redundant arrays of. Prof. Dr. Miroslaw Malek. Wintersemester 2004/05

QUADRICS IN LINUX CLUSTERS

Scala Storage Scale-Out Clustered Storage White Paper

Cluster Grid Interconects. Tony Kay Chief Architect Enterprise Grid and Networking

COMP5426 Parallel and Distributed Computing. Distributed Systems: Client/Server and Clusters

Using Linux Clusters as VoD Servers

A Comparison of Distributed Systems: ChorusOS and Amoeba

Revit products will use multiple cores for many tasks, using up to 16 cores for nearphotorealistic

A Comparison of VMware and {Virtual Server}

Cray XT3 Supercomputer Scalable by Design CRAY XT3 DATASHEET

How To Virtualize A Storage Area Network (San) With Virtualization

SRNWP Workshop. HP Solutions and Activities in Climate & Weather Research. Michael Riedmann European Performance Center

Scheduling and Resource Management in Computational Mini-Grids

Oracle Applications Release 10.7 NCA Network Performance for the Enterprise. An Oracle White Paper January 1998

(Scale Out NAS System)

Parallel Computing. Benson Muite. benson.

Fault Tolerance & Reliability CDA Chapter 3 RAID & Sample Commercial FT Systems

Oracle BI EE Implementation on Netezza. Prepared by SureShot Strategies, Inc.

Petascale Software Challenges. Piyush Chaudhary High Performance Computing

Hitachi File Services Manager Release Notes

Client Server Computing

Best Practices for Deploying SSDs in a Microsoft SQL Server 2008 OLTP Environment with Dell EqualLogic PS-Series Arrays

NEWSTAR ENTERPRISE and NEWSTAR Sales System Requirements

KFUPM Enterprise Network. Sadiq M. Sait

Virtualization across the organization

ICE. Client Guidelines. January 4, 2012

Hardware RAID vs. Software RAID: Which Implementation is Best for my Application?

Using VMware VMotion with Oracle Database and EMC CLARiiON Storage Systems

2. COMPUTER SYSTEM. 2.1 Introduction

lesson 1 An Overview of the Computer System

Panasas at the RCF. Fall 2005 Robert Petkus RHIC/USATLAS Computing Facility Brookhaven National Laboratory. Robert Petkus Panasas at the RCF

NETWORK MANAGEMENT SOLUTIONS

Transcription:

Network-Based Cluster Computing Professor Kai Hwang University of Southern California kaihwang@ceng.usc.edu Presentation Outline: h Cutting-Edge Technologies h Network-based Cluster Computing h Distributed Multi-Agent Computing h Innovative Cluster Applications 1

Information Technology: Today and The Future g What are hot today in 2000? 1GHzmicroprocessors, 256 Mb RAM, Gigabit Ethernet, ATM switches, Unix/SMP, Windows NT, Java, Internet, digital TV, multimedia commnications, etc. g How about beyond year 2000? 2 GHz microprocessors, 1-4 Gb RAM, NVSM (non-volatile semiconductor memory), 100 Gbps ~ 1 Tbps LANs, Satellite-based WWW, 3-30 Teraflops computers, cluster OS, Internet-based metacomputing, etc. 9/5/2000, Kai Hwang, USC 2

Competing Computer Architectures Node Complexity Distributed Systems Clusters MPPs SMPs cc-numa Single System Image 9/5/2000, Kai Hwang, USC 3

Comparison of Cluster with Other Scalable Parallel Architectures Characteristic MPP SMP & CC-NUMA Number of Nodes Node Complexity Internode Communication Job Scheduling O(100)- O(1000) Fine or medium grain Message passing or DSM Single run queue at host O(10) O(100) Medium or Coarse grain Centralized or distributed shared memory Single run queue mostly Cluster O(100) or less Medium grain Message Passing Multiple queues coordinated Distributed System O(10)- O(1000) Wide range Shared files, RPC, message passing Independent multiple Queues 9/5/2000, Kai Hwang, USC 4

Comparison of Cluster and Other Scalable Systems (cont d) System Characteristic SSI support Node & Host OS Address Space Internode Security MPP Partially N microkernels 1 monolithic OS at host Multiple (Single for DSM) SMP & CC-NUMA Always for SMP and some for NUMA One monolithic in SMP and multiple in CC- NUMA Cluster Desired N (Homogeneous or microkernel) Distributed System No N (Heterogeno us) Single Multiple Multiple Unnecessary Unnecessary Required if exposed Required 9/5/2000, Kai Hwang, USC 5

Clustering of Multiple Computers for Collective Applications Nov. 13, 1998, K. Hwang, p. 4

What is a Cluster of Computers? 2 A collection of computer nodes, interconnected by a LAN and/or a high-speed switching network, all nodes can be used individually or collectively as a cluster. 2 All computers in the cluster are glued together with middleware support for collective usage as a single computing resource, in addition to the traditional usage as individual computers. 9/5/2000, Kai Hwang, USC 7

An SSI Cluster with Hardware, Software, and Middleware Support Programming Environment (Java, C, Fortran, MPI, PVM) OS OS Web Windows User Interface Availability Infrastructure Other Subsystems (Database, OLTP, etc.) Single System Image Infrastructure OS Node Node Node Gigabit Network Interconnect 9/5/2000, Kai Hwang, USC 8

Major Cluster Design Issues g Size Scalability g Enhanced Availability g Single System Image g System Manageability g Fast Communication g Security and Encryption g Distributed Environment 9/5/2000, Kai Hwang, USC 9

Performance vs. Availability Scalable Performance MPP cc-numa SMP UP Cluster Fault-tolerant System System Availability 9/5/2000, Kai Hwang, USC 10

A PC/workstation Cluster with Distributed Disks, Network-Attached RAIDs, and I/O High-Speed Switching Network (SAN or LAN) Workstations or PCs... RAID NASD RAID Local disks forming an RADD NAP I/O Devices (Mirrored Architecture) (a) Unidirectional ring mirroring Nodes (b) Bidirectional ring mirroring (c) Bidirectional mirroring on a 3-D torus 9/5/2000, Kai Hwang, USC 11

Existing Middleware Packages Supporting SSI Cluster Computing SSI Features GLUnix TreadMarks DQS/CODI LSF NE Single control point Yes no yes yes Single entry point Yes no no no Single file hierarchy Yes yes yes yes Single memory space No yes no no Single process space Yes no no no Single I/O space No no no no Single networking No no no no Batch support Yes no yes yes Interactive support Yes yes yes yes Parallel support Yes yes yes yes Load balancing Yes no yes yes Process migration Yes no no no Job monitoring Yes no yes yes Suspend/Resume No no yes yes Dynamic resource Yes no yes yes User interface cmd-line cmd-line GUI/cmd-line GUI Fault tolerance Yes no yes yes 8

Relationships among supporting middleware modules Management level Single File Hierarchy (SFH) (NFS,AFS,xFS, Proxy) User Applications Job Management System (JMS) (GLUnix, LSF, CODINE) Distributed Shared Memory (DSM) (TreadMark, Wind Tunnel, etc) Programming level Single Process Space (SPS) Checkpointing /Process Migration (CPM) Implementation level Single I/O Space SIOS) ClusterHardwareandOSPlatform 14

The Financial Digital Library (FDL) Project at HKU g An Java-coded, Internet-based, multi-agent system monitoring more than 30 economic indicators in Hong Kong g An electronic disclosure system to improve the transparency and efficiency of financial information retrieval and processing 9/5/2000, Kai Hwang, USC 14

Information Technology for Business Globalization! Business is managed globally, overcoming the barriers in time, distance, culture, and language! Global supply chain management for multinational companies with headquarter, assembly plants, and distributors scattered all over the world 25

Distributed Multi-Agent Economic Analysis on a Cluster Platform Web Browser Agent Internet Oracle Database Database Server Agent Agent Name Server Agent Agent Agent 26

Distributed Multimedia Processing MM Servers MM Storage Server MM Transaction/ Mgmt. Services MM Multipoint Comm Server MM Enabling Networks MM Network Services MM User Platforms PC MAC Workstation Desktops (Business) Handhelds (Mobile) Video Playback Video Conf Audio Conf Media Acceleration Tech Distributed Software Infrastructure TV Set-tops (Home) 9/5/2000, Kai Hwang, USC 17

Bioinformatics : Computational Molecular Biology for Health Care! Human genome research for heath care, disease control, and new drug development! Parallel processing in the search, sorting, and alignment of DNA/protein sequences! Cluster of PCs/workstations for parallel and distributed genome research Nov. 13, 1998, K. Hwang, p. 27

Concluding Remarks: 1. Future clusters will be more scalable and providing many more SSI services. 28

Increasing demand of cluster, Java, and Internet-based applications:! Distributed multimedia processing in electronic commerce, digital entertainment, and urban management.! Dedicated digital Libraries for distance education, bioinformatics for health-care, and economic crisis management! Innovative applications in remote services, telemedicine, collaborative designs, and environmental protection. Feb.11,1999, K. Hwang p. 29