QosCosGrid Grid Technologies and Complex System Modelling

Similar documents
Cellular Computing on a Linux Cluster

Cluster, Grid, Cloud Concepts

BSC vision on Big Data and extreme scale computing

Grid Scheduling Architectures with Globus GridWay and Sun Grid Engine

A Service for Data-Intensive Computations on Virtual Clusters

MEng, BSc Applied Computer Science

Mr. Apichon Witayangkurn Department of Civil Engineering The University of Tokyo

Building Platform as a Service for Scientific Applications

Concepts and Architecture of the Grid. Summary of Grid 2, Chapter 4

The Lattice Project: A Multi-Model Grid Computing System. Center for Bioinformatics and Computational Biology University of Maryland

Grid based Integration of Real-Time Value-at-Risk (VaR) Services. Abstract

Middleware and Distributed Systems. Introduction. Dr. Martin v. Löwis

Sector vs. Hadoop. A Brief Comparison Between the Two Systems

:Introducing Star-P. The Open Platform for Parallel Application Development. Yoel Jacobsen E&M Computing LTD

Remote Graphical Visualization of Large Interactive Spatial Data

Clouds vs Grids KHALID ELGAZZAR GOODWIN 531

Audio networking. François Déchelle Patrice Tisserand Simon Schampijer

Science Gateways and scalable application tools with QosCosGrid (QCG) for large communities in Grids and Clouds

Introduction to Cluster Computing

MEng, BSc Computer Science with Artificial Intelligence

Hadoop and Map-Reduce. Swati Gore

SLA BASED SERVICE BROKERING IN INTERCLOUD ENVIRONMENTS

Challenges for cloud software engineering

Scalability and Classifications

A SIMULATOR FOR LOAD BALANCING ANALYSIS IN DISTRIBUTED SYSTEMS

Seeking Opportunities for Hardware Acceleration in Big Data Analytics

Cloud Computing Simulation Using CloudSim

Analytic Modeling in Python

Digital libraries of the future and the role of libraries

Leittechnik für Bahnsysteme mit Eclipse

Objectives. Distributed Databases and Client/Server Architecture. Distributed Database. Data Fragmentation

WORKFLOW ENGINE FOR CLOUDS

CHAPTER 6 MAJOR RESULTS AND CONCLUSIONS

Speeding up MATLAB and Simulink Applications

CLOUD COMPUTING. When It's smarter to rent than to buy

Mizan: A System for Dynamic Load Balancing in Large-scale Graph Processing

Cloud Platforms, Challenges & Hadoop. Aditee Rele Karpagam Venkataraman Janani Ravi

A Flexible Cluster Infrastructure for Systems Research and Software Development

MIKE by DHI 2014 e sviluppi futuri

Key Research Challenges in Cloud Computing

Cloud Federations in Contrail

CHAPTER 1: OPERATING SYSTEM FUNDAMENTALS

IDL. Get the answers you need from your data. IDL

PARALLEL & CLUSTER COMPUTING CS 6260 PROFESSOR: ELISE DE DONCKER BY: LINA HUSSEIN

Visualisation in the Google Cloud

A Case Study - Scaling Legacy Code on Next Generation Platforms

Service Oriented Architectures

GPUs for Scientific Computing

Sensing, monitoring and actuating on the UNderwater world through a federated Research InfraStructure Extending the Future Internet SUNRISE

Scientific Computing Programming with Parallel Objects

Operating System Support for Multiprocessor Systems-on-Chip

Stream Processing on GPUs Using Distributed Multimedia Middleware

Data-Aware Service Choreographies through Transparent Data Exchange

Analyses on functional capabilities of BizTalk Server, Oracle BPEL Process Manger and WebSphere Process Server for applications in Grid middleware

The Fastest Way to Parallel Programming for Multicore, Clusters, Supercomputers and the Cloud.

Lightpath Planning and Monitoring

An approach to grid scheduling by using Condor-G Matchmaking mechanism

Multi-GPU Load Balancing for Simulation and Rendering

NVIDIA CUDA Software and GPU Parallel Computing Architecture. David B. Kirk, Chief Scientist

Status and Integration of AP2 Monitoring and Online Steering

Introduction to Cloud Computing

Load Balancing on a Non-dedicated Heterogeneous Network of Workstations

Introduction to grid technologies, parallel and cloud computing. Alaa Osama Allam Saida Saad Mohamed Mohamed Ibrahim Gaber

Using WestGrid. Patrick Mann, Manager, Technical Operations Jan.15, 2014

Chapter 1 - Web Server Management and Cluster Topology

ANALYSIS OF GRID COMPUTING AS IT APPLIES TO HIGH VOLUME DOCUMENT PROCESSING AND OCR

The Virtual Grid Application Development Software (VGrADS) Project

A Survey Study on Monitoring Service for Grid

Automated deployment of virtualization-based research models of distributed computer systems

Environments, Services and Network Management for Green Clouds

A Performance Evaluation of Open Source Graph Databases. Robert McColl David Ediger Jason Poovey Dan Campbell David A. Bader

Develop a process for applying updates to systems, including verifying properties of the update. Create File Systems

Transcription:

QosCosGrid Grid Technologies and Complex System Modelling Pamela Burrage Krzysztof Kurowski Institute for Molecular Bioscience, University of Queensland, Australia

Vision, objectives Complex systems (motivations) Very broad application class with widely varying requirements QosCosGrid grid technologies (motivations) requirements, advanced capabilities, prototype Examples: complex systems from molecular bioscience - use cases 3 and 5 Overview and main characteristics Protein interactions, lipid rafts Metabolic pathways Design and development Overview

EU 6th Framework Programme STREP Project 2.5 years, ends in 03/2009 2 800 000 Euro Strong QCG Consortium: 11 partners (2 private companies) from 10 countries Technical Manager DIISR Australian funding, 2 years, ends 06/2009 QCG in numbers

Gap and Vision Gap = Vision Reality Still wide range of demanding applications and complex systems run only on supercomputers and/or local clusters QosCosGrid vision To address & (make first step towards closing) this gap by developing a quasi-opportunistic supercomputer based on advanced grid middleware and new programming and execution environments Quasi-opportunistic supercomputer (QoS) Quasi-opportunistic = not really opportunistic Qos uses grid technologies to deliver supercomputer-like performance Qos facilitates execution of demanding parallel and distributed applications in grids through key technologies bridging the visionreality gap of the grid

QCG Objectives 1. To develop tools for end users and complex system developers Fault-tolerant cluster-to-cluster message passing libraries based on Open MPI (C/C++/Python) and ProActive (Java) Remote complex system steering and control capabilities User and admin easy-to-use web interfaces based on GridSphere/Vine toolkits 2. To develop advanced grid middleware - Dynamic resource brokering (for complex systems simulations) Reservation and orchestration of resources, communication, synchronization and routing as known from massively parallel processors computers 3. Integrate and evaluate QCG concept with various types of complex systems (9 example use cases) and running simulations on a real prototype testbed

Complex Systems gridification Developers/Users 1. Real problem 2. Problem decomposition (including algorithm and communication structure design) 3. Agglomeration 4. Mapping 5. Execution and Control QCG grid middleware

Complex Systems categorization EGEE or TeraGrid middleware T0: No communications T1: Explicitly defined, static comm. graph T2: Explicitly defined, dynamic comm. graph T3: Cellular automata T4: Distance-dependent communication T5: Unknown (random) communication QCG grid middleware

RECV Does it matter how it goes? It took around 0.3 sec and the average data transfer was 0.1 Mb/s GEANT2 SEND AARnet

it is important not for use case developers but for the QCG grid middleware, (fully transparent for CSS developers, using same well known APIs based on MPI or ProActive/RPC) * Use case users using the QCG grid middleware have to provide only a list of requirements for their CSS (number of processes, network topologies, networks speed, hardware architectures, stage-in/out data, etc.) and then the QCG grid middleware will take care of: security (sensitive data, identity/authentication, authorization and accounting with different administrative domains) monitoring of computing, storage and network resources in our international testbed load balancing, advanced reservation and co-allocation of computing resources required for multi domain experiments parallel and distributed application control and steering * It is partially true

QosCosGrid features usability (e.g. user interface) web based interfaces for scientific users AND administrators command line tools also provided for advanced users and administrators a set of tutorials, guides, template solutions and best practices available for application developers security and trust: improved authentication and single sign-on mechanisms via web for end users improved authorization, policy control and enforcement mechanisms via web for administrators Performance, deployment Australians have been already added to QCG!

Use Case 5 (Barcelona) goals Simulates a genetic regulatory network. Involves sets of highly-coupled DEs. Need to find globally optimal sets of parameters for a given model. ByoDyn* is a computational package aimed at integrating different types of DEs, through its interface with several publicly available packages Python, BLAS, LAPACK, gnuplot, Uses QosCosGrid environment for optimization of parameters through the use of different techniques. More research in this area is conducted in the BioBridge project: http://www.biobridge.eu * ByoDyn is using Python

Use Case 3 (UQ) goals Prototype lattice of size 250 x 378 voxels: 1µm 1.5µm 1 time step ( 4 µs ) allows each molecule to move. Simulation for T = 1000 is only 0.004s real time. 600nm x 600nm, 25% rafts, fences, 2000 proteins, some obstacles: 2 mins compute-time, 0.004s real time. 600nm x 600nm, 50% rafts, 20000 proteins, FRAP: approx. 2 weeks on a PC, for 4s real time. To model a membrane 100 times as large, up to several real time seconds.

Parallel Implementation Master-slave implementation. Split membrane into vertical strips, one per slave. Track proteins as they move between slaves (unique ID). Provide report data and visualisation capability via the master.

Implementation contd. Visualisation capability Slaves send data to master (one large file) Front-end security via certificate signing Animation or snapshots at certain times

Implementation Issues Message passing via master more robust than slave-slave. Master outputs all files. Each slave has LH and RH overlap of neighbour s membrane. How much overlap? How often communicate? System integrity maintaining system dynamics.

Performance Compare timings for multi-processor computer for local clusters for remote clusters Speed-up vs frequency of communications vs volume of communication (size of membrane overlap)

Timings Examples

Technical Support Krzysztof Kurowski Original Development of Model Dan Nicolau (UQ, Oxford) John Hancock (UQ, Texas) Kevin Burrage (UQ, Oxford) Project Support Prof. Mark Ragan Other Programming Support Martin Swain (Ulster) Michal Lorenc (Hamburg) Use Case Discussions Jordi Villa i Freixa (Barcelona) George Kampis (Budapest) Acknowledgements

QCG Testbed prototype