New scheduling problems coming from grid computing

Size: px
Start display at page:

Download "New scheduling problems coming from grid computing"

Transcription

1 Electronic Notes in Discrete Mathematics 36 (2010) New scheduling problems coming from grid computing Alexandre Lissy 1 Laboratoire d Informatique Université François-Rabelais Tours, France Patrick Martineau 2 Laboratoire d Informatique Université François-Rabelais Tours, France Abstract In this article, we expose how the availability of an increasing computational power affects scheduling problems that uses grid computing to utilize this power. We first present how scheduling problems are currently handled, and the so-called nextgeneration clusters that will have different needs. Starting from those, we discuss real-life issues that will affect the scheduling and its resolutions and we expose how it affects scheduling and assignment problems. Then, we conclude with discussing some emerging methods to increase scheduling performances. Keywords: scheduling, clusters, grid, single system image, failure, checkpoint, hotswap, green it 1 alexandre.lissy@etu.univ-tours.fr 2 patrick.martineau@univ-tours.fr /$ see front matter 2010 Elsevier B.V. All rights reserved. doi: /j.endm

2 1034 A. Lissy, P. Martineau / Electronic Notes in Discrete Mathematics 36 (2010) Introduction We are interested in scheduling problems for grid computing. As per Moore s law, stating that each year, computation power doubles for the same price, one can notice that computing power available is becoming more and more important. But while this might be a good news, it also has consequences on scheduling problems. In the first section, we will present what methods are currently used for handling real clusters and how this relates to scheduling. Then, we will describe what we call next-generation clusters, and resume with what seems to be the current industrial way of evolution. Before we conclude, we will expose new problems coming from this evolution and the way they impact scheduling. 2 Currently in-use scheduling methods for clusters in production When talking about clusters in production, it either targets enormous supercomputer such as those from the Top500 list, grid-based such as Grid5000 project, or small-scale computing cluster that are dozen of computers. 2.1 Running and proved solutions All super-computer and grid shares one thing, that is critical for them to be usable, i.e. submission mecanishm. The technical details are irrelevant, what is important is the idea: a tool used to schedule jobs. Such tools are called batch schedulers and they can be as primitive as the classical cron tool, or as complex as OAR. All of these batch schedulers are very efficient scheduling software, given that they are provided with correct data as input (which is classical off-line scheduling data, such as job s availability date, duration, ressources needed (power, memory)). Issue are kind of obvious: no one is able to precisely estimate the needs. Worst may happen as batch scheduling softwares may kill tasks that are still running after their deadline, forcing users to cheat, resulting in a poor scheduling. 2.2 Next-generation solutions For the past decade, the size of clusters have been following some what like Moore s law, resulting in those machines to have more and more CPUs. Ac-

3 A. Lissy, P. Martineau / Electronic Notes in Discrete Mathematics 36 (2010) cording to Top500 3, more than the half of today s super-computers registered on the list have between 4000 and 8000 CPUs, by june One decade ago, in june 1999, there were between 65 to 128 CPUs for half of all the registered systems. It should be obvious from these figures that parallelism is heavily increasing: it has been multiplied bu more than sixty. Another recent development is aimed at reducing the level of use for supercomputing, by suppressing the batch scheduler. It is the Single System Image way. Main idea is to create a cluster of machines running on a network, and aggregate them so it looks like a single massively parallel machine. Everything is common and shared: filesystem, process space, I/O space, IPC space,... This allows to create low-grade yet powerful and easily usables clusters. Several projects 4 5 are working in this way, such as DragonFly BSD, Kerrighed, LinuxPMI, MOSIX, OpenSSI,... Those clusters have some nice features such as distant process fork, or process migration, which are scheduling problems ; deciding on which node fork a new process, or where to migrate which process. But the context for those scheduling problems is different than for classical batch scheduler: it is impossible to ask for anything such as process duration, ressources needed, etc., leading to new needs for an efficient scheduling around the cluster. 3 Impacting scheduling problems We will now present several issue linked to clusters, not yet real-life for now, but which will be as soon as those developments are used in production. For each one, we will expose how it impacts scheduling. We will first present the problems linked to cluster s life cycle. Then, we will talk about virtualization and its role. After this, we will talk about failure handling, before continuing with checkpointing. We will then talk about energetic footprint of clusters. We will finish with some considerations about communications in massively parallelized software. 3.1 Cluster s life cycle The main idea behind this title is that we see emerging techniques linked with the use of wide clusters: those clusters have hotswap (adding and/or removal of hardware) capabilities. One or several nodes might appear or disappear

4 1036 A. Lissy, P. Martineau / Electronic Notes in Discrete Mathematics 36 (2010) in the cluster. In the past, scheduling problems for clusters were much more static. But with hotswap capabilities being mainstream (and needed), it is not enough. This kind of features is not only limited to compute-only clusters, but we might also find them for any kind of massively parallel architecture, used to handle cheap but powerful storage. We will talk about the node removal later, while exposing failure handling. Impact of cluster s life cycle on scheduling problem can be resumed to: robustness. It is important to be aware that we will have some nodes unavailable when using the cluster, and the proposed solution should tolerate this and not degrade the objective. 3.2 Virtualization is getting cheaper Virtualization is a rather old (IBM introduced it with CP/CMS a couple of decades ago) yet useful tool. For the past few years, it became mainstream tool (VMware, VirtualBox, Xen, KVM). Main reasons why it might be interesting for scheduling is that it allows for hardware-independency, abstraction for running jobs and this without an important overhead. Last, many virtualization tools also provide a way to migrate virtual machines between physical hosts. With good standard hardware, it might be very fast (a couple of seconds), and has no impact: for code running, it might just look like the box has been freezed. Plus, every mainstream processor has virtualisation instructions. Virtualization can be used to enable high availability and redundancy of services. Scheduling problems might take advantage of this kind of functionnality to duplicate some critical jobs and thus being able to ensure that they will be terminated. Current literature treats of how we can use virtualization as a piece of solution for scheduling problem [8]. 3.3 The checkpointing issue The main idea behind this concept is to freeze the process, and save it. It can be useful to address several flaws, such as: a node running a process crashed, and your cluster doesn t handle this. So you have to restart your job. You can restart your process to what it was when checkpointed, and avoid loosing too many results. Another use might be for handling bugs in your program: the process crashed after several weeks of computation. Yet, efficient checkpointing is a non-trivial problem and has its roots in scheduling mainly. There are several sides of the problem to be solved, from

5 A. Lissy, P. Martineau / Electronic Notes in Discrete Mathematics 36 (2010) what and how to checkpointing, to when checkpointing. The second part of those questions, the when, is the one that will have impact on scheduling problems. Also, scheduling algorithm have to consider the time needed to make the checkpoint of the process when assigning jobs to node, and computing a solution. It is also linked to nodes failure handling: in order to checkpoint as later as possible before the node is fails. Current literature on checkpointing can be found [9]. 3.4 Failure handling This is a major topic, which several roots. First of all, hardware will fail. On small-scale system, it might be rather rare, but when you start to have enough hardware, you will have to face it. So, software (here, scheduling), needs to be aware. Failure is one component of cluster s life cycle. Even so it is the bad side of life, death, cluster should handle failures of one or several nodes in a way it doesn t affect, or at least as minimum as possible, the remaining nodes, in a way that preserves running jobs. When we talked about loosing one node, it is in a general sense, it does not specifically implies that hardware is broken. But such a node cannot be used for current scheduling in the cluster. It is derivated from cluster s life cycle, but it is a bit different. We already talked about the impact of appearance and disappearance of nodes at runtime. But there are other questions directly related specifically to the failure of a node. Scheduling should be able to try to prevent node failure and effectively handle those failures in an efficient way. Handling of failure is concomittant with tools like virtualization, checkpoint/restart and migration, as they can be used to address the problem So the impact for scheduling is major. There are several ways to avoid failure, or at least its consequences, but they all have a cost that need to be taken into account at scheduling level: it is possible to run several instances of the same process on several nodes but it will lower efficiency. This can be circumvented by enabling checkpoint feature, but then we should consider the checkpointing cost not to mention the question of checkpointing itself as shown above ; or you can use migration to continue execution on another node prior to failure. This can rely on checkpointing, or failure prediction, wether you want some proactive or reactive behavior. Once again, it has a cost. Failure prediction/prevention and proactive scheduling are becoming more and more popular as subjects, current literature is available [4,1]. Financial sector works on similar issues [6]. Literature also contains analysis of past

6 1038 A. Lissy, P. Martineau / Electronic Notes in Discrete Mathematics 36 (2010) failure [5]. 3.5 Dealing with energetic footprint This topic might seem rather strange, but energetic consumption will become more and more critical. It is both ecologically, economically and intellectually pertinent to achieve a better use of power. There are hardware-related advancements needed to achieve this, but software and scheduling is also a major piece of the puzzle. A cluster that schedule in a green-way will allow for more power to be available at the same ecological price. There are hundreds of ways to consider this side, but we can already see some companies results. Some huge companies might schedule jobs based on weather and time, in order to use datacenters that are needing the less cooling for this workload (think: desert and night). Combined with virtualization, some are migrating their workload towards the night, reducing the cooling needed. Some company is starting to use a freshly built datacenter with green it in mind. An example of this is their dynamic cooling infrastructure, allowing the power to only those coolers that are connected to racks that are effectively used, even modulating cooling power according to consumed power by those racks. Solutions to lower energy consumption will have both hardware and software side, and scheduling will be a major component. Scheduling algorithm for clusters should consider the energetic cost of their decisions. Minimizing this can be achieved using some of the tools presented above, such as checkpointing and virtualization. With all those, and some eco-friendly assignment scheduler in a cluster, we might be able to only power what is needed of the cluster, and thus limit the use of cooling devices, maximize electricity usage to lower electricity conversion-related losses. Scheduling energyaware can be found in [12,13]. 3.6 Communications are growing Another point that might affect scheduling community is the growing of parallelism. Even five years ago, we only had one-core based mainstream computers. Now, mainstream is reaching, on a mid-term scale, quad-core. And six-core is tomorrow. This is an illustration of how massively parallel architecture are going to be more and more used. A 1680-cores cluster is cheap, now. It is just 70 computers. That makes a large computation power available, and using it as best as possible when needed to have massively parallelized software. One thing might alter the performance at this point is the communication, which

7 A. Lissy, P. Martineau / Electronic Notes in Discrete Mathematics 36 (2010) won t stop. There are hardware methods to solve this, but a good assignment will avoid the need for expensive hardware solution. The more the tasks that needs to communicate are close, the better. Massively-parallel tasks will still need to communicate, and this will lead to an increase in communications. We stated that this is generally the most important part of negative speedup in parallel applications. So, if we want still to have massively parallel code being able to scale, it will be needed to schedule them in a way that reduces the performances drawback caused by communication, by trying to put the communication-intensive tasks together. The literature contains references about scheduling considering communications, [2]. 4 Conclusion Today s hardware is powerful, and there is still progress to be done to efficiently use it. For example, many of the in-production scheduling systems for clusters (of any kind, it can either be a compute-cluster targetted at running intensive and parallels tasks or a pool of webservers), are using naive and basic scheduling algorithms, that doesn t consider topics we covered. We can cite for example [10]. Other works in the literature that covers scheduling in grid are a bit more complex [7]. We can also find references of efficiency-oriented scheduling, for example [11], inspired by economy. Bi-criteria scheduling is covered, too [3]. Most of them will have an impact on energy consumption. The Green IT shouldn t only be seen as a marketing green-washing by some vendors, but as the sign of a need that has to be addressed. Another important point is the maturity of some Single-System Image solutions. Those clusters are easier to use than supercomputers, and improving their scheduling tools will help providing better performances at a lower cost. References [1] Benjamin,. T., Khoo and B. Veeravalli, Pro-active failure handling mechanisms for scheduling in grid computing environments, Journal of Parallel and Distributed Computing (2009). [2] Dodonov, E. and R. F. de Mello, A novel approach for distributed application scheduling based on prediction of communication events, Future Generation Computer Systems (2009).

8 1040 A. Lissy, P. Martineau / Electronic Notes in Discrete Mathematics 36 (2010) [3] Dutot, P.-F., L. Eyraud, G. Mounié and D. Trystram, Bi-criteria algorithm for scheduling jobs on cluster platforms, ACM Symposium on Parallel Algorithms and Architectures (2004) (2005). [4] Fu, S., Failure-aware construction and reconfiguration of distributed virtual machines for high availability computing, in: CCGRID 09: Proceedings of the th IEEE/ACM International Symposium on Cluster Computing and the Grid (2009), pp [5] Fu, S. and C.-Z. Xu, Exploring event correlation for failure prediction in coalitions of clusters, in: SC 07: Proceedings of the 2007 ACM/IEEE conference on Supercomputing (2007), pp [6] Huysmans, J., B. Baesens, J. Vanthienen and T. V. Gestel, Failure prediction with self organizing maps, Expert Syst. Appl. 30 (2006), pp [7] Lin, G. and R. Rajaraman, Approximation algorithms for multiprocessor scheduling under uncertainty (2007). [8] Pérotin, M., Calcul Haute Performance sur Matériel Générique, Ph.D. thesis, Laboratoire d Informatique de l École Polytechnique de l Université de Tours (2008). [9] Rough, J. T. and A. M. Goscinski, The development of an efficient checkpointing facility exploiting operating systems services of the genesis cluster operating system, Future Gener. Comput. Syst. 20 (2004), pp [10] Rudolph, L., M. Silvkin-Allalouf and E. Upfal, A simple load balancing scheme for task allocation in parallel machines, ACM Symposium on Parallel Algorithms and Architectures (1991), pp [11] Sherwani, J., N. Ali, N. Lotia, Z. Hayat and R. Buyya, Libra: An economy driven job scheduling system for clusters, Technical Report Technical Report, July 2002, Dept. of Computer Science and Software Engineering, The University of Melbourne (2002). [12] Subrata, R., A. Y. Zomaya and B. Landfeldt, Cooperative power-aware scheduling in grid computing environments, Journal of Parallel and Distributed Computing 70 (2010), pp [13] Tsao, S.-L. and Y.-L. Chen, Energy-efficient packet scheduling algorithms for real-time communications in a mobile wimax system, Computer Communications 31 (2008), pp

Dynamic resource management for energy saving in the cloud computing environment

Dynamic resource management for energy saving in the cloud computing environment Dynamic resource management for energy saving in the cloud computing environment Liang-Teh Lee, Kang-Yuan Liu, and Hui-Yang Huang Department of Computer Science and Engineering, Tatung University, Taiwan

More information

Kerrighed: use cases. Cyril Brulebois. Kerrighed. Kerlabs

Kerrighed: use cases. Cyril Brulebois. Kerrighed. Kerlabs Kerrighed: use cases Cyril Brulebois cyril.brulebois@kerlabs.com Kerrighed http://www.kerrighed.org/ Kerlabs http://www.kerlabs.com/ 1 / 23 Introducing Kerrighed What s Kerrighed? Single-System Image (SSI)

More information

Setting deadlines and priorities to the tasks to improve energy efficiency in cloud computing

Setting deadlines and priorities to the tasks to improve energy efficiency in cloud computing Setting deadlines and priorities to the tasks to improve energy efficiency in cloud computing Problem description Cloud computing is a technology used more and more every day, requiring an important amount

More information

Energy Conscious Virtual Machine Migration by Job Shop Scheduling Algorithm

Energy Conscious Virtual Machine Migration by Job Shop Scheduling Algorithm Energy Conscious Virtual Machine Migration by Job Shop Scheduling Algorithm Shanthipriya.M 1, S.T.Munusamy 2 ProfSrinivasan. R 3 M.Tech (IT) Student, Department of IT, PSV College of Engg & Tech, Krishnagiri,

More information

Google File System. Web and scalability

Google File System. Web and scalability Google File System Web and scalability The web: - How big is the Web right now? No one knows. - Number of pages that are crawled: o 100,000 pages in 1994 o 8 million pages in 2005 - Crawlable pages might

More information

Energy Constrained Resource Scheduling for Cloud Environment

Energy Constrained Resource Scheduling for Cloud Environment Energy Constrained Resource Scheduling for Cloud Environment 1 R.Selvi, 2 S.Russia, 3 V.K.Anitha 1 2 nd Year M.E.(Software Engineering), 2 Assistant Professor Department of IT KSR Institute for Engineering

More information

Installing & Using KVM with Virtual Machine Manager COSC 495

Installing & Using KVM with Virtual Machine Manager COSC 495 Installing & Using KVM with Virtual Machine Manager COSC 495 1 Abstract:. There are many different hypervisors and virtualization software available for use. One commonly use hypervisor in the Linux system

More information

A Dynamic Resource Management with Energy Saving Mechanism for Supporting Cloud Computing

A Dynamic Resource Management with Energy Saving Mechanism for Supporting Cloud Computing A Dynamic Resource Management with Energy Saving Mechanism for Supporting Cloud Computing Liang-Teh Lee, Kang-Yuan Liu, Hui-Yang Huang and Chia-Ying Tseng Department of Computer Science and Engineering,

More information

A Taxonomy and Survey of Energy-Efficient Data Centers and Cloud Computing Systems

A Taxonomy and Survey of Energy-Efficient Data Centers and Cloud Computing Systems A Taxonomy and Survey of Energy-Efficient Data Centers and Cloud Computing Systems Anton Beloglazov, Rajkumar Buyya, Young Choon Lee, and Albert Zomaya Present by Leping Wang 1/25/2012 Outline Background

More information

This is an author-deposited version published in : http://oatao.univ-toulouse.fr/ Eprints ID : 12902

This is an author-deposited version published in : http://oatao.univ-toulouse.fr/ Eprints ID : 12902 Open Archive TOULOUSE Archive Ouverte (OATAO) OATAO is an open access repository that collects the work of Toulouse researchers and makes it freely available over the web where possible. This is an author-deposited

More information

Provisioning and Resource Management at Large Scale (Kadeploy and OAR)

Provisioning and Resource Management at Large Scale (Kadeploy and OAR) Provisioning and Resource Management at Large Scale (Kadeploy and OAR) Olivier Richard Laboratoire d Informatique de Grenoble (LIG) Projet INRIA Mescal 31 octobre 2007 Olivier Richard ( Laboratoire d Informatique

More information

VMware Server 2.0 Essentials. Virtualization Deployment and Management

VMware Server 2.0 Essentials. Virtualization Deployment and Management VMware Server 2.0 Essentials Virtualization Deployment and Management . This PDF is provided for personal use only. Unauthorized use, reproduction and/or distribution strictly prohibited. All rights reserved.

More information

Real Time Network Server Monitoring using Smartphone with Dynamic Load Balancing

Real Time Network Server Monitoring using Smartphone with Dynamic Load Balancing www.ijcsi.org 227 Real Time Network Server Monitoring using Smartphone with Dynamic Load Balancing Dhuha Basheer Abdullah 1, Zeena Abdulgafar Thanoon 2, 1 Computer Science Department, Mosul University,

More information

Chapter 7: Distributed Systems: Warehouse-Scale Computing. Fall 2011 Jussi Kangasharju

Chapter 7: Distributed Systems: Warehouse-Scale Computing. Fall 2011 Jussi Kangasharju Chapter 7: Distributed Systems: Warehouse-Scale Computing Fall 2011 Jussi Kangasharju Chapter Outline Warehouse-scale computing overview Workloads and software infrastructure Failures and repairs Note:

More information

Migrate workloads back and forth across diverse physical and virtual platforms using the same interfaces, policies, and performance analytics

Migrate workloads back and forth across diverse physical and virtual platforms using the same interfaces, policies, and performance analytics WHITE PAPER Novell Holds Down Data Center Costs with PlateSpin Workload Management Solutions Sponsored by: Novell Mary Johnston Turner June 2009 IDC OPINION Global Headquarters: 5 Speen Street Framingham,

More information

Analysis of Issues with Load Balancing Algorithms in Hosted (Cloud) Environments

Analysis of Issues with Load Balancing Algorithms in Hosted (Cloud) Environments Analysis of Issues with Load Balancing Algorithms in Hosted (Cloud) Environments Branko Radojević *, Mario Žagar ** * Croatian Academic and Research Network (CARNet), Zagreb, Croatia ** Faculty of Electrical

More information

Parallels Virtuozzo Containers

Parallels Virtuozzo Containers Parallels Virtuozzo Containers White Paper Greener Virtualization www.parallels.com Version 1.0 Greener Virtualization Operating system virtualization by Parallels Virtuozzo Containers from Parallels is

More information

FAULT TOLERANCE FOR MULTIPROCESSOR SYSTEMS VIA TIME REDUNDANT TASK SCHEDULING

FAULT TOLERANCE FOR MULTIPROCESSOR SYSTEMS VIA TIME REDUNDANT TASK SCHEDULING FAULT TOLERANCE FOR MULTIPROCESSOR SYSTEMS VIA TIME REDUNDANT TASK SCHEDULING Hussain Al-Asaad and Alireza Sarvi Department of Electrical & Computer Engineering University of California Davis, CA, U.S.A.

More information

Grid Scheduling Dictionary of Terms and Keywords

Grid Scheduling Dictionary of Terms and Keywords Grid Scheduling Dictionary Working Group M. Roehrig, Sandia National Laboratories W. Ziegler, Fraunhofer-Institute for Algorithms and Scientific Computing Document: Category: Informational June 2002 Status

More information

ENERGY EFFICIENT VIRTUAL MACHINE ASSIGNMENT BASED ON ENERGY CONSUMPTION AND RESOURCE UTILIZATION IN CLOUD NETWORK

ENERGY EFFICIENT VIRTUAL MACHINE ASSIGNMENT BASED ON ENERGY CONSUMPTION AND RESOURCE UTILIZATION IN CLOUD NETWORK International Journal of Computer Engineering & Technology (IJCET) Volume 7, Issue 1, Jan-Feb 2016, pp. 45-53, Article ID: IJCET_07_01_006 Available online at http://www.iaeme.com/ijcet/issues.asp?jtype=ijcet&vtype=7&itype=1

More information

Moving Virtual Storage to the Cloud. Guidelines for Hosters Who Want to Enhance Their Cloud Offerings with Cloud Storage

Moving Virtual Storage to the Cloud. Guidelines for Hosters Who Want to Enhance Their Cloud Offerings with Cloud Storage Moving Virtual Storage to the Cloud Guidelines for Hosters Who Want to Enhance Their Cloud Offerings with Cloud Storage Table of Contents Overview... 1 Understanding the Storage Problem... 1 What Makes

More information

A Distributed Render Farm System for Animation Production

A Distributed Render Farm System for Animation Production A Distributed Render Farm System for Animation Production Jiali Yao, Zhigeng Pan *, Hongxin Zhang State Key Lab of CAD&CG, Zhejiang University, Hangzhou, 310058, China {yaojiali, zgpan, zhx}@cad.zju.edu.cn

More information

A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM

A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM Sneha D.Borkar 1, Prof.Chaitali S.Surtakar 2 Student of B.E., Information Technology, J.D.I.E.T, sborkar95@gmail.com Assistant Professor, Information

More information

Distributed Operating Systems. Cluster Systems

Distributed Operating Systems. Cluster Systems Distributed Operating Systems Cluster Systems Ewa Niewiadomska-Szynkiewicz ens@ia.pw.edu.pl Institute of Control and Computation Engineering Warsaw University of Technology E&IT Department, WUT 1 1. Cluster

More information

Analysis and Research of Cloud Computing System to Comparison of Several Cloud Computing Platforms

Analysis and Research of Cloud Computing System to Comparison of Several Cloud Computing Platforms Volume 1, Issue 1 ISSN: 2320-5288 International Journal of Engineering Technology & Management Research Journal homepage: www.ijetmr.org Analysis and Research of Cloud Computing System to Comparison of

More information

Introduction to Cloud Computing

Introduction to Cloud Computing Introduction to Cloud Computing Cloud Computing I (intro) 15 319, spring 2010 2 nd Lecture, Jan 14 th Majd F. Sakr Lecture Motivation General overview on cloud computing What is cloud computing Services

More information

International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015

International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015 RESEARCH ARTICLE OPEN ACCESS Ensuring Reliability and High Availability in Cloud by Employing a Fault Tolerance Enabled Load Balancing Algorithm G.Gayathri [1], N.Prabakaran [2] Department of Computer

More information

A Survey on Virtual Machine Security

A Survey on Virtual Machine Security A Survey on Virtual Machine Security Jenni Susan Reuben Helsinki University of Technology jreubens@cc.hut.fi Abstract Virtualization plays a major role in helping the organizations to reduce the operational

More information

A Multi-criteria Class-based Job Scheduler for Large Computing Farms

A Multi-criteria Class-based Job Scheduler for Large Computing Farms A Multi-criteria Class-based Job Scheduler for Large Computing Farms R. Baraglia 1, P. Dazzi 1, and R. Ferrini 1 1 ISTI A. Faedo, CNR, Pisa, Italy Abstract In this paper we propose a new multi-criteria

More information

Moving Virtual Storage to the Cloud

Moving Virtual Storage to the Cloud Moving Virtual Storage to the Cloud White Paper Guidelines for Hosters Who Want to Enhance Their Cloud Offerings with Cloud Storage www.parallels.com Table of Contents Overview... 3 Understanding the Storage

More information

Joe Young, Senior Windows Administrator, Hostway

Joe Young, Senior Windows Administrator, Hostway Many of our enterprise customers wanted dedicated virtual servers that offered a much higher degree of isolation... we needed to step up our virtualization efforts to stay competitive." Joe Young, Senior

More information

Parallel Computing. Benson Muite. benson.muite@ut.ee http://math.ut.ee/ benson. https://courses.cs.ut.ee/2014/paralleel/fall/main/homepage

Parallel Computing. Benson Muite. benson.muite@ut.ee http://math.ut.ee/ benson. https://courses.cs.ut.ee/2014/paralleel/fall/main/homepage Parallel Computing Benson Muite benson.muite@ut.ee http://math.ut.ee/ benson https://courses.cs.ut.ee/2014/paralleel/fall/main/homepage 3 November 2014 Hadoop, Review Hadoop Hadoop History Hadoop Framework

More information

CHAPTER 1 INTRODUCTION

CHAPTER 1 INTRODUCTION CHAPTER 1 INTRODUCTION 1.1 Background The command over cloud computing infrastructure is increasing with the growing demands of IT infrastructure during the changed business scenario of the 21 st Century.

More information

Enhancing the Scalability of Virtual Machines in Cloud

Enhancing the Scalability of Virtual Machines in Cloud Enhancing the Scalability of Virtual Machines in Cloud Chippy.A #1, Ashok Kumar.P #2, Deepak.S #3, Ananthi.S #4 # Department of Computer Science and Engineering, SNS College of Technology Coimbatore, Tamil

More information

Router Architectures

Router Architectures Router Architectures An overview of router architectures. Introduction What is a Packet Switch? Basic Architectural Components Some Example Packet Switches The Evolution of IP Routers 2 1 Router Components

More information

Big Data and Cloud Computing for GHRSST

Big Data and Cloud Computing for GHRSST Big Data and Cloud Computing for GHRSST Jean-Francois Piollé (jfpiolle@ifremer.fr) Frédéric Paul, Olivier Archer CERSAT / Institut Français de Recherche pour l Exploitation de la Mer Facing data deluge

More information

Best Practices for Managing Virtualized Environments

Best Practices for Managing Virtualized Environments WHITE PAPER Introduction... 2 Reduce Tool and Process Sprawl... 2 Control Virtual Server Sprawl... 3 Effectively Manage Network Stress... 4 Reliably Deliver Application Services... 5 Comprehensively Manage

More information

Energy Efficient MapReduce

Energy Efficient MapReduce Energy Efficient MapReduce Motivation: Energy consumption is an important aspect of datacenters efficiency, the total power consumption in the united states has doubled from 2000 to 2005, representing

More information

Grid Computing Vs. Cloud Computing

Grid Computing Vs. Cloud Computing International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 6 (2013), pp. 577-582 International Research Publications House http://www. irphouse.com /ijict.htm Grid

More information

Advanced Core Operating System (ACOS): Experience the Performance

Advanced Core Operating System (ACOS): Experience the Performance WHITE PAPER Advanced Core Operating System (ACOS): Experience the Performance Table of Contents Trends Affecting Application Networking...3 The Era of Multicore...3 Multicore System Design Challenges...3

More information

There are a number of factors that increase the risk of performance problems in complex computer and software systems, such as e-commerce systems.

There are a number of factors that increase the risk of performance problems in complex computer and software systems, such as e-commerce systems. ASSURING PERFORMANCE IN E-COMMERCE SYSTEMS Dr. John Murphy Abstract Performance Assurance is a methodology that, when applied during the design and development cycle, will greatly increase the chances

More information

x64 Servers: Do you want 64 or 32 bit apps with that server?

x64 Servers: Do you want 64 or 32 bit apps with that server? TMurgent Technologies x64 Servers: Do you want 64 or 32 bit apps with that server? White Paper by Tim Mangan TMurgent Technologies February, 2006 Introduction New servers based on what is generally called

More information

Load Balancing in the Cloud Computing Using Virtual Machine Migration: A Review

Load Balancing in the Cloud Computing Using Virtual Machine Migration: A Review Load Balancing in the Cloud Computing Using Virtual Machine Migration: A Review 1 Rukman Palta, 2 Rubal Jeet 1,2 Indo Global College Of Engineering, Abhipur, Punjab Technical University, jalandhar,india

More information

Power Consumption Based Cloud Scheduler

Power Consumption Based Cloud Scheduler Power Consumption Based Cloud Scheduler Wu Li * School of Software, Shanghai Jiaotong University Shanghai, 200240, China. * Corresponding author. Tel.: 18621114210; email: defaultuser@sjtu.edu.cn Manuscript

More information

3 - Introduction to Operating Systems

3 - Introduction to Operating Systems 3 - Introduction to Operating Systems Mark Handley What is an Operating System? An OS is a program that: manages the computer hardware. provides the basis on which application programs can be built and

More information

Design and Implementation of the Heterogeneous Multikernel Operating System

Design and Implementation of the Heterogeneous Multikernel Operating System 223 Design and Implementation of the Heterogeneous Multikernel Operating System Yauhen KLIMIANKOU Department of Computer Systems and Networks, Belarusian State University of Informatics and Radioelectronics,

More information

A Robust Dynamic Load-balancing Scheme for Data Parallel Application on Message Passing Architecture

A Robust Dynamic Load-balancing Scheme for Data Parallel Application on Message Passing Architecture A Robust Dynamic Load-balancing Scheme for Data Parallel Application on Message Passing Architecture Yangsuk Kee Department of Computer Engineering Seoul National University Seoul, 151-742, Korea Soonhoi

More information

ENERGY-EFFICIENT TASK SCHEDULING ALGORITHMS FOR CLOUD DATA CENTERS

ENERGY-EFFICIENT TASK SCHEDULING ALGORITHMS FOR CLOUD DATA CENTERS ENERGY-EFFICIENT TASK SCHEDULING ALGORITHMS FOR CLOUD DATA CENTERS T. Jenifer Nirubah 1, Rose Rani John 2 1 Post-Graduate Student, Department of Computer Science and Engineering, Karunya University, Tamil

More information

EFFICIENT SCHEDULING STRATEGY USING COMMUNICATION AWARE SCHEDULING FOR PARALLEL JOBS IN CLUSTERS

EFFICIENT SCHEDULING STRATEGY USING COMMUNICATION AWARE SCHEDULING FOR PARALLEL JOBS IN CLUSTERS EFFICIENT SCHEDULING STRATEGY USING COMMUNICATION AWARE SCHEDULING FOR PARALLEL JOBS IN CLUSTERS A.Neela madheswari 1 and R.S.D.Wahida Banu 2 1 Department of Information Technology, KMEA Engineering College,

More information

Virtualization Technology (or how my Windows computer gave birth to a bunch of Linux computers)

Virtualization Technology (or how my Windows computer gave birth to a bunch of Linux computers) Virtualization Technology (or how my Windows computer gave birth to a bunch of Linux computers) The purpose of this document is to walk you through creating a Linux Virtual Machine (a guest Operating System)

More information

IBM System x reference architecture solutions for big data

IBM System x reference architecture solutions for big data IBM System x reference architecture solutions for big data Easy-to-implement hardware, software and services for analyzing data at rest and data in motion Highlights Accelerates time-to-value with scalable,

More information

The Ultimate in Scale-Out Storage for HPC and Big Data

The Ultimate in Scale-Out Storage for HPC and Big Data Node Inventory Health and Active Filesystem Throughput Monitoring Asset Utilization and Capacity Statistics Manager brings to life powerful, intuitive, context-aware real-time monitoring and proactive

More information

The Key Technology Research of Virtual Laboratory based On Cloud Computing Ling Zhang

The Key Technology Research of Virtual Laboratory based On Cloud Computing Ling Zhang International Conference on Advances in Mechanical Engineering and Industrial Informatics (AMEII 2015) The Key Technology Research of Virtual Laboratory based On Cloud Computing Ling Zhang Nanjing Communications

More information

- Behind The Cloud -

- Behind The Cloud - - Behind The Cloud - Infrastructure and Technologies used for Cloud Computing Alexander Huemer, 0025380 Johann Taferl, 0320039 Florian Landolt, 0420673 Seminar aus Informatik, University of Salzburg Overview

More information

INCREASING SERVER UTILIZATION AND ACHIEVING GREEN COMPUTING IN CLOUD

INCREASING SERVER UTILIZATION AND ACHIEVING GREEN COMPUTING IN CLOUD INCREASING SERVER UTILIZATION AND ACHIEVING GREEN COMPUTING IN CLOUD M.Rajeswari 1, M.Savuri Raja 2, M.Suganthy 3 1 Master of Technology, Department of Computer Science & Engineering, Dr. S.J.S Paul Memorial

More information

Cloud Management: Knowing is Half The Battle

Cloud Management: Knowing is Half The Battle Cloud Management: Knowing is Half The Battle Raouf BOUTABA David R. Cheriton School of Computer Science University of Waterloo Joint work with Qi Zhang, Faten Zhani (University of Waterloo) and Joseph

More information

Virtualization. Dr. Yingwu Zhu

Virtualization. Dr. Yingwu Zhu Virtualization Dr. Yingwu Zhu What is virtualization? Virtualization allows one computer to do the job of multiple computers. Virtual environments let one computer host multiple operating systems at the

More information

Experience with Server Self Service Center (S3C)

Experience with Server Self Service Center (S3C) Experience with Server Self Service Center (S3C) Juraj Sucik, Sebastian Bukowiec IT Department, CERN, CH-1211 Genève 23, Switzerland E-mail: juraj.sucik@cern.ch, sebastian.bukowiec@cern.ch Abstract. CERN

More information

Infrastructure as a Service (IaaS)

Infrastructure as a Service (IaaS) Infrastructure as a Service (IaaS) (ENCS 691K Chapter 4) Roch Glitho, PhD Associate Professor and Canada Research Chair My URL - http://users.encs.concordia.ca/~glitho/ References 1. R. Moreno et al.,

More information

Measuring Energy Efficiency in a Data Center

Measuring Energy Efficiency in a Data Center The goals of the greening of the Data Center are to minimize energy consumption and reduce the emission of green house gases (carbon footprint) while maximizing IT performance. Energy efficiency metrics

More information

Operating System for the K computer

Operating System for the K computer Operating System for the K computer Jun Moroo Masahiko Yamada Takeharu Kato For the K computer to achieve the world s highest performance, Fujitsu has worked on the following three performance improvements

More information

TOP TEN CONSIDERATIONS

TOP TEN CONSIDERATIONS White Paper TOP TEN CONSIDERATIONS FOR CHOOSING A SERVER VIRTUALIZATION TECHNOLOGY Learn more at www.swsoft.com/virtuozzo Published: July 2006 Revised: July 2006 Table of Contents Introduction... 3 Technology

More information

A Novel Switch Mechanism for Load Balancing in Public Cloud

A Novel Switch Mechanism for Load Balancing in Public Cloud International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) A Novel Switch Mechanism for Load Balancing in Public Cloud Kalathoti Rambabu 1, M. Chandra Sekhar 2 1 M. Tech (CSE), MVR College

More information

Heterogeneous Workload Consolidation for Efficient Management of Data Centers in Cloud Computing

Heterogeneous Workload Consolidation for Efficient Management of Data Centers in Cloud Computing Heterogeneous Workload Consolidation for Efficient Management of Data Centers in Cloud Computing Deep Mann ME (Software Engineering) Computer Science and Engineering Department Thapar University Patiala-147004

More information

A Survey on Resource Provisioning in Cloud

A Survey on Resource Provisioning in Cloud RESEARCH ARTICLE OPEN ACCESS A Survey on Resource in Cloud M.Uthaya Banu*, M.Subha** *,**(Department of Computer Science and Engineering, Regional Centre of Anna University, Tirunelveli) ABSTRACT Cloud

More information

Challenges and Importance of Green Data Center on Virtualization Environment

Challenges and Importance of Green Data Center on Virtualization Environment Challenges and Importance of Green Data Center on Virtualization Environment Abhishek Singh Department of Information Technology Amity University, Noida, Uttar Pradesh, India Priyanka Upadhyay Department

More information

Multifaceted Resource Management for Dealing with Heterogeneous Workloads in Virtualized Data Centers

Multifaceted Resource Management for Dealing with Heterogeneous Workloads in Virtualized Data Centers Multifaceted Resource Management for Dealing with Heterogeneous Workloads in Virtualized Data Centers Íñigo Goiri, J. Oriol Fitó, Ferran Julià, Ramón Nou, Josep Ll. Berral, Jordi Guitart and Jordi Torres

More information

A Bi-Objective Approach for Cloud Computing Systems

A Bi-Objective Approach for Cloud Computing Systems A Bi-Objective Approach for Cloud Computing Systems N.Geethanjali 1, M.Ramya 2 Assistant Professor, Department of Computer Science, Christ The King Engineering College 1, 2 ABSTRACT: There are Various

More information

Auto-Scaling Model for Cloud Computing System

Auto-Scaling Model for Cloud Computing System Auto-Scaling Model for Cloud Computing System Che-Lun Hung 1*, Yu-Chen Hu 2 and Kuan-Ching Li 3 1 Dept. of Computer Science & Communication Engineering, Providence University 2 Dept. of Computer Science

More information

Recommended hardware system configurations for ANSYS users

Recommended hardware system configurations for ANSYS users Recommended hardware system configurations for ANSYS users The purpose of this document is to recommend system configurations that will deliver high performance for ANSYS users across the entire range

More information

Figure 1. The cloud scales: Amazon EC2 growth [2].

Figure 1. The cloud scales: Amazon EC2 growth [2]. - Chung-Cheng Li and Kuochen Wang Department of Computer Science National Chiao Tung University Hsinchu, Taiwan 300 shinji10343@hotmail.com, kwang@cs.nctu.edu.tw Abstract One of the most important issues

More information

Chapter 1 Computer System Overview

Chapter 1 Computer System Overview Operating Systems: Internals and Design Principles Chapter 1 Computer System Overview Eighth Edition By William Stallings Operating System Exploits the hardware resources of one or more processors Provides

More information

A Study on the Application of Existing Load Balancing Algorithms for Large, Dynamic, Heterogeneous Distributed Systems

A Study on the Application of Existing Load Balancing Algorithms for Large, Dynamic, Heterogeneous Distributed Systems A Study on the Application of Existing Load Balancing Algorithms for Large, Dynamic, Heterogeneous Distributed Systems RUPAM MUKHOPADHYAY, DIBYAJYOTI GHOSH AND NANDINI MUKHERJEE Department of Computer

More information

An Experimental Study of Load Balancing of OpenNebula Open-Source Cloud Computing Platform

An Experimental Study of Load Balancing of OpenNebula Open-Source Cloud Computing Platform An Experimental Study of Load Balancing of OpenNebula Open-Source Cloud Computing Platform A B M Moniruzzaman 1, Kawser Wazed Nafi 2, Prof. Syed Akhter Hossain 1 and Prof. M. M. A. Hashem 1 Department

More information

High performance ETL Benchmark

High performance ETL Benchmark High performance ETL Benchmark Author: Dhananjay Patil Organization: Evaltech, Inc. Evaltech Research Group, Data Warehousing Practice. Date: 07/02/04 Email: erg@evaltech.com Abstract: The IBM server iseries

More information

Manufacturer Saves $1.5 Million, Caps IT Costs by Using Hyper-V Technology

Manufacturer Saves $1.5 Million, Caps IT Costs by Using Hyper-V Technology Microsoft Virtualization: Data Center to Desktop Customer Solution Case Study Manufacturer Saves $1.5 Million, Caps IT Costs by Using Hyper-V Technology Overview Country or Region: Slovenia Industry: Manufacturing

More information

IaaS Cloud Architectures: Virtualized Data Centers to Federated Cloud Infrastructures

IaaS Cloud Architectures: Virtualized Data Centers to Federated Cloud Infrastructures IaaS Cloud Architectures: Virtualized Data Centers to Federated Cloud Infrastructures Dr. Sanjay P. Ahuja, Ph.D. 2010-14 FIS Distinguished Professor of Computer Science School of Computing, UNF Introduction

More information

How To Use A Cloud For A Local Cluster

How To Use A Cloud For A Local Cluster Marcos Dias de Assunção 1,2, Alexandre di Costanzo 1 and Rajkumar Buyya 1 1 Department of Computer Science and Software Engineering 2 National ICT Australia (NICTA) Victoria Research Laboratory The University

More information

The Fastest Way to Parallel Programming for Multicore, Clusters, Supercomputers and the Cloud.

The Fastest Way to Parallel Programming for Multicore, Clusters, Supercomputers and the Cloud. White Paper 021313-3 Page 1 : A Software Framework for Parallel Programming* The Fastest Way to Parallel Programming for Multicore, Clusters, Supercomputers and the Cloud. ABSTRACT Programming for Multicore,

More information

An Implementation of Active Data Technology

An Implementation of Active Data Technology White Paper by: Mario Morfin, PhD Terri Chu, MEng Stephen Chen, PhD Robby Burko, PhD Riad Hartani, PhD An Implementation of Active Data Technology October 2015 In this paper, we build the rationale for

More information

solution brief September 2011 Can You Effectively Plan For The Migration And Management of Systems And Applications on Vblock Platforms?

solution brief September 2011 Can You Effectively Plan For The Migration And Management of Systems And Applications on Vblock Platforms? solution brief September 2011 Can You Effectively Plan For The Migration And Management of Systems And Applications on Vblock Platforms? CA Capacity Management and Reporting Suite for Vblock Platforms

More information

Axceleon s CloudFuzion Turbocharges 3D Rendering On Amazon s EC2

Axceleon s CloudFuzion Turbocharges 3D Rendering On Amazon s EC2 Axceleon s CloudFuzion Turbocharges 3D Rendering On Amazon s EC2 In the movie making, visual effects and 3D animation industrues meeting project and timing deadlines is critical to success. Poor quality

More information

Efficient Data Management Support for Virtualized Service Providers

Efficient Data Management Support for Virtualized Service Providers Efficient Data Management Support for Virtualized Service Providers Íñigo Goiri, Ferran Julià and Jordi Guitart Barcelona Supercomputing Center - Technical University of Catalonia Jordi Girona 31, 834

More information

Cloud Based Distributed Databases: The Future Ahead

Cloud Based Distributed Databases: The Future Ahead Cloud Based Distributed Databases: The Future Ahead Arpita Mathur Mridul Mathur Pallavi Upadhyay Abstract Fault tolerant systems are necessary to be there for distributed databases for data centers or

More information

BridgeWays Management Pack for VMware ESX

BridgeWays Management Pack for VMware ESX Bridgeways White Paper: Management Pack for VMware ESX BridgeWays Management Pack for VMware ESX Ensuring smooth virtual operations while maximizing your ROI. Published: July 2009 For the latest information,

More information

Hyperconverged Infrastructure: Improve business value while decreasing TCO White Paper

Hyperconverged Infrastructure: Improve business value while decreasing TCO White Paper Hyperconverged Infrastructure: Improve business value while decreasing TCO White Paper Author: Benjamin Woo Neuralytix, Inc. Doc#: 194322 Published: [Publish Date] Last Update: Page 2 of 19 Neuraspective

More information

A Middleware Strategy to Survive Compute Peak Loads in Cloud

A Middleware Strategy to Survive Compute Peak Loads in Cloud A Middleware Strategy to Survive Compute Peak Loads in Cloud Sasko Ristov Ss. Cyril and Methodius University Faculty of Information Sciences and Computer Engineering Skopje, Macedonia Email: sashko.ristov@finki.ukim.mk

More information

Efficient Scheduling Of On-line Services in Cloud Computing Based on Task Migration

Efficient Scheduling Of On-line Services in Cloud Computing Based on Task Migration Efficient Scheduling Of On-line Services in Cloud Computing Based on Task Migration 1 Harish H G, 2 Dr. R Girisha 1 PG Student, 2 Professor, Department of CSE, PESCE Mandya (An Autonomous Institution under

More information

Cost-effective Strategies for Building the Next-generation Data Center

Cost-effective Strategies for Building the Next-generation Data Center White Paper Cost-effective Strategies for Building the Next-generation Data Center Custom-made servers bearing energy-efficient processors are key to today s cloud computing-inspired architectures. Tom

More information

Steps to Migrating to a Private Cloud

Steps to Migrating to a Private Cloud Deploying and Managing Private Clouds The Essentials Series Steps to Migrating to a Private Cloud sponsored by Introduction to Realtime Publishers by Don Jones, Series Editor For several years now, Realtime

More information

Kerrighed / XtreemOS cluster flavour

Kerrighed / XtreemOS cluster flavour Kerrighed / XtreemOS cluster flavour Jean Parpaillon Reisensburg Castle Günzburg, Germany July 5-9, 2010 July 6th, 2010 Kerrighed - XtreemOS cluster flavour 1 Summary Kerlabs Context Kerrighed Project

More information

Green HPC - Dynamic Power Management in HPC

Green HPC - Dynamic Power Management in HPC Gr eenhpc Dynami cpower Management i nhpc AT ECHNOL OGYWHI T EP APER Green HPC Dynamic Power Management in HPC 2 Green HPC - Dynamic Power Management in HPC Introduction... 3 Green Strategies... 4 Implementation...

More information

Scheduling and Resource Management in Computational Mini-Grids

Scheduling and Resource Management in Computational Mini-Grids Scheduling and Resource Management in Computational Mini-Grids July 1, 2002 Project Description The concept of grid computing is becoming a more and more important one in the high performance computing

More information

Keywords: Cloudsim, MIPS, Gridlet, Virtual machine, Data center, Simulation, SaaS, PaaS, IaaS, VM. Introduction

Keywords: Cloudsim, MIPS, Gridlet, Virtual machine, Data center, Simulation, SaaS, PaaS, IaaS, VM. Introduction Vol. 3 Issue 1, January-2014, pp: (1-5), Impact Factor: 1.252, Available online at: www.erpublications.com Performance evaluation of cloud application with constant data center configuration and variable

More information

can you effectively plan for the migration and management of systems and applications on Vblock Platforms?

can you effectively plan for the migration and management of systems and applications on Vblock Platforms? SOLUTION BRIEF CA Capacity Management and Reporting Suite for Vblock Platforms can you effectively plan for the migration and management of systems and applications on Vblock Platforms? agility made possible

More information

Load Balancing on a Grid Using Data Characteristics

Load Balancing on a Grid Using Data Characteristics Load Balancing on a Grid Using Data Characteristics Jonathan White and Dale R. Thompson Computer Science and Computer Engineering Department University of Arkansas Fayetteville, AR 72701, USA {jlw09, drt}@uark.edu

More information

Virtual Machine Monitors. Dr. Marc E. Fiuczynski Research Scholar Princeton University

Virtual Machine Monitors. Dr. Marc E. Fiuczynski Research Scholar Princeton University Virtual Machine Monitors Dr. Marc E. Fiuczynski Research Scholar Princeton University Introduction Have been around since 1960 s on mainframes used for multitasking Good example VM/370 Have resurfaced

More information

CHAPTER 1 INTRODUCTION

CHAPTER 1 INTRODUCTION 1 CHAPTER 1 INTRODUCTION 1.1 MOTIVATION OF RESEARCH Multicore processors have two or more execution cores (processors) implemented on a single chip having their own set of execution and architectural recourses.

More information

Unit two is about the components for cloud computing.

Unit two is about the components for cloud computing. Unit two is about the components for cloud computing. Copyright IBM Corporation 2012 1 Please study this units learning objectives. Copyright IBM Corporation 2015 2 The diagram illustrates the virtual

More information