Objective Criteria of Job Scheduling Problems. Uwe Schwiegelshohn, Robotics Research Lab, TU Dortmund University



Similar documents
Online Scheduling for Cloud Computing and Different Service Levels

Factors to Describe Job Shop Scheduling Problem

Efficient and Robust Allocation Algorithms in Clouds under Memory Constraints

Joint Optimization of Overlapping Phases in MapReduce

Parallel Scalable Algorithms- Performance Parameters

A Comparison of General Approaches to Multiprocessor Scheduling

Load Balancing on a Non-dedicated Heterogeneous Network of Workstations

Rackspace Cloud Databases and Container-based Virtualization

ENERGY EFFICIENT CONTROL OF VIRTUAL MACHINE CONSOLIDATION UNDER UNCERTAIN INPUT PARAMETERS FOR THE CLOUD

CSE 4351/5351 Notes 7: Task Scheduling & Load Balancing

Scheduling Shop Scheduling. Tim Nieberg

Complexity Theory. IE 661: Scheduling Theory Fall 2003 Satyaki Ghosh Dastidar

Green Cloud Computing 班 級 : 資 管 碩 一 組 員 : 黃 宗 緯 朱 雅 甜

Online Scheduling with Bounded Migration

Introduction to production scheduling. Industrial Management Group School of Engineering University of Seville

DECENTRALIZED LOAD BALANCING IN HETEROGENEOUS SYSTEMS USING DIFFUSION APPROACH

Control 2004, University of Bath, UK, September 2004

Classification - Examples

A Service Revenue-oriented Task Scheduling Model of Cloud Computing

THROUGHPUT OPTIMIZATION IN ROBOTIC CELLS

Small Maximal Independent Sets and Faster Exact Graph Coloring

Clustering & Visualization

Dimensioning an inbound call center using constraint programming

A Comparison of Dynamic Load Balancing Algorithms

Characterizing the Performance of Dynamic Distribution and Load-Balancing Techniques for Adaptive Grid Hierarchies

High-Mix Low-Volume Flow Shop Manufacturing System Scheduling

Introduction to Scheduling Theory

CPU SCHEDULING (CONT D) NESTED SCHEDULING FUNCTIONS

ESQUIVEL S.C., GATICA C. R., GALLARD R.H.

Aperiodic Task Scheduling

International Journal of Advance Research in Computer Science and Management Studies

Lecture Outline Overview of real-time scheduling algorithms Outline relative strengths, weaknesses

Cellular Computing on a Linux Cluster

Single machine models: Maximum Lateness -12- Approximation ratio for EDD for problem 1 r j,d j < 0 L max. structure of a schedule Q...

Load Balancing. Load Balancing 1 / 24

PARALLELS CLOUD STORAGE

CHAPTER 6 MAJOR RESULTS AND CONCLUSIONS

Load Balancing to Save Energy in Cloud Computing

Operatin g Systems: Internals and Design Principle s. Chapter 10 Multiprocessor and Real-Time Scheduling Seventh Edition By William Stallings

Assembly line balancing to minimize balancing loss and system loss. D. Roy 1 ; D. Khan 2

An Empirical Study and Analysis of the Dynamic Load Balancing Techniques Used in Parallel Computing Systems

Commonly Used Approaches to Real-Time Scheduling

CHAPTER 1 INTRODUCTION

Scheduling Single Machine Scheduling. Tim Nieberg

An Approach to Load Balancing In Cloud Computing

Load Balancing in cloud computing

TU e. Advanced Algorithms: experimentation project. The problem: load balancing with bounded look-ahead. Input: integer m 2: number of machines

A STUDY OF TASK SCHEDULING IN MULTIPROCESSOR ENVIROMENT Ranjit Rajak 1, C.P.Katti 2, Nidhi Rajak 3

Real-Time Scheduling (Part 1) (Working Draft) Real-Time System Example

How to assess the risk of a large portfolio? How to estimate a large covariance matrix?

The International Journal Of Science & Technoledge (ISSN X)

Parametric Analysis of Mobile Cloud Computing using Simulation Modeling

A SIMULATOR FOR LOAD BALANCING ANALYSIS IN DISTRIBUTED SYSTEMS

Multi-service Load Balancing in a Heterogeneous Network with Vertical Handover

Characterizing Task Usage Shapes in Google s Compute Clusters

Ronald Graham: Laying the Foundations of Online Optimization

The Impact of Big Data on Classic Machine Learning Algorithms. Thomas Jensen, Senior Business Expedia

An Efficient load balancing using Genetic algorithm in Hierarchical structured distributed system

Cloud Storage. Parallels. Performance Benchmark Results. White Paper.

Resource Models: Batch Scheduling

CHAPTER 4: SOFTWARE PART OF RTOS, THE SCHEDULER

MIP-Based Approaches for Solving Scheduling Problems with Batch Processing Machines

Performance White Paper

Dynamic request management algorithms for Web-based services in Cloud computing

HYBRID GENETIC ALGORITHMS FOR SCHEDULING ADVERTISEMENTS ON A WEB PAGE

Efficient Parallel Processing on Public Cloud Servers Using Load Balancing

How To Compare Load Sharing And Job Scheduling In A Network Of Workstations

Chapter 1 Computer System Overview

Real-time scheduling algorithms, task visualization

Power Management in Cloud Computing using Green Algorithm. -Kushal Mehta COP 6087 University of Central Florida

ANALYSIS OF WORKFLOW SCHEDULING PROCESS USING ENHANCED SUPERIOR ELEMENT MULTITUDE OPTIMIZATION IN CLOUD

Routing in packet-switching networks

A* Algorithm Based Optimization for Cloud Storage

Chapter Load Balancing. Approximation Algorithms. Load Balancing. Load Balancing on 2 Machines. Load Balancing: Greedy Scheduling

Transcription:

Objective Criteria of Job Scheduling Problems Uwe Schwiegelshohn, Robotics Research Lab, TU Dortmund University 1

Jobs and Users in Job Scheduling Problems Independent users No or unknown precedence constraints between different jobs Online scheduling Jobs are unknown until they are submitted (r j,online -condition). Nonclairvoyant scheduling The processing time of a job is unknown until its completion (ncvcondition). Coarse granular scheduling Fine granular scheduling is responsibility of the user or the OS of the user (virtualization). A job is a single entity or consists of few stages. Resource (pre-)selection by the user A job requires more than one machine in parallel (size j -condition). 2

Machines in Job Scheduling Problems Cloud federation or computational grid (often GP m -model) Job allocation to a group of machines (central allocation or bids of different owners) A group of machines belongs to a single owner (often P m -model). System centric primary objective Secondary objectives may consider user interests (service level agreements). Heterogeneity within a group of machines is usually invisible to the user. Storage system, processors and cores. Consideration of the dominant resource (virtual machine) Virtually exclusive access to machines Machine sharing (fine grain preemption) is invisible to the user. 3

What is the Purpose of an Objective Criterion? Primary properties Appropriate representation of system goals Quantitative evaluation of the schedule Secondary characteristics Easy evaluation Little volatility important for online scheduling Robustness regarding different scenarios The overhead to determine general results and/or develop frameworks must pay off. Extensibility to include more complex criteria Inclusion of secondary user criteria 4

Analysis of Job Scheduling Problems Evaluation on a real system Real systems of sufficient size are rarely available for experiments. Simulation experiments Sampling of the solution space Selection of input data to generate a good cover of the solution space. Random data often do not represent real problems. Only few real workload data are available. Theoretical evaluation Stochastic scheduling Real workload cannot be modeled by simple distributions Competitive analysis 5

Competitive Analysis Worst case analysis Information about stability of the approach Possibly little indication about applicability in practice Similarity to approximation algorithms Determination of a competitive factor Methodology For all problem instances, we determine an upper bound for the ratio between the objective value of the schedule generated by the algorithm to the objective value of the optimal schedule for this instance. Example for makespan C max (S)<c C max (OPT) for all instances with c being the competitive factor 6

A Common Objective: Makespan Makespan corresponds to machine utilization. It is easy to determine the makespan of a schedule. Schedules with an optimal makespan may not be good schedules. It is difficult to incorporate secondary objectives. time The objective may be highly volatile machines in an online scenario. 7

Another Common Objective: Total Completion Time The total completion time objective considers all jobs. Little volatility ΣC j = 43 14 4 There may be different completion time results even in optimally utilized schedules. no idle machines 3 1 time Bias towards certain schedules machines Extension with job weights Who selects the weights? 8

Total Completion Time with Resource Weights w j =p j (sequential jobs) w j =p j size j (parallel jobs) Some known analysis results (see Queyranne and Kawaguchi and Kyan) Unbiased if the resource occupation remains unchanged vertically dividing a parallel job does not change the objective. horizontally dividing a long running job is invariant of the schedule. machines time same result as a single parallel job 9

Selection of a Reference Value Utilization must particularly address time periods of high demand (and their neighboring time periods). Makespan: Start time (time 0) of the schedule In an online scenario, the makespan is lower bounded by the last release date plus the corresponding processing time. Simple transformation of offline results into online results (see Shmoys et al.) Completion time (C j ) or flow time (C j -r j )? Same optimal schedule Significant differences in competitive factors (see Kellerer et al. and Becchetti and Leonardi) System representation: completion time with an appropriate start time (see makespan) User representation: flow time 10

P m r j,online, ncv * Comparison using the competitive factor for list scheduling Makespan (*= C max ) 2-1/m (tight bound, see Graham) Utilization until the actual time (*= occupied machine time / total machine time = actual time number of machines) 1.333 (tight bound, see Hussein et al.) Resource weight metric (*= p j C j ) 1.25 (small gap, 1.207 is a lower bound, see Kawaguchi and Kyan) 11

P m r j,online, ncv p j C j jobs released at time r The analysis helps to determine an appropriate machine overprovisioning in the system. Induction by the number of different release dates Use of the utilization result At most 25% of the resources in an interval are left idle by list scheduling and are used in the optimal schedule. r time idle machines machines jobs released before time r 12

P m size j,ncv * Makespan (*= C max ) 2-1/m (tight bound, see Graham) Utilization 1.333 until C max (OPT) 2k Resource weight metric (*= p j C j ) 2 (jobs are scheduled in decreasing degree of parallelism) 1 time machines 13

P m size j, ncv p j size j C j Vertical splitting of parallel jobs Horizontal splitting of some long jobs no reduction of the competitive factor Combination of the remaining long jobs neutral to the objective value t s (S ) t s (S) time Determination of the competitive factor by numerical optimization (2 variables) 14

P m r j,online, size j,ncv * Makespan (*= C max ) 2-1/m (tight bound, see Naroska et al.) Utilization until the actual time m Ω(m) 2m optimal schedule m time machines 15

P m r j,online, size j,ncv * Makespan (*= C max ) 2-1/m (tight bound, see Naroska et al.) Utilization until the actual time Ω(m) m 0.5 +1 optimal schedule Resource weight metric (*= p j C j ) 2 if size j <m/2 for all jobs (see Turek et al.) Ω(m 0.5 ) in the general case 3.562 with fine granular preemption (see Schwiegelshohn and Yahyapour) 1 machines 16 time

Challenges and Status Discussion of several metrics for online nonclairvoyant job scheduling problems Comparison of the metrics based on competitive analysis Testing of the metrics for real multiprocessor scheduling Real workloads Heuristic algorithms done mostly done mostly open Extension of the metrics Different job classes with additional weight factors Consideration of service level agreements open 17