Deadline and Cost based MapReduce Job Scheduling in Heterogeneous Cloud using Dynamic Pricing
|
|
|
- Darrell Melvin Mills
- 10 years ago
- Views:
Transcription
1 Journal homepage: INTERNATIONAL JOURNAL OF ADVANCED RESEARCH RESEARCH ARTICLE Deadline and Cost based MapReduce Job Scheduling in Heterogeneous Cloud using Dynamic Pricing Ms. Harsha Daryani 1, Mr. Sanjay B. Thakare 2 1. M. E. Second Year Student Department of Computer Engineering JSPM s Rajarshi Shahu College of Engineering, Tathawade Savitribai phule Pune University, India 2. Associate Professor Department of Computer Engineering JSPM s Rajarshi Shahu College of Engineering, Tathawade Savitribai phule Pune University, Pune Maharashtra, India Manuscript Info Manuscript History: Received: 15 July 2015 Final Accepted: 16 August 2015 Published Online: September 2015 Key words: Budget constraints; cloud service providers; dynamic pricing; heterogeneous cloud; task scheduling. *Corresponding Author Ms. Harsha Daryani Abstract Cloud Computing has emerged as an eminent technology in internet world. Job scheduling is one of the most challenging issues in the cloud computing area. This paper is centered on scheduling algorithms for MapReduce jobs with respect to deadline and cost constraints on a range of provisioned heterogeneous machines in the cloud. Each job is partitioned into multiple tasks. In addition, the time for executing a task on clouds resources and price for using that resource is determined. We propose a dynamic pricing algorithm for dynamically changing the price required for task execution. It helps the cloud service provider(csp) to recover its machine s cost and various other cost required for task execution based on the scenario when job is submitted. This algorithm will benefit CSP to maximizes its revenues. In addition to that, it provides inexpensive services to users as the cost of running jobs in the cloud is reduced by dynamic pricing. The Budget Distribution algorithm takes input the dynamic time-price table and distributes total budget among all the tasks, and remaining budget to slowest task to increase overall execution time. In this approach, we use Multiple Choice Knapsack (MCKS) deadline constrained algorithm that takes the dynamic pricing table as input and optimal cost efficient machine based on execution time required on that machine. An evaluation has been carried out which verifies the efficiency of proposed algorithm and also illustrates that the technique is cost-effective to both CSP and user. Copy Right, IJAR, 2015,. All rights reserved INTRODUCTION Cloud Computing is emerging as a new technique for large scale distributed computing driven by budget, in which a set of highly scalable, virtualized, heterogeneous, and configurable and reconfigurable computing resources can be rapidly acquired and released with minimum management effort. Users of cloud computing does not need to set up any kind of software and can acquire their data worldwide from any device (laptops, tablets, mobile phones, desktop PC) till an internet connection is available [10][11]. MapReduce has emerged as a parallel programming paradigm for processing data in large data centers. It can efficiently handle large compute cluster. Hadoop is an open source implementation of MapReduce parallel 56
2 programming framework and Google File System. It allows for distributed processing of large data sets across clusters of computers, each offering local computation and storage [13]. Pricing is an important factor for organizations providing services or products. The way price is set effects customer behavior, commitment to a provider, and the organizations success. Hence, building an appropriate pricing model will help attain higher revenues. Many pricing techniques are utilized [6][7]. For example, a typical pricing technique is to pay once for unlimited usage. On the other hand, this approach is inflexible and does not take into account many other factors that effects pricing, such as maintenance costs for resource, and price fairness. Many cloud computing providers (e.g., Google App Engine and Amazon Web Services) provides pay-peruse fixed pricing which charges end users as per their overall resource usage. Pay for resources is another approach, in which users are charged based on the storage or bandwidth size allowed. Subscription is another pricing approach, in which the user subscribes with a certain CSP for a static price per unit for long periods of time. Dynamic pricing is another technique, in which the price charged to user for resource consumption changes dynamically. It directly affects both a provider s revenue and price charged to customer for resource usage. The following are the factors that affect pricing in cloud environment: 1. Initial Costs: This is quantity of money that the service providers spend to purchase resources. 2. Cost of maintenance: This is the quantity of money that service providers spends to maintain cloud resources. 3. Demand: This is the demand from users for utilizing cloud resources. It is usually dynamic and unpredictable. 4. Supply: This indicates the available resources with cloud to fulfill demand. [5] The task-level scheduling is performed for two criteria: budget and deadline [8][9][13]. The goal of the algorithms, Budget Distribution algorithm and Multiple Choice Knapsack (MCKS) deadline constrained algorithm are: I. For a given budget B, how to effectively select a machine from the candidate set of machines for each task so that the total time for executing the workflow comes out to be minimum without breaking the budget. II. For a given deadline D, how to effectively select a machine from the candidate set of machines for each task so that the total cost for executing the workflow is comes out to be without breaking the deadline. The scheduler will ensure that all tasks have adequate budget to finish execution. If there is any remaining budget of workflow, then iteratively this budget is distributed to a task whose current execution time ascertains to be the slowest. The process continues until no budget is left. The heterogeneous nature of cloud is depicted by having distinct machines that may have distinct configuration or performance parameters, and thus may have distinct service charges. Motivation: The cloud resources are often provisioned on-demand with a billing model, cloud based application are generally budget driven. Hence, the efficient use of resources to meet relevant performance requirements within budget is always a practical concern for Cloud Service Providers. A Cloud Service Providers (CSP) typical aim is to maximize its revenues with its pricing scheme, although its user s main aim is to acquire required service obtainable for a reasonable price. Hence, satisfying both parties requires an optimal pricing technique. The price charged is one of the most significant metrics that a CSP can control to promote the usage of its services. Challenges: 1. Determining cost and deadline for each task. 2. Dynamically determining price charged to user. 3. Prices are charged per instance per time unit. This paper is further organized as follows: in Section II we mention emergence of job scheduling and related work in job scheduling recently developed. Section III presents implementation details in which problem definition, 57
3 problem formulation, system architecture, algorithms, mathematical model, modules are explained. We follow with the results in section IV and conclude the paper in section V. Related work Following outlines the scheduling mechanisms developed in recent years. Default FIFO Scheduler, uses a FIFO queue for functioning. After partitioning a job into multiple tasks, it is entered into a queue and assigned to TaskTracker nodes as they become available. Typically each job uses entire cluster, so jobs had to wait for their turn to arrive. Problem arises of sharing resources among users. Fair Scheduler [1] was developed by Facebook to handle access to their cluster. It overcomes the challenge of FIFO Scheduler by providing a fair share of the cluster capacity. In addition to that, users may load their jobs to pools, with each pool having a minimum number of TaskTracker slots. Thus cluster capacity is shared among jobs. Capacity Scheduler [2] was developed by Yahoo considers scenario of users where number of users is large. It performs fair allocation of computation resources among users rather than jobs. It does not consider resource availability. Mark Yong, Nitin Garegrat, Shiwali Mohan [3], proposed two mechanisms for resource aware scheduling. First, Dynamic Free Slot Advertisement computes the available TaskTracker nodes dynamically by using resource metrics retrieved from each node. Second, Free Slot Priorities/Filtering configures maximum number of computation slots for each node at configuration time. Thomas Sandholm and Kevin Lai [4], this paper supports capacity distribution between concurrent users considering priorities of the users. It proposed the idea of sharing TaskTracker slots proportionally per unit time. It does not handle deadline requirements. Xiaocheng Liu et al.[12], is centered on parallel processing capabilities of cloud. This work is intended to attain specific level of utilization of its nodes as well as responsiveness of parallel jobs. Priority based have been proposed with two VM- tiers (low CPU priority and high CPU priority) to improve responsiveness of jobs. Dzmitry Kliazovich1 et al.[13], this work emphases the role of communication fabric and presents a scheduling solution, named e-stab, which takes into account traffic requirements of cloud applications providing energy efficient job allocation and traffic load balancing in data center networks. Abhishek Verma, Ludmila Cherkasova, and Roy H. Camp- bell [14], goal of this paper is to minimize the completion time of a set of jobs. It also focuses on improving cluster utilization. It handles a set of production workload that consists of jobs with no dependencies. Z. Guo and G. Fox.[15], proposed resource stealing to utilize idle resources. The paper focuses improving resource utilization and decreasing job execution time. It also explains the concepts of MapReduce in detail. Yang Wang and Wei Shi [16], proposed task level scheduling algorithms with constraints of budget and deadline. The proposed mechanism uses static pricing which could not be fair to every user because every user does not have the same needs. Implementation Details A. Problem Definition We design a system for scheduling of task within budget and deadline constraints using dynamic pricing for determining the time required for execution of task on the best machine from candidate set and price for using that machine. The scheduling algorithm for task level with regard to budget and deadline constraints for a batch of jobs on a set of provisioned heterogeneous machines in cloud environment where price remains unaltered after it has been determined (static pricing). It introduces the problem of being impartial to users and with service providers. As the service providers might reserve the resources for longer time than the customers has utilized. On the user side, they may overpay for the resources reserved if they does not use them extensively. To address this problem, we are proposing a mechanism for dynamically changing the price and time. Time parameter indicates the time to run a task on a particular machine within the given deadline of the heterogeneous cloud. Price indicates the cost of using that machine which should be within allotted budget. According to the current scenario of machines in the cloud, task level scheduling will be performed. Problem Formulation A batch of jobs is modeled as a multi-stage fork and join workflow which consists of x stages, each stage s will be having a collection of independent tasks, denoted as J s = {J 0, J 1, J 2, J 3,, J n }, where n+1 is the size of stage 58
4 s. In the cloud, each task can be related with a set of machines. For task J i, 0 s x, 0 i n the time to run task on a machine and its corresponding prices are shown in Time-Price Matrix where t m, 1 m r gives time to run a task J i on machine M m where p m gives the corresponding price for using that machine, and r is the total number of machines that can execute task J i. We assume that the prices have been sorted in decreasing order and times in increasing order. TableI indicates the conventions used. For each task, the time for scheduling it is given by TaskTime and price for using virtual machine is given by TaskPrice. TaskPrice = Total Cost / ( 100 * Task Size ) (1) TaskTime = Processing Speed / Task Size (2) Time-Price Matrix for task J i t 1 t 2 t r p 1 p 2 p r TABLE I. Memorization Parameter Table Notation x J s J i n W t m p m r B i B s B w D s D w T i (B i ) M m Meaning Number of stages Task set in stage s Task i in stage J Number of tasks in stage s Total number of tasks in the workflow Time to run task J i on machine M m Cost for using machine M m Total number of machines that can run JBudget i used by task J i Budget for stage s Total budget for workflow Deadline for stage s Total deadline for workflow Shortest time to finish J i for given B i Assigned machine Budget Constraints Given budget B i for task J i, shortest time to finish it is denoted as T i (B i ), defined as : T i (B i ) = t m, p m+1 < B i < p m-1 (3) The time to finish a stage s with budget B s denoted as T s (B s ) and is defined as time taken to finish last task in the stage within given budget: T s (B s ) = max i ϵ [0,n] T i (B i ) (4) Since workflow is fork and join, a stage cannot start until its previous stage had finished. Hence, the total makespan within budget B w to finish the workflow is defined as the sum of all stages times. We need to minimize this time within budget B w : T w (B w ) = min s ϵ [0,x] T i (B i ) (5) Deadline Constraints The minimum cost to finish stage s, given deadline D j denoted as C s (D s ): 59
5 C s (D s ) = i ϵ [0,n] C i (D s ) (6) where C i (D s ) is the minimum cost to finish task J i in stage s within such that t 1 D s t r The total cost for the workflow within deadline D w : C w (D w ) = min s ϵ [0,x] C s (D s ) (7) System Architecture Fig. 1 shows the system architecture. Users access the cloud through web browser and submit their jobs into cloud. Cloud administrator estimates the time and price required for task execution and dynamically maintain it. Budget and deadline distribution solver finds appropriate node for executing task. If the task can be completed within budget or deadline it is scheduled, else it is not. It estimates the scheduling length and cost for complete workflow. Algorithms Fig. 1. System Architecture Assumption: Communication between map/reduce tasks is manipulated by the MapReduce framework via the underlying network file systems. System Constraints: For time-price table, time should be sorted in ascending order whereas price should be sorted in descending order. Algorithm 1: Budget Distribution Algorithm Input: JobSet; total Budget; initial execution time, budget distribution, assigned machine Output: Minimum time for executing workflow 1: Maintain idle list of machines 2: for task stage s do 3: Find best machine from candidate machines which is idle. 4: end for 5: Calculate remaining budget for workflow 6: Initialize profile variables (T i, B i, M m ) 7: Identify slowest and second slowest task 60
6 8: Assign these pairs to a set L 9: Determine which task in L should be allocated budget Algorithm 2: MCKS Deadline Constrained Algorithm Input: JobSet; time-price table Output: Minimum cost for executing workflow 1: for stage s workflow w do 2: Construct class s consists of tuple (D si, C si ) 1 i r ϵ [1,n] p r 3: end for 4: for each task J i ϵ stage s do 5: Assign execution time on the candidate machine into a Set S 6: end for 7: Sort S in increasing order 8: for each element t i ϵ S do 9: D si t i 10: end for 11: Compute C si (D si ) for each task 12: Choose exactly one tuple from each class to minimize the total cost. Algorithm 3: Dynamic Pricing Algorithm Input: Job, MachineSet(Initial_Cost, Maintenance_Cost, Power_Cost, Processing_Speed), Task_Size t p = Time for planning and allocating machine Output: PriceSet and TimeSet for task 1: Estimate price-demand function Z(m, tp, w) 2: if (Capacity > Z) then 3: TaskPrice = TaskPrice - ( Capacity / Z ) 4: Allot capacity to incoming demand 5: else 6: Select max p Pw,m p where p = TaskPrice, P w,m = PriceSet for tasks which are demanding resource 7: TaskPrice = TaskPrice + ( CapacityLeft / Z ) 8: end if 9: Estimate revenue of CSP = max p Pw,m p. Z(m, tp, w) 10: Estimate user gain. Modules The system is divided into multiple sub-systems which performs its functions as mentioned below. The first module of the system is User side. User submits their jobs to cloud environment. The complexity of cloud system is abstracted from user. The second module is dynamic pricing module which makes the time-price table dynamic by updating its values considering different parameters of the machine on which the task can be scheduled. The third module of the system is Budget and Deadline Distribution Solver (BDS). The BDS takes as an input a batch of jobs that are arranged as a multi-stage fork and join workflow by the scheduler at execution time. Every task of the job is associated with a time-price table As a result; the BDS can be configured with various parameters, including time-price tables, the job set and the total number of jobs in the workflow. The goal is to make the total scheduling length of the workflow minimum without breaking the budget and the total monetary cost of the 61
7 workflow minimum without missing the deadline. The result of these sub-systems is aggregated to obtain the final result i.e. total makespan and total cost. Experimental Results The system is being developed using eclipse with MapReduce for Hadoop operations. PHP/MySQL with Apache for load injection in the cloud. The system does not require any specific hardware to run. The experiments have been conducted on machine having following configurations: Intel i5 quadcore processor with 2.67GHz CPU, 4GB RAM. We have taken google4me cloud dataset consisting of a Job dataset, Machine Set. Job Set is obtained from user which is split into Task Set in the cloud. Task Set is maintained by cloud administrator which contains multiple attributes. Machine Set contains various details about machines in the cloud such as costs, configuration etc. TABLE II : Sample DataSet for jobs and tasks entered into the cloud. Job ID Job Name Task ID Task Name Task Size (in kb) Time Static Price 1 Read Files 1 Read PDF Read TXT Url Search Text Browse Web Search Images Dynamic Price Fig 3: Comparison between Budget-driven, Deadline-driven and Dynamic Pricing Scheduling technique in terms of Cost. 62
8 Fig 4: Scheduling Length of the algorithms vs. No. of tasks Figure 3 shows the comparison between state-of-the-art algorithm and proposed algorithm. The plots are the scheduling cost verses no. of tasks. This graph supports the claim that dynamic pricing algorithm, the proposed algorithm provides gains to user as well as CSP in terms of cost. Figure 4 shows the scheduling length for various algorithms. Scheduling Length gives the total time required for scheduling the task. This time is the estimated minimum time. The plots are the scheduling length verses the no. of tasks. The blue line is Budget-driven algorithm and the red line is Deadline-driven algorithm and green line is Dynamic Pricing algorithm. The scheduling results obtained by the deadline and cost constrained scheduling technique, describes the relation between the tasks and their scheduling length and scheduling cost. This results shows that dynamic pricing provides the real time scenario of market conditions. Lower price is set to attract customers, whereas higher price is set to attract more revenue The success definition of this scheduling technique lies in dynamically controlling the unpredictable demand of customers. Conclusion and future scope Large number of jobs arrive into the cloud, to handle these jobs, scheduling provides a smarter way of placing these jobs on cloud resources. In this paper, we came up with a novel approach for dynamically changing the price charged for user s job execution which is beneficial to both user and CSP, after that applying cost and deadline constrained scheduling. Microsoft Azure views dynamic pricing as a feasible option that will be highly adopted in addition to static pricing. Google AdWorks and Facebook Ads uses bidding/auctions strategy which is efficient but doesnot considers market conditions (demand and supply). This implicates the necessity of dynamic pricing. The advanced features in the scheduling (speculative task scheduling, redundant computing for fault tolerance) with respect to the budget constraints can be considered in future work. Enforcing the full system in a real cloud computing environment (Eg. Amazon) can also be tackled as future work. The market competition between clouds can also be taken as future enhancement. 63
9 References [1] Hadoop s Fair Scheduler, [2] Hadoop s Capacity Scheduler, [3] Mark Yong, Nitin Garegrat, Shiwali Mohan: Towards a Resource Aware Scheduler in Hadoop in Proc. ICWS, 2009, pp: [4] Thomas Sandholm and Kevin Lai, Dynamic Proportional Share Scheduling in Hadoop, E. Frachtenberg and U. Schwiegelshohn (Eds.): JSSPP 2010, LNCS 6253, pp , c_springer-verlag Berlin Heidelberg [5] Arun Anandasivam, Marc Premm, BID PRICE CONTROL AND DYNAMIC PRICING IN CLOUDS, Association for Information Systems AIS Electronic Library (AISeL) ECIS 2009 Proceedings European Conference on Information Systems (ECIS). [6] Saravanan. K and Sri Vigna Hema. V, Dynamic Pricing Model for a Cloud Cache Environment, International Journal of Engineering Trends and Technology (IJETT) - Volume4Issue4- April [7] May Al-Roomi, Shaikha Al-Ebrahim, Sabika Buqrais and Imtiaz Ahmad, Cloud Computing Pricing Models: A Survey, International Journal of Grid and Distributed Computing Vol.6, No.5 (2013), pp [8] J. Yu and R. Buyya, Scheduling scientific workflow applications with deadline and budget constraints using genetic algorithms, Sci. Program., vol. 14, no. 3-4, pp , Dec [9] Dr. Amit Agarwal, Saloni Jain, Efficient Optimal Algorithm of Task Scheduling in Cloud Computing Environment, International Journal of Computer Trends and Technology (IJCTT) volume 9 number 7 Mar [10] Anthony T. Velte, Toby J. Velte, Ph.D., Robert Elsenpeter, Cloud Computing: A Practical Approach, 2010 The McGraw-Hill Companies. [11] Peter Mell, Timothy Grance,The NIST Definition of Cloud Computing, National Institute of Standard and Technology U.S. Dept. of Commerce. [12] Xiaocheng Liu, Chen Wang, Bing Bing Zhou, Junliang Chen, Ting Yang, and Albert Y. Zomaya, Priority-Based Consolidation of Parallel Workloads in the Cloud, IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, VOL. 24, NO. 9, SEPTEMBER [13] Dzmitry Kliazovich, Sisay T. Arzo, Fabrizio Granelli, Pascal Bouvry and Samee Ullah Khan, e-stab: Energy-Efficient Scheduling for Cloud Computing Applications with Traffic Load Balancing, 2013 IEEE International Conference on Green Computing and Communi- cations and IEEE Internet of Things and IEEE Cyber, Physical and Social Computing. [14] Abhishek Verma, Ludmila Cherkasova, and Roy H. Campbell, Orchestrating an Ensemble of MapReduce Jobs for Minimizing Their Makespan, IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, VOL. 10, NO. 5, SEPTEMBER/OCTOBER [15] Yang Wang and Wei Shi, Budget-Driven Scheduling Algorithms for Batches of MapReduce Jobs in Heterogeneous Clouds, IEEE TRANSACTIONS ON CLOUD COMPUTING, VOL. 2, NO. 3, JULY- SEPTEMBER
SCHEDULING IN CLOUD COMPUTING
SCHEDULING IN CLOUD COMPUTING Lipsa Tripathy, Rasmi Ranjan Patra CSA,CPGS,OUAT,Bhubaneswar,Odisha Abstract Cloud computing is an emerging technology. It process huge amount of data so scheduling mechanism
Survey on Scheduling Algorithm in MapReduce Framework
Survey on Scheduling Algorithm in MapReduce Framework Pravin P. Nimbalkar 1, Devendra P.Gadekar 2 1,2 Department of Computer Engineering, JSPM s Imperial College of Engineering and Research, Pune, India
IMPROVED FAIR SCHEDULING ALGORITHM FOR TASKTRACKER IN HADOOP MAP-REDUCE
IMPROVED FAIR SCHEDULING ALGORITHM FOR TASKTRACKER IN HADOOP MAP-REDUCE Mr. Santhosh S 1, Mr. Hemanth Kumar G 2 1 PG Scholor, 2 Asst. Professor, Dept. Of Computer Science & Engg, NMAMIT, (India) ABSTRACT
A SURVEY ON MAPREDUCE IN CLOUD COMPUTING
A SURVEY ON MAPREDUCE IN CLOUD COMPUTING Dr.M.Newlin Rajkumar 1, S.Balachandar 2, Dr.V.Venkatesakumar 3, T.Mahadevan 4 1 Asst. Prof, Dept. of CSE,Anna University Regional Centre, Coimbatore, [email protected]
Task Scheduling in Hadoop
Task Scheduling in Hadoop Sagar Mamdapure Munira Ginwala Neha Papat SAE,Kondhwa SAE,Kondhwa SAE,Kondhwa Abstract Hadoop is widely used for storing large datasets and processing them efficiently under distributed
A REAL TIME MEMORY SLOT UTILIZATION DESIGN FOR MAPREDUCE MEMORY CLUSTERS
A REAL TIME MEMORY SLOT UTILIZATION DESIGN FOR MAPREDUCE MEMORY CLUSTERS Suma R 1, Vinay T R 2, Byre Gowda B K 3 1 Post graduate Student, CSE, SVCE, Bangalore 2 Assistant Professor, CSE, SVCE, Bangalore
Hadoop Scheduler w i t h Deadline Constraint
Hadoop Scheduler w i t h Deadline Constraint Geetha J 1, N UdayBhaskar 2, P ChennaReddy 3,Neha Sniha 4 1,4 Department of Computer Science and Engineering, M S Ramaiah Institute of Technology, Bangalore,
Reallocation and Allocation of Virtual Machines in Cloud Computing Manan D. Shah a, *, Harshad B. Prajapati b
Proceedings of International Conference on Emerging Research in Computing, Information, Communication and Applications (ERCICA-14) Reallocation and Allocation of Virtual Machines in Cloud Computing Manan
PERFORMANCE ANALYSIS OF KERNEL-BASED VIRTUAL MACHINE
PERFORMANCE ANALYSIS OF KERNEL-BASED VIRTUAL MACHINE Sudha M 1, Harish G M 2, Nandan A 3, Usha J 4 1 Department of MCA, R V College of Engineering, Bangalore : 560059, India [email protected] 2 Department
AN EFFICIENT LOAD BALANCING ALGORITHM FOR CLOUD ENVIRONMENT
AN EFFICIENT LOAD BALANCING ALGORITHM FOR CLOUD ENVIRONMENT V.Bharath 1, D. Vijayakumar 2, R. Sabarimuthukumar 3 1,2,3 Department of CSE PG, National Engineering College Kovilpatti, Tamilnadu, (India)
Profit-driven Cloud Service Request Scheduling Under SLA Constraints
Journal of Information & Computational Science 9: 14 (2012) 4065 4073 Available at http://www.joics.com Profit-driven Cloud Service Request Scheduling Under SLA Constraints Zhipiao Liu, Qibo Sun, Shangguang
Efficient Data Replication Scheme based on Hadoop Distributed File System
, pp. 177-186 http://dx.doi.org/10.14257/ijseia.2015.9.12.16 Efficient Data Replication Scheme based on Hadoop Distributed File System Jungha Lee 1, Jaehwa Chung 2 and Daewon Lee 3* 1 Division of Supercomputing,
Cloud Computing: Computing as a Service. Prof. Daivashala Deshmukh Maharashtra Institute of Technology, Aurangabad
Cloud Computing: Computing as a Service Prof. Daivashala Deshmukh Maharashtra Institute of Technology, Aurangabad Abstract: Computing as a utility. is a dream that dates from the beginning from the computer
Optimal Service Pricing for a Cloud Cache
Optimal Service Pricing for a Cloud Cache K.SRAVANTHI Department of Computer Science & Engineering (M.Tech.) Sindura College of Engineering and Technology Ramagundam,Telangana G.LAKSHMI Asst. Professor,
Dynamic Load Balancing of Virtual Machines using QEMU-KVM
Dynamic Load Balancing of Virtual Machines using QEMU-KVM Akshay Chandak Krishnakant Jaju Technology, College of Engineering, Pune. Maharashtra, India. Akshay Kanfade Pushkar Lohiya Technology, College
VIRTUAL RESOURCE MANAGEMENT FOR DATA INTENSIVE APPLICATIONS IN CLOUD INFRASTRUCTURES
U.P.B. Sci. Bull., Series C, Vol. 76, Iss. 2, 2014 ISSN 2286-3540 VIRTUAL RESOURCE MANAGEMENT FOR DATA INTENSIVE APPLICATIONS IN CLOUD INFRASTRUCTURES Elena Apostol 1, Valentin Cristea 2 Cloud computing
A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM
A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM Sneha D.Borkar 1, Prof.Chaitali S.Surtakar 2 Student of B.E., Information Technology, J.D.I.E.T, [email protected] Assistant Professor, Information
PERFORMANCE ANALYSIS OF PaaS CLOUD COMPUTING SYSTEM
PERFORMANCE ANALYSIS OF PaaS CLOUD COMPUTING SYSTEM Akmal Basha 1 Krishna Sagar 2 1 PG Student,Department of Computer Science and Engineering, Madanapalle Institute of Technology & Science, India. 2 Associate
Energy Constrained Resource Scheduling for Cloud Environment
Energy Constrained Resource Scheduling for Cloud Environment 1 R.Selvi, 2 S.Russia, 3 V.K.Anitha 1 2 nd Year M.E.(Software Engineering), 2 Assistant Professor Department of IT KSR Institute for Engineering
Big Data Analysis and Its Scheduling Policy Hadoop
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 1, Ver. IV (Jan Feb. 2015), PP 36-40 www.iosrjournals.org Big Data Analysis and Its Scheduling Policy
Efficient Scheduling Of On-line Services in Cloud Computing Based on Task Migration
Efficient Scheduling Of On-line Services in Cloud Computing Based on Task Migration 1 Harish H G, 2 Dr. R Girisha 1 PG Student, 2 Professor, Department of CSE, PESCE Mandya (An Autonomous Institution under
Towards a Resource Aware Scheduler in Hadoop
Towards a Resource Aware Scheduler in Hadoop Mark Yong, Nitin Garegrat, Shiwali Mohan Computer Science and Engineering, University of Michigan, Ann Arbor December 21, 2009 Abstract Hadoop-MapReduce is
Efficient Service Broker Policy For Large-Scale Cloud Environments
www.ijcsi.org 85 Efficient Service Broker Policy For Large-Scale Cloud Environments Mohammed Radi Computer Science Department, Faculty of Applied Science Alaqsa University, Gaza Palestine Abstract Algorithms,
Exploiting Cloud Heterogeneity for Optimized Cost/Performance MapReduce Processing
Exploiting Cloud Heterogeneity for Optimized Cost/Performance MapReduce Processing Zhuoyao Zhang University of Pennsylvania, USA [email protected] Ludmila Cherkasova Hewlett-Packard Labs, USA [email protected]
Chapter 7. Using Hadoop Cluster and MapReduce
Chapter 7 Using Hadoop Cluster and MapReduce Modeling and Prototyping of RMS for QoS Oriented Grid Page 152 7. Using Hadoop Cluster and MapReduce for Big Data Problems The size of the databases used in
Enhancing MapReduce Functionality for Optimizing Workloads on Data Centers
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 2, Issue. 10, October 2013,
Resource Allocation Avoiding SLA Violations in Cloud Framework for SaaS
Resource Allocation Avoiding SLA Violations in Cloud Framework for SaaS Shantanu Sasane Abhilash Bari Kaustubh Memane Aniket Pathak Prof. A. A.Deshmukh University of Pune University of Pune University
Keywords Distributed Computing, On Demand Resources, Cloud Computing, Virtualization, Server Consolidation, Load Balancing
Volume 5, Issue 1, January 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Survey on Load
Affinity Aware VM Colocation Mechanism for Cloud
Affinity Aware VM Colocation Mechanism for Cloud Nilesh Pachorkar 1* and Rajesh Ingle 2 Received: 24-December-2014; Revised: 12-January-2015; Accepted: 12-January-2015 2014 ACCENTS Abstract The most of
MINIMIZING STORAGE COST IN CLOUD COMPUTING ENVIRONMENT
MINIMIZING STORAGE COST IN CLOUD COMPUTING ENVIRONMENT 1 SARIKA K B, 2 S SUBASREE 1 Department of Computer Science, Nehru College of Engineering and Research Centre, Thrissur, Kerala 2 Professor and Head,
The Comprehensive Performance Rating for Hadoop Clusters on Cloud Computing Platform
The Comprehensive Performance Rating for Hadoop Clusters on Cloud Computing Platform Fong-Hao Liu, Ya-Ruei Liou, Hsiang-Fu Lo, Ko-Chin Chang, and Wei-Tsong Lee Abstract Virtualization platform solutions
CURTAIL THE EXPENDITURE OF BIG DATA PROCESSING USING MIXED INTEGER NON-LINEAR PROGRAMMING
Journal homepage: http://www.journalijar.com INTERNATIONAL JOURNAL OF ADVANCED RESEARCH RESEARCH ARTICLE CURTAIL THE EXPENDITURE OF BIG DATA PROCESSING USING MIXED INTEGER NON-LINEAR PROGRAMMING R.Kohila
An Energy Aware Cloud Load Balancing Technique using Dynamic Placement of Virtualized Resources
pp 81 86 Krishi Sanskriti Publications http://www.krishisanskriti.org/acsit.html An Energy Aware Cloud Load Balancing Technique using Dynamic Placement of Virtualized Resources Sumita Bose 1, Jitender
A Novel Approach for Efficient Load Balancing in Cloud Computing Environment by Using Partitioning
A Novel Approach for Efficient Load Balancing in Cloud Computing Environment by Using Partitioning 1 P. Vijay Kumar, 2 R. Suresh 1 M.Tech 2 nd Year, Department of CSE, CREC Tirupati, AP, India 2 Professor
Research on Job Scheduling Algorithm in Hadoop
Journal of Computational Information Systems 7: 6 () 5769-5775 Available at http://www.jofcis.com Research on Job Scheduling Algorithm in Hadoop Yang XIA, Lei WANG, Qiang ZHAO, Gongxuan ZHANG School of
LOAD BALANCING STRATEGY BASED ON CLOUD PARTITIONING CONCEPT
Journal homepage: www.mjret.in ISSN:2348-6953 LOAD BALANCING STRATEGY BASED ON CLOUD PARTITIONING CONCEPT Ms. Shilpa D.More 1, Prof. Arti Mohanpurkar 2 1,2 Department of computer Engineering DYPSOET, Pune,India
Scheduling Algorithms in MapReduce Distributed Mind
Scheduling Algorithms in MapReduce Distributed Mind Karthik Kotian, Jason A Smith, Ye Zhang Schedule Overview of topic (review) Hypothesis Research paper 1 Research paper 2 Research paper 3 Project software
A Comparative Study of Load Balancing Algorithms in Cloud Computing
A Comparative Study of Load Balancing Algorithms in Cloud Computing Reena Panwar M.Tech CSE Scholar Department of CSE, Galgotias College of Engineering and Technology, Greater Noida, India Bhawna Mallick,
Role of Cloud Computing in Big Data Analytics Using MapReduce Component of Hadoop
Role of Cloud Computing in Big Data Analytics Using MapReduce Component of Hadoop Kanchan A. Khedikar Department of Computer Science & Engineering Walchand Institute of Technoloy, Solapur, Maharashtra,
ADAPTIVE LOAD BALANCING ALGORITHM USING MODIFIED RESOURCE ALLOCATION STRATEGIES ON INFRASTRUCTURE AS A SERVICE CLOUD SYSTEMS
ADAPTIVE LOAD BALANCING ALGORITHM USING MODIFIED RESOURCE ALLOCATION STRATEGIES ON INFRASTRUCTURE AS A SERVICE CLOUD SYSTEMS Lavanya M., Sahana V., Swathi Rekha K. and Vaithiyanathan V. School of Computing,
CSE-E5430 Scalable Cloud Computing Lecture 2
CSE-E5430 Scalable Cloud Computing Lecture 2 Keijo Heljanko Department of Computer Science School of Science Aalto University [email protected] 14.9-2015 1/36 Google MapReduce A scalable batch processing
Computing Load Aware and Long-View Load Balancing for Cluster Storage Systems
215 IEEE International Conference on Big Data (Big Data) Computing Load Aware and Long-View Load Balancing for Cluster Storage Systems Guoxin Liu and Haiying Shen and Haoyu Wang Department of Electrical
Finding Insights & Hadoop Cluster Performance Analysis over Census Dataset Using Big-Data Analytics
Finding Insights & Hadoop Cluster Performance Analysis over Census Dataset Using Big-Data Analytics Dharmendra Agawane 1, Rohit Pawar 2, Pavankumar Purohit 3, Gangadhar Agre 4 Guide: Prof. P B Jawade 2
Sla Aware Load Balancing Algorithm Using Join-Idle Queue for Virtual Machines in Cloud Computing
Sla Aware Load Balancing Using Join-Idle Queue for Virtual Machines in Cloud Computing Mehak Choudhary M.Tech Student [CSE], Dept. of CSE, SKIET, Kurukshetra University, Haryana, India ABSTRACT: Cloud
Figure 1. The cloud scales: Amazon EC2 growth [2].
- Chung-Cheng Li and Kuochen Wang Department of Computer Science National Chiao Tung University Hsinchu, Taiwan 300 [email protected], [email protected] Abstract One of the most important issues
Scalable Cloud Computing Solutions for Next Generation Sequencing Data
Scalable Cloud Computing Solutions for Next Generation Sequencing Data Matti Niemenmaa 1, Aleksi Kallio 2, André Schumacher 1, Petri Klemelä 2, Eija Korpelainen 2, and Keijo Heljanko 1 1 Department of
Apache Hadoop. Alexandru Costan
1 Apache Hadoop Alexandru Costan Big Data Landscape No one-size-fits-all solution: SQL, NoSQL, MapReduce, No standard, except Hadoop 2 Outline What is Hadoop? Who uses it? Architecture HDFS MapReduce Open
A PERFORMANCE ANALYSIS of HADOOP CLUSTERS in OPENSTACK CLOUD and in REAL SYSTEM
A PERFORMANCE ANALYSIS of HADOOP CLUSTERS in OPENSTACK CLOUD and in REAL SYSTEM Ramesh Maharjan and Manoj Shakya Department of Computer Science and Engineering Dhulikhel, Kavre, Nepal [email protected],
Telecom Data processing and analysis based on Hadoop
COMPUTER MODELLING & NEW TECHNOLOGIES 214 18(12B) 658-664 Abstract Telecom Data processing and analysis based on Hadoop Guofan Lu, Qingnian Zhang *, Zhao Chen Wuhan University of Technology, Wuhan 4363,China
High Throughput Computing on P2P Networks. Carlos Pérez Miguel [email protected]
High Throughput Computing on P2P Networks Carlos Pérez Miguel [email protected] Overview High Throughput Computing Motivation All things distributed: Peer-to-peer Non structured overlays Structured
How To Understand Cloud Computing
Overview of Cloud Computing (ENCS 691K Chapter 1) Roch Glitho, PhD Associate Professor and Canada Research Chair My URL - http://users.encs.concordia.ca/~glitho/ Overview of Cloud Computing Towards a definition
A Survey on Load Balancing Algorithms in Cloud Environment
A Survey on Load s in Cloud Environment M.Aruna Assistant Professor (Sr.G)/CSE Erode Sengunthar Engineering College, Thudupathi, Erode, India D.Bhanu, Ph.D Associate Professor Sri Krishna College of Engineering
Big Data and Apache Hadoop s MapReduce
Big Data and Apache Hadoop s MapReduce Michael Hahsler Computer Science and Engineering Southern Methodist University January 23, 2012 Michael Hahsler (SMU/CSE) Hadoop/MapReduce January 23, 2012 1 / 23
Energy Efficient MapReduce
Energy Efficient MapReduce Motivation: Energy consumption is an important aspect of datacenters efficiency, the total power consumption in the united states has doubled from 2000 to 2005, representing
Payment minimization and Error-tolerant Resource Allocation for Cloud System Using equally spread current execution load
Payment minimization and Error-tolerant Resource Allocation for Cloud System Using equally spread current execution load Pooja.B. Jewargi Prof. Jyoti.Patil Department of computer science and engineering,
A Proposed Framework for Ranking and Reservation of Cloud Services Based on Quality of Service
II,III A Proposed Framework for Ranking and Reservation of Cloud Services Based on Quality of Service I Samir.m.zaid, II Hazem.m.elbakry, III Islam.m.abdelhady I Dept. of Geology, Faculty of Sciences,
Task Placement in a Cloud with Case-based Reasoning
Task Placement in a Cloud with Case-based Reasoning Eric Schulte-Zurhausen and Mirjam Minor Institute of Informatik, Goethe University, Robert-Mayer-Str.10, Frankfurt am Main, Germany {eschulte, minor}@informatik.uni-frankfurt.de
HYBRID ACO-IWD OPTIMIZATION ALGORITHM FOR MINIMIZING WEIGHTED FLOWTIME IN CLOUD-BASED PARAMETER SWEEP EXPERIMENTS
HYBRID ACO-IWD OPTIMIZATION ALGORITHM FOR MINIMIZING WEIGHTED FLOWTIME IN CLOUD-BASED PARAMETER SWEEP EXPERIMENTS R. Angel Preethima 1, Margret Johnson 2 1 Student, Computer Science and Engineering, Karunya
An Experimental Approach Towards Big Data for Analyzing Memory Utilization on a Hadoop cluster using HDFS and MapReduce.
An Experimental Approach Towards Big Data for Analyzing Memory Utilization on a Hadoop cluster using HDFS and MapReduce. Amrit Pal Stdt, Dept of Computer Engineering and Application, National Institute
SCORE BASED DEADLINE CONSTRAINED WORKFLOW SCHEDULING ALGORITHM FOR CLOUD SYSTEMS
SCORE BASED DEADLINE CONSTRAINED WORKFLOW SCHEDULING ALGORITHM FOR CLOUD SYSTEMS Ranjit Singh and Sarbjeet Singh Computer Science and Engineering, Panjab University, Chandigarh, India ABSTRACT Cloud Computing
An improved task assignment scheme for Hadoop running in the clouds
Dai and Bassiouni Journal of Cloud Computing: Advances, Systems and Applications 2013, 2:23 RESEARCH An improved task assignment scheme for Hadoop running in the clouds Wei Dai * and Mostafa Bassiouni
Introduction to Apache YARN Schedulers & Queues
Introduction to Apache YARN Schedulers & Queues In a nutshell, YARN was designed to address the many limitations (performance/scalability) embedded into Hadoop version 1 (MapReduce & HDFS). Some of the
How To Understand Cloud Computing
Cloud Computing: a Perspective Study Lizhe WANG, Gregor von LASZEWSKI, Younge ANDREW, Xi HE Service Oriented Cyberinfrastruture Lab, Rochester Inst. of Tech. Abstract The Cloud computing emerges as a new
marlabs driving digital agility WHITEPAPER Big Data and Hadoop
marlabs driving digital agility WHITEPAPER Big Data and Hadoop Abstract This paper explains the significance of Hadoop, an emerging yet rapidly growing technology. The prime goal of this paper is to unveil
IMCM: A Flexible Fine-Grained Adaptive Framework for Parallel Mobile Hybrid Cloud Applications
Open System Laboratory of University of Illinois at Urbana Champaign presents: Outline: IMCM: A Flexible Fine-Grained Adaptive Framework for Parallel Mobile Hybrid Cloud Applications A Fine-Grained Adaptive
Keywords: PDAs, VM. 2015, IJARCSSE All Rights Reserved Page 365
Volume 5, Issue 7, July 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Energy Adaptive
Fair Scheduling Algorithm with Dynamic Load Balancing Using In Grid Computing
Research Inventy: International Journal Of Engineering And Science Vol.2, Issue 10 (April 2013), Pp 53-57 Issn(e): 2278-4721, Issn(p):2319-6483, Www.Researchinventy.Com Fair Scheduling Algorithm with Dynamic
Real Time Network Server Monitoring using Smartphone with Dynamic Load Balancing
www.ijcsi.org 227 Real Time Network Server Monitoring using Smartphone with Dynamic Load Balancing Dhuha Basheer Abdullah 1, Zeena Abdulgafar Thanoon 2, 1 Computer Science Department, Mosul University,
Secured Storage of Outsourced Data in Cloud Computing
Secured Storage of Outsourced Data in Cloud Computing Chiranjeevi Kasukurthy 1, Ch. Ramesh Kumar 2 1 M.Tech(CSE), Nalanda Institute of Engineering & Technology,Siddharth Nagar, Sattenapalli, Guntur Affiliated
CDBMS Physical Layer issue: Load Balancing
CDBMS Physical Layer issue: Load Balancing Shweta Mongia CSE, School of Engineering G D Goenka University, Sohna [email protected] Shipra Kataria CSE, School of Engineering G D Goenka University,
A Comparative Performance Analysis of Load Balancing Algorithms in Distributed System using Qualitative Parameters
A Comparative Performance Analysis of Load Balancing Algorithms in Distributed System using Qualitative Parameters Abhijit A. Rajguru, S.S. Apte Abstract - A distributed system can be viewed as a collection
R.K.Uskenbayeva 1, А.А. Kuandykov 2, Zh.B.Kalpeyeva 3, D.K.Kozhamzharova 4, N.K.Mukhazhanov 5
Distributed data processing in heterogeneous cloud environments R.K.Uskenbayeva 1, А.А. Kuandykov 2, Zh.B.Kalpeyeva 3, D.K.Kozhamzharova 4, N.K.Mukhazhanov 5 1 [email protected], 2 [email protected],
Introduction to Cloud Computing
Introduction to Cloud Computing Cloud Computing I (intro) 15 319, spring 2010 2 nd Lecture, Jan 14 th Majd F. Sakr Lecture Motivation General overview on cloud computing What is cloud computing Services
Group Based Load Balancing Algorithm in Cloud Computing Virtualization
Group Based Load Balancing Algorithm in Cloud Computing Virtualization Rishi Bhardwaj, 2 Sangeeta Mittal, Student, 2 Assistant Professor, Department of Computer Science, Jaypee Institute of Information
A Review on Load Balancing In Cloud Computing 1
www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 4 Issue 6 June 2015, Page No. 12333-12339 A Review on Load Balancing In Cloud Computing 1 Peenaz Pathak, 2 Er.Kamna
International Journal of Advance Research in Computer Science and Management Studies
Volume 3, Issue 6, June 2015 ISSN: 2321 7782 (Online) International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online
Survey on Job Schedulers in Hadoop Cluster
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661, p- ISSN: 2278-8727Volume 15, Issue 1 (Sep. - Oct. 2013), PP 46-50 Bincy P Andrews 1, Binu A 2 1 (Rajagiri School of Engineering and Technology,
CiteSeer x in the Cloud
Published in the 2nd USENIX Workshop on Hot Topics in Cloud Computing 2010 CiteSeer x in the Cloud Pradeep B. Teregowda Pennsylvania State University C. Lee Giles Pennsylvania State University Bhuvan Urgaonkar
Cloud Partitioning of Load Balancing Using Round Robin Model
Cloud Partitioning of Load Balancing Using Round Robin Model 1 M.V.L.SOWJANYA, 2 D.RAVIKIRAN 1 M.Tech Research Scholar, Priyadarshini Institute of Technology and Science for Women 2 Professor, Priyadarshini
An Adaptive Scheduling Algorithm for Dynamic Heterogeneous Hadoop Systems
An Adaptive Scheduling Algorithm for Dynamic Heterogeneous Hadoop Systems Aysan Rasooli, Douglas G. Down Department of Computing and Software McMaster University {rasooa, downd}@mcmaster.ca Abstract The
A Locality Enhanced Scheduling Method for Multiple MapReduce Jobs In a Workflow Application
2012 International Conference on Information and Computer Applications (ICICA 2012) IPCSIT vol. 24 (2012) (2012) IACSIT Press, Singapore A Locality Enhanced Scheduling Method for Multiple MapReduce Jobs
GraySort on Apache Spark by Databricks
GraySort on Apache Spark by Databricks Reynold Xin, Parviz Deyhim, Ali Ghodsi, Xiangrui Meng, Matei Zaharia Databricks Inc. Apache Spark Sorting in Spark Overview Sorting Within a Partition Range Partitioner
A Load Balancing Model Based on Cloud Partitioning for the Public Cloud
IEEE TRANSACTIONS ON CLOUD COMPUTING YEAR 2013 A Load Balancing Model Based on Cloud Partitioning for the Public Cloud Gaochao Xu, Junjie Pang, and Xiaodong Fu Abstract: Load balancing in the cloud computing
Multilevel Communication Aware Approach for Load Balancing
Multilevel Communication Aware Approach for Load Balancing 1 Dipti Patel, 2 Ashil Patel Department of Information Technology, L.D. College of Engineering, Gujarat Technological University, Ahmedabad 1
Big Data Storage Architecture Design in Cloud Computing
Big Data Storage Architecture Design in Cloud Computing Xuebin Chen 1, Shi Wang 1( ), Yanyan Dong 1, and Xu Wang 2 1 College of Science, North China University of Science and Technology, Tangshan, Hebei,
IMPROVEMENT OF RESPONSE TIME OF LOAD BALANCING ALGORITHM IN CLOUD ENVIROMENT
IMPROVEMENT OF RESPONSE TIME OF LOAD BALANCING ALGORITHM IN CLOUD ENVIROMENT Muhammad Muhammad Bala 1, Miss Preety Kaushik 2, Mr Vivec Demri 3 1, 2, 3 Department of Engineering and Computer Science, Sharda
Task Scheduling for Efficient Resource Utilization in Cloud
Summer 2014 Task Scheduling for Efficient Resource Utilization in Cloud A Project Report for course COEN 241 Under the guidance of, Dr.Ming Hwa Wang Submitted by : Najuka Sankhe Nikitha Karkala Nimisha
A Load Balancing Algorithm based on the Variation Trend of Entropy in Homogeneous Cluster
, pp.11-20 http://dx.doi.org/10.14257/ ijgdc.2014.7.2.02 A Load Balancing Algorithm based on the Variation Trend of Entropy in Homogeneous Cluster Kehe Wu 1, Long Chen 2, Shichao Ye 2 and Yi Li 2 1 Beijing
Index Terms : Load rebalance, distributed file systems, clouds, movement cost, load imbalance, chunk.
Load Rebalancing for Distributed File Systems in Clouds. Smita Salunkhe, S. S. Sannakki Department of Computer Science and Engineering KLS Gogte Institute of Technology, Belgaum, Karnataka, India Affiliated
