Quantitative Analysis of Cloud-based Streaming Services

Similar documents

Load Balancing and Switch Scheduling

CHAPTER 5 WLDMA: A NEW LOAD BALANCING STRATEGY FOR WAN ENVIRONMENT

UNIT 2 QUEUING THEORY

Chapter 1. Introduction

CHAPTER 3 CALL CENTER QUEUING MODEL WITH LOGNORMAL SERVICE TIME DISTRIBUTION

Queuing Theory. Long Term Averages. Assumptions. Interesting Values. Queuing Model

Can We Beat DDoS Attacks in Clouds?

Performance Analysis of a Telephone System with both Patient and Impatient Customers

ECE 333: Introduction to Communication Networks Fall 2002

Process simulation. Enn Õunapuu

Basic Queuing Relationships

How To Balance In A Distributed System

1 st year / / Principles of Industrial Eng. Chapter -3 -/ Dr. May G. Kassir. Chapter Three

Analysis of a Production/Inventory System with Multiple Retailers

Lecture Outline Overview of real-time scheduling algorithms Outline relative strengths, weaknesses

The International Journal Of Science & Technoledge (ISSN X)

Efficient Service Broker Policy For Large-Scale Cloud Environments

Chapter 13 Waiting Lines and Queuing Theory Models - Dr. Samir Safi

Deployment of express checkout lines at supermarkets

CALL CENTER PERFORMANCE EVALUATION USING QUEUEING NETWORK AND SIMULATION

Smart Queue Scheduling for QoS Spring 2001 Final Report

Efficient Parallel Processing on Public Cloud Servers Using Load Balancing

M/M/1 and M/M/m Queueing Systems

Minimize Wait Time and Improve the Waiting Experience

A Proposed Service Broker Strategy in CloudAnalyst for Cost-Effective Data Center Selection

Keywords Load balancing, Dispatcher, Distributed Cluster Server, Static Load balancing, Dynamic Load balancing.

Effective Utilization of Mobile Call Center Using Queuing Models

Periodic Capacity Management under a Lead Time Performance Constraint

Simulation Software 1

Operating Systems. III. Scheduling.

Design and Modeling of Internet Protocols. Dmitri Loguinov March 1, 2005

PERFORMANCE ANALYSIS OF PaaS CLOUD COMPUTING SYSTEM

Waiting Times Chapter 7

Pull versus Push Mechanism in Large Distributed Networks: Closed Form Results

Web Server Software Architectures

Supplement to Call Centers with Delay Information: Models and Insights

A QUEUEING-INVENTORY SYSTEM WITH DEFECTIVE ITEMS AND POISSON DEMAND.

1: B asic S imu lati on Modeling

A Review on Load Balancing In Cloud Computing 1

Gopinath Panda Veena Goswami A D Banik

Multi-service Load Balancing in a Heterogeneous Network with Vertical Handover

CoMPACT-Monitor: Change-of-Measure based Passive/Active Monitoring Weighted Active Sampling Scheme to Infer QoS

Chapter 10: Scalability

Profit Maximization and Power Management of Green Data Centers Supporting Multiple SLAs

Discrete-Event Simulation

Frame Burst Adjusting for Transmitting Video Conference in Gigabit Ethernet

Operating Systems Concepts: Chapter 7: Scheduling Strategies

On the Feasibility of Prefetching and Caching for Online TV Services: A Measurement Study on Hulu

An Approach to Load Balancing In Cloud Computing

The Data Replication Bottleneck: Overcoming Out of Order and Lost Packets across the WAN

1. Implementation of a testbed for testing Energy Efficiency by server consolidation using Vmware

Network Design Performance Evaluation, and Simulation #6

Secure SCTP against DoS Attacks in Wireless Internet

Cloud Enabled Emergency Navigation Using Faster-than-real-time Simulation

4 The M/M/1 queue. 4.1 Time-dependent behaviour

Public Cloud Partition Balancing and the Game Theory

Road Map. Scheduling. Types of Scheduling. Scheduling. CPU Scheduling. Job Scheduling. Dickinson College Computer Science 354 Spring 2010.

Giving life to today s media distribution services

BRAESS-LIKE PARADOXES FOR NON-COOPERATIVE DYNAMIC LOAD BALANCING IN DISTRIBUTED COMPUTER SYSTEMS

LECTURE 16. Readings: Section 5.1. Lecture outline. Random processes Definition of the Bernoulli process Basic properties of the Bernoulli process

Sla Aware Load Balancing Algorithm Using Join-Idle Queue for Virtual Machines in Cloud Computing

Optimal Hiring of Cloud Servers A. Stephen McGough, Isi Mitrani. EPEW 2014, Florence

A Task-Based Adaptive-TTL approach for Web Server Load Balancing *

Simple Queuing Theory Tools You Can Use in Healthcare

The Exponential Distribution

Deciding which process to run. (Deciding which thread to run) Deciding how long the chosen process can run

How To Model A System

Radio Resource Allocation in GSM/GPRS Networks

International Journal of Scientific & Engineering Research, Volume 6, Issue 3, March ISSN

CPU Scheduling. Core Definitions

Internet Video Streaming and Cloud-based Multimedia Applications. Outline

Chapter 2. Simulation Examples 2.1. Prof. Dr. Mesut Güneş Ch. 2 Simulation Examples

International Journal of Scientific & Engineering Research, Volume 6, Issue 4, April ISSN

OPTIMIZED PERFORMANCE EVALUATIONS OF CLOUD COMPUTING SERVERS

AN EFFICIENT DISTRIBUTED CONTROL LAW FOR LOAD BALANCING IN CONTENT DELIVERY NETWORKS

Queueing Systems. Ivo Adan and Jacques Resing

Keywords: Dynamic Load Balancing, Process Migration, Load Indices, Threshold Level, Response Time, Process Age.

Hydrodynamic Limits of Randomized Load Balancing Networks

How To Manage A Call Center

Internet Traffic Variability (Long Range Dependency Effects) Dheeraj Reddy CS8803 Fall 2003

Modeling and simulation modeling

Simulating a File-Sharing P2P Network

Optimal Dynamic Resource Allocation in Multi-Class Queueing Networks

Figure 1. The cloud scales: Amazon EC2 growth [2].

INTEGRATED OPTIMIZATION OF SAFETY STOCK

EXAM IN COURSE [EKSAMEN I EMNE] TTM4110 Dependability and Performance with Discrete event Simulation [Pålitelighet og ytelse med simulering]

Transcription:

of Cloud-based Streaming Services Fang Yu 1, Yat-Wah Wan 2 and Rua-Huan Tsaih 1 1. Department of Management Information Systems National Chengchi University, Taipei, Taiwan 2. Graduate Institute of Logistics Management National Dong Hwa University, Hualien, Taiwan June 14, 2013 1 / 32

IEEE SCC 2013 Overview Motivation This work has been accepted to be published in: 10th IEEE International Conference on Service Computing Santa Clara, June 2013. 2 / 32

Cloud-based Services Motivation Services deployed on Clouds are getting more and more popular Commerce: online banking, online shopping, etc. Entertainment: online music and videos, gaming, etc. Interaction: social networks, blogger, etc. Advantages Service Scalability and Availability Dynamic Resource Allocation 3 / 32

Servie Quality Overview Motivation Managing high-standard service quality becomes a critical issue for online business success As for the National Palace Museum project, we are particularly interested in online cloud-based streaming services. 4 / 32

Theoretical Gap Overview Motivation Intuitively, the service quality is related with: the processing ability on the server side, the bandwidth allocated for the customer, the traffic condition of the Internet, and the processing ability of the end device on the client side. There is a lack of formal quantitative analysis, nor theoretical exploration, of the relationship between the service quality and these four factors, particularly for cloud-based services. 5 / 32

Objectives Overview Motivation This study addresses this theoretical gap with: formalizing the service framework for cloud-based streaming services with queuing models proposing macro models for deriving operations characteristics of systems with closed form expressions (from the system perspective) proposing micro models with simulation procedures for observing service dynamics and indicators (from the customer perspective) 6 / 32

Overview Framework Queuing Models Cloud-based Service System Using ipalace Video Channel as an example: 7 / 32

Framework Queuing Models The Two-phase Service Framework PHASE I: 1 The arrival comes to the Dispatcher. 2 The Dispatcher responds with the corresponding worker. PHASE II: 1 The arrival is redirected to the corresponding worker. 2 The corresponding worker provides the subsequent video service (similar to a television program) via broadcasting the selected video streaming that interweaves with the advertisements until the arrival reneges. 8 / 32

The Queuing Model of Phase I Framework Queuing Models The service system regarding the phase I is a single-server service system that every arrival is served by the Dispatcher. Phase I is an M/D/1 : / /FCFS queuing system: 1 Arrivals are served on a FCFS (first come, first serve) basis. 2 Arrivals are independent of preceding arrivals, but the arrival rate λ does not change over time. 3 Arrivals are described by a Poisson probability distribution and come from a very large population. 4 Service times for all arrivals are fixed and known (a constant denoted as d). 5 d is negligible, compared with the value of 1 λ. 9 / 32

The Queuing Model of Phase I Framework Queuing Models In this M/D/1 : / /FCFS queuing system, the service time is fixed as d (to have a stable system, λd needs to be smaller than 1). the stationary-state probability of zero arrivals in the system P 0 equals 1 λd the average number of customers in the service system L equals 2λd λ2 d 2 2(1 λd), the average number of arrivals waiting in queues L q equals λ 2 d 2 2(1 λd), the average time spent in the service system W equals 2d λd 2 2(1 λd), and the average time spent waiting in queues W q equals λd 2 2(1 λd). 10 / 32

The Queuing Model of Phase II Framework Queuing Models Arrival: With the assumptions of the Phase I system, the departure process of Phase I is the same as the arrival process of Phase II. Thus the arrival process of Phase II is a Poisson process with mean rate λ. Departure: It is the reneging behavior of the customer that defines the service time of the customer with the corresponding worker. Such service times are assumed to be independent and identically distributed across customers with a known distribution. 11 / 32

The Queuing Model of Phase II Framework Queuing Models The service system in phase II is an M/M/1 : /BOUND/round-robin queuing system. Arrivals are served on a round-robin basis. Arrivals follow the Poisson process of rate λ, independent of the service process. The time WaVidtime for a customer spent on watching video are i.i.d. negative exponential probability distribution of rate µ, a known positive constant. Thus, P(WaVidtime x) = 1 - e µx for x 0. 12 / 32

The Queuing Model of Phase II Framework Queuing Models Assuming the initial setup and communication time are relative small and negligible, it follows that the service time for a worker spent on an arrival (i.e., the duration from the epoch that the worker starts to generate video packets for the customer to the epoch that the worker stops to generate video packets for the customer) are also i.i.d. negative exponential probability distribution of rate µ. Each worker serves at most the BOUND amount of arrivals. Once the value of LENGTH hits the BOUND, no new customer will be assigned to this worker, which will retire from service after clearing the customers on hand. 13 / 32

Round-Robin Schedule Framework Queuing Models The worker adopts the round-robin algorithm (Silberschatz et al. 2004) to handle tasks of all arrivals and spends QUANTUM in each arrival for generating the packages of video, while the remaining tasks of the other arrivals are waiting in queue without being executed. 14 / 32

Macroscopic View Microscopic View The key questions for the manager of online streaming services to answer are: the operations characteristics of the system, which can be measured by indicators such as the average number of customers in system and the average number of VMs used, and the service quality for customers such as experienced lag time and interruptions 15 / 32

Macroscopic View Microscopic View Macroscopic Model: Model for the Number of Customers in System Let s = (n; n 1,..., n BOUND ) be the state of the system, where n is the number of customers at the corresponding worker, n BOUND 1. n j is the number of retiring workers with j customer at the machine, j {1..BOUND}. Total rate out: q s = [ λ + ( n + BOUND j=1 j n j ) µ Effect of an arrival: q s,s+ = λ, for n BOUND-2; q s,r = λ, for n = BOUND-1; Effect of a departure: q s,s = n µ q s,sj = n j µ j, for n j 1, j {1..BOUND}. ] ; 16 / 32

Macroscopic View Microscopic View Stationary Distribution of the State of the Corresponding Worker We derive the stationary distribution of the state of the corresponding worker when a new arrival comes in. The state is defined as the number of customers served by the corresponding worker (before the new arrival). 17 / 32

Stationary AnalysisI Macroscopic View Microscopic View When stable, we have: λ p BOUND 1 + µ p 1 = λ p 0, λ p i 1 + (i + 1) µ p i+1 = (λ + i µ) p i, i {1..BOUND 2}, λ p BOUND 2 = (λ + µ (BOUND 1)) p BOUND 1. (1) 18 / 32

Stationary Analysis Macroscopic View Microscopic View Applying the normalization equation to Eqt. (??), we have, for κ {1..BOUND}, the closed form solution: p BOUND κ = ( κ κ 1 ) m=1 j=m (BOUND j) θ k m 1 + [ BOUND k k=2 m=1 ( k 1 j=m (BOUND j) ) θ k m ]. (2) 19 / 32

Macroscopic View Microscopic View Average Number of Customers (Seen by a New Arrival) L = BOUND 1 k=0 p k k, (3) L 0 5 10 15 20 25 30 35 Bound = 5 Bound = 10 Bound = 20 Bound = 50 0.0 0.2 0.4 0.6 0.8 1.0 theta 20 / 32

Macroscopic View Microscopic View The Mean Time for a Virtual Machine from Activated to Idle We deduce the mean time taken for a worker to go through the process of being activated, followed by retirement, and then eventually becomes unemployed (to the idle pool). 21 / 32

Macroscopic View Microscopic View The Mean Time for a Virtual Machine from Activated to Idle Within one round of duty, on average a worker works for E(T 0,0r ) = E(T 0,BOUNDr ) + E(T BOUNDr,0 r ) units of time, and on average accepts λ E(T 0,BOUNDr ) arrivals. Because the rate of customers handled by one worker is on average there are workers employed by the system. λ E(T 0,BOUNDr ) E(T 0,BOUNDr ) + E(T BOUNDr,0 r ), 1 + E(T BOUND r,0 r ) E(T 0,BOUNDr ) 22 / 32

Macroscopic View Microscopic View Microscopic Model: Model for the Service Experience of Customers Given the time-sharing dynamics at QUANTUM level of workers, the traffic condition of WWW, and the properties of the end device of the customer, the microscopic model discusses procedures that eventually give the (expected) amount of lag time and the number of interruptions experienced by a customer. 23 / 32

Macroscopic View Microscopic View Arrival, Effect and Lag Time of the i th Packets ArrivalTime(i) = InitServer + i 1 (LENGTH j QUANTUM) + QUANTUM + j=1 NormalTransTime(i) + DelayTransTime(i), i 1; (4) EffectTime(i) =max{effecttime(i 1)+ SessionTime,ArrivalTime(i)}, i 2. (5) LagTime(i) = [ArrivalTime(i) (EffectTime(i 1)+ SessionTime)] +, i 2. (6) 24 / 32

Simulation Overview Macroscopic View Microscopic View Simulation procedure for the End Device Queue for a Customer Arriving at state k with k < BOUND with given QUANTUM, SessionTime, NormalTransTime(), DelayTransTime(). parameter NI LT NR p u I k L k s rb definition the cumulative number of interruptions within the WaVidTime of a customer the corresponding cumulative lag time the number of replications in the simulation procedure the probability of the state going up the estimate of expected number of interruptions in WaVidTime for a customer who on arrival finds k customers with the corresponding worker the estimate of the expected duration of lag time for a customer who on arrival finds k customers the current number of customers i a flag for being in the retiring stage 25 / 32

Simulation Overview Macroscopic View Microscopic View We simulate NR times of a sample term. Each sample term simulates the process of a customer from its arrival to its departure, collecting accumulated lag time and interruptions that the customer experiences. 1 Set NI = 0, LT = 0, t 0 = 0, m = 1. Select a value for NR. 2 While (m NR) simulate each term 3 For a customer arriving at state k, I k = NI /NR, and L k = LT /NR. 26 / 32

Simulation Overview Macroscopic View Microscopic View 1 Set n=1, s=k+1, rb=false; 2 If rb=true or s = BOUND, set η = s µ, p u = 0, and rb=true; else η = λ + s µ and p u = λ λ+s µ. 3 Draw a random variate t n -t n 1 from exp(η). The worker stays at state s in (t n 1, t n ). 4 Simulate the end device queue according to Eqt. (4), (5) and (6) with LENGTH = s. Set NI = NI +1 whenever an interruption occurs, and accumulate the corresponding lag time of the customer in (t n 1, t n ) to LT. 5 At t n, set δ = 1 with probability p u, and δ = -1 with probability (1-p u ). If δ = -1, go to 2.6 with probability 1 s else set n = n+1, s = s+δ, and go to 2.2. 6 Set m = m+1. 27 / 32

Lagtime Overview Macroscopic View Microscopic View Lagtime = BOUND 1 k=0 p k L k, (7) LagTime 0 100 200 300 400 500 600 700 Bound = 5 Bound = 10 Bound = 20 Bound = 50 0.0 0.2 0.4 0.6 0.8 1.0 theta 28 / 32

Interrupt Overview Macroscopic View Microscopic View Interruption = BOUND 1 k=0 p k I k, (8) Interrupt 0 5000 10000 15000 20000 Bound = 5 Bound = 10 Bound = 20 Bound = 50 0.0 0.2 0.4 0.6 0.8 1.0 theta 29 / 32

As online streaming services significantly increase in recent years, it becomes a critical issue to offer quality service via systems that benefit from cloud computing developments. We pioneer the study on quantitative analysis to estimate and further improve the quality of cloud-based streaming services, deriving theoretical results on operations characteristics of queuing models with mild assumptions. By simulating the continuous-time Markov chain according to the adopted operations rules for VMs, we also get performance indicators such as the average lag time and interruptions that a customer may experience under different environment settings. 30 / 32

Ongoing work The presented approach provides managers of online streaming services a formal and systematic way to evaluate service quality before launch. One of our ongoing work is applying the presented approach to analyze ipalace Channel (Tsaih et al. 2012), a digital exhibition channel of National Palace Museum (NPM) in Taiwan. 31 / 32

Any suggestion? Thank you for your attention. 32 / 32