004.738.5:378.091.214.18 ADJUSTING THE MASSIVELY OPEN ONLINE COURSES IN CLOUD COMPUTING ENVIRONMENT 9



Similar documents
CloudAnalyst: A CloudSim-based Visual Modeller for Analysing Cloud Computing Environments and Applications

Round Robin with Server Affinity: A VM Load Balancing Algorithm for Cloud Based Infrastructure

A Proposed Service Broker Strategy in CloudAnalyst for Cost-Effective Data Center Selection

CloudAnalyst: A CloudSim-based Tool for Modelling and Analysis of Large Scale Cloud Computing Environments

Efficient Service Broker Policy For Large-Scale Cloud Environments

Keywords: Cloudsim, MIPS, Gridlet, Virtual machine, Data center, Simulation, SaaS, PaaS, IaaS, VM. Introduction

Service Broker Algorithm for Cloud-Analyst

A NOVEL LOAD BALANCING STRATEGY FOR EFFECTIVE UTILIZATION OF VIRTUAL MACHINES IN CLOUD

CDBMS Physical Layer issue: Load Balancing

WEIGHTED ROUND ROBIN POLICY FOR SERVICE BROKERS IN A CLOUD ENVIRONMENT

Cloud Computing Simulation Using CloudSim

A Proposed Service Broker Policy for Data Center Selection in Cloud Environment with Implementation

High performance computing network for cloud environment using simulators

Analysis of Service Broker Policies in Cloud Analyst Framework

Performance Evaluation of Round Robin Algorithm in Cloud Environment

A Comparative Study of Load Balancing Algorithms in Cloud Computing

LOAD BALANCING OF USER PROCESSES AMONG VIRTUAL MACHINES IN CLOUD COMPUTING ENVIRONMENT

Hierarchical Trust Model to Rate Cloud Service Providers based on Infrastructure as a Service

An Efficient Cloud Service Broker Algorithm

Throtelled: An Efficient Load Balancing Policy across Virtual Machines within a Single Data Center

EFFICIENT VM LOAD BALANCING ALGORITHM FOR A CLOUD COMPUTING ENVIRONMENT

Cloud Analyst: An Insight of Service Broker Policy

CloudSim: A Toolkit for Modeling and Simulation of Cloud Computing Environments and Evaluation of Resource Provisioning Algorithms

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 4, July-Aug 2014

Dr. Ravi Rastogi Associate Professor Sharda University, Greater Noida, India

Estimating Trust Value for Cloud Service Providers using Fuzzy Logic

Tableau Server 7.0 scalability

An Efficient Hybrid P2P MMOG Cloud Architecture for Dynamic Load Management. Ginhung Wang, Kuochen Wang

IMPROVEMENT OF RESPONSE TIME OF LOAD BALANCING ALGORITHM IN CLOUD ENVIROMENT

Load Balancing in Cloud Computing using Stochastic Hill Climbing-A Soft Computing Approach

An Implementation of Load Balancing Policy for Virtual Machines Associated With a Data Center

Dr. J. W. Bakal Principal S. S. JONDHALE College of Engg., Dombivli, India

Comparison of Dynamic Load Balancing Policies in Data Centers

SERVICE BROKER ROUTING POLICES IN CLOUD ENVIRONMENT: A SURVEY

Simulation-based Evaluation of an Intercloud Service Broker

An efficient VM load balancer for Cloud

Tableau Server Scalability Explained

Load Testing on Web Application using Automated Testing Tool: Load Complete

A Scalable Network Monitoring and Bandwidth Throttling System for Cloud Computing

Utilizing Round Robin Concept for Load Balancing Algorithm at Virtual Machine Level in Cloud Environment

Microsoft Dynamics NAV 2013 R2 Sizing Guidelines for On-Premises Single Tenant Deployments

German Open Course Review - Future of Learning

ENERGY EFFICIENT VIRTUAL MACHINE ASSIGNMENT BASED ON ENERGY CONSUMPTION AND RESOURCE UTILIZATION IN CLOUD NETWORK

Load Balancing Algorithm Based on Estimating Finish Time of Services in Cloud Computing

Importance of online learning platforms. Nurzhan Bakibayev, Advisor

Newsletter 4/2013 Oktober

Cloud Computing through Virtualization and HPC technologies

An Efficient Adaptive Load Balancing Algorithm for Cloud Computing Under Bursty Workloads

Technical Investigation of Computational Resource Interdependencies

* A Practical Response to. Massive Open Online Courses (MOOCs)

Performance Analysis of VM Scheduling Algorithm of CloudSim in Cloud Computing

2) Xen Hypervisor 3) UEC

Multilevel Communication Aware Approach for Load Balancing

Real-Time Analysis of CDN in an Academic Institute: A Simulation Study

Performance analysis and comparison of virtualization protocols, RDP and PCoIP

Diablo and VMware TM powering SQL Server TM in Virtual SAN TM. A Diablo Technologies Whitepaper. May 2015

A Dynamic Resource Management with Energy Saving Mechanism for Supporting Cloud Computing

Nutan. N PG student. Girish. L Assistant professor Dept of CSE, CIT GubbiTumkur

The International Research Foundation for English Language Education

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 4, July-Aug 2014

MOOCaaS: MOOCs as a Service

Reallocation and Allocation of Virtual Machines in Cloud Computing Manan D. Shah a, *, Harshad B. Prajapati b

Cloud Performance and Load Balancing Algorithm

CLOUD COMPUTING PERFORMANCE EVALUATION: ISSUES AND CHALLENGES

Innovative forms of delivery in an online university : moocs. José Antonio Díaz (UNED-Spain) jdiaz@poli.uned.es

Auto-Scaling Model for Cloud Computing System

Red Hat Network Satellite Management and automation of your Red Hat Enterprise Linux environment

PERFORMANCE ANALYSIS OF PaaS CLOUD COMPUTING SYSTEM

Jahresakademie KAAD Bonn, A very brief history of Massive Open Online Courses. 2 Leuphana Digital School s Mentored Open Online Courses

Response Time Minimization of Different Load Balancing Algorithms in Cloud Computing Environment

Dynamically optimized cost based task scheduling in Cloud Computing

Scheduling Virtual Machines for Load balancing in Cloud Computing Platform

Dynamic Round Robin for Load Balancing in a Cloud Computing

FOR SERVERS 2.2: FEATURE matrix

Chapter 19 Cloud Computing for Multimedia Services

How To Test For Performance And Scalability On A Server With A Multi-Core Computer (For A Large Server)

Software-Defined Networks Powered by VellOS

PERFORMANCE ANALYSIS OF KERNEL-BASED VIRTUAL MACHINE

CloudAnalyzer: A cloud based deployment framework for Service broker and VM load balancing policies

A Comparison of Four Popular Heuristics for Load Balancing of Virtual Machines in Cloud Computing

Efficient and Enhanced Load Balancing Algorithms in Cloud Computing

Role of Cloud Computing to Overcome the Issues and Challenges in E-learning

ISBN: SDIWC 1

Model-driven Performance Estimation, Deployment, and Resource Management for Cloud-hosted Services

MOOCs: The Good, the Bad, and the Potential. Ann H. Taylor Director, Dutton e-education Institute Penn State University

Intelligent Content Delivery Network (CDN) The New Generation of High-Quality Network

Red Hat Satellite Management and automation of your Red Hat Enterprise Linux environment

HKUST-MIT Research Alliance Consortium. Call for Proposal. Lead Universities. Participating Universities

Energy Constrained Resource Scheduling for Cloud Environment

HALF THE PRICE. HALF THE POWER. HALF THE SPACE.

Scala Storage Scale-Out Clustered Storage White Paper

Technology Partners. Acceleratio Ltd. is a software development company based in Zagreb, Croatia, founded in 2009.

Design of Simulator for Cloud Computing Infrastructure and Service

Payment minimization and Error-tolerant Resource Allocation for Cloud System Using equally spread current execution load

Shoal: IaaS Cloud Cache Publisher

Group Based Load Balancing Algorithm in Cloud Computing Virtualization

DELL. Virtual Desktop Infrastructure Study END-TO-END COMPUTING. Dell Enterprise Solutions Engineering

Sla Aware Load Balancing Algorithm Using Join-Idle Queue for Virtual Machines in Cloud Computing

Dynamic Resource allocation in Cloud

Design of Cloud Services for Cloud Based IT Education

Transcription:

004.738.5:378.091.214.18 ADJUSTING THE MASSIVELY OPEN ONLINE COURSES IN CLOUD COMPUTING ENVIRONMENT 9 Aleksandar Karadimce, MSc University of information science and technology St. Paul the Apostle Ohrid, Macedonia aleksandar.karadimce@uist.edu.mk, akaradimce@ieee.org Abstract Massively Open Online Courses (MOOC) is new trending model for delivering learning content online to any person with no limit on attendance for everyone that wishes to attend to a course. Following the MOOC trend there is an expectation that till 2015 there will be more than 6 million unique students worldwide. In order to provide appropriate quality learning user experience for the students we propose how to optimize the cloud computing resources, such as Virtual Machine (VM) distribution per contents to support the MOOC. For this purpose we have used a simulation tool, called CloudAnalyst, which supports visual modelling and simulation of large-scale applications that are deployed on cloud infrastructures. The results will provide information for the geographic distribution of traffic and location of data centres and number of resources for each data centre. Key words: Massively Open Online Courses, Coursera, Cloud Computing, CloudAnalyst. INTRODUCTION University study provides students with the opportunity to learn by working in groups during class time, presenting their work in class, and even collaborating together on a project, which as we all know is happening in real-time. Recently, there is a need to shift from a monolithic learning environment in which everything must be controlled and predictable to a more pluralistic learning ecology in which both prescriptive and emergent IT application domains and models of learning have their place [1]. Another research, had asked participants in online learning environment to describe their learning experience in the context of multiple communities and found that there s an apparent importance of expressing new perspectives across 9 original scientific paper 73

multiple communities that spring from online courses [2]. Using this approach, students are expanding the ability for creating, sharing and collaborating through emerging technologies such as blogs, wikis, podcasts, and social networks. In the same time professors and teaching assistants will spent less time on lecturing the students, and they can be more effective in scaffolding and tutoring the students to improve their skills. Recent research has proposed cloud computing (CC) to be used as approach for online user-assessment and classification of learning materials, in order to provide personalized resource recommendation [3]. The last two years there is a trend called massively open online course (MOOC), which have generated wide attention due to initiatives made by high-profile institutions such as Stanford and MIT, which have formed start-ups such as Coursera, Udacity, and edx [4]. Distance learning and the MOOC have a great potential for informal and lifelong learning. The MOOC presents important challenge related to student engagement and evaluation, in order to ensure an exceptional student experience and to avoid high dropout rates researchers are using pedagogic approach by experiential learning and collaborative learning theories [4]. The MOOC allows hosting university to open its curriculum to a wider audience. In order to be able to support simultaneously large number of users from all around the world there needs to be optimal distribution of CC resources. Considering that nowadays the trend of MOOC reputation is to be able to deliver real-time uninterrupted delivery of multimedia contents promptly to all students [5]. The main contribution in this paper is to provide appropriate optimal configuration of CC environment for the providers of MOOC to deliver high-quality learning material. To be able to provide finest configuration of CC environment we have used simulation tool, called CloudAnalyst, to plan the resources needed for MOOC to be accessible and reliable worldwide. This paper is organized as follows: Firstly we present the concept of the cloud-based massively open online courses. Next it is described the simulation of MOOC use case and comparison of two simulations with CloudAnalyst. At the end is provided discussion for the results and conclusion of the paper. CLOUD-BASED MASSIVELY OPEN ONLINE COURSES The MOOC networks can be used for creating, sharing, and refining the knowledge between the participants. Learners are encouraged to represent their knowledge or questions through diversity of platforms and to connect to each other around issues and topics of shared interest. That way, 74

MOOC builds a model of collaborative networks of unprecedented rapidly expanding educational resources. The MOOC environment is a self-guided course that could have thousands of participants. It requires that a participant is willing to engage, and has some confidence and comfort with the discourses of an autodidact and a self-starter [5]. MOOC offers opportunities for interaction of student with the material as well as student with their peers. The open online course can be used by potentially large number of students from all over the world. The real potential of MOOC is that CC based support for the emergence of learning networks among participants in a many-to-many relationship, rather than the traditional one-to-many model of interactions between a teacher and his or her students [6]. Common for all MOOC providers is that all of them have been established during 2012 by leading world universities and they all have large number of participants. Another common feature for MOOCs is that they are cloud-based learning platforms. The Google Engine platform is the base for Udacity and Khan Academy, whereas, the Amazon Web Service Infrastructure is support for Coursera, Instructure, LoudCloud Systems and Lore [7]. Considering the significant large number of students enrolled in Coursera we have decided to conduct further and deeper research for this MOOC. The popularity of Coursera in world scale, broken down by country [8], is given in Figure 1 (left). Another research made by Outsell [9] have estimated that by the end of 2012, about 3.17 million unique students worldwide were enrolled in courses from the major MOOC providers, see Figure 1 (right). Following the MOOC trend and popularity they are expecting that till 2015 there will be more than 3 million unique students only from North and South America, and the rest of 3 million unique students will be from other worldwide continents [9]. MOOCs allow learning to happen across space and time due to its mainly asynchronous and online architecture. The MOOC is a gathering of people with generally good internet connection, it has a unique social advantage that relates to a more open and connected way of thinking. Considering that nowadays the trend of MOOC reputation the providers should be able to deliver real-time uninterrupted delivery of multimedia contents promptly to all students. In order to provide appropriate and optimal configuration of CC environment for the MOOC providers we will use the CloudAnalyst simulator. 75

Figure 1. Country distribution [8] (left), Global MOOC audience [9] (right) In this research we have used CloudAnalyst [10] to model a simulated environment to study the behavior of a MOOC platform in CC environment. It provides graphical output in the form of tables and charts is highly desirable to summarize the potentially large amount of statistics that is collected during the simulation. The simulator is developed on Java platform, using Java SE 1.6, and the GUI component is built using Swing components [10]. CloudAnalyst, built on top of CloudSim, allows description of application workloads, including information of geographic location of users generating traffic and location of data centers, number of users and data centers, and number of resources in each data center [10]. A User Base models a group of users that is considered as a single unit in the simulation and its main responsibility is to generate traffic for the simulation. We have used Throttled Load Balancer [10] that ensures only a pre-defined number of Internet Cloudlets are allocated to a single VM at any given time. With this balancer if more request groups are present than the number of available VM s at a data center, some of the requests will have to be queued until the next VM becomes available. Modeling bandwidth for a complex network like the MOOC is an extremely difficult task. In the current version of CloudAnalyst it is used a hypothetical parameter, available bandwidth, which is assumed to be the quota of Internet bandwidth available for the application being simulated, ignoring other external factors [10]. For that purpose we model the behavior of MOOC platform for the use case Coursera and using CloudAnalyst we will evaluate time response of data centers and performances related to use of CC to host MOOCs courses worldwide. 76

A CASE STUDY: SIMULATION OF A MOOC IN CLOUDANALYST Streaming video requires reasonable quality of bandwidth, and a reasonably new computer with good quality video/graphics card. Most of the MOOC participants from North America in rural and remote communities may face bandwidth challenges similar to their African peers, but even when they do not, challenges still arise with respect to such things as the possession and use of microphones, web cams, and headsets [5]. Also, time zones can also be concerns in MOOCs, especially if regular live sessions are planned. Therefore, the use of synchronous tools should be carefully planned in a MOOC, in order to overcome the issue with different time zones [6]. The CloudAnalyst allows clear overview of analysis for different time zone intervals in different regions of the World. The massive size of the participating group maximizes the possibility for participants to find peers who share complementary interests and skills, and with whom they can collaborate to achieve mutually defined goals. The CloudAnalyst simulator environment, identified by CloudAnalyst Region IDs, is divided into 6 Regions that correspond with the 6 main continents in the World. Considering the Coursera students distribution by country, given in Figure 1, we have created the approximate percentage distribution of the Coursera user base across the world is given in Table 1. We have defined 6 user bases representing the above 6 regions corresponding to the CloudAnalyst Region IDs with the following parameters are also given in Table 1. User base TABLE I. Region ID and continent PERCENTAGE DISTRIBUTION AND USER BASES Simultaneous Users Peak Hours Online Users in % (GMT) During Peak Hrs Simultaneous Online Users During Offpeak Hrs UB1 0- North America 43,8 02:00-09:00 430,000 43,000 UB2 1- South America 8,9 06:00-13:00 89,000 8,900 UB3 2- Europe 32,9 10:00-17:00 329,000 32,900 UB4 3- Asia 11,9 15:00-22:00 119,000 11,900 UB5 4- Africa 0,9 11:00-18:00 9,000 900 UB6 5- Australia & Oceania 1,6 20:00-03:00 16,000 1,600 For the sake of simplicity in the simulation configuration each user base is contained within a single time zone and it is assumed that most users use the MOOC platform for about 7 hours, per day. Also, it is also assumed that only (1/10) one tenth of that number of users is on line during the off- 77

peak hours. The cost for hosting Coursera platform in a Cloud is closely following this pricing plan: Cost per VM per hour $0.10; Cost per 1GB of data transfer (from/to Internet): $0.10 [17]. Size of virtual machines (VM) in data centers used to host Coursera in the experiment is 10,000MB. The virtual machines have 1GB of RAM memory and have 1000MB of available bandwidth. Simulated hosts have x86 architecture, virtual machine monitor Xen and Linux operating system. Data center nodes have six core L5640 CPU processors 24GB of RAM and 60x600GB Dual channel SAS disks of storage. Each machine has 2 CPUs, and each CPU has a capacity power of 37006 MIPS. A time-shared policy is used to schedule resources to VMs. Users are grouped by a factor of 1000, and requests are grouped by a factor of 100. Each user request requires 4096 instructions to be executed. a) Scenario 1 Coursera hosted on a single data center Considering the centralized approach for Coursera we assume initially the platform is deployed in a single location, in Region 0 (North America). The simulated data center hosts 50 virtual machines dedicated to Coursera, each one with 1024MB of memory in each VM running on physical processors capable of speeds of 37006 MIPS. After running the simulation in CloudAnalyst for the first scenario with single data center we have received the following simulation output, see Table 2. The response times experienced by each user base for the first scenario are shown graphically in Figure 2. TABLE II. SCENARIO 1: OVERALL RESPONSE AND PROCESSING TIME Avg (ms) Min (ms) Max (ms) Overall response time: 1021 91 3018 Data center processing time: 74 0.15 522 78

Figure 2. Scenario 1: Response time across regions with the single data center The spikes in response times can be seen clearly during the seven hour peak period, and it can be observed how the peak loads of one user base could affect other user bases as well. The UB1 has the peak between 02:00-09:00 GMT, similar UB3 has significantly increased peak time between 10:00-17:00 GMT and all other user bases have small spikes during this same period. Overall, during the 1 day simulation we can see different distribution depending of the user base region and different peak hours per regions. During the high load of the UB1 peak from Region 0 (North America) the time for processing has been highest compared to the processing times of the other regions. b) Scenario 2 Coursera hosted on a two data centers When Coursera grow in popularity on the Internet the most common approach to improve service quality is to deploy the application in several locations around the globe. Consequently for the second scenario in CloudAnalyst, we kept the user bases the same and we add one more data center, in region 2 (Europe). In order to preserve the cost the same 50 virtual machines are spitted on half for each data center. After running the simulation from second scenario with two data centers we have received the following response time and request processing time, see Table 3. 79

TABLE III. SCENARIO 2: OVERALL RESPONSE AND PROCESSING TIME Avg (ms) Min (ms) Max (ms) Overall response time: 825 75 2535 Data center processing time: 414 0.19 1618 Overall, response time in second scenario is significantly lower than the corresponding response time in first scenario with single data center. Contrary, we have increased data center processing time in the second scenario with two data centers, this is because we are using half of the number of VMs in each data center compared to the first scenario. Substantial improvement in the second scenario is evident for the response time by region where the average response time has balanced distribution, see the second column Avg (ms) from Table 4. TABLE IV. SCENARIO 2: RESPONSE TIME BY REGION User Base Avg (ms) Min (ms) Max (ms) UB1 994,941 92,687 2.535,985 UB2 545,046 189,154 1.738,521 UB3 719,439 75,094 1.817,187 UB4 749,981 280,650 1.335,600 UB5 346,825 259,448 876,667 UB6 208,679 166,418 370,501 Significant difference is noticeable in the data center processing times and response times shown in Figure 3. The average processing time of both data centers is increased because of the decreased number of VMs. In the same time we can notice decreased loading traffic for the first data center (DC1), now the peak is 6M requests per hour. Reducing the number of virtual machines by half during each of these peak loads effectively lengthens the time taken to process by about two times in each data center. 80

Figure 3. Scenario 2: Data center processing and loading times Considering the MOOC trend there is an expectation that till 2015 there will be more than 6 million unique students worldwide. In this research we were able to conduct excellent analysis using the simulator CloudAnalyst and we received substantial improvement for the response time in the 6 regions and lowered loading traffic for the first data center (DC1). CONCLUSION We have researched the massive online teaching infrastructures on a worldwide scale because it offers remarkable collaborative and conversational opportunities for students to gather and discuss the course content. We demonstrated how CloudAnalyst can be used to model and evaluate a real world MOOC through a case study of a Coursera deployed on the cloud. We have illustrated how the simulator can be used to effectively identify overall usage patterns and how such usage patterns affect data centers hosting the application. This paper provides appropriate configuration for distribution of the resources by geographic location depending on the workload of CC environment for the providers of MOOC to deliver high-quality learning material. We have proposed to use environment with two data centers, one in North America and the other in Europe, in order to bringing the service closer to the users. This improves the response time in the 6 regions and has shown how to improve the Coursera load balancing at the application level across data centers. 81

REFERENCES 1. R. Williams, R. Karousou, and J. Mackness, Emergent learning and learning ecologies in web 2.0, International Review of Research in Open and Distance Learning 12(3), pp. 1-21, 2011. 2. T. Ozturk, Hayriye, and H. Ozcinar, Learning in multiple communities from the perspective of knowledge capital, The International Review of Research in Open and Distance Learning 14.1 (2013): pp. 205-221. March 2013. 3. R. Boyatt and J. Sinclair, Meeting learners' needs inside the educational cloud, Published by International Journal of Learning Technology, Vol.8, No.1, pp. 61 85, 2013. DOI: 10.1504/IJLT.2013.052827. 4. K. Gerd, B. Arosha, S. Neil, R. Michael, and P. Marian, Educating the Internet-of-Things generation, Published by Computer, 46(2), pp.53 61, 2013. ISSN:0018-9162 5. A. Mc Auley, B. Stewart, G. Siemens, and D. Cormier, The MOOC model for digital practice, 2010. Online: http://www.elearnspace.org/articles/mooc_final.pdf. 6. A. Fini, The technological dimension of a massive open online course: the case of the CCK08 course tools, The International Review of Research in Open and Distance Learning, North America, 10, nov. 2009. 7. Google Introduces Course Builder, an Open Source Project Targeted at MOOCs. Online: http://mfeldstein.com/google-introduces-coursebuilder-an-open-source-project-targeted-at-moocs-but-the-realcompetitor-might-be-amazon/ <accessed 9 December 2013> 8. Coursera, Coursera hits 1 million students across 196 countries, Online http://blog.coursera.org/post/29062736760/coursera-hits-1- million-students-across-196-countries Published on 9 August 2012. 9. K. Worlock, MOOCs: cutting through the hype, Published by Outsell on 27 March 2013. Online: http://www.smarthighered.com/wp- content/uploads/2013/04/moocs-cutting-through-the- HYPE.pdf <accessed 19 December 2013> 10. B. Wickremasinghe, R. N. Calheiros, and R. Buyya, CloudAnalyst: a CloudSim-based visual modeller for analysing cloud computing environments and applications, Proceedings of the 2010 24th IEEE International Conference on Advanced Information Networking and Applications. 2010. pp. 446-452. ISBN: 978-0-7695-4018-4. 82