R.Tamilarasi #1, G.Kesavaraj *2



Similar documents
LONG TERM EVOLUTION WITH 5G USING MAPREDUCING TASK FOR DISTRIBUTED FILE SYSTEMS IN CLOUD

SOLVING LOAD REBALANCING FOR DISTRIBUTED FILE SYSTEM IN CLOUD

Index Terms : Load rebalance, distributed file systems, clouds, movement cost, load imbalance, chunk.

Distributed file system in cloud based on load rebalancing algorithm

IMPACT OF DISTRIBUTED SYSTEMS IN MANAGING CLOUD APPLICATION

Load Rebalancing for File System in Public Cloud Roopa R.L 1, Jyothi Patil 2

Survey on Load Rebalancing for Distributed File System in Cloud

Load Re-Balancing for Distributed File. System with Replication Strategies in Cloud

Water-Filling: A Novel Approach of Load Rebalancing for File Systems in Cloud

International journal of Engineering Research-Online A Peer Reviewed International Journal Articles available online

Design and Implementation of Performance Guaranteed Symmetric Load Balancing Algorithm

Enhance Load Rebalance Algorithm for Distributed File Systems in Clouds

International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November ISSN

Simple Load Rebalancing For Distributed Hash Tables In Cloud

Balancing the Load to Reduce Network Traffic in Private Cloud

BALANCING BLOCKS FOR DISTRIBUTED FILE SYSTEMS IN CLOUD

How To Balance In Cloud Computing

Load Balancing in Structured Overlay Networks. Tallat M. Shafaat

Secured Load Rebalancing for Distributed Files System in Cloud

A Novel Switch Mechanism for Load Balancing in Public Cloud

Varalakshmi.T #1, Arul Murugan.R #2 # Department of Information Technology, Bannari Amman Institute of Technology, Sathyamangalam

Dynamo: Amazon s Highly Available Key-value Store

Load Balancing Algorithms in Cloud Environment

Two Level Hierarchical Model of Load Balancing in Cloud

Locality-Aware Randomized Load Balancing Algorithms for DHT Networks

LOAD BALANCING FOR OPTIMAL SHARING OF NETWORK BANDWIDTH

@IJMTER-2015, All rights Reserved 355

LOAD BALANCING ALGORITHM REVIEW s IN CLOUD ENVIRONMENT

An Approach to Load Balancing In Cloud Computing

Load Balancing in cloud computing

International Journal of Scientific & Engineering Research, Volume 6, Issue 4, April ISSN

Scalable Multiple NameNodes Hadoop Cloud Storage System

A Survey on Load Balancing Algorithms in Cloud Environment

LOAD BALANCING STRATEGY BASED ON CLOUD PARTITIONING CONCEPT

CURTAIL THE EXPENDITURE OF BIG DATA PROCESSING USING MIXED INTEGER NON-LINEAR PROGRAMMING

An Efficient Distributed Load Balancing For DHT-Based P2P Systems

MANAGEMENT OF DATA REPLICATION FOR PC CLUSTER BASED CLOUD STORAGE SYSTEM

Availability and Load Balancing in Cloud Computing

A REVIEW ON EFFICIENT DATA ANALYSIS FRAMEWORK FOR INCREASING THROUGHPUT IN BIG DATA. Technology, Coimbatore. Engineering and Technology, Coimbatore.

LOAD BALANCING WITH PARTIAL KNOWLEDGE OF SYSTEM

Cloud Partitioning of Load Balancing Using Round Robin Model

AN EFFECTIVE PROPOSAL FOR SHARING OF DATA SERVICES FOR NETWORK APPLICATIONS

ADAPTIVE LOAD BALANCING ALGORITHM USING MODIFIED RESOURCE ALLOCATION STRATEGIES ON INFRASTRUCTURE AS A SERVICE CLOUD SYSTEMS

The International Journal Of Science & Technoledge (ISSN X)

Efficient Load Balancing Algorithm in Cloud Environment

A STUDY ON HADOOP ARCHITECTURE FOR BIG DATA ANALYTICS

Load Balancing in Distributed Systems: A survey

A PROXIMITY-AWARE INTEREST-CLUSTERED P2P FILE SHARING SYSTEM

A Game Theory Modal Based On Cloud Computing For Public Cloud

Using In-Memory Computing to Simplify Big Data Analytics

Multilevel Communication Aware Approach for Load Balancing

CDBMS Physical Layer issue: Load Balancing

Object Request Reduction in Home Nodes and Load Balancing of Object Request in Hybrid Decentralized Web Caching

Load Balancing in Structured Peer to Peer Systems

The Performance Characteristics of MapReduce Applications on Scalable Clusters

Load Balancing in Structured Peer to Peer Systems

marlabs driving digital agility WHITEPAPER Big Data and Hadoop

Introduction to Hadoop

Load Balancing in Dynamic Structured P2P Systems

CSE-E5430 Scalable Cloud Computing Lecture 2

Research on P2P-SIP based VoIP system enhanced by UPnP technology

RESEARCH PAPER International Journal of Recent Trends in Engineering, Vol 1, No. 1, May 2009

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

High Performance Cluster Support for NLB on Window

The Load Balancing Strategy to Improve the Efficiency in the Public Cloud Environment

Load Balancing in Structured P2P Systems

IMPROVEMENT OF RESPONSE TIME OF LOAD BALANCING ALGORITHM IN CLOUD ENVIROMENT

Redistribution of Load in Cloud Using Improved Distributed Load Balancing Algorithm with Security

Payment minimization and Error-tolerant Resource Allocation for Cloud System Using equally spread current execution load

Efficient Parallel Processing on Public Cloud Servers Using Load Balancing

Secure and Privacy-Preserving Distributed File Systems on Load Rebalancing in Cloud Computing

A Brief Analysis on Architecture and Reliability of Cloud Based Data Storage

QOS Differentiation of Various Cloud Computing Load Balancing Techniques

Models of Load Balancing Algorithm in Cloud Computing

[Laddhad, 4(8): August, 2015] ISSN: (I2OR), Publication Impact Factor: 3.785

2. Research and Development on the Autonomic Operation. Control Infrastructure Technologies in the Cloud Computing Environment

Comparision of k-means and k-medoids Clustering Algorithms for Big Data Using MapReduce Techniques

A Load Balancing Model Based on Cloud Partitioning for the Public Cloud

A Load Balancing Model Using a New Algorithm in Cloud Computing For Distributed File with Hybrid Security

Evaluating HDFS I/O Performance on Virtualized Systems

AN EFFICIENT LOAD BALANCING APPROACH IN CLOUD SERVER USING ANT COLONY OPTIMIZATION

How To Handle Big Data With A Data Scientist

Improved Dynamic Load Balance Model on Gametheory for the Public Cloud

IMPROVED FAIR SCHEDULING ALGORITHM FOR TASKTRACKER IN HADOOP MAP-REDUCE

Cassandra A Decentralized, Structured Storage System

Roulette Wheel Selection Model based on Virtual Machine Weight for Load Balancing in Cloud Computing

Big Data Racing and parallel Database Technology

A Game Theoretic Approach for Cloud Computing Infrastructure to Improve the Performance

A Content-Based Load Balancing Algorithm for Metadata Servers in Cluster File Systems*

Storage Systems Autumn Chapter 6: Distributed Hash Tables and their Applications André Brinkmann

International Journal of Advanced Research in Computer Science and Software Engineering

IMPROVED PROXIMITY AWARE LOAD BALANCING FOR HETEROGENEOUS NODES

White Paper. Big Data and Hadoop. Abhishek S, Java COE. Cloud Computing Mobile DW-BI-Analytics Microsoft Oracle ERP Java SAP ERP

Cloud Based Adaptive Overlapped Data Chained Declustering

PEER-TO-PEER (P2P) systems have emerged as an appealing

Group Based Load Balancing Algorithm in Cloud Computing Virtualization

A Survey Of Various Load Balancing Algorithms In Cloud Computing

A Comparative Study of Load Balancing Algorithms in Cloud Computing

A SURVEY ON LOAD BALANCING ALGORITHMS FOR CLOUD COMPUTING

Transcription:

ENHANCING SECURE MULTI USER ACCESS IN CLOUD ENVIRONMENT BY LOAD BALANCING RTamilarasi #1, GKesavaraj *2 #1 Mphil, Research Scholar, Vivekananda Arts and Science College for women *2 Assistant professor,department of Computer Science And Application,Vivekananda Arts and Science College for women Abstract--Cloud computing is an important technology widely used by many of the large industries, small industries and more importantly effectively used over the internet However the uses and application of the cloud is improved improving continuously the secure and performance are the issues faced By providing on demand access to a shared pool of computing resources in a selfservice, dynamically scaled and metered manner, cloud computing offers compelling advantages in speed, agility and efficiency Cloud computing encompasses any subscription-based or pay-per-use service that, in real time over the Internet, extends IT's existing capabilities On the positive side, cloud computing has the potential to both increase end-user productivity and reduce infrastructure costs In the existing system, multi user access is not effectively obtained and in addition it leads to increase in the cost of the access by individual users To overcome these issues, this paper proposes a Multiple Virtual Machine (MVM) The proposed technique effectively response the multiple user by scheduling the available resource and provide an effective service to the user by using load balancing technique In addition it uses Meta data technique to protect the storage space effectively To ensure the secure data retrieval and access of the data from the cloud storage Third party security is implemented The experimental results show that the proposed technique obtains an efficient result when comparing to existing system Key words:cloud computing, MVM, load balancing, security and Meta data I INTRODUCTION During the last several decades, dramatic advances in computing power, storage, and networking technology have allowed the human race to generate, process, and share increasing amounts of information in dramatically new ways As new applications of computing technology are developed and introduced, these applications are often used in ways that their designers never envisioned New applications, in turn, lead to new demands for even more powerful computing infrastructure It is now possible to assemble very large, powerful systems consisting of many small, inexpensive commodity components because computers have become smaller and less expensive, disk drive capacity continues to increase, and networks have gotten faster Such systems tend to be much less costly than a single, faster machine with comparable capabilities Software challenges also arise in this environment because writing software that can take full advantage of the aggregate computing power of many machines is far more difficult than writing software for a single, faster machine Regardless of the exact definition used, numerous companies and research organizations are applying cloud-computing concepts to their business or research problems including Google, Amazon, Yahoo, and numerous universities Cloud computing encompasses any subscription-based or payper-use service that, in real time over the Internet, extends IT's existing capabilities On the positive side, cloud computing has the potential to both increase end-user productivity and reduce infrastructure costs The proposed system explains the MVM technique, this implemented in order to response effectively for multiusers simultaneously at a time The proposed system works by scheduling the available resources in the cloud and allocate it to the entire available user as well as for the new users To avoid the unavailability of resources to the new users load balancing is used to handle those kinds of situations To ensure the secure retrieval of data from the cloud the third party security is implemented which works by generating individual key to individual users for each and every time of access by the user from the cloud storage A Cloud computing is emerging as a new paradigm of large scale distributed computing It has moved computing and data away from desktop and portable PCs, into large data centre s [1] It provides the scalable IT resources such as applications and services, as well as the infrastructure on which they operate, over the Internet, on pay-per-use basis to adjust the capacity quickly and easily It helps to accommodate changes in demand and helps any organization in avoiding the capital costs of software and hardware [2] [3] Thus, Cloud Computing is a framework for enabling a suitable, on-demand network access to a shared pool of computing resources (eg networks, servers, storage, applications, and services)these resources can be provisioned and de-provisioned quickly with minimal management effort or service provider interaction This further helps in promoting availability [4] Due to the exponential growth of 8

cloud computing, it has been widely adopted by the industry and there is a rapid expansion in data centres Load balancing in computer networks is a technique used to spread workload across multiple network links of computers [2] It facilitates networks and resources by providing a maximum throughput with minimum time, thus it helps to improve performance by optimally using available resources and helps in minimizing latency and response time Load balancing is achieved by using multiple resources that is, multiple servers that are able to fulfill a request or by having multiple paths to a resource Load balancing helps to achieve a high user satisfaction and resource utilization When one or more components of any service fail, load balancing facilitates continuation of the service by implementing fair-over, that is, it helps in provisioning and deprovisioning of instances of applications without fail It also ensures that every computing resource is distributed efficiently and fairly [5] chunks Additionally, we aim to reduce network traffic (movement cost) reduce network traffic (or movement cost) caused by rebalancing the loads of nodes as much as possible to maximize the network bandwidth available to normal application Applications need information on both when and how to rebalance; The three load balancing steps are: 1 Evaluate the imbalance; 2 Decide how to balance if needed; 3 Redistribute work to correct the imbalance We address the first two requirements and derive complete information on how to perform the third; the application must be able to redistribute its work units as instructed by our framework (a requirement also imposed by partitioners Our load model couples abstract application information with scalable load measurements We derive actionable load metrics to evaluate the accuracy of the information Our load model evaluates the cost of correcting load imbalance with specific load balancing algorithms We use it to select the method that most efficiently balances a particular scenario We demonstrate this methodology on two large-scale production applications that simulate molecular dynamics and dislocation dynamics Overall, we make the following contributions Fig1 Cloud Computing Consumption of resources and conservation of energy is not always a prime focus of discussion in cloud computing However, resource consumption can be kept to minimum with proper load balancing which not only helps in reducing costs but making enterprise greener Scalability, one of the very important features of cloud computing, is also enabled by load balancing Hence, improving resource utility and the performance of a distributed system in such a way will reduce the energy consumption and carbon footprints to achieve Green computing [1] The objective and motivation of this survey is to provide a analytic survey of existing load balancing techniques in cloud computing In this paper, we are interested in studying the load rebalancing problem in distributed file systems specialized for large-scale, dynamic and data-intensive clouds (The terms rebalance and balance are interchangeable in this paper) Such a largescale cloud has hundreds or thousands of nodes (and may reach tens of thousands in the future) Our objective is to allocate the chunks of files as uniformly as possible among the nodes such that no node manages an excessive number of Load rebalances in the distributed file system carried out using the map reducing task in cloud which helps in arranging files in nodes of every chunk ie stores the files in related nodes of the chunks Objective of this project is to allocate the chunks of files as uniformly as possible among the nodes such that no node manages an excessive number of chunks and also to reduce network traffic and maximize the network bandwidth available to normal applications Using the distributed file system, arranging the file system in a cloud: that is the file chunks are no distributed uniformly as possible among the nodes because the load is put under workload that is linearly scaled with the system and to increase the performance of the transformation of the file This performance of the proposal is implemented to be used in the clustered environment II SYSTEM OVERVIEW The load rebalancing problem in distributed file systems specialized for large-scale, dynamic and dataintensive clouds Suggest offloading the load rebalancing task to storage nodes by having the storage nodes balance their loads spontaneously The storage nodes are structured as a network based on distributed hash tables (DHTs) discovering a file chunk can simply refer to rapid key lookup in DHTs, given that a unique handle is assigned to each file DHTs enable nodes to self-organize and -repair while constantly offering lookup functionality in node dynamism, simplifying the system provision and management Present a load rebalancing algorithm for distributing file chunks as uniformly 9

as possible and minimizing the movement cost as much as possible Nodes perform their load-balancing tasks independently without synchronization or global knowledge regarding the system This project not only takes advantage of physical network locality in the reallocation of file chunks to reduce the movement Cost but also exploits capable nodes to improve the overall system performance Algorithm reduces overhead introduced to the DHTs as much as possible Additionally, our loadbalancing algorithm exhibits a fast convergence rate The Architecture can be shown in figure 2 A Storage Node Creation In cloud server simultaneously create node, serve computing and storage functions; a file is partitioned into a number of chunks allocated in distinct nodes In this module, a cloud partitions the file into a large number of disjointed and fixed-size pieces (or file chunks) and assigns them to different cloud storage nodes (ie, chunk servers) Each storage node then calculates the frequency of each unique word by scanning and parsing its local file chunks User creates a storage node after successful register our account User Search File Fig3 Identify Storage Node Chunk Update DHT SN1 SN2 SNn Lookup DHT DHT Fig4 Distributed Hash Table Fig2 Storage Node Creation B Distributed Hash Table DHTs guarantee that if a node leaves, then its locally hosted chunks are reliably migrated to its successor; if a node joins, then it allocates the chunks whose IDs immediately precede the joining node from its successor to manage Our proposal heavily depends on the node arrival and departure operations to migrate file chunks among nodes Interested readers are referred to for the details of the self-management technique in DHTs While lookups take a modest delay by visiting n nodes in a typical DHT, the lookup latency can be reduced because discovering the l chunks of a file can be performed in parallel Proposal is independent of the DHT protocols To further reduce the lookup latency, can adopt state-of-the-art DHTs such as Amazon s Dynamo in that offer one-hop lookup delay Fig5 Overall Data Flow Diagram C Distributed Load Balancing A large-scale distributed file system is in a loadbalanced state if each chunk server hosts no more than A chunks In our proposed algorithm, each chunk server node i first estimate whether it is under loaded (light) or overloaded (heavy) without global knowledge A node is light if the 10

number of chunks it hosts is smaller than the threshold In enable nodes to self-organize and repair while constantly contrast, a heavy node manages the number of chunks In the offering lookup functionality in node dynamism, simplifying following discussion, if a node i departs and rejoins as a the system provision and management For each entity, it successor of another node j, then represent node i as node j+ 1, provides many web pages File chunks size can be taken as node j s original successor as node j + 2, the successor of node example j s original successor as node j + 3, and so on For each node I V, if node i is light, then it seeks a heavy node and takes B File Chunks over at most A chunks from the heavy node The distribution of chunks after performing the HDFS loads balancer File chunks =500 Data nodes =20 =500/20=250 C Map reducing Task The map reducing task is separated for all data s This result is shown in figure7all chunks are stored in goggle apps engine This paper using map reducing task evaualate the balance and redistribute the balancing nodes are solving Table 1 Comparison of file chunk size D File Distribution Fig6 Load Rebalancing Architecture Diagram A DHT node is an overlay on the application level The logical proximity abstraction derived from the DHT does not necessarily match the physical proximity information in reality That means a message traveling between two neighbors in a DHT overlay may travel a long physical distance through several physical network links In the loadbalancing algorithm, a light node i may rejoin as a successor of a remote heavy node j Then, the requested chunks migrated from j to i need to traverse several physical network links, thus generating considerable network traffic and consuming significant network resources (ie, the buffers in the switches on a Communication path for transmitting a file chunk from a source node to a destination node) and distribute the files for requesting users efficiently and effectively III PRELIMINARY RESULTS Some preliminary results on load rebalancing are presented In the following subsections contains DHT formulation, File chunks, and then map reducing task A DHT formulation Distributed hash table is given unique identity of each every file So files are stored in one hash table The hash table performed given results The storage nodes are structured as a network based on distributed hash tables (DHTs), DHTs File Chunks Size Data nodes 250 25 300 30 400 40 450 20 500 20 IV RESULTS The entire system is implemented in Net using eclipse Platform In computer programming, eclipse is an integrated development environment comprising a base work phase and an extensible plug in system for customizing the environment It is written mostly in java It can be used to develop application in java, and by means of various plug-ins other programming languages including Ada, C, C++, COBOL, FORTRAN, Haskell, JavaScript, lasso, Perl and ErlangIt can also be used to develop packages for the software mathematics Development environment includes the eclipse data development tools for java and scala Eclipse CDT for C/C++ and Eclipse PDT for PHP among others The initial codebase originated from IBM VisualAgeThe Eclipse software development tool is mean for java developers User can extend its abilities by installing plug-ins written for the Eclipse Platform Such as development toolkit for other programming languages, other programming languages, and can write contribute their own plug-in modules Java contains many JAR file JAR files help to extract the plain text from the web pages The result is shown in fig 7 11

branching factor is used by each chunk to construct its routing table To provide consistency with previous work, reconsider Chord as a tree-based routing DHT It is straightforward to show that Chord finger tables are constructed like tree-based routing tables REFERENCES Fig 7 Chunks Stored in Google AppsEngine V CONCLUSION A load-balancing algorithm to deal with the load rebalancing problem in large-scale, dynamic, and distributed file systems in clouds has been presented in this project Proposal strives to balance the loads of nodes and reduce the demanded movement cost as much as possible, while taking advantage of physical network locality and node heterogeneity In the absence of representative real workloads (ie, the distributions of file chunks in a large-scale storage system) in the public domain, To have investigated the performance of our proposal and compared it against competing algorithms through synthesized probabilistic distributions of file chunks The synthesis workloads stress test the load-balancing algorithms by creating a few storage nodes that are heavily loaded Proposal is comparable to the centralized algorithm in the Hadoop HDFS production system and dramatically outperforms the competing distributed algorithm in terms of load imbalance factor, movement cost, and algorithmic overhead Particularly, our load-balancing algorithm exhibits a fast convergence rate The efficiency and effectiveness of our design are further validated by analytical models and a real implementation with a small-scale cluster environment Consider a DHT with an ordered id space I with size N = j and a branching factor B such that log N is integral The [1] J Dean and S Ghemawat, MapReduce: Simplified Data Processing on Large Clusters, Proc Sixth Symp Operating System Design and Implementation (OSDI 04), pp 137-150, Dec 2004 [2] AW M Smeulders, M Worring, S Santini, A Gupta, and R Jain, Content-based image retrieval at the end of the early years, IEEE Transaction Pattern Analysis Machine Intelligence, Antony Rowstron and Peter Druschel, Pastry: Scalable, Distributed Object Location and Routing for Large-scale Peer-to-Peer Systems, in Proc Middleware, 2001 [3] John Byers, Jeffrey Considine, and Michael Mitzenmacher, [4] Simple Load Balancing for Distributed Hash Tables, in Proc IPTPS, Feb 2003 [5] David Karger and Matthias Ruhl, New Algorithms for Load [6] Balancing in Peer-to-Peer Systems, Tech Rep MIT-LCS-TR- 911, MIT LCS, July 2003 [7] J Westbrook, Load balancing for response time, in EuropeanSymposium on Algorithms, 1995, pp 355 368 [8] Micah Adler, Eran Halperin, Richard M Karp,and Vijay V Vazirani A Stochastic Process on the Hypercube with Applications to Peer-to-Peer Networks In Proceedings STOC, pages 575 584, 2003 [9] Tanveer Ahmed, Yogendra Singh, Analytic study of load balancing techniques using tool cloud analyst [10] Zenon Chaczko, Venkatesh Mahadevan, Shahrzad Aslanzadeh and Christopher Mcdermid, Availabilty and load balancing in cloud computing, 2011 InternationalConference on Computer and Software Modeling, IPCSIT vol14 (2011) ACSIT Press, Singapore [11] Giuseppe Valetto, Paul Snyder, Daniel J Dubois, Elisabetta DiNitto and Nicolo M Calcavecchia, A self-organized load balancing algorithm for overlay based decentralized service networks [12] Nidhi Jain Kansal, Inderveer Chana, Cloud Load balancing techniques: A step towards green computing, IJCSI International Journal of Computer Science Issues, Vol 9, Issue 1, No 1, January 2012 [13] G DeCandia, D Hastorun, M Jampani, G Kakulapati, A Lakshman, A Pilchin, S Sivasubramanian, P Vosshall, and W Vogels, Dynamo: Amazon s Highly Available Key-value Store, in Proc 21st ACM Symp [14] Hadoop Distributed File System, Rebalancing Blocks, http://developeryahoocom/hadoop/tutorial/module2html#rebalan cing [15] HDFSFederation,http://hadoopapacheorg/common/docs/r0230/h adoop-yarn/hadoop-yarn-site/federationhtm [16] D Karger and M Ruhl, Simple Efficient Load Balancing Algorithms for Peer-to-Peer Systems, in Proc 16th ACM Symp Parallel Algorithms and Architectures (SPAA 04), June 2004, pp 36 43 12