Research Statement for Henri Casanova
|
|
- Della Bryan
- 8 years ago
- Views:
Transcription
1 Research Statement for Henri Casanova Advances in networking technology have made it possible to deploy distributed scientific applications on platforms that aggregate large numbers of diverse and distant resources. This distributed computing vision, recently popularized as grid computing, holds the promise of application executions at unprecedented scale, capacity, and performance. My research interests spans several theoretical and practical aspects of parallel and distributed computing, with an emphasis on computing on large-scale platforms. Large-scale distributed computing platforms have been in production for various scientific applications and with various modes of operation, going from high-end systems like the TeraGrid to volunteer computing systems like While sound engineering has enabled these various flavors of grid computing, many challenges are still to be addressed for grid computing to become widely available to a large range of applications and users, affording them both efficiency and ease-of-use. My recent research has focused on three fundamental such challenges in this context: (i) designing efficient and practical application scheduling and resource allocation strategies; (ii) measuring, understanding, modeling, and simulating platforms and applications; (iii) developing software methodologies and tools that make application deployment both straightforward and efficient. In my Ph.D. thesis I explored issues of performance prediction and scheduling. I developed stochastic models for capturing application behavior in scenarios that exhibit uncertainty in both data transfer times and computation times [1], which is common in large-scale platforms. During my Ph.D. I also developed NetSolve [2, 3], a software infrastructure for deploying applications on what was to become known as grid platforms. This work led me to co-author a chapter of the first Grid book [4]. I joined UCSD in 1999 and in 2001 I created the Grid Research And Innovation Laboratory (GRAIL), which has counted around 10 members, mostly graduate students and postdoctoral researchers. During the last four years at UCSD we have been able to explore the three aforementioned challenges as part of several NSF-funded projects, some of which are highlighted in what follows. Scheduling and Deploying Large-Scale Parameter Sweep Applications This work started upon my arrival at UCSD. At the time, grid computing was in its infancy and a critical goal was to enable the first generation of applications. In this view, I first focused on the simple yet popular parameter sweep application model: large numbers of independent computations that can be performed in a simple master-worker fashion. This seemingly straightforward model poses logistics of deployment challenges, which I had partially addressed with my work on NetSolve. I proceeded to extend this work to handle data locality, which is often a critical issue for application performance, in addition to the load balancing issue. I first developed novel scheduling heuristics for deploying parameter sweep applications that have complex data-sharing and data-locality requirements onto systems that consists of multiple compute sites, where each site contains one or more clusters with shared data storage. One of these heuristics, XSufferage, was compared in simulation to heuristics from the literature and showed significant performance improvements [5]. In addition, I explored how this heuristic could be made adaptive, periodically refining its 1
2 scheduling decisions to adapt to fluctuations in delivered platform performance. Simulation as well as real-world experiments showed that, thanks to adaptation, XSufferage indeed outperforms competing algorithms even in the presence of performance fluctuations [6]. This work has led to a software tool, APST [7, 8], which utilizes existing services to deploy applications and automatically schedules them with XSufferage. This software is currently in production for several scientific applications. Divisible Load Scheduling The above work was in the context of applications that have non-trivial data sharing and locality requirements. Equally important, and yet simpler, applications are ones that consist of a large number of independent, roughly identical tasks that all use distinct input, thus posing no data locality concerns. A recent theoretical framework has been proposed that approximates such low-granularity applications as a load that is continuously divisible. The Divisible Load Scheduling (DLS) problem consists in orchestrating communications and computations in a way that minimizes application execution time, with a key concern for pipelining of communication and computation (in addition to the usual load balancing issue). Effective pipelining can be achieved with scheduling algorithms that use multiple rounds. Together with a Ph.D. student, we first attacked the multi-round DLS problem from the theoretical perspective and have made four clear contribution over the state of the art. First, we have improved on the only previously proposed multi-round algorithm by extending it to account for network latencies. Second, we have developed the first multi-round algorithm that is applicable to heterogeneous platforms. Third, this algorithm automatically computes an optimal number of rounds, which was not done in previous approaches. Fourth, we have extended our approach to tolerate uncertainties in data transfer and computation times. This work resulted in a number of publications [9, 10, 11, 12], and provided the theoretical foundations for practical DLS in distributed and heterogeneous platforms. We then implemented our algorithms as part of the aforementioned APST software, which we extended to support divisible load. We have validated this practical implementation for several applications (e.g., MPEG-4 encoding) on a wide-area testbed [13]. Desktop Grids The previous two research projects target platforms that, while subject to some fluctuations in the performance delivered by resources, are relatively stable in terms of resource availability (i.e., infrequent downtimes). At the other extreme are so-called Desktop Grid platforms that aggregate the idle cycles of large numbers of individually owned desktop PCs. These cost-effective platforms have been popularized by projects such as SETI@home, and several software infrastructures are available today. The main challenge here is the volatility of the resources, which can be reclaimed by their owners without notice. Most applications successfully deployed on these platforms to date consist of large numbers of independent tasks and the performance metric is the task completion rate over long periods of time, which is ideally suited to volatile resources. In this project we explored the feasibility of running applications that consist of moderate numbers of tasks (e.g., comparable to the number of available desktop resources) with the objective of minimizing application execution time. Enabling such applications on desktop grids will dramatically increase their utility to a wider range of users, but requires techniques for intelligent resource selection 2
3 and computation redundancy, so as to mask volatility. First, we conducted measurements of availability on a real desktop grid to (i) analyze and understand the temporal structure and statistical properties of resource availability and (ii) obtain trace data that can be used for simulation. We measured host and CPU availability of a deployment of the Entropia desktop grid software at SDSC, obtaining the first high-quality measurement dataset of the effective power delivered by an enterprise desktop grid [14]. With this trace data as a basis for simulation we then studied several resource selection strategies that perform resource ranking, resource exclusion, and task replication. We designed these strategies guided by our analysis of our availability measurements (e.g., our dataset shows that CPU clock rate is a reasonable predictor of host performance in a typical desktop grid in spite of host volatility). Overall, we found that a heuristic that performs intelligent resource exclusion, based on clock rates, and does moderate task replication, based on time-outs, achieves by far the best performance, within a factor 1.7 of optimal in practice [15]. Measurement, Modeling, and Simulation A constant theme in my research is the need to understand large-scale, distributed computing platforms, which requires measurement data obtained on real platforms. While measurement methodologies and datasets are common place in areas such as network research, they are scarce in the grid computing area. One contribution of my work is the aforementioned measurement dataset collected for a desktop grid [14]. Another is the development a benchmark probes that exercise basic grid computing functionality [16, 17], which we have integrated as part of a grid monitoring system and have used to collect periodic measurements on a production platform [18]. Measurement datasets can be analyze to extract fundamental properties of the underlying platform, which I have used to reason about application deployment and performance [19]. Further, a collection of such datasets combined with technology surveys, can be used to develop generators of realistic synthetic platform configurations, which are fundamental for enabling research (ie.g., as seen in the internetworking area). We have developed a generator for grid platforms in [20]. With realistic platform models it then becomes possible to instantiate realistic simulations, which are necessary for conducting controlled and repeatable experiments. I have developed the SimGrid [21] simulator, which has gained popularity in the scheduling community, has been used by over 30 other researchers, most of whom have published their results, and whose second version has recently been released [22]. Our work on platform measurement, on generation of realistic synthetic platforms, and on the development of a simulation framework provides the necessary foundations for allowing the scientific study of large-scale computing platforms and applications. Future Directions I hereafter describe three broad research directions that I would like to follow in the upcoming years. Note that my work on platform measurement, characterization, modeling, and simulation will support this future research. Workflow Applications My recent work in the area of scheduling research and of development has been for applications that consist of independent tasks. Another class of applications that has gained increased popularity in the last few years is that of scientific workflows, which correspond to multiple components logically interconnected with 3
4 dependencies, branching, and iterations. While our APST software supports execution of workflows, and is used by several workflow applications, we have not conducted the necessary scheduling research to achieve high performance. In spite of the large number of efforts focusing on software support for workflows, the area of workflow scheduling on large-scale platforms remains relatively unexplored. A few results in the scheduling literature can be used as a basis for workflow scheduling (e.g., DAG scheduling algorithms), and I will focus on the many additional research advances needed to apply such results in practice. Job Scheduling In application scheduling, the goal is to optimize the performance metric of one application. By contrast, job scheduling consists in optimizing some aggregate metric across applications belonging to different users. Job scheduling is traditionally done on single systems by batch schedulers, which are sophisticated but often optimize metrics that are not user-centric (e.g., resource utilization). As we build large-scale, shared systems, there is an opportunity for taking a fresh look at job scheduling. I see three particularly interesting challenges. First is the definition of an aggregate, user-centric performance metrics that quantify both performance and fairness, and that can be optimized in a tractable way. We have recently made a preliminary contribution to answer this question by proposing a metric that can be optimized for the restricted case in which all applications are divisible loads [23]. Second is the question of resource sharing in in the presence of dynamic resource availability, as expected on any large-scale system. This question is largely unexplored for grid computing and I plan to use our desktop grid research on application scheduling as a basis for job scheduling strategies. Third, is the issue of decentralized job scheduling for better scalability and resilience. Several researchers have proposed just coupling multiple legacy batch-schedulers, but I believe that there is a great opportunity for a more forwardlooking approach that develops more fundamental systems principles. Very Large-Scale Computing The grid community has yet to address the challenges posed by very large-scale platforms. Virtual Organizations with tens of thousands of individual resources no longer seem in the distant future, and even larger systems are emerging with the advent of mobile technology and sensor networks. The traditional scheduling approach that consists in examining the whole universe of resources, although feasible today on most platforms, will not scale. Consequently, new ways for applications to express their resource requirements as well as new techniques for scoping the resource universe in a view to conducting approximate but scalable resource selection are needed. Together with several collaborators we have recently initiated a 5-year project, funded by NSF/ITR, that explores these issues, and I am particularly focusing on resource selection and scheduling. The first step is to design a simple resource requirement description language that is amenable to efficient searches in large-scale environments. An interesting question is then to identify the best design point between scalability and quality of the resource selection. While such issues have been partially explored in the peer-to-peer community for file sharing applications, many new challenges arise for supporting broader applications that have a computational component (e.g., complex application structure, data/computation locality, time sequencing, stability and predictability of performance). This work will build on but also challenge the current state-of-the-art of grid resource discovery and selection in a view to enable truly large-scale platforms in the 5-10 year range. 4
5 References [1] H. Casanova, M. Thomason, and J. Dongarra. Stochastic Performance Prediction for Iterative Algorithms in Distributed Environments. Journal of Parallel and Distributed Computing, 58(1):68 91, [2] H. Casanova and J. Dongarra. NetSolve: A Network-Enabled Server for Solving Computational Science Problems. In Proceedings of Supercomputing 1996 (SC 96), Nov [3] H. Casanova and J. Dongarra. Using Agent-Based Software for Scientific Computing in the NetSolve System. Parallel Computing, 24: , [4] J. Dongarra, H. Casanova, C. Johnson, and M. Miller. Application-Specific Tools. In I. Foster and C. Kesselman, editors, Computational Grids: Blueprint for a New Computing Infrastructure. M. Kaufmann Publishers, Inc., [5] H. Casanova, A. Legrand, D. Zagorodnov, and F. Berman. Heuristics for Scheduling Parameter Sweep Applications in Grid environments. In Proceedings of the 9th Heterogeneous Computing Workshop (HCW 00), pages , May [6] H. Casanova, G. Obertelli, F. Berman, and R. Wolski. The AppLeS Parameter Sweep Template: User-Level Middleware for the Grid. Scientific Programming, 8(3): , Extended version of a paper in Proceedings of Super Computing 2000 (SC 00). [7] H. Bal, H. Casanova, J. Dongarra, and S. Matsuoka. Application-Level Tools. In I. Foster and C. Kesselman, editors, Grid 2: Blueprint for a New Computing Infrastructure. M. Kaufmann Publishers, Inc., 2nd edition, [8] H. Casanova and F. Berman. Parameter Sweeps on the Grid with APST. In F. Berman, G. Fox, and T. Hey, editors, Grid Computing: Making the Global Infrastructure a Reality. Wiley Publishers, Inc., [9] Y. Yang and H. Casanova. UMR: a Multi-Round Algorithm for Scheduling Divisible Workloads. In Proceedings of the International Parallel and Distributed Processing Symposium (IPDPS 2003), April [10] O. Beaumont, H. Casanova, A. Legrand, Y. Robert, and Y. Yang. Scheduling Divisible Loads on Star and Tree Networks: Results and Open Problems. IEEE Transactions on Parallel and Distributed Systems (TPDS), to appear. [11] Y. Yang and H. Casanova. Multi-Round algorithms for Scheduling Divisible Workloads. IEEE Transactions on Parallel and Distributed Systems (TPDS), to appear. [12] Y. Yang and H. Casanova. RUMR: Robust Scheduling for Divisible Workloads. In Proceedings of the 12th IEEE Symposium on High-Performance Distributed Computing (HPDC-12), pages , June
6 [13] K. van der Raadt, Y. Yang, and H. Casanova. Practical Divisible Load Scheduling on Grid Platforms with APST-DV. submitted to the International Parallel and Distributed Processing Symposium (IPDPS 05), [14] D. Kondo, M. Taufer, C. L. Brooks, H. Casanova, and A. Chien. Characterizing and Evaluating Desktop Grids: An Empirical Study. In Proceedings of the International Parallel and Distributed Processing Symposium (IPDPS 04), April [15] D. Kondo, A. Chien, and H. Casanova. Resource Management for Short-Lived Applications on Enterprise Desktop Grids. In Proceedings of SC 04, November [16] A. Snavely, G. Chun, H. Casanova, R. Van der Wijngaart, and M. Frumkin. Benchmarks for Grid Computing: A Review of Ongoing Efforts and Future Directions. SIG- METRICS Performance Evaluation Review, 30(4):27 32, [17] G. Chun, H. Dail, H. Casanova, and A. Snavely. Benchmark Probes for Grid Assessment. In Proceedings of the High-Performance Grid Computing Workshop, April [18] S. Smallen, M. Murray, C. Mills Olschanonowsky, A. Snavely, and H. Casanova. Benchmarking and Measuring Grid Platforms: Software Tools and Results on the TeraGrid. Poster at SC 04, [19] H. Casanova. Network Modeling Issues for Grid Application Scheduling. International Journal of Foundations of Computer Science (IJFCS), to appear. [20] Y.-S. Kee, H. Casanova, and A. Chien. Realistic Modeling and Synthesis of Resources for Computational Grids. In Proceedings of SC 04, November [21] H. Casanova. SimGrid: A Toolkit for the Simulation of Application Scheduling. In Proceedings of the 1st IEEE International Symposium on Cluster Computing and the Grid (CCGrid 01), pages , May [22] A. Legrand, L. Marchal, and H. Casanova. Scheduling Distributed Applications: The SimGrid Simulation Framework. In Proceedings of the Third IEEE International Symposium on Cluster Computing and the Grid (CCGrid 03), May [23] L. Marchal, Y. Yang, H. Casanova, and Y. Robert. A Realistic Network/Application Model for Scheduling Divisible Loads on Large-Scale Platforms. submitted to the International Parallel and Distributed Processing Symposium (IPDPS 05),
An Evaluation of Economy-based Resource Trading and Scheduling on Computational Power Grids for Parameter Sweep Applications
An Evaluation of Economy-based Resource Trading and Scheduling on Computational Power Grids for Parameter Sweep Applications Rajkumar Buyya, Jonathan Giddy, and David Abramson School of Computer Science
More informationTechniques to Produce Good Web Service Compositions in The Semantic Grid
Techniques to Produce Good Web Service Compositions in The Semantic Grid Eduardo Blanco Universidad Simón Bolívar, Departamento de Computación y Tecnología de la Información, Apartado 89000, Caracas 1080-A,
More informationThe Lattice Project: A Multi-Model Grid Computing System. Center for Bioinformatics and Computational Biology University of Maryland
The Lattice Project: A Multi-Model Grid Computing System Center for Bioinformatics and Computational Biology University of Maryland Parallel Computing PARALLEL COMPUTING a form of computation in which
More informationDistributed Systems and Recent Innovations: Challenges and Benefits
Distributed Systems and Recent Innovations: Challenges and Benefits 1. Introduction Krishna Nadiminti, Marcos Dias de Assunção, and Rajkumar Buyya Grid Computing and Distributed Systems Laboratory Department
More informationAn Evaluation of Job Scheduling Strategies for Divisible Loads on Grid Platforms
An Evaluation of Job Scheduling Strategies for Divisible Loads on Grid Platforms Yudith Cardinale Dpto. de Computación y Tecnología de la Información Universidad Simón Bolívar Apartado 89, Caracas 8-A,
More informationA General Distributed Scalable Peer to Peer Scheduler for Mixed Tasks in Grids
A General Distributed Scalable Peer to Peer Scheduler for Mixed Tasks in Grids Cong Liu, Sanjeev Baskiyar *, and Shuang Li Dept. of Computer Science and Software Engineering Auburn University, Auburn,
More informationUse of Agent-Based Service Discovery for Resource Management in Metacomputing Environment
In Proceedings of 7 th International Euro-Par Conference, Manchester, UK, Lecture Notes in Computer Science 2150, Springer Verlag, August 2001, pp. 882-886. Use of Agent-Based Service Discovery for Resource
More informationMULTIDIMENSIONAL QOS ORIENTED TASK SCHEDULING IN GRID ENVIRONMENTS
MULTIDIMENSIONAL QOS ORIENTED TASK SCHEDULING IN GRID ENVIRONMENTS Amit Agarwal and Padam Kumar Department of Electronics & Computer Engineering, Indian Institute of Technology Roorkee, Roorkee, India
More informationGraduated Student: José O. Nogueras Colón Adviser: Yahya M. Masalmah, Ph.D.
Graduated Student: José O. Nogueras Colón Adviser: Yahya M. Masalmah, Ph.D. Introduction Problem Statement Objectives Hyperspectral Imagery Background Grid Computing Desktop Grids DG Advantages Green Desktop
More informationClassic Grid Architecture
Peer-to to-peer Grids Classic Grid Architecture Resources Database Database Netsolve Collaboration Composition Content Access Computing Security Middle Tier Brokers Service Providers Middle Tier becomes
More informationGrid Computing. Making the Global Infrastructure a Reality. Wiley Series on Communications Networking & Distributed Systems
Brochure More information from http://www.researchandmarkets.com/reports/2172168/ Grid Computing. Making the Global Infrastructure a Reality. Wiley Series on Communications Networking & Distributed Systems
More informationGrid Computing Vs. Cloud Computing
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 6 (2013), pp. 577-582 International Research Publications House http://www. irphouse.com /ijict.htm Grid
More informationQoS Guided Min-Min Heuristic for Grid Task Scheduling *
QoS Guided Min-Min Heuristic for Grid Task Scheduling * HE Xiaoshan ( 何 晓 珊 ) 1 Xian-He Sun ( 孙 贤 和 ) 1 Gregor von Laszewski 2 1 Department of Computer Science, Illinois Institute of Technology, IL, USA
More informationExploring Inter-Cloud Load Balancing by Utilizing Historical Service Submission Records
72 International Journal of Distributed Systems and Technologies, 3(3), 72-81, July-September 2012 Exploring Inter-Cloud Load Balancing by Utilizing Historical Service Submission Records Stelios Sotiriadis,
More informationDecentralized Load Balancing for Heterogeneous Grids
Decentralized Load Balancing for Heterogeneous Grids Issam Al-Azzoni and Douglas G. Down Department of Computing and Software McMaster University Hamilton, Ontario, Canada alazzoi@mcmaster.ca, downd@mcmaster.ca
More informationKeywords: Cloudsim, MIPS, Gridlet, Virtual machine, Data center, Simulation, SaaS, PaaS, IaaS, VM. Introduction
Vol. 3 Issue 1, January-2014, pp: (1-5), Impact Factor: 1.252, Available online at: www.erpublications.com Performance evaluation of cloud application with constant data center configuration and variable
More informationRisk and Security Assessment. Zbigniew Kalbarczyk
Risk and Security Assessment Zbigniew Kalbarczyk 1 TCIPG Cluster Arrangement Communication and Data Delivery for Wide-Area Monitoring and Control Trustworthy cyber infrastructure and technologies for wide-area
More informationAn Empirical Study and Analysis of the Dynamic Load Balancing Techniques Used in Parallel Computing Systems
An Empirical Study and Analysis of the Dynamic Load Balancing Techniques Used in Parallel Computing Systems Ardhendu Mandal and Subhas Chandra Pal Department of Computer Science and Application, University
More informationPerformance Prediction, Sizing and Capacity Planning for Distributed E-Commerce Applications
Performance Prediction, Sizing and Capacity Planning for Distributed E-Commerce Applications by Samuel D. Kounev (skounev@ito.tu-darmstadt.de) Information Technology Transfer Office Abstract Modern e-commerce
More informationReSIST NoE Resilience for Survivability in IST. Introduction
ReSIST NoE Resilience for Survivability in IST Modelling and Evaluation of Largeness in Evolving Systems Andrea Bondavalli University of Firenze (here PISA) Introduction Systems complexity has always been
More informationAn Economy Driven Resource Management Architecture for Global Computational Power Grids
An Economy Driven Resource Management Architecture for Global Computational Power Grids Rajkumar Buyya, David Abramson, and Jonathan Giddy School of Computer Science and Software Engineering Monash University
More informationMethods for Assessing Vulnerability of Critical Infrastructure
March 2010 Methods for Assessing Vulnerability of Critical Infrastructure Project Leads Eric Solano, PhD, PE, RTI International Statement of Problem Several events in the recent past, including the attacks
More informationStudy and Comparison of CloudSim Simulators in the Cloud Computing
Study and Comparison of CloudSim Simulators in the Cloud Computing Dr. Rahul Malhotra* & Prince Jain** *Director-Principal, Adesh Institute of Technology, Ghauran, Mohali, Punjab, INDIA. E-Mail: blessurahul@gmail.com
More informationDeveloping a Computer Based Grid infrastructure
Computational Grids: Current Trends in Performance-oriented Distributed Computing Rich Wolski Computer Science Department University of California, Santa Barbara Introduction While the rapid evolution
More informationMassive Cloud Auditing using Data Mining on Hadoop
Massive Cloud Auditing using Data Mining on Hadoop Prof. Sachin Shetty CyberBAT Team, AFRL/RIGD AFRL VFRP Tennessee State University Outline Massive Cloud Auditing Traffic Characterization Distributed
More informationCloudAnalyst: A CloudSim-based Visual Modeller for Analysing Cloud Computing Environments and Applications
CloudAnalyst: A CloudSim-based Visual Modeller for Analysing Cloud Computing Environments and Applications Bhathiya Wickremasinghe 1, Rodrigo N. Calheiros 2, and Rajkumar Buyya 1 1 The Cloud Computing
More informationSan Diego Supercomputer Center, UCSD. Institute for Digital Research and Education, UCLA
Facilitate Parallel Computation Using Kepler Workflow System on Virtual Resources Jianwu Wang 1, Prakashan Korambath 2, Ilkay Altintas 1 1 San Diego Supercomputer Center, UCSD 2 Institute for Digital Research
More informationHeterogeneous Workload Consolidation for Efficient Management of Data Centers in Cloud Computing
Heterogeneous Workload Consolidation for Efficient Management of Data Centers in Cloud Computing Deep Mann ME (Software Engineering) Computer Science and Engineering Department Thapar University Patiala-147004
More informationANALYSIS OF WORKFLOW SCHEDULING PROCESS USING ENHANCED SUPERIOR ELEMENT MULTITUDE OPTIMIZATION IN CLOUD
ANALYSIS OF WORKFLOW SCHEDULING PROCESS USING ENHANCED SUPERIOR ELEMENT MULTITUDE OPTIMIZATION IN CLOUD Mrs. D.PONNISELVI, M.Sc., M.Phil., 1 E.SEETHA, 2 ASSISTANT PROFESSOR, M.PHIL FULL-TIME RESEARCH SCHOLAR,
More informationSource Code Transformations Strategies to Load-balance Grid Applications
Source Code Transformations Strategies to Load-balance Grid Applications Romaric David, Stéphane Genaud, Arnaud Giersch, Benjamin Schwarz, and Éric Violard LSIIT-ICPS, Université Louis Pasteur, Bd S. Brant,
More informationAdaptive Processor Allocation for Moldable Jobs in Computational Grid
10 International Journal of Grid and High Performance Computing, 1(1), 10-21, January-March 2009 Adaptive Processor Allocation for Moldable Jobs in Computational Grid Kuo-Chan Huang, National Taichung
More informationWeb Service Based Data Management for Grid Applications
Web Service Based Data Management for Grid Applications T. Boehm Zuse-Institute Berlin (ZIB), Berlin, Germany Abstract Web Services play an important role in providing an interface between end user applications
More informationWeb Email DNS Peer-to-peer systems (file sharing, CDNs, cycle sharing)
1 1 Distributed Systems What are distributed systems? How would you characterize them? Components of the system are located at networked computers Cooperate to provide some service No shared memory Communication
More informationCharles Koelbel Rice University chk@cs.rice.edu http://vgrads.rice.edu/
Virtual Grid Application Development Software (VGrADS), Electron Microscopy Analysis (EMAN), and Other Acronyms Charles Koelbel Rice University chk@cs.rice.edu http://vgrads.rice.edu/ Virtual Grid Application
More informationEverything you need to know about flash storage performance
Everything you need to know about flash storage performance The unique characteristics of flash make performance validation testing immensely challenging and critically important; follow these best practices
More informationClustering and scheduling maintenance tasks over time
Clustering and scheduling maintenance tasks over time Per Kreuger 2008-04-29 SICS Technical Report T2008:09 Abstract We report results on a maintenance scheduling problem. The problem consists of allocating
More informationEnterprise Desktop Grids
Enterprise Desktop Grids Evgeny Ivashko Institute of Applied Mathematical Research, Karelian Research Centre of Russian Academy of Sciences, Petrozavodsk, Russia, ivashko@krc.karelia.ru WWW home page:
More informationAn Opportunistic Algorithm for Scheduling Workflows on Grids
An Opportunistic Algorithm for Scheduling Workflows on Grids Luiz Meyer 1, Doug Scheftner 2, Jens Voeckler 2, Marta Mattoso 1, Mike Wilde 3, Ian Foster 2, 3 1 Federal University of Rio de Janeiro - COPPE,
More informationCollaborative & Integrated Network & Systems Management: Management Using Grid Technologies
2011 International Conference on Computer Communication and Management Proc.of CSIT vol.5 (2011) (2011) IACSIT Press, Singapore Collaborative & Integrated Network & Systems Management: Management Using
More informationA SIMULATOR FOR LOAD BALANCING ANALYSIS IN DISTRIBUTED SYSTEMS
Mihai Horia Zaharia, Florin Leon, Dan Galea (3) A Simulator for Load Balancing Analysis in Distributed Systems in A. Valachi, D. Galea, A. M. Florea, M. Craus (eds.) - Tehnologii informationale, Editura
More informationXMPP A Perfect Protocol for the New Era of Volunteer Cloud Computing
International Journal of Computational Engineering Research Vol, 03 Issue, 10 XMPP A Perfect Protocol for the New Era of Volunteer Cloud Computing Kamlesh Lakhwani 1, Ruchika Saini 1 1 (Dept. of Computer
More informationData-Aware Service Choreographies through Transparent Data Exchange
Institute of Architecture of Application Systems Data-Aware Service Choreographies through Transparent Data Exchange Michael Hahn, Dimka Karastoyanova, and Frank Leymann Institute of Architecture of Application
More informationCharacterizing Task Usage Shapes in Google s Compute Clusters
Characterizing Task Usage Shapes in Google s Compute Clusters Qi Zhang University of Waterloo qzhang@uwaterloo.ca Joseph L. Hellerstein Google Inc. jlh@google.com Raouf Boutaba University of Waterloo rboutaba@uwaterloo.ca
More informationThe Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets
The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets!! Large data collections appear in many scientific domains like climate studies.!! Users and
More informationSystem Models for Distributed and Cloud Computing
System Models for Distributed and Cloud Computing Dr. Sanjay P. Ahuja, Ph.D. 2010-14 FIS Distinguished Professor of Computer Science School of Computing, UNF Classification of Distributed Computing Systems
More information3rd International Symposium on Big Data and Cloud Computing Challenges (ISBCC-2016) March 10-11, 2016 VIT University, Chennai, India
3rd International Symposium on Big Data and Cloud Computing Challenges (ISBCC-2016) March 10-11, 2016 VIT University, Chennai, India Call for Papers Cloud computing has emerged as a de facto computing
More informationCHAPTER 1 INTRODUCTION
1 CHAPTER 1 INTRODUCTION Exploration is a process of discovery. In the database exploration process, an analyst executes a sequence of transformations over a collection of data structures to discover useful
More informationUnderstanding the Benefits of IBM SPSS Statistics Server
IBM SPSS Statistics Server Understanding the Benefits of IBM SPSS Statistics Server Contents: 1 Introduction 2 Performance 101: Understanding the drivers of better performance 3 Why performance is faster
More informationPerformance Workload Design
Performance Workload Design The goal of this paper is to show the basic principles involved in designing a workload for performance and scalability testing. We will understand how to achieve these principles
More informationPreview of Award 1320357 Annual Project Report Cover Accomplishments Products Participants/Organizations Impacts Changes/Problems
Preview of Award 1320357 Annual Project Report Cover Accomplishments Products Participants/Organizations Impacts Changes/Problems Cover Federal Agency and Organization Element to Which Report is Submitted:
More informationPowerful Duo: MapR Big Data Analytics with Cisco ACI Network Switches
Powerful Duo: MapR Big Data Analytics with Cisco ACI Network Switches Introduction For companies that want to quickly gain insights into or opportunities from big data - the dramatic volume growth in corporate
More informationBibliography. University of Applied Sciences Fulda, Prof. Dr. S. Groß
Slide III Bibliography 1) Abbas, A.: Grid Computing - A Practical Guide to Technology and Applications. Charles River Media, 2004. http://www.charlesriver.com/titles/gridcomputing.html 2) Berman, F.; et
More information1.1 Difficulty in Fault Localization in Large-Scale Computing Systems
Chapter 1 Introduction System failures have been one of the biggest obstacles in operating today s largescale computing systems. Fault localization, i.e., identifying direct or indirect causes of failures,
More informationHadoop in the Hybrid Cloud
Presented by Hortonworks and Microsoft Introduction An increasing number of enterprises are either currently using or are planning to use cloud deployment models to expand their IT infrastructure. Big
More informationLoad Balancing on a Non-dedicated Heterogeneous Network of Workstations
Load Balancing on a Non-dedicated Heterogeneous Network of Workstations Dr. Maurice Eggen Nathan Franklin Department of Computer Science Trinity University San Antonio, Texas 78212 Dr. Roger Eggen Department
More informationA Data Locality Aware Online Scheduling Approach for I/O-Intensive Jobs with File Sharing
A Data Locality Aware Online Scheduling Approach for I/O-Intensive Jobs with File Sharing Gaurav Khanna 1, Umit Catalyurek 2, Tahsin Kurc 2, P. Sadayappan 1, Joel Saltz 2 1 Dept. of Computer Science and
More informationEmergency Alert System using Android Text Message Service ABSTRACT:
Emergency Alert System using Android Text Message Service ABSTRACT: Cellular text messaging services are increasingly being relied upon to disseminate critical information during emergencies. Accordingly,
More informationDATA MANAGEMENT, CODE DEPLOYMENT, AND SCIENTIFIC VISUALLIZATION TO ENHANCE SCIENTIFIC DISCOVERY IN FUSION RESEARCH THROUGH ADVANCED COMPUTING
DATA MANAGEMENT, CODE DEPLOYMENT, AND SCIENTIFIC VISUALLIZATION TO ENHANCE SCIENTIFIC DISCOVERY IN FUSION RESEARCH THROUGH ADVANCED COMPUTING D.P. Schissel, 1 A. Finkelstein, 2 I.T. Foster, 3 T.W. Fredian,
More informationSolution White Paper Connect Hadoop to the Enterprise
Solution White Paper Connect Hadoop to the Enterprise Streamline workflow automation with BMC Control-M Application Integrator Table of Contents 1 EXECUTIVE SUMMARY 2 INTRODUCTION THE UNDERLYING CONCEPT
More informationPART II THE ROAD AHEAD: IMPLEMENTING KNOWLEDGE MANAGEMENT
PART II THE ROAD AHEAD: IMPLEMENTING KNOWLEDGE MANAGEMENT CHAPTER 4 THE 10-STEP KNOWLEDGE MANAGEMENT ROAD MAP IN THIS CHAPTER Understand the 10-step KM road map and how it applies to your company. Understand
More informationmultiple geographically apart resource aggregation collaboration between remote working groups and developing teams
47 GRID architecture 2.3 Properties of the Grid What is GRID useful for? Resource sharing simplified access to remote resources multiple geographically apart resource aggregation access to remote databases
More informationAn Enhanced Application Model for Scheduling in Grid Environments
An Enhanced Application Model for Scheduling in Grid Environments Christoph Ruffner, Pedro José Marrón, Kurt Rothermel University of Stuttgart Institute of Parallel and Distributed Systems (IPVS) Universitaetsstr.
More informationThe Probabilistic Model of Cloud Computing
A probabilistic multi-tenant model for virtual machine mapping in cloud systems Zhuoyao Wang, Majeed M. Hayat, Nasir Ghani, and Khaled B. Shaban Department of Electrical and Computer Engineering, University
More informationThe Virtual Grid Application Development Software (VGrADS) Project
The Virtual Grid Application Development Software (VGrADS) Project VGrADS: Enabling e-science Workflows on Grids and Clouds with Fault Tolerance http://vgrads.rice.edu/ VGrADS Goal: Distributed Problem
More informationConcept and Project Objectives
3.1 Publishable summary Concept and Project Objectives Proactive and dynamic QoS management, network intrusion detection and early detection of network congestion problems among other applications in the
More informationA Cloud Data Center Optimization Approach Using Dynamic Data Interchanges
A Cloud Data Center Optimization Approach Using Dynamic Data Interchanges Efstratios Rappos Institute for Information and Communication Technologies, Haute Ecole d Ingénierie et de Geston du Canton de
More informationGridSolve: : A Seamless Bridge Between the Standard Programming Interfaces and Remote Resources
GridSolve: : A Seamless Bridge Between the Standard Programming Interfaces and Remote Resources Jack Dongarra University of Tennessee and Oak Ridge National Laboratory 2/25/2006 1 Overview Grid/NetSolve
More information1 st Symposium on Colossal Data and Networking (CDAN-2016) March 18-19, 2016 Medicaps Group of Institutions, Indore, India
1 st Symposium on Colossal Data and Networking (CDAN-2016) March 18-19, 2016 Medicaps Group of Institutions, Indore, India Call for Papers Colossal Data Analysis and Networking has emerged as a de facto
More informationAn Grid Service Module for Natural Resource Managers
An Grid Service Module for Natural Resource Managers Dali Wang 1, Eric Carr 1, Mark Palmer 1, Michael W. Berry 2 and Louis J. Gross 1 1 The Institute for Environmental Modeling 569 Dabney Hall, University
More informationGrid Computing Making the Global Infrastructure a Reality Teena Vyas March 11, 2004
Chapter 32 - Grid Resource Allocation and Control using computational economies Grid Computing Making the Global Infrastructure a Reality Teena Vyas March 11, 2004 Introduction Basic strategies used for
More informationDynamic Load Balancing for a Grid Application
Dynamic Load Balancing for a Grid Application Menno Dobber, Ger Koole, and Rob van der Mei Vrije Universiteit, De Boelelaan 1081a, 1081 HV Amsterdam, The Netherlands {amdobber, koole, mei}@few.vu.nl http://www.cs.vu.nl/~amdobber
More informationENOVIA V6 Architecture Performance Capability Scalability
ENOVIA V6 Technical Advantages Whitepaper ENOVIA V6 Architecture Performance Capability Scalability a Product Lifecycle Management Whitepaper Prepared by ENOVIA, a Dassault Systèmes Brand Executive Summary
More informationThe Big Data methodology in computer vision systems
The Big Data methodology in computer vision systems Popov S.B. Samara State Aerospace University, Image Processing Systems Institute, Russian Academy of Sciences Abstract. I consider the advantages of
More informationPerformance Analysis of Cloud-Based Applications
Performance Analysis of Cloud-Based Applications Peter Budai and Balazs Goldschmidt Budapest University of Technology and Economics, Department of Control Engineering and Informatics, Budapest, Hungary
More informationElectric Energy Systems
Electric Energy Systems Electric Energy Systems seeks to explore methods at the frontier of understanding of the future electric power and energy systems worldwide. The track will focus on the electric
More informationGame Theory Based Load Balanced Job Allocation in Distributed Systems
in Distributed Systems Anthony T. Chronopoulos Department of Computer Science University of Texas at San Antonio San Antonio, TX, USA atc@cs.utsa.edu Load balancing: problem formulation Load balancing
More informationGridsofagentsforcomputer and telecommunication network management
CONCURRENCY AND COMPUTATION: PRACTICE AND EXPERIENCE Concurrency Computat.: Pract. Exper. 2003; 00:1 12 [Version: 2002/09/19 v2.02] Gridsofagentsforcomputer and telecommunication network management M.
More informationHadoop Cluster Applications
Hadoop Overview Data analytics has become a key element of the business decision process over the last decade. Classic reporting on a dataset stored in a database was sufficient until recently, but yesterday
More informationDynamic Resource Pricing on Federated Clouds
Dynamic Resource Pricing on Federated Clouds Marian Mihailescu and Yong Meng Teo Department of Computer Science National University of Singapore Computing 1, 13 Computing Drive, Singapore 117417 Email:
More informationPACE Predictive Analytics Center of Excellence @ San Diego Supercomputer Center, UCSD. Natasha Balac, Ph.D.
PACE Predictive Analytics Center of Excellence @ San Diego Supercomputer Center, UCSD Natasha Balac, Ph.D. Brief History of SDSC 1985-1997: NSF national supercomputer center; managed by General Atomics
More informationcan you effectively plan for the migration and management of systems and applications on Vblock Platforms?
SOLUTION BRIEF CA Capacity Management and Reporting Suite for Vblock Platforms can you effectively plan for the migration and management of systems and applications on Vblock Platforms? agility made possible
More informationDistribution transparency. Degree of transparency. Openness of distributed systems
Distributed Systems Principles and Paradigms Maarten van Steen VU Amsterdam, Dept. Computer Science steen@cs.vu.nl Chapter 01: Version: August 27, 2012 1 / 28 Distributed System: Definition A distributed
More informationC-Meter: A Framework for Performance Analysis of Computing Clouds
C-Meter: A Framework for Performance Analysis of Computing Clouds Nezih Yigitbasi, Alexandru Iosup, and Dick Epema {M.N.Yigitbasi, D.H.J.Epema, A.Iosup}@tudelft.nl Delft University of Technology Simon
More informationAn Experience in Accessing Grid Computing Power from Mobile Device with GridLab Mobile Services
An Experience in Accessing Grid Computing Power from Mobile Device with GridLab Mobile Services Abstract In this paper review the notion of the use of mobile device in grid computing environment, We describe
More informationIBM Deep Computing Visualization Offering
P - 271 IBM Deep Computing Visualization Offering Parijat Sharma, Infrastructure Solution Architect, IBM India Pvt Ltd. email: parijatsharma@in.ibm.com Summary Deep Computing Visualization in Oil & Gas
More informationClouds vs Grids KHALID ELGAZZAR GOODWIN 531 ELGAZZAR@CS.QUEENSU.CA
Clouds vs Grids KHALID ELGAZZAR GOODWIN 531 ELGAZZAR@CS.QUEENSU.CA [REF] I Foster, Y Zhao, I Raicu, S Lu, Cloud computing and grid computing 360-degree compared Grid Computing Environments Workshop, 2008.
More informationMinimal Cost Data Sets Storage in the Cloud
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 5, May 2014, pg.1091
More informationSimulating a File-Sharing P2P Network
Simulating a File-Sharing P2P Network Mario T. Schlosser, Tyson E. Condie, and Sepandar D. Kamvar Department of Computer Science Stanford University, Stanford, CA 94305, USA Abstract. Assessing the performance
More informationSynthetic Grid Workloads with Ibis, KOALA, and GrenchMark
Synthetic Grid Workloads with Ibis, KOALA, and GrenchMark Alexandru Iosup 1, Jason Maassen 2, Rob van Nieuwpoort 2, and Dick H.J. Epema 1 1 Faculty of Electrical Engineering, Mathematics, and Computer
More informationCYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21)
CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21) Goal Develop and deploy comprehensive, integrated, sustainable, and secure cyberinfrastructure (CI) to accelerate research
More informationMetrics for Effective Resource Management in Global Computing Environments
Metrics for Effective Resource Management in Global Computing Environments Michela Taufer 1, Patricia J. Teller 1, David P. Anderson 2, and Charles L. Brooks, III 3 1 College of Engineering University
More informationNimrod/G: An Architecture for a Resource Management and Scheduling System in a Global Computational Grid
Nimrod/G: An Architecture for a Resource Management and Scheduling System in a Global Computational Grid Rajkumar Buyya, David Abramson, and Jonathan Giddy School of Computer Science and CRC for Enterprise
More informationInternational Journal of Scientific & Engineering Research, Volume 6, Issue 4, April-2015 36 ISSN 2229-5518
International Journal of Scientific & Engineering Research, Volume 6, Issue 4, April-2015 36 An Efficient Approach for Load Balancing in Cloud Environment Balasundaram Ananthakrishnan Abstract Cloud computing
More informationC-Meter: A Framework for Performance Analysis of Computing Clouds
9th IEEE/ACM International Symposium on Cluster Computing and the Grid C-Meter: A Framework for Performance Analysis of Computing Clouds Nezih Yigitbasi, Alexandru Iosup, and Dick Epema Delft University
More informationNetworkCloudSim: Modelling Parallel Applications in Cloud Simulations
2011 Fourth IEEE International Conference on Utility and Cloud Computing NetworkCloudSim: Modelling Parallel Applications in Cloud Simulations Saurabh Kumar Garg and Rajkumar Buyya Cloud Computing and
More informationVanguard Knowledge Automation System
KNOWLEDGE AUTOMATION SYSTEM: OVERVIEW Vanguard Knowledge Automation System Turn routine processes into easy-to-use Web Apps Vanguard Knowledge Automation System lets you capture routine business processes
More informationSCC717 Recent Developments in Information Technology
SCC717 Recent Developments in Information Technology Lecturer: Wanlei ZHOU, Ph.D., DSc., Professor, Acting Head School of Information Technology Deakin University, Melbourne Campus Office: T205.2, Ext.
More information16th International Conference on Control Systems and Computer Science (CSCS16 07)
16th International Conference on Control Systems and Computer Science (CSCS16 07) TOWARDS AN IO INTENSIVE GRID APPLICATION INSTRUMENTATION IN MEDIOGRID Dacian Tudor 1, Florin Pop 2, Valentin Cristea 2,
More informationA Log Analysis System with REST Web Services for Desktop Grids and its Application to Resource Group-based Task Scheduling
Journal of Information Processing Systems, Vol.7, No.4, December 2011 http://dx.doi.org/10.3745/jips.2011.7.4.707 A Log Analysis System with REST Web Services for Desktop Grids and its Application to Resource
More informationIntegration of the OCM-G Monitoring System into the MonALISA Infrastructure
Integration of the OCM-G Monitoring System into the MonALISA Infrastructure W lodzimierz Funika, Bartosz Jakubowski, and Jakub Jaroszewski Institute of Computer Science, AGH, al. Mickiewicza 30, 30-059,
More information