Tackling BigData: Strategies for Parallelizing and Porting Geoprocessing Algorithms to High-Performance Computational Environments

Size: px
Start display at page:

Download "Tackling BigData: Strategies for Parallelizing and Porting Geoprocessing Algorithms to High-Performance Computational Environments"

Transcription

1 Tackling BigData: Strategies for Parallelizing and Porting Geoprocessing Algorithms to High-Performance Computational Environments A. Sorokine 1, A. Myers 1, C. Liu 1, P. Coleman 1, E. Bright 1, A. Rose 1, P. Nugent 1, B. Bhaduri 1 1 OAK RIDGE NATIONAL LABORATORY Oak Ridge, Tennessee {sorokina,myersat,liuc,colemanpr,brightea,rosean,nugentpj,bhaduribl}@ornl.gov 1. Introduction In a world of increasing volume of earth observation and simulation data, the success of scientific advancement and discovery is strongly connected to our capabilities in storing, analyzing, and creating meaningful information from the enormous databases within a potentially useful timeframe. Although the progress of individual processor speed, cache performance, and graphics capability has been impressive, it has not been adequate to match the growth of available spatial data. Consequently, a number of new high-performance computing technologies have been developed and made available to GIScientists in the recent decade. New computational technologies have the potential to improve geoprocessing software performance and reduce computation time by orders of magnitude. Such improvements open new opportunities for GIScientists as the same algorithms and models can be used for larger datasets and applied at a larger scale or for larger number of geographic regions. Most new computational technologies rely on the decomposition of programming tasks into a number of smaller simultaneously executed units. However, most of the existing geoprocessing algorithms have been developed for single-threaded computing environments which in turn imposes rather low limits on the volumes of data that can be effectively processed. Conversion of the existing algorithms to parallel data-intensive architectures presents major challenges in terms of programming effort and testing. This paper describes the preliminary result of an ongoing study aiming to evaluate and analyze strategies for porting legacy geoprocessing algorithms to parallel high-performance and high data volume computational environments. 2. The Challenges of Parallel Geoprocessing and Earlier Work Parallel or distributed computing can be an effective solution to a number of spatial data integration, analysis, and visualization problems (Healy et al., 1998). Creating a geoprocessing application for parallel computational environments is a challenging problem. The complexity of the implementation of a parallel geoprocessing algorithm is significantly higher than development in the traditional serial software paradigm (McKenney, 2011). A number of both technical and non-technical issues specific to parallel processing have to be addressed. Plausible computing solutions include segmenting computational tasks across processors in a multiprocessor machine or across a cluster to alleviate the bottleneck associated with computational processes (Armstrong et al., 1994a, 1994b; Armstrong and Marciano, 1995; Rokos and Armstrong, 1995). This has been well demonstrated in the field of satellite remote sensing. Because of the raster nature of the data and ease of geometric partitioning (single program multiple data sets), several efforts have attempted to facilitate remote sensing algorithms using parallel and distributed computing (Hawick and James, 1997 and 1998; Yang, 2004). Parallel algorithm designers also should account for other issues such as synchronization among the threads accessing the same memory regions, sharing of I/O, and

2 the need to minimize interaction between nodes and threads (Hunt and John, 2011, Quinn, 2003). As a result, implementations of the same algorithm for parallel and serial environments are typically rather different as the peculiarities of parallel platforms have a significant impact upon the overall algorithm design. In addition, in data-intensive applications, input/output operations have to be given a special consideration (Bryant, 2007). In practice, the decision to pursue parallel development has to be based on a large number of factors, with expected improvements in computational efficiency being only one of them. The cost of developing or porting a geoprocessing application to a parallel environment is one of the most important factors that have to be balanced with expected improvement in computational efficiency. Other factors include the availability of the expertise for a specific computational platform, viability of a parallel computational platform for which development is performed in the long term, and the cost of acquiring hardware and maintaining it in the long term. The goal of this study is to demonstrate quantitative comparison of different strategies for porting an existing geoprocessing algorithm to parallel computing environments and to develop a methodology for balancing gains in computational efficiency with development effort. 3. Materials and Methods For the purposes of this study we have examined several re-implementations of a population disaggregation algorithm. This algorithm is used as one of the components of LandScan Global and LandScan USA, two high-resolution population distribution models (Bhaduri et. al., 2002, 2007, Dobson et. al., 2000, Sabesan et. al., 2007). LandScan data are used by tens of thousands of users throughout government, academic, and humanitarian organizations for consequence assessments. Applications using the LandScan data range from plume dispersion models, emergency evacuation planning and support, and natural hazard assessment. Users often require rapid updates of population distributions to account for rapidly changing population mobility such as refugee and other displaced population movements. The need for improving computational performance of the algorithm was initiated by the increased spatial and temporal resolution of the production database for LandScan and LandScan USA. The original version of the algorithm was developed in the Avenue programming language for ArcView 3.2 and has been used for LandScan releases since mid-1990s. The systematic study of the algorithm performance was devised after porting the algorithm to ArcGIS 9.x did not result in the expected performance improvements. Selection of the porting strategy was conditioned on a number of requirements. First, we expected significant improvement of the computational performance that would allow analysts to run the algorithm multiple times to test various combinations of the parameters and data sources. Second, the re-implemented algorithm should fit into the existing geoprocessing infrastructure but at the same time should be usable on highperformance systems without major redesign. Last but not least, the high-performance version of the algorithm should be viable and maintainable in the long term. The goals of this algorithm are to produce the initial LandScan population grid from the provided coefficient grid generated from the LandScan model and the census boundaries in grid format, and normalize the population within each boundary to the census values. First, the total population for each census area is divided by the sum of the coefficients within each of the respective census areas. Next, the coefficient grid is used to generate the initial population grid that will be normalized against the census population. The sum of the resulting grid within each census area is compared with the population values provided; the difference is stored for each zone. The raster cells are converted to points for the normalization process. From here, the population is either added to or subtracted from each point within a census zone until the population difference is zero. Population can be removed

3 from any point within a zone regardless of it coefficient, and the distribution of points are randomized to provide a better population distribution. However, the coefficient does limit the minimum value needed to add one person to a cell. The normalized population points are then converted back into ESRI GRID format and saved for review by an analyst (Bhaduri et. al., 2007). Multiple implementations of the algorithm were created with the purposes of understanding performance improvements that result from various software design decisions. The first reimplementation was done in C# in the ArcGIS 9.x framework. The advantage of using C# in ArcGIS is that it fits well into the ESRI products-based production framework. However, this implementation did not result in any significant performance advantages. Two other implementations were built using open-source tools and frameworks: a serial C++ implementation (compiled with GCC 4.6 and linked to standard library) and a multi-threaded program in Java 6 (IcedTea 1.10). Both implementations use gdal ( for reading and writing GIS formats. C++ is a language of choice for many of high-performance applications and has extensive support in term of libraries and development tools on all high-performance platforms. However, C++ programming generally is regarded as more difficult and requiring more programming skill. On the other hand, Java offers a shorter development cycle and easier availability of the programming expertise in the geoprocessing area. At the same time, some advantages of Java such as late binding and garbage collection come at the cost of performance penalties and lesser adoption and support of Java in high-performance applications. Additional versions of the implementations were created to test the performance of various data structures and memory management approaches. The C++ implementation was tested using either a hash array or indirect index for accessing a table of population counts in memory. For Java, two approaches were tested: storing the intermediate computation result in one array of objects, or storing the results in the arrays of primitive types. All versions of the program were verified to produce identical results with randomization disabled. 4. Preliminary Results and Discussion All implementations were benchmarked using representative sub-datasets of LandScan Global and LandScan USA: C# and Avenue versions were tested on a 39 million pixel subset of LandScan Global (~1 km spatial resolution), and C++ and Java versions were run with 1.3 billion pixel subset of LandScan USA (~90 m spatial resolution). On the Linux systems, execution time was measured using the time system command and rounded to the nearest minute. The Java and C++ versions of the program were executed multiple times to prefill the file cache. The systems were monitored for activity of other programs and services to exclude system-level factors affecting performance. Results of the performance tests are shown in Table 1 and mean execution times of the Java implementation are shown on Fig Time (sec) number of threads Figure 1. Mean execution time of the Java implementation on an 8-core PC.

4 Table 1. Benchmarking Results. Implementation Computing threads Operating System CPU, RAM size, disk Avenue for 1 Windows Intel Core GHz ArcView 3.2 XP 4GB RAM C# for ArcGIS 1 Windows dual 6-core AMD Opteron 9.3 Server GHz 8GB RAM C++ with hash 1 Ubuntu dual 4-core Xeon 2.4GHz, tables C++ with arrays 1 Ubuntu dual 4-core Xeon 2.4GHz, Java with arrays 1 Ubuntu dual 4-core Xeon 2.4GHz, of objects Java with arrays of primitives Java with arrays of primitives 1 Ubuntu Linux 12 Ubuntu Linux dual 4-core Xeon 2.4GHz, 32GB RAM, RAID5 dual 4-core Xeon 2.4GHz, 32GB RAM, RAID5 Dataset size 5,400x 7,200 5,400x 7,200 Execution Time 70 minutes 80 minutes minutes below 2 minutes did not finish because of memory limit 5-6 minutes 2-3 minutes Table 1 shows drastic performance improvements with the transition to implementation in C++ and Java. Performance of the programs written in C++ and Java are rather similar, however, it strongly depends upon the programming techniques used. C++ program execution time was reduced 7-fold by switching from using C++ hash maps to arrays. Two implementations of the algorithm in Java were tested. In the first implementation the data in RAM was organized as a single array of Java objects. However, this implementation had poor performance and excessive memory requirements due to overhead related to creation and destruction of Java objects. In the second Java implementation we relied on the arrays of primitive types and achieved performance that was much closer to that of C++. Further performance improvement was achieved by using multiple threads for computation executed on multiple CPU cores (Fig. 1). The computation was decomposed along the census area boundaries. 5. Conclusions and Current Work In this on-going study we have systematically evaluated strategies for improving performance of a geoprocessing algorithm. We were able to reduce computation time from hours or days to minutes allowing researchers to process much larger datasets and test a larger number of scenarios and model parameters. Such improvements are critically important for bringing GIScience research into the domain of BigData computations. The same algorithm was reimplemented several times using various software platforms and languages. Implementing the algorithm as a standalone program in C++ and Java resulted in the largest performance improvement. However, the performance difference between Java and C++ was not as big as expected. Overall performance of the program depends more on the programming techniques and data structures used; for example, the efficiency of hash table lookup functions in the C++ standard library or speed of allocation of various types of arrays in Java. Improvements from a parallel version of the algorithm were relatively small as the disk system performance was the major bottleneck. Current research is addressing the reimplementation of the same algorithm for high-performance distributed memory system and a version for mixed architecture with GPGPU is under consideration.

5 Acknowledgements This manuscript has been authored by employees of UT-Battelle, LLC, under contract DE-AC05-00OR22725 with the U.S. Department of Energy. Accordingly, the United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The authors would like to acknowledge the financial support for this research from the US Government for the development of LandScan USA model and database. References Armstrong, M. P., Pavlik, C. E., & Marciano, R., 1994a. Parallel processing of spatial statistics. Computers & Geosciences, 20 (2), Armstrong, M. P., Pavlik, C. E., & Marciano, R., 1994b. Experiments in the measurement of spatial association using a parallel supercomputer. Geographical Systems, 1 (4), Armstrong, M. P. & Marciano, R., Massively parallel processing of spatial statistics. International Journal of Geographical Information Systems, 9 (2), Bhaduri, B., E. Bright, P. Coleman, and J. Dobson. LandScan: Locating People Is What Matters. Geoinformatics 5, no. 2 (2002): Bhaduri, B, E. Bright, P. Coleman, and M. Urban. LandScan USA: a High-resolution Geospatial and Temporal Modeling Approach for Population Distribution and Dynamics. GeoJournal 69, no. 1 (2007): Bryant, R. (2007). Data-Intensive Supercomputing: The Case for DISC. Computer Science Department. Retrieved from Dobson, J. E, E. A Bright, P. R Coleman, R. C Durfee, and B. A Worley. LandScan: a Global Population Database for Estimating Populations at Risk. Photogrammetric Engineering and Remote Sensing 66, no. 7 (2000): Hawick, K.A., James, H.A., Distributed high performance computing for remote sensing, Proceedings of Supercomputing 97 Pacific Rim. Hawick, K.A., James, H.A., A Web-based interface for on-demand processing of satellite imagery archives. Proceedings of the Australian Computer Science Conference (ACSC) 98, Perth, Australia. Healy, R., Dowers, S., Gittings, B., & Mineter, M., Parallel processing algorithms for GIS. Taylor & Francis, London. 460 pp. Hunt, C., & John, B. (2011). Java Performance (1st ed.). Prentice Hall. Rokos, D. & Armstrong, M.P., Using Linda to compute spatial autocorrelation in parallel. Computers & Geosciences, 22 (5), McKenney, Paul E. Is Parallel Programming Hard, And, If So, What Can You Do About It? Corvallis, OR, USA: kernel.org, Quinn, M. (2003). Parallel Programming in C with MPI and OpenMP (1st ed.). McGraw-Hill Science/Engineering/Math. Sabesan, A., K. Abercrombie, A. Ganguly, B. Bhaduri, Eddie Bright, and Phillip Coleman. Metrics for the Comparative Analysis of Geospatial Datasets with Applications to High-resolution Grid-based Population Data. GeoJournal 69, no. 1 (2007): Yang, X., Z. Chang, H. Zhou, X. Qu, and C.-J. Li. Services for Parallel Remote-Sensing Image Processing Based on Computational Grid. In Grid and Cooperative Computing - GCC 2004 Workshops, edited by H. Jin, Y. Pan, N. Xiao, and J. Sun, 3252: Lecture Notes in Computer Science. Springer Berlin / Heidelberg, 2004.

Parallel Visualization for GIS Applications

Parallel Visualization for GIS Applications Parallel Visualization for GIS Applications Alexandre Sorokine, Jamison Daniel, Cheng Liu Oak Ridge National Laboratory, Geographic Information Science & Technology, PO Box 2008 MS 6017, Oak Ridge National

More information

High-Performance Visualization of Geographic Data

High-Performance Visualization of Geographic Data High-Performance Visualization of Geographic Data Presented by Budhendra Bhaduri Alexandre Sorokine Geographic Information Science and Technology Computational Sciences and Engineering Managed by UT-Battelle

More information

A Theory of the Spatial Computational Domain

A Theory of the Spatial Computational Domain A Theory of the Spatial Computational Domain Shaowen Wang 1 and Marc P. Armstrong 2 1 Academic Technologies Research Services and Department of Geography, The University of Iowa Iowa City, IA 52242 Tel:

More information

Can social media play a role in developing building occupancy curves for small area estimation?

Can social media play a role in developing building occupancy curves for small area estimation? Can social media play a role in developing building occupancy curves for small area estimation? Robert Stewart *, Jesse Piburn, Eric Weber, Marie Urban, April Morton, Gautam Thakur, Budhendra Bhaduri Oak

More information

GIS Initiative: Developing an atmospheric data model for GIS. Olga Wilhelmi (ESIG), Jennifer Boehnert (RAP/ESIG) and Terri Betancourt (RAP)

GIS Initiative: Developing an atmospheric data model for GIS. Olga Wilhelmi (ESIG), Jennifer Boehnert (RAP/ESIG) and Terri Betancourt (RAP) GIS Initiative: Developing an atmospheric data model for GIS Olga Wilhelmi (ESIG), Jennifer Boehnert (RAP/ESIG) and Terri Betancourt (RAP) Unidata seminar August 30, 2004 Presentation Outline Overview

More information

Muse Server Sizing. 18 June 2012. Document Version 0.0.1.9 Muse 2.7.0.0

Muse Server Sizing. 18 June 2012. Document Version 0.0.1.9 Muse 2.7.0.0 Muse Server Sizing 18 June 2012 Document Version 0.0.1.9 Muse 2.7.0.0 Notice No part of this publication may be reproduced stored in a retrieval system, or transmitted, in any form or by any means, without

More information

HPC and Big Data. EPCC The University of Edinburgh. Adrian Jackson Technical Architect a.jackson@epcc.ed.ac.uk

HPC and Big Data. EPCC The University of Edinburgh. Adrian Jackson Technical Architect a.jackson@epcc.ed.ac.uk HPC and Big Data EPCC The University of Edinburgh Adrian Jackson Technical Architect a.jackson@epcc.ed.ac.uk EPCC Facilities Technology Transfer European Projects HPC Research Visitor Programmes Training

More information

RevoScaleR Speed and Scalability

RevoScaleR Speed and Scalability EXECUTIVE WHITE PAPER RevoScaleR Speed and Scalability By Lee Edlefsen Ph.D., Chief Scientist, Revolution Analytics Abstract RevoScaleR, the Big Data predictive analytics library included with Revolution

More information

Benchmarking Large Scale Cloud Computing in Asia Pacific

Benchmarking Large Scale Cloud Computing in Asia Pacific 2013 19th IEEE International Conference on Parallel and Distributed Systems ing Large Scale Cloud Computing in Asia Pacific Amalina Mohamad Sabri 1, Suresh Reuben Balakrishnan 1, Sun Veer Moolye 1, Chung

More information

PERFORMANCE ANALYSIS OF KERNEL-BASED VIRTUAL MACHINE

PERFORMANCE ANALYSIS OF KERNEL-BASED VIRTUAL MACHINE PERFORMANCE ANALYSIS OF KERNEL-BASED VIRTUAL MACHINE Sudha M 1, Harish G M 2, Nandan A 3, Usha J 4 1 Department of MCA, R V College of Engineering, Bangalore : 560059, India sudha.mooki@gmail.com 2 Department

More information

High Performance Computing in CST STUDIO SUITE

High Performance Computing in CST STUDIO SUITE High Performance Computing in CST STUDIO SUITE Felix Wolfheimer GPU Computing Performance Speedup 18 16 14 12 10 8 6 4 2 0 Promo offer for EUC participants: 25% discount for K40 cards Speedup of Solver

More information

How To Use Hadoop For Gis

How To Use Hadoop For Gis 2013 Esri International User Conference July 8 12, 2013 San Diego, California Technical Workshop Big Data: Using ArcGIS with Apache Hadoop David Kaiser Erik Hoel Offering 1330 Esri UC2013. Technical Workshop.

More information

Clusters: Mainstream Technology for CAE

Clusters: Mainstream Technology for CAE Clusters: Mainstream Technology for CAE Alanna Dwyer HPC Division, HP Linux and Clusters Sparked a Revolution in High Performance Computing! Supercomputing performance now affordable and accessible Linux

More information

PARALLEL & CLUSTER COMPUTING CS 6260 PROFESSOR: ELISE DE DONCKER BY: LINA HUSSEIN

PARALLEL & CLUSTER COMPUTING CS 6260 PROFESSOR: ELISE DE DONCKER BY: LINA HUSSEIN 1 PARALLEL & CLUSTER COMPUTING CS 6260 PROFESSOR: ELISE DE DONCKER BY: LINA HUSSEIN Introduction What is cluster computing? Classification of Cluster Computing Technologies: Beowulf cluster Construction

More information

Virtualization of ArcGIS Pro. An Esri White Paper December 2015

Virtualization of ArcGIS Pro. An Esri White Paper December 2015 An Esri White Paper December 2015 Copyright 2015 Esri All rights reserved. Printed in the United States of America. The information contained in this document is the exclusive property of Esri. This work

More information

Distributed Framework for Data Mining As a Service on Private Cloud

Distributed Framework for Data Mining As a Service on Private Cloud RESEARCH ARTICLE OPEN ACCESS Distributed Framework for Data Mining As a Service on Private Cloud Shraddha Masih *, Sanjay Tanwani** *Research Scholar & Associate Professor, School of Computer Science &

More information

Parallel Programming Survey

Parallel Programming Survey Christian Terboven 02.09.2014 / Aachen, Germany Stand: 26.08.2014 Version 2.3 IT Center der RWTH Aachen University Agenda Overview: Processor Microarchitecture Shared-Memory

More information

Benchmark Hadoop and Mars: MapReduce on cluster versus on GPU

Benchmark Hadoop and Mars: MapReduce on cluster versus on GPU Benchmark Hadoop and Mars: MapReduce on cluster versus on GPU Heshan Li, Shaopeng Wang The Johns Hopkins University 3400 N. Charles Street Baltimore, Maryland 21218 {heshanli, shaopeng}@cs.jhu.edu 1 Overview

More information

On-Demand Supercomputing Multiplies the Possibilities

On-Demand Supercomputing Multiplies the Possibilities Microsoft Windows Compute Cluster Server 2003 Partner Solution Brief Image courtesy of Wolfram Research, Inc. On-Demand Supercomputing Multiplies the Possibilities Microsoft Windows Compute Cluster Server

More information

Understanding the Benefits of IBM SPSS Statistics Server

Understanding the Benefits of IBM SPSS Statistics Server IBM SPSS Statistics Server Understanding the Benefits of IBM SPSS Statistics Server Contents: 1 Introduction 2 Performance 101: Understanding the drivers of better performance 3 Why performance is faster

More information

Upgrading Small Business Client and Server Infrastructure E-LEET Solutions. E-LEET Solutions is an information technology consulting firm

Upgrading Small Business Client and Server Infrastructure E-LEET Solutions. E-LEET Solutions is an information technology consulting firm Thank you for considering E-LEET Solutions! E-LEET Solutions is an information technology consulting firm that specializes in low-cost high-performance computing solutions. This document was written as

More information

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software WHITEPAPER Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software SanDisk ZetaScale software unlocks the full benefits of flash for In-Memory Compute and NoSQL applications

More information

INCREASING EFFICIENCY WITH EASY AND COMPREHENSIVE STORAGE MANAGEMENT

INCREASING EFFICIENCY WITH EASY AND COMPREHENSIVE STORAGE MANAGEMENT INCREASING EFFICIENCY WITH EASY AND COMPREHENSIVE STORAGE MANAGEMENT UNPRECEDENTED OBSERVABILITY, COST-SAVING PERFORMANCE ACCELERATION, AND SUPERIOR DATA PROTECTION KEY FEATURES Unprecedented observability

More information

An Oracle Benchmarking Study February 2011. Oracle Insurance Insbridge Enterprise Rating: Performance Assessment

An Oracle Benchmarking Study February 2011. Oracle Insurance Insbridge Enterprise Rating: Performance Assessment An Oracle Benchmarking Study February 2011 Oracle Insurance Insbridge Enterprise Rating: Performance Assessment Executive Overview... 1 RateManager Testing... 2 Test Environment... 2 Test Scenarios...

More information

Application of GIS Analyzes with Cloud Computing

Application of GIS Analyzes with Cloud Computing ion of GIS Analyzes with Cloud Computing Sevket BEDIROGLU, Volkan YILDIRIM and Selcuk ERBAS (Turkey) Key words: GIS, Cloud Computing, Spatial Cloud Computing SUMMARY Usage of Cloud Computing (CC) has been

More information

Graph Analytics in Big Data. John Feo Pacific Northwest National Laboratory

Graph Analytics in Big Data. John Feo Pacific Northwest National Laboratory Graph Analytics in Big Data John Feo Pacific Northwest National Laboratory 1 A changing World The breadth of problems requiring graph analytics is growing rapidly Large Network Systems Social Networks

More information

PRIMERGY server-based High Performance Computing solutions

PRIMERGY server-based High Performance Computing solutions PRIMERGY server-based High Performance Computing solutions PreSales - May 2010 - HPC Revenue OS & Processor Type Increasing standardization with shift in HPC to x86 with 70% in 2008.. HPC revenue by operating

More information

Agenda. HPC Software Stack. HPC Post-Processing Visualization. Case Study National Scientific Center. European HPC Benchmark Center Montpellier PSSC

Agenda. HPC Software Stack. HPC Post-Processing Visualization. Case Study National Scientific Center. European HPC Benchmark Center Montpellier PSSC HPC Architecture End to End Alexandre Chauvin Agenda HPC Software Stack Visualization National Scientific Center 2 Agenda HPC Software Stack Alexandre Chauvin Typical HPC Software Stack Externes LAN Typical

More information

Radar Image Processing with Clusters of Computers

Radar Image Processing with Clusters of Computers Radar Image Processing with Clusters of Computers Alois Goller Chalmers University Franz Leberl Institute for Computer Graphics and Vision ABSTRACT Some radar image processing algorithms such as shape-from-shading

More information

benchmarking Amazon EC2 for high-performance scientific computing

benchmarking Amazon EC2 for high-performance scientific computing Edward Walker benchmarking Amazon EC2 for high-performance scientific computing Edward Walker is a Research Scientist with the Texas Advanced Computing Center at the University of Texas at Austin. He received

More information

Big Data and Analytics: A Conceptual Overview. Mike Park Erik Hoel

Big Data and Analytics: A Conceptual Overview. Mike Park Erik Hoel Big Data and Analytics: A Conceptual Overview Mike Park Erik Hoel In this technical workshop This presentation is for anyone that uses ArcGIS and is interested in analyzing large amounts of data We will

More information

Speed Performance Improvement of Vehicle Blob Tracking System

Speed Performance Improvement of Vehicle Blob Tracking System Speed Performance Improvement of Vehicle Blob Tracking System Sung Chun Lee and Ram Nevatia University of Southern California, Los Angeles, CA 90089, USA sungchun@usc.edu, nevatia@usc.edu Abstract. A speed

More information

THE EXPAND PARALLEL FILE SYSTEM A FILE SYSTEM FOR CLUSTER AND GRID COMPUTING. José Daniel García Sánchez ARCOS Group University Carlos III of Madrid

THE EXPAND PARALLEL FILE SYSTEM A FILE SYSTEM FOR CLUSTER AND GRID COMPUTING. José Daniel García Sánchez ARCOS Group University Carlos III of Madrid THE EXPAND PARALLEL FILE SYSTEM A FILE SYSTEM FOR CLUSTER AND GRID COMPUTING José Daniel García Sánchez ARCOS Group University Carlos III of Madrid Contents 2 The ARCOS Group. Expand motivation. Expand

More information

White Paper. Recording Server Virtualization

White Paper. Recording Server Virtualization White Paper Recording Server Virtualization Prepared by: Mike Sherwood, Senior Solutions Engineer Milestone Systems 23 March 2011 Table of Contents Introduction... 3 Target audience and white paper purpose...

More information

Virtual Machines. www.viplavkambli.com

Virtual Machines. www.viplavkambli.com 1 Virtual Machines A virtual machine (VM) is a "completely isolated guest operating system installation within a normal host operating system". Modern virtual machines are implemented with either software

More information

Scalable Data Analysis in R. Lee E. Edlefsen Chief Scientist UserR! 2011

Scalable Data Analysis in R. Lee E. Edlefsen Chief Scientist UserR! 2011 Scalable Data Analysis in R Lee E. Edlefsen Chief Scientist UserR! 2011 1 Introduction Our ability to collect and store data has rapidly been outpacing our ability to analyze it We need scalable data analysis

More information

TRENDS IN HARDWARE FOR GEOGRAPHIC INFORMATION SYSTEMS

TRENDS IN HARDWARE FOR GEOGRAPHIC INFORMATION SYSTEMS TRENDS IN HARDWARE FOR GEOGRAPHIC INFORMATION SYSTEMS Jack Dangermond Scott Morehouse Environmental Systems Research Institute 380 New York Street Redlands,CA 92373 ABSTRACT This paper presents a description

More information

64-Bit versus 32-Bit CPUs in Scientific Computing

64-Bit versus 32-Bit CPUs in Scientific Computing 64-Bit versus 32-Bit CPUs in Scientific Computing Axel Kohlmeyer Lehrstuhl für Theoretische Chemie Ruhr-Universität Bochum March 2004 1/25 Outline 64-Bit and 32-Bit CPU Examples

More information

Cluster Scalability of ANSYS FLUENT 12 for a Large Aerodynamics Case on the Darwin Supercomputer

Cluster Scalability of ANSYS FLUENT 12 for a Large Aerodynamics Case on the Darwin Supercomputer Cluster Scalability of ANSYS FLUENT 12 for a Large Aerodynamics Case on the Darwin Supercomputer Stan Posey, MSc and Bill Loewe, PhD Panasas Inc., Fremont, CA, USA Paul Calleja, PhD University of Cambridge,

More information

Parallel Ray Tracing using MPI: A Dynamic Load-balancing Approach

Parallel Ray Tracing using MPI: A Dynamic Load-balancing Approach Parallel Ray Tracing using MPI: A Dynamic Load-balancing Approach S. M. Ashraful Kadir 1 and Tazrian Khan 2 1 Scientific Computing, Royal Institute of Technology (KTH), Stockholm, Sweden smakadir@csc.kth.se,

More information

1 Bull, 2011 Bull Extreme Computing

1 Bull, 2011 Bull Extreme Computing 1 Bull, 2011 Bull Extreme Computing Table of Contents HPC Overview. Cluster Overview. FLOPS. 2 Bull, 2011 Bull Extreme Computing HPC Overview Ares, Gerardo, HPC Team HPC concepts HPC: High Performance

More information

Big Data and Cloud Computing for GHRSST

Big Data and Cloud Computing for GHRSST Big Data and Cloud Computing for GHRSST Jean-Francois Piollé (jfpiolle@ifremer.fr) Frédéric Paul, Olivier Archer CERSAT / Institut Français de Recherche pour l Exploitation de la Mer Facing data deluge

More information

An Grid Service Module for Natural Resource Managers

An Grid Service Module for Natural Resource Managers An Grid Service Module for Natural Resource Managers Dali Wang 1, Eric Carr 1, Mark Palmer 1, Michael W. Berry 2 and Louis J. Gross 1 1 The Institute for Environmental Modeling 569 Dabney Hall, University

More information

Chapter 7. Using Hadoop Cluster and MapReduce

Chapter 7. Using Hadoop Cluster and MapReduce Chapter 7 Using Hadoop Cluster and MapReduce Modeling and Prototyping of RMS for QoS Oriented Grid Page 152 7. Using Hadoop Cluster and MapReduce for Big Data Problems The size of the databases used in

More information

Introduction to GPU hardware and to CUDA

Introduction to GPU hardware and to CUDA Introduction to GPU hardware and to CUDA Philip Blakely Laboratory for Scientific Computing, University of Cambridge Philip Blakely (LSC) GPU introduction 1 / 37 Course outline Introduction to GPU hardware

More information

A Comparative Study on Vega-HTTP & Popular Open-source Web-servers

A Comparative Study on Vega-HTTP & Popular Open-source Web-servers A Comparative Study on Vega-HTTP & Popular Open-source Web-servers Happiest People. Happiest Customers Contents Abstract... 3 Introduction... 3 Performance Comparison... 4 Architecture... 5 Diagram...

More information

Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage

Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage White Paper Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage A Benchmark Report August 211 Background Objectivity/DB uses a powerful distributed processing architecture to manage

More information

Parallel Analysis and Visualization on Cray Compute Node Linux

Parallel Analysis and Visualization on Cray Compute Node Linux Parallel Analysis and Visualization on Cray Compute Node Linux David Pugmire, Oak Ridge National Laboratory and Hank Childs, Lawrence Livermore National Laboratory and Sean Ahern, Oak Ridge National Laboratory

More information

NVIDIA CUDA Software and GPU Parallel Computing Architecture. David B. Kirk, Chief Scientist

NVIDIA CUDA Software and GPU Parallel Computing Architecture. David B. Kirk, Chief Scientist NVIDIA CUDA Software and GPU Parallel Computing Architecture David B. Kirk, Chief Scientist Outline Applications of GPU Computing CUDA Programming Model Overview Programming in CUDA The Basics How to Get

More information

Online Remote Data Backup for iscsi-based Storage Systems

Online Remote Data Backup for iscsi-based Storage Systems Online Remote Data Backup for iscsi-based Storage Systems Dan Zhou, Li Ou, Xubin (Ben) He Department of Electrical and Computer Engineering Tennessee Technological University Cookeville, TN 38505, USA

More information

Accelerating Microsoft Exchange Servers with I/O Caching

Accelerating Microsoft Exchange Servers with I/O Caching Accelerating Microsoft Exchange Servers with I/O Caching QLogic FabricCache Caching Technology Designed for High-Performance Microsoft Exchange Servers Key Findings The QLogic FabricCache 10000 Series

More information

DELL. Virtual Desktop Infrastructure Study END-TO-END COMPUTING. Dell Enterprise Solutions Engineering

DELL. Virtual Desktop Infrastructure Study END-TO-END COMPUTING. Dell Enterprise Solutions Engineering DELL Virtual Desktop Infrastructure Study END-TO-END COMPUTING Dell Enterprise Solutions Engineering 1 THIS WHITE PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY CONTAIN TYPOGRAPHICAL ERRORS AND TECHNICAL

More information

Scalable and High Performance Computing for Big Data Analytics in Understanding the Human Dynamics in the Mobile Age

Scalable and High Performance Computing for Big Data Analytics in Understanding the Human Dynamics in the Mobile Age Scalable and High Performance Computing for Big Data Analytics in Understanding the Human Dynamics in the Mobile Age Xuan Shi GRA: Bowei Xue University of Arkansas Spatiotemporal Modeling of Human Dynamics

More information

CMS Tier-3 cluster at NISER. Dr. Tania Moulik

CMS Tier-3 cluster at NISER. Dr. Tania Moulik CMS Tier-3 cluster at NISER Dr. Tania Moulik What and why? Grid computing is a term referring to the combination of computer resources from multiple administrative domains to reach common goal. Grids tend

More information

Multicore Parallel Computing with OpenMP

Multicore Parallel Computing with OpenMP Multicore Parallel Computing with OpenMP Tan Chee Chiang (SVU/Academic Computing, Computer Centre) 1. OpenMP Programming The death of OpenMP was anticipated when cluster systems rapidly replaced large

More information

Graphical Processing Units to Accelerate Orthorectification, Atmospheric Correction and Transformations for Big Data

Graphical Processing Units to Accelerate Orthorectification, Atmospheric Correction and Transformations for Big Data Graphical Processing Units to Accelerate Orthorectification, Atmospheric Correction and Transformations for Big Data Amanda O Connor, Bryan Justice, and A. Thomas Harris IN52A. Big Data in the Geosciences:

More information

Performance Engineering of the Community Atmosphere Model

Performance Engineering of the Community Atmosphere Model Performance Engineering of the Community Atmosphere Model Patrick H. Worley Oak Ridge National Laboratory Arthur A. Mirin Lawrence Livermore National Laboratory 11th Annual CCSM Workshop June 20-22, 2006

More information

Big Data and Analytics: Getting Started with ArcGIS. Mike Park Erik Hoel

Big Data and Analytics: Getting Started with ArcGIS. Mike Park Erik Hoel Big Data and Analytics: Getting Started with ArcGIS Mike Park Erik Hoel Agenda Overview of big data Distributed computation User experience Data management Big data What is it? Big Data is a loosely defined

More information

Load Balancing on a Non-dedicated Heterogeneous Network of Workstations

Load Balancing on a Non-dedicated Heterogeneous Network of Workstations Load Balancing on a Non-dedicated Heterogeneous Network of Workstations Dr. Maurice Eggen Nathan Franklin Department of Computer Science Trinity University San Antonio, Texas 78212 Dr. Roger Eggen Department

More information

Client/Server Computing Distributed Processing, Client/Server, and Clusters

Client/Server Computing Distributed Processing, Client/Server, and Clusters Client/Server Computing Distributed Processing, Client/Server, and Clusters Chapter 13 Client machines are generally single-user PCs or workstations that provide a highly userfriendly interface to the

More information

Virtualization Technologies and Blackboard: The Future of Blackboard Software on Multi-Core Technologies

Virtualization Technologies and Blackboard: The Future of Blackboard Software on Multi-Core Technologies Virtualization Technologies and Blackboard: The Future of Blackboard Software on Multi-Core Technologies Kurt Klemperer, Principal System Performance Engineer kklemperer@blackboard.com Agenda Session Length:

More information

LLamasoft K2 Enterprise 8.1 System Requirements

LLamasoft K2 Enterprise 8.1 System Requirements Overview... 3 RAM... 3 Cores and CPU Speed... 3 Local System for Operating Supply Chain Guru... 4 Applying Supply Chain Guru Hardware in K2 Enterprise... 5 Example... 6 Determining the Correct Number of

More information

Numerical Calculation of Laminar Flame Propagation with Parallelism Assignment ZERO, CS 267, UC Berkeley, Spring 2015

Numerical Calculation of Laminar Flame Propagation with Parallelism Assignment ZERO, CS 267, UC Berkeley, Spring 2015 Numerical Calculation of Laminar Flame Propagation with Parallelism Assignment ZERO, CS 267, UC Berkeley, Spring 2015 Xian Shi 1 bio I am a second-year Ph.D. student from Combustion Analysis/Modeling Lab,

More information

Information Processing, Big Data, and the Cloud

Information Processing, Big Data, and the Cloud Information Processing, Big Data, and the Cloud James Horey Computational Sciences & Engineering Oak Ridge National Laboratory Fall Creek Falls 2010 Information Processing Systems Model Parameters Data-intensive

More information

Cluster Computing at HRI

Cluster Computing at HRI Cluster Computing at HRI J.S.Bagla Harish-Chandra Research Institute, Chhatnag Road, Jhunsi, Allahabad 211019. E-mail: jasjeet@mri.ernet.in 1 Introduction and some local history High performance computing

More information

Tableau Server 7.0 scalability

Tableau Server 7.0 scalability Tableau Server 7.0 scalability February 2012 p2 Executive summary In January 2012, we performed scalability tests on Tableau Server to help our customers plan for large deployments. We tested three different

More information

Kashif Iqbal - PhD Kashif.iqbal@ichec.ie

Kashif Iqbal - PhD Kashif.iqbal@ichec.ie HPC/HTC vs. Cloud Benchmarking An empirical evalua.on of the performance and cost implica.ons Kashif Iqbal - PhD Kashif.iqbal@ichec.ie ICHEC, NUI Galway, Ireland With acknowledgment to Michele MicheloDo

More information

Enabling Technologies for Distributed Computing

Enabling Technologies for Distributed Computing Enabling Technologies for Distributed Computing Dr. Sanjay P. Ahuja, Ph.D. Fidelity National Financial Distinguished Professor of CIS School of Computing, UNF Multi-core CPUs and Multithreading Technologies

More information

Amazon Hosted ESRI GeoPortal Server. GeoCloud Project Report

Amazon Hosted ESRI GeoPortal Server. GeoCloud Project Report Amazon Hosted ESRI GeoPortal Server GeoCloud Project Report Description of Application Operating Organization The USDA participated in the FY 2011 Federal Geographic Data Committee (FGDC) GeoCloud Sandbox

More information

Flood Modelling for Cities using Cloud Computing FINAL REPORT. Vassilis Glenis, Vedrana Kutija, Stephen McGough, Simon Woodman, Chris Kilsby

Flood Modelling for Cities using Cloud Computing FINAL REPORT. Vassilis Glenis, Vedrana Kutija, Stephen McGough, Simon Woodman, Chris Kilsby Summary Flood Modelling for Cities using Cloud Computing FINAL REPORT Vassilis Glenis, Vedrana Kutija, Stephen McGough, Simon Woodman, Chris Kilsby Assessment of pluvial flood risk is particularly difficult

More information

Part I Courses Syllabus

Part I Courses Syllabus Part I Courses Syllabus This document provides detailed information about the basic courses of the MHPC first part activities. The list of courses is the following 1.1 Scientific Programming Environment

More information

Terminal Server Software and Hardware Requirements. Terminal Server. Software and Hardware Requirements. Datacolor Match Pigment Datacolor Tools

Terminal Server Software and Hardware Requirements. Terminal Server. Software and Hardware Requirements. Datacolor Match Pigment Datacolor Tools Terminal Server Software and Hardware Requirements Datacolor Match Pigment Datacolor Tools January 21, 2011 Page 1 of 8 Introduction This document will provide preliminary information about the both the

More information

SURVEY ON SCIENTIFIC DATA MANAGEMENT USING HADOOP MAPREDUCE IN THE KEPLER SCIENTIFIC WORKFLOW SYSTEM

SURVEY ON SCIENTIFIC DATA MANAGEMENT USING HADOOP MAPREDUCE IN THE KEPLER SCIENTIFIC WORKFLOW SYSTEM SURVEY ON SCIENTIFIC DATA MANAGEMENT USING HADOOP MAPREDUCE IN THE KEPLER SCIENTIFIC WORKFLOW SYSTEM 1 KONG XIANGSHENG 1 Department of Computer & Information, Xinxiang University, Xinxiang, China E-mail:

More information

System Requirements Table of contents

System Requirements Table of contents Table of contents 1 Introduction... 2 2 Knoa Agent... 2 2.1 System Requirements...2 2.2 Environment Requirements...4 3 Knoa Server Architecture...4 3.1 Knoa Server Components... 4 3.2 Server Hardware Setup...5

More information

Hadoop Cluster Applications

Hadoop Cluster Applications Hadoop Overview Data analytics has become a key element of the business decision process over the last decade. Classic reporting on a dataset stored in a database was sufficient until recently, but yesterday

More information

An Oracle White Paper November 2010. Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics

An Oracle White Paper November 2010. Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics An Oracle White Paper November 2010 Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics 1 Introduction New applications such as web searches, recommendation engines,

More information

System requirements for A+

System requirements for A+ System requirements for A+ Anywhere Learning System: System Requirements Customer-hosted Browser Version Web-based ALS (WBA) Delivery Network Requirements In order to configure WBA+ to properly answer

More information

~ Greetings from WSU CAPPLab ~

~ Greetings from WSU CAPPLab ~ ~ Greetings from WSU CAPPLab ~ Multicore with SMT/GPGPU provides the ultimate performance; at WSU CAPPLab, we can help! Dr. Abu Asaduzzaman, Assistant Professor and Director Wichita State University (WSU)

More information

HPC Wales Skills Academy Course Catalogue 2015

HPC Wales Skills Academy Course Catalogue 2015 HPC Wales Skills Academy Course Catalogue 2015 Overview The HPC Wales Skills Academy provides a variety of courses and workshops aimed at building skills in High Performance Computing (HPC). Our courses

More information

GeoImaging Accelerator Pansharp Test Results

GeoImaging Accelerator Pansharp Test Results GeoImaging Accelerator Pansharp Test Results Executive Summary After demonstrating the exceptional performance improvement in the orthorectification module (approximately fourteen-fold see GXL Ortho Performance

More information

Mobile Storage and Search Engine of Information Oriented to Food Cloud

Mobile Storage and Search Engine of Information Oriented to Food Cloud Advance Journal of Food Science and Technology 5(10): 1331-1336, 2013 ISSN: 2042-4868; e-issn: 2042-4876 Maxwell Scientific Organization, 2013 Submitted: May 29, 2013 Accepted: July 04, 2013 Published:

More information

Processing Large Amounts of Images on Hadoop with OpenCV

Processing Large Amounts of Images on Hadoop with OpenCV Processing Large Amounts of Images on Hadoop with OpenCV Timofei Epanchintsev 1,2 and Andrey Sozykin 1,2 1 IMM UB RAS, Yekaterinburg, Russia, 2 Ural Federal University, Yekaterinburg, Russia {eti,avs}@imm.uran.ru

More information

CLOUD BENCHMARK ROUND 1

CLOUD BENCHMARK ROUND 1 REPORT CLOUD BENCHMARK ROUND 1 MAY 27, 2014 Disclaimer Techila Technologies Ltd. disclaims any and all warranties, express, implied or statutory regarding this document or the use of thereof by you to

More information

OPERATING SYSTEMS Internais and Design Principles

OPERATING SYSTEMS Internais and Design Principles OPERATING SYSTEMS Internais and Design Principles FOURTH EDITION William Stallings, Ph.D. Prentice Hall Upper Saddle River, New Jersey 07458 CONTENTS Web Site for Operating Systems: Internais and Design

More information

Studying Code Development for High Performance Computing: The HPCS Program

Studying Code Development for High Performance Computing: The HPCS Program Studying Code Development for High Performance Computing: The HPCS Program Jeff Carver 1, Sima Asgari 1, Victor Basili 1,2, Lorin Hochstein 1, Jeffrey K. Hollingsworth 1, Forrest Shull 2, Marv Zelkowitz

More information

Evaluation of Enterprise Data Protection using SEP Software

Evaluation of Enterprise Data Protection using SEP Software Test Validation Test Validation - SEP sesam Enterprise Backup Software Evaluation of Enterprise Data Protection using SEP Software Author:... Enabling you to make the best technology decisions Backup &

More information

IT Infrastructure and Platforms

IT Infrastructure and Platforms Chapter 6 IT Infrastructure and Platforms 6.1 2006 by Prentice Hall OBJECTIVES Define IT infrastructure and describe the components and levels of IT infrastructure Identify and describe the stages of IT

More information

OpenMP Programming on ScaleMP

OpenMP Programming on ScaleMP OpenMP Programming on ScaleMP Dirk Schmidl schmidl@rz.rwth-aachen.de Rechen- und Kommunikationszentrum (RZ) MPI vs. OpenMP MPI distributed address space explicit message passing typically code redesign

More information

White Paper: Performance of Host-based Media Processing

White Paper: Performance of Host-based Media Processing White Paper: Performance of Host-based Media Processing The viability of host-based software in providing media processing capabilities continues to expand the possibilities for voice application development.

More information

Efficient Parallel Execution of Sequence Similarity Analysis Via Dynamic Load Balancing

Efficient Parallel Execution of Sequence Similarity Analysis Via Dynamic Load Balancing Efficient Parallel Execution of Sequence Similarity Analysis Via Dynamic Load Balancing James D. Jackson Philip J. Hatcher Department of Computer Science Kingsbury Hall University of New Hampshire Durham,

More information

Big Graph Processing: Some Background

Big Graph Processing: Some Background Big Graph Processing: Some Background Bo Wu Colorado School of Mines Part of slides from: Paul Burkhardt (National Security Agency) and Carlos Guestrin (Washington University) Mines CSCI-580, Bo Wu Graphs

More information

SUN ORACLE EXADATA STORAGE SERVER

SUN ORACLE EXADATA STORAGE SERVER SUN ORACLE EXADATA STORAGE SERVER KEY FEATURES AND BENEFITS FEATURES 12 x 3.5 inch SAS or SATA disks 384 GB of Exadata Smart Flash Cache 2 Intel 2.53 Ghz quad-core processors 24 GB memory Dual InfiniBand

More information

High Performance. CAEA elearning Series. Jonathan G. Dudley, Ph.D. 06/09/2015. 2015 CAE Associates

High Performance. CAEA elearning Series. Jonathan G. Dudley, Ph.D. 06/09/2015. 2015 CAE Associates High Performance Computing (HPC) CAEA elearning Series Jonathan G. Dudley, Ph.D. 06/09/2015 2015 CAE Associates Agenda Introduction HPC Background Why HPC SMP vs. DMP Licensing HPC Terminology Types of

More information

Very Large Enterprise Network, Deployment, 25000+ Users

Very Large Enterprise Network, Deployment, 25000+ Users Very Large Enterprise Network, Deployment, 25000+ Users Websense software can be deployed in different configurations, depending on the size and characteristics of the network, and the organization s filtering

More information

CS 3530 Operating Systems. L02 OS Intro Part 1 Dr. Ken Hoganson

CS 3530 Operating Systems. L02 OS Intro Part 1 Dr. Ken Hoganson CS 3530 Operating Systems L02 OS Intro Part 1 Dr. Ken Hoganson Chapter 1 Basic Concepts of Operating Systems Computer Systems A computer system consists of two basic types of components: Hardware components,

More information

Removing Performance Bottlenecks in Databases with Red Hat Enterprise Linux and Violin Memory Flash Storage Arrays. Red Hat Performance Engineering

Removing Performance Bottlenecks in Databases with Red Hat Enterprise Linux and Violin Memory Flash Storage Arrays. Red Hat Performance Engineering Removing Performance Bottlenecks in Databases with Red Hat Enterprise Linux and Violin Memory Flash Storage Arrays Red Hat Performance Engineering Version 1.0 August 2013 1801 Varsity Drive Raleigh NC

More information

Scalable Cloud Computing Solutions for Next Generation Sequencing Data

Scalable Cloud Computing Solutions for Next Generation Sequencing Data Scalable Cloud Computing Solutions for Next Generation Sequencing Data Matti Niemenmaa 1, Aleksi Kallio 2, André Schumacher 1, Petri Klemelä 2, Eija Korpelainen 2, and Keijo Heljanko 1 1 Department of

More information

Oracle Big Data Spatial and Graph

Oracle Big Data Spatial and Graph Oracle Big Data Spatial and Graph Oracle Big Data Spatial and Graph offers a set of analytic services and data models that support Big Data workloads on Apache Hadoop and NoSQL database technologies. For

More information

An Experimental Study of Load Balancing of OpenNebula Open-Source Cloud Computing Platform

An Experimental Study of Load Balancing of OpenNebula Open-Source Cloud Computing Platform An Experimental Study of Load Balancing of OpenNebula Open-Source Cloud Computing Platform A B M Moniruzzaman 1, Kawser Wazed Nafi 2, Prof. Syed Akhter Hossain 1 and Prof. M. M. A. Hashem 1 Department

More information