OUT OF ORDER RENDERING ON VISUALIZATION CLUSTERS

Size: px
Start display at page:

Download "OUT OF ORDER RENDERING ON VISUALIZATION CLUSTERS"

Transcription

1 OUT OF ORDER RENDERING ON VISUALIZATION CLUSTERS Karl Rasche Department of Computer Science Clemson University Clemson, South Carolina, USA Robert Geist Department of Computer Science Clemson University Clemson, South Carolina, USA James Westall Department of Computer Science Clemson University Clemson, South Carolina, USA ABSTRACT A technique for workload balancing on distributed visualization clusters is suggested. The round-robin scheduling of partial object geometry chunks avoids render node starvation and balances the data backlog in the send buffers of the geometry nodes. Because frame synchronization requires that all send buffers drain, balancing these send buffer backlogs is seen to substantially improve frame rate performance for dynamic displays. KEY WORDS Chromium, distributed rendering, tiled displays, workload balancing processing including reading back the frame buffers of the rendering nodes for visualization on special display nodes. Most rendering applications make use of a standard graphics library such as OpenGL [4]. Chromium [5] is a generic software framework for distributing and processing streams of OpenGL commands. The rendering application context is described by a directed graph of stream processing units, or SPUs. Calls by the application to the OpenGL library are intercepted by Chromium and re-routed through the SPU graph. For a sort-first system, one uses a Tile sort SPU to sort and distribute geometry to a set of Render SPUs or Readback SPUs. The latter type renders and makes results available for downstream processing. 1 Introduction In a sort-first parallel rendering system the workload is divided based on a tiling of the two-dimensional display, i.e., a tiling of image space [1]. Image tiles are assigned to rendering tasks. Geometry is sent to a rendering task if that geometry would fall within the tile assigned to that renderer. The geometry must be projected from (3D) world coordinates into (2D) image space before this distribution can be made, as the tile assignment is not known a priori. To avoid transforming all geometry, bounding boxes for sections of geometry are computed in world coordinates. The bounding boxes are then projected into image space, and the corresponding geometry is sent to the appropriate rendering task as determined by the bounds. For a fixed image space resolution, increasing the number of image tiles decreases the average tile size, and thus more tiles are likely to be covered by any given bounding box. Since all geometry in a box must be sent to all rendering tasks controlling all tiles mapped by the box, this can lead to increased network transmission. This overlap problem is a classical hindrance to the scalability of sort-first systems. Recently, there has been an increased interest in parallel, real-time rendering using clusters of commodity, offthe-shelf components [2, 3]. In a sort-first configuration, one set of nodes (PCs) is usually assigned to transforming and sorting geometry. This set is connected, in turn, to other sets of nodes with graphics hardware that perform the rendering. Several options are available for downstream 2 Motivation In May of this year, we installed a 265-node distributed rendering system in Clemson s W. M. Keck Visualization Laboratory. Each node has a 1.6GHz Pentium IV CPU, 58GB disk, and dual Ethernet NICs. The 240 geometry/rendering nodes each have 512MB main memory, an Nvidia GeForce4 TI 4400 graphics card, and dual 100Mb NICs. The 24 display nodes, which drive a 6x4 projector array, each have 1GB main memory, a Matrox G450 graphics card, a Gigabit NIC, and a 100Mb NIC. All nodes are connected via a dedicated Gigabit Ethernet switch. We are interested in examining the network performance of the connections between the geometry sorting nodes and the rendering nodes. For dynamic displays, the overlap of the geometry and the image tiling is constantly changing, and it is important to understand this rich and dynamic behavior, as this is usually the limiting factor in the performance of visualization clusters [6]. The communication between the sorting nodes and the rendering nodes must be synchronous, at least at an image frame level. After sorting and sending all of the geometry for one frame, the sorting node must wait for all of the data to be received by the rendering node(s) before transmitting the next frame of geometry. Failure to synchronize can cause large backlogs of geometry to be queued on the rendering nodes. For interactive applications, this can yield large latencies when responding to user requests. An un-

2 synchronized connection could suffice for non-interactive, non-dynamic applications. We suggest that the order with which geometry is drawn is important for the performance of the sort-torender network. Consider the case in which two objects of complex geometry are each mapped to a separate tile with no overlap. From an application-level standpoint, the natural approach to drawing any scene, including this one, has an object level granularity. That is, the first object is drawn completely, and then the second object is drawn. This can yield an unfortunate interleaving of the connections to which geometry is sent. In the case at hand, a large amount of data is sent to the first rendering node while the second rendering node sits idle. When the sorting of the geometry in the first object completes, the situation reverses and the first rendering node is idled while the second rendering node is swamped. Geometry interleaving has two important and related effects on network performance. First, the rendering nodes should always have data queued in their receive buffers when they are ready to render. Starving a rendering node delays the end of the frame synchronization, which in turn decreases the performance of the system. Second, a poor interleaving can cause large backlogs of data in the send queues on the sorting node(s). This backlog must be fully drained prior to the synchronization step. The larger the backlog, the longer the synchronization will take to complete. To see the effect that geometry interleaving has on performance in an asymmetric network, we conducted a simple experiment using Chromium on nodes in the Keck system. Image space was divided into 6 tiles, each a horizontal strip running the width of the image. Geometry consisted of a set of points in the plane. The points were divided into a 10 x 10 array of bins, where the axes of the array were aligned with the image. We then walked through the array of bins, sending all of the geometry for each bin and flushing the sorting node after each bin. In the first test, we walked through the array in column-major order, and in the second, row-major order. Figure 1 illustrates the tile and geometry layout. The sorting node had a 1000 Mbps TCP/IP connection, and the rendering nodes each had 100 Mbps TCP/IP connections. Figure 2 shows the data received by one of the rendering nodes as a function of elapsed time. Similar results were found for the other rendering nodes in the system. Note the stair-step effect that occurs in the row-major walk. This is a result of biasing the distribution of data to one tile at a time. The column-major walk is much closer to an equal distribution. Note that it starts by sending the first bin of data to the first tile, then it sends a second bin of data that overlaps the first two tiles, then it sends a bin of data to the second tile, and so on. This approach keeps all the send queues for all connections relatively balanced and avoids the starvation that is seen in the row-major walk. Note that the stalling in the row-major Figure 1. Tiling setup for the first experiments walk is not due to processing by the sorting node, as this would yield large stalls in the column-major walk as well. The round-robin (Reordered) scheduling shown in the figure will be explained in section 3. We also recorded the total bytes in the send queues at the time of the post-frame synchronization under each transmission scheme. These are shown in Figure 3, where we can see that a large backlog occurs in the row-major walk, but not in the column major walk. The round-robin (Reordered) scheduling shown in the figure will be explained in section 3. 3 Out of Order Rendering From this experiment, we see that it is advantageous to sort geometry so that sequential sections of geometry are sent to different tiles. To achieve this advantage, the user application could be altered so that it draws small portions of objects in an interleaved fashion. This would solve the problem if the mapping of the geometry to the tiles were known to the application. For the general application, this is impossible. Simply interleaving sections of the objects will not guarantee a balanced interleaving over the tiles. We suggest that a two part solution to this problem will be required. The first part is a decomposition of the objects in the scene into small sections or chunks. The desired granularity of the chunks will depend upon the scene and system. Recent, distributed OpenGL systems [5, 6] have operated by intercepting calls with a fake dynamic library and re-routing them as the setup demands. Such an approach would be very difficult to use with this decomposition, as it would require retaining a considerable

3 Figure 2. Rendering node throughput (static points) Figure 3. Send queue data at synchronization (static points)

4 amount of state data prior to performing the decomposition. Samanta et al [7] propose using such a retained mode decomposition, but such is too restrictive for our purposes. We need a view-independent decomposition that can be performed as a preprocessing step for static geometry. We are investigating automated decomposition, but for now we perform this step on an application-specific basis. The second part of the solution requires scheduling the order in which the chunks should be sorted and transmitted for rendering. If we assume a fair amount of temporal coherence in our dynamic displays, we can use the overlap information from the previous frame to schedule the chunks for the next frame. 3.1 Tracking In order for the system to keep track of the chunks of geometry, an identifier is needed for each as well as a method of querying the scheduler for the next chunk of geometry to be scheduled. We propose two new OpenGL functions to fill this void. The first, glgentokens(), defines a set of identifiers. The second, glnexttoken(), returns the identifier of the next chunk to be drawn. An explicit scheduling call is not necessary, as it can be done at the time of Swap- Buffers(). Below is a sample rendering loop. /* generate a set of 100 chunk ids */ glgentokens(1,100); /* schedule and render */ while(1){ int next; glnexttoken(1,&next); if(next == -1) break; render_chunk(next); } SwapBuffers(); The function render chunk() is responsible for issuing the commands to draw a given chunk of the scene geometry. 3.2 Scheduling The goal of the scheduler is to provide a good, i.e., temporally balanced, interleaving of geometry to the various rendering nodes. Our scheduler operates in a round-robin manner. We maintain N + 1 lists of identifiers for a system with N tiles. The extra list is used for geometry that falls outside the viewport. In the case where geometry overlaps multiple tiles, a record is stored in each of the (covered) tile lists. The scheduler removes a chunk identifier from the first list and removes any instances of the same identifier from other lists. The chosen geometry chunk identifier is the next value returned by glnexttoken(). This procedure continues in a round-robin fashion until all lists are exhausted. This provides an interleaving but not a perfect balance. It ignores cases where a geometry chunk overlaps multiple tiles. If an identifier is at the head of one list and also appears in the neighboring list, the neighboring list, which is likely to receive the overlapping identifier when this chunk is rendered, probably should be skipped in this scheduling pass. We also note that our testing procedure assumes a uniform granularity of geometry sections. If this is not the case, the lists can be sorted by size before scheduling. 4 Measurement We repeated the experiment of Figure 1 with the roundrobin reordering system, and the resulting throughput is shown as the Reordered line in Figure 2. Note that the throughput is approximately the same as that for the column-major walk. This is to be expected, as the roundrobin scheduling should produce an order similar to the column-major walk. Near the beginning of the transfer, the reordered walk appears to behave similarly to the rowmajor walk, but, as time progresses, the behavior aligns itself with that of the column-major walk. At this stage, it would appear that there is no benefit in reordering. Similar results were found when measuring the amount of data queued in send buffers on the sorting node, as seen in Figure 3. The round-robin scheduling yielded the minimum amount of data queued, but the advantage over the columnmajor walk was not large. In real visualization applications, it is likely that the geometry will be moving with respect to the image plane. To account for this in our test program, we performed a second experiment in which the points were rotated a few degrees every frame. The results are shown in Figure 4. In this case, the row-major and column-major walks oscillate as the bins line up with the image tiles. The round-robin reordered walk displays little of this behavior and maintains a fairly constant, higher throughput. Looking at the send queue data at synchronization instants, Figure 5, we see similar results. There are occasional steps in the roundrobin walk, likely due to multiple overlaps, but it is clear that reordering is advantageous. For a less contrived test, we used our reordering technique on a cloud of 70,000 points scanned from the recently salvaged civil-war submarine, the H. L. Hunley [8]. The complete Hunley scan is shown in Figure 6. This data set has a significantly different spatial distribution from the data set used in the previous tests. Figure 7 shows the data queued at synchronization instants as the point cloud slowly rotates. We can see that while the difference is somewhat less than in the previous experiments, the reordering remains beneficial.

5 Figure 4. Render node throughput with a rotating set of points Figure 5. Data in send queues at synchronization for a rotating set of points

6 5 Discussion While reordering data has benefits, it is not without potential drawbacks. The first difficulty is the code structure demand placed on the user application. It is not easy to take existing OpenGL code and reorder the geometry without inducing unwanted modifications in the resulting image(s). Automated decomposition of the geometry using spatially non-uniform octrees (KD trees) should be a productive avenue for exploration. It may be possible to take a sequential geometry drawing loop and preprocess it into sections, perhaps with the aid of a profiler. Another potential problem is thrashing. There may be large changes in (OpenGL) state that must take place before a section can be sorted and transmitted for rendering. Perhaps even more serious is texture thrashing. Reordering may destroy texture coherence, requiring frequent uploading of textures to the graphics hardware. This is part of the motivation for dividing geometry sections into sets. To attempt to minimize this thrashing, sets with like state and textures can be grouped together. Depending on the spatial locality of the sections, this may increase rendering speed, at some expense in throughput. Unfortunately, since network throughput is generally the bottleneck [6], this may not cause any improvement in overall performance. In our scheduling scheme, there are no provisions for multiple tiles that map to the same rendering node. This problem is easily solved by concatenating hit lists for tiles for which this occurs. Transparency is also troublesome. Traditionally, rendering transparent objects requires control over the order with which objects are drawn [4]. One potential solution would be to allow only part of the scene to be reordered by inserting sequential drawing instructions between sets of reorderable geometry. Another potential solution would be to use a different approach to rendering transparent objects. Everitt describes an order-independent method for rendering transparent objects [9]. This would allow for all objects in the scene to be reorderable, but this solution resembles a sort-last parallel system more than a sort-first system. Finally, while it is relatively straightforward to decompose static geometry into chunks, it may be more difficult to maintain a good decomposition for dynamic data. 6 Conclusions and Future Work We have suggested a technique for workload balancing on distributed visualization clusters. The round-robin scheduling of partial object geometry chunks avoids rendering node starvation and balances the data backlog in the send buffers of the geometry nodes. Because frame synchronization requires that all send buffers drain, balancing these send buffer backlogs is important to frame rate perfor- Figure 6. Complete scan data from H. L. Hunley mance for dynamic displays. An automated geometry decomposition system could be quite useful in conjunction with our scheduler. Decomposition levels might be controlled by feedback from previous frame rendering performance. A scheduling algorithm that takes into account the interleaving of geometry chunks that overlap tiles may provide further improved throughput. This may become more apparent when using scenes with high overlap. Nevertheless, load balancing through re-tiling of the image strives to decrease overlap [3], and, if overlap is kept to a minimum, improving our scheduler to better handle these cases may yield little performance improvement. In some cases, an intercept-and-stream model of distribution [5, 6] may not be optimal. Take, for example, the case of normal vectors. With many surface representations, normals can be recovered from the local surface properties. We could therefore reduce the bandwidth necessary to distribute the surface if the normal computation were deferred until after transmission. In a stream processing environment such as Chromium, this would be possible by inserting a processing element into the processing graph after the transmission stage. Unfortunately, as geometry is distributed, topological information is generally lost. While some of this information can be recovered with additional processing, this may introduce unwanted latency or retransmission. A system with partial data replication on the rendering nodes may offer substantial improvements. Each chunk identifier could be associated with a set of memory pages. From these pages, the stream of GPU commands could be generated. If a chunk sent to a rendering node causes a fault on its required set of pages, the rendering node could simply enqueue the identifier and proceed with execution

7 Figure 7. Data in send queues at synchronization instants while rotating the H. L. Hunley data set of the next ready chunk. A similar enqueue-and-move-on strategy could be employed in a distributed texture system, where the tracking of chunks could be used for preloading of the textures. 7 Acknowledgments This work was supported in part by the ERC Program of the U.S. National Science Foundation under award EEC , the ITR Program of the National Science Foundation under award ACI , and a grant from the W. M. Keck Foundation. References [1] C. Mueller. The Sort-First Rendering Architecture for High-Performance Graphics. In Proc. ACM SIG- GRAPH Symposium on Interactive 3-D Graphics, pages 75 84, [2] G. Humphreys and P. Hanrahan. A Distributed Graphics System for Large Tiled Displays. In Proc. IEEE Visualization, pages , [3] R. Samanta, J. Zheng, T. Funkhouser, K. Li, and J. P. Singh. Load Balancing for Multi-Projector Rendering Systems. In Proc. SIGGRAPH/Eurographics Workshop on Graphics Hardware, pages , [4] M. Woo, J. Neider, T. Davis, and D. Shreiner. OpenGL Programming Guide. Addison-Wesley, 3rd edition, [5] G. Humphreys, M. Houston, R. Ng, R. Frank, S. Ahern, P. Kirchner, and J. T. Klosowski. Chromium: A Stream Processing Framework for Interactive Rendering on Clusters of Workstations. ACM Trans. on Graphics (Proc. SIGGRAPH 2002), 21(3): , July [6] G. Humphreys, M. Eldridge, I. Buck, G. Stoll, M. Everett, and P. Hanrahan. WireGL: A Scalable Graphics System for Clusters. In Computer Graphics (SIG- GRAPH 2001 Proceedings), pages , [7] R. Samanta, T. Funkhouser, K. Li, and J. P. Singh. Hybrid Sort-First and Sort-Last Parallel Rendering with a Cluster of PCs. In Proc. SIGGRAPH/Eurographics Workshop on Graphics Hardware, pages , [8] G. Oeland and I. Block. The H.L. Hunley. National Geographic, pages , July [9] C. Everitt. Interactive order-independent transparency. Technical report, Nvidia, developer.nvidia.com.

How To Share Rendering Load In A Computer Graphics System

How To Share Rendering Load In A Computer Graphics System Bottlenecks in Distributed Real-Time Visualization of Huge Data on Heterogeneous Systems Gökçe Yıldırım Kalkan Simsoft Bilg. Tekn. Ltd. Şti. Ankara, Turkey Email: gokce@simsoft.com.tr Veysi İşler Dept.

More information

Stream Processing on GPUs Using Distributed Multimedia Middleware

Stream Processing on GPUs Using Distributed Multimedia Middleware Stream Processing on GPUs Using Distributed Multimedia Middleware Michael Repplinger 1,2, and Philipp Slusallek 1,2 1 Computer Graphics Lab, Saarland University, Saarbrücken, Germany 2 German Research

More information

Journal of Chemical and Pharmaceutical Research, 2013, 5(12):118-122. Research Article. An independence display platform using multiple media streams

Journal of Chemical and Pharmaceutical Research, 2013, 5(12):118-122. Research Article. An independence display platform using multiple media streams Available online www.jocpr.com Journal of Chemical and Pharmaceutical Research, 2013, 5(12):118-122 Research Article ISSN : 0975-7384 CODEN(USA) : JCPRC5 An independence display platform using multiple

More information

OctaVis: A Simple and Efficient Multi-View Rendering System

OctaVis: A Simple and Efficient Multi-View Rendering System OctaVis: A Simple and Efficient Multi-View Rendering System Eugen Dyck, Holger Schmidt, Mario Botsch Computer Graphics & Geometry Processing Bielefeld University Abstract: We present a simple, low-cost,

More information

A Chromium Based Viewer for CUMULVS

A Chromium Based Viewer for CUMULVS A Chromium Based Viewer for CUMULVS Submitted to PDPTA 06 Dan Bennett Corresponding Author Department of Mathematics and Computer Science Edinboro University of PA Edinboro, Pennsylvania 16444 Phone: (814)

More information

REMOTE RENDERING OF COMPUTER GAMES

REMOTE RENDERING OF COMPUTER GAMES REMOTE RENDERING OF COMPUTER GAMES Peter Eisert, Philipp Fechteler Fraunhofer Institute for Telecommunications, Einsteinufer 37, D-10587 Berlin, Germany eisert@hhi.fraunhofer.de, philipp.fechteler@hhi.fraunhofer.de

More information

Parallel Visualization for GIS Applications

Parallel Visualization for GIS Applications Parallel Visualization for GIS Applications Alexandre Sorokine, Jamison Daniel, Cheng Liu Oak Ridge National Laboratory, Geographic Information Science & Technology, PO Box 2008 MS 6017, Oak Ridge National

More information

Quiz for Chapter 6 Storage and Other I/O Topics 3.10

Quiz for Chapter 6 Storage and Other I/O Topics 3.10 Date: 3.10 Not all questions are of equal difficulty. Please review the entire quiz first and then budget your time carefully. Name: Course: Solutions in Red 1. [6 points] Give a concise answer to each

More information

NVIDIA IndeX Enabling Interactive and Scalable Visualization for Large Data Marc Nienhaus, NVIDIA IndeX Engineering Manager and Chief Architect

NVIDIA IndeX Enabling Interactive and Scalable Visualization for Large Data Marc Nienhaus, NVIDIA IndeX Engineering Manager and Chief Architect SIGGRAPH 2013 Shaping the Future of Visual Computing NVIDIA IndeX Enabling Interactive and Scalable Visualization for Large Data Marc Nienhaus, NVIDIA IndeX Engineering Manager and Chief Architect NVIDIA

More information

Equalizer. Parallel OpenGL Application Framework. Stefan Eilemann, Eyescale Software GmbH

Equalizer. Parallel OpenGL Application Framework. Stefan Eilemann, Eyescale Software GmbH Equalizer Parallel OpenGL Application Framework Stefan Eilemann, Eyescale Software GmbH Outline Overview High-Performance Visualization Equalizer Competitive Environment Equalizer Features Scalability

More information

EMC Business Continuity for Microsoft SQL Server Enabled by SQL DB Mirroring Celerra Unified Storage Platforms Using iscsi

EMC Business Continuity for Microsoft SQL Server Enabled by SQL DB Mirroring Celerra Unified Storage Platforms Using iscsi EMC Business Continuity for Microsoft SQL Server Enabled by SQL DB Mirroring Applied Technology Abstract Microsoft SQL Server includes a powerful capability to protect active databases by using either

More information

Load Balancing in Fault Tolerant Video Server

Load Balancing in Fault Tolerant Video Server Load Balancing in Fault Tolerant Video Server # D. N. Sujatha*, Girish K*, Rashmi B*, Venugopal K. R*, L. M. Patnaik** *Department of Computer Science and Engineering University Visvesvaraya College of

More information

Binary search tree with SIMD bandwidth optimization using SSE

Binary search tree with SIMD bandwidth optimization using SSE Binary search tree with SIMD bandwidth optimization using SSE Bowen Zhang, Xinwei Li 1.ABSTRACT In-memory tree structured index search is a fundamental database operation. Modern processors provide tremendous

More information

SFWR 4C03: Computer Networks & Computer Security Jan 3-7, 2005. Lecturer: Kartik Krishnan Lecture 1-3

SFWR 4C03: Computer Networks & Computer Security Jan 3-7, 2005. Lecturer: Kartik Krishnan Lecture 1-3 SFWR 4C03: Computer Networks & Computer Security Jan 3-7, 2005 Lecturer: Kartik Krishnan Lecture 1-3 Communications and Computer Networks The fundamental purpose of a communication network is the exchange

More information

PARALLELS CLOUD STORAGE

PARALLELS CLOUD STORAGE PARALLELS CLOUD STORAGE Performance Benchmark Results 1 Table of Contents Executive Summary... Error! Bookmark not defined. Architecture Overview... 3 Key Features... 5 No Special Hardware Requirements...

More information

Using Fuzzy Logic Control to Provide Intelligent Traffic Management Service for High-Speed Networks ABSTRACT:

Using Fuzzy Logic Control to Provide Intelligent Traffic Management Service for High-Speed Networks ABSTRACT: Using Fuzzy Logic Control to Provide Intelligent Traffic Management Service for High-Speed Networks ABSTRACT: In view of the fast-growing Internet traffic, this paper propose a distributed traffic management

More information

Scalability and Classifications

Scalability and Classifications Scalability and Classifications 1 Types of Parallel Computers MIMD and SIMD classifications shared and distributed memory multicomputers distributed shared memory computers 2 Network Topologies static

More information

In-Memory Databases Algorithms and Data Structures on Modern Hardware. Martin Faust David Schwalb Jens Krüger Jürgen Müller

In-Memory Databases Algorithms and Data Structures on Modern Hardware. Martin Faust David Schwalb Jens Krüger Jürgen Müller In-Memory Databases Algorithms and Data Structures on Modern Hardware Martin Faust David Schwalb Jens Krüger Jürgen Müller The Free Lunch Is Over 2 Number of transistors per CPU increases Clock frequency

More information

MEASURING WIRELESS NETWORK CONNECTION QUALITY

MEASURING WIRELESS NETWORK CONNECTION QUALITY Technical Disclosure Commons Defensive Publications Series January 27, 2016 MEASURING WIRELESS NETWORK CONNECTION QUALITY Mike Mu Avery Pennarun Follow this and additional works at: http://www.tdcommons.org/dpubs_series

More information

Large Scale Data Visualization and Rendering: Scalable Rendering

Large Scale Data Visualization and Rendering: Scalable Rendering Large Scale Data Visualization and Rendering: Scalable Rendering Randall Frank Lawrence Livermore National Laboratory UCRL-PRES PRES-145218 This work was performed under the auspices of the U.S. Department

More information

Introduction Disks RAID Tertiary storage. Mass Storage. CMSC 412, University of Maryland. Guest lecturer: David Hovemeyer.

Introduction Disks RAID Tertiary storage. Mass Storage. CMSC 412, University of Maryland. Guest lecturer: David Hovemeyer. Guest lecturer: David Hovemeyer November 15, 2004 The memory hierarchy Red = Level Access time Capacity Features Registers nanoseconds 100s of bytes fixed Cache nanoseconds 1-2 MB fixed RAM nanoseconds

More information

Benchmarking Hadoop & HBase on Violin

Benchmarking Hadoop & HBase on Violin Technical White Paper Report Technical Report Benchmarking Hadoop & HBase on Violin Harnessing Big Data Analytics at the Speed of Memory Version 1.0 Abstract The purpose of benchmarking is to show advantages

More information

The IntelliMagic White Paper on: Storage Performance Analysis for an IBM San Volume Controller (SVC) (IBM V7000)

The IntelliMagic White Paper on: Storage Performance Analysis for an IBM San Volume Controller (SVC) (IBM V7000) The IntelliMagic White Paper on: Storage Performance Analysis for an IBM San Volume Controller (SVC) (IBM V7000) IntelliMagic, Inc. 558 Silicon Drive Ste 101 Southlake, Texas 76092 USA Tel: 214-432-7920

More information

A Distributed Render Farm System for Animation Production

A Distributed Render Farm System for Animation Production A Distributed Render Farm System for Animation Production Jiali Yao, Zhigeng Pan *, Hongxin Zhang State Key Lab of CAD&CG, Zhejiang University, Hangzhou, 310058, China {yaojiali, zgpan, zhx}@cad.zju.edu.cn

More information

Multiprocessor Scheduling and Scheduling in Linux Kernel 2.6

Multiprocessor Scheduling and Scheduling in Linux Kernel 2.6 Multiprocessor Scheduling and Scheduling in Linux Kernel 2.6 Winter Term 2008 / 2009 Jun.-Prof. Dr. André Brinkmann Andre.Brinkmann@uni-paderborn.de Universität Paderborn PC² Agenda Multiprocessor and

More information

VMWARE WHITE PAPER 1

VMWARE WHITE PAPER 1 1 VMWARE WHITE PAPER Introduction This paper outlines the considerations that affect network throughput. The paper examines the applications deployed on top of a virtual infrastructure and discusses the

More information

Highly Available Mobile Services Infrastructure Using Oracle Berkeley DB

Highly Available Mobile Services Infrastructure Using Oracle Berkeley DB Highly Available Mobile Services Infrastructure Using Oracle Berkeley DB Executive Summary Oracle Berkeley DB is used in a wide variety of carrier-grade mobile infrastructure systems. Berkeley DB provides

More information

Operating Systems, 6 th ed. Test Bank Chapter 7

Operating Systems, 6 th ed. Test Bank Chapter 7 True / False Questions: Chapter 7 Memory Management 1. T / F In a multiprogramming system, main memory is divided into multiple sections: one for the operating system (resident monitor, kernel) and one

More information

Task Scheduling in Hadoop

Task Scheduling in Hadoop Task Scheduling in Hadoop Sagar Mamdapure Munira Ginwala Neha Papat SAE,Kondhwa SAE,Kondhwa SAE,Kondhwa Abstract Hadoop is widely used for storing large datasets and processing them efficiently under distributed

More information

Data Visualization in Parallel Environment Based on the OpenGL Standard

Data Visualization in Parallel Environment Based on the OpenGL Standard NO HEADER, NO FOOTER 5 th Slovakian-Hungarian Joint Symposium on Applied Machine Intelligence and Informatics January 25-26, 2007 Poprad, Slovakia Data Visualization in Parallel Environment Based on the

More information

How To Monitor And Test An Ethernet Network On A Computer Or Network Card

How To Monitor And Test An Ethernet Network On A Computer Or Network Card 3. MONITORING AND TESTING THE ETHERNET NETWORK 3.1 Introduction The following parameters are covered by the Ethernet performance metrics: Latency (delay) the amount of time required for a frame to travel

More information

Windows Server Performance Monitoring

Windows Server Performance Monitoring Spot server problems before they are noticed The system s really slow today! How often have you heard that? Finding the solution isn t so easy. The obvious questions to ask are why is it running slowly

More information

Scheduling and Resource Management in Computational Mini-Grids

Scheduling and Resource Management in Computational Mini-Grids Scheduling and Resource Management in Computational Mini-Grids July 1, 2002 Project Description The concept of grid computing is becoming a more and more important one in the high performance computing

More information

InfiniBand Clustering

InfiniBand Clustering White Paper InfiniBand Clustering Delivering Better Price/Performance than Ethernet 1.0 Introduction High performance computing clusters typically utilize Clos networks, more commonly known as Fat Tree

More information

Introduction to Cloud Computing

Introduction to Cloud Computing Introduction to Cloud Computing Parallel Processing I 15 319, spring 2010 7 th Lecture, Feb 2 nd Majd F. Sakr Lecture Motivation Concurrency and why? Different flavors of parallel computing Get the basic

More information

low-level storage structures e.g. partitions underpinning the warehouse logical table structures

low-level storage structures e.g. partitions underpinning the warehouse logical table structures DATA WAREHOUSE PHYSICAL DESIGN The physical design of a data warehouse specifies the: low-level storage structures e.g. partitions underpinning the warehouse logical table structures low-level structures

More information

High Performance Cluster Support for NLB on Window

High Performance Cluster Support for NLB on Window High Performance Cluster Support for NLB on Window [1]Arvind Rathi, [2] Kirti, [3] Neelam [1]M.Tech Student, Department of CSE, GITM, Gurgaon Haryana (India) arvindrathi88@gmail.com [2]Asst. Professor,

More information

File System & Device Drive. Overview of Mass Storage Structure. Moving head Disk Mechanism. HDD Pictures 11/13/2014. CS341: Operating System

File System & Device Drive. Overview of Mass Storage Structure. Moving head Disk Mechanism. HDD Pictures 11/13/2014. CS341: Operating System CS341: Operating System Lect 36: 1 st Nov 2014 Dr. A. Sahu Dept of Comp. Sc. & Engg. Indian Institute of Technology Guwahati File System & Device Drive Mass Storage Disk Structure Disk Arm Scheduling RAID

More information

Best Practices for Deploying SSDs in a Microsoft SQL Server 2008 OLTP Environment with Dell EqualLogic PS-Series Arrays

Best Practices for Deploying SSDs in a Microsoft SQL Server 2008 OLTP Environment with Dell EqualLogic PS-Series Arrays Best Practices for Deploying SSDs in a Microsoft SQL Server 2008 OLTP Environment with Dell EqualLogic PS-Series Arrays Database Solutions Engineering By Murali Krishnan.K Dell Product Group October 2009

More information

- An Essential Building Block for Stable and Reliable Compute Clusters

- An Essential Building Block for Stable and Reliable Compute Clusters Ferdinand Geier ParTec Cluster Competence Center GmbH, V. 1.4, March 2005 Cluster Middleware - An Essential Building Block for Stable and Reliable Compute Clusters Contents: Compute Clusters a Real Alternative

More information

18-742 Lecture 4. Parallel Programming II. Homework & Reading. Page 1. Projects handout On Friday Form teams, groups of two

18-742 Lecture 4. Parallel Programming II. Homework & Reading. Page 1. Projects handout On Friday Form teams, groups of two age 1 18-742 Lecture 4 arallel rogramming II Spring 2005 rof. Babak Falsafi http://www.ece.cmu.edu/~ece742 write X Memory send X Memory read X Memory Slides developed in part by rofs. Adve, Falsafi, Hill,

More information

A Pattern-Based Approach to. Automated Application Performance Analysis

A Pattern-Based Approach to. Automated Application Performance Analysis A Pattern-Based Approach to Automated Application Performance Analysis Nikhil Bhatia, Shirley Moore, Felix Wolf, and Jack Dongarra Innovative Computing Laboratory University of Tennessee (bhatia, shirley,

More information

An Interactive Dynamic Tiled Display System

An Interactive Dynamic Tiled Display System An Interactive Dynamic Tiled Display System Juliano Franz, Gelson Reinaldo, Anderson Maciel and Luciana Nedel Instituto de Informática Universidade Federal do Rio Grande do Sul Porto Alegre, Brazil {jmfranz@inf.ufrgs.br,

More information

Cloud Based Distributed Databases: The Future Ahead

Cloud Based Distributed Databases: The Future Ahead Cloud Based Distributed Databases: The Future Ahead Arpita Mathur Mridul Mathur Pallavi Upadhyay Abstract Fault tolerant systems are necessary to be there for distributed databases for data centers or

More information

Parallel Simplification of Large Meshes on PC Clusters

Parallel Simplification of Large Meshes on PC Clusters Parallel Simplification of Large Meshes on PC Clusters Hua Xiong, Xiaohong Jiang, Yaping Zhang, Jiaoying Shi State Key Lab of CAD&CG, College of Computer Science Zhejiang University Hangzhou, China April

More information

PART III. OPS-based wide area networks

PART III. OPS-based wide area networks PART III OPS-based wide area networks Chapter 7 Introduction to the OPS-based wide area network 7.1 State-of-the-art In this thesis, we consider the general switch architecture with full connectivity

More information

Web Server Software Architectures

Web Server Software Architectures Web Server Software Architectures Author: Daniel A. Menascé Presenter: Noshaba Bakht Web Site performance and scalability 1.workload characteristics. 2.security mechanisms. 3. Web cluster architectures.

More information

PERFORMANCE TUNING ORACLE RAC ON LINUX

PERFORMANCE TUNING ORACLE RAC ON LINUX PERFORMANCE TUNING ORACLE RAC ON LINUX By: Edward Whalen Performance Tuning Corporation INTRODUCTION Performance tuning is an integral part of the maintenance and administration of the Oracle database

More information

The Sierra Clustered Database Engine, the technology at the heart of

The Sierra Clustered Database Engine, the technology at the heart of A New Approach: Clustrix Sierra Database Engine The Sierra Clustered Database Engine, the technology at the heart of the Clustrix solution, is a shared-nothing environment that includes the Sierra Parallel

More information

Faculty of Computer Science Computer Graphics Group. Final Diploma Examination

Faculty of Computer Science Computer Graphics Group. Final Diploma Examination Faculty of Computer Science Computer Graphics Group Final Diploma Examination Communication Mechanisms for Parallel, Adaptive Level-of-Detail in VR Simulations Author: Tino Schwarze Advisors: Prof. Dr.

More information

An Extremely Inexpensive Multisampling Scheme

An Extremely Inexpensive Multisampling Scheme An Extremely Inexpensive Multisampling Scheme Tomas Akenine-Möller Ericsson Mobile Platforms AB Chalmers University of Technology Technical Report No. 03-14 Note that this technical report will be extended

More information

A NEW METHOD OF STORAGE AND VISUALIZATION FOR MASSIVE POINT CLOUD DATASET

A NEW METHOD OF STORAGE AND VISUALIZATION FOR MASSIVE POINT CLOUD DATASET 22nd CIPA Symposium, October 11-15, 2009, Kyoto, Japan A NEW METHOD OF STORAGE AND VISUALIZATION FOR MASSIVE POINT CLOUD DATASET Zhiqiang Du*, Qiaoxiong Li State Key Laboratory of Information Engineering

More information

Intel 965 Express Chipset Family Memory Technology and Configuration Guide

Intel 965 Express Chipset Family Memory Technology and Configuration Guide Intel 965 Express Chipset Family Memory Technology and Configuration Guide White Paper - For the Intel 82Q965, 82Q963, 82G965 Graphics and Memory Controller Hub (GMCH) and Intel 82P965 Memory Controller

More information

The International Journal Of Science & Technoledge (ISSN 2321 919X) www.theijst.com

The International Journal Of Science & Technoledge (ISSN 2321 919X) www.theijst.com THE INTERNATIONAL JOURNAL OF SCIENCE & TECHNOLEDGE Efficient Parallel Processing on Public Cloud Servers using Load Balancing Manjunath K. C. M.Tech IV Sem, Department of CSE, SEA College of Engineering

More information

POWER ALL GLOBAL FILE SYSTEM (PGFS)

POWER ALL GLOBAL FILE SYSTEM (PGFS) POWER ALL GLOBAL FILE SYSTEM (PGFS) Defining next generation of global storage grid Power All Networks Ltd. Technical Whitepaper April 2008, version 1.01 Table of Content 1. Introduction.. 3 2. Paradigm

More information

Performance Analysis of IPv4 v/s IPv6 in Virtual Environment Using UBUNTU

Performance Analysis of IPv4 v/s IPv6 in Virtual Environment Using UBUNTU Performance Analysis of IPv4 v/s IPv6 in Virtual Environment Using UBUNTU Savita Shiwani Computer Science,Gyan Vihar University, Rajasthan, India G.N. Purohit AIM & ACT, Banasthali University, Banasthali,

More information

The functionality and advantages of a high-availability file server system

The functionality and advantages of a high-availability file server system The functionality and advantages of a high-availability file server system This paper discusses the benefits of deploying a JMR SHARE High-Availability File Server System. Hardware and performance considerations

More information

Multiple Public IPs (virtual service IPs) are supported either to cover multiple network segments or to increase network performance.

Multiple Public IPs (virtual service IPs) are supported either to cover multiple network segments or to increase network performance. EliteNAS Cluster Mirroring Option - Introduction Real Time NAS-to-NAS Mirroring & Auto-Failover Cluster Mirroring High-Availability & Data Redundancy Option for Business Continueity Typical Cluster Mirroring

More information

System-Level Display Power Reduction Technologies for Portable Computing and Communications Devices

System-Level Display Power Reduction Technologies for Portable Computing and Communications Devices System-Level Display Power Reduction Technologies for Portable Computing and Communications Devices Achintya K. Bhowmik and Robert J. Brennan Intel Corporation 2200 Mission College Blvd. Santa Clara, CA

More information

Remote Graphical Visualization of Large Interactive Spatial Data

Remote Graphical Visualization of Large Interactive Spatial Data Remote Graphical Visualization of Large Interactive Spatial Data ComplexHPC Spring School 2011 International ComplexHPC Challenge Cristinel Mihai Mocan Computer Science Department Technical University

More information

Agenda. Enterprise Application Performance Factors. Current form of Enterprise Applications. Factors to Application Performance.

Agenda. Enterprise Application Performance Factors. Current form of Enterprise Applications. Factors to Application Performance. Agenda Enterprise Performance Factors Overall Enterprise Performance Factors Best Practice for generic Enterprise Best Practice for 3-tiers Enterprise Hardware Load Balancer Basic Unix Tuning Performance

More information

Tranzeo s EnRoute500 Performance Analysis and Prediction

Tranzeo s EnRoute500 Performance Analysis and Prediction Tranzeo s EnRoute500 Performance Analysis and Prediction Introduction Tranzeo has developed the EnRoute500 product family to provide an optimum balance between price and performance for wireless broadband

More information

Distributed File System. MCSN N. Tonellotto Complements of Distributed Enabling Platforms

Distributed File System. MCSN N. Tonellotto Complements of Distributed Enabling Platforms Distributed File System 1 How do we get data to the workers? NAS Compute Nodes SAN 2 Distributed File System Don t move data to workers move workers to the data! Store data on the local disks of nodes

More information

Load Balancing In Concurrent Parallel Applications

Load Balancing In Concurrent Parallel Applications Load Balancing In Concurrent Parallel Applications Jeff Figler Rochester Institute of Technology Computer Engineering Department Rochester, New York 14623 May 1999 Abstract A parallel concurrent application

More information

RAID technology and IBM TotalStorage NAS products

RAID technology and IBM TotalStorage NAS products IBM TotalStorage Network Attached Storage October 2001 RAID technology and IBM TotalStorage NAS products By Janet Anglin and Chris Durham Storage Networking Architecture, SSG Page No.1 Contents 2 RAID

More information

Performance of networks containing both MaxNet and SumNet links

Performance of networks containing both MaxNet and SumNet links Performance of networks containing both MaxNet and SumNet links Lachlan L. H. Andrew and Bartek P. Wydrowski Abstract Both MaxNet and SumNet are distributed congestion control architectures suitable for

More information

Recommended hardware system configurations for ANSYS users

Recommended hardware system configurations for ANSYS users Recommended hardware system configurations for ANSYS users The purpose of this document is to recommend system configurations that will deliver high performance for ANSYS users across the entire range

More information

Load Balancing in the Cloud Computing Using Virtual Machine Migration: A Review

Load Balancing in the Cloud Computing Using Virtual Machine Migration: A Review Load Balancing in the Cloud Computing Using Virtual Machine Migration: A Review 1 Rukman Palta, 2 Rubal Jeet 1,2 Indo Global College Of Engineering, Abhipur, Punjab Technical University, jalandhar,india

More information

An Approach to High-Performance Scalable Temporal Object Storage

An Approach to High-Performance Scalable Temporal Object Storage An Approach to High-Performance Scalable Temporal Object Storage Kjetil Nørvåg Department of Computer and Information Science Norwegian University of Science and Technology 791 Trondheim, Norway email:

More information

Intel DPDK Boosts Server Appliance Performance White Paper

Intel DPDK Boosts Server Appliance Performance White Paper Intel DPDK Boosts Server Appliance Performance Intel DPDK Boosts Server Appliance Performance Introduction As network speeds increase to 40G and above, both in the enterprise and data center, the bottlenecks

More information

4.2: Multimedia File Systems Traditional File Systems. Multimedia File Systems. Multimedia File Systems. Disk Scheduling

4.2: Multimedia File Systems Traditional File Systems. Multimedia File Systems. Multimedia File Systems. Disk Scheduling Chapter 2: Representation of Multimedia Data Chapter 3: Multimedia Systems Communication Aspects and Services Chapter 4: Multimedia Systems Storage Aspects Optical Storage Media Multimedia File Systems

More information

Chapter 11 I/O Management and Disk Scheduling

Chapter 11 I/O Management and Disk Scheduling Operating Systems: Internals and Design Principles, 6/E William Stallings Chapter 11 I/O Management and Disk Scheduling Dave Bremer Otago Polytechnic, NZ 2008, Prentice Hall I/O Devices Roadmap Organization

More information

A Review of Customized Dynamic Load Balancing for a Network of Workstations

A Review of Customized Dynamic Load Balancing for a Network of Workstations A Review of Customized Dynamic Load Balancing for a Network of Workstations Taken from work done by: Mohammed Javeed Zaki, Wei Li, Srinivasan Parthasarathy Computer Science Department, University of Rochester

More information

Final for ECE374 05/06/13 Solution!!

Final for ECE374 05/06/13 Solution!! 1 Final for ECE374 05/06/13 Solution!! Instructions: Put your name and student number on each sheet of paper! The exam is closed book. You have 90 minutes to complete the exam. Be a smart exam taker -

More information

<Insert Picture Here> Oracle In-Memory Database Cache Overview

<Insert Picture Here> Oracle In-Memory Database Cache Overview Oracle In-Memory Database Cache Overview Simon Law Product Manager The following is intended to outline our general product direction. It is intended for information purposes only,

More information

Distributed Dynamic Load Balancing for Iterative-Stencil Applications

Distributed Dynamic Load Balancing for Iterative-Stencil Applications Distributed Dynamic Load Balancing for Iterative-Stencil Applications G. Dethier 1, P. Marchot 2 and P.A. de Marneffe 1 1 EECS Department, University of Liege, Belgium 2 Chemical Engineering Department,

More information

Interactive Level-Set Deformation On the GPU

Interactive Level-Set Deformation On the GPU Interactive Level-Set Deformation On the GPU Institute for Data Analysis and Visualization University of California, Davis Problem Statement Goal Interactive system for deformable surface manipulation

More information

How To Understand The Concept Of A Distributed System

How To Understand The Concept Of A Distributed System Distributed Operating Systems Introduction Ewa Niewiadomska-Szynkiewicz and Adam Kozakiewicz ens@ia.pw.edu.pl, akozakie@ia.pw.edu.pl Institute of Control and Computation Engineering Warsaw University of

More information

Quantcast Petabyte Storage at Half Price with QFS!

Quantcast Petabyte Storage at Half Price with QFS! 9-131 Quantcast Petabyte Storage at Half Price with QFS Presented by Silvius Rus, Director, Big Data Platforms September 2013 Quantcast File System (QFS) A high performance alternative to the Hadoop Distributed

More information

CATIA V5 Surface Design

CATIA V5 Surface Design CATIA V5 Training Foils CATIA V5 Surface Design Version 5 Release 19 August 2008 EDU_CAT_EN_V5S_FI_V5R19 1 Lesson 1: Introduction to Generative Shape Design About this Course Introduction CATIA is a robust

More information

Overlapping Data Transfer With Application Execution on Clusters

Overlapping Data Transfer With Application Execution on Clusters Overlapping Data Transfer With Application Execution on Clusters Karen L. Reid and Michael Stumm reid@cs.toronto.edu stumm@eecg.toronto.edu Department of Computer Science Department of Electrical and Computer

More information

STANDPOINT FOR QUALITY-OF-SERVICE MEASUREMENT

STANDPOINT FOR QUALITY-OF-SERVICE MEASUREMENT STANDPOINT FOR QUALITY-OF-SERVICE MEASUREMENT 1. TIMING ACCURACY The accurate multi-point measurements require accurate synchronization of clocks of the measurement devices. If for example time stamps

More information

D1.2 Network Load Balancing

D1.2 Network Load Balancing D1. Network Load Balancing Ronald van der Pol, Freek Dijkstra, Igor Idziejczak, and Mark Meijerink SARA Computing and Networking Services, Science Park 11, 9 XG Amsterdam, The Netherlands June ronald.vanderpol@sara.nl,freek.dijkstra@sara.nl,

More information

Benchmark Hadoop and Mars: MapReduce on cluster versus on GPU

Benchmark Hadoop and Mars: MapReduce on cluster versus on GPU Benchmark Hadoop and Mars: MapReduce on cluster versus on GPU Heshan Li, Shaopeng Wang The Johns Hopkins University 3400 N. Charles Street Baltimore, Maryland 21218 {heshanli, shaopeng}@cs.jhu.edu 1 Overview

More information

The changes in each specification and how they compare is shown in the table below. Following the table is a discussion of each of these changes.

The changes in each specification and how they compare is shown in the table below. Following the table is a discussion of each of these changes. Introduction There are many interconnect technologies connect components in a system and an embedded designer is faced with an array of standards and technologies choose from. This paper explores the latest

More information

Efficient Parallel Processing on Public Cloud Servers Using Load Balancing

Efficient Parallel Processing on Public Cloud Servers Using Load Balancing Efficient Parallel Processing on Public Cloud Servers Using Load Balancing Valluripalli Srinath 1, Sudheer Shetty 2 1 M.Tech IV Sem CSE, Sahyadri College of Engineering & Management, Mangalore. 2 Asso.

More information

L20: GPU Architecture and Models

L20: GPU Architecture and Models L20: GPU Architecture and Models scribe(s): Abdul Khalifa 20.1 Overview GPUs (Graphics Processing Units) are large parallel structure of processing cores capable of rendering graphics efficiently on displays.

More information

GPU Architecture. Michael Doggett ATI

GPU Architecture. Michael Doggett ATI GPU Architecture Michael Doggett ATI GPU Architecture RADEON X1800/X1900 Microsoft s XBOX360 Xenos GPU GPU research areas ATI - Driving the Visual Experience Everywhere Products from cell phones to super

More information

A Review of Load Balancing Algorithms for Cloud Computing

A Review of Load Balancing Algorithms for Cloud Computing www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume - 3 Issue -9 September, 2014 Page No. 8297-8302 A Review of Load Balancing Algorithms for Cloud Computing Dr.G.N.K.Sureshbabu

More information

Keywords Load balancing, Dispatcher, Distributed Cluster Server, Static Load balancing, Dynamic Load balancing.

Keywords Load balancing, Dispatcher, Distributed Cluster Server, Static Load balancing, Dynamic Load balancing. Volume 5, Issue 7, July 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com A Hybrid Algorithm

More information

System Interconnect Architectures. Goals and Analysis. Network Properties and Routing. Terminology - 2. Terminology - 1

System Interconnect Architectures. Goals and Analysis. Network Properties and Routing. Terminology - 2. Terminology - 1 System Interconnect Architectures CSCI 8150 Advanced Computer Architecture Hwang, Chapter 2 Program and Network Properties 2.4 System Interconnect Architectures Direct networks for static connections Indirect

More information

Module 5. Broadcast Communication Networks. Version 2 CSE IIT, Kharagpur

Module 5. Broadcast Communication Networks. Version 2 CSE IIT, Kharagpur Module 5 Broadcast Communication Networks Lesson 1 Network Topology Specific Instructional Objectives At the end of this lesson, the students will be able to: Specify what is meant by network topology

More information

N5 NETWORKING BEST PRACTICES

N5 NETWORKING BEST PRACTICES N5 NETWORKING BEST PRACTICES Table of Contents Nexgen N5 Networking... 2 Overview of Storage Networking Best Practices... 2 Recommended Switch features for an iscsi Network... 2 Setting up the iscsi Network

More information

A Performance Study of Load Balancing Strategies for Approximate String Matching on an MPI Heterogeneous System Environment

A Performance Study of Load Balancing Strategies for Approximate String Matching on an MPI Heterogeneous System Environment A Performance Study of Load Balancing Strategies for Approximate String Matching on an MPI Heterogeneous System Environment Panagiotis D. Michailidis and Konstantinos G. Margaritis Parallel and Distributed

More information

Cellular Computing on a Linux Cluster

Cellular Computing on a Linux Cluster Cellular Computing on a Linux Cluster Alexei Agueev, Bernd Däne, Wolfgang Fengler TU Ilmenau, Department of Computer Architecture Topics 1. Cellular Computing 2. The Experiment 3. Experimental Results

More information

Evaluating HDFS I/O Performance on Virtualized Systems

Evaluating HDFS I/O Performance on Virtualized Systems Evaluating HDFS I/O Performance on Virtualized Systems Xin Tang xtang@cs.wisc.edu University of Wisconsin-Madison Department of Computer Sciences Abstract Hadoop as a Service (HaaS) has received increasing

More information

BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB

BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB Planet Size Data!? Gartner s 10 key IT trends for 2012 unstructured data will grow some 80% over the course of the next

More information

Name: 1. CS372H: Spring 2009 Final Exam

Name: 1. CS372H: Spring 2009 Final Exam Name: 1 Instructions CS372H: Spring 2009 Final Exam This exam is closed book and notes with one exception: you may bring and refer to a 1-sided 8.5x11- inch piece of paper printed with a 10-point or larger

More information

Survey on Job Schedulers in Hadoop Cluster

Survey on Job Schedulers in Hadoop Cluster IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661, p- ISSN: 2278-8727Volume 15, Issue 1 (Sep. - Oct. 2013), PP 46-50 Bincy P Andrews 1, Binu A 2 1 (Rajagiri School of Engineering and Technology,

More information

Integrated Application and Data Protection. NEC ExpressCluster White Paper

Integrated Application and Data Protection. NEC ExpressCluster White Paper Integrated Application and Data Protection NEC ExpressCluster White Paper Introduction Critical business processes and operations depend on real-time access to IT systems that consist of applications and

More information