During the fifteen years that have elapsed

Size: px
Start display at page:

Download "During the fifteen years that have elapsed"

Transcription

1 Using a Computational Grid for Geographic Information Analysis: A Reconnaissance Marc P. Armstrong, Mary Kathryn Cowles, and Shaowen Wang The University of Iowa High performance computing has undergone a radical transformation during the past decade. Though monolithic supercomputers continue to be built with significantly increased computing power, geographically distributed computing resources are now routinely linked using high-speed networks to address a broad range of computationally complex problems. These confederated resources are referred to collectively as a computational Grid. Many geographical problems exhibit characteristics that make them candidates for this new model of computing. As an illustration, we describe a spatial statistics problem and demonstrate how it can be addressed using Grid computing strategies. A key element of this application is the development of middleware that handles domain decomposition and coordinates computational functions. We also discuss the development of Grid portals that are designed to help researchers and decision makers access and use geographic information analysis tools. Key Words: Grid computing, parallel computing, spatial statistics, middleware, Grid portals. Introduction During the fifteen years that have elapsed since Griffith (1990) described an approach to geographic information analysis that required the use of then-current supercomputers, the characteristics of many geographic problems have evolved in two important ways. First, data sets have continued to increase in size as a consequence of improved data collection technologies and practices (Armstrong 2000). Gigabyte-sized datasets, almost unheard of a decade and a half ago, are now being eclipsed by terabyte and petabyte scale databases 1 (e.g., Thakar et al. 2003). Secondly, computationally intensive methods of analysis that were just becoming known in the early nineties are now gaining wider acceptance. Either of these changes could lead to computational intractability, but when taken together and when computationally intensive methods are applied to enormous disaggregated datasets, the burdens imposed can effectively preclude analyses, unless one is willing to wait months, or even years, for a result (Armstrong, Xiao, and Bennett 2003; Cowles 2003). Geography, of course, is not alone in facing such dilemmas and, fortunately, computationally based geographic research can reap significant benefits by exploiting advances now being put in place by researchers in other disciplines such as computer science and engineering. During the past few years, these advances have begun to transform the landscape of highperformance computing. In particular, an emerging model of computation, called the Grid (Foster and Kesselman 1999), uses high-speed networks to link available, trusted computer systems and form configurable, distributed virtual computing resources. The Grid represents a change in modality in the use of computer technology in which a certain amount of control over local resources is relinquished for use by other trusted applications that are used by a much wider audience. Though the focus of this article is on computation, these shared resources can take other forms, such as distributed data storage and collaborative visualization (Gao et al. 2002; Rajasekar, Wan, and Moore 2002). It is also important to note that these changes are not confined to hardware and software issues alone but also include a set of behavioral changes that enable the shared use of the emerging cyberinfrastructure to support collaborative research (Laforenza 2002) that is being conducted in cognate disciplines such as earth science (e.g., GEON) and ecology (e.g., SEEK EcoGrid). This has required the definition and promulgation of new communication protocols, software that coordinates use (middle- The Professional Geographer, 57(3) 2005, pages r Copyright 2005 by Association of American Geographers. Initial submission, October 2003; revised submission, July 2004; final acceptance, August Published by Blackwell Publishing, 350 Main Street, Malden, MA 02148, and 9600 Garsington Road, Oxford OX4 2DQ, U.K.

2 366 Volume 57, Number 3, August 2005 ware), and the development of sociocomputational mores that enable sharing of distributed resources. The purpose of this article is to sketch out the state of the practice in Grid computing and to present some results from a project that illustrates how this new approach to computation can be used to address a spatial problem. We conclude with some comments about our current research on the development of Grid portals. Such portals are designed to provide researchers with access to collections of easyto-use distributed computing tools that can be configured into a domain-specific, problemsolving environment ( PSE; Gallopoulos, Houstis, and Rice 1994). Computational Grids The idea that gave rise to Grid computing is superficially simple: turn computation into a service delivered over a widely available infrastructure (e.g., cyberinfrastructure). Metaphors such as electrical power transmission and water distribution systems are typically invoked, though the problem of a computational Grid is somewhat different (see, e.g., Licklider and Taylor 1968; Smarr 1999). The concept of Grid computing was developed by computer scientists and engineers after a realistic appraisal of trends in computer technology. The wellknown and oft-cited Moore s Law asserted that chip density (number of transistors per integrated circuit) would double every eighteen months (Moore 1965; Borkar 2003). This observation has recently been extended to include processing speed and other aspects of computer performance (see Bell and Gray 1997, 13). In particular, when trends in storage and networks are considered, the doubling time of capacity and speed turns out to be approximately one year and nine months, respectively (Foster 2002, 42). There is, as might be expected, pessimism in some circles about the likely longterm duration of such trends, but few researchers dispute that they will persist well into the coming decade. Moore, for example, has subsequently predicted that his law will hold until approximately 2017 when other basic physical laws will take precedence. Computer scientists and engineers are well aware of these trends and have acted accordingly. The expenses associated with manufacturing high-density microcircuitry are astronomical. As a consequence, high sales volumes (or prohibitively high unit costs) are needed to maintain the economic viability of companies involved in the design and creation of new generations of computer processors. This has led to significant economic problems for most computer equipment manufacturers. Though vector supercomputers, such as those manufactured by Cray, represented the high end of scientific computing in the late 1980s and early 1990s, they were expensive to develop, acquire, and maintain and were typically tightly controlled by a centralized administrative structure. This is the type of computer environment described by Griffith (1990), for example, and with only one notable exception ( Japan s Earth Simulator), vector supercomputers have receded from view. Around the same time, more explicitly parallel systems were being developed and used (see, e.g., Armstrong, Pavlik, and Marciano 1994; Cramer and Armstrong 1999), but since these systems were also expensive and difficult to maintain, many using custom processors and interconnects, most of them became extinct during the mid-1990s. The availability of highspeed Internet, however, has enabled the high level of connectivity that would support the decentralized computation that has emerged to represent the current state of the practice; this trend is likely to persist well into the future, given the gathering force of recent nationallevel, high-performance network initiatives such as Internet2 ( and LambdaRail ( Since network speed and storage capacities are increasing at such a rapid pace, and since there are commensurate efforts to expand the reach of high-performance networks, a next logical step is to link large numbers of existing distributed computational resources together (now often sitting idle) so that they can be used to address computationally intensive problems. Most of the original work in this area has taken place using hardware that is under the specific control of a local administrator (e.g., a laboratory or university facility). Clusters such as those using the Beowulf architecture (Sterling et al. 1999) have become widespread; they exist on many campuses and are used in a variety of business applications. The cluster-computing approach does have two significant limitations, however. One pundit has compared this ap-

3 Using a Computational Grid for Geographic Information Analysis: A Reconnaissance 367 proach to amassing a large number of rowboats that, collectively, can never hope to fulfill the functions of a cruise ship. The argument here is that while large numbers of commodity-class processors can crunch numbers, they are inadequate for memory-intensive problems that require high performance I-O and memory subsystems. The ability to link heterogeneous collections of processors that may vary, for example, in clock frequency, RAM, disk capacity and architecture, is precisely where Grid concepts become relevant. A second problem is that the purchase of needed resources can become quite expensive. Total costs are reduced if resources are shared cooperatively, and, again, this is a key element of Grid computing. At the present time, a pattern in distributed computing resource use is beginning to emerge: problems that are relatively small can be addressed locally, using a Beowulf cluster for example, while large problems that require a considerable expenditure of processing time are shifting to national (or international) Grid constellations, such as the TeraGrid (Reed 2003). The use of distributed, shared resources has required the establishment of protocols that govern their use. In addition, these protocols are designed to support secure movement of information and processing capabilities. Grid computing has encouraged the establishment of flexibly defined virtual organizations (VO) that exist to support interdisciplinary and collaborative research. Such problems, involving, for example, linked human-environment interaction and models in sustainability science, might require expertise in climatology, hydrology, GIScience, computer science, economics, sociology, agriculture, and industrial ecology. Though sufficiently high levels of expertise in each of these areas might be co-located in a single place at a few locations in the world, it is far more likely that scientific expertise will be geographically distributed, perhaps on different continents. Each VO creates and enforces rules that define group membership and rights of access to Grid resources (Foster et al. 2002). Based on these rules, VO members are allowed to access domain-specific, problem-solving environments. In a later section of this article, we describe such an environment that is designed to support the analysis of point-based geographic information using the Grid. The architecture of the Grid is typically conceptualized as a sequence of layers. At the lowest level, the fabric layer specifies protocols and software interfaces that enable access to required hardware resources. Above this layer, middleware consists of specialized software that handles tasks such as user authentication, authorization, encryption, and secure execution; in many ways, this middleware layer holds the Grid together since it also links high-level applications software that performs needed analytical functions (e.g., spatial statistics software) to the low-level disk access and network protocols ( fabric). For example, VO members, such as a spatial statistician and epidemiologist, could access programs and data that are then divided into tasks that are allocated to execute on distributed computing resources. Middleware handles the decomposition of data and allocates tasks based on an assessment of resource availability and capacity, as well as the requirements of each application. Results are then collected (also using middleware) and visualized, and, based on the domain knowledge of users, parameters for otherwise prohibitively timeconsuming analyses can be modified in an exploratory fashion. Implementing a Grid Application Grid applications access distributed resources that may be linked by commodity-grade Internet connections with unknown and highly variable latencies. Consequently, they are normally based on a programming model that supports coarse-grained communication and problem decomposition. Two approaches can be used to accomplish such tasks (Laforenza 2002): The shared data approach can be implemented through the use of distributed virtual shared memory as exemplified by Linda (see Carriero and Gelernter 1990; Rokos and Armstrong 1996). The efficiency of this approach, however, is affected by communication latency, and, consequently, it is not widely adopted. The shared nothing approach, in contrast, has gained widespread acceptance and is used in the example application described in this article. This second approach relies on message passing to coordinate the execution of a balanced set of tasks that have

4 368 Volume 57, Number 3, August 2005 been decomposed for execution on distributed processors. Grid users do not need to know where a program is currently being executed or where intermediate results are being stored. The management of these tasks is handled by middleware, which sits between traditional operating systems (e.g., Linux or Windows) and Grid applications and acts as a metacomputing operating system (Smarr and Catlett 1992). It manages the complexity of securely using geographically distributed computing resources and hides this complexity from application developers and Grid end users. Indeed, middleware is sometimes referred to as plumbing since it connects applications and passes data between them (Laforenza 2002, 1739). Two Grid middleware architectures have been adopted: object oriented and bag of services. The object-oriented architecture takes advantage of object-oriented modeling capabilities such as polymorphism, encapsulation, overloading, and inheritance to address the complexity of designing and implementing Grid middleware. The bag-of-service model, on the other hand, aims to provide configurable middleware components that can be assembled to meet customizable needs from Grid users. Legion ( legion.virginia.edu/) is an object-oriented implementation of Grid middleware, while the Globus Toolkit ( is an exemplar of the bag-of-service approach. We have adopted Globus Toolkit for our research. Globus Toolkit The Globus Toolkit, a part of the National Science Foundation Middleware Initiative ( is becoming a de facto standard of Grid middleware. Globus adopts the hourglass model (National Research Council 1994) to construct interfaces between different layers of Grid middleware components (Foster, Kesselman, and Tuecke 2001) (Figure 1). The hourglass neck consists of the resource and connectivity layers shown in Figure 1. In these two layers, the Globus Toolkit integrates the following four protocols: GSI (Grid Security Infrastructure), based on the Public-Key Infrastructure ( html), is used for authentication, community protection, and authorization. GRAM (Globus Resource Access and Management) is used to interact with local job schedulers such as PBS ( and Condor ( that allocate computational resources for applications; it is also used to monitor and control computing processes. GridFTP extends the Internet File Transfer Protocol to support and manage secure data transfers in Grid environments. GRIP (Grid Resource Information Protocol) is used to define and represent Grid resource information. Higher-level Grid services are built upon these protocols to support application development. For example, DUROC (Dynamically Updated Request Online Co-allocator) is based on GRAM and is used to allocate a set of resources simultaneously. Application Development Application development requires domain-specific contextualization of the components in the top two (collective and application) Grid layers (Figure 1). In the collective layer, middleware development usually involves implementing resource discovery and broker utilities that are tailored to specific applications. Resource discovery utilities are constructed using monitoring and discovery services that are built using GRIP. Resource broker utilities include task scheduling as a major component, and their output normally takes the form of RSL (Resource Specification Language) scripts. These scripts describe resources and computing tasks, and specify how to match them with each other. For example, the following is an RSL script fragment: & (resourcemanagercontact ¼ rtgrid1.its. uiowa.edu/jobmanager-condor ) (maxtime ¼ 2) (count ¼ 1) (memory4 ¼ 256) (memoryo ¼ 512) (executable ¼./GridGiStar ) (arguments ¼ uniform4000.txt ) (directory ¼ /home/swang/gridgistar/ ) This RSL fragment states that the executable GridGiStar located in the /home/swang/ GridGiStar/ directory should be executed

5 Using a Computational Grid for Geographic Information Analysis: A Reconnaissance 369 Figure 1 Grid architecture, after Foster, Kesselman, and Tuecke (2001). once within two minutes on the resource rtgrid1.its.uiowa.edu with a memory size between 256MB and 512MB and that Condor should be used as the local job manager. The arguments statement specifies the particular dataset to be processed as well as the parameters that will control the execution of the distributed program. Additional functions, such as multicast I-O to distribute data across distributed systems and information collection about available resources, are also typically invoked using RSL scripts or other middleware components. Grid Testbed We have implemented a campus Grid testbed to support the development of prototype Gridbased geographic analysis applications and to conduct computational experiments. The testbed is comprised of three Linux clusters that are located in different campus buildings; their individual hardware configurations are shown in Table 1. The three clusters and a Grid client desktop system are connected through the optical fiber infrastructure of The University of Iowa. A snapshot of peer-to-peer network bandwidth information is provided in Figure 2. Globus Toolkit, Condor, and PBS are used as core Grid middleware components (Figure 3). In the next section we describe a spatial statistics problem that was addressed using this Grid testbed. An Illustrative Example:G n i (d) Spatial data typically consist of measurements of a variable of primary interest (here termed the response variable) at sites identified by spatial coordinates. A common research goal in analyzing such data is to identify clusters of points (sometimes called hot spots ) at which values of the response variable are unusually high or unusually low compared to the rest of the region. A related, but broader, goal is to identify pockets of spatial dependence. Getis and Ord (1992) introduced a statistic, G i *(d), that may be used for these purposes. Here we focus on the simplest version of the standardized form of this statistic that was further elucidated by Ord and Getis (1995). Let x i, i ¼ 1,...n, denote measurements of the response variable of interest taken at n distinct locations with known coordinates. One or more distances d of interest are chosen such that most locations in the dataset have at least one neighbor point within distance d. Then a spatial weight matrix {w ij (d)} is defined such that w ij (d) ¼ 1 if location i is within distance d of location j, and w ij (d) ¼ 0 otherwise; note that in the computation of G i *(d), w ii (d) ¼ 1. If we let Table 1 Resource Configurations of a Grid Testbed Name (Department) Processors Total Memory Total Local Disk Storage cluster0.its.uiowa.edu (ITS) 32 1 GHz 16 GB 640 GB 160 GB rtgrid1.its.uiowa.edu (ITS) 6 Athlon GHz 4 GB 240 GB 0.5 TB beowulf.stat.uiowa.edu (Statistics) GHz 14GB GB N/A

6 370 Volume 57, Number 3, August 2005 Figure 2 Network bandwidth of the Grid testbed. Wi ¼ P j w ijðdþ, S1i ¼ P j w2 ijðdþ, and x and s2 denote the standard sample mean and sample variance, then the value of G i *(d) is calculated as P j w ijðdþx j Wi x G i ðdþ ¼ ð1þ sf½ðns1i Þ W 2 i =ðn 1Þg 1=2 In preparation for calculating G i *(d), all the pairwise distances between measurement locations must be computed and the {w ij (d)} derived. Then the calculation in Equation 1 must be performed for each point i in the dataset. The worst-case time complexity of computing G i *(d) sequentially is O(n 3 ) because traversing a twodimensional distance matrix has O(n 2 ) complexity and computations involving the d parameter contribute an additional O(n) complexity. Experimental dataset A synthetic dataset of points (x, y, z) was generated in which each (x, y) pair represents the Euclidian coordinate of a point (i.e., location) and z represents a measured value of a variable at that point. The dataset has 24,000 measured points contained in the range [0 100, 0 100] that are distributed into twelve clusters (Figure 4). Several of these clusters overlap, but in each Figure 3 Software configuration of the Grid testbed.

7 Using a Computational Grid for Geographic Information Analysis: A Reconnaissance 371 Figure 4 Experimental dataset. one of them, points are distributed using a twodimensional normal probability density function, the standard deviation of which is two. The twelve clusters were created based on two density functions. One density function takes 4,000 points as input and uniformly distributes them into six clusters. The other density function takes the remaining 20,000 points as input and uniformly distributes them among the other six clusters. Application-specific Grid middleware for computing G i *(d) Specific components in the application and collective layers (Figure 3) must be developed to support the computation of G i *(d) using Grid resources. These application-specific Grid middleware components must accomplish a set of required tasks with an overarching goal of optimal performance and efficient use of Grid resources. Various issues such as load balancing, speed-up, and scalability are considered to achieve this goal. In our application, a domain decomposition algorithm was implemented that is based on a regular quadtree (Samet 1990) and a space-filling curve that uses a Morton ordering (Morton 1965; Tomlinson 1970, 75). A moving-window technique is applied to each quad cell so that distance computations are localized by the boundaries of the quad cell and its (8-connected) neighbors (cf. Goodchild and Mark 1987). The Morton ordering of cells is used to organize them into a number of local groups such that each group represents a task that is submitted to a Grid resource. The advantage of using a spacefilling curve is that it maintains data locality. This locality property can be used to reduce search and helps to define the set of tasks that are distributed to Grid resources. Task scheduling also takes account of the characteristics of the network (Figure 2) and Grid middleware (Figure 3) for load-balancing purposes. Performance A set of computational experiments was designed to evaluate the application-specific Grid middleware that is used to compute G i *(d) for the experimental dataset (Figure 4). These experiments emulate a situation in which the number of d values considered is incremented to meet data exploration needs. For example, in Table 2, each column represents one independent exploratory scenario. In order to create a basis for comparison, a sequential algorithm for computing G i *(d) was implemented first, and its running time for each independent scenario is shown in Table 2. For the same exploratory scenarios, the running time of the parallel algorithm based on the application-specific Grid middleware is shown in Table 3.

8 372 Volume 57, Number 3, August 2005 Table 2 Computing Time Required to Run a Sequential G i *(d) Algorithm on a Single Processor of the Head Node of rtgrid.its.uiowa.edu Cluster (the Best Computing Resource in the Grid Testbed) Using Several d Values as Input d values used to compute G i *(d) 2 2, 3 2, 3, 4 2, 3, 4, 5 2, 3, 4, 5, 6 Computing Time (seconds) Table 3 also shows the number of tasks computed by individual Grid resources as well as the speed-up of the Grid-based algorithm over the sequential algorithm. The number of tasks allocated to the three individual resources in our Grid testbed is determined not only by the resource hardware configurations (Table 1) but is also inversely proportional to the Grid middleware overhead that is added when remote computing and data transfers are involved. For example, although the resource beowulf.stat. uiowa.edu has better CPU performance than the resource cluster0.its.uiowa.edu, a larger number of tasks are allocated to cluster0 than to beowulf. This is because the PBS middleware used on cluster0 contributes less overhead than the Condor middleware used on the beowulf Cluster. At the same time, the overhead difference from the Grid middleware is more significant than the gain from the hardware configurations in our computational experiments. It is evident that for the more computationally intensive exploratory scenarios (i.e., as more d values are explored), greater speed-ups are achieved. As exploratory analyses become more computationally intensive, the overhead from Grid middleware components becomes proportionally less compared to the performance gain afforded by parallelism. Future Work: Creating Problem-Solving Environments Using Grid Portals During the mid-1990s, Gallopoulos, Houstis, and Rice (1994) developed the concept of a problem-solving environment ( PSE): a computational system that provides a set of high-level tools for solving problems. In particular, a PSE should provide transparent access to heterogeneous distributed computing resources (Laforenza 2002) and allow users to define and modify problems, choose solution strategies, visualize and analyze results, and record and coordinate extended problem-solving tasks. Though this view resembles many gardenvariety GIS and statistical software environments, we are further concerned with geographical problems that require significant computing, high-end, cartographic visualization tools and interaction among groups of individuals (Armstrong 1994; Jankowski and Nyerges 2001; MacEachren and Brewer 2004). This general class of problems is ideally suited for Grid environments. Ramakrishnan et al. (2002), for example, describe five PSE examples, and two of them have significant geographical components that involve computationally intensive models, mapped output, and input from domain experts in several fields. The development of domain-specific PSEs is regarded as an important focus of Grid computing research and development activity. For example, supported by the National Science Foundation Middleware Initiative, the OGCE (Open Grid Computing Environments) project was established in 2003 to foster collaborations and promote the development of sharable and reusable components within the Grid portal development community (Gannon et al. 2004). The OGCE focuses on establishing a repository of portal service components that can be used to develop domain-specific PSEs. Table 3 Computing Time and Speedup of the G i *(d) Algorithm Based on Application-Specific Grid Middleware Grid testbed resources Number of tasks scheduled on each Grid resource d ¼ 2 d ¼ 2, 3 d ¼ 2, 3, 4 d ¼ 2, 3, 4, 5 d ¼ 2, 3, 4, 5, 6 cluster0.its.uiowa.edu beowulf.stat.uiowa.edu rtgrid1.its.uiowa.edu Computing Time (seconds) Speedup

9 Using a Computational Grid for Geographic Information Analysis: A Reconnaissance 373 Figure 5A GeoPSE (geographical problem-solving environment) computational service selection interface. Increasingly, Grid applications software is accessed using high-level Grid portals that employ GUIs and Web service protocols to support software component compositions. Portals hide implementation-level details so that they might be used to concentrate on problemsolving tasks. Most computer users routinely access Web portals; for example, several news, shopping, and search portals are in wide use (e.g., Google) and, in the spatial domain, the Geodata portal ( is designed to serve as a data clearinghouse to help users find data resources with a minimum of effort. In fact, a design goal of some portals is that a user should be two clicks from entering the portal to accessing needed information. In this particular case, the general set of capabilities for a geographical problem-solving environment (GeoPSE) can be implemented using a Web portal. In the case of a GeoPSE, however, additional access to a set of specialized tools and data will be required. At the outset, a GeoPSE portal would need to be tuned narrowly to one or more applications, though, as time progresses, we envision generic GeoPSE portals that will provide access to a broad class of geographic information services. A pointbased GeoPSE will employ generic domain decomposition middleware similar to the quadtree approach described by Wang and Armstrong (2003). Localized tasks will be processed using distributed resources using application-specific Figure 5B Selection of domain decomposition and task scheduling services.

10 374 Volume 57, Number 3, August 2005 software that is organized into classes, such as spatial statistics and interpolation (see Figure 5). Each class is further refined to the point of a user selecting parameters such as the d used in our illustrative example, or the k parameter when k- closest neighbors are used by interpolation algorithms to create surfaces. We are currently developing a portal for a GeoPSE that is designed not only to support spatial statistics but also other types of computationally intensive analyses such as interpolation and Markovchain, Monte Carlo methods. Concluding Discussion Geographers and others involved in the use of computer technology to address complex environmental, social, and economic problems, must be knowledgeable about changes in the state of the practice in computing. At present, a sea change is taking place in the way that access to computational resources is occurring. Grid computing, which is enabled by a class of software called middleware, is markedly different from the high-performance computing environments of the previous decade. Though middleware that is tuned to a specific class of applications in geography remains to be fully developed, our initial attempts demonstrate that Grid technologies can be employed successfully to address a spatial statistics problem that shares several basic characteristics with other point-based analyses such as interpolation and optimization. We expect that middleware can be designed and implemented to handle flexible allocations of point data to distributed processors in the near term since our initial experience with quadtree-based decomposition appears to show considerable promise (Wang and Armstrong 2003). Though a considerable amount of work remains to be completed before a fully realized generic GeoPSE is available to support the analysis of point data, we are optimistic about its prognosis. Note 1 A terabyte is a measure of computer storage capacity and is 2 40 bytes or approximately a thousand billion bytes (a thousand gigabytes). A petabyte is 2 50 bytes or, approximately a thousand terabytes. Literature Cited Armstrong, Marc P Requirements for the development of GIS-based group decision support systems. Journal of the American Society for Information Science 45 (9): Geography and computational science. Annals of the Association of American Geographers 90: Armstrong, Marc P., Claire E. Pavlik, and Richard Marciano Experiments in the measurement of spatial association using a parallel supercomputer. Geographical Systems 1 (4): Armstrong, Marc P., Ningchuan Xiao, and David Bennett Using genetic algorithms to create multicriteria class intervals for choropleth maps. Annals of the Association of American Geographers 93: Bell, Gordon, and James Gray The revolution yet to happen. In Beyond calculation: The next fifty years of computing, ed. Peter Denning and Robert Metcalfe, New York: Springer-Verlag. Borkar, Shekhar Getting Gigascale chips: Challenges and opportunities in continuing Moore s law. ACM Queue 1 (7): Carriero, Nicholas, and David Gelernter How to write parallel programs: A first course. Cambridge, MA: MIT Press. Cowles, M. K Efficient model-fitting and model-comparison for high-dimensional Bayesian geostatistical models. Journal of Statistical Planning and Inference 112: Cramer, Barton E., and Marc P. Armstrong An evaluation of domain decomposition strategies for parallel spatial interpolation of surfaces. Geographical Analysis 31 (2): Foster, Ian The Grid: A new infrastructure for 21st century science. Physics Today, February: Foster, Ian, and Carl Kesselman The Grid: Blueprint for a new computing infrastructure. San Francisco: Morgan Kaufmann Publishers. Foster, Ian, Carl Kesselman, Jeffrey Nick, and Steve Tuecke Grid services for distributed system integration. IEEE Computer 35 (6): Foster, Ian, Carl Kesselman, and Steve Tuecke The anatomy of the Grid: Enabling scalable virtual organization. International Journal of High-Performance Computing Applications 15 (3): Gallopoulos, Efstratios, Elias Houstis, and John R. Rice Computer as thinker/doer: Problem solving environments for computational science. IEEE Computational Science and Engineering 2: Gannon, Dennis, Geoffrey Fox, Marlon Pierce, Beth Plale, Gregor Von Laszewski, Charles Severance, Joseph Hardin, Jay Alameda, Mary Thomas, and John Boisseau Grid portals: A scientist s access point for Grid services (draft 1). Global Grid

11 Using a Computational Grid for Geographic Information Analysis: A Reconnaissance 375 Forum technical document, available at: extreme.indiana.edu/gannon/ggf-portals-draft. pdf (last accessed 16 August 2004). Gao, Guang R., Kevin B. Theobald, Ziang Hu, Haiping Wu, Jizhu Lu, Keshav Pingali, Paul Stodghill, Thomas L. Sterling, Rick Stevens, and Mark Hereld Next generation system software for future high-end computing systems. International Parallel and Distributed Processing Symposium, Fort Lauderdale, FL. Getis, Arthur, and J. K. Ord The analysis of spatial association by distance statistics. Geographical Analysis 24: Goodchild, Michael, and David Mark The fractal nature of geographic phenomena. Annals of the Association of American Geographers 77: Griffith, Daniel A Supercomputing and spatial statistics: a reconnaissance. The Professional Geographer 42: Jankowski, Piotr, and Timothy Nyerges Geographic Information Systems for group decision making. London: Taylor & Francis. Laforenza, Domenico Grid programming: some indications where we are headed. Parallel Computing 28: Licklider, Joseph C. R. Lick, and Robert Taylor The computer as a communication device, available at: (last accessed 16 August 2004). MacEachren, Alan, and Isaac Brewer Developing a conceptual framework for visually-enabled geocollaboration. International Journal of Geographical Information Science 18 (1): Moore, Gordon E Cramming more components onto integrated circuits. Electronics 38 (8): Morton, Guy M A computer oriented geodetic data base and a new technique in file sequencing. Ottawa, Canada: IBM. National Research Council Realizing the information future: The Internet and beyond. National Academy Press, available at: readingroom/books/rtif/(last accessed 16 August 2004). Ord, J. Keith, and Arthur Getis Local spatial autocorrelation statistics: distributional issues and an application. Geographical Analysis 27: Rajasekar, Arcot, Michael Wan, and Reagan Moore MySRB & SRB Components of a data Grid. The 11th International Symposium on High Performance Distributed Computing (HPDC-11), July 24 26, Edinburgh, U.K. Ramakrishnan, Naren, Layne Watson, Dennis Kafura, Calvin Ribbens, and Clifford Shaffer Programming environments for multidisciplinary Grid communities. Concurrency and Computation: Practice and Experience 14: Reed, Daniel A Grids, the Teragrid, and beyond. IEEE Computer 36 (1): Rokos, Demitrius, and Marc P. Armstrong Using Linda to compute spatial autocorrelation in parallel. Computers & Geosciences 22 (5): Samet, Hanan Applications of spatial data structures. Reading, MA: Addison Wesley. Smarr, Larry Grids in context. In The Grid: Blueprint for a new computing infrastructure, ed. Ian Foster and Carl Kesselman, San Francisco: Morgan Kaufman. Smarr, Larry, and Charles Catlett Metacomputing. Communications of the Association for Computing Machinery 35 (6): Sterling, Thomas, John Salmon, Donald Becker, and Daniel Savarese How to build a Beowulf. Cambridge, MA: MIT Press. Thakar, Ani, Alexander Szalay, Peter Kunszt, and James Gray Migrating a multiterabyte archive from object to relational databases. Computing in Science and Engineering 5 (5): Tomlinson, Roger F Environment Information Systems. The Proceedings of the UNESCO/IGU First Symposium on Geographical Information Systems, Ottawa, September, 1970, a publication of the International Geographical Union Commission on Geographical Data Sensing and Processing. Wang, Shaowen, and Marc P. Armstrong A quadtree approach to domain decomposition for spatial interpolation in grid computing environments. Parallel Computing 29 (10): MARC P. ARMSTRONG is a Professor in the Department of Geography and the Program in Applied Mathematical and Computational Science at The University of Iowa, Iowa City, IA marc-armstrong@uiowa.edu. His research interests include computational geography, cartography, mobile computing, and spatial decision support systems. MARY KATHRYN COWLES is an Associate Professor in the Department of Statistics and Actuarial Science and the Department of Biostatistics at The University of Iowa, Iowa City, IA katecowles@uiowa.edu. Her research interests include Bayesian methods in biostatistics and environmental science, Markov chain Monte Carlo algorithms and convergence assessment, and statistical computing. SHAOWEN WANG is an Associate Research Scientist in the Academic Technologies Research Services Division of Information Technology Services and an adjunct Assistant Professor in the Department of Geography at The University of Iowa, Iowa City, IA shaowen-wang@uiowa.edu. His research interests include Grid computing, parallel and distributed computing, computationally intensive geographic analysis, and problem-solving environments.

A Theory of the Spatial Computational Domain

A Theory of the Spatial Computational Domain A Theory of the Spatial Computational Domain Shaowen Wang 1 and Marc P. Armstrong 2 1 Academic Technologies Research Services and Department of Geography, The University of Iowa Iowa City, IA 52242 Tel:

More information

Collaborative & Integrated Network & Systems Management: Management Using Grid Technologies

Collaborative & Integrated Network & Systems Management: Management Using Grid Technologies 2011 International Conference on Computer Communication and Management Proc.of CSIT vol.5 (2011) (2011) IACSIT Press, Singapore Collaborative & Integrated Network & Systems Management: Management Using

More information

Grid Scheduling Dictionary of Terms and Keywords

Grid Scheduling Dictionary of Terms and Keywords Grid Scheduling Dictionary Working Group M. Roehrig, Sandia National Laboratories W. Ziegler, Fraunhofer-Institute for Algorithms and Scientific Computing Document: Category: Informational June 2002 Status

More information

An Evaluation of Economy-based Resource Trading and Scheduling on Computational Power Grids for Parameter Sweep Applications

An Evaluation of Economy-based Resource Trading and Scheduling on Computational Power Grids for Parameter Sweep Applications An Evaluation of Economy-based Resource Trading and Scheduling on Computational Power Grids for Parameter Sweep Applications Rajkumar Buyya, Jonathan Giddy, and David Abramson School of Computer Science

More information

Web Service Based Data Management for Grid Applications

Web Service Based Data Management for Grid Applications Web Service Based Data Management for Grid Applications T. Boehm Zuse-Institute Berlin (ZIB), Berlin, Germany Abstract Web Services play an important role in providing an interface between end user applications

More information

Resource Management on Computational Grids

Resource Management on Computational Grids Univeristà Ca Foscari, Venezia http://www.dsi.unive.it Resource Management on Computational Grids Paolo Palmerini Dottorato di ricerca di Informatica (anno I, ciclo II) email: palmeri@dsi.unive.it 1/29

More information

An approach to grid scheduling by using Condor-G Matchmaking mechanism

An approach to grid scheduling by using Condor-G Matchmaking mechanism An approach to grid scheduling by using Condor-G Matchmaking mechanism E. Imamagic, B. Radic, D. Dobrenic University Computing Centre, University of Zagreb, Croatia {emir.imamagic, branimir.radic, dobrisa.dobrenic}@srce.hr

More information

KNOWLEDGE GRID An Architecture for Distributed Knowledge Discovery

KNOWLEDGE GRID An Architecture for Distributed Knowledge Discovery KNOWLEDGE GRID An Architecture for Distributed Knowledge Discovery Mario Cannataro 1 and Domenico Talia 2 1 ICAR-CNR 2 DEIS Via P. Bucci, Cubo 41-C University of Calabria 87036 Rende (CS) Via P. Bucci,

More information

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets!! Large data collections appear in many scientific domains like climate studies.!! Users and

More information

Cluster, Grid, Cloud Concepts

Cluster, Grid, Cloud Concepts Cluster, Grid, Cloud Concepts Kalaiselvan.K Contents Section 1: Cluster Section 2: Grid Section 3: Cloud Cluster An Overview Need for a Cluster Cluster categorizations A computer cluster is a group of

More information

An Grid Service Module for Natural Resource Managers

An Grid Service Module for Natural Resource Managers An Grid Service Module for Natural Resource Managers Dali Wang 1, Eric Carr 1, Mark Palmer 1, Michael W. Berry 2 and Louis J. Gross 1 1 The Institute for Environmental Modeling 569 Dabney Hall, University

More information

THE EXPAND PARALLEL FILE SYSTEM A FILE SYSTEM FOR CLUSTER AND GRID COMPUTING. José Daniel García Sánchez ARCOS Group University Carlos III of Madrid

THE EXPAND PARALLEL FILE SYSTEM A FILE SYSTEM FOR CLUSTER AND GRID COMPUTING. José Daniel García Sánchez ARCOS Group University Carlos III of Madrid THE EXPAND PARALLEL FILE SYSTEM A FILE SYSTEM FOR CLUSTER AND GRID COMPUTING José Daniel García Sánchez ARCOS Group University Carlos III of Madrid Contents 2 The ARCOS Group. Expand motivation. Expand

More information

A High-Performance Virtual Storage System for Taiwan UniGrid

A High-Performance Virtual Storage System for Taiwan UniGrid Journal of Information Technology and Applications Vol. 1 No. 4 March, 2007, pp. 231-238 A High-Performance Virtual Storage System for Taiwan UniGrid Chien-Min Wang; Chun-Chen Hsu and Jan-Jan Wu Institute

More information

Load Balancing on a Non-dedicated Heterogeneous Network of Workstations

Load Balancing on a Non-dedicated Heterogeneous Network of Workstations Load Balancing on a Non-dedicated Heterogeneous Network of Workstations Dr. Maurice Eggen Nathan Franklin Department of Computer Science Trinity University San Antonio, Texas 78212 Dr. Roger Eggen Department

More information

Scheduling and Resource Management in Computational Mini-Grids

Scheduling and Resource Management in Computational Mini-Grids Scheduling and Resource Management in Computational Mini-Grids July 1, 2002 Project Description The concept of grid computing is becoming a more and more important one in the high performance computing

More information

GridFTP: A Data Transfer Protocol for the Grid

GridFTP: A Data Transfer Protocol for the Grid GridFTP: A Data Transfer Protocol for the Grid Grid Forum Data Working Group on GridFTP Bill Allcock, Lee Liming, Steven Tuecke ANL Ann Chervenak USC/ISI Introduction In Grid environments,

More information

THE CCLRC DATA PORTAL

THE CCLRC DATA PORTAL THE CCLRC DATA PORTAL Glen Drinkwater, Shoaib Sufi CCLRC Daresbury Laboratory, Daresbury, Warrington, Cheshire, WA4 4AD, UK. E-mail: g.j.drinkwater@dl.ac.uk, s.a.sufi@dl.ac.uk Abstract: The project aims

More information

Globus Striped GridFTP Framework and Server. Raj Kettimuthu, ANL and U. Chicago

Globus Striped GridFTP Framework and Server. Raj Kettimuthu, ANL and U. Chicago Globus Striped GridFTP Framework and Server Raj Kettimuthu, ANL and U. Chicago Outline Introduction Features Motivation Architecture Globus XIO Experimental Results 3 August 2005 The Ohio State University

More information

22S:295 Seminar in Applied Statistics High Performance Computing in Statistics

22S:295 Seminar in Applied Statistics High Performance Computing in Statistics 22S:295 Seminar in Applied Statistics High Performance Computing in Statistics Luke Tierney Department of Statistics & Actuarial Science University of Iowa August 30, 2007 Luke Tierney (U. of Iowa) HPC

More information

Service Oriented Distributed Manager for Grid System

Service Oriented Distributed Manager for Grid System Service Oriented Distributed Manager for Grid System Entisar S. Alkayal Faculty of Computing and Information Technology King Abdul Aziz University Jeddah, Saudi Arabia entisar_alkayal@hotmail.com Abstract

More information

Concepts and Architecture of the Grid. Summary of Grid 2, Chapter 4

Concepts and Architecture of the Grid. Summary of Grid 2, Chapter 4 Concepts and Architecture of the Grid Summary of Grid 2, Chapter 4 Concepts of Grid Mantra: Coordinated resource sharing and problem solving in dynamic, multi-institutional virtual organizations Allows

More information

The Lattice Project: A Multi-Model Grid Computing System. Center for Bioinformatics and Computational Biology University of Maryland

The Lattice Project: A Multi-Model Grid Computing System. Center for Bioinformatics and Computational Biology University of Maryland The Lattice Project: A Multi-Model Grid Computing System Center for Bioinformatics and Computational Biology University of Maryland Parallel Computing PARALLEL COMPUTING a form of computation in which

More information

Understanding the Benefits of IBM SPSS Statistics Server

Understanding the Benefits of IBM SPSS Statistics Server IBM SPSS Statistics Server Understanding the Benefits of IBM SPSS Statistics Server Contents: 1 Introduction 2 Performance 101: Understanding the drivers of better performance 3 Why performance is faster

More information

Data Management in an International Data Grid Project. Timur Chabuk 04/09/2007

Data Management in an International Data Grid Project. Timur Chabuk 04/09/2007 Data Management in an International Data Grid Project Timur Chabuk 04/09/2007 Intro LHC opened in 2005 several Petabytes of data per year data created at CERN distributed to Regional Centers all over the

More information

Overlapping Data Transfer With Application Execution on Clusters

Overlapping Data Transfer With Application Execution on Clusters Overlapping Data Transfer With Application Execution on Clusters Karen L. Reid and Michael Stumm reid@cs.toronto.edu stumm@eecg.toronto.edu Department of Computer Science Department of Electrical and Computer

More information

SYLLABUS. 1 seminar/laboratory 3.4 Total hours in the curriculum 42 Of which: 3.5 course 28 3.6

SYLLABUS. 1 seminar/laboratory 3.4 Total hours in the curriculum 42 Of which: 3.5 course 28 3.6 SYLLABUS 1. Information regarding the programme 1.1 Higher education Babeş-Bolyai University institution 1.2 Faculty Faculty of Mathematics and Computer Science 1.3 Department Department of Computer Science

More information

Grid Computing: A Ten Years Look Back. María S. Pérez Facultad de Informática Universidad Politécnica de Madrid mperez@fi.upm.es

Grid Computing: A Ten Years Look Back. María S. Pérez Facultad de Informática Universidad Politécnica de Madrid mperez@fi.upm.es Grid Computing: A Ten Years Look Back María S. Pérez Facultad de Informática Universidad Politécnica de Madrid mperez@fi.upm.es Outline Challenges not yet solved in computing The parents of grid Computing

More information

A Survey Study on Monitoring Service for Grid

A Survey Study on Monitoring Service for Grid A Survey Study on Monitoring Service for Grid Erkang You erkyou@indiana.edu ABSTRACT Grid is a distributed system that integrates heterogeneous systems into a single transparent computer, aiming to provide

More information

DiPerF: automated DIstributed PERformance testing Framework

DiPerF: automated DIstributed PERformance testing Framework DiPerF: automated DIstributed PERformance testing Framework Ioan Raicu, Catalin Dumitrescu, Matei Ripeanu Distributed Systems Laboratory Computer Science Department University of Chicago Ian Foster Mathematics

More information

Performance of the NAS Parallel Benchmarks on Grid Enabled Clusters

Performance of the NAS Parallel Benchmarks on Grid Enabled Clusters Performance of the NAS Parallel Benchmarks on Grid Enabled Clusters Philip J. Sokolowski Dept. of Electrical and Computer Engineering Wayne State University 55 Anthony Wayne Dr., Detroit, MI 4822 phil@wayne.edu

More information

The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Datasets

The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Datasets The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Datasets Ann Chervenak Ian Foster $+ Carl Kesselman Charles Salisbury $ Steven Tuecke $ Information

More information

Grid Technology and Information Management for Command and Control

Grid Technology and Information Management for Command and Control Grid Technology and Information Management for Command and Control Dr. Scott E. Spetka Dr. George O. Ramseyer* Dr. Richard W. Linderman* ITT Industries Advanced Engineering and Sciences SUNY Institute

More information

Concepts and Architecture of Grid Computing. Advanced Topics Spring 2008 Prof. Robert van Engelen

Concepts and Architecture of Grid Computing. Advanced Topics Spring 2008 Prof. Robert van Engelen Concepts and Architecture of Grid Computing Advanced Topics Spring 2008 Prof. Robert van Engelen Overview Grid users: who are they? Concept of the Grid Challenges for the Grid Evolution of Grid systems

More information

Distributed Systems and Recent Innovations: Challenges and Benefits

Distributed Systems and Recent Innovations: Challenges and Benefits Distributed Systems and Recent Innovations: Challenges and Benefits 1. Introduction Krishna Nadiminti, Marcos Dias de Assunção, and Rajkumar Buyya Grid Computing and Distributed Systems Laboratory Department

More information

Grid Computing vs Cloud

Grid Computing vs Cloud Chapter 3 Grid Computing vs Cloud Computing 3.1 Grid Computing Grid computing [8, 23, 25] is based on the philosophy of sharing information and power, which gives us access to another type of heterogeneous

More information

Gi-Joon Nam, IBM Research - Austin Sani R. Nassif, Radyalis. Opportunities in Power Distribution Network System Optimization (from EDA Perspective)

Gi-Joon Nam, IBM Research - Austin Sani R. Nassif, Radyalis. Opportunities in Power Distribution Network System Optimization (from EDA Perspective) Gi-Joon Nam, IBM Research - Austin Sani R. Nassif, Radyalis Opportunities in Power Distribution Network System Optimization (from EDA Perspective) Outline! SmartGrid: What it is! Power Distribution Network

More information

Optimizing the Virtual Data Center

Optimizing the Virtual Data Center Optimizing the Virtual Center The ideal virtual data center dynamically balances workloads across a computing cluster and redistributes hardware resources among clusters in response to changing needs.

More information

Hadoop Cluster Applications

Hadoop Cluster Applications Hadoop Overview Data analytics has become a key element of the business decision process over the last decade. Classic reporting on a dataset stored in a database was sufficient until recently, but yesterday

More information

Deploying a distributed data storage system on the UK National Grid Service using federated SRB

Deploying a distributed data storage system on the UK National Grid Service using federated SRB Deploying a distributed data storage system on the UK National Grid Service using federated SRB Manandhar A.S., Kleese K., Berrisford P., Brown G.D. CCLRC e-science Center Abstract As Grid enabled applications

More information

Grid Computing. Making the Global Infrastructure a Reality. Wiley Series on Communications Networking & Distributed Systems

Grid Computing. Making the Global Infrastructure a Reality. Wiley Series on Communications Networking & Distributed Systems Brochure More information from http://www.researchandmarkets.com/reports/2172168/ Grid Computing. Making the Global Infrastructure a Reality. Wiley Series on Communications Networking & Distributed Systems

More information

Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage

Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage White Paper Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage A Benchmark Report August 211 Background Objectivity/DB uses a powerful distributed processing architecture to manage

More information

Grid based Integration of Real-Time Value-at-Risk (VaR) Services. Abstract

Grid based Integration of Real-Time Value-at-Risk (VaR) Services. Abstract Grid based Integration of Real-Time Value-at-Risk (VaR) s Paul Donachy Daniel Stødle Terrence J harmer Ron H Perrott Belfast e-science Centre www.qub.ac.uk/escience Brian Conlon Gavan Corr First Derivatives

More information

CHAPTER 5 IMPLEMENTATION OF THE PROPOSED GRID NETWORK MONITORING SYSTEM IN CRB

CHAPTER 5 IMPLEMENTATION OF THE PROPOSED GRID NETWORK MONITORING SYSTEM IN CRB 60 CHAPTER 5 IMPLEMENTATION OF THE PROPOSED GRID NETWORK MONITORING SYSTEM IN CRB This chapter discusses the implementation details of the proposed grid network monitoring system, and its integration with

More information

Cray: Enabling Real-Time Discovery in Big Data

Cray: Enabling Real-Time Discovery in Big Data Cray: Enabling Real-Time Discovery in Big Data Discovery is the process of gaining valuable insights into the world around us by recognizing previously unknown relationships between occurrences, objects

More information

A Taxonomy and Survey of Grid Resource Planning and Reservation Systems for Grid Enabled Analysis Environment

A Taxonomy and Survey of Grid Resource Planning and Reservation Systems for Grid Enabled Analysis Environment A Taxonomy and Survey of Grid Resource Planning and Reservation Systems for Grid Enabled Analysis Environment Arshad Ali 3, Ashiq Anjum 3, Atif Mehmood 3, Richard McClatchey 2, Ian Willers 2, Julian Bunn

More information

DATA MANAGEMENT, CODE DEPLOYMENT, AND SCIENTIFIC VISUALLIZATION TO ENHANCE SCIENTIFIC DISCOVERY IN FUSION RESEARCH THROUGH ADVANCED COMPUTING

DATA MANAGEMENT, CODE DEPLOYMENT, AND SCIENTIFIC VISUALLIZATION TO ENHANCE SCIENTIFIC DISCOVERY IN FUSION RESEARCH THROUGH ADVANCED COMPUTING DATA MANAGEMENT, CODE DEPLOYMENT, AND SCIENTIFIC VISUALLIZATION TO ENHANCE SCIENTIFIC DISCOVERY IN FUSION RESEARCH THROUGH ADVANCED COMPUTING D.P. Schissel, 1 A. Finkelstein, 2 I.T. Foster, 3 T.W. Fredian,

More information

An Analysis of Quality of Service Metrics and Frameworks in a Grid Computing Environment

An Analysis of Quality of Service Metrics and Frameworks in a Grid Computing Environment An Analysis of Quality of Service Metrics and Frameworks in a Grid Computing Environment Russ Wakefield Colorado State University Ft. Collins, Colorado May 4 th, 2007 Abstract One of the fastest emerging

More information

A Flexible Cluster Infrastructure for Systems Research and Software Development

A Flexible Cluster Infrastructure for Systems Research and Software Development Award Number: CNS-551555 Title: CRI: Acquisition of an InfiniBand Cluster with SMP Nodes Institution: Florida State University PIs: Xin Yuan, Robert van Engelen, Kartik Gopalan A Flexible Cluster Infrastructure

More information

Dynamic allocation of servers to jobs in a grid hosting environment

Dynamic allocation of servers to jobs in a grid hosting environment Dynamic allocation of s to in a grid hosting environment C Kubicek, M Fisher, P McKee and R Smith As computational resources become available for use over the Internet, a requirement has emerged to reconfigure

More information

Article on Grid Computing Architecture and Benefits

Article on Grid Computing Architecture and Benefits Article on Grid Computing Architecture and Benefits Ms. K. Devika Rani Dhivya 1, Mrs. C. Sunitha 2 1 Assistant Professor, Dept. of BCA & MSc.SS, Sri Krishna Arts and Science College, CBE, Tamil Nadu, India.

More information

Parallel Visualization for GIS Applications

Parallel Visualization for GIS Applications Parallel Visualization for GIS Applications Alexandre Sorokine, Jamison Daniel, Cheng Liu Oak Ridge National Laboratory, Geographic Information Science & Technology, PO Box 2008 MS 6017, Oak Ridge National

More information

A Chromium Based Viewer for CUMULVS

A Chromium Based Viewer for CUMULVS A Chromium Based Viewer for CUMULVS Submitted to PDPTA 06 Dan Bennett Corresponding Author Department of Mathematics and Computer Science Edinboro University of PA Edinboro, Pennsylvania 16444 Phone: (814)

More information

Distance Visualization: Data Exploration on the Grid

Distance Visualization: Data Exploration on the Grid Distance Visualization: Data Exploration on the Grid Ian Foster *+, Joseph Insley *, Carl Kesselman @, Gregor von Laszewski *, Marcus Thiebaux @ * Mathematics and Computer Science Division, Argonne National

More information

CHAPTER 5 WLDMA: A NEW LOAD BALANCING STRATEGY FOR WAN ENVIRONMENT

CHAPTER 5 WLDMA: A NEW LOAD BALANCING STRATEGY FOR WAN ENVIRONMENT 81 CHAPTER 5 WLDMA: A NEW LOAD BALANCING STRATEGY FOR WAN ENVIRONMENT 5.1 INTRODUCTION Distributed Web servers on the Internet require high scalability and availability to provide efficient services to

More information

Star System. 2004 Deitel & Associates, Inc. All rights reserved.

Star System. 2004 Deitel & Associates, Inc. All rights reserved. Star System Apple Macintosh 1984 First commercial OS GUI Chapter 1 Introduction to Operating Systems Outline 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 1.10 1.11 1.12 Introduction What Is an Operating System?

More information

Grid Computing With FreeBSD

Grid Computing With FreeBSD Grid Computing With FreeBSD USENIX ATC '04: UseBSD SIG Boston, MA, June 29 th 2004 Brooks Davis, Craig Lee The Aerospace Corporation El Segundo, CA {brooks,lee}aero.org http://people.freebsd.org/~brooks/papers/usebsd2004/

More information

COMP/CS 605: Intro to Parallel Computing Lecture 01: Parallel Computing Overview (Part 1)

COMP/CS 605: Intro to Parallel Computing Lecture 01: Parallel Computing Overview (Part 1) COMP/CS 605: Intro to Parallel Computing Lecture 01: Parallel Computing Overview (Part 1) Mary Thomas Department of Computer Science Computational Science Research Center (CSRC) San Diego State University

More information

Workload Characterization and Analysis of Storage and Bandwidth Needs of LEAD Workspace

Workload Characterization and Analysis of Storage and Bandwidth Needs of LEAD Workspace Workload Characterization and Analysis of Storage and Bandwidth Needs of LEAD Workspace Beth Plale Indiana University plale@cs.indiana.edu LEAD TR 001, V3.0 V3.0 dated January 24, 2007 V2.0 dated August

More information

Big Data Challenges in Bioinformatics

Big Data Challenges in Bioinformatics Big Data Challenges in Bioinformatics BARCELONA SUPERCOMPUTING CENTER COMPUTER SCIENCE DEPARTMENT Autonomic Systems and ebusiness Pla?orms Jordi Torres Jordi.Torres@bsc.es Talk outline! We talk about Petabyte?

More information

IBM Global Technology Services September 2007. NAS systems scale out to meet growing storage demand.

IBM Global Technology Services September 2007. NAS systems scale out to meet growing storage demand. IBM Global Technology Services September 2007 NAS systems scale out to meet Page 2 Contents 2 Introduction 2 Understanding the traditional NAS role 3 Gaining NAS benefits 4 NAS shortcomings in enterprise

More information

Web Service Robust GridFTP

Web Service Robust GridFTP Web Service Robust GridFTP Sang Lim, Geoffrey Fox, Shrideep Pallickara and Marlon Pierce Community Grid Labs, Indiana University 501 N. Morton St. Suite 224 Bloomington, IN 47404 {sblim, gcf, spallick,

More information

An Experience in Accessing Grid Computing Power from Mobile Device with GridLab Mobile Services

An Experience in Accessing Grid Computing Power from Mobile Device with GridLab Mobile Services An Experience in Accessing Grid Computing Power from Mobile Device with GridLab Mobile Services Abstract In this paper review the notion of the use of mobile device in grid computing environment, We describe

More information

Achieving Nanosecond Latency Between Applications with IPC Shared Memory Messaging

Achieving Nanosecond Latency Between Applications with IPC Shared Memory Messaging Achieving Nanosecond Latency Between Applications with IPC Shared Memory Messaging In some markets and scenarios where competitive advantage is all about speed, speed is measured in micro- and even nano-seconds.

More information

MIGRATING DESKTOP AND ROAMING ACCESS. Migrating Desktop and Roaming Access Whitepaper

MIGRATING DESKTOP AND ROAMING ACCESS. Migrating Desktop and Roaming Access Whitepaper Migrating Desktop and Roaming Access Whitepaper Poznan Supercomputing and Networking Center Noskowskiego 12/14 61-704 Poznan, POLAND 2004, April white-paper-md-ras.doc 1/11 1 Product overview In this whitepaper

More information

COMPUTER SCIENCE AND ENGINEERING - Microprocessor Systems - Mitchell Aaron Thornton

COMPUTER SCIENCE AND ENGINEERING - Microprocessor Systems - Mitchell Aaron Thornton MICROPROCESSOR SYSTEMS Mitchell Aaron Thornton, Department of Electrical and Computer Engineering, Mississippi State University, PO Box 9571, Mississippi State, MS, 39762-9571, United States. Keywords:

More information

Grid Portal Development for Sensing Data Retrieval and Processing

Grid Portal Development for Sensing Data Retrieval and Processing Grid Portal Development for Sensing Data Retrieval and Processing 1 Diego Arias, Mariana Mendoza, Fernando Cintron, Kennie Cruz, and Wilson Rivera Parallel and Distributed Computing Laboratory University

More information

Digital libraries of the future and the role of libraries

Digital libraries of the future and the role of libraries Digital libraries of the future and the role of libraries Donatella Castelli ISTI-CNR, Pisa, Italy Abstract Purpose: To introduce the digital libraries of the future, their enabling technologies and their

More information

Workflow Automation and Management Services in Web 2.0: An Object-Based Approach to Distributed Workflow Enactment

Workflow Automation and Management Services in Web 2.0: An Object-Based Approach to Distributed Workflow Enactment Workflow Automation and Management Services in Web 2.0: An Object-Based Approach to Distributed Workflow Enactment Peter Y. Wu wu@rmu.edu Department of Computer & Information Systems Robert Morris University

More information

WATER INTERACTIONS WITH ENERGY, ENVIRONMENT AND FOOD & AGRICULTURE Vol. II Spatial Data Handling and GIS - Atkinson, P.M.

WATER INTERACTIONS WITH ENERGY, ENVIRONMENT AND FOOD & AGRICULTURE Vol. II Spatial Data Handling and GIS - Atkinson, P.M. SPATIAL DATA HANDLING AND GIS Atkinson, P.M. School of Geography, University of Southampton, UK Keywords: data models, data transformation, GIS cycle, sampling, GIS functionality Contents 1. Background

More information

Ubiquitous Analytics: Interacting with Big Data Anywhere, Anytime

Ubiquitous Analytics: Interacting with Big Data Anywhere, Anytime Ubiquitous Analytics: Interacting with Big Data Anywhere, Anytime Niklas Elmqvist (Purdue University) and Pourang Irani (University of Manitoba) Ubiquitous analytics amplifies human cognition by embedding

More information

Improving Performance in Load Balancing Problem on the Grid Computing System

Improving Performance in Load Balancing Problem on the Grid Computing System Improving Performance in Problem on the Grid Computing System Prabhat Kr.Srivastava IIMT College of Engineering Greater Noida, India Sonu Gupta IIMT College of Engineering Greater Noida, India Dheerendra

More information

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21)

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21) CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21) Goal Develop and deploy comprehensive, integrated, sustainable, and secure cyberinfrastructure (CI) to accelerate research

More information

The Shortcut Guide to Balancing Storage Costs and Performance with Hybrid Storage

The Shortcut Guide to Balancing Storage Costs and Performance with Hybrid Storage The Shortcut Guide to Balancing Storage Costs and Performance with Hybrid Storage sponsored by Dan Sullivan Chapter 1: Advantages of Hybrid Storage... 1 Overview of Flash Deployment in Hybrid Storage Systems...

More information

16th International Conference on Control Systems and Computer Science (CSCS16 07)

16th International Conference on Control Systems and Computer Science (CSCS16 07) 16th International Conference on Control Systems and Computer Science (CSCS16 07) TOWARDS AN IO INTENSIVE GRID APPLICATION INSTRUMENTATION IN MEDIOGRID Dacian Tudor 1, Florin Pop 2, Valentin Cristea 2,

More information

How To Build A Cloud Computer

How To Build A Cloud Computer Introducing the Singlechip Cloud Computer Exploring the Future of Many-core Processors White Paper Intel Labs Jim Held Intel Fellow, Intel Labs Director, Tera-scale Computing Research Sean Koehl Technology

More information

CHAPTER 2 MODELLING FOR DISTRIBUTED NETWORK SYSTEMS: THE CLIENT- SERVER MODEL

CHAPTER 2 MODELLING FOR DISTRIBUTED NETWORK SYSTEMS: THE CLIENT- SERVER MODEL CHAPTER 2 MODELLING FOR DISTRIBUTED NETWORK SYSTEMS: THE CLIENT- SERVER MODEL This chapter is to introduce the client-server model and its role in the development of distributed network systems. The chapter

More information

How To Speed Up A Flash Flash Storage System With The Hyperq Memory Router

How To Speed Up A Flash Flash Storage System With The Hyperq Memory Router HyperQ Hybrid Flash Storage Made Easy White Paper Parsec Labs, LLC. 7101 Northland Circle North, Suite 105 Brooklyn Park, MN 55428 USA 1-763-219-8811 www.parseclabs.com info@parseclabs.com sales@parseclabs.com

More information

2 Transport-level and Message-level Security

2 Transport-level and Message-level Security Globus Toolkit Version 4 Grid Security Infrastructure: A Standards Perspective The Globus Security Team 1 Version 4 updated September 12, 2005 Abstract This document provides an overview of the Grid Security

More information

Relational Join Queries Over Grid-aware Architectures

Relational Join Queries Over Grid-aware Architectures University of Crete Computer Science Department Master Thesis A Framework for High-Performance Relational Join Queries Scheduling in Distributed Database Systems over Grid-aware Architectures Author Supervisor

More information

The Accounting Information Sharing Model for ShanghaiGrid 1

The Accounting Information Sharing Model for ShanghaiGrid 1 The Accounting Information Sharing Model for ShanghaiGrid 1 Jiadi Yu, Minglu Li, Ying Li, Feng Hong Department of Computer Science and Engineering,Shanghai Jiao Tong University, Shanghai 200030, P.R.China

More information

Introduction. MCSN N. Tonellotto Complements of Distributed Enabling Platforms

Introduction. MCSN N. Tonellotto Complements of Distributed Enabling Platforms Introduction 1 Distributed relating to a computer network in which at least some of the processing is done by the individual computers and information is shared by and often stored at the computers Enabling

More information

A novel load balancing algorithm for computational grid

A novel load balancing algorithm for computational grid International Journal of Computational Intelligence Techniques, ISSN: 0976 0466 & E-ISSN: 0976 0474 Volume 1, Issue 1, 2010, PP-20-26 A novel load balancing algorithm for computational grid Saravanakumar

More information

An objective comparison test of workload management systems

An objective comparison test of workload management systems An objective comparison test of workload management systems Igor Sfiligoi 1 and Burt Holzman 1 1 Fermi National Accelerator Laboratory, Batavia, IL 60510, USA E-mail: sfiligoi@fnal.gov Abstract. The Grid

More information

HPC and Big Data. EPCC The University of Edinburgh. Adrian Jackson Technical Architect a.jackson@epcc.ed.ac.uk

HPC and Big Data. EPCC The University of Edinburgh. Adrian Jackson Technical Architect a.jackson@epcc.ed.ac.uk HPC and Big Data EPCC The University of Edinburgh Adrian Jackson Technical Architect a.jackson@epcc.ed.ac.uk EPCC Facilities Technology Transfer European Projects HPC Research Visitor Programmes Training

More information

Exploiting Remote Memory Operations to Design Efficient Reconfiguration for Shared Data-Centers over InfiniBand

Exploiting Remote Memory Operations to Design Efficient Reconfiguration for Shared Data-Centers over InfiniBand Exploiting Remote Memory Operations to Design Efficient Reconfiguration for Shared Data-Centers over InfiniBand P. Balaji, K. Vaidyanathan, S. Narravula, K. Savitha, H. W. Jin D. K. Panda Network Based

More information

A Review of Customized Dynamic Load Balancing for a Network of Workstations

A Review of Customized Dynamic Load Balancing for a Network of Workstations A Review of Customized Dynamic Load Balancing for a Network of Workstations Taken from work done by: Mohammed Javeed Zaki, Wei Li, Srinivasan Parthasarathy Computer Science Department, University of Rochester

More information

4-3 Grid Communication Library Allowing for Dynamic Firewall Control

4-3 Grid Communication Library Allowing for Dynamic Firewall Control 4-3 Grid Communication Library Allowing for Dynamic Firewall Control HASEGAWA Ichiro, BABA Ken-ichi, and SHIMOJO Shinji Current Grid technologies tend not to consider firewall system properly, and as a

More information

Client/Server Computing Distributed Processing, Client/Server, and Clusters

Client/Server Computing Distributed Processing, Client/Server, and Clusters Client/Server Computing Distributed Processing, Client/Server, and Clusters Chapter 13 Client machines are generally single-user PCs or workstations that provide a highly userfriendly interface to the

More information

COMPUTER GRAPHICS AND INTERACTIVE SYSTEMS LABORATORY

COMPUTER GRAPHICS AND INTERACTIVE SYSTEMS LABORATORY COMPUTER GRAPHICS AND INTERACTIVE SYSTEMS LABORATORY Contact details Name Acronym Logo Computer Graphics and Interactive Systems Laboratory CGIS Site Address Faculty Department http://cgis.utcluj.ro Telephone

More information

RevoScaleR Speed and Scalability

RevoScaleR Speed and Scalability EXECUTIVE WHITE PAPER RevoScaleR Speed and Scalability By Lee Edlefsen Ph.D., Chief Scientist, Revolution Analytics Abstract RevoScaleR, the Big Data predictive analytics library included with Revolution

More information

Software Development around a Millisecond

Software Development around a Millisecond Introduction Software Development around a Millisecond Geoffrey Fox In this column we consider software development methodologies with some emphasis on those relevant for large scale scientific computing.

More information

IMPLEMENTING GREEN IT

IMPLEMENTING GREEN IT Saint Petersburg State University of Information Technologies, Mechanics and Optics Department of Telecommunication Systems IMPLEMENTING GREEN IT APPROACH FOR TRANSFERRING BIG DATA OVER PARALLEL DATA LINK

More information

A High Performance Computing Scheduling and Resource Management Primer

A High Performance Computing Scheduling and Resource Management Primer LLNL-TR-652476 A High Performance Computing Scheduling and Resource Management Primer D. H. Ahn, J. E. Garlick, M. A. Grondona, D. A. Lipari, R. R. Springmeyer March 31, 2014 Disclaimer This document was

More information

Data Mining for Data Cloud and Compute Cloud

Data Mining for Data Cloud and Compute Cloud Data Mining for Data Cloud and Compute Cloud Prof. Uzma Ali 1, Prof. Punam Khandar 2 Assistant Professor, Dept. Of Computer Application, SRCOEM, Nagpur, India 1 Assistant Professor, Dept. Of Computer Application,

More information

Symmetric Multiprocessing

Symmetric Multiprocessing Multicore Computing A multi-core processor is a processing system composed of two or more independent cores. One can describe it as an integrated circuit to which two or more individual processors (called

More information

CloudAnalyst: A CloudSim-based Tool for Modelling and Analysis of Large Scale Cloud Computing Environments

CloudAnalyst: A CloudSim-based Tool for Modelling and Analysis of Large Scale Cloud Computing Environments 433-659 DISTRIBUTED COMPUTING PROJECT, CSSE DEPT., UNIVERSITY OF MELBOURNE CloudAnalyst: A CloudSim-based Tool for Modelling and Analysis of Large Scale Cloud Computing Environments MEDC Project Report

More information

Computer Network. Interconnected collection of autonomous computers that are able to exchange information

Computer Network. Interconnected collection of autonomous computers that are able to exchange information Introduction Computer Network. Interconnected collection of autonomous computers that are able to exchange information No master/slave relationship between the computers in the network Data Communications.

More information

On-Demand Supercomputing Multiplies the Possibilities

On-Demand Supercomputing Multiplies the Possibilities Microsoft Windows Compute Cluster Server 2003 Partner Solution Brief Image courtesy of Wolfram Research, Inc. On-Demand Supercomputing Multiplies the Possibilities Microsoft Windows Compute Cluster Server

More information

Using In-Memory Computing to Simplify Big Data Analytics

Using In-Memory Computing to Simplify Big Data Analytics SCALEOUT SOFTWARE Using In-Memory Computing to Simplify Big Data Analytics by Dr. William Bain, ScaleOut Software, Inc. 2012 ScaleOut Software, Inc. 12/27/2012 T he big data revolution is upon us, fed

More information

What is Analytic Infrastructure and Why Should You Care?

What is Analytic Infrastructure and Why Should You Care? What is Analytic Infrastructure and Why Should You Care? Robert L Grossman University of Illinois at Chicago and Open Data Group grossman@uic.edu ABSTRACT We define analytic infrastructure to be the services,

More information