A Grid-enabled Science Portal for Collaborative. Coastal Modeling

Size: px
Start display at page:

Download "A Grid-enabled Science Portal for Collaborative. Coastal Modeling"

Transcription

1

2 A Grid-enabled Science Portal for Collaborative Coastal Modeling Master of Science, Systems Science Project Report submitted to Department of Computer Science, Louisiana State University Chongjie Zhang March 28, 2006 Department of Computer Science, Louisiana State University, Baton Rouge, LA

3 Abstract The Southeastern United States is regularly impacted by severe ocean-driven events such as hurricanes that affect the lives of hundreds of thousands of citizens. It is urgent to improve our ability to predict critical coastal phenomena within the region. The SCOOP program is intended to create an open access, distributed laboratory for scientific research and coastal operations by integrating the coastal data, computing resources, and research efforts from its partners. A Grid Computing infrastructure is deployed to support researchers work by allow sharing expertise, software, and data across multiple institutions. This report presents the SCOOP portal, a Grid-enabled science portal, developed for the coastal research community to reduce the complexity of high performance computing for end users and enable advanced application scenarios. Since Grid portal toolkits are increasing being adopted as a means to speed up the Grid portal development, we extensively investigate and compare two major Grid portal toolkits, OGCE and GridPortlets. Then details about the SCOOP portal development are described, including use cases, design, and implementation. The SCOOP portal, built with the GridSphere Framework, currently integrates customized Grid portlet components for data access, job submission, resource management and notification. i

4 Acknowledgements I express my sincere gratitude to everyone in any way connected to the work presented in this report. First and foremost I would like to thank Gabrielle Allen for her constant support, guidance, encouragement and invaluable time that she put in to help me make progress towards completion of this project. I also would like to thank the rest of my program committee, Jianhua Chen and Bert R. Boyce, for their helpful comments and guidance. My work benefits greatly from a friendly and productive cooperation with other members in the SCOOP team and Grid research group at the Center for Computation and Technology. My report makes use of two papers for which I am the primary author, but which contain constructive comments and suggestions from other authors. I am specially grateful to to Jon MacLaren for his effective management on SCOOP project and work on archive services; to Ian Kelley for his comments and suggestions on Grid portal development; and to Chirag Dekate for his ideas and work on coastal modeling scenarios. I would like to sincerely thank Michael Russell, Jason Novotny, and Oliver Wehrens for their support on GridSphere and GridPortlets during the project. Finally, I would like to thank Xiaoxi Xu for keeping me happy and sane throughout the process, my father and my mother for teaching me early on to love learning, and my brother, whose enthusiasm and support helped me to reach this point. ii

5 Acknowledgements iii This project was carried out as a component of the SURA Coastal Ocean Observing and Prediction (SCOOP) Program, an initiative of the Southeastern Universities Research Association (SURA). Funding support for SCOOP has been provided by the Office of Naval Research, Award #N and by the National Oceanic and Atmospheric Administration s NOAA Ocean Service, Award #NA04NOS Additional support was provided by the Center for Computation & Technology at Louisiana State University.

6 Table of Contents 1 Introduction Major Contributions Outline Grid Portal Development Grid Computing Grid Portal Toolkits GridPortlets OGCE Comparison between GridPortlets and OGCE Collaborative Coastal Modeling Goals Science Scenarios CCT Role Infrastructure SCOOP Portal Requirements Choice of Grid Portal Toolkits Design Implementation Archive SCOOP Job Management Conclusion 36 iv

7 List of Figures 2.1 GridPortlet Architecture OGCE 2 Architecture SCOOP Grid Infrastructure SCOOP Portal architecture based on GridSphere Archive portlet using metadata for querying and retrieving SCOOP data files from storage Copy retrieved data files to a remote machine Manage files through physical file management portlet Job submission portlet for running ensembles of simulation models on different types of data Job submission portlet for running ensembles of simulation models on different types of data Show output of each sub-job run v

8 List of Tables 2.1 Comparison between GridPortlets and OGCE vi

9 Chapter 1 Introduction The Southeastern United States hosts roughly 100 million citizens and supports five naval bases, over a dozen major ports, essential commercial shipping and fishing enterprises, major oil and natural gas reserves, and thriving tourist enterprises. It is also a region that frequently suffers from severe ocean-driven events such as hurricanes that affect the lives of hundreds of thousands of citizens yearly. The recent devastation to coastal Louisiana by Hurricanes Katrina and Rita cost the lives of over 1000 people and severely damaged the economy and the environment. Hence there is an urgent need for accurate models of hurricanes and other severe weather events within the region. Such accurate models are needed to predict the path and effect of impending hurricanes for evacuation and preparation, to design better coastal defense systems, and to understand the physics and trends of hurricanes. In an effort to improve model fidelity, the Southeastern Universities Research Association, or SURA, has initiated and funded the SURA Coastal Ocean Observing Program (SCOOP) (1), partnering with ten institutions near the Gulf and Atlantic coasts, including Louisiana State University (LSU). SCOOP is representative of a growing class of geograph- 1

10 1.1. Major Contributions 2 ically distributed collaborators who have realized the need for new infrastructures, such as Grid computing (2), to support their research work in today s world of complex applications which require sharing expertise, software, and data across multiple institutions. Building the necessary infrastructures for collaborative projects such as SCOOP involves integrating multiple Grid middleware packages to provide a holistic approach to collaborative problem solving. Portals have become a popular way to integrate applications and content, providing groups of users (or virtual organizations) with a single entry point to interact with their applications, data, colleagues and services, all the while maintaining a consistent and uniform interface. As new applications and technologies, such as Grid computing, become increasing complex and difficult to configure and use, Grid portals have come to be recognized as a useful tool to enable the work of scientists and engineers without burdening them with the low-level details of underlying technologies. 1.1 Major Contributions The Coastal Studies Institute and the Center for Computation & Technology (CCT) are representing LSU in SCOOP program. As a research assistant at CCT, I worked closely with other members of the SCOOP team at LSU. I designed and built a Grid-enabled science portal for the coastal research community to reduce the complexity of grid computing for end users and enable advanced application scenarios. The resulting SCOOP portal uses new collaborative tools to better access ocean data and computational resources. The development and deployment of the SCOOP portal also illustrate how portal technologies can complement Grid middleware to provide a community with an easy-to-use collaborative infrastructure that is tailored to their particular needs and has the ability to incrementally

11 1.2. Outline 3 introduce and test new capabilities and services. My paper (3) based on this work gained the best paper award in GCE05: Workshop on Grid Computing Portals at Supercomputing Conference Before designing the SCOOP portal, I performed an extensive investigation of the current state of integration of Grid technologies with portal frameworks, cumulating in a review paper (4) which discusses two of the major Grid portal solutions, the Open Grid Computing Environments Collaboratory (OGCE) (5) and GridPortlets (6). That paper investigates and compares what each of these packages provides, discusses their advantages and disadvantages, and identifies missing features vital for Grid portal development. Our main purpose is to identify what current toolkits provide, reveal some of their limitations, and provide motivation for the evolution of Grid portal solutions. That paper can also be used as a reference for application groups to choose a Grid portal toolkit that fulfills the needs of their application. 1.2 Outline The rest of this report is organized as follows. Chapter 2 introduces Grid computing technologies and Grid portal toolkits. Chapter 3 lists several requirements from coastal modeling researchers and describes the Grid infrastructure currently deployed to meet these needs. Section 4 elaborates use cases from the coastal community, discusses the choice of Grid portal toolkits and the design and architecture of the SCOOP portal, provides implementation details and information about the different services the portal provides, and also looks to future development. Finally, Section 5 presents the conclusions of this work.

12 Chapter 2 Grid Portal Development The term the Grid was coined in the mid-1990s to denote a proposed distributed computing infrastructure to enable resource sharing within scientific collaborations. Much progress has since been made on the construction of such an infrastructure and its applications to both scientific and industrial computing problems. However, current Grids are generally difficult to use, so the development and deployment of Grid portals has been a popular way to simplify the usage of Grid services. This chapter will introduce basic concepts and technologies of Grid computing and toolkits to develop Grid portals. 2.1 Grid Computing Grid computing is a form of distributed computing that addresses a need for coordinated resource sharing and problem solving across multiple dynamic and geographically dispersed organizations. Resources include not only data, computer cycles, applications, and networks, but also specialized scientific instruments, such as telescopes, ocean sensors, and earthquake shake tables. The sharing is highly controlled, with resource providers 4

13 2.1. Grid Computing 5 defining clearly and carefully what resources are shared, who can assess those resources, and how those resources are used. Generally, Grids contains heterogeneous administrative domains with different operating systems and hardware architectures. One of its distinguishing points from traditional distributed systems is that Grids aim to use standard, open, general-purpose protocols and interfaces. The technologies used to construct Grids has evolved over time. The Globus Alliance (7) released the Globus Toolkit 2 (GT2) in The Globus Toolkits thereafter became the de facto standard for Grid computing. As the rapidly increasing uptake of Grid technologies, the first Global Grid Forum (GGF) (8) was held as a formal standards body in March Since then, GGF has produced numerous standards and specifications documents, including the Open Grid Services Architecture (OGSA) (9). OGSA is based on web services and provides a well-defined suite of standard interfaces and behaviours that serve as a common framework for all Grid-enabled systems and applications. The Globus Toolkit 4 (GT4) (10) has implemented OGSA and other GGF-defined protocols to provide functionalities of resource management, information services, security services, and data management. In addition, a number of tools function along with Globus Toolkit to make Grids a more robust platform, such as Gridbus (11), Grid Portal toolkits, Condor (12; 13), MPICH-G2 (14), etc. Grid technologies have been applied on a range of problems from both academia and industry. These applications cover compute-intensive, data-intensive, and equipment-centered scenarios. Locally, we can these applications from projects in the Center for Computation & Technology at Louisiana State University. The UCoMS (15) project is using Grid technologies to advance reservoir simulation and drilling analysis studies, coordinating the use of large scale compute and data resources at LSU and University of Louisiana at Lafayette. The GridChem (16) project is building a production system based on Grid infrastructures

14 2.2. Grid Portal Toolkits 6 for Chemists to launch and monitor computational chemistry calculations on CCG supercomputers from remote sites. To support Grid applications in astrophysics, coastal modeling, and petroleum engineering, the Enlightened (17) project focuses on developing dynamic, adaptive, coordinated and optimized use of networks connecting geographically distributed high-end computing resources and specific scientific instrumentation. 2.2 Grid Portal Toolkits The Web has proven to be an effective way of integrating and delivering content and applications. Web portals, like Yahoo or MSN, offer users a single point to access various information, services, and applications. The Grid is still a complex distributed system that has, at its roots, many differing software packages that form underlying Grid-middleware and infrastructure. Grid portals are gaining momentum in the scientific research community as a possible way to expose Grid services and functionality in a friendly, easy-to-use, and collaborative interface. When designing a Grid portal for a particular application domain, developers often have the choice of developing a new solution from the ground-up or leveraging an existing toolkit. Although building a solution from scratch may prove successful in the short-term, as technologies evolve and the demands on portal applications become more intense, more sophisticated solutions are needed. Grid portal toolkits are increasingly being adopted as a means to speed application development since they can provide much of the high-level functionality that is needed to manage the multi-institutional and multi-virtual organization issues that arise when designing and maintaining a production portal. A defining moment in the evolution of portal development toolkits occurred in October 2003 with the introduction of JSR-168 (18), an industry led effort to specify how

15 2.2. Grid Portal Toolkits 7 components within Java-based portals, called portlets, should interact with their hosting environment, or container. By clearly defining the methods a portlet must implement to be standards compliant, JSR-168 is the first step towards allowing true interoperability between portlets developed in different portlet containers. A true JSR-168-compliant portlet that does not have any container-related dependencies will be able to run, without any modification, in any number of portal servers including IBM WebSphere, BEA WebLogic, uportal, and GridSphere (19). In such a case, the container becomes irrelevant in choosing a Grid portal toolkit, so long as it is portable. There are a number of Grid portal toolkits including GridPortlets, GridPort Toolkit (20), and OGCE. Since the GridPort Toolkit, initially started in the TACC, is now gradually integrated into the OGCE, I will mainly introduce GridPortlets and OGCE in the following subsections GridPortlets GridPortlets is an open-source toolkit that was developed at the Albert Einstein Institute as part of the European GridLab (21) project s GridSphere workpackage. GridPortlets provides a high-level interface for portal developers to access a range of Grid services including resource information, credential management, job submission, and file browsing. Figure 2.1 shows the general architecture of GridPortlets, where a common and clearly defined API abstracts developers from underlying services and middleware. For example, to launch an executable using GridPortlets, a simple execute task is constructed that does not require any details about the underlying implementation, which may be the Globus Resource Allocation Manager (GRAM) (22) via the Java Commodity Grid Toolkit (Java CoG) (23) or an advanced resource brokering system (24) such as the GridLab Resource

16 2.2. Grid Portal Toolkits 8 Figure 2.1: GridPortlet Architecture. Management System (GRMS) (25). In addition to providing a high-level API for Grid operations, GridPortlets contains many reusable User Interface (UI) components that can easily be exploited to develop other portlet-based applications. These UI components allow developers to customize the generic JSP pages used by GridPortlets and incorporate them into their applications. GridPortlets itself reuses many of these components for its various portlets, including the file browsing dialog and resource information interfaces. Portal services in GridPortlets are managed through a Service Registry that allows developers to plug in new implementations to the existing framework. Both services and portlets in GridPortlets can leverage other services to build more sophisticated applications and share data. For example the Job Submission Portlet accesses the MyProxy (26) service to determine if a user has a valid credential, and if not, refers the user to the MyProxy portlet to retrieve a valid proxy.

17 2.2. Grid Portal Toolkits 9 GridPortlets is packaged with five generic JSR-168 compliant portlets that can be used without any additional development: Resource Registry Portlet - Administrators can manage the set of resources their Grid portal makes available to its users. Resource Browser Portlet - Users can browse the resources to which they have access, including the services, job queues, and accounts that are available on remote computers. Credential Retrieval Portlet - Users can retrieve credentials from a MyProxy server and gain single sign-on access to both the portal and Grid resources. Job Submission Portlet - Users can submit and manage jobs on their resources using Globus. Resource brokering systems are also supported through the same API, although currently only GRMS has been implemented. File Browser Portlet - Users can browse and manage physical files on Grid resources. Logical files registered with a logical file service can be accessed if a service is available OGCE The OGCE project was established in Fall 2003 with funding from the National Science Foundation Middleware Initiative. OGCE is a open-source collaborative project that leverages Grid portal research and development from the University of Chicago, Argonne National Laboratory, Indiana University, the University of Michigan, the National Center for Supercomputing Applications, the San Diego State University, and the Texas Advanced Computing Center.

18 2.2. Grid Portal Toolkits 10 Figure 2.2: OGCE 2 Architecture The basis of the OGCE architecture, as shown in Figure 2.2, is pluggable components in the form of services and portlets. OGCE uses the Java CoG as its main service API for accessing Grid resources. GridPort s Grid Portal Information Repository (GPIR) is used to retrieve local resource information, such as machine loads and queue status. OGCE comes packaged with services to support Grid operations such as workflow and queue management with the Open GCE Runtime Engine (OGRE) (27) and the Berkeley- Illinois-Maryland Association (BIMA) QueueManager. OGCE provides container-independent mechanisms that allow portlets to share data. For example, the MyProxy Manager allows other portlets accessing Grid resources to use credentials retrieved by the MyProxy portlet. The OGCE team delivered OGCE release 2 (OGCE 2), whose portlets are compliant to JSR-168 and can currently be deployed into either GridSphere or uportal. The following are the core Grid portlets in OGCE 2: Proxy Management Portlet - Enables users to retrieve credential proxies from a MyProxy server specified by the user and allows them to remove retrieved credential

19 2.2. Grid Portal Toolkits 11 proxies. Job Submission Portlet - Provides a simple user interface for submitting and tracking jobs on a Globus-based Grid. File Transfer Portlet - Uses GridFTP (28) for managing files among Grid machines, supporting both uploading and downloading files. GPIR Browser Portlet - Allows users to browse Grid and portal-related resource information that has been aggregated and cached by the GPIR web-service. Resource data is categorized into compute resources, storage resources, and visualization resources. OGRE Events Viewer Portlet - Allows users to monitor OGRE events on a specified server. Using OGRE enables users to write workflow scripts to execute a flow of controlled tasks. BIMA Queue Viewer Portlet - Allows users to monitor queue status and status of jobs in queues on a specified server using BIMA. The BIMA QueueManager is a Java application that supports the execution of multi-stage jobs, where individual jobs may be dispatched to a Globus-based Grid through the Java CoG. Viscosity Portlet - Allows users to access the central data repository developed by the Network for Earthquake Engineering Simulation (NEES) to store or retrieve files and associated metadata Comparison between GridPortlets and OGCE Grid-middleware and portal technologies are constantly evolving as new technologies are developed and more stakeholders become involved. It is only natural in this evolution that

20 2.2. Grid Portal Toolkits 12 Table 2.1: Comparison between GridPortlets and OGCE 2 Features GridPortlets OGCE 2 Service API a uniform and consistent highlevel portal service API a heterogeneous set of service APIs Grid Middleware GT2, GT3, GT4, and GridLab GT2, GT3, and GT4 middleware Persistent Using hibernate to persisting information Not known Layer about resources and jobs Presentation JSP and a UI component model JSP and Velocity Layer Core Grid Portlets Portability Credential Management, Resource Information Provider, Job Management (Support GRAM and GRMS), File Management Compliant to JSR 168, but only ready for GridSphere Credential Management, Resource Information Provider, Job Management (Support GRAM and Condor), File Management Compliant to JSR 168 and ready for GridSphere and uportal the packages supporting the Grid also will be rapidly changing. Therefore, I focuses on the functionality provided by GridPortlets v1.0 and OGCE 2 RC 4. This section gives a relatively brief comparison between them and further details can be referred to our review paper. Table summarizes the comparison of main features of Grid portal toolkits. Both GridPortlets and OGCE provide information services, job and file management services, and authentication services for portal developers. While overlapping in the basic functionalities of Grid services, GridPortlets and OGCE differ greatly in the service APIs they provide for developers. One key feature of GridPortlets is that it defines a single and consistent high-level API between portlets and underlying Grid services. This uniform API abstracts developers from underlying Grid technologies and infrastructure while supporting multiple implementations. The API can be implemented with Grid programming

21 2.2. Grid Portal Toolkits 13 frameworks, such as Java CoG and Java Grid Application Toolkit (GAT) (29), or with Grid middleware toolkits, such as Globus Toolkit and GridLab middleware services. Developers choose the implementations of particular services during portal configuration or in some cases users are given the choice of which service to use. The current implementation of GridPortlets API support GT2, GT3, and GT4 through Java CoG 1.1, and the Grid middleware services developed in the GridLab project. It uses the GridSphere persistence layer, an object/relational API that is implemented with Hibernate, to store job and resource information. By contrast, OGCE provides a heterogeneous set of service APIs, aggregated mainly from other projects, such as the Java CoG and GridPort. For job submission and physical file management, Java CoG provides a uniform API that abstracts developers from underlying implementations, including Condor, SSH, and different versions of the Globus Toolkit. GridPortlets and OGCE both use JavaServer Pages (JSP) for generating web presentation. Additionally, GridPortlets UI component model provides a number of reusable JSP components, including file browsing and job submission components. These reusable UI components facilitate developers in building interactive, friendly web interfaces. Although OGCE does not supply reusable JSP UI components, it provides tools to support for Velocity-based portlets, which may help developers to port Jetspeed-based portlets into JSR-168 containers. GridPortlets and OGCE both offer Grid-related portlets that can be integrated into Grid portals without additional programming, including credential management, resource information browser, job management, and file management. Their functionalities are similar and main differences are on user interface and some minor features. Since GridPorlets services are based on the service framework of GridSphere, it is not completely independent and, as a result, portlets based on GridPortlets, even though

22 2.2. Grid Portal Toolkits 14 technically compliant with JSR-168, can not easily be deployed into portlet containers other than GridSphere. By contrast, OGCE provides container-independent services and its portlets can be deployed in either GridSphere or uportal. In addition, OGCE includes a suite of HttpUnit tests for its Grid-related portlets, which can be used to verify the OGCE installation in a portlet container.

23 Chapter 3 Collaborative Coastal Modeling 3.1 Goals SCOOP is working towards a integrated and coordinated coastal ocean observation and prediction system, leveraging emerging regional efforts and cutting edge information technologies. It aims to integrate distributed real-time ocean observing stations and regional coastal modeling entities, to run ensembles of numerical hydrodynamic models for the prediction, verification and visualization of critical storm surge and wave behaviour during severe storms and hurricanes. SCOOP is addressing several needs in reaching their goals including: Ubiquitous and easy access to data of all types, including sensor, satellite, model and visualization data. Automated deployment of models across heterogeneous resources, including complex workflows and ensembles. Creation of data standards and interoperability of model codes. 15

24 3.2. Science Scenarios 16 Capabilities for coupled and multi-scale models. Operational procedures which can provide GIS visualization and notification to emergency management personnel. Building an infrastructure to meet these needs and supply timely information about severe events requires attention to reliability, fault tolerance, scheduling, as well as end user presentation and interaction. 3.2 Science Scenarios The current SCOOP members is a combination of research institutions, university programs and national agencies, including Louisiana State University, University of Alabama at Huntsville, Texas A&M University, GoMOOS, University of Florida, University of North Carolina at Chapel Hill, the National Oceanic and Atmospheric Administration (NOAA), University of Maryland, University of Miami, and Virginia Institute of Marine Science. This myriad collaboration engages researchers with diverse skill sets and varying degrees of technical expertise. One motivating collaborative scenario (3) for SCOOP is the following: An evolving hurricane in the tracking region begins the complex process of predicting and validating a hurricane path and its corresponding hydrodynamic impacts. Feeds from the National Hurricane Center trigger analytical models at the University of Florida which generate appropriate wind fields. Additionally atmospheric models such as No- GAPS, COAMPS, MM5, NCEP-NAM among others, are pushed by diverse modeling entities in a non-deterministic manner. A brief description of these models can be found at the National Hurricane Center page (30). These winds are pushed into a SCOOP Storage

25 3.2. Science Scenarios 17 archive using automated mechanisms as they become available. Researchers from across the Nation, alerted to the impending event by notifications sent via , SMS and IM, authenticate to the SCOOP portal from wherever they are located. From an Ensemble Model interface, they query for all possible atmospheric models available for the corresponding temporal specifications, in addition to the analytical winds. Based on the query results a matrix of possible hydrodynamic models vs. the available atmospheric datasets is generated. For datasets from atmospheric models that have yet to arrive in the archive watchdogs are deployed tagged with the associated coupled ocean modeling workflow. The portal interface provides researchers with the ability to configure models as needed, and prioritize the order in which they should be run. The matrix of models and currently available datasets is then used to automatically stage data and schedule the ensemble of hydrodynamic models on the SCOOP Grid, comprised of machines across the region. As results become available, notifications are dispatched to collaborators and output data are pushed back into the archive. GIS driven visualization services (31) allow the modelers and end users to analyze results using interfaces that provide an overlay of results obtained from all the models. Interfaces pinpoint the location of available sensors allowing modelers to compare the model ensemble with realtime data from Sensor Stations. Such correlated results comparison which allows for model data validation and verification results in improved forecasts of storm surge and inundation displayed using context relevant maps eg. street overlays or artificial levee diversion structures.

26 3.3. CCT Role CCT Role The SCOOP group at CCT is contributing to the deployment of a cyber infrastructure for SCOOP. Our role includes: 1. Surveying the coastal research community to identify hardware and software requirements 2. Providing a data archive (32) for storing observational data from satellite and buoy sources and results from coastal and atmospheric model simulations. 3. Designing a Grid testbed, the SCOOP Grid, for model deployment and data archive. 4. Building a Grid-enabled science portal to provide the coastal research community with easy access to data, models, and resources in the SCOOP Grid. 5. Providing a command-line toolkit (33) for accessing the data archive 6. Showing prototype examples of ensemble scenarios where multiple wind input data can be automatically located and fed into multiple wave or surge models and the resulting data will be staged to the data archive. The CCT SCOOP team includes Gabrielle Allen (Lead), Jon MacLaren (Manager and Data Archive), Andrei Hutanu (Visualization), Ian Kelley (GridSphere), Chirag Dekate (Models and SCOOP Grid), Chongjie Zhang (SCOOP Portal), Dayong Huang (Data clients), Zhou Lei (Grid Application Toolkit), Archit Kulshrestha (Condor), Sasanka Madiraju (SCOOP Grid) and Edward Jerome Tate (Visualization).

27 3.4. Infrastructure Infrastructure Currently, coastal researchers typically access data from multiple sources (e.g. wind fields from NCEP or USGODAE, hurricane tracks from NHC, observation data from coastal observatories like WAVCIS or SEACOOS) using HTTP, FTP or more recently the LDM (34) protocols. Operational workflows are deployed using cron type scripts, which are hard to adapt to address unreliable file arrival or fault tolerance. Usually the models involved in SCOOP (e.g. ADCIRC, WWIII, SWAN, WAM, CH3D, ELCIRC) are run only at local sites, and may require many different configuration and input files. The various institutions deploy their own web servers, delivering results at different times in varying data formats and data descriptions. Activities in SCOOP and other projects are addressing the complexity of dealing with different data sources and formats. A prime need is to develop data standards to facilitate sharing and collaboration. In lieu of a formal standard, SCOOP has developed a file-naming convention throughout the project that encodes enough information to serve as primary metadata. LSU have established an advanced Grid-enabled data storage archive service, providing essential features such as digestion and retrieval of data via multiple protocols (including GridFTP and HTTP), a logical file catalog, and general event-based notification. Compute resources for researchers are available in the form of the SCOOP Grid which is comprised of resources distributed across multiple institutions (Louisiana State University, University of North Carolina, MCNC, University of Florida, Virginia Institute of Marine Sciences). Basic Grid middleware such as Globus Toolkit 3 (GT3) is deployed across the resources. Condor is deployed across the LSU-SCOOP Grid for prototyping scheduling and job management scenarios. Figure 3.1 shows the current Grid infrastructure we deployed for SCOOP. An ongoing goal is to be able to coordinate the deployment

28 3.4. Infrastructure 20 Figure 3.1: SCOOP Grid Infrastructure and scheduling of operational and research simulations across the SCOOP Grid.

29 Chapter 4 SCOOP Portal The SCOOP Portal provides the SCOOP user community with a centralized gateway mechanism to submit and manage coastal model simulations and keep track of a large amount of data files. To better understand portal requirements from the community, we worked closely with coastal researchers and developed use-case scenarios which have driven the design of the portal. 4.1 Requirements After consulting and discussing with the SCOOP coastal researchers and modelers, we identified different requirements for the SCOOP portal development. One notable requirement was that although the scientists wanted to restrict access to data to those in the collaboration (for one reason, to address potential problems with casual interpretation of severe storm data), no finer grained access control was required. Additionally, all machines in the SCOOP Grid are shared resources among scientists, simplifying authorization needs considerably. In implementing this first version of the SCOOP portal, we concentrated on 21

30 4.1. Requirements 22 the following two user scenarios. Archive Access The SCOOP project has set up an archive service to store the source atmospheric data, the wave/surge data generated by model simulations, and also other observed data that might be used to verify the model accuracy. The SCOOP Portal is required to provide functionality to facilitate modelers and researchers in querying and retrieving datasets. The steps for accessing data files are as follows: 1. A user selects a class of data and specifies corresponding metadata to query a metadata catalog service to discover datafiles of interest, e.g. Output datafiles from AD- CIRC model simulations performed at Louisiana State University for the Gulf of Mexico region, during August A list of Logical File Names (LFNs) are returned from the query to the user. 2. The user can select one or more LFNs of interest, and then the portal contacts the archive s logical file service to return the physical file locations to the user. 3. The users can choose either to download the data file to the local machine or to perform a third-party transfer via a range of protocols. Model Simulations One of scientific objectives of SCOOP is to run an ensemble of hydrodynamic models driven by input conditions from a range of different atmospheric models. The steps for running hydrodynamic model simulations are as follows: 1. A user is required to retrieve a proxy credential to authenticate to Grid resources.

31 4.2. Choice of Grid Portal Toolkits The user specifies metadata describing atmospheric data and a hydrodynamic model. The SCOOP Portal contacts a metadata catalog service with specified metadata and the archive s logical file service to get atmospheric data files of interest. Based on the data files, the SCOOP Portal constructs a list of possible simulations, depending on the available input files in the archive. Each of these simulations will then be submitted to a job scheduler. 3. The user then can track the progress of each simulations via the SCOOP Portal or use the portal s notification services, which include AIM and Upon successful completion of each simulation the results are pushed into the archive for dissemination and further processing. 4.2 Choice of Grid Portal Toolkits The requirements from the SCOOP community are still evolving. The use of a mature portal framework and a well-designed Grid portal toolkit was necessary to be able to focus on the business logic of the SCOOP use-case scenarios and to allow for extensibility to future requirements. Based on our comparative analysis of GridPortlets and OGCE, we chose GridSphere and GridPortlets as our main toolkits to speed up the process of developing and deploying an application portal for SCOOP modelers and researchers. GridSphere is a free, open-source portal framework developed by the European Grid- Lab project, which focused on developing Grid application tools and middleware. Grid- Sphere provides a well documented set of functionality, including portlet management, user management, layout management, and role-based access control. Its portlet-based architecture offers flexibility and extensibility for portal development and facilitates soft-

32 4.3. Design 24 ware component sharing and code reuse. GridSphere is compliant with the JSR-168 portlet specification which allows portlets to be developed independently of a specific portal framework. GridSphere s portlet service model provides developers with a way to encapsulate reusable business logic into services that may be shared between many portlets. The advantages of using GridSphere come not only from its core functionalities, but also from its accompanying Grid portal toolkit, GridPortlets. GridPortlets abstracts the details of underlying Grid technologies and offers a consistent and uniform high-level service API, enabling developers to easily create custom Grid portal web-applications. The Grid- Portlets services provide functionalities for managing proxy credentials, resources, jobs, and remote files, and supports persisting information about credentials, resources, and jobs submitted by users. The GridPortlets service API currently supports both GT2 and GT3. In addition, GridPortlets delivers five well-designed, easy-to-use portlets, which include: resource registry, resource browser, credential management, job submission, and file management. 4.3 Design The architecture of the SCOOP Portal is based on the GridSphere framework. Figure 4.1 shows the SCOOP Portal software components and their interactions and relationships. From the simplified diagram, it can seen that SCOOP portlets use SCOOP services for application-specific functionality and business logic. The SCOOP Portal services themselves use or extend services built into the GridSphere framework, the GridPortlets package, as well as some third-party packages. For example, the SCOOP portal services mainly use the GridPortlet service API to interact with Grid resources, such as submitting jobs and moving remote files. Most portlets are independent from one-another, however, they

33 4.4. Implementation 25 End User 1 End User 2... End User n SCOOP Portal (based on GridSphere) Model Portlet Archive Portlet Request Portlet... Credential Portlet Resource Browser Portlet SCOOP Portal Services GridPortlets Services Archive SCOOP Job Management Download Tracking Request Tracking Credential Management Resource Browser Job Management File Management Grid Middleware GT2 Java CoG API GT3 GridLab Service API (igrid, GRMS, File Service,...) GSI, GRAM, GridFTP, MDS GSI, RLS, GridFTP Computing Resources Archival Storage Figure 4.1: SCOOP Portal architecture based on GridSphere. can communicate with each other via the service layer. For example, the credential portlet calls the credential service to retrieve proxy credentials from a MyProxy server, and later a job submission portlet can get the retrieved credentials to authenticate with Grid resources. This portlet-based, service-oriented, architecture greatly speeds up portal development and exhibits high extensibility. 4.4 Implementation The SCOOP Portal is implemented with GridSphere version and GridPortlet version 1.0, which together provide a coherent core set of relevant functionality. When evaluating the SCOOP community requirements, we found several common functionalities had already been implemented in other Grid portal projects. To avoid reinventing these, the fully

34 4.4. Implementation 26 deployed SCOOP Portal contains not only portlets developed specifically for the SCOOP, but also shared portlets from GridSphere, GridPortlets, and GridLab Testbed (35). The following list illustrates the functionalities of portlets and services that are specific to the SCOOP project: Archive enables users to retrieve, either by HTTP to the local machine or GridFTP to a remote machine, SCOOP data files from archive storage. The interfaces provides queries using metadata, and custom interfaces to specific data formats such as OpenDAP. SCOOP Job Submission provides custom interfaces and options for users to launch coastal models. The interface matches models to available data files in the archive. Simulation Tracking and Notification allows users to track the progress of active simulations via the SCOOP Portal and to receive notification via or AIM. Request Tracking coordinates team work by allowing users to manage and track the status of tasks and defects in various SCOOP sub-projects. Download Tracking tracks downloads of software tools distributed through the portal. The following list illustrates the Grid-related functionalities of portlets and services that are deployed to the SCOOP Portal but developed by other Grid projects. All other portlets are from the GridPortlets distribution except the Grid Resource Status Monitoring portlet that is developed by the GridLab project. Credential Management enables users to retrieve, renew, and delete proxy credentials from a MyProxy server.

35 4.4. Implementation 27 Resource Registry enable portal administrators to register or unregister Grid resources, such as computing resources or services. Resouce Browser enable users to view available Grid resources, including information about hardware configuration, services, job queues, and accounts on remote machines. Grid Resource Status Monitoring enable users to view the information about whether particular services and software components are installed and available on each machine, and the possible reasons for the services that are not available. Physical File Management enable users to browse and manage files on remote machines Archive The current archive storage contains three classes of data files: source atmospheric data, simulated wave/surge data, and other observed data to verify the model accuracy. The three class of data are associated with class specific metadata attributes and query interfaces. Figure 4.2 shows the archive portlet for querying for simulated wave/surge data files. The archive portlet gathers metadata information from user requests and retrieves a list of logical file names which have matching metadata from the archive portal service. Currently, SCOOP does not have an appropriate metadata service, the logical file name used in queries is generated from metadata information contained in the SCOOP file-naming convention. The logical file name may contain wildcard characters to accommodate unknown metadata or to select a range of files. The mappings of physical file names and logical file names can be obtained by querying the archive s logical file services, currently provided

36 4.4. Implementation 28 Figure 4.2: Archive portlet using metadata for querying and retrieving SCOOP data files from storage.

37 4.4. Implementation 29 Figure 4.3: Copy retrieved data files to a remote machine.

38 4.4. Implementation 30 Figure 4.4: Manage files through physical file management portlet by an instance of Globus Replica Location Service (RLS) version 2.2 (36). To provide performance scalability, the query results are shown with dynamically paging techniques, because the RLS API does not support merely returning the size of matched results. Users can retrieve SCOOP data files via HTTPS to their local machine, or perform a third-party transfer via GridFTP. The transfer service is built on the GridPortlet file services and resource services for directory selection and file copy. Figure 4.3 shows the user interface to allow users to specify a location on a remote machine when copying retrieved data files. Users can manage copied files through physical file management portlet, as shown in Figure 4.4. Currently the logical file entries in the RLS point to the physical file entries available locally. Efforts are underway to incorporate distributed storage resources including SCOOP data store instances at TAMU. The LSU SCOOP archive is also being expanded to leverage

39 4.4. Implementation 31 SRB based terabyte scale storage at SDSC. Evolving versions of the SCOOP archive access portlet will address issues of accessing federated data stores SCOOP Job Management From the user scenarios, SCOOP models can run on different input data types (e.g. using wind data generated either by analytic means or by other models). As shown in Figure 4.5, the SCOOP job submission portlet allows users to select multiple different types of wind data for a particular model. Using specified metadata, the job submission portlet queries the logical file service and generates a file list for each selected data type, and constructs a list of such tasks. The SCOOP job service submits each task to Condor via Globus GRAM to run the model on each file list. Hence one SCOOP simulation job may contain several sub-job runs. The SCOOP job submission is mainly built on the GridPortlet service API. To store custom job information and the parent-child job hierarchy, we provide a persistent layer for SCOOP job management using Hibernate. Later, we can track the job information via this portlet, as shown in Figure 4.6. Active proxy credential is required for job submission. Users need to delegate their Grid credentials to the MyProxy server. The SCOOP Portal allows users to retrieve credentials from a MyProxy server via GridPortlets credential management portlet. The job submission service will automatically use retrieved credentials for authenticating with Grid resources. The SCOOP job portlet allows users to view the status and output of each sub-job run, as shown in Figure 4.7. Notification of the job status, currently by AIM or , is implemented by another service: Simulation Tracking and Notification. Each sub-job registers itself with and continuously sends updated information to the notification service

40 4.4. Implementation 32 Figure 4.5: Job submission portlet for running ensembles of simulation models on different types of data.

41 4.4. Implementation 33 Figure 4.6: Job submission portlet for running ensembles of simulation models on different types of data.

42 4.4. Implementation 34 Figure 4.7: Show output of each sub-job run.

43 4.4. Implementation 35 via XML-RPC. The notification service collects and sends out this information via or AIM, depending on the user preference. Currently, the functionality of the SCOOP job management interfaces is limited by the lack of interoperability of the underlying models and data sources. As ongoing work is completed, more complex workflows and ensemble runs will be implemented.

44 Chapter 5 Conclusion This report has investigated state-of-the-art Grid portal toolkits represented by GridPortlets and OGCE, and described the background, design and implementation of a Grid-enabled science portal for the SCOOP coastal ocean observing and modeling community. The SCOOP portal, built with the GridSphere Framework, currently integrates customized Grid portlet components for data access, job submission, resource management and notification, and provides researchers and modelers with easy access to integrated data and compute resources. While the portal interfaces have thus far been well received by the SCOOP community, the challenge now is to make the portal an essential part of the scientists usual working environment. This requires adding new scenario and usecase driven features to the portal. These enhancements will include: advanced ensemble simulation interfaces to allow modelers to run a spectrum of hydrodynamic models each with different input conditions integration of GIS technologies into the portal to provide geo-referenced interactive 36

A Collaborative Science Portal for High. Performance Computing

A Collaborative Science Portal for High. Performance Computing A Collaborative Science Portal for High Performance Computing Master of Science, Systems Science Project Report submitted to Department of Computer Science, Louisiana State University Xiaoxi Xu April 09,

More information

A Workflow Approach to Designed Reservoir Study

A Workflow Approach to Designed Reservoir Study A Workflow Approach to Designed Reservoir Study Gabrielle Allen, Promita Chakraborty, Dayong Huang, Zhou Lei, John Lewis, Xin Li, Christopher D White, Xiaoxi Xu, Chongjie Zhang Center for Computation &

More information

Large Scale Coastal Modeling on the Grid

Large Scale Coastal Modeling on the Grid Large Scale Coastal Modeling on the Grid Lavanya Ramakrishnan lavanya@renci.org Renaissance Computing Institute Duke University North Carolina State University University of North Carolina - Chapel Hill

More information

SCOOP Data Management: A Standards-based Distributed System for Coastal Data and Modeling

SCOOP Data Management: A Standards-based Distributed System for Coastal Data and Modeling SCOOP Data Management: A Standards-based Distributed System for Coastal Data and Modeling Helen Conover, Bruce Beaumont, Marilyn Drewry, Sara Graves, Ken Keiser, Manil Maskey, Matt Smith The University

More information

Application Frameworks for High Performance and Grid Computing

Application Frameworks for High Performance and Grid Computing Application Frameworks for High Performance and Grid Computing Gabrielle Allen Assistant Director for Computing Applications, Center for Computation & Technology Associate Professor, Department of Computer

More information

Concepts and Architecture of the Grid. Summary of Grid 2, Chapter 4

Concepts and Architecture of the Grid. Summary of Grid 2, Chapter 4 Concepts and Architecture of the Grid Summary of Grid 2, Chapter 4 Concepts of Grid Mantra: Coordinated resource sharing and problem solving in dynamic, multi-institutional virtual organizations Allows

More information

THE CCLRC DATA PORTAL

THE CCLRC DATA PORTAL THE CCLRC DATA PORTAL Glen Drinkwater, Shoaib Sufi CCLRC Daresbury Laboratory, Daresbury, Warrington, Cheshire, WA4 4AD, UK. E-mail: g.j.drinkwater@dl.ac.uk, s.a.sufi@dl.ac.uk Abstract: The project aims

More information

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets!! Large data collections appear in many scientific domains like climate studies.!! Users and

More information

An approach to grid scheduling by using Condor-G Matchmaking mechanism

An approach to grid scheduling by using Condor-G Matchmaking mechanism An approach to grid scheduling by using Condor-G Matchmaking mechanism E. Imamagic, B. Radic, D. Dobrenic University Computing Centre, University of Zagreb, Croatia {emir.imamagic, branimir.radic, dobrisa.dobrenic}@srce.hr

More information

ENHANCING THE COLLABORATIVE USE OF GRID COMPUTING APPLICATIONS WITH A USER-ORIENTED PORTAL DASHBOARD

ENHANCING THE COLLABORATIVE USE OF GRID COMPUTING APPLICATIONS WITH A USER-ORIENTED PORTAL DASHBOARD ENHANCING THE COLLABORATIVE USE OF GRID COMPUTING APPLICATIONS WITH A USER-ORIENTED PORTAL DASHBOARD Dr. Thomas Choong CEO, elipva Ltd tchoong@elipva.com Chang Sau Sheong Director, Software Development

More information

An Experience in Accessing Grid Computing Power from Mobile Device with GridLab Mobile Services

An Experience in Accessing Grid Computing Power from Mobile Device with GridLab Mobile Services An Experience in Accessing Grid Computing Power from Mobile Device with GridLab Mobile Services Abstract In this paper review the notion of the use of mobile device in grid computing environment, We describe

More information

SOA REFERENCE ARCHITECTURE: WEB TIER

SOA REFERENCE ARCHITECTURE: WEB TIER SOA REFERENCE ARCHITECTURE: WEB TIER SOA Blueprint A structured blog by Yogish Pai Web Application Tier The primary requirement for this tier is that all the business systems and solutions be accessible

More information

Concepts and Architecture of Grid Computing. Advanced Topics Spring 2008 Prof. Robert van Engelen

Concepts and Architecture of Grid Computing. Advanced Topics Spring 2008 Prof. Robert van Engelen Concepts and Architecture of Grid Computing Advanced Topics Spring 2008 Prof. Robert van Engelen Overview Grid users: who are they? Concept of the Grid Challenges for the Grid Evolution of Grid systems

More information

A Survey Study on Monitoring Service for Grid

A Survey Study on Monitoring Service for Grid A Survey Study on Monitoring Service for Grid Erkang You erkyou@indiana.edu ABSTRACT Grid is a distributed system that integrates heterogeneous systems into a single transparent computer, aiming to provide

More information

IBM Solutions Grid for Business Partners Helping IBM Business Partners to Grid-enable applications for the next phase of e-business on demand

IBM Solutions Grid for Business Partners Helping IBM Business Partners to Grid-enable applications for the next phase of e-business on demand PartnerWorld Developers IBM Solutions Grid for Business Partners Helping IBM Business Partners to Grid-enable applications for the next phase of e-business on demand 2 Introducing the IBM Solutions Grid

More information

Data Grids. Lidan Wang April 5, 2007

Data Grids. Lidan Wang April 5, 2007 Data Grids Lidan Wang April 5, 2007 Outline Data-intensive applications Challenges in data access, integration and management in Grid setting Grid services for these data-intensive application Architectural

More information

Using Resource Virtualization Techniques to Grid-enable Coupled Coastal Ocean Models

Using Resource Virtualization Techniques to Grid-enable Coupled Coastal Ocean Models Using Resource Virtualization Techniques to Grid-enable Coupled Coastal Ocean Models Renato Figueiredo Arijit Ganguly Advanced Computing and Information Systems Lab Peter Sheng, Justin Davis, Vladimir

More information

e-science Technologies in Synchrotron Radiation Beamline - Remote Access and Automation (A Case Study for High Throughput Protein Crystallography)

e-science Technologies in Synchrotron Radiation Beamline - Remote Access and Automation (A Case Study for High Throughput Protein Crystallography) Macromolecular Research, Vol. 14, No. 2, pp 140-145 (2006) e-science Technologies in Synchrotron Radiation Beamline - Remote Access and Automation (A Case Study for High Throughput Protein Crystallography)

More information

Analyses on functional capabilities of BizTalk Server, Oracle BPEL Process Manger and WebSphere Process Server for applications in Grid middleware

Analyses on functional capabilities of BizTalk Server, Oracle BPEL Process Manger and WebSphere Process Server for applications in Grid middleware Analyses on functional capabilities of BizTalk Server, Oracle BPEL Process Manger and WebSphere Process Server for applications in Grid middleware R. Goranova University of Sofia St. Kliment Ohridski,

More information

PROGRESS Portal Access Whitepaper

PROGRESS Portal Access Whitepaper PROGRESS Portal Access Whitepaper Maciej Bogdanski, Michał Kosiedowski, Cezary Mazurek, Marzena Rabiega, Malgorzata Wolniewicz Poznan Supercomputing and Networking Center April 15, 2004 1 Introduction

More information

The THREDDS Data Repository: for Long Term Data Storage and Access

The THREDDS Data Repository: for Long Term Data Storage and Access 8B.7 The THREDDS Data Repository: for Long Term Data Storage and Access Anne Wilson, Thomas Baltzer, John Caron Unidata Program Center, UCAR, Boulder, CO 1 INTRODUCTION In order to better manage ever increasing

More information

MIGRATING DESKTOP AND ROAMING ACCESS. Migrating Desktop and Roaming Access Whitepaper

MIGRATING DESKTOP AND ROAMING ACCESS. Migrating Desktop and Roaming Access Whitepaper Migrating Desktop and Roaming Access Whitepaper Poznan Supercomputing and Networking Center Noskowskiego 12/14 61-704 Poznan, POLAND 2004, April white-paper-md-ras.doc 1/11 1 Product overview In this whitepaper

More information

Abstract. 1. Introduction. Ohio State University Columbus, OH 43210 {langella,oster,hastings,kurc,saltz}@bmi.osu.edu

Abstract. 1. Introduction. Ohio State University Columbus, OH 43210 {langella,oster,hastings,kurc,saltz}@bmi.osu.edu Dorian: Grid Service Infrastructure for Identity Management and Federation Stephen Langella 1, Scott Oster 1, Shannon Hastings 1, Frank Siebenlist 2, Tahsin Kurc 1, Joel Saltz 1 1 Department of Biomedical

More information

BUILDING APPLICATIONS FROM A WEB SERVICE BASED COMPONENT ARCHITECTURE. D. Gannon, S. Krishnan, L. Fang, G. Kandaswamy, Y. Simmhan, A.

BUILDING APPLICATIONS FROM A WEB SERVICE BASED COMPONENT ARCHITECTURE. D. Gannon, S. Krishnan, L. Fang, G. Kandaswamy, Y. Simmhan, A. BUILDING APPLICATIONS FROM A WEB SERVICE BASED COMPONENT ARCHITECTURE D. Gannon, S. Krishnan, L. Fang, G. Kandaswamy, Y. Simmhan, A. Slominski What this talk is about How to build secure, reliable applications

More information

Middleware- Driven Mobile Applications

Middleware- Driven Mobile Applications Middleware- Driven Mobile Applications A motwin White Paper When Launching New Mobile Services, Middleware Offers the Fastest, Most Flexible Development Path for Sophisticated Apps 1 Executive Summary

More information

Cluster, Grid, Cloud Concepts

Cluster, Grid, Cloud Concepts Cluster, Grid, Cloud Concepts Kalaiselvan.K Contents Section 1: Cluster Section 2: Grid Section 3: Cloud Cluster An Overview Need for a Cluster Cluster categorizations A computer cluster is a group of

More information

GridSpeed: A Web-based Grid Portal Generation Server

GridSpeed: A Web-based Grid Portal Generation Server GridSpeed: A Web-based Grid Portal Generation Server Toyotaro Suzumura Tokyo Institute of Technology, and Japan Society for the Promotion of Science suzumura@is.titech.ac.jp Satoshi Matsuoka Tokyo Institute

More information

Integrating SharePoint Sites within WebSphere Portal

Integrating SharePoint Sites within WebSphere Portal Integrating SharePoint Sites within WebSphere Portal November 2007 Contents Executive Summary 2 Proliferation of SharePoint Sites 2 Silos of Information 2 Security and Compliance 3 Overview: Mainsoft SharePoint

More information

Cloud Computing. Lecture 5 Grid Case Studies 2014-2015

Cloud Computing. Lecture 5 Grid Case Studies 2014-2015 Cloud Computing Lecture 5 Grid Case Studies 2014-2015 Up until now Introduction. Definition of Cloud Computing. Grid Computing: Schedulers Globus Toolkit Summary Grid Case Studies: Monitoring: TeraGRID

More information

Coastal Research Proposal Abstracts

Coastal Research Proposal Abstracts Coastal Research Proposal Abstracts An Automated Data Analysis, Processing, and Management System Proposal to NASA with SAIC Total Budget $15K Submitted 10/31/2008 The goal of this study is to design,

More information

SAS Information Delivery Portal

SAS Information Delivery Portal SAS Information Delivery Portal Table of Contents Introduction...1 The State of Enterprise Information...1 Information Supply Chain Technologies...2 Making Informed Business Decisions...3 Gathering Business

More information

XSEDE Service Provider Software and Services Baseline. September 24, 2015 Version 1.2

XSEDE Service Provider Software and Services Baseline. September 24, 2015 Version 1.2 XSEDE Service Provider Software and Services Baseline September 24, 2015 Version 1.2 i TABLE OF CONTENTS XSEDE Production Baseline: Service Provider Software and Services... i A. Document History... A-

More information

The distribution of marine OpenData via distributed data networks and Web APIs. The example of ERDDAP, the message broker and data mediator from NOAA

The distribution of marine OpenData via distributed data networks and Web APIs. The example of ERDDAP, the message broker and data mediator from NOAA The distribution of marine OpenData via distributed data networks and Web APIs. The example of ERDDAP, the message broker and data mediator from NOAA Dr. Conor Delaney 9 April 2014 GeoMaritime, London

More information

GridFTP: A Data Transfer Protocol for the Grid

GridFTP: A Data Transfer Protocol for the Grid GridFTP: A Data Transfer Protocol for the Grid Grid Forum Data Working Group on GridFTP Bill Allcock, Lee Liming, Steven Tuecke ANL Ann Chervenak USC/ISI Introduction In Grid environments,

More information

NASA s Big Data Challenges in Climate Science

NASA s Big Data Challenges in Climate Science NASA s Big Data Challenges in Climate Science Tsengdar Lee, Ph.D. High-end Computing Program Manager NASA Headquarters Presented at IEEE Big Data 2014 Workshop October 29, 2014 1 2 7-km GEOS-5 Nature Run

More information

A High-Performance Virtual Storage System for Taiwan UniGrid

A High-Performance Virtual Storage System for Taiwan UniGrid Journal of Information Technology and Applications Vol. 1 No. 4 March, 2007, pp. 231-238 A High-Performance Virtual Storage System for Taiwan UniGrid Chien-Min Wang; Chun-Chen Hsu and Jan-Jan Wu Institute

More information

Web Service Based Data Management for Grid Applications

Web Service Based Data Management for Grid Applications Web Service Based Data Management for Grid Applications T. Boehm Zuse-Institute Berlin (ZIB), Berlin, Germany Abstract Web Services play an important role in providing an interface between end user applications

More information

Monitoring Clusters and Grids

Monitoring Clusters and Grids JENNIFER M. SCHOPF AND BEN CLIFFORD Monitoring Clusters and Grids One of the first questions anyone asks when setting up a cluster or a Grid is, How is it running? is inquiry is usually followed by the

More information

Communiqué 4. Standardized Global Content Management. Designed for World s Leading Enterprises. Industry Leading Products & Platform

Communiqué 4. Standardized Global Content Management. Designed for World s Leading Enterprises. Industry Leading Products & Platform Communiqué 4 Standardized Communiqué 4 - fully implementing the JCR (JSR 170) Content Repository Standard, managing digital business information, applications and processes through the web. Communiqué

More information

Deploying a distributed data storage system on the UK National Grid Service using federated SRB

Deploying a distributed data storage system on the UK National Grid Service using federated SRB Deploying a distributed data storage system on the UK National Grid Service using federated SRB Manandhar A.S., Kleese K., Berrisford P., Brown G.D. CCLRC e-science Center Abstract As Grid enabled applications

More information

IBM Rational Rapid Developer Components & Web Services

IBM Rational Rapid Developer Components & Web Services A Technical How-to Guide for Creating Components and Web Services in Rational Rapid Developer June, 2003 Rev. 1.00 IBM Rational Rapid Developer Glenn A. Webster Staff Technical Writer Executive Summary

More information

Data Management System for grid and portal services

Data Management System for grid and portal services Data Management System for grid and portal services Piotr Grzybowski 1, Cezary Mazurek 1, Paweł Spychała 1, Marcin Wolski 1 1 Poznan Supercomputing and Networking Center, ul. Noskowskiego 10, 61-704 Poznan,

More information

DATABASES AND THE GRID

DATABASES AND THE GRID DATABASES AND THE GRID Paul Watson Department of Computing Science, University of Newcastle, Newcastle-upon-Tyne, NE1 7RU, UK e-mail: Paul.Watson@newcastle.ac.uk Telephone: +44 191 222 7653 Fax: +44 191

More information

API Architecture. for the Data Interoperability at OSU initiative

API Architecture. for the Data Interoperability at OSU initiative API Architecture for the Data Interoperability at OSU initiative Introduction Principles and Standards OSU s current approach to data interoperability consists of low level access and custom data models

More information

GRID COMPUTING Techniques and Applications BARRY WILKINSON

GRID COMPUTING Techniques and Applications BARRY WILKINSON GRID COMPUTING Techniques and Applications BARRY WILKINSON Contents Preface About the Author CHAPTER 1 INTRODUCTION TO GRID COMPUTING 1 1.1 Grid Computing Concept 1 1.2 History of Distributed Computing

More information

Red Hat Enterprise Portal Server: Architecture and Features

Red Hat Enterprise Portal Server: Architecture and Features Red Hat Enterprise Portal Server: Architecture and Features By: Richard Li and Jim Parsons March 2003 Abstract This whitepaper provides an architectural overview of the open source Red Hat Enterprise Portal

More information

StormSurgeViz: A Visualization and Analysis Application for Distributed ADCIRC-based Coastal Storm Surge, Inundation, and Wave Modeling

StormSurgeViz: A Visualization and Analysis Application for Distributed ADCIRC-based Coastal Storm Surge, Inundation, and Wave Modeling : A Visualization and Analysis Application for Distributed ADCIRC-based Coastal Storm Surge, Inundation, and Wave Modeling Brian Blanton Renaissance Computing Institute University of North Carolina at

More information

Secure Federated Light-weight Web Portals for FusionGrid

Secure Federated Light-weight Web Portals for FusionGrid Secure Federated Light-weight Web Portals for FusionGrid By: D. Aswath, M. Thompson, M. Goode, X. Lee, N. Y. Kim Presented by: Dipti Aswath GCE Workshop 2006 Second International Workshop on Grid Computing

More information

The glite File Transfer Service

The glite File Transfer Service The glite File Transfer Service Peter Kunszt Paolo Badino Ricardo Brito da Rocha James Casey Ákos Frohner Gavin McCance CERN, IT Department 1211 Geneva 23, Switzerland Abstract Transferring data reliably

More information

What You Need to Know About Transitioning to SOA

What You Need to Know About Transitioning to SOA What You Need to Know About Transitioning to SOA written by: David A. Kelly, ebizq Analyst What You Need to Know About Transitioning to SOA Organizations are increasingly turning to service-oriented architectures

More information

Mitra Innovation Leverages WSO2's Open Source Middleware to Build BIM Exchange Platform

Mitra Innovation Leverages WSO2's Open Source Middleware to Build BIM Exchange Platform Mitra Innovation Leverages WSO2's Open Source Middleware to Build BIM Exchange Platform May 2015 Contents 1. Introduction... 3 2. What is BIM... 3 2.1. History of BIM... 3 2.2. Why Implement BIM... 4 2.3.

More information

2012 LABVANTAGE Solutions, Inc. All Rights Reserved.

2012 LABVANTAGE Solutions, Inc. All Rights Reserved. LABVANTAGE Architecture 2012 LABVANTAGE Solutions, Inc. All Rights Reserved. DOCUMENT PURPOSE AND SCOPE This document provides an overview of the LABVANTAGE hardware and software architecture. It is written

More information

Using DeployR to Solve the R Integration Problem

Using DeployR to Solve the R Integration Problem DEPLOYR WHITE PAPER Using DeployR to olve the R Integration Problem By the Revolution Analytics DeployR Team March 2015 Introduction Organizations use analytics to empower decision making, often in real

More information

Using Message Brokering and Data Mediation to use Distributed Data Networks of Earth Science Data to Enhance Global Maritime Situational Awareness.

Using Message Brokering and Data Mediation to use Distributed Data Networks of Earth Science Data to Enhance Global Maritime Situational Awareness. Using Message Brokering and Data Mediation to use Distributed Data Networks of Earth Science Data to Enhance Global Maritime Situational Awareness. Conor Delaney, Alfredo Alessandrini & Harm Greidanus

More information

An IDL for Web Services

An IDL for Web Services An IDL for Web Services Interface definitions are needed to allow clients to communicate with web services Interface definitions need to be provided as part of a more general web service description Web

More information

Data Collection and Analysis: Get End-to-End Security with Cisco Connected Analytics for Network Deployment

Data Collection and Analysis: Get End-to-End Security with Cisco Connected Analytics for Network Deployment White Paper Data Collection and Analysis: Get End-to-End Security with Cisco Connected Analytics for Network Deployment Cisco Connected Analytics for Network Deployment (CAND) is Cisco hosted, subscription-based

More information

Cyberinfrastructure Education and Hands-on Training Using the CH3D-GTM Virtual Appliance on SURAGrid

Cyberinfrastructure Education and Hands-on Training Using the CH3D-GTM Virtual Appliance on SURAGrid Cyberinfrastructure Education and Hands-on Training Using the CH3D-GTM Virtual Appliance on SURAGrid Renato Figueiredo http://grid-appliance.org J. Davis, J. Fortes, P. Sheng, V. Paramygin, B. Tutak, D.

More information

The Arctic Observing Network and its Data Management Challenges Florence Fetterer (NSIDC/CIRES/CU), James A. Moore (NCAR/EOL), and the CADIS team

The Arctic Observing Network and its Data Management Challenges Florence Fetterer (NSIDC/CIRES/CU), James A. Moore (NCAR/EOL), and the CADIS team The Arctic Observing Network and its Data Management Challenges Florence Fetterer (NSIDC/CIRES/CU), James A. Moore (NCAR/EOL), and the CADIS team Photo courtesy Andrew Mahoney NSF Vision What is AON? a

More information

Grid Computing With FreeBSD

Grid Computing With FreeBSD Grid Computing With FreeBSD USENIX ATC '04: UseBSD SIG Boston, MA, June 29 th 2004 Brooks Davis, Craig Lee The Aerospace Corporation El Segundo, CA {brooks,lee}aero.org http://people.freebsd.org/~brooks/papers/usebsd2004/

More information

Archiving, Indexing and Accessing Web Materials: Solutions for large amounts of data

Archiving, Indexing and Accessing Web Materials: Solutions for large amounts of data Archiving, Indexing and Accessing Web Materials: Solutions for large amounts of data David Minor 1, Reagan Moore 2, Bing Zhu, Charles Cowart 4 1. (88)4-104 minor@sdsc.edu San Diego Supercomputer Center

More information

Globus Toolkit: Authentication and Credential Translation

Globus Toolkit: Authentication and Credential Translation Globus Toolkit: Authentication and Credential Translation JET Workshop, April 14, 2004 Frank Siebenlist franks@mcs.anl.gov http://www.globus.org/ Copyright (c) 2002 University of Chicago and The University

More information

Web Service Robust GridFTP

Web Service Robust GridFTP Web Service Robust GridFTP Sang Lim, Geoffrey Fox, Shrideep Pallickara and Marlon Pierce Community Grid Labs, Indiana University 501 N. Morton St. Suite 224 Bloomington, IN 47404 {sblim, gcf, spallick,

More information

U.S. Department of Health and Human Services (HHS) The Office of the National Coordinator for Health Information Technology (ONC)

U.S. Department of Health and Human Services (HHS) The Office of the National Coordinator for Health Information Technology (ONC) U.S. Department of Health and Human Services (HHS) The Office of the National Coordinator for Health Information Technology (ONC) econsent Trial Project Architectural Analysis & Technical Standards Produced

More information

HPC and Grid Concepts

HPC and Grid Concepts HPC and Grid Concepts Divya MG (divyam@cdac.in) CDAC Knowledge Park, Bangalore 16 th Feb 2012 GBC@PRL Ahmedabad 1 Presentation Overview What is HPC Need for HPC HPC Tools Grid Concepts GARUDA Overview

More information

Workload Characterization and Analysis of Storage and Bandwidth Needs of LEAD Workspace

Workload Characterization and Analysis of Storage and Bandwidth Needs of LEAD Workspace Workload Characterization and Analysis of Storage and Bandwidth Needs of LEAD Workspace Beth Plale Indiana University plale@cs.indiana.edu LEAD TR 001, V3.0 V3.0 dated January 24, 2007 V2.0 dated August

More information

Web Application Hosting Cloud Architecture

Web Application Hosting Cloud Architecture Web Application Hosting Cloud Architecture Executive Overview This paper describes vendor neutral best practices for hosting web applications using cloud computing. The architectural elements described

More information

TOG & JOSH: Grid scheduling with Grid Engine & Globus

TOG & JOSH: Grid scheduling with Grid Engine & Globus TOG & JOSH: Grid scheduling with Grid Engine & Globus G. Cawood, T. Seed, R. Abrol, T. Sloan EPCC, The University of Edinburgh, James Clerk Maxwell Building, Mayfield Road, Edinburgh, EH9 3JZ, UK Abstract

More information

SOA management challenges. After completing this topic, you should be able to: Explain the challenges of managing an SOA environment

SOA management challenges. After completing this topic, you should be able to: Explain the challenges of managing an SOA environment Managing SOA Course materials may not be reproduced in whole or in part without the prior written permission of IBM. 4.0.3 4.0.3 Unit objectives After completing this unit, you should be able to: Explain

More information

On Enabling Hydrodynamics Data Analysis of Analytical Ultracentrifugation Experiments

On Enabling Hydrodynamics Data Analysis of Analytical Ultracentrifugation Experiments On Enabling Hydrodynamics Data Analysis of Analytical Ultracentrifugation Experiments 18. June 2013 Morris Reidel, Shahbaz Memon, et al. Outline Background Ultrascan Application Ultrascan Software Components

More information

BEA AquaLogic Integrator Agile integration for the Enterprise Build, Connect, Re-use

BEA AquaLogic Integrator Agile integration for the Enterprise Build, Connect, Re-use Product Data Sheet BEA AquaLogic Integrator Agile integration for the Enterprise Build, Connect, Re-use BEA AquaLogic Integrator delivers the best way for IT to integrate, deploy, connect and manage process-driven

More information

Orbiter Series Service Oriented Architecture Applications

Orbiter Series Service Oriented Architecture Applications Workshop on Science Agency Uses of Clouds and Grids Orbiter Series Service Oriented Architecture Applications Orbiter Project Overview Mark L. Green mlgreen@txcorp.com Tech-X Corporation, Buffalo Office

More information

André Karpištšenko, Co-Founder & Chief Scientist, Marinexplore Strata, 2014.02.11

André Karpištšenko, Co-Founder & Chief Scientist, Marinexplore Strata, 2014.02.11 marineos André Karpištšenko, Co-Founder & Chief Scientist, Marinexplore Strata, 2014.02.11 The Ocean's Big Data Platform marineos: a platform for organizing, analyzing and distributing machine data marineos

More information

IBM Rational Web Developer for WebSphere Software Version 6.0

IBM Rational Web Developer for WebSphere Software Version 6.0 Rapidly build, test and deploy Web, Web services and Java applications with an IDE that is easy to learn and use IBM Rational Web Developer for WebSphere Software Version 6.0 Highlights Accelerate Web,

More information

Resource Management on Computational Grids

Resource Management on Computational Grids Univeristà Ca Foscari, Venezia http://www.dsi.unive.it Resource Management on Computational Grids Paolo Palmerini Dottorato di ricerca di Informatica (anno I, ciclo II) email: palmeri@dsi.unive.it 1/29

More information

Status and Integration of AP2 Monitoring and Online Steering

Status and Integration of AP2 Monitoring and Online Steering Status and Integration of AP2 Monitoring and Online Steering Daniel Lorenz - University of Siegen Stefan Borovac, Markus Mechtel - University of Wuppertal Ralph Müller-Pfefferkorn Technische Universität

More information

Integration strategy

Integration strategy C3-INAD and ESGF: Integration strategy C3-INAD Middleware Team: Stephan Kindermann, Carsten Ehbrecht [DKRZ] Bernadette Fritzsch [AWI] Maik Jorra, Florian Schintke, Stefan Plantikov [ZUSE Institute] Markus

More information

How To Create A Visual Analytics Tool

How To Create A Visual Analytics Tool W H I T E P A P E R Visual Analytics for the Masses 1 State of Visual Analytics Visual analytics, in the field of business intelligence, is the integration of data visualization and interactive visual

More information

Execution Management: Key Concepts

Execution Management: Key Concepts Execution Management: Key Concepts Execution Management: Key Concepts Overview The Globus Toolkit provides a service to submit, monitor, and cancel jobs on Grid computing resources called GRAM. In GRAM,

More information

HEP Data-Intensive Distributed Cloud Computing System Requirements Specification Document

HEP Data-Intensive Distributed Cloud Computing System Requirements Specification Document HEP Data-Intensive Distributed Cloud Computing System Requirements Specification Document CANARIE NEP-101 Project University of Victoria HEP Computing Group December 18, 2013 Version 1.0 1 Revision History

More information

XSEDE Overview John Towns

XSEDE Overview John Towns April 15, 2011 XSEDE Overview John Towns XD Solicitation/XD Program extreme Digital Resources for Science and Engineering (NSF 08 571) Extremely Complicated High Performance Computing and Storage Services

More information

GRIP:Creating Interoperability between Grids

GRIP:Creating Interoperability between Grids GRIP:Creating Interoperability between Grids Philipp Wieder, Dietmar Erwin, Roger Menday Research Centre Jülich EuroGrid Workshop Cracow, October 29, 2003 Contents Motivation Software Base at a Glance

More information

The GENIUS Grid Portal

The GENIUS Grid Portal The GENIUS Grid Portal (*) work in collaboration with A. Falzone and A. Rodolico EGEE NA4 Workshop, Paris, 18.12.2003 CHEP 2000, 10.02.2000 Outline Introduction Grid portal architecture and requirements

More information

TUTORIAL. Rebecca Breu, Bastian Demuth, André Giesler, Bastian Tweddell (FZ Jülich) {r.breu, b.demuth, a.giesler, b.tweddell}@fz-juelich.

TUTORIAL. Rebecca Breu, Bastian Demuth, André Giesler, Bastian Tweddell (FZ Jülich) {r.breu, b.demuth, a.giesler, b.tweddell}@fz-juelich. TUTORIAL Rebecca Breu, Bastian Demuth, André Giesler, Bastian Tweddell (FZ Jülich) {r.breu, b.demuth, a.giesler, b.tweddell}@fz-juelich.de September 2006 Outline Motivation & History Production UNICORE

More information

The Lattice Project: A Multi-Model Grid Computing System. Center for Bioinformatics and Computational Biology University of Maryland

The Lattice Project: A Multi-Model Grid Computing System. Center for Bioinformatics and Computational Biology University of Maryland The Lattice Project: A Multi-Model Grid Computing System Center for Bioinformatics and Computational Biology University of Maryland Parallel Computing PARALLEL COMPUTING a form of computation in which

More information

Service-Oriented Architecture and Software Engineering

Service-Oriented Architecture and Software Engineering -Oriented Architecture and Software Engineering T-86.5165 Seminar on Enterprise Information Systems (2008) 1.4.2008 Characteristics of SOA The software resources in a SOA are represented as services based

More information

How To Create A C++ Web Service

How To Create A C++ Web Service A Guide to Creating C++ Web Services WHITE PAPER Abstract This whitepaper provides an introduction to creating C++ Web services and focuses on:» Challenges involved in integrating C++ applications with

More information

NASCIO EA Development Tool-Kit Solution Architecture. Version 3.0

NASCIO EA Development Tool-Kit Solution Architecture. Version 3.0 NASCIO EA Development Tool-Kit Solution Architecture Version 3.0 October 2004 TABLE OF CONTENTS SOLUTION ARCHITECTURE...1 Introduction...1 Benefits...3 Link to Implementation Planning...4 Definitions...5

More information

A Web Services Data Analysis Grid *

A Web Services Data Analysis Grid * A Web Services Data Analysis Grid * William A. Watson III, Ian Bird, Jie Chen, Bryan Hess, Andy Kowalski, Ying Chen Thomas Jefferson National Accelerator Facility 12000 Jefferson Av, Newport News, VA 23606,

More information

This research note is restricted to the personal use of christine_tolman@byu.edu

This research note is restricted to the personal use of christine_tolman@byu.edu Burton IT1 Research G00234483 Identity Management Published: 9 July 2012 Analyst(s): Ian Glazer, Bob Blakley Identity management (IdM) has become a distinct aggregation of functions for the maintenance

More information

Performance Management Platform

Performance Management Platform Open EMS Suite by Nokia Performance Management Platform Functional Overview Version 1.4 Nokia Siemens Networks 1 (16) Performance Management Platform The information in this document is subject to change

More information

Technical. Overview. ~ a ~ irods version 4.x

Technical. Overview. ~ a ~ irods version 4.x Technical Overview ~ a ~ irods version 4.x The integrated Ru e-oriented DATA System irods is open-source, data management software that lets users: access, manage, and share data across any type or number

More information

Introduction to Service Oriented Architectures (SOA)

Introduction to Service Oriented Architectures (SOA) Introduction to Service Oriented Architectures (SOA) Responsible Institutions: ETHZ (Concept) ETHZ (Overall) ETHZ (Revision) http://www.eu-orchestra.org - Version from: 26.10.2007 1 Content 1. Introduction

More information

WebSphere Portal Server and Web Services Whitepaper

WebSphere Portal Server and Web Services Whitepaper WebSphere Server and s Whitepaper Thomas Schaeck (schaeck@de.ibm.com) IBM Software Group Abstract As web services will become the predominant method for making information and applications available programmatically

More information

Machine-to-Machine Exchange of Cyber Threat Information: a Key to Mature Cyber Defense

Machine-to-Machine Exchange of Cyber Threat Information: a Key to Mature Cyber Defense Machine-to-Machine Exchange of Cyber Threat Information: a Key to Mature Cyber Defense By: Daniel Harkness, Chris Strasburg, and Scott Pinkerton The Challenge The Internet is an integral part of daily

More information

HP Systinet. Software Version: 10.01 Windows and Linux Operating Systems. Concepts Guide

HP Systinet. Software Version: 10.01 Windows and Linux Operating Systems. Concepts Guide HP Systinet Software Version: 10.01 Windows and Linux Operating Systems Concepts Guide Document Release Date: June 2015 Software Release Date: June 2015 Legal Notices Warranty The only warranties for HP

More information

Designing an Enterprise Application Framework for Service-Oriented Architecture 1

Designing an Enterprise Application Framework for Service-Oriented Architecture 1 Designing an Enterprise Application Framework for Service-Oriented Architecture 1 Shyam Kumar Doddavula, Sandeep Karamongikar Abstract This article is an attempt to present an approach for transforming

More information

Business Process Management with @enterprise

Business Process Management with @enterprise Business Process Management with @enterprise March 2014 Groiss Informatics GmbH 1 Introduction Process orientation enables modern organizations to focus on the valueadding core processes and increase

More information

Business Process Management Tampereen Teknillinen Yliopisto

Business Process Management Tampereen Teknillinen Yliopisto Business Process Management Tampereen Teknillinen Yliopisto 31.10.2007 Kimmo Kaskikallio IT Architect IBM Software Group IBM SOA 25.10.2007 Kimmo Kaskikallio IT Architect IBM Software Group Service Oriented

More information

Users. Extending Visibility Across Your TIBCO Infrastructure and Beyond

Users. Extending Visibility Across Your TIBCO Infrastructure and Beyond White Paper Application Performance Management (APM) for TIBCO Users Extending Visibility Across Your TIBCO Infrastructure and Beyond Publication Date: October 2008 Abstract: TIBCO users may wish to expand

More information

NASA's Strategy and Activities in Server Side Analytics

NASA's Strategy and Activities in Server Side Analytics NASA's Strategy and Activities in Server Side Analytics Tsengdar Lee, Ph.D. High-end Computing Program Manager NASA Headquarters Presented at the ESGF/UVCDAT Conference Lawrence Livermore National Laboratory

More information