TopoMon: A Monitoring Tool for Grid Network Topology
|
|
|
- Hollie Marsh
- 10 years ago
- Views:
Transcription
1 TopoMon: A Monitoring Tool for Grid Network Topology Mathijs den Burger, Thilo Kielmann, and Henri E. Bal Division of Mathematics and Computer Science, Vrije Universiteit, De Boelelaan 1081a, 1081HV Amsterdam, The Netherlands {mathijs,kielmann,bal}@cs.vu.nl Abstract. In Grid environments, high-performance applications have to take into account the available network performance between the individual sites. Existing monitoring tools like the Network Weather Service (NWS) measure bandwidth and latency of end-to-end network paths. This information is necessary but not sufficient. With more than two participating sites, simultaneous transmissions may collide with each other on shared links of the wide-area network. If this occurs, applications may obtain lower network performance than predicted by NWS. In this paper, we describe TopoMon, a monitoring tool for Grid networks that augments NWS with additional sensors for the routes between the sites of a Grid environment. Our tool conforms to the Grid Monitoring Architecture (GMA) defined by the Global Grid Forum. It unites NWS performance and topology discovery in a single monitoring architecture. Our topology consumer process collects route information between the sites of a Grid environment and derives the overall topology for utilization by application programs and communication libraries. The topology can also be visualized for Grid application developers. 1 Introduction A difficult problem in designing efficient applications for computational Grids is that the wide-area interconnection network is highly complex, dynamic, and heterogeneous. A more traditional computing platform such as a cluster or supercomputer typically has a highly regular network with constant, guaranteed performance. With a Grid, however, the network has an irregular and asymmetric topology, where different links have different speeds, which even change over time. The Network Weather Service (NWS) [23] is a tool that addresses this problem by providing dynamic bandwidth and latency measurements and predictions. Unfortunately, the information provided by NWS is insufficient for applications with communication patterns where multiple sites compete for the same links. For example, if two pairs of communicating sites are used, NWS will predict performance for each pair separately. If the two communication streams have some links in common, however, these predictions will clearly be too optimistic, as the bandwidth for these links has to be shared by the two streams. The
2 current NWS system cannot solve this problem, because it lacks topology information and thus cannot know which links are shared. This problem is especially important for applications that use collective communication, where many sites communicate with each other simultaneously. The work on MPICH-G (the MPI implementation of Globus [5]) has shown that our wide-area collective commnication library (MagPIe, [10]) has exactly this problem for shared links [8]. In this paper, we present a new tool, TopoMon, which augments NWS with topology information about the wide-area network. The added value of integrating both performance and topology information is that applications or communication libraries can not only predict their communication performance, but can also avoid congestion on those Internet links that are shared by multiple, simultaneous data streams. To foster interoperability with other Grid services, our tool conforms to the Grid Monitoring Architecture (GMA) defined by the Global Grid Forum, recommending a structure of separate processes with sensor, producer, consumer, and directory-service functionality. TopoMon can analyze which paths between Grid sites overlap with each other, possibly resulting in performance (congestion) problems. The topology is determined by sensors that internally use traceroute and is output in the form of XML event descriptions. The NWS measurements and predictions are likewise wrapped using XML. The data from both NWS and topology sensors is sent to a consumer process, which combines the data, for example to compute an optimal spanning tree or to export it for visualization. The paper is structured as follows. In Section 2, we present TopoMon s design and implementation. Section 3 illustrates the added value of integrating performance and topology data within TopoMon s topology consumer process. Section 4 discusses related work, Section 5 concludes. 2 Tool Design and Implementation In this section, we describe TopoMon. We present its architecture, the topology sensors, and the performance and topology information transferred to interested consumer processes. 2.1 The Topology Monitoring Architecture Our goal is to build a Grid network monitoring system that integrates performance data with information about the Grid network topology. The system is designed to run on various different platforms that are used by the sites of a Grid environment. Therefore, TopoMon s overall architecture (as shown in Figure 1) follows the proposed Grid Monitoring Architecture (GMA) [3]. For portability, we implemented all processes as Java programs that communicate with each other using TCP sockets. (The platform-specific sensors are written in C.) According to the GMA, different functionality is implemented in specialized processes that interoperate using specific network protocols. We distinguish between sensors (performing the actual measurements), producers (providing mon-
3 Routers NWS Sensor TopoMon Sensor NWS Sensor TopoMon Sensor Application Topo Producer XML events Consumer NWS Producer register search Directory Service Fig. 1. The TopoMon architecture itoring data from one or more sensors to interested consumers), and a directory service that allows producers and consumers to contact each other. The Network Weather Service (NWS) [23] already provides excellent performance measurements (and forecasts) for end-to-end network bandwidth and latency. Its architecture also conforms with the ideas of GMA. We therefore decided to augment the existing NWS with network topology information. However, the NWS processes communicate with proprietary protocols using binary representations for the data items exchanged. For interoperability, Grid monitoring data should be exchanged as XML descriptions of monitoring events [20]. TopoMon s implementation builds a wrapper layer around the NWS. The TopoMon performance ( NWS producer ) process presents its performance
4 data as XML descriptions to its consumer processes. It directly wraps the NWS producer processes (called memory and forecaster), and translates NWS data to XML-event descriptions. As with the original NWS software, the NWS memory processes directly communicate with the NWS network sensors, located at the participating Grid sites. Furthermore, TopoMon also directly uses the NWS directory service, called name server. In addition to wrapping NWS, TopoMon implements an infrastructure to generate network topology information. At each participating Grid site, a TopoMon sensor determines the network paths to all other sites. The Topo Producer process retrieves the topology information from the sensors in their respective data format, and presents it as XML event descriptions. We also implemented a consumer process that retrieves both the topology and NWS performance information, and presents it to application programs. We now describe the topology-related parts of TopoMon in detail. 2.2 Topology Sensors TopoMon s topology sensors explore the structure of the network connecting the sites of a Grid environment. For high-performance applications, we are interested in identifying the end-to-end paths between all sites of a Grid. The performance (bandwidth and latency) of those paths is monitored by the NWS sensors. In addition, we need to identify which parts of the end-to-end paths overlap with each other, as such overlap may cause congestion and performance decrease, if the links are used simultaneously. As Grid networks are part of the global Internet, identifying end-to-end paths corresponds to observing the respective IP routing information, as used by the Border Gateway Protocol (BGP) [16]. However, the routing data available from BGP is restricted to routing between Autonomous Systems (AS), which are the organizational units of the Internet, such as ISPs and their customers. In order to maintain global scalability of routing information, BGP data is restricted to entry and exit routers for each AS; routing inside each AS is performed locally. As a consequence, end-to-end paths and their possible intersections, as needed by TopoMon, can not be derived from BGP data. Alternatively, path information could be taken from Internet topology repositories, like the ones maintained by CAIDA [19]. Unfortunately, those repositories only provide information about the sites included in their own monitoring efforts, a consequence of the intractable size of the Internet as a whole. As TopoMon s purpose is to explore the topology exactly between the sites of a Grid environment, general Internet topology repositories do not provide the required information about the specific sites in use. As a consequence, TopoMon has to perform its own, active topology discovery. The only feasible approach is to perform probes as done by traceroute, or to run the tool directly between the sites of a Grid environment [18]. traceroute is directly based on the ubiquitous IP and ICMP protocols. It discovers the routers on a path from one site in the Internet to another one, without relying on any additional network management protocol like SNMP, which is only available
5 within local network installations. We implemented the TopoMon sensors as wrapper processes around the locally available traceroute programs. 2.3 Performance and Topology Event Data The XML producer-consumer protocol, as proposed by the GGF performance working group [20], describes all performance data as timestamped events of name-value pairs. The respective GGF document prescribes the protocol between producers and consumers, covering both subscription to services and the exchange of event data itself. The messages of this protocol, including the event data, are formatted as XML documents. <NWSBandwidthTcp xmlns=" <SourceHostName>das0fs.cs.vu.nl</SourceHostName> <DestinationHostName>das2fs.wins.uva.nl</DestinationHostName> <NWSForecast> <Value unit="mb/s">2.31</value> <Error> </Error> </NWSForecast> <TimeStamp> T Z</TimeStamp> </NWSBandwidthTcp> <NWSLatencyTcp xmlns=" <SourceHostName>das0fs.cs.vu.nl</SourceHostName> <DestinationHostName>huron.cs.unh.edu</DestinationHostName> <NWSForecast> <Value unit="ms">12.3</value> <Error> </Error> </NWSForecast> <TimeStamp> T Z</TimeStamp> </NWSLatencyTcp> Fig. 2. XML events for NWS bandwidth and latency data The GGF document also provides simple XML schemas for performance data like network roundtrip times. We use similar XML schemas for representing the data generated by NWS. They are shown using example events in Figure 2. For simplicity of presentation, we present example events rather than the formal schema descriptions themselves. The latter are only needed by the producer and consumer processes implementing the protocols. The figure shows both a bandwidth and a latency data event. Both event schemas are easily human-readable, which is one of the advantages of using XML [20]. The NWS performance events start with a tag indicating their type, followed by information about source and destination host, identifying the pair of NWS sensors that generated this event. The events are completed with the predicted data value, its predicted error, and the event timestamp. It should be noted that the bandwidth and latency event descriptions are specific to NWS, the network monitoring tool that generates the events. Unfortunately, network performance data is strongly dependent on the measurement method used for generating the data. For example, message roundtrip times measured by NWS and by ping can be very different because of the different protocols involved (TCP vs. ICMP). For later assessment of the data, information about
6 its measurement method has to be added. We are involved in the newly-founded Network Measurements Working Group (NMWG) of the GGF. The NMWG aims to develop standards to ensure the compatibility of metrics across measurement systems. Until such standards exist, however, tools like TopoMon have to define their own metrics. <Traceroute xmlns=" maxhops="30" probesize="38"> <SourceHost> <Name>das0fs.cs.vu.nl</Name> <IP4Address> </IP4Address> </SourceHost> <DestinationHost> <Name>das2fs.wins.uva.nl</Name> <IP4Address> </IP4Address> </DestinationHost> <Hop index="1"> <Host> <Name>brandaris</Name> <IP4Address> </IP4Address> </Host> <RTT>1.068</RTT> <RTT>1.032</RTT> <RTT>1.017</RTT> </Hop> <Hop index="2"> <Host> <Name>wnxswitch</Name> <IP4Address> </IP4Address> </Host> <RTT>0.551</RTT> <RTT>0.406</RTT> <RTT>0.395</RTT> </Hop>... <Hop index="8"> <Host> <Name>das2fs.wins.uva.nl</Name> <IP4Address> </IP4Address> </Host> <RTT>1.584</RTT> <RTT>2.140</RTT> <RTT>2.226</RTT> </Hop> <TimeStamp> T Z</TimeStamp> </Traceroute> Fig. 3. XML event for traceroute information Figure 3 shows an example event for the traceroute information, as gathered by the topology producers. The traceroute event mainly consists of source and destination host, and of a list of hops, for which the name, IP address, and the measured round-trip times are reported. The XML representation for traceroute data also is easy to understand by humans. Translating the already textual representation of the raw data produced by the traceroute sensors into another textual representation has two advantages. First, the very regular XML description is easier to parse by a consumer process. Second, the XML description removes error messages and the (slight) differences in the data formats produced by the traceroute implementations on the various platforms. 3 Topology Consumer Functionality We have implemented a topology consumer process that queries the producer processes and provides the information in preprocessed form to Grid application programs. The consumer thus takes as input the network topology descriptions and the NWS measurement data (see Figure 1), both in XML form. Our consumer process implementation illustrates the capabilities of a Grid monitoring
7 architecture that integrates both performance and topology information. It supports the development of collective communication libraries like MagPIe [10]. Other application domains may develop consumers tailored for their specific needs. The consumer computes the overall topology of a Grid by combining the topology (traceroute) results between all respective sites, and transitively removing those intermediate nodes with exactly two neighbors that do not contribute to the determination of overlapping paths. In order to cover the potential path asymmetry between pairs of Grid sites, all links in this topology graph are unidirectional. All links of the precomputed topology are then attributed with latency and bandwidth values. Latency values are taken from the traceroute data. Bandwidth values are taken from the NWS measurements; for each link l, the maximum of the end-to-end bandwidth values of all connections using l is used as the link s estimated bandwidth. This computation of individual link bandwidth is a conservative estimate, as Internet backbone links may provide higher accumulated bandwidth to multiple, simultaneous transmissions. As NWS currently minimizes the intrusiveness of its measurements, concurrent measurements across shared links are not provided. We are currently developing a scheme to augment NWS by additionally performing such measurements, where the number of additional probes can be limited to those paths where links are actually shared, according to the traceroute results. The consumer process reports its information to application programs and communication libraries like MagPIe. It can provide end-to-end latency and bandwidth, the path between two endpoints, and the shared parts of two paths between two pairs of endpoints. The consumer can also export the graph connecting the endpoints to the GraphViz graph visualization package [7], the latter to support Grid application developers. Figure 4 shows a topology graph between four sites. The endpoints are denoted by shaded boxes. For clarity of presentation, only bandwidth numbers (in Mbit/s) are shown in the figure. Most importantly, the consumer can also compute (multicast) spanning trees from one endpoint to all other endpoints. As we are interested in application-level communication, only endpoints (Grid sites) can actively forward messages. The consumer process can compute the spanning trees either with minimal latency, or with maximal bandwidth. The former case minimizes the completion time for multicasting a short message to all endpoints, which is mostly independent of the available bandwidth. The latter case optimizes multicast for long messages, which is mostly independent of latency. Computing these spanning trees generalizes our work on MPI-style collective communication [9] from homogeneous to heterogeneous Grid networks. Spanning trees with minimal latency can be computed based on end-to-end latency values only. For short messages, link bandwidth and thus overlapping paths can be ignored. However, for computing spanning trees with maximal bandwidth, the TopoMon consumer combines all NWS predictions and traceroute results. All links in the graph (see for example Figure 4) are annotated with
8 their estimated bandwidth. Based on these annotations, the maximum overall bandwidth from one endpoint to the others, can be computed by applying Dijkstra s algorithm to compute a tree of shortest paths, using the reciprocals of the bandwidth values as distance metrics [1]. das0fs.cs.vu.nl 6.0 das3fs.tn.tudelft.nl 6.5 BR1.Delft.surf.net PO10-0.CR1.Amsterdam2.surf.net AR5.Amsterdam1.surf.net PO1-0.CR2.Amsterdam2.surf.net ipls-surfnet.abilene.ucaid.edu clev-ipls.abilene.ucaid.edu huron.cs.unh.edu charcoal.cs.ucsb.edu ipls-kscy.abilene.ucaid.edu 1.4 ipls-clev.abilene.ucaid.edu 1.4 AT2-0-4.BR1.Chicago1.surf.net kscy-ipls.abilene.ucaid.edu PO5-0.CR1.Amsterdam2.surf.net vu-router.customer.surf.net tudelft-router.customer.surf.net Fig. 4. Topology and link bandwidth (in Mbit/s) derived from TopoMon data 4 Related Work Many researchers have found that high-performance Grid applications need to be aware of the interconnection network in use [4, 8, 9, 15, 17]. Performance monitoring systems thus are becoming integral parts of Grid computing platforms.
9 The Global Grid Forum has proposed the GMA architecture [3] that describes the components of monitoring systems and their interactions. In an accompanying document [20], an XML-based producer-consumer protocol has been proposed for the GMA. TopoMon follows the GMA structure, and also uses the XML-based protocol for portability reasons. For TopoMon, we had to define additional XML schemas for expressing performance and topology information. There exist several monitoring systems for the Internet that either measure network performance [2, 13, 14] or explore network topology [6, 19]. However, their usefulness for Grid applications is limited as they only provide data for the nodes involved in their own monitoring efforts. For our purposes, we need information (topology and performance) about exactly those parts of the global Internet that connect the sites of a Grid environment. While Internet-related tools provide useful monitoring techniques, Grid environments have to apply them to their own sites. TopoMon integrates performance and topology information for given Grid platforms. Some systems explore the topology of LAN installations in great detail. Most prominently, Remos [4] uses SNMP to collect the necessary information. However, SNMP is not available for general Internet connections. The BGP [16] routing protocol provides global topology information, however only with the granularity of Autonomous Systems, which is insufficient for our purposes. The most advanced performance monitoring system for Grids is the Network Weather Service (NWS) [23]. However, NWS only provides performance data between pairs of Grid sites. TopoMon augments this data with information about the topology of the interconnecting networks. Only the combination of both kinds of data allows Grid applications to both predict communication performance and avoid contention between multiple of its own data streams. Instead of NWS and traceroute, other network sensor tools could be used as well [11, 12, 21]. However, our choice of sensors was determined by low intrusiveness, widespread availability, and for NWS also by application-level measurement using TCP streams. Other sensors might easily be integrated into TopoMon simply by adapting the respective producer process. 5 Conclusions and Future Work We have presented TopoMon, a Grid network monitoring tool that integrates both performance and topology information. TopoMon has been designed according to the GGF monitoring architecture (GMA) to foster interoperability with other Grid services. The added value of integrating both performance and topology information is that applications or communication libraries can not only predict their communication performance, but can also avoid congestion on those Internet links that are shared by multiple, simultaneous data streams. TopoMon s consumer process computes spanning trees between the sites of a Grid environment that minimize either overall latency (for multicasting short messages) or overall bandwidth (for long messages). This process has been specifically designed to support collective
10 communication as in our MagPIe library [10]. Other application domains, such as distributed simulations on large data sets, can easily integrate their own consumer functionality, to optimally use the existing network resources. TopoMon currently conservatively underestimates the bandwidth of network links, neglecting the possibly higher backbone bandwidth that might be observed by multiple, simultaneous transmissions. This is a direct consequence of using NWS for network measurements. NWS carefully avoids simultaneous measurements both for minimizing intrusiveness and for accuracy [22]. We are currently working on a scheme to augment NWS to additionally perform concurrent measurements that both explores the potential of the Internet backbone and minimizes the necessary, additional measurements. Exploiting the topology of the given network connections is key to this scheme. We are also working on a better graphical representation of the network topology to visualize conflicting data streams. Finally, we are augmenting our MagPIe library with collective communication algorithms that exploit the information derived by TopoMon. Acknowledgements We would like to thank Rich Wolski and Martin Swany for the lively exchange of ideas about network monitoring. Jason Maassen, Rob van Nieuwpoort, and Guilleaume Pierre made valuable comments on drafts of this work. The following people kindly granted us access to their systems for performing topology experimentation: Luc Bougé (ENS Cachan), Edgar Gabriel (HLRS), Sergei Gorlatch (TU Berlin), Phil Hatcher (Univ. of New Hampshire), Thomas Kunz (Carleton Univ.), Ludek Matyska (Masaryk Univ.), Jarek Nabrzysky (PNSC), Steven Newhouse (Imperial College), Guillaume Pierre (INRIA Rocquencourt), Satoshi Sekiguchi (ETL), Rich Wolski (UCSB). References 1. R. K. Ahuja, T. L. Magnanti, and J. B. Orlin. Network Flows. Prentice Hall, Active Measurement Project (AMP) R. Aydt, D. Gunter, W. Smith, M. Swany, V. Taylor, B. Tierney, and R. Wolski. A Grid Monitoring Architecture. Global Grid Forum, Performance Working Group, Grid Working Document GWD-Perf-16-1, P. Dinda, T. Gross, R. Karrer, B. Lowekamp, N. Miller, P. Steenkiste, and D. Sutherland. The Architecture of the Remos System. In IEEE Symposium on High Performance Distributed Computing (HPDC10), San Francisco, CA, I. Foster and C. Kesselman. Globus: A Metacomputing Infrastructure Toolkit. Int. Journal of Supercomputer Applications, 11(2): , P. Francis, S. Jamin, C. Jin, Y. Jin, D. Raz, Y. Shavitt, and L. Zhang. IDMaps: A Global Internet Host Distance Estimation Service. IEEE/ACM Transactions on Networking, E. R. Gansner, E. Koutsofios, S. C. North, and K.-P. Vo. A Technique for Drawing Directed Graphs. IEEE Trans. of Software Engineering, 19(3): , 1993.
11 8. N. T. Karonis, B. R. de Supinski, I. Foster, W. Gropp, E. Lusk, and J. Bresnahan. Exploiting Hierarchy in Parallel Computer Networks to Optimize Collective Operation Performance. In International Parallel and Distributed Processing Symposium (IPDPS 2000), pages , Cancun, Mexico, May IEEE. 9. T. Kielmann, H. E. Bal, S. Gorlatch, K. Verstoep, and R. F. H. Hofman. Network Performance-aware Collective Communication for Clustered Wide Area Systems. Parallel Computing, 27(11): , T. Kielmann, R. F. H. Hofman, H. E. Bal, A. Plaat, and R. A. F. Bhoedjang. Mag- PIe: MPI s Collective Communication Operations for Clustered Wide Area Systems. In Symposium on Principles and Practice of Parallel Programming (PPoPP), pages , Atlanta, GA, May B. A. Mah. pchar Pathrate V. Paxson, J. Mahdavi, A. Adams, and M. Mathis. An Architecture for Large-scale Internet Measurement. IEEE Communications, PingER A. Plaat, H. E. Bal, R. F. Hofman, and T. Kielmann. Sensitivity of Parallel Applications to Large Differences in Bandwidth and Latency in Two-Layer Interconnects. Future Generation Computer Systems, 17(6): , Y. Rekhter and T. Li. A Border Gateway Protocol 4 (BGP-4). IETF Network Working Group, RFC 1771, G. Shao, F. Berman, and R. Wolski. Using Effective Network Views to Promote Distributed Application Performance. In Parallel and Distributed Processing Techniques and Applications (PDPTA), R. Siamwalla, R. Sharma, and S. Keshav. Discovering Internet Topology. In IEEE INFOCOM, Skitter W. Smith, D. Gunter, and D. Quesnel. A Simple XML Producer-Consumer Protocol. Global Grid Forum, Performance Working Group, Grid Working Document GWD-Perf-8-2, Sprobe M. Swany and R. Wolski. Topology Discovery for the Network Weather Service. Submitted for publication. 23. R. Wolski, N. Spring, and J. Hayes. The Network Weather Service: A Distributed Resource Performance Forecasting Service for Metacomputing. Future Generation Computing Systems, 15(5 6): , 1999.
MapCenter: An Open Grid Status Visualization Tool
MapCenter: An Open Grid Status Visualization Tool Franck Bonnassieux Robert Harakaly Pascale Primet UREC CNRS UREC CNRS RESO INRIA ENS Lyon, France ENS Lyon, France ENS Lyon, France [email protected]
Dynamic Load Balancing for a Grid Application
Dynamic Load Balancing for a Grid Application Menno Dobber, Ger Koole, and Rob van der Mei Vrije Universiteit, De Boelelaan 1081a, 1081 HV Amsterdam, The Netherlands {amdobber, koole, mei}@few.vu.nl http://www.cs.vu.nl/~amdobber
QUALITY OF SERVICE METRICS FOR DATA TRANSMISSION IN MESH TOPOLOGIES
QUALITY OF SERVICE METRICS FOR DATA TRANSMISSION IN MESH TOPOLOGIES SWATHI NANDURI * ZAHOOR-UL-HUQ * Master of Technology, Associate Professor, G. Pulla Reddy Engineering College, G. Pulla Reddy Engineering
Argonne National Laboratory, Argonne, IL USA 60439
LEGS: A WSRF Service to Estimate Latency between Arbitrary Hosts on the Internet R Vijayprasanth 1, R Kavithaa 2,3, and Rajkumar Kettimuthu 2,3 1 Department of Information Technology Coimbatore Institute
A Hierarchy of Network Performance Characteristics for Grid Applications and Services
GWD-C Bruce Lowekamp, College of William and Mary Category: Community Practice Brian Tierney, Lawrence Berkeley National Lab Network Measurements Working Group Les Cottrell, Stanford Linear Accelerator
Internet Firewall CSIS 4222. Packet Filtering. Internet Firewall. Examples. Spring 2011 CSIS 4222. net15 1. Routers can implement packet filtering
Internet Firewall CSIS 4222 A combination of hardware and software that isolates an organization s internal network from the Internet at large Ch 27: Internet Routing Ch 30: Packet filtering & firewalls
DiPerF: automated DIstributed PERformance testing Framework
DiPerF: automated DIstributed PERformance testing Framework Ioan Raicu, Catalin Dumitrescu, Matei Ripeanu Distributed Systems Laboratory Computer Science Department University of Chicago Ian Foster Mathematics
A Survey Study on Monitoring Service for Grid
A Survey Study on Monitoring Service for Grid Erkang You [email protected] ABSTRACT Grid is a distributed system that integrates heterogeneous systems into a single transparent computer, aiming to provide
Resource Management on Computational Grids
Univeristà Ca Foscari, Venezia http://www.dsi.unive.it Resource Management on Computational Grids Paolo Palmerini Dottorato di ricerca di Informatica (anno I, ciclo II) email: [email protected] 1/29
How To Monitor A Grid System
1. Issues of Grid monitoring Monitoring Grid Services 1.1 What the goals of Grid monitoring Propagate errors to users/management Performance monitoring to - tune the application - use the Grid more efficiently
Quality of Service Routing Network and Performance Evaluation*
Quality of Service Routing Network and Performance Evaluation* Shen Lin, Cui Yong, Xu Ming-wei, and Xu Ke Department of Computer Science, Tsinghua University, Beijing, P.R.China, 100084 {shenlin, cy, xmw,
GridFTP: A Data Transfer Protocol for the Grid
GridFTP: A Data Transfer Protocol for the Grid Grid Forum Data Working Group on GridFTP Bill Allcock, Lee Liming, Steven Tuecke ANL Ann Chervenak USC/ISI Introduction In Grid environments,
How To Make A Network Plan Based On Bg, Qos, And Autonomous System (As)
Policy Based QoS support using BGP Routing Priyadarsi Nanda and Andrew James Simmonds Department of Computer Systems Faculty of Information Technology University of Technology, Sydney Broadway, NSW Australia
Faculty of Engineering Computer Engineering Department Islamic University of Gaza 2012. Network Chapter# 19 INTERNETWORK OPERATION
Faculty of Engineering Computer Engineering Department Islamic University of Gaza 2012 Network Chapter# 19 INTERNETWORK OPERATION Review Questions ٢ Network Chapter# 19 INTERNETWORK OPERATION 19.1 List
Internet Infrastructure Measurement: Challenges and Tools
Internet Infrastructure Measurement: Challenges and Tools Internet Infrastructure Measurement: Challenges and Tools Outline Motivation Challenges Tools Conclusion Why Measure? Why Measure? Internet, with
SCALEA-G: a Unified Monitoring and Performance Analysis System for the Grid
SCALEA-G: a Unified Monitoring and Performance Analysis System for the Grid Hong-Linh Truong½and Thomas Fahringer¾ ½Institute for Software Science, University of Vienna [email protected] ¾Institute
Using IPM to Measure Network Performance
CHAPTER 3 Using IPM to Measure Network Performance This chapter provides details on using IPM to measure latency, jitter, availability, packet loss, and errors. It includes the following sections: Measuring
Traceroute-Based Topology Inference without Network Coordinate Estimation
Traceroute-Based Topology Inference without Network Coordinate Estimation Xing Jin, Wanqing Tu Department of Computer Science and Engineering The Hong Kong University of Science and Technology Clear Water
Web Service Based Data Management for Grid Applications
Web Service Based Data Management for Grid Applications T. Boehm Zuse-Institute Berlin (ZIB), Berlin, Germany Abstract Web Services play an important role in providing an interface between end user applications
Monitoring Clusters and Grids
JENNIFER M. SCHOPF AND BEN CLIFFORD Monitoring Clusters and Grids One of the first questions anyone asks when setting up a cluster or a Grid is, How is it running? is inquiry is usually followed by the
Lightpath Planning and Monitoring
Lightpath Planning and Monitoring Ronald van der Pol 1, Andree Toonk 2 1 SARA, Kruislaan 415, Amsterdam, 1098 SJ, The Netherlands Tel: +31205928000, Fax: +31206683167, Email: [email protected] 2 SARA, Kruislaan
EFFICIENT DETECTION IN DDOS ATTACK FOR TOPOLOGY GRAPH DEPENDENT PERFORMANCE IN PPM LARGE SCALE IPTRACEBACK
EFFICIENT DETECTION IN DDOS ATTACK FOR TOPOLOGY GRAPH DEPENDENT PERFORMANCE IN PPM LARGE SCALE IPTRACEBACK S.Abarna 1, R.Padmapriya 2 1 Mphil Scholar, 2 Assistant Professor, Department of Computer Science,
TOPOLOGIES NETWORK SECURITY SERVICES
TOPOLOGIES NETWORK SECURITY SERVICES 1 R.DEEPA 1 Assitant Professor, Dept.of.Computer science, Raja s college of Tamil Studies & Sanskrit,Thiruvaiyaru ABSTRACT--In the paper propose about topology security
CHAPTER 2 GRID MONITORING ARCHITECTURE AND TOOLS USED FOR GRID MONITORING
10 CHAPTER 2 GRID MONITORING ARCHITECTURE AND TOOLS USED FOR GRID MONITORING This section presents literature survey about Grid computing, Grid standards, Globus Toolkit architecture, Grid monitoring process,
Title: Scalable, Extensible, and Safe Monitoring of GENI Clusters Project Number: 1723
Title: Scalable, Extensible, and Safe Monitoring of GENI Clusters Project Number: 1723 Authors: Sonia Fahmy, Department of Computer Science, Purdue University Address: 305 N. University St., West Lafayette,
A Catechistic Method for Traffic Pattern Discovery in MANET
A Catechistic Method for Traffic Pattern Discovery in MANET R. Saranya 1, R. Santhosh 2 1 PG Scholar, Computer Science and Engineering, Karpagam University, Coimbatore. 2 Assistant Professor, Computer
Agenda. Distributed System Structures. Why Distributed Systems? Motivation
Agenda Distributed System Structures CSCI 444/544 Operating Systems Fall 2008 Motivation Network structure Fundamental network services Sockets and ports Client/server model Remote Procedure Call (RPC)
Datagram-based network layer: forwarding; routing. Additional function of VCbased network layer: call setup.
CEN 007C Computer Networks Fundamentals Instructor: Prof. A. Helmy Homework : Network Layer Assigned: Nov. 28 th, 2011. Due Date: Dec 8 th, 2011 (to the TA) 1. ( points) What are the 2 most important network-layer
Transport and Network Layer
Transport and Network Layer 1 Introduction Responsible for moving messages from end-to-end in a network Closely tied together TCP/IP: most commonly used protocol o Used in Internet o Compatible with a
Avaya ExpertNet Lite Assessment Tool
IP Telephony Contact Centers Mobility Services WHITE PAPER Avaya ExpertNet Lite Assessment Tool April 2005 avaya.com Table of Contents Overview... 1 Network Impact... 2 Network Paths... 2 Path Generation...
Facility Usage Scenarios
Facility Usage Scenarios GDD-06-41 GENI: Global Environment for Network Innovations December 22, 2006 Status: Draft (Version 0.1) Note to the reader: this document is a work in progress and continues to
PANDORA FMS NETWORK DEVICE MONITORING
NETWORK DEVICE MONITORING pag. 2 INTRODUCTION This document aims to explain how Pandora FMS is able to monitor all network devices available on the marke such as Routers, Switches, Modems, Access points,
A Fast Path Recovery Mechanism for MPLS Networks
A Fast Path Recovery Mechanism for MPLS Networks Jenhui Chen, Chung-Ching Chiou, and Shih-Lin Wu Department of Computer Science and Information Engineering Chang Gung University, Taoyuan, Taiwan, R.O.C.
Figure 1. perfsonar architecture. 1 This work was supported by the EC IST-EMANICS Network of Excellence (#26854).
1 perfsonar tools evaluation 1 The goal of this PSNC activity was to evaluate perfsonar NetFlow tools for flow collection solution and assess its applicability to easily subscribe and request different
MONITORING AVAILABLE BANDWIDTH OF UNDERLYING GRID NETWORKS
MONITORING AVAILABLE BANDWIDTH OF UNDERLYING GRID NETWORKS Marcia Zangrilli Bruce B. Lowekamp, advisor Department of Computer Science College of William and Mary Abstract Harnessing the complete power
pt360 FREE Tool Suite Networks are complicated. Network management doesn t have to be.
pt360 FREE Tool Suite Networks are complicated. Network management doesn t have to be. pt360 FREE Tool Suite - At a Glance PacketTrap Networks November, 2009 PacketTrap's pt360 FREE Tool Suite consolidates
Lesson 5-3: Border Gateway Protocol
Unit 5: Intradomain and Interdomain Protocols Lesson 5-3: Gateway Protocol At a Glance The Gateway Protocol (BGP) is an interdomain routing protocol used in TCP/IP internetworks. BGP was created to allow
Development of Monitoring Tools for Measuring Network Performances: A Passive Approach
IJCST Vo l. 6, Is s u e 4, Oc t - De c 2015 ISSN : 0976-8491 (Online) ISSN : 2229-4333 (Print) Development of Monitoring Tools for Measuring Network Performances: A Passive Approach 1 Abdullah Al Mamun,
A System for Monitoring and Management of Computational Grids
A System for Monitoring and Management of Computational Grids Warren Smith Computer Sciences Corporation NASA Ames Research Center [email protected] Abstract As organizations begin to deploy large computational
A REPORT ON ANALYSIS OF OSPF ROUTING PROTOCOL NORTH CAROLINA STATE UNIVERSITY
A REPORT ON ANALYSIS OF OSPF ROUTING PROTOCOL Using OPNET 14.5 Modeler NORTH CAROLINA STATE UNIVERSITY SUBMITTED BY: SHOBHANK SHARMA [email protected] Page 1 ANALYSIS OF OSPF ROUTING PROTOCOL A. Introduction
P ERFORMANCE M ONITORING AND A NALYSIS S ERVICES - S TABLE S OFTWARE
P ERFORMANCE M ONITORING AND A NALYSIS S ERVICES - S TABLE S OFTWARE WP3 Document Filename: Work package: Partner(s): Lead Partner: v1.0-.doc WP3 UIBK, CYFRONET, FIRST UIBK Document classification: PUBLIC
Issues about the Integration of Passive and Active Monitoring for Grid Networks
Issues about the Integration of Passive and Active Monitoring for Grid Networks S. Andreozzi 2, D. Antoniades 1, A. Ciuffoletti 2, A. Ghiselli 2, E.P. Markatos 1, M. Polychronakis 1, P. Trimintzios 1 1
Performance of the NAS Parallel Benchmarks on Grid Enabled Clusters
Performance of the NAS Parallel Benchmarks on Grid Enabled Clusters Philip J. Sokolowski Dept. of Electrical and Computer Engineering Wayne State University 55 Anthony Wayne Dr., Detroit, MI 4822 [email protected]
Troubleshooting Network Performance with Alpine
Troubleshooting Network Performance with Alpine Jeffrey Papen As a Network Engineer, I am often annoyed by slow Internet performance caused by network issues like congestion, fiber cuts, and packet loss.
Network (Tree) Topology Inference Based on Prüfer Sequence
Network (Tree) Topology Inference Based on Prüfer Sequence C. Vanniarajan and Kamala Krithivasan Department of Computer Science and Engineering Indian Institute of Technology Madras Chennai 600036 [email protected],
PANDORA FMS NETWORK DEVICES MONITORING
NETWORK DEVICES MONITORING pag. 2 INTRODUCTION This document aims to explain how Pandora FMS can monitor all the network devices available in the market, like Routers, Switches, Modems, Access points,
How To Provide Qos Based Routing In The Internet
CHAPTER 2 QoS ROUTING AND ITS ROLE IN QOS PARADIGM 22 QoS ROUTING AND ITS ROLE IN QOS PARADIGM 2.1 INTRODUCTION As the main emphasis of the present research work is on achieving QoS in routing, hence this
An Overview of Solutions to Avoid Persistent BGP Divergence
An Overview of Solutions to Avoid Persistent BGP Divergence Ravi Musunuri Jorge A. Cobb Department of Computer Science The University of Texas at Dallas Email: musunuri, cobb @utdallas.edu Abstract The
Mapping Tool for Networks Using OSPF and LLTD Protocols
Mapping Tool for Networks Using OSPF and LLTD Protocols PAVEL KRIZ, FILIP MALY Department of Informatics and Quantitative Methods University of Hradec Kralove Rokitanskeho 62, Hradec Kralove CZECH REPUBLIC
Web Service Robust GridFTP
Web Service Robust GridFTP Sang Lim, Geoffrey Fox, Shrideep Pallickara and Marlon Pierce Community Grid Labs, Indiana University 501 N. Morton St. Suite 224 Bloomington, IN 47404 {sblim, gcf, spallick,
Graph Theory and Complex Networks: An Introduction. Chapter 08: Computer networks
Graph Theory and Complex Networks: An Introduction Maarten van Steen VU Amsterdam, Dept. Computer Science Room R4.20, [email protected] Chapter 08: Computer networks Version: March 3, 2011 2 / 53 Contents
Multicast vs. P2P for content distribution
Multicast vs. P2P for content distribution Abstract Many different service architectures, ranging from centralized client-server to fully distributed are available in today s world for Content Distribution
Computer Networks 1 (Mạng Máy Tính 1) Lectured by: Dr. Phạm Trần Vũ
Computer Networks 1 (Mạng Máy Tính 1) Lectured by: Dr. Phạm Trần Vũ 1 Lecture 7: Network Layer in the Internet Reference: Chapter 5 - Computer Networks, Andrew S. Tanenbaum, 4th Edition, Prentice Hall,
Grid Scheduling Dictionary of Terms and Keywords
Grid Scheduling Dictionary Working Group M. Roehrig, Sandia National Laboratories W. Ziegler, Fraunhofer-Institute for Algorithms and Scientific Computing Document: Category: Informational June 2002 Status
Dynamic Routing Protocols II OSPF. Distance Vector vs. Link State Routing
Dynamic Routing Protocols II OSPF Relates to Lab 4. This module covers link state routing and the Open Shortest Path First (OSPF) routing protocol. 1 Distance Vector vs. Link State Routing With distance
An approach to grid scheduling by using Condor-G Matchmaking mechanism
An approach to grid scheduling by using Condor-G Matchmaking mechanism E. Imamagic, B. Radic, D. Dobrenic University Computing Centre, University of Zagreb, Croatia {emir.imamagic, branimir.radic, dobrisa.dobrenic}@srce.hr
Introduction to IP v6
IP v 1-3: defined and replaced Introduction to IP v6 IP v4 - current version; 20 years old IP v5 - streams protocol IP v6 - replacement for IP v4 During developments it was called IPng - Next Generation
Building Secure Network Infrastructure For LANs
Building Secure Network Infrastructure For LANs Yeung, K., Hau; and Leung, T., Chuen Abstract This paper discusses the building of secure network infrastructure for local area networks. It first gives
Experimentation driven traffic monitoring and engineering research
Experimentation driven traffic monitoring and engineering research Amir KRIFA ([email protected]) 11/20/09 ECODE FP7 Project 1 Outline i. Future directions of Internet traffic monitoring and engineering
SmartSockets: Solving the Connectivity Problems in Grid Computing
SmartSockets: Solving the Connectivity Problems in Grid Computing Jason Maassen and Henri E. Bal Dept. of Computer Science, Vrije Universiteit Amsterdam, The Netherlands [email protected], [email protected] ABSTRACT
Integration of the OCM-G Monitoring System into the MonALISA Infrastructure
Integration of the OCM-G Monitoring System into the MonALISA Infrastructure W lodzimierz Funika, Bartosz Jakubowski, and Jakub Jaroszewski Institute of Computer Science, AGH, al. Mickiewicza 30, 30-059,
Inter-domain Routing Basics. Border Gateway Protocol. Inter-domain Routing Basics. Inter-domain Routing Basics. Exterior routing protocols created to:
Border Gateway Protocol Exterior routing protocols created to: control the expansion of routing tables provide a structured view of the Internet by segregating routing domains into separate administrations
Introduction to LAN/WAN. Network Layer
Introduction to LAN/WAN Network Layer Topics Introduction (5-5.1) Routing (5.2) (The core) Internetworking (5.5) Congestion Control (5.3) Network Layer Design Isues Store-and-Forward Packet Switching Services
Analysis of Internet Topologies
Analysis of Internet Topologies Ljiljana Trajković [email protected] Communication Networks Laboratory http://www.ensc.sfu.ca/cnl School of Engineering Science Simon Fraser University, Vancouver, British
Computer Networking Networks
Page 1 of 8 Computer Networking Networks 9.1 Local area network A local area network (LAN) is a network that connects computers and devices in a limited geographical area such as a home, school, office
A Network Management Software Based on Secure Shell (SSH) Channels. and Java Universal Network Graph (JUNG)
A Network Management Software Based on Secure Shell (SSH) Channels and Java Universal Network Graph (JUNG) Students: Simran Khalsa, Gerald Castaneda, Ruben Rivera. Mentor: Dr. Jorge Crichigno Department
Path Selection Methods for Localized Quality of Service Routing
Path Selection Methods for Localized Quality of Service Routing Xin Yuan and Arif Saifee Department of Computer Science, Florida State University, Tallahassee, FL Abstract Localized Quality of Service
How to Keep Track of Your Network Configuration
The following paper was originally presented at the Seventh System Administration Conference (LISA 93) Monterey, California, November, 1993 How to Keep Track of Your Network Configuration J. Schonwalder
Quality of Service Routing in MPLS Networks Using Delay and Bandwidth Constraints
Quality of Service Routing in MPLS Networks Using Delay and Bandwidth Constraints Mohammad HossienYaghmae Computer Department, Faculty of Engineering, Ferdowsi University of Mashad, Mashhad, Iran [email protected]
Stability of QOS. Avinash Varadarajan, Subhransu Maji {avinash,smaji}@cs.berkeley.edu
Stability of QOS Avinash Varadarajan, Subhransu Maji {avinash,smaji}@cs.berkeley.edu Abstract Given a choice between two services, rest of the things being equal, it is natural to prefer the one with more
A Study of Internet Packet Reordering
A Study of Internet Packet Reordering Yi Wang 1, Guohan Lu 2, Xing Li 3 1 Department of Electronic Engineering Tsinghua University, Beijing, P. R. China, 100084 [email protected] 2 China Education
Classic Grid Architecture
Peer-to to-peer Grids Classic Grid Architecture Resources Database Database Netsolve Collaboration Composition Content Access Computing Security Middle Tier Brokers Service Providers Middle Tier becomes
Scaling 10Gb/s Clustering at Wire-Speed
Scaling 10Gb/s Clustering at Wire-Speed InfiniBand offers cost-effective wire-speed scaling with deterministic performance Mellanox Technologies Inc. 2900 Stender Way, Santa Clara, CA 95054 Tel: 408-970-3400
Research on Errors of Utilized Bandwidth Measured by NetFlow
Research on s of Utilized Bandwidth Measured by NetFlow Haiting Zhu 1, Xiaoguo Zhang 1,2, Wei Ding 1 1 School of Computer Science and Engineering, Southeast University, Nanjing 211189, China 2 Electronic
Definition. A Historical Example
Overlay Networks This lecture contains slides created by Ion Stoica (UC Berkeley). Slides used with permission from author. All rights remain with author. Definition Network defines addressing, routing,
