Deploying distributed network monitoring mesh



Similar documents
Tier3 Network Issues. Richard Carlson May 19, 2009

Network Monitoring with the perfsonar Dashboard

LHCONE Site Connections

Voice Quality Measurement in perfsonar

US CMS Tier1 Facility Network at Fermilab

Network monitoring with perfsonar. Duncan Rand Imperial College London

perfsonar MDM updates for LHCONE: VRF monitoring, updated web UI, VM images

ESnet Support for WAN Data Movement

Software Defined Networking for big-data science

Introduction to perfsonar

Software Defined Networking for big-data science

perfsonar: End-to-End Network Performance Verification

Overview of Network Measurement Tools

Network performance monitoring Insight into perfsonar

ENOS: a Network Opera/ng System for ESnet Testbed

TCP Labs. WACREN Network Monitoring and Measurement Workshop Antoine Delvaux perfsonar developer

Flexible SDN Transport Networks With Optical Circuit Switching

Throughput Issues for High-Speed Wide-Area Networks

ANI Network Testbed Update

End-to-End Network/Application Performance Troubleshooting Methodology

Infrastructure for active and passive measurements at 10Gbps and beyond

Hands on Workshop. Network Performance Monitoring and Multicast Routing. Yasuichi Kitamura NICT Jin Tanaka KDDI/NICT APAN-JP NOC

Network Architecture and Topology

OnTimeDetect: Offline and Online Network Anomaly Notification Tool

IP Telephony Management

Top-Down Network Design

(Possible) HEP Use Case for NDN. Phil DeMar; Wenji Wu NDNComm (UCLA) Sept. 28, 2015

ESnet On-demand Secure Circuits and Advance Reservation System (OSCARS)

EVALUATING NETWORK BUFFER SIZE REQUIREMENTS

Implementing MPLS VPN in Provider's IP Backbone Luyuan Fang AT&T

Towards a virtualized Internet for computer networking assignments

A Coordinated. Enterprise Networks Software Defined. and Application Fluent Programmable Networks

Achieving Performance Isolation with Lightweight Co-Kernels

Figure 1. perfsonar architecture. 1 This work was supported by the EC IST-EMANICS Network of Excellence (#26854).

WHITEPAPER. VPLS for Any-to-Any Ethernet Connectivity: When Simplicity & Control Matter

Maintaining Non-Stop Services with Multi Layer Monitoring

Data Center Virtualization and Cloud QA Expertise

Network Technologies for Next-generation Data Centers

Transform Your Business and Protect Your Cisco Nexus Investment While Adopting Cisco Application Centric Infrastructure

Network System Design Lesson Objectives

Internetworking Microsoft TCP/IP on Microsoft Windows NT 4.0

High Availability Databases based on Oracle 10g RAC on Linux

Net Optics Learning Center Presents The Fundamentals of Passive Monitoring Access

WAN Optimization. Riverbed Steelhead Appliances

Testing Challenges for Modern Networks Built Using SDN and OpenFlow

Open Networking User Group SD-WAN Requirements Demonstration Talari Test Results

Network Terminology Review

Network Probe. Figure 1.1 Cacti Utilization Graph

Chapter 18. Network Management Basics

SMB Direct for SQL Server and Private Cloud

How To Set Up Foglight Nms For A Proof Of Concept

TÓPICOS AVANÇADOS EM REDES ADVANCED TOPICS IN NETWORKS

Lecture 02b Cloud Computing II

Layer 3 Network + Dedicated Internet Connectivity

Brocade Solution for EMC VSPEX Server Virtualization

Transcription:

Deploying distributed network monitoring mesh for LHC Tier-1 and Tier-2 sites Phil DeMar, Maxim Grigoriev Fermilab Joe Metzger, Brian Tierney ESnet Martin Swany University of Delaware Jeff Boote, Eric Boyd, Aaron Brown, Matt Zekauskas, Jason Zurawski Internet2 Presented at CHEP2009 Prague, Czech Republic

Outline Challenges of Wide Area Networking From centralized network monitoring model to distributed ib t d mesh of monitoring i services perfsonar-ps collection of webservices Deployment at LHC Tier-1 and Tier-2 centers

Overview Everyone know how to ping but how many know how to share results of it? Centralized monitoring models failed to deliver scalable robust network monitoring solutions Everything is a service, I mean everything Network Computational facility Storage... Let s think about network monitoring as Service Oriented Architecture

Fermilab s WAN connectivity Year 2004 Year 2009

Just Numbers 4x10Gbps ESnet Science Data Network channels with dynamic circuit reservation system 2x10Gbps routed channels It s very easy to saturate 10Gbps ( March 2009 ) CMS Tier-1 Weekly Utilization CMS Tier-1 Daily Utilization

perfsonar Collection of interoperable webservices New set of XML schema and protocols Every network monitoring tool as a service Mesh of deployed monitoring services as Network Monitoring Service perfsonar PS is perfsonar services perfsonar-ps is perfsonar services implemented in perl

perfsonar-ps services PingER based on ping, very lightweight g SNMP used for interface utilization/errors, possible to extend for any MIBs perfsonar-buoy active measurements BWCTL iperf on demand, scheduling, AA OWAMP one way delay, scheduling, on demand Information Service - services discovery, two-tiered Lookup Service Topology Service

Current state of perfsonar-ps about 100 services are running ESnet US Energy Science network is covered Internet2 largest R&D network in US is covered Tier-1 sites in US BNL and FNAL are running LHCOPN Layer2 monitoring, LHC monitoring nodes plan to deploy ~ 200 services on 30 networks by the end of Year 2009

NPToolkit Based on Knoppix Live Linux CD disk Web100 kernel perfsonar-ps services + NPAD and NDT Packaged Apache webserver, MySQL DB, Oracle XML DB Cacti, RRDtools, Cricket Zero Configuration, Out of Box Service

LHC network monitoring node Network Monitoring appliance Based on NPToolkit Modest hardware configuration ~ 600USD a box Easy updates just insert CD with updated package Two boxes required - one for latency tests, another for throughput tests Each box is dual homed - one NIC for production network, another for high impact circuit(s)

Deployment for LHC

ESnet PerfSONAR Locations PNNL LBNL FNAL StarLight BNL MAN LAN (32 A of A) SLAC LLNL LANL ANL ORNL GA IP router SDN router IP SDN MAN Optical node Lab There are 2 perfsonar hosts (1 for bandwidth services, and 1 for latency services) at each SDN router location, and at most DOE labs

Requirements for setting up LHC Network Monitoring Node LHC Tier-1/2/3 center 1 Gbps connectivity Thats it!

Why do you need it? Network issues troubleshooting Applying Network performance troubleshooting methodology Isolation of the network segments End-system vs networking problem Setting up expectations Network capacity planning Networking resources allocation Dynamic circuits reservation

Information Service (IS) Global Lookup (gls) + Topology Service (TS) Network Topology Information Services discovery Services registration End-to-end performance troubleshooting with gls

PingER data UI URL of the remote PingER MA

Sample Test results This plot shows both ping and iperf results for an 8 hour window on the network path from FNAL to UMich. Note the latency spikes around 11:30 that t are clearly l related to the traffic spike on the UMich router during that same time.

Future Deployment plans Every Tier-2 in US, full interoperability with European perfsonar MDM deployments All federated networks involved with LHC computing Orchestration level for the monitoring services, higher level data fusion and analysis Advance visualization layer Network issues tracking service

Useful links perfsonar-ps project - http://code.google.com/p/perfsonar-ps/ NPToolkit http://code.google.com/p/perfsonar- ps/wiki/nptoolkit perfsonar - http://www.perfsonar.net t Fermilab Wide Area Networking Group - Fermilab Wide Area Networking Group https://plone3.fnal.gov/p0/wan/

Questions?