Network Operations Centre (NOC) Infrastructure Services Monitoring and Alerting



Similar documents
Service Offerings. Ensuring IT Resources are available, reliable, scalable & manageable always.

Best of Breed of an ITIL based IT Monitoring. The System Management strategy of NetEye

SENTINEL MANAGEMENT & MONITORING

Managed Services OVERVIEW

mbits Network Operations Centrec

Heroix Longitude Quick Start Guide V7.1

Cisco Data Center Network Manager for SAN

EMC Data Protection Advisor 6.0

Curtin University: Curtin University: Who are we?

Platform as a Service

FUNCTIONAL OVERVIEW

ManageEngine (division of ZOHO Corporation) Infrastructure Management Solution (IMS)

VNLINFOTECH JOIN US & MAKE YOUR FUTURE BRIGHT. mcsa (70-413) Microsoft certified system administrator. (designing & implementing server infrasturcure)

Cisco Unified Computing Remote Management Services

Designing and Implementing a Server Infrastructure

One software solution to monitor your entire network, including devices, applications traffic and availability.

Network Monitoring. Easy, failsafe, and complete visibility of your network. Our customers have the same view as our NOC technicians.

Infoblox vnios Software for CISCO AXP

Virtual Private Servers

How To Get Atos Paas For Free

EMC VPLEX FAMILY. Continuous Availability and Data Mobility Within and Across Data Centers

IT Infrastructure Management

WÜRTHPHOENIX NetEye Version 3

N-central 8.0 On-Premise Software and N-compass 3.1 Advanced Reporting Software

Cisco Unified Communications Remote Management Services

Infrastructure Support Engineer Job Profile

SapphireIMS 4.0 BSM Feature Specification

Cisco Nexus 1000V and Cisco Nexus 1110 Virtual Services Appliance (VSA) across data centers

WINDOWS SERVER MONITORING

Proven Technical and Management skills over a career spanning more than 10 years. Brunswick Vic 3056 Australia

Implementing, Managing and Maintaining a Microsoft Windows Server 2003 Network Infrastructure: Network Services Course No.

Cisco Data Center Network Manager Release 5.1 (LAN)

OpManager MSP Edition

OMNITURE MONITORING. Ensuring the Security and Availability of Customer Data. June 16, 2008 Version 2.0

VMware vsphere 5.0 Boot Camp

n2grate Open Positions March 2015

EMC VPLEX FAMILY. Continuous Availability and data Mobility Within and Across Data Centers

SolarWinds Network Performance Monitor powerful network fault & availabilty management

Pluralsight Training Pre-Approved for CompTIA CEUs

Availability Management Nagios overview. TEIN2 training Bangkok September 2005

MS6419B: Configuring, Managing and Maintaining Windows Server 2008-Based Servers

Oracle VM on Cisco UCS Simplicity, Scale and Cost Efficient

NETWORK MONITORING & ALERTING SERVICES SERVICE DEFINITION

1 Data Center Infrastructure Remote Monitoring

Deployment Topologies

SapphireIMS Business Service Monitoring Feature Specification

HP SiteScope software

INFOCOMM & DIGITAL MEDIA (IT NETWORK AND SYSTEM ADMINISTRATION)

Solarwinds Training Standard, Pro & Expert

CSS ONEVIEW G-Cloud CA Nimsoft Monitoring

StruxureWare TM Data Center Expert

SolarWinds Network Performance Monitor

Integrated processes aligned to your business The examples of the new NetEye and EriZone releases

INFRASTRUCTURE SOLUTIONS OVERVIEW

Kaseya Traverse. Kaseya Product Brief. Predictive SLA Management and Monitoring. Kaseya Traverse. Service Containers and Views

Course Outline. Course 6419 : Configuring, Managing and Maintaining Windows Server 2008-based Servers. Duration: 5 Days

CARENET-SE. NOC Tools Review. Communication System Design Summer Project team. Champion Björn Pehrson Coach Hans Eriksson

Service Description CloudSure Public, Private & Hybrid Cloud

COURSE OUTLINE MOC 20413: DESIGNING AND IMPLEMENTING A SERVER INFRASTRUCTURE

NetAid Services NETENRICH. Service at a Glance. IT as a Service Offering from NetEnrich. Delivering IT as a Service

Private Compute-as-a-Service

HRG Assessment: Stratus everrun Enterprise

Microsoft Technologies

CAREN NOC MONITORING AND SECURITY

Disaster Recovery Infrastructure

Server Management & Monitoring Quick Guide

Configuring, Managing and Maintaining Windows Server 2008-based Servers

Implementing, Managing, and Maintaining a Microsoft Windows Server 2003 Network Infrastructure: Network Services (5 days)

CHOOSING A RACKSPACE HOSTING PLATFORM

Optimised Managed IT Services, Hosting and Infrastructure. Keep your business running at peak performance

BridgeConnex Statement of Work Managed Network Services (MNS) & Network Monitoring Services (NMS)

Microsoft Windows Server 2008: MS-6435 Designing Network and Applications Infrastructure MCITP 6435

Collax Monitoring with Nagios

Veritas Cluster Server

Course 6419B: Configuring, Managing and Maintaining Windows Server 2008-based Servers

Course Venue :- Lab 302, IT Dept., Govt. Polytechnic Mumbai, Bandra (E)

SystemWatch SM. Remote Network Monitoring

Proactively Managing Servers with Dell KACE and Open Manage Essentials

Cisco Prime Data Center Network Manager Release 6.1

MS 6419 Configuring, Managing and Maintaining Windows Server 2008-based Servers

Statement of Service Enterprise Services - AID Servers: Windows/Linux/Unix Network Infrastructure: Switches/Routers/Firewall/Wireless Access

HELPDESK & SERVER MONITORING. Helpdesk HOURS OF COVER KEY FEATURES

MSP Center Plus Features Checklist

Application-Centric WLAN. Rob Mellencamp

8/26/2007. Network Monitor Analysis Preformed for Home National Bank. Paul F Bergetz

Diploma in Network (LAN/WAN) Administration

Designing a Windows Server 2008 Applications Infrastructure

SCOPE DOCUMENT. Trade Name IT- Network Systems Administration Post- Secondary DATE OF DISTRIBUTION VIA WEBSITE

Transcription:

Network Operations Centre (NOC) Infrastructure Services Monitoring and Alerting

Monitoring and Alerting About SCU Regional University Located on the North East Coast of NSW, 4 major campus sites stretching from Sydney to Gold Coast (Airport), primary campus is located in Lismore NSW. Currently a new campus called Beachside, is being constructed adjacent to the Coolangatta Airport, SE QLD, Beachside campus consists of a 4 and 10 story building with a further 7 story building set to commence by end of 2013, Recently added an engineering school to the profile of courses offered, Students 16,000, Staff 911, Approximately 50% of students are studying via distance education.

Monitoring and Alerting Infrastructure Overview Campuses interlinked by 1GB AARNET Circuits, Data Centre s located at Lismore and Gold Coast Campuses interlink via 10GB AARNET circuit, Primary Lismore and Beachside campuses provisioned with physically diverse tails with AARNET circuit redundancy for Internet and WAN connectivity, Networking platform, CISCO, Storage and Server platform, HP, Virtualisation platform, VMWARE ESX 5.1, Deploying next generation HP P7400 (3PAR) storage arrays coupled C7000 flex fabric blades with CISCO NEXUS 5K ToR, Storage replication between DC sites, Active Directory for File/Print/DHCP/Authentication/Internal DNS, Database MSSQL and ORACLE RAC farms, Websites REDHAT/APACHE, WINDOWS/IIS Load sharing F5, Messaging platform, Microsoft O365 for students and staff, Number of wireless Access Points, 540 Number of physical hosts approx. 100, Number of virtual guests approx. 515, Virutalisation fraction 83.7%,

Before NOC Monitoring and Alerting Multiple monitoring platforms (Nagios, Cacti, MRTG, Statscount, CISCO Works, HP Insight, VCops, HP Capacity Advisor), No aggregate view of performance across infrastructure platforms, Lack of transparency on infrastructure performance incidents outside of technical teams, leading to inefficient resolution practices, Competing work priorities prevents teams from sustaining an ongoing focus on developing and maintaining a cohesive monitoring service, No consistent approach to developing monitors across teams (templating), Differing levels of expertise and insights on how to best develop a cohesive monitoring capability.

Objectives Monitoring and Alerting Provide a 24X7 performance monitoring, alerting and escalation capability, Aggregate infrastructure monitoring, alerting and reporting under a common interface/platform, Achieve greater transparency over infrastructure performances, Provide a template driven service to ensure consistency of monitors and alerts across the infrastructure, Activate alert triggers based on sustained performance levels to smooth out performance spikes and reduce false positives, Gain access to an expertise to readily develop new monitoring and alerting capabilities, Provide an interface/process that allows new monitors to be requested readily and obsolete monitors disabled. Do it cost effectively.

After NOC Monitoring and Alerting Common platform for monitoring all infrastructures, Access to expertise for developing new monitors/tests and reports, Independent and transparent provider of monitoring services, Leverage of a 24X7 helpdesk and call escalation service, Comprehensive set of SMS alerts, Interface for requesting and disabling alerts, All key production services and network interfaces monitored, Monitoring data backed up for future analysis.

Monitoring and Alerting NOC Processes AARNet NOC Services - SCU Handover.pdf SCUMonitoringChangeRequest-v1.0.pdf

Monitoring and Alerting SMS Alerts CRITICAL (PROBLEM) 'Internal DNS (ADS) - Staff Domain controllers' 'CRITICAL: physical memory: Total: 4G - Used: 3.94G (98%) - Free: 59.7M (2%) critical' 23 July CRITICAL (RECOVERY) 'Internal DNS (ADS) - Staff Domain controllers' 'OK: physical memory: Total: 4G - Used: 781M (19%) - Free: 3.24G (81%)' 23 Jun 2013 19:44:24 CRITICAL (ACKNOWLEDGEMENT) 'Test Linux Platform' 'PING CRITICAL - Packet loss = 100%' 25 Jun 2013 17:31:45 +1000 ldora1.scu.int

Monitoring and Alerting Future Plans Continued the rollout of monitors across the Infrastructure, Work with AARNET on developing the sites displays and reporting capabilities, e.g. geographical view with drill down capabilities, Develop more sophisticated end user, service performance and benchmarking tests, Refine existing monitors. Data Centres

NOC Demonstration Monitoring and Alerting https://cpe-scu-noc1.aarnet.net.au/

Thanks for listening

AARNet NOC Services SCU Deployment Network Operations Mike Groeneweg Questnet 2013 3 July 2013

24x7 NOC Project Phases Phase 1 Move to 24x7 Operations 25 June 2010 Phase 2 Optical Monitoring September 2010 Phase 3 - NOC Services February 2011 Transitioned trial customer ACU Added CDU in 2011 Started SCU in late 2011 (mid December!) 13

SCU Requirements Single Monitoring service: Windows Servers Linux Servers Databases Network Data centres 14

SCU Requirements Customised contact & escalation system Per group contacts Specific Priority Escalations 15

SCU Deployment - Hardware 2 x Servers (Acer with REHL6) 2 x VPN Devices (Mikrotik) 4 x Probes (Mikrotik) 2 x SMS Modems (Falcomm) Deployment Locations Lismore (Primary) Riverside (Coolangatta Secondary) 16

SCU Deployment Locations 17

SCU Deployment - Software Red Hat Enterprise Linux 6 platform OS Nagios 3.4.4 PHP4Nagios 0.6.16 18

SCU Deployment Nagios Modules All standard 3.x modules (check_ping, check_http, check_snmp et al) Custom built modules F5 load balancer check Interface check (with error counts, discards) MS-SQL Oracle 19

SCU Deployment Nagios Modules Windows Monitoring Active NRPE based module NSCP (0.4.0) http://www.nsclient.org/nscp/ check_drive check_file check_process check_memory check_cpu check_registry check_uptime 20

SCU Deployment DHCP Check check_dhcp (with DHCP Helper/Relay) Mikrotik VPNs back to master VPN and trunks Mikrotik remote probes as DHCP clients 21

SCU Performance Monitoring Phase II 22

Check our network Iperf Weathermap 23

NOC Process Improvements 24

SCU Tickets & E-Mail Change management between SCU and AARNet - deployed ticket system for SCU to lodge & view tickets Added dedicated NOC correspondence e-mail address for SCU Ensures SCU related information is collated and answered in context. 25

Future changes/additions SeleniumRC SNMP Traps Nagios 4.x or Icinga or??? Database configuration for rapid change. 26

The Results 27

Thanks SCU Luke Walford, Tim Lane, Matthew Smith, Howard Jeffery Unix: Robin Garner Windows: Graham Oliver, Shane Morris, Simon Nimmo Exchange: Tony Hill Networks: Michael Lymbery, Mark Angel Data Centre: Bernard Thomas AARNet Doug Farmer Adam Binneweg SysAdmin Team: David Jericho, Rob Kearey, George Coburn 28

Thank you for listening. Questions? 1300 APL NOC noc@aarnet.edu.au 29 mike.groeneweg@aarnet.edu.au

Monitoring BOF 5pm 30