Assessing Fault Management in Network Resource Intensive and Protocol Rich Environments

Similar documents
Creating the Conceptual Design by Gathering and Analyzing Business and Technical Requirements

Network Management and Monitoring Software

IT Networking and Security

PLUMgrid Toolbox: Tools to Install, Operate and Monitor Your Virtual Network Infrastructure

Question: 3 When using Application Intelligence, Server Time may be defined as.

Trademark Notice. General Disclaimer

Dell PowerVault MD Series Storage Arrays: IP SAN Best Practices

NOS for Network Support (903)

1 Product. Open Text is the leading fax server vendor in the world. *

DISASTER RECOVERY WITH AWS

Top-Down Network Design

How To Make A Delay Tolerant Network (Dtn) Work When You Can'T Get A Signal From A Long Delay (Tcp/Ip) To A Long Time (Tokus) Or From A Short Delay (Ip) (Tko

IP SAN BEST PRACTICES

Fundamentals of a Windows Server Infrastructure MOC 10967

Highly Available Mobile Services Infrastructure Using Oracle Berkeley DB

Routing & Traffic Analysis for Converged Networks. Filling the Layer 3 Gap in VoIP Management

Management of Converging Networks

Introduction to Automated Testing

The EMSX Platform. A Modular, Scalable, Efficient, Adaptable Platform to Manage Multi-technology Networks. A White Paper.

Considerations In Developing Firewall Selection Criteria. Adeptech Systems, Inc.

CIP- 005 R2: Understanding the Security Requirements for Secure Remote Access to the Bulk Energy System

SURE 5 Zone DDoS PROTECTION SERVICE

How To Understand The History Of The Network And Network (Networking) In A Network (Network) (Netnet) (Network And Network) (Dns) (Wired) (Lannet) And (Network Network)

Advanced Administration for Citrix NetScaler 9.0 Platinum Edition

Network Management System (NMS) FAQ

LinuxWorld Conference & Expo Server Farms and XML Web Services

Apigee Gateway Specifications

Networking Technology Online Course Outline

Managing and Maintaining Windows Server 2008 Servers

Configuration Management: Best Practices White Paper

Lesson 5-2: Network Maintenance and Management

Cisco Discovery 3: Introducing Routing and Switching in the Enterprise hours teaching time

FortiMail Filtering. Course 221 (for FortiMail v5.0) Course Overview

IT Networking and Security

Microsoft SharePoint 2010 Administration

SSL VPN Technology White Paper

How To Manage A Network On A Network (Networking) On A Microsoft Network) On An Ipnet (Network) On Pcode) On Your Pc Or Mac (Netware) On The Netbook (Netro) On Linux

Chapter 18. Network Management Basics

Using Fuzzy Logic Control to Provide Intelligent Traffic Management Service for High-Speed Networks ABSTRACT:

Private Cloud. One solution managed by Applied

EC-Council Network Security Administrator (ENSA) Duration: 5 Days Method: Instructor-Led

Citrix NetScaler 10.5 Essentials for ACE Migration CNS208; 5 Days, Instructor-led

FortiMail Filtering. Course for FortiMail v4.0. Course Overview

Cloud Computing Disaster Recovery (DR)

i-pcgrid Workshop 2015 Cyber Security for Substation Automation The Jagged Line between Utility and Vendors

ANNE ARUNDEL COMMUNITY COLLEGE ARNOLD, MARYLAND COURSE OUTLINE CATALOG DESCRIPTION

CONDIS. IT Service Management and CMDB

IP SAN Best Practices

Managing a Fibre Channel Storage Area Network

A technical guide for monitoring Adobe LiveCycle ES deployments

Architectural Overview

Active Directory LDAP

Implementing Microsoft Windows 2000 Clustering

NetCrunch 6. AdRem. Network Monitoring Server. Document. Monitor. Manage

FortiMail Filtering. Course 221 (for FortiMail v4.2) Course Overview

Check Point Certified Security Administrator (CCSA) 50 Cragwood Rd, Suite 350 South Plainfield, NJ 07080

APPROACH FOR INTEGRATION OF THIRD PARTY NODES WITH PROPRIETARY OSS

SSVP SIP School VoIP Professional Certification

BlackBerry Enterprise Service 10. Version: Configuration Guide

OnCommand Unified Manager

Table of Contents. Introduction. Audience. At Course Completion. Prerequisites

How to Choose the Right Industrial Firewall: The Top 7 Considerations. Li Peng Product Manager

Fundamentals of a Windows Server Infrastructure Course 10967A; 5 Days, Instructor-led

Troubleshooting and Maintaining Cisco IP Networks Volume 1

Course Outline. ttttttt

Implementing the Application Control Engine Service Module

Five Essential Components for Highly Reliable Data Centers

המרכז ללימודי חוץ המכללה האקדמית ספיר. ד.נ חוף אשקלון טל' פקס בשיתוף עם מכללת הנגב ע"ש ספיר

Opus Guide for IT Managers

Fail-Safe IPS Integration with Bypass Technology

Avaya Aura System Manager

Basic & Advanced Administration for Citrix NetScaler 9.2

Check Point Security Administrator R70

Digital Advisory Services Professional Service Description Network Assessment

Designing a Microsoft SharePoint 2010 Infrastructure

MCSE SYLLABUS. Exam : Managing and Maintaining a Microsoft Windows Server 2003:

ForeScout CounterACT. Device Host and Detection Methods. Technology Brief

Course Overview: Learn the essential skills needed to set up, configure, support, and troubleshoot your TCP/IP-based network.

TECHNOLOGY WHITE PAPER. Correlating SDN overlays and the physical network with Nuage Networks Virtualized Services Assurance Platform

Managing and Maintaining a Windows Server 2003 Network Environment

A Look at the New Converged Data Center

Prisma IP Element Management System

Securing Web Services From Encryption to a Web Service Security Infrastructure

NNMi120 Network Node Manager i Software 9.x Essentials

How To Understand and Configure Your Network for IntraVUE

Network Security Administrator

Testing Intelligent Device Communications in a Distributed System

SIP Trunking Guide: Get More For Your Money 07/17/2014 WHITE PAPER

Network-Wide Class of Service (CoS) Management with Route Analytics. Integrated Traffic and Routing Visibility for Effective CoS Delivery

Abstract. MEP; Reviewed: GAK 10/17/2005. Solution & Interoperability Test Lab Application Notes 2005 Avaya Inc. All Rights Reserved.

Oracle Net Services for Oracle10g. An Oracle White Paper May 2005

Introduction to Network Management

MCSE Objectives. Exam : TS:Exchange Server 2007, Configuring

IT Security and OT Security. Understanding the Challenges

High Availability Failover Optimization Tuning HA Timers PAN-OS 6.0.0

District of Columbia Courts Attachment 1 Video Conference Bridge Infrastructure Equipment Performance Specification

Network System Design Lesson Objectives

SP Designing a Microsoft SharePoint 2010 Infrastructure

TDM services over IP networks

10233B: Designing and Deploying Messaging Solutions with Microsoft Exchange Server 2010

Transcription:

Assessing Fault Management in Network Resource Intensive and Protocol Rich Environments Presented by Tom Hempler September 11, 2014 1 Image: NASA/JPL

Fault Management Verification Ensure network infrastructure functions correctly and supports current and future space network needs. Ensure architecture malfunctions and environmental conditions do not impact functional capabilities of the system or its reliability. A satellite reporting lightning strikes should not mistake them for meteor showers Communication dropouts due to solar storms and and communication delays due to spatial distance should not be mistaken for disconnects or access faults Predictable errant behavior is challenging Unknown, unexpected, or unpredictable events are more challenging A parked motor vehicle should not start moving when the engine is started, but it did Commanding satellite switchover from solar to battery power source prior to eclipse is predictable and should not be delayed or interrupted but it can be Fault management design must support reliable and durable network processing and communications 2

Network Architectures and Fault Management Typical fault management in network resource intensive and protocol rich environments 3

Network Resources and Protocol Reliability Fault management analysis ensures: Architecture and services reliability is met in requirements, design, and implementation SOA messaging reliability and durability is maximized for high availability Assess compliance of infrastructure with protocol standards Measure correct, consistent, predictable network behavior Measure degree of compatibility among interface technologies (precision, range deltas) Enumerate known vulnerabilities of protocols being implemented Assess fault management quality with reliability measures in four dimensions, in proven practices, and statistically significant values Measure 1) Availability, 2) Reliability, 3) Safety and 4) Security On a very large architecture and infrastructure measurable quality attributes include fault management coverage, message durability and reliability, resources and services availability, and security accountability Evaluate services reliability examine network reliability and messaging durability http://blogs.msdn.com/b/nickmalik/archive/2007/05/27/reliability-in-soa-is- huge.aspx 4

Network Resources and Protocol Dependability System Dependability Is Strengthened Through Error handling that is detectable and recoverable at all Levels of the network topology, protocols, applications and resources Fault Management that is measured at all Levels of the Network Topology, Protocols, and Applications through exception handling frameworks among the technologies Recovery Management is only as effective as error and fault detection Factors stressing dependability are: Network latencies Data bandwidth Communication link distances Network security threats Verification processes quantify error handling and fault management coverage and capability Identify risk 5

Measuring Network Management Functions FCAPS Model Fault, Configuration, Accounting, Performance, and Security (FCAPS) model is the industry standard for network management Managing FCAPS are key management features in current technology for meeting goals Fault management enables fault detection, correction, and network recovery Network protocol and resource exception handling facilitates fault management Tradeoffs in performance and FCAPS implementation drive risk Fault management throughout system development to deployment Requirements specify functions for operations, interoperability, and failure mitigation Design built-in alarming, isolation, diagnostics, error handling, alerting, and error logging Implementation build-in events, operator and message validation, status, and exception handling Integration and Test unit test for compile time errors and subsystem tests for exceptions Fault management during operations Collecting and analyzing logs, correlating events, identification and authentication failures Protect safety critical software from emergent events Data storage, archiving and playback to facilitate forensics Protect or backup control layer and data layer, and provide multiple path decision making processes 6

Protocols and Frameworks SNMP ICMP Java Exception Handling Framework Availability Management Framework SAI-AIS-AMF-B.04.01 http://www.saforum.org/hoa/assn16627/images/sai-ais-amf-b.04.01.al.pdf JAVA Community Process JSR-000319 Availability Management for Java http://download.oracle.com/otndocs/jcp/availability_management-1.0-fr-oth-jspec/ 7

Protocols Lightweight Directory Access Protocol (LDAP) Error Codes LDAP error codes facilitate exception handling for directory services Provides a fault management capability in network infrastructure LDAP defines a set of status codes that are returned with LDAP responses sent by the LDAP server (see RFC 2251). LDAP error codes map to JNDI Exceptions. In the JNDI, error conditions are indicated as checked exceptions that are subclasses of NamingException. LDAP service provider translates the LDAP status code it receives from the LDAP server to the appropriate subclass of NamingException. LDAP status codes are useful for troubleshooting and managing network directory services; and detecting operations, protocol, authentication errors to name a few. IV&V analyzes fault management quality and coverage of LDAP authentication and directory services 8

Protocols Simple Object Access Protocol (SOAP) Faults SOAP Faults facilitate exception handling for web service messaging Provides a fault management capability over network infrastructure (i.e. distributed databases) SOAP Fault Element <wsdl:fault> in SOAP message contains: o <Code> <Reason> <Node> <Role> <Detail> Analogous to an application exception, generated by message receivers to report business logic errors or unexpected conditions. The faults returned to the sender only if request/response messaging is in use. If a Web service operation is configured as one-way, the SOAP fault is not returned to the sender, but stored for further processing. SOAP Fault and Java Exception Mapping Modeled Maps to an exception that is thrown explicitly from the business logic of the Java code and mapped to wsdl:fault definitions in the WSDL file, when the Web service is deployed. In this case, the SOAP faults are predefined. Unmodeled Maps to an exception (for example, java.lang.runtimeexception) that is generated at run-time when no business logic fault is defined in the WSDL. In this case, Java exceptions are represented as generic SOAP fault exceptions IV&V analyzes fault management quality and coverage of SOAP messaging 9

Protocols IP and Delay Tolerant Network / Disruption Tolerant Network Variable delay and disruption due to planetary distance and rotation TCP/IP protocols designed for limited router memory. Packets are discarded due to memory constraints Enter Bundle protocols such as CCSDS, DTN using Packet Switching Store and Forward techniques Bundle Security Protocol RFC 6257 In 1973 Bob Kahn and Vint Cerf developed the internet protocol suite (TCP/IP) In 1997 Vint Cerf began working on interplanetary communications Cerf working with NASA & JPL helped develop a new set of protocols that can withstand the unique space environment for an interplanetary internet Planetary Rotation, Time Synchronization, Domain Naming and Addressing, Routing and Security are among the challenges for IP Delay Tolerant Network (DTN) under active development by DTNRG DTN experiments on ISS and Mars BP allows for disconnections, glitches and other problems commonly found in deep space 10 http://www.wired.com/2013/05/vint-cerf-interplanetary-internet/

White Papers ManageEngine COTS product that provides FCAPS for virtual environments http://www.manageengine.com/products/white-papers.html 11