OpenSM Log Management



Similar documents
Know your Cluster Bottlenecks and Maximize Performance

Introduction to Infiniband. Hussein N. Harake, Performance U! Winter School

Enterprise Content Management System Monitor. Server Debugging Guide CENIT AG Bettighofer, Stefan

Mellanox Academy Online Training (E-learning)

Troubleshooting for Yamaha router

1 Basic Configuration of Cisco 2600 Router. Basic Configuration Cisco 2600 Router

Mellanox Academy Course Catalog. Empower your organization with a new world of educational possibilities

How To Monitor Infiniband Network Data From A Network On A Leaf Switch (Wired) On A Microsoft Powerbook (Wired Or Microsoft) On An Ipa (Wired/Wired) Or Ipa V2 (Wired V2)

Determine the process of extracting monitoring information in Sun ONE Application Server

Configuring Static and Dynamic NAT Simultaneously

Database Administration

Release Notes LS Retail Data Director August 2011

Mellanox Technical Training and Certification Program

Knowledge Base Articles

QLogic SRP Module on Linux for OpenFabrics and InfiniPath Version Table of Contents

Enterprise Manager Performance Tips

EVENT LOG MANAGEMENT...

RSA Security Analytics Netflow Collection Configuration Guide

RSA Security Analytics Netflow Collection Configuration Guide

Logging in Java Applications

Grid Engine 6. Troubleshooting. BioTeam Inc.

Welcome to MaxMobile. Introduction. System Requirements

ModScan A SCADA MODBUS Network Scanner. Mark Bristow mark.bristow@gmail.com

JUROPA Linux Cluster An Overview. 19 May 2014 Ulrich Detert

RSA Security Analytics

COMMANDS 1 Overview... 1 Default Commands... 2 Creating a Script from a Command Document Revision History... 10

Installation Manual v2.0.0

Deploying the BIG-IP LTM with IBM QRadar Logging

CYAN SECURE WEB APPLIANCE. User interface manual

CAIL Security Facility NSK Host to Host FTP Encryption

HIPAA Compliance Use Case

Mediatrix 4404 Step by Step Configuration Guide June 22, 2011

GDC Data Transfer Tool User s Guide. NCI Genomic Data Commons (GDC)

Using the VCDS Application Monitoring Tool

Router Lab Reference Guide

VoIPvoice MAC Integration User Guide. VoIPvoice Skype Integration for MAC. User Guide. Last Updated 02 December Page 1 of 11

Tivoli Endpoint Manager BigFix Dashboard

MIS Export via the FEM transfer software

How To Analyze Logs On Aloha On A Pcode On A Linux Server On A Microsoft Powerbook (For Acedo) On A Macbook Or Ipad (For An Ubuntu) On An Ubode (For Macrocess

Oracle Data Integrator for Big Data. Alex Kotopoulis Senior Principal Product Manager

Visualizing gem5 via ARM DS-5 Streamline. Dam Sunwoo ARM R&D December 2012

Datagram. Datagram SyslogAgent manual. Version 3.6

(Discovery 2) Credit value: 10 Guided learning hours: 80. Aim and purpose. Unit introduction. Learning outcomes

TEST AUTOMATION FRAMEWORK

RIOT-Lab. How to use RIOT in the IoT-Lab. Oliver "Oleg" Hahm. November 7, 2014 INRIA. O. Hahm (INRIA) RIOT-Lab November 7, / 29

Firewall Troubleshooting

BRIDGING EMC ISILON NAS ON IP TO INFINIBAND NETWORKS WITH MELLANOX SWITCHX

MuL SDN Controller HOWTO for pre-packaged VM

Troubleshooting Tools to Diagnose or Report a Problem February 23, 2012

Active-Active ImageNow Server

rev. 31 (applies to CyBro OPC Server v3.0.4 and later)

Job Scheduler Daemon Configuration Guide

VM-Series Firewall Deployment Tech Note PAN-OS 5.0

SonicOS 5.9 / / 6.2 Log Events Reference Guide with Enhanced Logging

Welcome to MaxMobile. Introduction. System Requirements. MaxMobile 10.5 for Windows Mobile Pocket PC

RadBlue Load Tester Version 6. [Released: 09 DEC 2009]

From the command line- a user can begin Asterisk (v1.6), and access the Asterisk CLI by typing:

Overview. NetBorder Express Loggers Configuration Guide

Bulk Downloader. Call Recording: Bulk Downloader

SANbox Manager Release Notes Version Rev A

Framework 8.1. External Authentication. Reference Manual

PT Activity: Configure Cisco Routers for Syslog, NTP, and SSH Operations

Configuring System Message Logging

Remote Annex. Quick Start for Windows. Read before installing and using Remote Annex Software Release 4.2

vcenter Support Assistant User's Guide

System Administration Training Guide. S100 Installation and Site Management

Introduction. Created by Richard Bell 10/29/2014

ASA 9.x EIGRP Configuration Example

ZyXEL MGS-3712F V3.90(BBB.4)C0 Release Note/Manual Supplement

Technical Note P/N REV A05 September 20, 2010

SOA Software: Troubleshooting Guide for Agents

Vantage Report. Quick Start Guide

Release Notes OPC-Server V3 Alarm Event for High Availability

VirtualCenter 2 Diagnostics. Eric Gray Product Support Engineer

ReadyNAS Remote. Add-on Manual. 350 East Plumeria Drive San Jose, CA USA. May

What is new in Switch 12

Web-Based Configuration Manual System Report. Table of Contents

Using the Packet Capture Tool (PCAP) Ethernet Routing Switch 8600 Software Release 4.1

Orbix Release Notes

Monitor Print Popup for Mac. Product Manual.

DiskPulse DISK CHANGE MONITOR

PageR Enterprise Monitored Objects - AS/400-5

Using RADIUS Agent for Transparent User Identification

Performance Analysis and Optimization Tool

Configuring Network Load Balancing for vethernet

Phone Network Gateway Android Application. User Guide

Port Trunking. Contents

Open Mic on IBM Notes Traveler Best Practices. Date: 11 July, 2013

CTI Concerto Predictive Dialer Setup Instructions. Version 4

GDB Tutorial. A Walkthrough with Examples. CMSC Spring Last modified March 22, GDB Tutorial

OfficeServ Link. User Guide. Version 2.1 June 2005

Yamaha Audio Network Monitor User Guide

Configuring NetFlow Secure Event Logging (NSEL)

WEBAPP PATTERN FOR APACHE TOMCAT - USER GUIDE

Setting Up an AudioCodes MP-114

CLC Server Command Line Tools USER MANUAL

HP LeftHand SAN Solutions

Moving Exchange Message Stores and Transaction Logs to an Alternate Drive

Transcription:

OpenSM Log Management OpenFabrics Software User Group Workshop Hal Rosenstock Mellanox Technologies 1

Agenda OpenSM Update Per Module Logging Feature 2

OpenSM Releases Releases nominally every 6-9 months Independent of OFED FDR and FDR-10 support (OpenSM 3.3.11 Aug 2011) FDR (and EDR) are IBTA standards FDR-10 is MLNX proprietary SRIOV support (OpenSM 3.3.14 May 2012) Additional GUIDs for virtual machines Bug fixes beyond 3.3.14 3

Quick OpenSM Update OpenSM 3.3.17 Feb 2014 Also included in OFED 3.12 OpenSM 3.3.18 July 2014 Latest release is OpenSM 3.3.19 December 2014 Also included in OFED 3.18 which is in process now ~25 commits past 3.3.19 right now Nightly regression tests against latest master More than 400 test cases used 4

Additional MLNX OpenSM Features Event reporting scalability Bad nodes/hardware Heldback switches Multicast improvements Routing chains Credit-loop free UC and MC routing for updn/ftree/minhop Multithreaded updn/minhop/dor PQFT routing engine For quasi and parallel ports generalized fat tree topologies Fast multicast routing Enhanced EDR support Connect-IB SRIOV support 5

OpenSM Logging Related Command Line Options -f, --log_file <file name> This option defines the log to be the given file. By default, the log goes to /var/log/opensm.log. For the log to go to standard output use -f stdout. -L, --log_limit <size in MB> This option defines maximal log file size in MB. When specified the log file will be truncated upon reaching this limit. -e, --erase_log_file This option will cause deletion of the log file (if it previously exists). By default, the log file is accumulative. 6

OpenSM Logging Related Config File Options # Force flush of the log file after each log message force_log_flush FALSE # Log file to be used log_file /var/log/opensm.log # Limit the size of the log file in MB. If overrun, log is restarted log_max_size 0 # If TRUE will accumulate the log over multiple OpenSM sessions accum_log_file TRUE # Per module logging configuration file # Each line in config file contains <module_name><separator><log_flags> # where module_name is file name including.c # separator is either =, space, or tab # log_flags is the same flags as used in the coarse/overall logging per_module_logging_file /usr/local/etc/opensm/per-module-logging.conf 7

OpenSM Log Levels Overall log verbosity level log_flags config file option Related command line options -D <value> -V -v, --verbose -d, --debug <value> 8

-D <value> This option sets the log verbosity level. A flags field must follow the -D option. A bit set/clear in the flags enables/disables a specific log level as follows: BIT 0x01 0x02 0x04 0x08 0x10 0x20 0x40 0x80 ERROR (error messages) INFO (basic messages, low volume) LOG LEVEL ENABLED VERBOSE (interesting stuff, moderate volume) DEBUG (diagnostic, high volume) FUNCS (function entry/exit, very high volume) FRAMES (dumps all SMP and GMP frames) ROUTING (dump FDB routing information) SYS (syslog at LOG_INFO level in addition to OpenSM logging) 9

-D <value> Without -D, OpenSM defaults to ERROR + INFO (0x3). Specifying -D 0 disables all messages. Specifying -D 0xFF enables all messages (see -V). High verbosity levels may require increasing the transaction timeout with the -t option. #2015UG 10

Other Related Log Level Command Line Options -v, --verbose This option increases the log verbosity level. The -v option may be specified multiple times to further increase the verbosity level. See the -D option for more information about log verbosity. -V This option sets the maximum verbosity level and forces log flushing. The -V option is equivalent to - D 0xFF -d 2. See the -D option for more information about log verbosity. -d, --debug <value> -d2 - Force log flushing after each log message #2015UG 11

Issues with OpenSM Logging Coarseness of log level One level for all of OpenSM Too many log messages as increase verbosity/log level Somewhat cryptic nature of messages logged #2015UG 12

Per Module Logging (PML) Log level per module Module is a source code file Introduced so can keep overall level low but dial up level in specific modules/files Need to have idea of which modules/files to dial up PML can change on the fly with SIGHUP Added to upstream master git tree in June/July 2012 Part of OpenSM 3.3.15 and beyond 13

Per Module Logging (PML) Enable via per_module_logging_file option in options file set to PML config file name Disable by setting per_module_logging_file to (null) in options file Per module logging config file format Set of lines with module name and logging level as follows: <module name><separator><logging level> where: <module name> is the file name including.c <separator> is either =, space, or tab <logging level> is the same levels as used in the coarse/overall logging Module names may vary between releases 3.3.16 and beyond have all modules listed 3.3.15 has one less module (no osm_congestion_control.c) 14

Per Module Logging (PML) Module Names Based on Latest Upstream Master From opensm/osm_subnet.c: static const char *module_name_str[] = { "main.c", "osm_console.c", "osm_console_io.c", "osm_db_files.c", "osm_db_pack.c", "osm_drop_mgr.c", "osm_dump.c", "osm_event_plugin.c", "osm_guid_info_rcv.c", "osm_guid_mgr.c", "osm_helper.c", "osm_inform.c", "osm_lid_mgr.c", "osm_lin_fwd_rcv.c", "osm_link_mgr.c", "osm_log.c", "osm_mad_pool.c", "osm_mcast_fwd_rcv.c", "osm_mcast_mgr.c", "osm_mcast_tbl.c", "osm_mcm_port.c", 15

Per Module Logging (PML) Module Names Based on Latest Upstream Master "osm_mesh.c", "osm_mlnx_ext_port_info_rcv.c", "osm_mtree.c", "osm_multicast.c", "osm_node.c", "osm_node_desc_rcv.c", "osm_node_info_rcv.c", "osm_opensm.c", "osm_perfmgr.c", "osm_perfmgr_db.c", "osm_pkey.c", "osm_pkey_mgr.c", "osm_pkey_rcv.c", "osm_port.c", "osm_port_info_rcv.c", "osm_prtn.c", "osm_prtn_config.c", "osm_qos.c", "osm_qos_parser_l.c", "osm_qos_parser_y.c", "osm_qos_policy.c", "osm_remote_sm.c", "osm_req.c", "osm_resp.c", "osm_router.c", 16

Per Module Logging (PML) Module Names Based on Latest Upstream Master "osm_sa.c", "osm_sa_class_port_info.c", "osm_sa_guidinfo_record.c", "osm_sa_informinfo.c", "osm_sa_lft_record.c", "osm_sa_link_record.c", "osm_sa_mad_ctrl.c", "osm_sa_mcmember_record.c", "osm_sa_mft_record.c", "osm_sa_multipath_record.c", "osm_sa_node_record.c", "osm_sa_path_record.c", "osm_sa_pkey_record.c", "osm_sa_portinfo_record.c", "osm_sa_service_record.c", "osm_sa_slvl_record.c", "osm_sa_sminfo_record.c", "osm_sa_sw_info_record.c", "osm_sa_vlarb_record.c", "osm_service.c", "osm_slvl_map_rcv.c", "osm_sm.c", "osm_sminfo_rcv.c", "osm_sm_mad_ctrl.c", "osm_sm_state_mgr.c", "osm_state_mgr.c", 17

Per Module Logging (PML) Module Names Based on Latest Upstream Master }; "osm_subnet.c", "osm_sw_info_rcv.c", "osm_switch.c", "osm_torus.c", "osm_trap_rcv.c", "osm_ucast_cache.c", "osm_ucast_dnup.c", "osm_ucast_file.c", "osm_ucast_ftree.c", "osm_ucast_lash.c", "osm_ucast_mgr.c", "osm_ucast_updn.c", "osm_vendor_ibumad.c", "osm_vl15intf.c", "osm_vl_arb_rcv.c", "st.c", "osm_ucast_dfsssp.c", "osm_congestion_control.c", /* Add new module names here... */ /* FILE_ID define in those modules must be identical to index here */ /* last FILE_ID is currently 89 */ 18

OpenSM Log Messages Overview Format: date time [thread ID] log level Feb 19 12:40:45 897693 [91A48700] 0x01 -> ERR number if in message is unique Having OpenSM sources helps Tracking error number in source module shows where generated and can read code and comments IBA spec knowledge is helpful Primarily volume 1 IB management related chapters 19

Cryptic Log Messages Always looking to improve wording of log messages Suggestions are welcome! Most common laments about messages are related to SA multicast and SM MAD timeouts/rejections 20

Thank You OpenFabrics Software User Group Workshop #OFSUserGroup 21