StruxureWare Data Center Operation Troubleshooting Guide Versions 7.4 and 7.4.5
1. Power Tool Tip Troubleshooting............................................................................ 2 2. Troubleshooting why Capture Index Values do not Display........................................................ 2 3. Generating Reports on Linux............................................................................... 3 4. Troubleshooting Connection and Synchronization Problems....................................................... 3 5. Troubleshooting operational problems........................................................................ 4 6. Troubleshooting Logon Problems........................................................................... 4 7. Troubleshooting Scan Problems............................................................................ 5 8. Troubleshooting virtualization issues......................................................................... 5 9. Troubleshooting Error Messages............................................................................ 6 10. Troubleshooting Performance Issues........................................................................ 7 01/27/2015 1
Power Tool Tip Troubleshooting If there is an issue with your power configuration, the system warns you by highlighting the problems in the tool tip. In this example, there is an issue with the potential failover load. There might be a problem even if there seems to still be power available. Follow the below instructions to troubleshoot. 1. To troubleshoot, enable the advanced power tool tip and see details about the phases and feeds. a. Select Tools>Preferences to access the Preferences dialog box. b. Select Advanced Power. 2. Now, hover over the rack again and inspect the tool tip. The tool tip reveals why the configuration does not provide enough power in case of a failover. This is a 3 phase configuration and phase L1 on the A-Feed UPS would be overloaded in case of a failover. This causes an issue regardless of the available power on the other phases. If the estimated load would also be highlighted in red, there would be a similar issue with this value and the solution would be the same: to reconfigure the phase configuration. What does the Power Tool Tip Tell Me? Troubleshooting why Capture Index Values do not Display The cooling simulation updates based on the equipment placement. If your configuration is invalid, the tool tip will inform you of the problem. If the cooling calculations do not run, verify your configuration is not invalid due to one of the following: The distance between the racks is insufficient (at least 2 feet). Hot air blows from one rack into the front of another rack. To make the cooling configuration valid, use the room layout icons and drag the row border to make the aisle wider. The row layout is not well designed. The equipment is placed in pairs directly across from each other but is misaligned by 10 cm or more. To make the cooling configuration valid, drag the equipment closer to each other. 01/27/2015 2
There is a piece of equipment placed in the aisle between the rows. To make the cooling configuration valid, move the equipment out of the aisle. Working with Capture Index Generating Reports on Linux On some Linux distributions, generating reports might fail due to missing libraries or default settings. If the client crashes during report generation or if this error message (see image) appears, this will fix the issue: 1. Open the StruxureWare Data Center Operation.ini file. The default location of the.ini file is C:\Program Files\StruxureWare Data Center Operation 7.4.5\application. 2. Write Dorg.eclipse.swt.browser.DefaultType=mozilla Dorg.eclipse.swt.browser.XULRunnerPath="/path/to/<xulrunner>" Note : Replace "/path/to/<xulrunner>" with the real path to the XULRunner on your machine. If you do not have XULRunner installed, you can download it here. Version 1.9.x should work in most cases. Otherwise, check which version is supported for your version of Firefox. StruxureWare Data Center Operation Compatibility with Linux-based Operating Systems Working with Reports Troubleshooting Connection and Synchronization Problems Correcting connection or synchronization problems that may occur when using the mobile device. Symptom Explanation Solution Error message: Could not connect to server at specified address. Ensure the specified server is running. One or more of the following conditions might exist: The specified server is not responding because it is not running. The specified server refused the connection request because it is not a Data Center Operation server. Contact your Data Center Operation server administrator. 01/27/2015 3
Error Condition: The mobile device application cannot connect to or synchronize with the server. If the mobile device is on a wireless network, that network may be experiencing problems, or the server may be off-line. Ensure that the wireless network is enabled. Use the ping command from a desktop computer or through the network connection software on the mobile device to determine whether the server is operating correctly. If the mobile device is in its cradle, one or more physical connections may be loose. Microsoft ActiveSync or Windows Mobile Device Center (WMDC) for Windows Vista might be unavailable or not running. Ensure that the cable from the cradle is securely attached and that the mobile device is firmly in the cradle. Ensure that ActiveSync or WMDC is available and running on the computer. Local and Server Mode Troubleshooting operational problems How to correct problems that may occur when performing StruxureWare Data Center Operation: Mobile functions. Symptom Explanation Solution You experience frequent error messages, or your mobile device interface or the Strux ureware Data Center Operation: Mobile ap plication is frequently unresponsive. You are working in server mode, and the network connection is unstable. StruxureW are Data Center Operation: Mobile can run with a variety of other applications, but if other applications are too numerous or too large, insufficient memory can cause errors. Check your wireless network connection, or connect through the cradle and switch to local mode to work in a mode that does not require network connection. (Remember to connect and synchronize regularly.) See your mobile device documentation for instructions on checking memory usage. If memory is insufficient, uninstall unnecessary applications. StruxureWare Data Center Operation - Mobile Troubleshooting Logon Problems Correcting problems that may occur during logon. Symptom Explanation Solution Either of these error messages: You must enter a user name. You must enter a password. You must enter a server name or IP address. You must enter a port number. One of the fields of the StruxureWare Data Center Operation: Mobile logon screen was blank. Enter the missing data in the correct field. 01/27/2015 4
Either of these error messages: The supplied user name/password is not valid. The specified user does not have permission to log on to this application. You must have a user account at the Data Center Operation server that includes your user name, your password, and permissions for the user role of StruxureWare Data Center Operation Admin. Try again to log on to ensure that you entered your password correctly. If the error message occurs again at your second attempt, contact your StruxureWare Data Center Operation server administrator. Error message: Could not resolve specified hostname. DNS could not resolve the name of the host computer. Contact your Data Center Operation server administrator. Error message: You must accept the SSL certificate in order to log on. You selected Use SSL but declined to accept the displayed certificate. Try again, and do one of the following as you log on: For a high-security logon session, select Use SSL and accept the certificate. For a logon session protected only by an unencrypted username and password, deselect Use SSL. Error message: The client and server version do not match. It is advised that you download the client from the specified server. The StruxureWare Data Center Operation client version has been updated to a later version, and you should update the Struxur eware Data Center Operation: Mobile clien t. In the Web browser on the mobile device, specify the URL of the StruxureWare Data Center Operation server. In the Web browser on the mobile device, specify the URL of the StruxureWare Data Center Operation server and add /frontpage to the address. On the Data Center Operation server's download page, click Do wnload in the Mobile box to download the latest version to the mobile device. Follow the on-screen instructions to install it. Logging on to StruxureWare Data Center Operation - Mobile Troubleshooting Scan Problems How to correct problems that may occur when scanning bar codes. Symptom Explanation Solution No response to pressing the button to activate the mobile device's scanner. The scanning function of StruxureWare Data Center Operation: Mobile may be in conflict with the scanning function of another application. Restart the mobile device and reattempt the scan or close the other application that is using the scanning function. Asset Management Troubleshooting virtualization issues You can troubleshoot any StruxureWare Data Center Operation: PRO Pack issues in the Event Viewer Application log. 01/27/2015 5
Type of events that you might see: Discovery Monitoring Recovery Why don't I see all my virtual machine hosts in the list in Microsoft System Center Operations Manager? The list of virtual machine hosts is not a complete list of all hosts monitored in Microsoft System Center Operations Manager. It is a list of those that have been associated with modeled objects in StruxureWare Data Center Operation. StruxureWare Data Center Operation: PRO Pack retrieves data from StruxureWare Data Center Operation at a specified polling interval. If you do not see all the virtual machine hosts that you expect to see, wait for the specified period of time and the view will be updated with the latest data. Discovery: 4 hours Monitoring: 30 seconds I get many reports of impacted virtual machine hosts. What could be the cause of this? Ensure the configuration represents the real-world physical environment as accurately as possible. For example, the system will reflect the real-world more accurately if you configure power connections all the way down to the server level than if you stop at the rack PDU level. If the system is missing the server connection data, it will use the information available. It will report more impacts because it does not know which servers are connected to which rack PDUs. If a rack PDU in a rack is critically impacted by an alarm, the system will assume that all servers are connected to that rack PDU are impacted by the alarm. If there are two rack PDUs in a rack, the system will assume that all servers with redundancy are connected to these two rack PDUs are impacted by the alarm. Virtual Machine Manager does not migrate virtual machines to a host. What could be the cause of this? If a host outside a cluster was previously impacted, StruxureWare Data Center Operation: PRO Pack changed the status of this host from available for placement to unavailable. Once you have resolved the issue and the host is healthy, you must make it available again: In Virtual Machine Manager, right-click the host in Virtual Machines > Host Groups, select Properties > Status, and select This host is available for placement. Note: If you are integrating with Virtual Machine Manager 2012, your setup may look a little different from the instructions and images in this user assistance since these have been created for integration with Virtual Machine Manager 2008 R2. StruxureWare Data Center Operation - PRO Pack Troubleshooting Error Messages What do I do if I get a critical error message? Search for the error message in the Documentation and Community. If you do not find instructions for solving this particular issue, the log files can be helpful when troubleshooting why an error occurred. Follow the below steps or watch this short video tutorial (2:26). 1. 2. 3. 4. Take a screenshot of the application and error message. Write down the installation details (version, build, and serial number). You can access this information in the application Help>About StruxureWare Data Center Operation. Write down (in steps) what you were doing before the message appeared. Select Help>Download Log Files and save the server and client log files to a location of your choice. Log files are also available from the Webmin server management interface (StruxureWare Data Center Operation>Download server log files). 5. Send this information to technical support a. b. c. Error description Description of what you were doing when the message appeared Log files (A log folder is created at the location you specified in step 4. It is named with the timestamp, e.g. 2011-09-23--12-57-50.) 01/27/2015 6
Note: In a clustered environment, you must log on to each node and get the log files. Troubleshooting Troubleshooting Performance Issues On this page: Slow server/client connection Scaling client memory allocation for large solutions 3D showing only white surface Advanced server debugging Slow server/client connection If the connection between the StruxureWare Data Center Operation server and client is slow, add an exception in the connection options for the StruxureWare Data Center Operation IP address. The StruxureWare Data Center Operation client uses the proxy settings from the operating system. If you are running on Windows, you can configure the settings through the Internet Explorer connection options. 1. 2. 3. 4. 5. 6. In Internet Explorer, select Tools>Internet Options. Select the Connections tab. Click LAN settings. In Proxy server, select Use a proxy server for your LAN. Click Advanced. In Exceptions, add your exceptions separated by semi-colon. Scaling client memory allocation for large solutions Working with large solutions/rooms with many racks/assets, the client might run out of memory (java heap space error). To increase the amount of assets that can be loaded at the same time in the client, you can modify the configuration ini file. 1. Browse to C:\Program Files (x86)\<operation version, e.g. StruxureWare Data Center Operation 7.3>\application 2. Open the StruxureWare Data Center Operation.ini file in a text editor, such as Notepad++. 3. Edit the line -Xmx768M corresponding to an average of 1200 racks to -Xmx1500M which should double the memory (2400 racks). There is an upper limit to how much memory could be allocated with the -Xmx parameter and in some cases, you will not be able to launch the client. In that case, decrease the value again. 3D showing only white surface If switching to 3D mode only shows a white surface and the client log file shows the following ERROR 01/27/2015 7
2014-12-11 10:06:32,190 ERROR [com.apc.threedengine.eclipse.internal.lwjgl.lwjglawtcanvasex] (AWT-EventQueue-0) Unhandled exception occurred, skipping paint() org.lwjgl.lwjglexception: Failed to find ARB pixel format 1 0 at org.lwjgl.opengl.windowspeerinfo.nchoosepixelformat(native Method) at org.lwjgl.opengl.windowspeerinfo.choosepixelformat(windowspeerinfo.java:52) at org.lwjgl.opengl.windowsawtglcanvaspeerinfo.dolockandinithandle(windowsawtglcanvasp eerinfo.java:61) at org.lwjgl.opengl.peerinfo.lockandgethandle(peerinfo.java:85) at org.lwjgl.opengl.awtglcanvas.paint(awtglcanvas.java:320) at sun.awt.repaintarea.paintcomponent(unknown Source) at sun.awt.repaintarea.paint(unknown Source) at sun.awt.windows.wcomponentpeer.handleevent(unknown Source) at java.awt.component.dispatcheventimpl(unknown Source) at java.awt.component.dispatchevent(unknown Source) at java.awt.eventqueue.dispatcheventimpl(unknown Source) at java.awt.eventqueue.access$400(unknown Source) at java.awt.eventqueue$2.run(unknown Source) at java.awt.eventqueue$2.run(unknown Source) at java.security.accesscontroller.doprivileged(native Method) at java.security.accesscontrolcontext$1.dointersectionprivilege(unknown Source) at java.security.accesscontrolcontext$1.dointersectionprivilege(unknown Source) at java.awt.eventqueue$3.run(unknown Source) at java.awt.eventqueue$3.run(unknown Source) at java.security.accesscontroller.doprivileged(native Method) at java.security.accesscontrolcontext$1.dointersectionprivilege(unknown Source) at java.awt.eventqueue.dispatchevent(unknown Source) at java.awt.eventdispatchthread.pumponeeventforfilters(unknown Source) at java.awt.eventdispatchthread.pumpeventsforfilter(unknown Source) at java.awt.eventdispatchthread.pumpeventsforhierarchy(unknown Source) at java.awt.eventdispatchthread.pumpevents(unknown Source) at java.awt.eventdispatchthread.pumpevents(unknown Source) at java.awt.eventdispatchthread.run(unknown Source) You can try to adjust your windows client PC settings according to the screen shot, using the control panel. Advanced server debugging If you experience serious performance issues on the Data Center Operation server, you may be asked to open the server debugging page in Webmin. This page includes advanced information about the health of the server to help troubleshoot the issues. 01/27/2015 8
When you use the StruxureWare DC Operation>Download Log Files option, the system includes the data from the Debug p age in the log. But first, you must open the Debug page. 1. 2. Log on to Webmin using the user credentials created during the installation. In the left menu, select StruxureWare DC Operation>Debug. You may also be asked to start the JVM logger (option at the top of the Debug page) while you are experiencing issues if the server is running out of memory. The logger slows down the system so make sure to stop it again after troubleshooting. The data is stored on the server in /var/log/jvmstat.log. System Requirements 01/27/2015 9