Troubleshooting Guide for BIOS POST on 13 th Generation of Dell PowerEdge Servers

Similar documents
Lifecycle Controller Platform Update/Firmware Update in Dell PowerEdge 12th Generation Servers

UEFI on Dell BizClient Platforms

DELL. Unified Server Configurator: IT and Systems Management Overview. A Dell Technical White Paper

DELL. Unified Server Configurator Security Overview. A Dell Technical White Paper. By Raja Tamilarasan, Wayne Liles, Marshal Savage and Weijia Zhang

Remote Installation of VMware ESX Server Software Using Dell Remote Access Controller

Manage Dell Hardware in a Virtual Environment Using OpenManage Integration for VMware vcenter

A White Paper By: Dr. Gaurav Banga SVP, Engineering & CTO, Phoenix Technologies. Bridging BIOS to UEFI

Building Microsoft Windows Server 2012 Clusters on the Dell PowerEdge VRTX

Dell EqualLogic Red Hat Enterprise Linux 6.2 Boot from SAN

Dell PowerEdge T130 Owner's Manual

Agent-free Inventory and Monitoring for Storage and Network Devices in Dell PowerEdge 12 th Generation Servers

Computer Setup User Guide

Dell idrac7 with Lifecycle Controller

Dell PowerEdge T110 II Systems. Owner s Manual. Regulatory Model E11S Series Regulatory Type E11S002

Intel RAID Controller Troubleshooting Guide

Enhanced Diagnostics Improve Performance, Configurability, and Usability

User Guide for Windows 10

Monthly Specification Update

UEFI Driver Development Guide for All Hardware Device Classes

System Area Manager. Remote Management

Dual-Core Processors on Dell-Supported Operating Systems

Intel Rapid Storage Technology

Using Red Hat Network Satellite Server to Manage Dell PowerEdge Servers

Universal Serial Bus Implementers Forum EHCI and xhci High-speed Electrical Test Tool Setup Instruction

Dell ControlPoint Security Manager

Type Message Description Probable Cause Suggested Action. Fan in the system is not functioning or room temperature

High Performance Tier Implementation Guideline

Intel Server Board Platform Confidence Test Installation and Operating Instructions

Exploring the Remote Access Configuration Utility

How to Setup and Configure ESXi 5.0 and ESXi 5.1 for OpenManage Essentials

Configuring ThinkServer RAID 100 on the Lenovo TS430

Monthly Specification Update

Intel Rapid Start Technology (FFS) Guide

EUCIP IT Administrator - Module 2 Operating Systems Syllabus Version 3.0

Using Dell Server Diagnostic Tools

BIOS Update Release Notes

Systems Management Tools And Documentation Version 8.1 Installation Guide

Getting Started. rp5800, rp5700 and rp3000 Models

Intel Server Board S3420GPV

Dell Lifecycle Controller Graphical User Interface Version For 13 th Generation Dell PowerEdge Servers User's Guide

User Guide Software Version 2.1.0

Intel RAID Software v6.x (and newer) Upgrade/Installation Procedures

Remote Supervisor Adapter II. User s Guide

ThinkServer RD550 and RD650 Operating System Installation Guide

Intel Entry Storage System SS4000-E

HP BUSINESS NOTEBOOK PC F10 SETUP OVERVIEW

Guide to Installing BBL Crystal MIND on Windows 7

Getting Started. HP Retail Systems

Foglight Experience Monitor and Foglight Experience Viewer

Intel Matrix Storage Console

Proactively Managing Servers with Dell KACE and Open Manage Essentials

BIOS Update Release Notes

ThinkServer RS140 Operating System Installation Guide

Intel Server Control User s Guide

Dell PowerEdge Blades Outperform Cisco UCS in East-West Network Performance

Best Practices for Deploying SSDs in a Microsoft SQL Server 2008 OLTP Environment with Dell EqualLogic PS-Series Arrays

Data Migration: Moving from Dell PowerVault MD3000i/MD3000 to MD3200i/MD3220i and MD3600i/MD3620i Series Storage Arrays

Specification Update. January 2014

ThinkServer RD540 and RD640 Operating System Installation Guide

Dell PowerEdge R730 Owner's Manual

A6210 WiFi USB Adapter ac USB 3.0 Dual Band User Manual

Using GIGABYTE Notebook for the First Time

BIOS Update Release Notes

Configuring and Using AMT on TS140 and TS440

System i and System p. Customer service, support, and troubleshooting

Dell Compellent Storage Center

Reference Architecture for Dell VIS Self-Service Creator and VMware vsphere 4

Dell PowerEdge R610 Systems Hardware Owner s Manual

ThinkServer RD350 and RD450 Operating System Installation Guide

Cisco FlexFlash: Use and Manage Cisco Flexible Flash Internal SD Card for Cisco UCS C-Series Standalone Rack Servers

Dell FlexAddress for PowerEdge M-Series Blades

Configuring Dell OpenManage IT Assistant 8.0 to Monitor SNMP Traps Generated by VMware ESX Server

The SATA II PCIe RAID is an ultra high-speed two channel Serial ATA Generation II RAID controller for use in PCI Express enabled systems.

How To Compare Two Servers For A Test On A Poweredge R710 And Poweredge G5P (Poweredge) (Power Edge) (Dell) Poweredge Poweredge And Powerpowerpoweredge (Powerpower) G5I (

Reboot the ExtraHop System and Test Hardware with the Rescue USB Flash Drive

Intel System Event Log (SEL) Viewer Utility. User Guide SELViewer Version 10.0 /11.0 December 2012 Document number: G

Gigabyte Management Console User s Guide (For ASPEED AST 2400 Chipset)

System Release Notes Express5800/320LB System Release Notes

Feature Comparison: idrac 7 & 8 and idrac8 License Chart

Intel Active Management Technology with System Defense Feature Quick Start Guide

Dell PowerEdge C System Management

DESKTOP. Internal Drive Installation Guide

Intel Embedded Virtualization Manager

Phoenix SecureCore TM Setup Utility

VMware ESX 2.5 Server Software Backup and Restore Guide on Dell PowerEdge Servers and PowerVault Storage

Dell Server Management Pack Suite Version 6.0 for Microsoft System Center Operations Manager User's Guide

Processor Reorder Buffer (ROB) Timeout

Intel Server Board S3420GPRX Intel Server System SR1630GPRX Intel Server System SR1630HGPRX

Intel 810 and 815 Chipset Family Dynamic Video Memory Technology

SATA II PCIe RAID Quick Installation Guide

Dell PowerEdge R630 Owner's Manual

Dell DR4000 Disk Backup System. Introduction to the Dell DR4000 Restore Manager A primer for creating and using a Restore Manager USB flash drive

or later or later or later or later

Intel Server Board S5000PALR Intel Server System SR1500ALR

Installing the Operating System or Hypervisor

Configuring ThinkServer RAID 100 on the TS140 and TS440

Computer Setup (F10) Utility Guide HP Compaq dx2200 Microtower Business PC

Advanced VMware Training

Transcription:

Troubleshooting Guide for BIOS POST on 13 th Generation of Dell PowerEdge Servers Wei Liu Dell Server BIOS Development September 2014 Troubleshooting Guide for BIOS POST on Dell 13 th Generation of PowerEdge Servers

Revisions Date August 2014 Description Initial draft THIS WHITE PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY CONTAIN TYPOGRAPHICAL ERRORS AND TECHNICAL INACCURACIES. THE CONTENT IS PROVIDED AS IS, WITHOUT EXPRESS OR IMPLIED WARRANTIES OF ANY KIND. 2014 Dell Inc. All rights reserved. Reproduction of this material in any manner whatsoever without the express written permission of Dell Inc. is strictly forbidden. For more information, contact Dell. Dell, the DELL logo, and the DELL badge are trademarks of Dell Inc. Intel, the Intel Logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Microsoft, Windows, and Windows Server are registered trademarks of Microsoft Corporation in the United States and/or other countries. Other trademarks and trade names may be used in this document to refer to either the entities claiming the marks and names or their products. Dell disclaims any proprietary interest in the marks and names of others. 2 Troubleshooting Guide for BIOS POST on Dell 13 th Generation of PowerEdge Servers

Table of contents Revisions... 2 Executive summary... 4 1. BIOS Splash Screen Display... 4 2. POST Error and Warning Messages... 6 3. Post Code in idrac Web GUI... 9 4. Driver Health Status Report... 10 5. Dell Diagnostics (epsa)... 12 6. Red Screen of Death (RSOD)... 14 7. Yellow Screen of Death (YSOD)... 16 3 Troubleshooting Guide for BIOS POST on Dell 13 th Generation of PowerEdge Servers

Executive summary The Unified Extensible Firmware Interface (UEFI) is a set of industry-standard firmware interfaces that is designed to replace the legacy BIOS to support modern operating systems and hardware architectures. Dell has been shipping UEFI support in the BIOS since the 11 th generation of PowerEdge servers through a UEFI-over-Legacy model, where it is the legacy BIOS that initializes the whole system and loads the UEFI layer at the end of Power-On Self-Test (POST) if needed. The Dell Lifecycle Controller technology is built upon UEFI as well. The BIOS on the 13 th generation of Dell PowerEdge servers is now a native UEFI implementation, with a Compatibility Support Module (CSM) to provide legacy BIOS interfaces to support operating systems that are not UEFI-aware. The look and feel of the boot process is dramatically different from the previous generations. This guide provides troubleshooting solution for possible issues that may arise during POST and pre-boot environment on the 13 th generation of PowerEdge servers. 1. BIOS Splash Screen Display After the system is powered on, the Dell server BIOS may get to video display almost instantly. Fig. 1 is a sample snapshot of the POST splash screen. The text next to the progress bar on the bottom of the screen indicates various phases of POST. The text can aid in troubleshooting issues that happen during the system boot process. The following table lists the currently supported progress texts in the BIOS: Text Display Phase of the Boot Process Initializing Intel QuickPath Interconnect... BIOS performs an early initialization of the chipset, processors, and QPI interfaces. Configuring Memory BIOS initializes the system memory. Loading BIOS Drivers BIOS starts the Driver Execution Environment (DXE) phase, loads and executes DXE drivers to perform additional chipset, processor and hardware initializations. Initializing idrac BIOS waits for idrac to become ready. This phase may take more than a few seconds on the first AC power on of the system. Initializing idrac Done idrac initialization has completed. Initializing PCIe, USB and Video Start of PCI enumeration and detection of USB keyboard devices. Initializing PCIe, USB and Video Done PCI and USB enumeration has completed. Legacy PCI option ROM initialization (BIOS boot mode only) Applies to the BIOS boot mode only. The onscreen display varies, depending on the type of PCIe cards that are installed in the system. Testing Memory (X% Complete) Software-based memory test phase. A percent progress. Note: The memory test is disabled in the BIOS setup by default. 4 Troubleshooting Guide for BIOS POST on Dell 13 th Generation of PowerEdge Servers

Testing Memory Done [No Errors] Testing Memory Done [Errors Encountered] Testing Memory Aborted Loading Lifecycle Controller Drivers Loading Lifecycle Controller Drivers Done Initializing Firmware Interfaces Running In-System Characterization... Connecting iscsi device(s) Enumerating Boot options Enumerating Boot options Done Entering Lifecycle Controller Lifecycle Controller: Applying Updates or Setting System Configuration Lifecycle Controller: Collecting System Inventory Lifecycle Controller: Done Booting Memory test completed without any issue. Memory test has found error(s). Memory test was aborted by pressing <ESC> or spacebar. BIOS loads the Lifecycle Controller drivers. BIOS has finished loading the Lifecycle Controller drivers. BIOS connects the UEFI drivers to the device handles. The UEFI drivers from add-in PCIe cards are expected to be installed in this phase. In-System Characterization (ISC) is in progress. the UEFI iscsi device drivers are connected. This display applies to UEFI boot mode only. It gets displayed when an iscsi boot device(s) has been configured. BIOS starts to enumerate Boot Options in the system. The enumeration of Boot Options has completed. The system is booting into the Lifecycle Controller. An Automated Task Application is being scheduled in the Lifecycle Controller. Lifecycle Controller is collecting system inventory for this boot. Lifecycle Controller has finished execution. BIOS has finished POST and is giving control to the operating system. 5 Troubleshooting Guide for BIOS POST on Dell 13 th Generation of PowerEdge Servers

Fig. 1 POST splash screen and progress bar 2. POST Error and Warning Messages The BIOS on the 13 th generation of PowerEdge servers can display informational, warning and error messages during POST to help you troubleshoot various issues. If the error occurs early in POST, such as during memory initialization, then a pop-up message box with a detailed description of the issue (e.g. Fig. 2) may be displayed on the screen. 6 Troubleshooting Guide for BIOS POST on Dell 13 th Generation of PowerEdge Servers

Fig. 2 An error message box in early POST If the issue is detected at a later time in POST, corresponding error and warning messages aredisplay ed on the screen with a UEFIxxxx prefix. An event entry is logged in the Lifecycle Controller log (LC log) as well. Depending on the severity of the error/warning, the system may proceed with continuing boot, or prompt with F1/F2/F10/F11 for user input, or reset, or halt. The message comprisesof two parts, the error/warning message itself, and a recommended response action. You can follow the corresponding recommended response action to address the issue. For a complete list of POST error and warning messages, see the Event and Error Message Reference Guide for 13 th Generation Dell PowerEdge Servers. In the following example, the UEFI driver for the Integrated Network card is not signed. The user has just turned on Secure Boot in BIOS setup utility. In the next boot, a few error messages are displayed on the screen during POST. - The first error message (UEFI0072) displays that the UEFI driver from the Integrated NIC 1 Port 1 Partition 1 was not loaded because it failed the Secure Boot authentication. You may address this issue by updating the NIC firmware to a version that supports the UEFI driver signing. 7 Troubleshooting Guide for BIOS POST on Dell 13 th Generation of PowerEdge Servers

- The second error message (UEFI0071) displays that the previously configured UEFI network boot interface is no longer available. This is a result of the corresponding UEFI driver not being loaded. - The third warning message (UEFI0074) displays that the Secure Boot policy has been modified since the last time the system was booted. In this particular example, the user enabled Secure Boot on purpose, so no action needs to be taken. Fig. 3 An example of POST error messages Corresponding logs for the error and warning messages will be recorded in the Lifecycle Log (Fig. 4). 8 Troubleshooting Guide for BIOS POST on Dell 13 th Generation of PowerEdge Servers

Fig. 4 Screen shot of the Lifecycle Log 3. Post Code in idrac Web GUI In case you cannot get to the screen display, the Post Code feature available in the idrac web GUI may come handy. This page displays the last system POST code with a descriptive text. POST code helps to detect pre-video hangs, report fatal errors, and analyze system failures during POST. 9 Troubleshooting Guide for BIOS POST on Dell 13 th Generation of PowerEdge Servers

Fig. 5 An example of the Post Code in the idrac Web GUI 4. Driver Health Status Report The UEFI specification defines a Driver Health Protocol (DHP). The DHP provides services allowing a UEFI driver to express health status of a controller, return status messages associated with the health status, perform repair operations if necessary and request configuration changes to place the controller back in a usable state. Dell server BIOS checks the driver health status of each UEFI driver in the system, and displays the status messages. The BIOS may invoke the repair and configuration utility if a repair or reconfiguration operation is required. In most cases, you can follow the instructions on the screen to proceed. Fig. 6 is an example display where the BIOS halts on some errors returned from DHP. In this particular example, the idrac DHP detected that the backplane 2 power cable has been disconnected; The LSI SAS controller requires configuration changes, possibly due to a catastrophic issue. 10 Troubleshooting Guide for BIOS POST on Dell 13 th Generation of PowerEdge Servers

Fig. 6 Example of errors detected by UEFI Driver Health Protocol The following (Fig. 7) is a snapshot of the Driver Health Manager in the case when a driver requires configuration change. The Driver Health Manager lists all the device instances that require reconfiguration. You can select each one of them and follow the instructions on the screen to configure the devices. 11 Troubleshooting Guide for BIOS POST on Dell 13 th Generation of PowerEdge Servers

Fig. 7 Driver Health Manager 5. Dell Diagnostics (epsa) Dell Enhanced Pre-Boot System Diagnostics (epsa) are diagnostics tests that are embedded in the system (Fig. 8). These tests allow you to check the hardware health status outside the operating system environment. The findings of this diagnostics can assist you in troubleshooting the fault and working toward a resolution to the issue. The epsa can be launched from the Boot Manager-> System Utilities-> Launch Diagnostics (Fig. 9). 12 Troubleshooting Guide for BIOS POST on Dell 13 th Generation of PowerEdge Servers

Fig. 8 Sample screen shot of epsa 13 Troubleshooting Guide for BIOS POST on Dell 13 th Generation of PowerEdge Servers

Fig. 9 Launching diagnostics from Boot Manager 6. Red Screen of Death (RSOD) The Dell server BIOS implements an enhanced CPU exception handler (RSOD) which aids the user and tech support to analyze the software exception when the system crashes in the pre-boot UEFI environment. The debug information is displayed on the screen and additional information and stack traces can be retrieved through the serial port (if available). You can save the dump and use it for debugging offline. 14 Troubleshooting Guide for BIOS POST on Dell 13 th Generation of PowerEdge Servers

A sample RSOD display is depicted in Fig. 10. Fig. 10 An example of the RSOD screen shot When an exception is raised by the processor the BIOS displays the RSOD screen with the following information related to the exception. The exception type, such as Page Fault, General Protection Fault, Divide by Zero, Breakpoint, and so on. A Dell-defined error value, pre-fixed with UEFIxxxx. Note a corresponding error will be logged to the LC log as well. Partial register set (x86 64bit). Last-Branch records and associated module names if available. Current RIP and Faulting driver module name Stack trace back from faulted module. Additional information is available from the serial port dump. To retrieve the serial dump, you can connect the server to a client system with a null modem cable and use any terminal program (for example, Putty or HyperTerminal) with the baud rate set to 115200 bps, then press <ENTER>. The serial dump can be retrieved from Serial over LAN (SOL) method as well. 15 Troubleshooting Guide for BIOS POST on Dell 13 th Generation of PowerEdge Servers

Note: The RSOD serial dump can be obtained at the point of failure. The serial session does not have to be started prior to the RSOD. RSOD are usually caused by software issues, and may be resolved by updating the BIOS, Lifecycle Controller, or the UEFI firmware for PCIe cards. You may send the screen shot and serial dump to Dell support for further analysis, should you encounter a RSOD even after all the firmware updates. 7. Yellow Screen of Death (YSOD) When a hardware error occurs during UEFI pre-boot environment (excluding CSM phase in BIOS boot mode), the Dell server BIOS may display a Yellow Screen of Death (YSOD) with some of the software contexts at the time when the issue is detected. The hardware errors include Nonmaskable Interrupt (NMI) and Machine Check Errors (MCE). You should check the System Event Log (SEL) to identify the source and type of the error. Update the corresponding device firmware if the error is originated from a PCIe device. Note: The stack trace displayed on the YSOD screen only provides some context information before the failure, and not the source of the problem. A sample YSOD is depicted in Fig. 11. 16 Troubleshooting Guide for BIOS POST on Dell 13 th Generation of PowerEdge Servers

Fig. 11 An example of the YSOD screen shot 17 Troubleshooting Guide for BIOS POST on Dell 13 th Generation of PowerEdge Servers