Navigating the Rescue Mode for Linux



Similar documents
Do it Yourself System Administration

Reboot the ExtraHop System and Test Hardware with the Rescue USB Flash Drive

Restoring a Suse Linux Enterprise Server 9 64 Bit on Dissimilar Hardware with CBMR for Linux 1.02

How to Restore a Linux Server Using Bare Metal Restore

How to Backup XenServer VM with VirtualIQ

Red Hat Linux 7.2 Installation Guide

Information Sheet IS13011A. VS Series - Recovering / Installing the Operating System. (For Software Version 4.x) Issue

USB 2.0 Flash Drive User Manual

USB Bare Metal Restore: Getting Started

II. Installing Debian Linux:

Installing an IBM Workplace/Portal Server on Linux

«Disaster Recovery» A DOM Restore Guide for Thecus NAS

Using Internet or Windows Explorer to Upload Your Site

SETTING UP A LAMP SERVER REMOTELY

Red Hat Linux Administration II Installation, Configuration, Software and Troubleshooting

CONNECTING TO DEPARTMENT OF COMPUTER SCIENCE SERVERS BOTH FROM ON AND OFF CAMPUS USING TUNNELING, PuTTY, AND VNC Client Utilities

MTA Course: Windows Operating System Fundamentals Topic: Understand backup and recovery methods File name: 10753_WindowsOS_SA_6.

Using the IPMI interface

How to Install CS OrthoTrac on a New Server and Copy the Data from the Old Server to the New Version 12 and higher

NAS 109 Using NAS with Linux

PA-5000 Series SSD Storage Options Configuring RAID and Disk Backup

IUCLID 5 Guidance and support. Installation Guide Distributed Version. Linux - Apache Tomcat - PostgreSQL

Using SSH Secure FTP Client INFORMATION TECHNOLOGY SERVICES California State University, Los Angeles Version 2.0 Fall 2008.

CASHNet Secure File Transfer Instructions

Remote Unix Lab Environment (RULE)

Overview. Remote access and file transfer. SSH clients by platform. Logging in remotely

LOCKSS on LINUX. CentOS6 Installation Manual 08/22/2013

Using Symantec NetBackup with Symantec Security Information Manager 4.5

File Manager Pro User Guide. Version 3.0

WinSCP for Windows: Using SFTP to upload files to a server

Instructions for Accessing the Advanced Computing Facility Supercomputing Cluster at the University of Kansas

Cloud Storage Quick Start Guide

Table of Content. Official website:

How To Set Up A Backupassist For An Raspberry Netbook With A Data Host On A Nsync Server On A Usb 2 (Qnap) On A Netbook (Qnet) On An Usb 2 On A Cdnap (

Guide to the Configuration and Use of SFTP Clients for Uploading Digital Treatment Planning Data to ITC

Adafruit's Raspberry Pi Lesson 6. Using SSH

Getting Started With Your Virtual Dedicated Server. Getting Started Guide

[HOW TO RECOVER AN INFINITI/EVOLUTION MODEM IDX ] 1

GeBro-BACKUP. Die Online-Datensicherung. Manual Pro Backup Client on a NAS

Secure Shell. The Protocol

Installing a Symantec Backup Exec Agent on a SnapScale Cluster X2 Node or SnapServer DX1 or DX2. Summary

CPSC 226 Lab Nine Fall 2015

Easy Setup Guide 1&1 CLOUD SERVER. Creating Backups. for Linux

Quick Start Guide. Version R91. English

Getting Started With Your Virtual Dedicated Server. Getting Started Guide

VOICE IMPROVEMENT PROCESSOR (VIP) BACKUP AND RECOVERY PROCEDURES - Draft Version 1.0

Configuring High Availability for VMware vcenter in RMS Distributed Setup

Job Aid: Replacing the Hard Drive on the Avaya S8710 Media Servers R2.2 or Later

Configuring Virtual Blades

Red Hat Certifications: Red Hat Certified System Administrator (RHCSA)

EVault Software. Course 361 Protecting Linux and UNIX with EVault

Backup and Recovery Procedures

WES 9.2 DRIVE CONFIGURATION WORKSHEET

Shellshock Security Patch for X86

TSM for Windows Installation Instructions: Download the latest TSM Client Using the following link:

Digi Connect Wan 3G Application Guide Update the firmware, backup and restore the configuration of a Digi Connect Wan 3G using a USB flash drive.

How to Use? SKALICLOUD DEMO

How Do I Recover infiniti Remotes and Line Cards?

Acronis True Image 2015 REVIEWERS GUIDE

Introductory Note 711 & 811. Remote Access to Computer Science Linux Files Using Secure Shell Protocols

AlienVault. Unified Security Management x Offline Update and Software Restoration Procedures

Simple. Control Panel. for your Linux Server. Getting Started Guide. Simple Control Panel // Linux Server

VHA Innovations Program Future Technology Laboratory. Linux Workstation Remote Desktop Connection Manual

Configure NFS Staging for ACS 5.x Backup on Windows and Linux

LOCKSS on LINUX. Installation Manual and the OpenBSD Transition 02/17/2011

Introduction to Operating Systems

Quick Start Guide. Cerberus FTP is distributed in Canada through C&C Software. Visit us today at

Upgrading Redwood Engine Software. Version 2.0.x to 3.1.0

HP RDX Continuous Data Protection Software Quickstart Guide

Allworx Installation Course

How to FTP (How to upload files on a web-server)

NI Real-Time Hypervisor for Windows

Tips for getting started! with! Virtual Data Center!

Network Attached Storage System Recovery Procedure

QuickStart Guide for Client Management. Version 8.7

INASP: Effective Network Management Workshops

ucloud server User Guide v3.0 ( )

13.1 Backup virtual machines running on VMware ESXi / ESX Server

Remote Access to Unix Machines

Using Microsoft Expression Web to Upload Your Site

Recommended File System Ownership and Privileges

Format the USB Thumb Drive for Ghost

SSH and Basic Commands

File Transfer Examples. Running commands on other computers and transferring files between computers

Uploading and Downloading Files to the HIS Server. Application Requirements

Dell DR4000 Disk Backup System. Introduction to the Dell DR4000 Restore Manager A primer for creating and using a Restore Manager USB flash drive

Using Virtual Machines

Running a Default Vulnerability Scan

Red Hat System Administration 1(RH124) is Designed for IT Professionals who are new to Linux.

Understanding Backup and Recovery Methods

How To Use 1Bay 1Bay From Awn.Net On A Pc Or Mac Or Ipad (For Pc Or Ipa) With A Network Box (For Mac) With An Ipad Or Ipod (For Ipad) With The

VMTurbo Operations Manager 4.5 Installing and Updating Operations Manager

Backup & Restore Guide

File Transfers. Contents

IBM Rapid Restore PC powered by Xpoint - v2.02 (build 6015a)

Getting Started With Your Virtual Dedicated Server. Getting Started Guide

Using iscsi with BackupAssist. User Guide

Using Red Hat Enterprise Linux with Georgia Tech's RHN Satellite Server Installing Red Hat Enterprise Linux

Tutorial: Using HortonWorks Sandbox 2.3 on Amazon Web Services

Contents. Hardware Configuration Uninstalling Shortcuts Black...29

Transcription:

Navigating the Rescue Mode for Linux SUPPORT GUIDE DEDICATED SERVERS

ABOUT THIS GUIDE This document will take you through the process of booting your Linux server into rescue mode to identify and fix the problem(s) that may be causing it to be unresponsive. This guide will instruct you on how to: Log into rescue mode Identify disk partitions Detect physical disk problems Detect and fix file system errors Access and recover data Page 2 Logging into rescue mode If your Linux dedicated server is unresponsive and fails to come online after a reboot, you can boot the server into rescue mode from the Tagadab control panel to identify and fix the problem. 1. Once rescue mode has been started on your dedicated server, log in to the system via SSH using your servers usual IP address and the root password that was set when the system was first installed (you can find this in your Tagadab control panel). You can also access the server in graphical mode using VNC if you have a VNC client installed. Please be aware that the rescue mode system will have a different SSH host key to your normal server. If you are using PuTTY you will see a warning like Screen 1: Screen 1

2. Accept the warning by clicking the 'Yes' button and logging in. If you are using SSH from a Linux or Mac shell, you may need to remove the old version of the SSH key from your known hosts file before logging in. Once you have finished with rescue mode and booted your server normally, it will return to using its usual SSH host key and you will see a similar warning again. You should see a window similar to Screen 2 once you are logged in: Page 3 Screen 2 Identifying your disk partitions 1. Identify your disk partitions before recovering your system. Get a list of all of the disks connected to the system and their partitions by running the command 'fdisk l' as noted in Screen 3: Screen 3

2. The exact output from this will vary depending on the number of disk in your server, the number of partitions on each disk, and whether or not your system uses software RAID. Screen 3 shows one disk (/dev/sda) that contains four partitions (numbered 1, 2, 5 and 6). The first partition (/dev/sda1) is marked as bootable, so this would be the partition mounted under /boot. The second partition (/dev/sda2) is an extended partition and is only used as a container for the other two partitions. It is not mountable. The third partition (/dev/sda5) is the swap space, and the fourth partition (/dev/sda6) is the root partition, normally mounted as /. If your server has two disks the output will look something like Screen 4: Page 4 Screen 4

If your system uses software RAID, it will look something like Screen 5: Page 5 Screen 5 3. If your system uses software RAID, there are additional steps you will need to take before attempting to fix disk issues or access your data. Please refer to the separate software RAID instructions in the following sections. If no disks are displayed (or an incorrect number of disks are displayed) then the disk(s) may have already suffered a catastrophic failure. In such an event, you will need to ask Tagadab Support to arrange for a replacement disk / server and then restore any backups. Detecting physical disk problems 1. Your disk(s) may have physical errors that cannot be corrected and would require a disk replacement. You can use the smartctl program to test the disk to see if this is the case. First, check that the disk has its SMART capability enabled with the command 'smartctl i /dev/diskname', swapping diskname for the correct device as shown in Screen 6. This command should be successful as all Tagadab disks have SMART enabled. If this command does not successfully return the disk(s), a catastrophic failure may have occurred and the disk(s) will need to be replaced.

Page 6 Screen 6 2. Run a test on the disk using 'smartctl t short /dev/diskname'. Further options are available (use 'man smartctl' to see them). You will see a message that the test will take around one minute to complete as shown in Screen 7: Screen 7 3. After waiting a minute, use 'smartctl a /dev/diskname' to see the results displayed as a table with the number of disk failures that have occurred over the disk's lifetime. The example in Screen 8 does not show any major errors:

Page 7 Screen 8 4. Look out for a high error count next to any of the errors with the type 'Pre-fail' as these may be an indication that the disk is going to fail soon. If any of your disks have this type of error, please contact Tagadab Support. 5. Smartctl can be used on systems with multiple disks by running the above sequence of commands for each disk (not each partition). RAID Instructions There are no separate instructions required for this section. Detecting and fixing file system errors 1. Your server may fail to boot if there are errors with the file system. You can identify and correct these errors using the fsck tool. For example, if you have seen errors in the systems logs indicating partition problems on the root disk (/dev/sda6 as shown in Screen 9), you can try to correct this by running the command 'fsck /dev/sda6'. This must be done before the disk has been mounted. Screen 9

2. In Screen 9, there are a few minor errors that fsck has fixed. For more severe errors, fsck may ask if you would like to fix them through a prompt. To avoid being prompted and simply accept the default options, run the fsck command with the a flag. Further details are available from the fsck manual (type 'man fsck'). 3. If you fix any disk errors, exit rescue mode and attempt to boot the system normally. If the system still fails to boot, or can t fix the disk errors, you may need to recover any data that you did not back up (see the section on recovering data). Page 8 RAID Instructions Perform fsck on the RAID device rather than on the member partitions to check the file system on both disks simultaneously. The RAID device will likely be either /dev/md0 or /dev/md1, whichever is the largest (the smaller RAID device will be swap space). In Screen 10, minor errors have been corrected. Screen 10 Accessing your data 1. If your disks did not show any errors, or you know your system did not boot due to disk related reasons (e.g., incorrectly enabled firewall, incorrectly modified grub, etc.) you will need to access your disk(s) to either correct the problem or recover the data before reimaging. To do this, the disk(s) needs to be mounted. 2. From earlier steps, you should have already established the root partition. In our one disk example shown in Screen 11, it is /dev/sda6 and in our RAID example sin Screen 12 it is /dev/md0. For servers with multiple disks, you may want to access the partition on the second disk, although problems that prevent a server booting will normally be on the partition mounted at /.

3. To access the data on the root partition, create a mount point for the partition. For our one disk system, it will be created at /mnt/sda6. We then mount the disk on this mount point, and cd into the directory to view our system as shown below in Screen 11: Page 9 Screen 11 RAID Instructions Create a mount point at /mnt/md0, mount the RAID device here and cd into the directory as shown in Screen 12. You can now view and edit your files using standard Linux tools (such as less, cat, vi, nano). Screen 12 Chroot 1. Use the chroot command to change the root of the rescue system to the root on the disk. This is needed if you wanted to use the 'passwd' program to reset one of your system passwords. 2. Then use 'chroot mountpoint' to change the root to the partition you have mounted. In Screen 13 we used 'chroot /mnt/sda6' or 'chroot /mnt/md0'. You may see an error such as: chroot: failed to run command `/bin/zsh': No such file or directory This indicates that the zsh shell used by the rescue system is not available to run (i.e., it is not installed) on your dedicated server. In this case, modify the command to run the bash shell: 'chroot mountpoint bash' 3. Finally, run any remaining commands (such as passwd), and use exit to come out of the chroot.

Page 10 Screen 13 Recovering your data If you are unable to fix your server, you will need to copy any data that is not backed up before requesting a reimage from the Tagadab control panel. If you have access to another server that runs FTP or SSH, use the command line FTP or SCP tools to upload your data to that server. Otherwise, you can connect an SCP client (such as WinSCP for Windows) to the rescue mode server, navigate to the point where you mounted the disk and download the data to your local system. HOW IS THE BELOW REFERENCED IN COPY?I Screen 14

Unmount When you have finished making changes, unmount the disk, and end rescue mode by rebooting the server from the control panel as shown in Screen 15. If necessary, reimage the server via the Tagadab control panel. Page 11 Screen 15 Need Further Assistance? Please e-mail us at support@tagadab.com or call us at 020 3697 5537 Monday through Friday from 8am to 6pm GMT.