Lenovo ThinkServer Solution For Apache Hadoop: Cloudera Installation Guide
First Edition (January 2015) Copyright Lenovo 2015. LIMITED AND RESTRICTED RIGHTS NOTICE: If data or software is delivered pursuant a General Services Administration GSA contract, use, reproduction, or disclosure is subject to restrictions set forth in Contract No. GS-35F-05925.
Contents Chapter1.Product overview...... 1 Product components............. 1 Operating system and software......... 1 Chapter2.Connecting cables..... 3 Connecting Ethernet cables.......... 3 Connecting power cables........... 5 Chapter3.Deploying the product.... 7 Installing hardware.............. 7 Configuring RAID............... 7 Installing the operating system......... 8 Installing the management software....... 8 Viewing the configuration information....... 9 Chapter 4. Getting information, help, and service.............. 11 Chapter5.Documentation...... 13 AppendixA.Trademarks....... 15 Copyright Lenovo 2015 i
ii Lenovo ThinkServer Solution For Apache Hadoop: ClouderaInstallation Guide
Chapter 1. Product overview This chapter provides information about the features, software programs, and component locations. This chapter contains the following items: Product components on page 1 Operating system and software on page 1 Product components Your product contains the following hardware and software components: Hardware components Hardware component Quantity Required or optional Lenovo ThinkServer RD550 server (management node) 3 Required Lenovo ThinkServer RD650 server (data node) 8 Required Lenovo System Networking RackSwitch G8264T (data network) Lenovo System Networking RackSwitch G7028 (management network) 1 Required 1 Required 25U S2 Dynamic Rack Cabinet 1 Optional Enterprise C13 Power Distribution Unit (PDU) with North America Line Cord 1 Optional Lenovo Universal Rack PDU 2 Optional RT5kVA 3U Rack or Tower Uninterruptible Power Supply (UPS) (200V-240VAC) with North America Line Cord 1 Optional Local 1X8 Console Manager (LCM8) 1 Optional USB Four Pack of USB KVM Cables 3 Optional 1U 18.5-inch Standard Console 1 Optional To purchase the optional components, go to: http://www-03.ibm.com/systems/x/hardware/index.html You can use cable management arms to manage the cables. To purchase the cable management arm, go to: http://lenovoquickpick.com/usa/home/thinkserver/rack-and-tower-server Software components Software component Red Hat Enterprise Linux 6.5 Cloudera Distribution for Hadoop (CDH) 5.2 Required or recommended Required Required Operating system and software The following operating system and software are required in your product: Copyright Lenovo 2015 1
Operating system: Red Hat Enterprise Linux 6.5 To purchase and use the Red Hat Enterprise Linux 6.5 operating system, go to http://www.redhat.com/products/enterprise-linux/server/ and follow the instructions on the Web page. Software: Cloudera Distribution for Hadoop (CDH) 5.2 You can install the Cloudera Distribution for Hadoop (CDH) 5.2 program on the management nodes and data nodes. To purchase and use Cloudera Distribution for Hadoop (CDH) 5.2, go to http://www.cloudera.com/content/cloudera/en/home.html and follow the instructions on the Web page. Cloudera Manager CDH 5.2 The Cloudera Manager CDH 5.2 program is a management software. You can use Cloudera Manager CDH 5.2 to maintain, manage, and monitor the product. To install Cloudera Manager CDH 5.2, see Installing the management software on page 8. 2 Lenovo ThinkServer Solution For Apache Hadoop: ClouderaInstallation Guide
Chapter 2. Connecting cables This chapter contains the following information: Connecting Ethernet cables on page 3 Connecting power cables on page 5 Connecting Ethernet cables To connect the product to the network, do the following: 1. Connect the servers to the Lenovo System Networking RackSwitch G7028 (24-port) switch to enable the management network. Connect the Ethernet connector for system management (RJ-45) on the rear of the server to the Ethernet connector on the switch. Figure 1. Connecting servers to the management network Copyright Lenovo 2015 3
2. Connect the servers to the Lenovo System Networking RackSwitch G8264T (48-port) switch to enable the data network. Connect the Ethernet connectors on the installed AnyFabric adapter (also called mezzanine adapter) to the Ethernet connectors on the switch. Figure 2. Connecting servers to the data network 4 Lenovo ThinkServer Solution For Apache Hadoop: ClouderaInstallation Guide
Connecting power cables The following illustration provides information about connecting power cables to the components. C13 to C14 line cord C19 to C20 Y line cord C13 to C14 Y line cord Figure 3. Connecting power cables Chapter 2. Connecting cables 5
1 Lenovo System Networking RackSwitch G7028 2 Lenovo System Networking RackSwitch G8264T 3 Lenovo ThinkServer RD550 servers 4 Lenovo ThinkServer RD650 server 5 Enterprise C13 Power Distribution Unit (PDU) with North America Line Cord (optional) 6 Local 1X8 Console Manager (LCM8) (optional) 7 1U 18.5-inch Standard Console (optional) 8 RT5kVA 3U Rack or Tower Uninterruptible Power Supply (UPS) (200V-240VAC) with North America Line Cord (optional) 9 Lenovo Universal Rack PDU (optional) To purchase the optional components, go to: http://www-03.ibm.com/systems/x/hardware/index.html 6 Lenovo ThinkServer Solution For Apache Hadoop: ClouderaInstallation Guide
Chapter 3. Deploying the product This chapter provides information that helps you deploy your product. The chapter contains the following information: Installing hardware on page 7 Configuring RAID on page 7 Installing the operating system on page 8 Installing the management software on page 8 Viewing the configuration information on page 9 Installing hardware To install hardware in ThinkServer RD550 and ThinkServer RD650 servers, refer to the User Guide and Hardware Maintenance Manual. You can obtain the most up-to-date User Guide and Hardware Maintenance Manual from the Lenovo Web site at: http://www.lenovo.com/usermanuals Configuring RAID This topic provides information about configuring RAID for ThinkServer RD550 and RD650 servers. Configuring RAID for ThinkServer RD550 servers (management nodes) Configure hard disk drives 0 and 1 as RAID 1. These two hard disk drives are used for installing the operating system. Hard disk drives 2 and 3 support JBOD (Just a Bunch of Disks) mode for Cloudera Roles and Services. Hard disk drive ID Configuration Function Hard disk drive 0 RAID 1 Installing the operating system Hard disk drive 1 RAID 1 Installing the operating system Hard disk drive 2 JBOD mode Cloudera Roles and Services Hard disk drive 3 JBOD mode Cloudera Roles and Services Configuring RAID for ThinkServer RD650 servers (data nodes) Configure hard disk drives 0 and 1 as RAID 1. These two hard disk drives are used for installing the operating system. Hard disk drives 2 11 support JBOD mode for Cloudera Data. Hard disk drive ID Configuration Function Hard disk drive 0 RAID 1 Installing the operating system Hard disk drive 1 RAID 1 Installing the operating system Hard disk drive 2 JBOD mode Cloudera Data Hard disk drive 3 JBOD mode Cloudera Data Hard disk drive 4 JBOD mode Cloudera Data Hard disk drive 5 JBOD mode Cloudera Data Hard disk drive 6 JBOD mode Cloudera Data Copyright Lenovo 2015 7
Hard disk drive ID Configuration Function Hard disk drive 7 JBOD mode Cloudera Data Hard disk drive 8 JBOD mode Cloudera Data Hard disk drive 9 JBOD mode Cloudera Data Hard disk drive 10 JBOD mode Cloudera Data Hard disk drive 11 JBOD mode Cloudera Data For more information about how to configure and manage RAID, refer to the ThinkServer 12 Gb/s MegaRAID SAS Software User Guide. This document is available on the Lenovo Web site at: http://www.lenovo.com/support Installing the operating system To install the Red Hat Enterprise Linux 6.5 operating system on both management nodes and data nodes, do one of the following: Install the operating system manually. To install the operating system manually, refer to the related information in the ThinkServer RD550 and RD650 Operating System Installation Guide. This document is available on the Lenovo Web site at: http://www.lenovo.com/usermanuals Use the Lenovo ThinkServer Deployment Manager program to automate the operating system installation process. To use Lenovo ThinkServer Deployment Manager to install the operating system, do the following: Notes: 1. Turn on the server. Press F10 as soon as you see the logo screen. Then, wait for several seconds. Deployment Manager opens. 2. Read and accept the license agreement. Then, select the language in which you want to view the program. 3. Select Deployment on the left pane. Then, following the on-screen instructions to install the operating system. Select Full Installation when installing the operating system. It is recommended that you use the following drive partition when configuring RAID 1: For ThinkServer RD550 servers: ROOT: 1 895 300 MB BOOT: 1024 MB SWAP: 10 240 MB For ThinkServer RD650 servers: ROOT: 3 803 000 MB BOOT: 1024 MB SWAP: 10 240 MB Installing the management software Note: Before installing the management software, install the operating system on all management nodes and data nodes first. 8 Lenovo ThinkServer Solution For Apache Hadoop: ClouderaInstallation Guide
To install the latest version of the Cloudera Manager CDH 5.2 program on the management node, do the following: 1. Connect the management node to the Internet. 2. Use the following command lines to install the program: wget http://archive.cloudera.com/cm5/installer/latest/cloudera-manager-installer.bin chmod u+x cloudera-manager-installer.bin sudo./cloudera-manager-installer.bin For detailed information about installing and upgrading Cloudera Manager CDH 5.2, go to http://www.cloudera.com/content/cloudera/en/documentation/core/latest/topics/installation.html. Viewing the configuration information You can use the Lenovo ThinkServer System Manager (TSM) program to view the configuration information of the management nodes and the data nodes. The default user name and password are as follows: Username = lenovo Password = len0vo For detailed information about using the Lenovo ThinkServer System Manager program, refer to the help system of the program. You also can download the ThinkServer System Manager User Guide from the Lenovo Web site at: http://www.lenovo.com/support Chapter 3. Deploying the product 9
10 Lenovo ThinkServer Solution For Apache Hadoop: ClouderaInstallation Guide
Chapter 4. Getting information, help, and service This chapter contains information about help, service, and technical assistance for your product. Solution support is only available when the components listed in Product components on page 1 are used in your product. Lenovo provides service and support for the hardware components listed in the component list. When contacting Lenovo or Cloudera support, be prepared to provide the model and serial number information of the hardware components. For hardware installation and troubleshooting, go to the Lenovo Support Web site at: http://www.lenovo.com/support This Web site is updated with the latest support information such as the following: Drivers and software Diagnostic solutions Product and service warranty Product and parts details User guides and manuals Knowledge base and frequently asked questions For Cloudera software installation and troubleshooting, do the following: 1. Register your product with Cloudera by following the instructions at: http://support.cloudera.com 2. Be prepared to provide the Cloudera Cluster diagnostic log bundle of Cloudera Manager 5.2. For service and support on the operating system, go to: http://www.redhat.com/products/enterprise-linux/server/ Copyright Lenovo 2015 11
12 Lenovo ThinkServer Solution For Apache Hadoop: ClouderaInstallation Guide
Chapter 5. Documentation The documentation helps you install, use, and maintain the product. You can find the most up-to-date documentation for your product from the following Web sites: http://www.lenovo.com/usermanuals You can find and download the following documentation for Lenovo ThinkServer RD550 and RD650 servers: User Guide and Hardware Maintenance Manual Operating System Installation Guide http://www.lenovo.com/support You can find and download the ThinkServer 12 Gb/s MegaRAID SAS Software User Guide. http://www.cloudera.com/content/cloudera/en/documentation.html#clouderadocumentation You can view and download the documentation for Cloudera Distribution for Hadoop (CDH) 5.2 and Cloudera Manager CDH 5.2. Copyright Lenovo 2015 13
14 Lenovo ThinkServer Solution For Apache Hadoop: ClouderaInstallation Guide
Appendix A. Trademarks Lenovo, the Lenovo logo, and ThinkServer are trademarks of Lenovo in the United States, other countries, or both. Linux is a registered trademark of Linus Torvalds. Red Hat and Red Hat Enterprise Linux are registered trademarks of Red Hat, Inc. in the U.S. and other countries. Other company, product, or service names may be trademarks or service marks of others. Copyright Lenovo 2015 15
16 Lenovo ThinkServer Solution For Apache Hadoop: ClouderaInstallation Guide