第 2 回 Linux-HA Japan 勉強会

Similar documents

レッドハット製品プライスリスト Red Hat Enterprise Linux2013 新製品 (ベースサブスクリプション) 更新 :2015 年 4 22

レッドハット製品プライスリスト Red Hat Enterprise Linux 製品 (RHEL for HPC) 更新 :2015 年 4 22

CRM CLI (command line interface) tool

Cost Accounting 1. B r e a k e v e n A n a l y s i s. S t r a t e g y I m p l e m e n t a t i o n B a l a n c e d S c o r e c a r d s

Document and entity information

Discover the power of reading for university learners of Japanese in New Zealand. Mitsue Tabata-Sandom Victoria University of Wellington

Advanced Training Program for Teachers of the Japanese-Language Application Instructions For FY 2016

Graduate Program in Japanese Language and Culture (Master s Program) Application Instructions

Direct Marketing Production Printing & Value-Added Services: A strategy for growth

医学生物学一般問題 ( 問題用紙 1 枚解答用紙 2 枚 )

この外国弁護士による法律事務の取扱いに関する特別措置法施行規則の翻訳は平

さくらインターネット研究所上級研究員日本 Vyattaユーザー会運営委員松本直人

ใบสม ครเข าร วมโครงการน ส ตแลกเปล ยนมหาว ทยาล ยเกษตรศาสตร

Data Mining for Risk Management in Hospital Information Systems

Public Financial Assistance for Formal Education in Japan

How To Teach English At Kumon

Panasonic AC-DC Power Supply Design Support Service

Bushido: Way of the Samurai. Golden Screens. Chokuo TAMURA Hawks with pine and plum blossom x cm each

育デジ (Iku-Digi) Promoting further evolution of digital promotion

<HNAS_SNMP 監視取扱説明書別紙 2 イベントリスト> Event ID

Electricity Business Act ( Act No. 170 of July 11, 1964)

レッドハット製品プライスリスト標準価格. Red Hat Enterprise Linux 製品 (RHEL Server)

Overview: Clustering MySQL with DRBD & Pacemaker

TS-3GB-S.R v1.0 VoIP Supplementary Services Descriptions: Call Forwarding - Unconditional

Why today? What s next?

EDB-Report. 最新 Web 脆弱性トレンドレポート( ) ~ Exploit-DB( 公開されている内容に基づいた脆弱性トレンド情報です

Upper Secondary Education in Japan

Bluetooth FAQ. This document is an FAQ (Frequently Asked Questions) about Bluetooth in general and Logitech products using Bluetooth technology.

California Subject Examinations for Teachers

Teacher Training and Certificate System

Current Situation of Research Nurse Education and Future Perspectives in Japan

外部委託が進む中での地方自治体職員の雇用の保護 PROTECTION OF EMPLOYMENT FOR LOCAL GOVERNMENT WORKERS UNDER OUTSOURCING

How To Run A Server On A Microsoft Cloud Server (For A Small Amount Of Money)

Agenda. About Gengo. Our PostgreSQL usage. pgpool-ii. Lessons

SPECIFICATION. Not recommend for new design. MS6M Power Integrated Module 7MBR15UF060 MS6M Fuji Electric Device Technology Co.,Ltd.

Preschool Education and Care in Japan

Abbreviations. Learning Goals and Objectives 学習到達目標. Structure of Presentation

HIF 2015 Japanese Language and Japanese Culture Program

BULBOUS CELL MEDIA GROUP

Welcome Guide. SD/TF Card Reader. Model: AR200

JF Standard for Japanese-Language Education 2010

Network Camera SNC-Z20N/Z20P 設置説明書 JP. Installation Manual GB. Manuel d installation FR. Manual de instalación ES お買い上げいただきありがとうございます

For victims of traffic accidents

Wicket Hiroto Yamakawa

Most EFL teachers in Japan find that there are groups of verbs which consistently

7 myths about cars and free trade agreements

Development of Risk Assessment Methodology for Impact Based Business Continuity Management ビジネスインパクトを考慮した事業継続マネジメントのためのリスクアセスメント手法の開発

Ritsumeikan University Global 30 Project AY 2012 Follow-up

MPLS Configration 事例

EFL Information Gap Activities for Architecture Majors

Network Camera SNC-CS3N/CS3P 設置説明書 JP. Installation Manual GB. Manuel d installation FR. Manual de instalación ES お買い上げいただきありがとうございます

オープンソース NFV プラットフォームの取り組み

GRADUATE SCHOOL OF INTERNATIONAL RELATIONS

Cash, Receivables & Marketable Securities

Ibaraki University. Graduate School of Agriculture. (Master s Program)

Visitors International

New Publication of IIMA Global Market Volatility Index

My experience of Ruby Education in Taiwan

Hello, Chicago Okinawa Kenjinkai members,

第 9 回仮想政府セミナー Introduction Shared Servicesを考える ~Old but New Challenge~ 東京大学公共政策大学院奥村裕一 2014 年 2 月 21 日

LEAVING CERTIFICATE 2011 MARKING SCHEME JAPANESE HIGHER LEVEL

Gender Equality in Education in Japan

Chiba Institute of Technology Graduate School

JALT2009. The 35th JALT International Conference on Language Teaching and Learning The Teaching-Learning Dialogue: An Active Mirror

Addressing Japan s Healthcare Challenges with Information Technology

Digital Graphic Printer

How Should Remote Monitoring Sensor Be Accurate?

The Course of Study is the series of guidelines for subject

Research on Support for Persons with Mental Disability to Make Transition to Regular Employment

AW Travelling Light

Act on Special Measures Concerning Nuclear Emergency Preparedness (Act No. 156 of December 17, 1999)

Industrial Accident Compensation Insurance Application Guidance for Foreign Workers <Volume 2>

Design Patterns. Elemental Microformats. Compound Microformats KEY. Datetime Pattern <abbr class="foo"

Absolute Beginner s Guide to Hiragana (With an Introduction to Grammar and Kanji)

AUTOMOBILE LIABILITY SECURITY ACT

Educational Administration in Japan

Utilization & Promotion Activities of Cloud in Japan

Green Solution with Simegy

Interest in the relationship between religion and language

Posting and Reading Messages (PC version) 1/4

As the use of computers is becoming a more integral part of learning environments,

IHS Technology. IHS Technology Business Intelligence Enabling market leadership through research, analysis and strategy

Design Act ( Act No. 125 of 1959)

Using the Moodle Reader Module to Facilitate an Extensive. Reading Program

Linux Foundation Automotive Summit - Yokohama, Japan

Sompo Japan Nipponkoa Group's CSR Case Report

Christ Episcopal Church Sei Ko Kai

Welcome Guide. Ultra Compact Bluetooth Keyboard

The existence of credential effects in Canadian nursing education; Quebec versus the rest of Canada

Changing Views on Motivation in a Globalizing World

Corporate Management Strategy of a Life Insurance Company

Unwillingness to Use Social Networking Services for Autonomous Language Learning among Japanese EFL Students

2016 July 5. August 16

CIRCULAR. Block-4, Old JNU Campus New Mehrauli Road, New Delhi-67 Dated: the to" of September 2014

Electrical engineering have fun with building your own devices and travelling the world. Iwate Prefectural Mizusawa High School Sebastian DIEBOLD

Studies of Large-Scale Data Visualization and Visual Data Mining

Employment Security Act (Act No. 141 of 1947)

Transcription:

第 2 回 Linux-HA Japan 勉強会 2011 年 6 月 3 日 NTTデータ先端技術株式会社池田淳子

本日のお題 crm の歩き方目次 1.crm シェルとは 2.crm シェルのインストール 3.crm シェルでリソースを設定する 4.crm シェルでクラスタを管理する

1. crm シェルとは

crm シェルとは acemakerに付属したコマンドラインツールですリソースの設定やクラスタの管理を行うことができます

acemaker の設定ファイル /var/lib/heartbeat/crm/cib.xml <cib validate-with="pacemaker-1.0" crm_feature_set="3.0.1" have-quorum="1" dc-uuid="22222222-2222-2222-2222-222222222222" dmin_epoch="0" epoch="7" num_updates="0" cib-last-written="tue May 31 14:33:00 2011"> <configuration> <crm_config> <cluster_property_set id="cib-bootstrap-options"> <nvpair id="cib-bootstrap-options-dc-version" name="dc-version" value="1.0.11-db98485d06ed stable-1.0 tip"/> <nvpair id="cib-bootstrap-options-cluster-infrastructure" name="cluster-infrastructure" value="heartbeat"/> <nvpair id="cib-bootstrap-options-no-quorum-policy" name="no-quorum-policy" value="ignore"/> <nvpair id="cib-bootstrap-options-stonith-enabled" name="stonith-enabled" value="false"/> <nvpair id="cib-bootstrap-options-startup-fencing" name="startup-fencing" value="false"/> </cluster_property_set> </crm_config> <nodes> <node id="11111111-1111-1111-1111-111111111111" uname="xen01" type="normal"/> <node id="22222222-2222-2222-2222-222222222222" uname="xen02" type="normal"/> </nodes> <resources> <group id="grpha"> <primitive class="ocf" id="prmfs" provider="heartbeat" type="filesystem"> <instance_attributes id="prmfs-instance_attributes"> <nvpair id="prmfs-instance_attributes-fstype" name="fstype" value="ext3"/> <nvpair id="prmfs-instance_attributes-run_fsck" name="run_fsck" value="force"/> <nvpair id="prmfs-instance_attributes-device" name="device" value="/dev/sda1"/> <nvpair id="prmfs-instance_attributes-directory" name="directory" value="/mnt"/> </instance_attributes> <operations> <op id="prmfs-start-0s" interval="0s" name="start" on-fail="restart" timeout="480s"/> <op id="prmfs-monitor-10s" interval="10s" name="monitor" on-fail="restart" timeout="60s"/> </rsc_location> <rsc_colocation id="colocation-01" rsc="grpha" score="infinity" with-rsc="clningd"/> <rsc_order first="clningd" id="order-01" score="0" then="grpha"/> </constraints> <rsc_defaults> <meta_attributes id="rsc-options"> <nvpair id="rsc-options-resource-stickiness" name="resource-stickiness" value="infinity"/> <nvpair id="rsc-options-migration-threshold" name="migration-threshold" value="1"/> </meta_attributes> </rsc_defaults> </configuration> </cib>

Heartbeat v2 時代はかなり頑張って cib.xml を作ったり編集したりしていました (これがイヤで v1モードを使っている人も多いのでは ) またクラスタの状態を変更するためには crm_resourceやcibadminなど複雑な管理コマンドを使いこなす必要がありました acemaker を使用したクラスタ環境では crm シェルが使えます XMLの構文に悩まされることなくリソースの設定ができます crmシェルがこっそり cib.xml を編集してくれます管理コマンドがうろ覚えでもクラスタの管理ができます crmシェルがこっそり管理コマンドを呼び出してくれます対話モードとバッチモードが使えます TABキーでコマンドやオプションの補完ができますクラスタが起動していなくても使えますクラスタの管理はできませんがいろいろ遊べます

2. crm シェルのインストール

acemaker がインストールされた環境であれば crm シェルはすぐに使い始めることができます # which crm /usr/sbin/crm # file /usr/sbin/crm /usr/sbin/crm: python script text executable # rpm -qlp pacemaker-1.0.10-1.4.el6.x86_64.rpm grep "/usr/sbin/crm$" /usr/sbin/crm # crm crm(live)# help This is the CRM command line interface program. Available commands: cib manage shadow CIBs resource resources management リソースの管理を行います node nodes management ノードの管理を行います options user preferences configure CRM cluster configuration クラスタの設定や管理を行います ra resource agents information center RAの情報を表示します status show cluster status quit,bye,exit exit the program help show help end,cd,up go back one level

勉強会で使用する試験環境 Host OS(Domain0) OS RHEL 5.6 x86_64 (xen) Dell precision 370 Intel(R) entium(r) 4 CU 3.20GHz x2 Memory 2GB Guest (/dev/sda10) eth0 Guest (/dev/sda11) eth0 Guset OS(DomainU) OS RHEL 5.6 x86_64 eth1 eth2 eth1 eth2 インターコネクトLAN Cluster software pacemaker 1.0.11 cluster-glue 1.0.7 resource-agents 1.0.4 heartbeat 3.0.4 pm-extras 1.02-1 /dev/sda eth3 eth3 /dev/sda サービスLAN CU x1 Memory 512MB (/dev/sda12) 共有ディスクのふり (tap:aio)

勉強会で使用する試験環境 # cat /etc/ha.d/ha.cf pacemaker on logfacility local1 debug 0 udpport 694 keepalive 2 warntime 20 deadtime 24 initdead 48 bcast eth1 bcast eth2 node xen01 node xen02 watchdog /dev/watchdog respawn root /usr/lib64/heartbeat/ifcheckd

3. crm シェルでリソースを設定する

# vi sample.crm property no-quorum-policy="ignore" stonith-enabled="false" startup-fencing="false" rsc_defaults resource-stickiness="infinity" migration-threshold="1" Linux-HA Japanのサイトでもよく見る crm ファイルでもそもそも property って? primitive? group? group grpha prmfs prmi clone clningd prmingd primitive prmfs ocf:heartbeat:filesystem params fstype="ext3" run_fsck="force" device="/dev/sda1" directory="/mnt" op start interval="0s" timeout="480s" on-fail="restart" op monitor interval="10s" timeout="60s" on-fail="restart" op stop interval="0s" timeout="60s" on-fail="block

リソースの設定制約の設定 crm(live)# configure crm(live)configure# help Commands for resources are: - `primitive` - `monitor` - `group` - `clone` - `ms`/`master` (master-slave) There are three types of constraints: - `location` - `colocation` - `order` 設定関連はとりあえず configure! Finally, there are the cluster properties, resource meta attributes defaults, and operations defaults. クラスタの設定 - `property` - `rsc_defaults` - `op_defaults`

primitive 1コのリソースです例 ) 仮想 I ファイルシステムデータベース RA(Resource Agent)を登録します RAに必要なパラメータ (params) を設定します RAに必要なオペレーション (op) を設定します設定例 primitive prmi ocf:heartbeat:iaddr2 params ip="192.168.201.129" nic="eth3" cidr_netmask="24" op start interval="0s" timeout="60s" on-fail="restart" op monitor interval="10s" timeout="60s" on-fail="restart" op stop interval="0s" timeout="60s" on-fail="block"

でも RAに必要なパラメータとかオペレーションとかわからない

# crm crm(live)# ra RAの情報表示 crm(live)ra# list usage: list <class> [<provider>] crm(live)ra# help list crm(live)ra# list ocf さらに help crm(live)ra# info vmware Manages VMWare Server 2.0 virtual machines (ocf:heartbeat:vmware) OCF compliant script to control vmware server 2.0 virtual machines. arameters (* denotes required, [] the default): vmxpath* (string): VMX file path VMX configuration file path vimshbin (string, [/usr/bin/vmware-vim-cmd]): vmware-vim-cmd path vmware-vim-cmd executable path Operations' defaults (advisory minimum): start timeout=600 stop timeout=600 monitor interval=300 timeout=30

group 1コ以上のリソースを登録します start/stop の順序が保証されます group まるごと F/O できます設定例 group sister kana kayo グループの登録するリソースの名前グループの名前グループの登録するリソースの名前

crm(live)configure# property > no-quorum-policy="ignore" > stonith-enabled="false" > startup-fencing="false" crm(live)configure# primitive > prmi ocf:heartbeat:iaddr2 > params > ip="192.168.201.129" > nic="eth3" > cidr_netmask="24" > op start interval="0s" timeout="60s" on-fail="restart" > op monitor interval="10s" timeout="60s" on-fail="restart" > op stop interval="0s" timeout="60s" on-fail="block" crm(live)configure# show crm(live)configure# commit crm_verify[7851]: 2011/06/03_11:41:44 WARN: unpack_nodes: Blind faith: not fencing unseen nodes crm(live)configure# cd crm(live)# resource crm(live)resource# show prmi (ocf::heartbeat:iaddr2) Stopped STONITHを設定していないと警告がでるけどとりあえず無視 crm(live)resource# start prmi

crm(live)resource# cd crm(live)# configure crm(live)configure# primitive > kana ocf:pacemaker:dummy > op start interval="0s" timeout="100s" on-fail="restart" > op monitor interval="10s" timeout="100s" on-fail="restart" > op stop interval="0s" timeout="100s" on-fail="block" crm(live)configure# commit crm_verify[2699]: 2011/06/06_14:13:37 WARN: unpack_nodes: Blind faith: not fencing unseen nodes crm(live)configure# cd crm(live)# resource show prmi (ocf::heartbeat:iaddr2) Started kana (ocf::pacemaker:dummy) Started

crm(live)# configure crm(live)configure# edit node $id="11111111-1111-1111-1111-111111111111" xen01 node $id="22222222-2222-2222-2222-222222222222" xen02 primitive kana ocf:pacemaker:dummy op start interval="0s" timeout="100s" on-fail="restart" op monitor interval="10s" timeout="100s" on-fail="restart" op stop interval="0s" timeout="100s" on-fail="block" primitive kayo ocf:pacemaker:dummy op start interval="0s" timeout="100s" on-fail="restart" op monitor interval="10s" timeout="100s" on-fail="restart" op stop interval="0s" timeout="100s" on-fail="block" primitive prmi ocf:heartbeat:iaddr2 params ip="192.168.201.129" nic="eth3" cidr_netmask="24" op start interval="0s" timeout="60s" on-fail="restart" op monitor interval="10s" timeout="60s" on-fail="restart" op stop interval="0s" timeout="60s" on-fail="block" property $id="cib-bootstrap-options" dc-version="1.0.11-db98485d06ed stable-1.0 tip" cluster-infrastructure="heartbeat" no-quorum-policy="ignore" stonith-enabled="false" startup-fencing="false!wq edit の結果を書き込みます ( 今回の例では vi ですが好みのエディタに変更することも可能です) edit 操作は予想外の変更もクラスタに反映されてしまうことがあるため注意して実行してください編集には後述の rename, delete コマンドが推奨されていますコピー!

crm(live)configure# cd crm(live)# resource show prmi (ocf::heartbeat:iaddr2) Started kana (ocf::pacemaker:dummy) Started kayo (ocf::pacemaker:dummy) Started crm(live)# resource stop kayo crm(live)# resource stop kana crm(live)# resource show prmi (ocf::heartbeat:iaddr2) Started kana (ocf::pacemaker:dummy) Stopped kayo (ocf::pacemaker:dummy) Stopped crm(live)# configure crm(live)configure# group sister kana kayo

crm(live)configure# show node $id="11111111-1111-1111-1111-111111111111" xen01 node $id="22222222-2222-2222-2222-222222222222" xen02 primitive kana ocf:pacemaker:dummy op start interval="0s" timeout="100s" on-fail="restart" op monitor interval="10s" timeout="100s" on-fail="restart" op stop interval="0s" timeout="100s" on-fail="block" meta target-role="stopped" primitive kayo ocf:pacemaker:dummy op start interval="0s" timeout="100s" on-fail="restart" op monitor interval="10s" timeout="100s" on-fail="restart" op stop interval="0s" timeout="100s" on-fail="block" meta target-role="stopped" primitive prmi ocf:heartbeat:iaddr2 params ip="192.168.201.129" nic="eth3" cidr_netmask="24" op start interval="0s" timeout="60s" on-fail="restart" op monitor interval="10s" timeout="60s" on-fail="restart" op stop interval="0s" timeout="60s" on-fail="block" group sister kana kayo property $id="cib-bootstrap-options" dc-version="1.0.11-db98485d06ed stable-1.0 tip" cluster-infrastructure="heartbeat" no-quorum-policy="ignore" stonith-enabled="false" startup-fencing="false" crm(live)configure# commit crm_verify[2899]: 2011/06/06_14:21:11 WARN: unpack_nodes: Blind faith: not fencing unseen nodes

crm(live)configure# cd crm(live)# resource start sister crm(live)# show ERROR: syntax: show crm(live)# resource show prmi (ocf::heartbeat:iaddr2) Started Resource Group: sister kana (ocf::pacemaker:dummy) Started kayo (ocf::pacemaker:dummy) Started

crm(live)configure# cd crm(live)# resource stop prmi crm(live)# configure rename usage: rename <old_id> <new_id> crm(live)# configure rename prmi monja crm_verify[3041]: 2011/06/06_14:24:09 WARN: unpack_nodes: Blind faith: not fencing unseen nodes crm(live)# resource start monja crm(live)# resource show Resource Group: sister kana (ocf::pacemaker:dummy) Started kayo (ocf::pacemaker:dummy) Started monja (ocf::heartbeat:iaddr2) Started configureレベルの外から configureの変更を実施すると自動的に commit されてしまうので注意してください!

crm(live)# resource stop monja crm(live)# configure delete monja crm_verify[3897]: 2011/06/06_14:28:09 WARN: unpack_nodes: Blind faith: not fencing unseen nodes crm(live)# configure show node $id="11111111-1111-1111-1111-111111111111" xen01 node $id="22222222-2222-2222-2222-222222222222" xen02 primitive kana ocf:pacemaker:dummy op start interval="0s" timeout="100s" on-fail="restart" op monitor interval="10s" timeout="100s" on-fail="restart" op stop interval="0s" timeout="100s" on-fail="block" primitive kayo ocf:pacemaker:dummy op start interval="0s" timeout="100s" on-fail="restart" op monitor interval="10s" timeout="100s" on-fail="restart" op stop interval="0s" timeout="100s" on-fail="block" group sister kana kayo meta target-role="started" property $id="cib-bootstrap-options" dc-version="1.0.11-db98485d06ed stable-1.0 tip" cluster-infrastructure="heartbeat" monja の設定が削除されています no-quorum-policy="ignore" stonith-enabled="false" startup-fencing="false"

crm(live)# resource stop sister crm(live)# configure crm(live)configure# erase crm(live)configure# commit crm(live)configure# show node $id="11111111-1111-1111-1111-111111111111" xen01 node $id="22222222-2222-2222-2222-222222222222" xen02 クラスタの設定が全て削除されます delete と erase は注意して使い分けてください

ACT SBY group ACT group F/O SBY group

(1) ACT SBY shutdown group (1) 故障したACTノードを停止 (2) SBYで2 番目のprimitiveが故障 (3) SBYで2,3 番目のprimitiveが停止この場合 1 番目のprimitiveは起動継続する (2) (3) SBY group 仕様ホントは1 番目も停止してほしいところですがあんどりゅーくんとの闘いに敗れました SBY group

clone 1コの primitive または group を登録します設定例 clone clningd prmingd clone-max="2" clone-node-max="1" clone を配置するノード数デフォルト値 :クラスタ内のノード数 1ノードあたりの clone の数デフォルト値 :1

clone :0 clone :1 :1 インスタンス番号 clone :0 clone :1 ペアの clone が停止しても継続起動します

3ノード以上でもOK clone :0 clone :1 clone :2 clone :0 clone :1 全ノードに配置しなくてもOK

Master/Slave 1コの primitive または group を登録します設定例 ms msdrbd grpdrbd meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1" notify="true" globally-unique="false"

Master Slave Master promote Master Slave promote Slave Master demote

両ノードとも Master でも OK Master Master Master

制約の設定 location colocation order 配置制約同居制約順序制約

location ( 配置制約 ) crm(live)configure# location usage: location <id> <rsc> {node_pref rules} node_pref :: <score>: <node> rules :: rule [id_spec] [$role=<role>] <score>: <expression> [rule [id_spec] [$role=<role>] <score>: <expression>...] id_spec :: $id=<id> $id-ref=<id> score :: <number> <attribute> [-]inf expression :: <simple_exp> [bool_op <simple_exp>...] bool_op :: or and simple_exp :: <attribute> [type:]<binary_op> <value> <unary_op> <attribute> date <date_expr> type :: string version number binary_op :: lt gt lte gte eq ne unary_op :: defined not_defined

location ( 配置制約 ) node1 group node2 group どちらのノードを優先? group

location ( 配置制約 ) node1 node2 grpha location の名前配置制約を設定したいリソース名 location location-01 grpha rule 200: #uname eq node1 rule 100: #uname eq node2 スコア値ノード名

location ( 配置制約 ) node1 node2 grpha location location-02 grpha rule -INFINITY: #uname eq node2 node2で起動させたくない場合は -INFINITYを設定すればよい

colocation ( 同居制約 ) crm(live)configure# colocation usage: colocation <id> <score>: <rsc>[:<role>] <rsc>[:<role>] 制約に関連するリソース名制約対象となるリソース名 colocation の名前 colocation nakayoshi-colocation INFINITY: kana kayo kana は kayo が起動しているノードに同居できる colocation nakawarui-colocation -INFINITY: kana kayo kana は kayo が起動しているノードに同居できない

order ( 順序制約 ) crm(live)configure# order usage: order <id> score-type: <first-rsc>[:<action>] <then-rsc>[:<action>] [symmetrical=<bool>] order の名前起動する順番にリソース名を並べる order order-01 0: kana kayo kana が起動した後に kayo が起動する停止順序は起動順序の逆となりますただし symmetrical=falseを設定すると起動順序と停止順序は同じになります symmetricalのデフォルト値はtrueです

location, colocation, order の設定例 xen01 clningd pingd:0 xen02 clningd pingd:1 grpha Filesystem IAddr2 clningd が起動した後に grpha を起動させたい order grpha は clningd と同じノードで起動させたい colocation grpha は xen01 で優先的に起動させたい location clningdの属性値が設定されていないまたは clnindの属性値が100より小さい場合 grphaはそのノードで起動してはならない location

設定例 < 省略 > group grpha prmfs prmi clone clningd prmingd meta clone-max="2" clone-node-max="1" primitive prmfs ocf:heartbeat:filesystem < 省略 > primitive prmi ocf:heartbeat:iaddr2 < 省略 > primitive prmingd ocf:pacemaker:pingd params name="default_ping_set" host_list="192.168.201.81" multiplier="100" dampen="1" op start interval="0s" timeout="90s" on-fail="restart" op monitor interval="10s" timeout="60s" on-fail="restart" op stop interval="0s" timeout="100s" on-fail="ignore" location location-01 grpha rule 200: #uname eq xen01 rule 100: #uname eq xen02 rule -INFINITY: not_defined default_ping_set or default_ping_set lt 100 colocation colocation-01 INFINITY: grpha clningd order order-01 0: clningd grpha clningdの属性値が設定されていないまたは clnindの属性値が100より小さい場合は起動しない(-INFINITY)

property クラスタ全体の設定 crm(live)configure# property usage: property [$id=<set_id>] <option>=<value> res_defaults リソース全体の設定 crm(live)configure# rsc_defaults usage: rsc_defaults [$id=<set_id>] <option>=<value> op_defaults オペレーション全体の設定 crm(live)configure# op_defaults usage: op_defaults [$id=<set_id>] <option>=<value>

設定例 ### Cluster Option ### property no-quorum-policy="ignore" stonith-enabled="false" startup-fencing="false" ### Resource Defaults ### rsc_defaults resource-stickiness="infinity" migration-threshold="1" STONITHを有効にする場合は stonith-enabled= true stonith-enabled= true の場合 op stop on-fail= fence stonith-enabled= false の場合 op stop on-fail= block Linux-HA Japan 月刊あんどりゅーくん(6 月号 )の知恵袋を参照時間の都合上説明を省略します

バッチモード用のファイルを作成する dummy.crm ( 拡張子は.crm でなくてもよい) ### Cluster Option ### property no-quorum-policy="ignore" stonith-enabled="false" startup-fencing="false" ### Resource Defaults ### rsc_defaults resource-stickiness="infinity" migration-threshold="1" ### rimitive Configuration ### primitive prmdummy ocf:pacemaker:dummy op start interval="0s" timeout="100s" on-fail="restart" op monitor interval="10s" timeout="100s" on-fail="restart" op stop interval="0s" timeout="100s" on-fail="block" location location-01 prmdummy rule 200: #uname eq xen01 rule 100: #uname eq xen02

バッチモードでリソースを設定するファイル名 [root@xen01 hastudy]# crm configure load update dummy.crm crm_verify[6592]: 2011/06/01_16:06:12 WARN: unpack_nodes: Blind faith: not fencing unseen nodes [root@xen01 hastudy]# crm_mon -1 -A ============ Last updated: Wed Jun 1 16:06:50 2011 Stack: Heartbeat Current DC: xen02 (22222222-2222-2222-2222-222222222222) - partition with quorum Version: 1.0.11-db98485d06ed stable-1.0 tip 2 Nodes configured, unknown expected votes 1 Resources configured. ============ Online: [ xen01 xen02 ] STONITHを設定していないと警告がでるけどとりあえず無視 prmdummy (ocf::pacemaker:dummy): Started xen01 Node Attributes: * Node xen01: + xen02-eth1 : up + xen02-eth2 : up * Node xen02: + xen01-eth1 : up + xen01-eth2 : up ifcheckd をインストールして ha.cfに設定しておくとインターコネクトLAN の情報も表示できます(crm_mon -A)

4. crm シェルでクラスタを管理する

(1) リソースの起動停止 # crm crm(live)# resource crm(live)resource# show Resource Group: grpha prmfs (ocf::heartbeat:filesystem) Started prmi (ocf::heartbeat:iaddr2) Started Clone Set: clningd Started: [ xen01 xen02 ] crm(live)resource# start usage: start <rsc> crm(live)resource# stop usage: stop <rsc> crm(live)resource# stop prmi crm(live)resource# start prmi crm(live)resource# stop prmfs crm(live)resource# start prmfs crm(live)resource# stop clningd crm(live)resource# start clningd clone はインスタンス番号を指定できません

(2) リソースの移動 # crm crm(live)# resource crm(live)resource# show Resource Group: grpha prmfs (ocf::heartbeat:filesystem) Started prmi (ocf::heartbeat:iaddr2) Started Clone Set: clningd Started: [ xen01 xen02 ] crm(live)resource# move usage: migrate <rsc> [<node>] [<lifetime>] [force] crm(live)resource# move prmi xen02 force crm(live)resource# unmove usage: unmigrate <rsc> crm(live)resource# unmove prmi move force は移動元のノードにスコア値 (-INFINITY INFINITY)を追加するため実行時に警告が表示されます move force でリソースを移動させた後は移動元のスコアを正常に戻すため必ず unmove unmove を実行してください

(3) 故障からの復旧 # ip addr del 192.168.201.129/24 dev eth3 # crm_mon i1 -f Online: [ xen01 xen02 ] Resource Group: grpha prmfs (ocf::heartbeat:filesystem): Started xen02 prmi (ocf::heartbeat:iaddr2): Started xen02 Clone Set: clningd Started: [ xen01 xen02 ] Migration summary: * Node xen02: * Node xen01: prmi: migration-threshold=1 fail-count=1 Failed actions: prmi_monitor_10000 (node=xen01, call=41, rc=7, status=complete): not running

# crm crm(live)# resource crm(live)resource# failcount usage: failcount <rsc> set <node> <value> failcount <rsc> delete <node> failcount <rsc> show <node> crm(live)resource# show Resource Group: grpha prmfs (ocf::heartbeat:filesystem) Started prmi (ocf::heartbeat:iaddr2) Started Clone Set: clningd Started: [ xen01 xen02 ] crm(live)resource# failcount prmi show xen01 scope=status name=fail-count-prmi value=1 crm(live)resource# failcount prmi delete xen01 crm(live)resource# failcount prmi show xen01 scope=status name=fail-count-prmi value=0 crm(live)resource# quit bye

# crm resource failcount prmi show xen01 scope=status name=fail-count-prmi value=0 バッチモード crmシェルがこっそり呼び出しているコマンドを表示 # crm -R crm(live)# resource failcount prmi show xen01.in: resource failcount prmi show xen01.ext crm_failcount -r 'prmi' -N 'xen01' G scope=status name=fail-count-prmi value=0

pingd によるサービスLAN 切断検知 # iptables -A INUT -i eth3 -j DRO; iptables -A OUTUT -o eth3 -j DRO # crm_mon i1 -Af Online: [ xen01 xen02 ] Resource Group: grpha prmfs (ocf::heartbeat:filesystem): Started xen02 prmi (ocf::heartbeat:iaddr2): Started xen02 Clone Set: clningd Started: [ xen01 xen02 ] スコア値が100 0 Node Attributes: * Node xen01: + default_ping_set : 0 : Connectivity is lost + xen02-eth1 : up + xen02-eth2 : up * Node xen02: + default_ping_set : 100 + xen01-eth1 : up + xen01-eth2 : up location 設定に従ってF/O Migration summary: * Node xen02: * Node xen01: fail-count は増加しない

# iptables -F # crm_mon i1 -Af Online: [ xen01 xen02 ] Resource Group: grpha prmfs (ocf::heartbeat:filesystem): Started xen02 prmi (ocf::heartbeat:iaddr2): Started xen02 Clone Set: clningd Started: [ xen01 xen02 ] Node Attributes: * Node xen01: + default_ping_set : 100 + xen02-eth1 : up + xen02-eth2 : up * Node xen02: + default_ping_set : 100 + xen01-eth1 : up + xen01-eth2 : up Migration summary: * Node xen02: * Node xen01: スコア値が0 100

(4) ノードのスタンバイ化 /オンライン化 crm(live)# node crm(live)node# help standby Set a node to standby status. The node parameter defaults to the node where the command is run. Additionally, you may specify a lifetime for the standby---if set to `reboot`, the node will be back online once it reboots. `forever` will keep the node in standby after reboot. Usage:... standby [<node>] [<lifetime>] lifetime :: reboot forever... ノード名を指定しなかった場合は crmシェルを実行中のノードがスタンバイ化

おわり