Page 1 of 1

Cluster

Posted: 2018/03/08 15:33:40
by aich
Hello,
unable to start the 1st resource of fencing name Fence-1, as i have created the stonith for hp server over fence_ilo4.

Full list of resources:

Fence-1 (stonith:fence_ilo4): Started hp02
Fence-2 (stonith:fence_ilo4): Started hp02

Failed Actions:
* Fence-1_start_0 on hp01 'unknown error' (1): call=41, status=Timed Out, exitreason='none',
last-rc-change='Thu Mar 8 18:16:14 2018', queued=0ms, exec=20052ms
* Fence-2_start_0 on hp01 'unknown error' (1): call=47, status=Timed Out, exitreason='none',
last-rc-change='Thu Mar 8 18:21:59 2018', queued=0ms, exec=20004ms
====================================================================================
as per the var log:
Mar 08 18:16:13 [2179] hp01 cib: info: cib_file_write_with_digest: Reading cluster configuration file /var/lib/pacemaker/cib/cib.3kmZ2q (digest: /var/lib/pacemaker/cib/cib.jD5q1d)
Mar 08 18:16:13 [2181] hp01 lrmd: info: process_lrmd_get_rsc_info: Resource 'Fence-1' not found (0 active resources)
Mar 08 18:16:13 [2181] hp01 lrmd: info: process_lrmd_rsc_register: Added 'Fence-1' to the rsc list (1 active resources)
Mar 08 18:16:13 [2184] hp01 crmd: info: do_lrm_rsc_op: Performing key=4:33:7:789140b4-6312-4826-b180-e1aaf9c82348 op=Fence-1_monitor_0
Mar 08 18:16:14 [2184] hp01 crmd: notice: process_lrm_event: Result of probe operation for Fence-1 on hp01: 7 (not running) | call=40 key=Fence-1_monitor_0 confirmed=true cib-update=43
Mar 08 18:16:14 [2179] hp01 cib: info: cib_process_request: Forwarding cib_modify operation for section status to all (origin=local/crmd/43)
Mar 08 18:16:14 [2179] hp01 cib: info: cib_perform_op: Diff: --- 0.55.0 2
Mar 08 18:16:14 [2179] hp01 cib: info: cib_perform_op: Diff: +++ 0.55.1 (null)
Mar 08 18:16:14 [2179] hp01 cib: info: cib_perform_op: + /cib: @num_updates=1
Mar 08 18:16:14 [2179] hp01 cib: info: cib_perform_op: ++ /cib/status/node_state[@id='1']/lrm[@id='1']/lrm_resources: <lrm_resource id="Fence-1" type="fence_ilo4" class="stonith"/>
Mar 08 18:16:14 [2179] hp01 cib: info: cib_perform_op: ++ <lrm_rsc_op id="Fence-1_last_0" operation_key="Fence-1_monitor_0" operation="monitor" crm-debug-origin="do_update_resource" crm_feature_set="3.0.10" transition-key="4:33:7:789140b4-6312-4826-b180-e1aaf9c82348" transition-magic="0:7;4:33:7:789140b4-6312-4826-b180-e1aaf9c82348" on_node="hp01" call-id="40" rc-code="7" op-status="0" interval="0" last-run="1520513173" last-
Mar 08 18:16:14 [2179] hp01 cib: info: cib_perform_op: ++ </lrm_resource>
Mar 08 18:16:14 [2179] hp01 cib: info: cib_process_request: Completed cib_modify operation for section status: OK (rc=0, origin=hp01/crmd/43, version=0.55.1)
Mar 08 18:16:14 [2179] hp01 cib: info: cib_perform_op: Diff: --- 0.55.1 2
Mar 08 18:16:14 [2179] hp01 cib: info: cib_perform_op: Diff: +++ 0.55.2 (null)
Mar 08 18:16:14 [2179] hp01 cib: info: cib_perform_op: + /cib: @num_updates=2
Mar 08 18:16:14 [2179] hp01 cib: info: cib_perform_op: ++ /cib/status/node_state[@id='2']/lrm[@id='2']/lrm_resources: <lrm_resource id="Fence-1" type="fence_ilo4" class="stonith"/>
Mar 08 18:16:14 [2179] hp01 cib: info: cib_perform_op: ++ <lrm_rsc_op id="Fence-1_last_0" operation_key="Fence-1_monitor_0" operation="monitor" crm-debug-origin="do_update_resource" crm_feature_set="3.0.10" transition-key="6:33:7:789140b4-6312-4826-b180-e1aaf9c82348" transition-magic="0:7;6:33:7:789140b4-6312-4826-b180-e1aaf9c82348" on_node="hp02" call-id="52" rc-code="7" op-status="0" interval="0" last-run="1520512798" last-
Mar 08 18:16:14 [2179] hp01 cib: info: cib_perform_op: ++ </lrm_resource>
Mar 08 18:16:14 [2179] hp01 cib: info: cib_process_request: Completed cib_modify operation for section status: OK (rc=0, origin=hp02/crmd/105, version=0.55.2)
Mar 08 18:16:14 [2184] hp01 crmd: info: do_lrm_rsc_op: Performing key=7:33:0:789140b4-6312-4826-b180-e1aaf9c82348 op=Fence-1_start_0
Mar 08 18:16:14 [2181] hp01 lrmd: info: log_execute: executing - rsc:Fence-1 action:start call_id:41
Mar 08 18:16:14 [2180] hp01 stonith-ng: notice: stonith_device_register: Added 'Fence-1' to the device list (3 active devices)
Mar 08 18:16:19 [2179] hp01 cib: info: cib_process_ping: Reporting our current digest to hp02: 14afa89b4505c35bd0c006040315a0db for 0.55.2 (0x7f9e87d14e50 0)
Mar 08 18:16:34 [2180] hp01 stonith-ng: info: st_child_term: Child 4791 timed out, sending SIGTERM
Mar 08 18:16:34 [2180] hp01 stonith-ng: notice: stonith_action_async_done: Child process 4791 performing action 'monitor' timed out with signal 15
Mar 08 18:16:34 [2180] hp01 stonith-ng: notice: log_operation: Operation 'monitor' [4791] for device 'Fence-1' returned: -62 (Timer expired)
Mar 08 18:16:34 [2181] hp01 lrmd: info: log_finished: finished - rsc:Fence-1 action:start call_id:41 exit-code:1 exec-time:20052ms queue-time:0ms
Mar 08 18:16:35 [2184] hp01 crmd: error: process_lrm_event: Result of start operation for Fence-1 on hp01: Timed Out | call=41 key=Fence-1_start_0 timeout=20000ms

Re: Cluster

Posted: 2018/03/08 16:05:14
by TrevorH
Did you try running fence_ilo4 manually to see if it worked?