can not start messaging server resource group in cluster 3.2
800747Sep 26 2008 — edited Jan 30 2009Hi all,
Please help in the following issue.
I am not able to start resource group (msg-rg) and following is the error:
ms1@root# clrg online -M -e msg-rg
clrg: (C748634) Resource group msg-rg failed to start on chosen node and might fail over to other node(s)
clrg: (C135343) No primary node could be found for resource group msg-rg; it remains offline
scstat output (remove some for brief description)
-------------------
-- Device Group Servers --
Device Group Primary Secondary
------------ ------- ---------
Device group servers: SJMS ms1 ms2
-- Device Group Status --
Device Group Status
------------ ------
Device group status: SJMS Online
-- Resource Groups and Resources --
Group Name Resources
---------- ---------
Resources: msg-rg mail msg-hasp-rs msg-rs
-- Resources --
Resource Name Node Name State Status Message
------------- --------- ----- --------------
Resource: mail ms1 Offline Offline - LogicalHostname offline.
Resource: mail ms2 Offline Offline - LogicalHostname offline.
Resource: msg-hasp-rs ms1 Offline Offline
Resource: msg-hasp-rs ms2 Offline Offline
Resource: msg-rs ms1 Offline Offline - Stop Succeeded
Resource: msg-rs ms2 Offline Offline - Stop Succeeded
Following is the from /var/adm/messages (remove some for brief description)
Sep 26 12:25:19 ms1 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <ims_svc_start> for resource <msg-rs>, resou
rce group <msg-rg>, node <ms1>, timeout <300> seconds
Sep 26 12:25:19 ms1 Cluster.RGM.rgmd: [ID 784560 daemon.notice] resource msg-rs status on node ms1 change to R_FM_UNKNOWN
Sep 26 12:25:19 ms1 Cluster.RGM.rgmd: [ID 922363 daemon.notice] resource msg-rs status msg on node ms1 change to <Starting>
Sep 26 12:25:19 ms1 Cluster.RGM.rgmd: [ID 751138 daemon.notice] 47 fe_rpc_command: cmd_type(enum):<1>:cmd=</usr/cluster/lib/r
gm/rt/hafoip/hafoip_monitor_start>:tag=<msg-rg.mail.7>: Calling security_clnt_connect(..., host=<ms1>, sec_type {0:WEAK, 1:ST
RONG, 2:DES} =<1>, ...)
Sep 26 12:25:19 ms1 Cluster.RGM.rgmd: [ID 268902 daemon.notice] 45 fe_rpc_command: cmd_type(enum):<1>:cmd=</opt/sun/comms/msg
scha/bin/imssvc_start>:tag=<msg-rg.msg-rs.0>: Calling security_clnt_connect(..., host=<ms1>, sec_type {0:WEAK, 1:STRONG, 2:
DES} =<1>, ...)
Sep 26 12:25:19 ms1 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <hafoip_monitor_start> completed successfully for reso
urce <mail>, resource group <msg-rg>, node <ms1>, time used: 0% of timeout <300 seconds>
Sep 26 12:25:19 ms1 Cluster.RGM.rgmd: [ID 443746 daemon.notice] resource mail state on node ms1 change to R_ONLINE
Sep 26 12:26:53 ms1 Cluster.PMF.pmfd: [ID 887656 daemon.notice] Process: tag="msg-rg,msg-rs,1.svc", cmd="/bin/sh -c /opt/sun/
comms/messaging64/bin/start-msg watcher", Failed to stay up.
Sep 26 12:26:55 ms1 Cluster.RGM.rgmd: [ID 784560 daemon.notice] resource msg-rs status on node ms1 change to R_FM_ONLINE
Sep 26 12:26:55 ms1 Cluster.RGM.rgmd: [ID 922363 daemon.notice] resource msg-rs status msg on node ms1 change to <Start succe
eded.>
Sep 26 12:26:55 ms1 Cluster.PMF.pmfd: [ID 819736 daemon.notice] PMF is restarting process that died: tag=msg-rg,msg-rs,1.svc,
cmd_path=/bin/sh -c /opt/sun/comms/messaging64/bin/start-msg watcher, max_retries=0, num_retries=0
Sep 26 12:27:25 ms1 SC[SUNW.ims:7.0,msg-rg,msg-rs,ims_svc_start]: [ID 141062 daemon.error] Failed to connect to host 192.168.
0.250 and port 27442: Connection refused.
Sep 26 12:29:55 ms1 last message repeated 6 times
Sep 26 12:30:26 ms1 Cluster.RGM.rgmd: [ID 764140 daemon.error] Method <ims_svc_start> on resource <msg-rs>, resource group <m
sg-rg>, node <ms1>: Timeout.
Sep 26 12:30:26 ms1 Cluster.RGM.rgmd: [ID 443746 daemon.notice] resource msg-rs state on node ms1 change to R_START_FAILED
Sep 26 12:30:26 ms1 Cluster.RGM.rgmd: [ID 529407 daemon.notice] resource group msg-rg state on node ms1 change to RG_PENDING_
OFF_START_FAILED
Sep 26 12:30:26 ms1 Cluster.RGM.rgmd: [ID 784560 daemon.notice] resource msg-rs status on node ms1 change to R_FM_FAULTED
Sep 26 12:30:26 ms1 Cluster.RGM.rgmd: [ID 443746 daemon.notice] resource msg-rs state on node ms1 change to R_STOPPING
S