Degraded - Service is degraded
Hi,
I'm running a zone cluster setup on Oracle Solaris Cluster 3.3u1 for Solaris 10 sparc.
The resource type in question is SUNW.gds:6. clrs status toggles between the states of online and degraded. Further i see a message in the messages file which says :
Dec 19 10:11:59 zc-node1 SC[,SUNW.gds:6,prod-rg,prod-application-rs,gds_probe]: [ID 335591 daemon.error] Failed to retrieve the resource group proper
ty RG_is_frozen: resource, resource type or resource group has been updated since last scha__open call.
When i traced back the time the messages appeared to be started and check the cluster eventlog with showev4 (Please suggest anything better known). I can see the below
Sat Dec 17 13:09:34 2011
class: EC_Cluster subclass: ESC_cluster_gds_probe
vendor: SUNW pub: gds pid: 2467
cluster_id: cluster: prod node: zc1: prod
ts_sec: 1324087774 ts_usec: 182309
severity: 0 (Info) initiator: 3 (Agent)
r_name: prod-application-rs
rg_name: prod-rg status_msg: Probe has been executed with exit code 1 [opt/app /cluster/scripts/prod-app-monitor.sh]
Sat Dec 17 13:09:34 2011
class: EC_Cluster subclass: ESC_cluster_gds_probe
vendor: SUNW pub: gds pid: 2467
cluster_id: cluster: prod node: zc1:prod
ts_sec: 1324087774 ts_usec: 184683
severity: 0 (Info) initiator: 3 (Agent)
r_name: prod-application-rs
rg_name: prod-rg status_msg: The probe result is 1
Sat Dec 17 13:09:34 2011
class: EC_Cluster subclass: ESC_cluster_fm_r_status_change
vendor: SUNW pub: rgm pid: 2489
cluster_id: 4 cluster: prod node: zc1
ts_sec: 1324087774 ts_usec: 189629
severity: 0 (Info) initiator: 3 (Agent)
r_name: prod-application-rs
rg_name: prod-rg node_name: zc1
old_status: R_FM_ONLINE
new_status: R_FM_DEGRADED
status_msg: Service is degraded.
What i can think is it looks like an issue with the monitor script. But i’d like to know what really could be going on. Application does not go offline. But it marks it as degraded.
Thanks for your time.
Edited by: user13843357 on Dec 18, 2011 6:19 PM