Resource start timeout always at 60 seconds
Hello all,
I'm using clusterware for a standard database cluster. As my storage is NetApp I want to use snapdrive software to load dynamically LUN (3 per database). I've a script correctly working by hand. Time needed for LUNs to be attached is variable between 80 and 120 seconds. Resource is called database.ofa.
crs_profile -print database.ofa
NAME=database.ofa
TYPE=application
ACTION_SCRIPT=/app/oracle/product/crs/crs/public/act_snapdrive.pl
ACTIVE_PLACEMENT=0
AUTO_START=restore
CHECK_INTERVAL=450
DESCRIPTION=mantprd.ofa
FAILOVER_DELAY=0
FAILURE_INTERVAL=0
FAILURE_THRESHOLD=0
HOSTING_MEMBERS=
OPTIONAL_RESOURCES=
PLACEMENT=balanced
REQUIRED_RESOURCES=database
RESTART_ATTEMPTS=1
SCRIPT_TIMEOUT=450
START_TIMEOUT=0
STOP_TIMEOUT=0
UPTIME_THRESHOLD=7d
USR_ORA_ALERT_NAME=
USR_ORA_CHECK_TIMEOUT=0
USR_ORA_CONNECT_STR=/ as sysdba
USR_ORA_DEBUG=0
USR_ORA_DISCONNECT=false
USR_ORA_FLAGS=
USR_ORA_IF=
USR_ORA_INST_NOT_SHUTDOWN=
USR_ORA_LANG=
USR_ORA_NETMASK=
USR_ORA_OPEN_MODE=
USR_ORA_OPI=false
USR_ORA_PFILE=
USR_ORA_PRECONNECT=none
USR_ORA_SRV=mantprd
USR_ORA_START_TIMEOUT=0
USR_ORA_STOP_MODE=immediate
USR_ORA_STOP_TIMEOUT=0
USR_ORA_VIP=
Starting manually resource will fail. crsd.log reports:
2009-03-24 16:01:23.538: [ CRSRES][1510144352]0Attempting to start `database.ofa` on member `server01`
2009-03-24 16:02:26.537: [ CRSEVT][1510144352]0CAAMonitorHandler :: 0:Could not join /app/oracle/product/crs/crs/public/act_snapdrive.pl(start)
category: 1234, operation: scls_process_join, loc: childcrash, OS error: 0, other: Abnormal termination of the child
2009-03-24 16:02:26.537: [ CRSEVT][1510144352]0CAAMonitorHandler :: 0:Action Script /app/oracle/product/crs/crs/public/act_snapdrive.pl(start) timed out for database.ofa! (timeout=60)
2009-03-24 16:02:26.537: [ CRSAPP][1510144352]0StartResource error for database.ofa error code = -2
2009-03-24 16:03:29.559: [ CRSEVT][1510144352]0CAAMonitorHandler :: 0:Could not join /app/oracle/product/crs/crs/public/act_snapdrive.pl(stop)
category: 1234, operation: scls_process_join, loc: childcrash, OS error: 0, other: Abnormal termination of the child
2009-03-24 16:03:29.559: [ CRSEVT][1510144352]0CAAMonitorHandler :: 0:Action Script /app/oracle/product/crs/crs/public/act_snapdrive.pl(stop) timed out for database.ofa! (timeout=60)
2009-03-24 16:03:29.559: [ CRSAPP][1510144352]0StopResource error for database.ofa error code = -2
2009-03-24 16:03:29.572: [ CRSRES][1510144352]0X_OP_StopResourceFailed : Stop Resource failed
(File: rti.cpp, line: 1803
2009-03-24 16:03:29.573: [ CRSRES][1510144352][ALERT]0`database.ofa` on member `server01` has experienced an unrecoverable failure.
2009-03-24 16:03:29.573: [ CRSRES][1510144352]0Human intervention required to resume its availability.
Seems that timeout reported is always 60seconds I've manually changed for all resources options st=450 as shown above but always crs fails at 60 seconds.
Any hints to debug?
Thanks
Stefano