node panic/reboot after enabling samba resource
807567Mar 28 2007 — edited Jun 21 2007Hi there,
I configured the samba data service as shown in the standard doc examples:
http://docs.sun.com/app/docs/doc/819-3063
resp. the example from sun cluster oasis:
http://blogs.sun.com/SC/entry/new_solaris_10_experience
I can even register it with the following settings:
----------------------------------------------------------
/opt/SUNWscsmb/util/samba_config:
#+++ Resource Specific Parameters +++
RS=samba-pdc
RG=samba-rg
RS_LH=samba-lh
RS_HAS=samba-has
RUN_NMBD=YES
SERVICES="smbd,nmbd"
#+++ Common Parameters +++
BINDIR=/opt/csw/bin
SBINDIR=/opt/csw/sbin
CFGDIR=/ZFSsamba/nero
LDPATH=/opt/csw/lib
FMUSER=fmuser
#+++ SMBD & NMBD Specific Parameters (See Note 1) +++
SAMBA_LOGDIR=/ZFSsamba/nero/logs
SAMBA_FMPASS=samba
SAMBA_FMDOMAIN=kfz
TIMEOUT=3000
----------------------------------------------------------
The clresource enable command (clreource enable samba-pdc) doesn't complain either,
but seconds later the node (single node test-cluster) panics and reboots.
here are the corresponding log messages:
----------------------------------------------------------
/var/adm/messages:
Mar 28 15:35:57 nero SC[SUNWscsmb.samba.start]:samba-rg:samba-pdc: [ID 287111 daemon.error] Validate - RUN_NMBD= is invalid - specify YES or NO
Mar 28 15:35:57 nero Cluster.PMF.pmfd: [ID 887656 daemon.notice] Process: tag="samba-rg,samba-pdc,0.svc", cmd="/bin/sh -c /opt/SUNWscsmb/samba/bin/start_samba -R 'samba-p dc' -G 'samba-rg' -X 'smbd nmbd' -B '/opt/csw/bin' -S '/opt/csw/sbin' -C '/ZFSsamba/nero' -L '/ZFSsamba/nero/logs' -U kfz\\\\fmuser%samba -M 'scmondir' -P '/opt/csw/lib' -H nero-lh1 ", Failed to stay up.
...
Mar 28 15:35:57 nero Cluster.PMF.pmfd: [ID 819736 daemon.notice] PMF is restarting process that died: tag=samba-rg,samba-pdc,0.svc, cmd_path=/bin/sh -c /opt/SUNWscsmb/sam ba/bin/start_samba -R 'samba-pdc' -G 'samba-rg' -X 'smbd nmbd' -B '/opt/csw/bin' -S '/opt/csw/sbin' -C '/ZFSsamba/nero' -L '/ZFSsamba/nero/logs' -U kfz\\\\fmuser%samba -M 'scmondir' -P '/opt/csw/lib' -H nero-lh1 , max_retries=0, num_retries=0
...
Mar 28 15:35:57 nero Cluster.PMF.pmfd: [ID 534408 daemon.notice] "samba-rg,samba-pdc,0.svc" restarting too often ... sleeping 1 seconds.
...
Mar 28 15:36:49 nero ^Mpanic[cpu0]/thread=fffffe80000b9c80:
Mar 28 15:36:49 nero genunix: [ID 562397 kern.notice] Failfast: Aborting zone "global" (zone ID 0) because "pmfd" died 35 seconds ago.
/var/cluster/logs/DS/samba-rg/samba-pdc/probe_log.txt:
03/28/2007 15:19:43 nero PROBE-INFO> Probe has been executed with exit code 0 [opt/SUNWscsmb/samba/bin/probe_samba -R 'samba-pdc' -G 'samba-rg' -X 'smbd nmbd' -B '/opt/csw/bin' -S '/opt/csw/sbin' -C '/ZFSsamba/nero' -L '/ZFSsamba/nero/logs' -U kfz\\\\fmuser%samba -M 'scmondir' -P '/opt/csw/lib' -H nero-lh1 ]
/var/cluster/logs/DS/samba-rg/samba-pdc/start_stop_log.txt:
03/28/2007 14:53:29 nero START-INFO> Start succeeded. [opt/SUNWscsmb/samba/bin/start_samba -R 'samba-pdc' -G 'samba-rg' -X 'smbd nmbd' -B '/opt/csw/bin' -S '/opt/csw/sbin' -C '/ZFSsamba/nero' -L '/ZFSsamba/nero/logs' -U kfz\\\\fmuser%samba -M 'scmondir' -P '/opt/csw/lib' -H nero-lh1 ]
03/28/2007 14:53:51 nero STOP-ERROR> Failed to take the resource out of PMF control. Sending SIGKILL now.
03/28/2007 14:53:51 nero STOP-ERROR> Failed to stop the application with SIGKILL. Returning with failure from stop method.
----------------------------------------------------------
As you can see although I set it explicitly to YES in samba_config: Validate - RUN_NMBD= is invalid - specify YES or NO
OS: Solaris 10 11/06
Sun Cluster 3.2
Maybe someone knows how to solve this.
Greetings
Jochem Ippers