Skip to Main Content

Infrastructure Software

Announcement

For appeals, questions and feedback about Oracle Forums, please email oracle-forums-moderators_us@oracle.com. Technical questions should be asked in the appropriate category. Thank you!

getting started with SCAT - Solaris Crash Analysis Tool

807557Aug 28 2008
I recently installed an Areca 1260 PCI-E HBA in a Supermicro x86-based server running Solaris 10, and it failed within 18 hours. Solaris crashed, and the machine wouldn't POST with the card installed.

The card had 3 1TB drives attached, arranged in a zpool as a raidz device.

I think it's probably a faulty card (remote server, haven't been near enough to troubleshoot), but I could use some help poking around with SCAT to make sure. Here's where I'm at:
# /opt/SUNWscat/bin/scat 0

  Solaris[TM] CAT 5.0 for Solaris 10 64-bit x86
    SV4622M, Jul  3 2008

  Copyright ? 2008 Sun Microsystems, Inc. All rights reserved.
  Use is subject to license terms.

  Feedback regarding the tool should be sent to SolarisCAT_Feedback@Sun.COM
  Visit the Solaris CAT blog at http://blogs.sun.com/SolarisCAT

opening unix.0 vmcore.0 ...dumphdr...symtab...core...done
loading core data: modules...symbols...CTF...done

core file:      /var/crash/fixationfx/vmcore.0
user:           Super-User (root:0)
release:        5.10 (64-bit)
version:        Generic_120012-14
machine:        i86pc
node name:      fixationfx
domain:         fx.mydomain.foo
system type:    i86pc
hostid:         4c56b3f0
dump_conflags:  0x10000 (DUMP_KERNEL) on /dev/dsk/c1d0s1(517M)
time of crash:  Sat Aug 23 02:40:27 PDT 2008
age of system:  18 hours 28 minutes 0.40 seconds
panic CPU:      0 (2 CPUs, 7.99G memory)
panic string:   ZFS: I/O failure (write on <unknown> off 0: zio ffffffffa1eae5c0 [L0 DMU dnode] 4000L/400P DVA[0]=<0:112bf9f1000:800> DVA[1]=<0:19400673000:800> fletcher4 lzjb LE contiguous birth=12835 fill=4 cksum=

sanity checks: settings...vmem...sysent...clock...misc...
WARNING: 3 severe kstat errors (run "kstat xck")
done
SolarisCAT(vmcore.0/10X)> kstat xck
loaded 35 kstat checks from /opt/SUNWscat/lib/kstat_xck
sd16,err:Transport Errors>0(89):device had transport errors
sd17,err:Transport Errors>0(89):device had transport errors
sd18,err:Transport Errors>0(101):device had transport errors
SolarisCAT(vmcore.0/10X)> 
It looks like zfs paniced and failed, probably because the hardware failed. But I'd like to dig deeper. What else can I do with SCAT to investigate this situation?

Many thanks
Comments
Locked Post
New comments cannot be posted to this locked post.
Post Details
Locked on Sep 25 2008
Added on Aug 28 2008
0 comments
427 views