Skip to Main Content

Hardware

Announcement

For appeals, questions and feedback about Oracle Forums, please email oracle-forums-moderators_us@oracle.com. Technical questions should be asked in the appropriate category. Thank you!

scsi hdd read/write errors, how serious are they?

807557Feb 3 2008 — edited Mar 17 2008
Hi,

I've got a Sun Netra 210 (Sol9) with the user complaining system just hangs & had to be hardbooted. Its a preliminary stage and I only support the application running on the server - not the OS or Hardware. Nevertheless I've to troubleshoot the basics-

These are the results
I'm faced with the following errors-
Jan  3 17:40:15 a0635TI40CF 
 scsi: [ID 107833 kern.warning] WARNING: /pci@1c,600000/LSILogic,sas@1/sd@0,0 (sd31):
 	Error for Command: read(10)                Error Level: Fatal
 scsi: [ID 107833 kern.notice] 	Requested Block: 17461052                  Error Block: 17461098
 scsi: [ID 107833 kern.notice] 	Vendor: SEAGATE                            Serial Number: 060510E9D5  
 scsi: [ID 107833 kern.notice] 	Sense Key: Aborted Command
 scsi: [ID 107833 kern.notice] 	ASC: 0x4b (<vendor unique code 0x4b>), ASCQ: 0x4, FRU: 0x0
 scsi: [ID 107833 kern.warning] WARNING: /pci@1c,600000/LSILogic,sas@1/sd@0,0 (sd31):
 	Error for Command: read(10)                Error Level: Retryable
 scsi: [ID 107833 kern.notice] 	Requested Block: 17461036                  Error Block: 17461039
 scsi: [ID 107833 kern.notice] 	Vendor: SEAGATE                            Serial Number: 060510E9D5  
 scsi: [ID 107833 kern.notice] 	Sense Key: Aborted Command
 scsi: [ID 107833 kern.notice] 	ASC: 0x4b (<vendor unique code 0x4b>), ASCQ: 0x4, FRU: 0x0
 scsi: [ID 365881 kern.info] /pci@1c,600000/LSILogic,sas@1 (mpt0):
-----
scsi: [ID 107833 kern.warning] WARNING: /pci@1c,600000/LSILogic,sas@1/sd@0,0 (sd31):
	Error for Command: read                    Error Level: Retryable
scsi: [ID 107833 kern.notice] 	Requested Block: 250544                    Error Block: 250559
scsi: [ID 107833 kern.notice] 	Vendor: SEAGATE                            Serial Number: 060510E9D5  
scsi: [ID 107833 kern.notice] 	Sense Key: Aborted Command
scsi: [ID 107833 kern.notice] 	ASC: 0x4b (<vendor unique code 0x4b>), ASCQ: 0x4, FRU: 0x0
scsi: [ID 365881 kern.info] /pci@1c,600000/LSILogic,sas@1 (mpt0):
	Log info 31120300 received for target 0.
	scsi_status=0, ioc_status=804b, scsi_state=c
scsi: [ID 365881 kern.info] /pci@1c,600000/LSILogic,sas@1 (mpt0):
	Log info 31120300 received for target 0.
	scsi_status=0, ioc_status=804b, scsi_state=c
genunix: [ID 672855 kern.notice] syncing file systems...
genunix: [ID 904073 kern.notice]  done
First off I did an fsck after booting from cd, fixed couple of file incosistencies still on boot the fsck shows the followin output-
/dev/dsk/c1t0d0s0 IS CURRENTLY MOUNTED READ/WRITE.
CONTINUE? 
** /dev/dsk/c1t0d0s0
** Currently Mounted on /
** Phase 1 - Check Blocks and Sizes
** Phase 2 - Check Pathnames
** Phase 3a - Check Connectivity
** Phase 3b - Verify Shadows/ACLs
** Phase 4 - Check Reference Counts
** Phase 5 - Check Cylinder Groups
FILESYSTEM MAY STILL BE INCONSISTENT.
48752 files, 1361458 used, 2768268 free (75132 frags, 336642 blocks, 1.8% fragmentation)
***** FILE SYSTEM IS BAD *****

***** PLEASE RERUN FSCK *****
/dev/dsk/c1t0d0s3 IS CURRENTLY MOUNTED READ/WRITE.
CONTINUE? 
** /dev/dsk/c1t0d0s3
** Currently Mounted on /cms
** Phase 1 - Check Blocks and Sizes
** Phase 2 - Check Pathnames
** Phase 3a - Check Connectivity
** Phase 3b - Verify Shadows/ACLs
** Phase 4 - Check Reference Counts
** Phase 5 - Check Cylinder Groups
FILESYSTEM MAY STILL BE INCONSISTENT.
1889 files, 290360 used, 2808391 free (103 frags, 351036 blocks, 0.0% fragmentation)
***** FILE SYSTEM IS BAD *****

***** PLEASE RERUN FSCK *****
iostat -En doesnt show any errors-
a0635TI40CF# iostat -En
c0t0d0          Soft Errors: 1 Hard Errors: 0 Transport Errors: 0
Vendor: TSSTcorp Product: CD/DVDW TS-L532U Revision: SI02 Serial No:
Size: 0.00GB <0 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 1 Predictive Failure Analysis: 0
c1t0d0          Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: SEAGATE  Product: ST973401LSUN72G  Revision: 0556 Serial No: 060510E9D5
Size: 73.40GB <73400057856 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 0 Predictive Failure Analysis: 0
The application running on the server does write a lot of data into the HDD. Another thing I noticed was that the system date had also reset to 01.01.2000 ... which in itself will create lot of problem.Should I consider the SCSI issues serious enough to replace the Hard Disk?

Edited by: rookiebot on Feb 3, 2008 3:19 AM

Edited by: rookiebot on Feb 3, 2008 3:34 AM
Comments
Locked Post
New comments cannot be posted to this locked post.
Post Details
Locked on Apr 14 2008
Added on Feb 3 2008
2 comments
1,585 views