Hi all,
I got the following error message from solaris 10 system.
Security Violations
=-=-=-=-=-=-=-=-=-=
Jun 1 01:48:21 unix009 scsi: [ID 107833 kern.notice] ASC: 0x44 (internal target failure), ASCQ: 0x0, FRU: 0xbc
The following are from /var/adm/messages
Jun 1 01:48:21 unix009 scsi: [ID 107833 kern.warning] WARNING: /pci@1c,600000/scsi@2/sd@0,0 (sd3):
Jun 1 01:48:21 unix009 Error for Command: write(10) Error Level: Retryable
Jun 1 01:48:21 unix009 scsi: [ID 107833 kern.notice] Requested Block: 25502736
Error Block: 25502736
Jun 1 01:48:21 unix009 scsi: [ID 107833 kern.notice] Vendor: SEAGATE
Serial Number: 0534329T1T
Jun 1 01:48:21 unix009 scsi: [ID 107833 kern.notice] Sense Key: Hardware Error
Jun 1 01:48:21 unix009 scsi: [ID 107833 kern.notice] ASC: 0x44 (internal target failure), ASC
Q: 0x0, FRU: 0xbc
This looks like a disk error message ?. What exactly it means.
What should be done to take care of this. Please advice me
Thanks in advance!
You have a bad disk block. Disks have reserve blocks to move bad block data into.
If you still have errors on the same disk, the disk could not 'heal' itself. It is time to back up the disks data and then replace the hardware, then restore data to the new disk.
One-off occurrences are not usually an issue.
If a disk throws these errors regularly it is going bad, and needs to be replaced as well.
1 Like
beta17
June 14, 2011, 3:36am
3
show us an output from iostat:
iostat -E|grep -i error
Hi beta17,
below is the output.
c0t1d0 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: SEAGATE Product: ST373207LSUN72G Revision: 045A Serial No: 0534329T1K
Size: 73.40GB <73400057856 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 0 Predictive Failure Analysis: 0
c0t0d0 Soft Errors: 0 Hard Errors: 1 Transport Errors: 0
Vendor: SEAGATE Product: ST373207LSUN72G Revision: 045A Serial No: 0534329T1T
Size: 73.40GB <73400057856 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 0 Predictive Failure Analysis: 0
beta17
June 15, 2011, 2:40am
5
as you can see your disk has only one Hard Error, not critical but i recommend you to backup the date and plan to replace the disk!
Hi beta17,
Could you tell me how do i back it up and restore to new hard disk.
Thanks in advance!
Your disk c0t1d0 is having Hard error. Better provide the output of the command format.Also shutdown the server and do a dummy replacement.
Hi solarishumble,
here is the format output.
unix009# format
Searching for disks...done
AVAILABLE DISK SELECTIONS:
0. c0t0d0 <SUN72G cyl 14087 alt 2 hd 24 sec 424>
/pci@1c,600000/scsi@2/sd@0,0
1. c0t1d0 <SUN72G cyl 14087 alt 2 hd 24 sec 424>
/pci@1c,600000/scsi@2/sd@1,0
Specify disk (enter its number): 0
selecting c0t0d0
[disk formatted]
/dev/dsk/c0t0d0s0 is part of SVM volume stripe:d10. Please see metaclear(1M).
/dev/dsk/c0t0d0s1 is part of SVM volume stripe:d11. Please see metaclear(1M).
/dev/dsk/c0t0d0s3 contains an SVM mdb. Please see metadb(1M).
/dev/dsk/c0t0d0s4 is part of SVM volume stripe:d14. Please see metaclear(1M).
/dev/dsk/c0t0d0s5 contains an SVM mdb. Please see metadb(1M).
/dev/dsk/c0t0d0s6 contains an SVM mdb. Please see metadb(1M).
/dev/dsk/c0t0d0s7 contains an SVM mdb. Please see metadb(1M).
partition> p
Current partition table (original):
Total disk cylinders available: 14087 + 2 (reserved cylinders)
Part Tag Flag Cylinders Size Blocks
0 root wm 1649 - 12417 52.25GB (10769/0/0) 109585344
1 swap wu 0 - 1648 8.00GB (1649/0/0) 16780224
2 backup wm 0 - 14086 68.35GB (14087/0/0) 143349312
3 unassigned wm 12418 - 12422 24.84MB (5/0/0) 50880
4 var wm 12423 - 14071 8.00GB (1649/0/0) 16780224
5 unassigned wm 14072 - 14076 24.84MB (5/0/0) 50880
6 unassigned wm 14077 - 14081 24.84MB (5/0/0) 50880
7 unassigned wm 14082 - 14086 24.84MB (5/0/0) 50880