[solved] How to blink faulty disk in Solaris hardware?

Hi Guys,

One of two disks in my solaris machine has failed, the name is disk0, this is SUN physical sparc machine
But I work remotely, so people working near that physical server are not that technical, so from OS command prompt can run some command to bink faulty disk at front panel of Server.

Thanks,

What kind of hardware is it?

1 Like

like bartus11 mentioned, this can vary server to server.

In my experience, on newer Sun.Oracle machines, unconfiguring the disk using cfgadm (if possible) will switch the LED to indicate the drive is 'safe to remove'.

1 Like

Its sun4u Sun Fire V210

But Oracle Engineer said that it is hot swappable.

Moreover do I need to do metadetach and metaclear before removing the old disk? as the disks are mirrored.

I think I can directly remove old disk and insert new disk right?

Please advise, I really want to know this.

Thanks.

Did you just recently become an admin for that server? Maybe it would be best to ask Oracle Engineer to provide you the steps necessary to replace the faulty disk on that server as any mistake could be fatal to the server and applications running there.

1 Like

Agreed... And 'hot-pluggable' is subtly, crucially different from 'can be unplugged without properly warning the server'.

1 Like

If you have a service contract, the caseowner should provide a todo list for the intervention. If there is no action plan, ask the caseowner for help.

Outputs of metastat und format would be helpfull for detailed advice on your problem. If you have explorer installed, you should provide an explorer output to the caseowner.

1 Like

Below there is a link to Help with faulty Disk on Sun OS
Furthermore, I found an old article:

And another article:

Last but not least, Google finds
http://docs.oracle.com/cd/E18752_01/html/816-4520/troubleshoottasks-95.html\#troubleshoottasks-96

1 Like

Hi Guys!

This is what I did to replace failed disk:-

-bash-3.00$ /usr/local/bin/sudo /usr/sbin/cfgadm -la
Ap_Id                          Type         Receptacle   Occupant     Condition
c0                             scsi-bus     connected    configured   unknown
c0::dsk/c0t0d0                 CD-ROM       connected    configured   unknown
c1                             scsi-bus     connected    configured   unknown
c1::dsk/c1t0d0                 disk         connected    configured   unknown
c1::dsk/c1t1d0                 disk         connected    configured   unknown
c2                             scsi-bus     connected    unconfigured unknown
usb0/1                         unknown      empty        unconfigured ok
usb0/2                         unknown      empty        unconfigured ok
-bash-3.00$ 

- cfgadm -c unconfigure c1::dsk/c1t0d0 


2 .Verify that the device is now unconfigured. 


-bash-3.00$ /usr/local/bin/sudo /usr/sbin/cfgadm -la


3. Confirm that the hard drive you want to remove from the server 
is no longer visible to the operating system. 


- /usr/local/bin/sudo  /usr/sbin/format


4. The drive blue LED should now be lit

5. Open front bezel on the server.

6. Slide the catch to the right, remove the drive   <<<<-- REMOVE FAILED DISK

7. Slide the catch to the right, install the new drive. <<<<-- INSERT NEW DISK

8. Push the metal level until the HDD clicks in place.

9. Close the front bezel.

10. Configure the newly installed hard drive e.g.

- cfgadm -la

- cfgadm -c configure c1::dsk/c1t0d0 

I have removed the failed disk and inserted the new disk.

However I am unable to configure the disk getting below error:-

-bash-3.00$ /usr/local/bin/sudo /usr/sbin/cfgadm -c configure c1::dsk/c1t0d0
cfgadm: Hardware specific failure: failed to configure SCSI device: No such device or address
-bash-3.00$

This is what my cfgadm -al output shows:-

-bash-3.00$ /usr/local/bin/sudo /usr/sbin/cfgadm -la
Ap_Id Type Receptacle Occupant Condition
c0 scsi-bus connected configured unknown
c0::dsk/c0t0d0 CD-ROM connected configured unknown
c1 scsi-bus connected configured unknown
c1::dsk/c1t0d0 disk connected unconfigured unknown
c1::dsk/c1t1d0 disk connected configured unknown
c2 scsi-bus connected unconfigured unknown
usb0/1 unknown empty unconfigured ok
usb0/2 unknown empty unconfigured ok
-bash-3.00$ 

This is what my format command showing:-

-bash-3.00$ /usr/local/bin/sudo  /usr/sbin/format
Password:
Searching for disks...done


AVAILABLE DISK SELECTIONS:
       0. c1t1d0 <SUN72G cyl 14087 alt 2 hd 24 sec 424>  ROOTMIRR
          /pci@1c,600000/scsi@2/sd@1,0
Specify disk (enter its number):

is it safe to run this command on Server -->>>>

devfsadm �C �c disks

will it make my new inserted disk visible on system?

Please advise.

Thanks a lot.

This is solved.

Thanks a lot guys.