Failure to boot v445

Hi Guys,

I have a small problem with a v445 which I have been informed will only boot with the reconfigure option enabled.

It is attached to HP SAN storage using qla2300 FCA's with a Veritas encapsulated rootvoldg (No Laughing here please) when I try a reboot I get the following error.

Rebooting with command: boot -r
Boot device: /pci@1e,600000/pci@0/pci@2/scsi@0/disk@0,0:a  File and args: -r
SunOS Release 5.10 Version Generic_120011-14 64-bit
Copyright 1983-2007 Sun Microsystems, Inc.  All rights reserved.
Use is subject to license terms.
Hardware watchdog enabled
Hostname: ss071a
QLogic Fibre Channel Driver v4.20 Instance: 0
hba0: QLogic QLA2300 Fibre Channel Host Adapter fcode version 2.00.05 01/29/03
hba0: Firmware v3.3.12 (ipx)
QLogic Fibre Channel Driver v4.20 Instance: 2
hba2: QLogic QLA2300 Fibre Channel Host Adapter fcode version 2.00.05 01/29/03
hba2: Firmware v3.3.12 (ipx)
NOTICE: VxVM vxdmp V-5-0-34 added disk array DISKS, datype = Disk

NOTICE: VxVM vxdmp V-5-3-1700 dmpnode 280/0x0 has migrated from enclosure FAKE_ENCLR_SNO to enclosure DISKS

ERROR: svc:/system/filesystem/usr:default failed to mount /  (see 'svcs -x' for details)
Mar 25 18:15:36 svc.startd[7]: svc:/system/filesystem/usr:default: Method "/lib/svc/method/fs-usr" failed with exit status 95.
Mar 25 18:15:36 svc.startd[7]: system/filesystem/usr:default failed fatally: transitioned to maintenance (see 'svcs -xv' for details)
Requesting System Maintenance Mode
(See /lib/svc/share/README for more information.)
Console login service(s) cannot run

Root password for system maintenance (control-d to bypass):

I've been told that the IBM vpath drivers were installed in anticipation of the server being moved onto the new SAN, but there was no reboot carried out.

I have manually mounted the root plexes and removed the vpath drivers from /kernel/drv/sparcv9 but the damn thing won't boot.

Regards

Dave

the usr filesystem can not be mounted... so i assume there is something wrong with the slice/plex/volume (whatever) or the vfstab entry...

Hi DukeNuke2,

I thought that as well at the begining of this, but the muppet that built this system didn't build a separate /usr file system it's all under the root fs.

The /etc/vfstab file is valid (I think, but messy) as far as I can see, just to let you know I have been called out on this system - the last time I worked on it was about 10 years ago when it was on an E10K.

The /etc/vfstab file looks like;

ss071a-root>cat vfstab
#device         device          mount           FS      fsck    mount   mount
#to mount       to fsck         point           type    pass    at boot options
#
fd      -       /dev/fd fd      -       no      -
/proc   -       /proc   proc    -       no      -
/dev/vx/dsk/bootdg/swapvol      -       -       swap    -       no      nologging
/dev/vx/dsk/bootdg/rootvol      /dev/vx/rdsk/bootdg/rootvol     /       ufs     1       no      nologging
/dev/vx/dsk/bootdg/var  /dev/vx/rdsk/bootdg/var /var    ufs     1       no      -
/dev/vx/dsk/bootdg/opt  /dev/vx/rdsk/bootdg/opt /opt    ufs     2       yes     -
/dev/vx/dsk/bootdg/local        /dev/vx/rdsk/bootdg/local       /usr/local      ufs     2       yes     -
/dev/vx/dsk/bootdg/oraagent     /dev/vx/rdsk/bootdg/oraagent    /oracle/AGENT   ufs     2       yes     -
/devices        -       /devices        devfs   -       no      -
ctfs    -       /system/contract        ctfs    -       no      -
objfs   -       /system/object  objfs   -       no      -
swap    -       /tmp    tmpfs   -       yes     -
#NOTE: volume rootvol (/) encapsulated partition c0t0d0s0
#NOTE: volume swapvol (swap) encapsulated partition c0t0d0s1
#NOTE: volume local (/usr/local) encapsulated partition c0t0d0s5
#NOTE: volume opt (/opt) encapsulated partition c0t0d0s6
#NOTE: volume var (/var) encapsulated partition c0t0d0s7
## /cis/nbsp removed as not required. Awaiting confirmation from John Hibbs/Dave Dunn
## IM5740366
#/dev/vx/dsk/bootdg/nbsptmp     /dev/vx/rdsk/bootdg/nbsptmp     /cis/nbsp       ufs     2       yes     -
/dev/vx/dsk/nbs1dg/cis  /dev/vx/rdsk/nbs1dg/cis /cis    vxfs    2       yes     -
/dev/vx/dsk/nbs1dg/sag  dev/vx/rdsk/nbs1dg/sag  /sag    vxfs    2       yes     -
/dev/vx/dsk/nbs1dg/nbsp /dev/vx/rdsk/nbs1dg/nbsp        /cis/nbsl       vxfs    2       yes     -
/dev/vx/dsk/nbs1dg/u11 /dev/vx/rdsk/nbs1dg/u11 /u11 vxfs        2       yes     -
/dev/vx/dsk/nbs1dg/u12 /dev/vx/rdsk/nbs1dg/u12 /u12 vxfs        2       yes     -
/dev/vx/dsk/nbs1dg/u13 /dev/vx/rdsk/nbs1dg/u13 /u13 vxfs        2       yes     -
/dev/vx/dsk/nbs1dg/u14 /dev/vx/rdsk/nbs1dg/u14 /u14 vxfs        2       yes     -
/dev/vx/dsk/nbs1dg/u15 /dev/vx/rdsk/nbs1dg/u15 /u15 vxfs        2       yes     -
/dev/vx/dsk/nbs1dg/u16 /dev/vx/rdsk/nbs1dg/u16 /u16 vxfs        2       yes     -
/dev/vx/dsk/nbs1dg/u17 /dev/vx/rdsk/nbs1dg/u17 /u17 vxfs        2       yes     -
/dev/vx/dsk/nbs1dg/u18 /dev/vx/rdsk/nbs1dg/u18 /u18 vxfs        2       yes     -
/dev/vx/dsk/nbs1dg/ehome /dev/vx/rdsk/nbs1dg/ehome /home vxfs   2       yes     -
/dev/vx/dsk/nbs1dg/projberyl /dev/vx/rdsk/nbs1dg/projberyl /proj/beryl vxfs     2       yes     -
##-----------------------------------------------
#/dev/vx/dsk/nbsappsdg/berylapps /dev/vx/rdsk/nbsappsdg/berylapps /proj/beryl vxfs      2       yes     -
#/dev/vx/dsk/nbsappsdg/egateapps /dev/vx/rdsk/nbsappsdg/egateapps /egate vxfs   2       yes     -
##/dev/vx/dsk/nbsexportdg/nbsexport /dev/vx/rdsk/nbsexportdg/nbsexport /nbsexport vxfs 2 yes suid
##/dev/vx/dsk/nbstempdg/nbstemp /dev/vx/rdsk/nbstempdg/nbstemp /nbstemp vxfs 2 yes suid

So as you'll be able to see there isn't a separate mount for the /usr file system.

Regards

Dave

is the content of /usr present? and is there a /usr/local directory present? can you login in single user mode? if yes what is the output of svcs -xv and what is in the logfile for the error?

Hi DukeNuke2,

This is quite interesting, it would seem that the pkgadd for the vpath IBM drivers has an interesting side effect. It over writes the /etc/system file (note over write and not append) with;

forceload: drv/vpathdd

All the stuff that was in there before is just trashed, recover the old file from tape and edit is the answer at the moment. This just goes to show that making changes to a live system and not going for a reboot, is just asking for problems!

Regards

Dave

This isn't unlike a question I answered the other day as far as the actual boot error is concerned. I advised to recreate the filesystem device paths.

The thread is here:-

Take a backup first. Perhaps the attachment of the SAN changes the device paths. The error that you posted seems to indicate it is having a problem with root!!!