E4500 Disk board issues

Hi there,

I'm building a couple of E4500's both with the same spec and seem to be having an issue with the disk boards on each.

I�m running the following config

E4500 6 x 400MHz, 6GB, 2 x X2612A with cards, 1 x disk board with 2 x 9.1GB 10K Disk

I have configured the machine as follows

Front
Slot (0) 1 x X2602A with 2 x 400Mhz & 2GB
Slot (2) 1 x X2602A with 2 x 400Mhz & 2GB
Slot (4) 1 x X2602A with 2 x 400Mhz & 2GB

Back
Slot (1) 1 x X2612A with 1 x X1065A & 2 x X1049A
Slot (3) 1 x X2612A with 2 x LP9002S-E
Slot (5) 1 x X5163A with 2 x X5235A

The disk board is terminated with a SCSI SE terminator on the out port and the in port is connected via a 68Pin SCSI to the onboard SCSI connector on the second X2612A in Slot (3).

The machine seems to run diags etc all fine and sees both the disk and the DVD in probe-scsi-all. I was able to install Solaris 10 Sparc edition 10/09 without a problem and ran prtdiag etc all seemed ok as far as I could tell.

However when I type boot disk the machine is not able to boot from the installed disk it keeps defaulting to

/sbus@3,0/SUNW,fas@3,8800000/sd@0,0 

which is the incorrect address for the boot disk.

I did the following

{0} ok

probe-scsi-all

/sbus@7,0/lpfs@0,0

/sbus@7,0/SUNW,fas@3,8800000

Target c
Unit 0 Disk SEAGATE ST39204LCSUN9.0G42070038V0YNQW

Target d
Unit 0 Disk SEAGATE ST39204LCSUN9.0G42030038V0YY19

                /sbus@6,0/lpfs@1,0

/sbus@3,0/QLGC,isp@0,10000
invalid command

/sbus@3,0/SUNW,fas@3,8800000

Target 6
Unit 0 Removable Read Only device TOSHIBA DVD-ROM SD-M1401100912/20/00


{0} ok

devalias

disk /sbus@3,0/SUNW,fas@3,8800000/sd@0,0
diskbrd /sbus@3,0/SUNW,fas@3,8800000/sd@a,0
diskisp /sbus@3,0/QLGC,isp@0,10000/sd@0,0
net /sbus@3,0/SUNW,hme@3,8c00000
cdrom /sbus@3,0/SUNW,fas@3,8800000/sd@6,0:f
tape /sbus@3,0/SUNW,fas@3,8800000/st@4,0
scsi /sbus@3,0/SUNW,fas@3,8800000
disk0 /sbus@3,0/SUNW,fas@3,8800000/sd@0,0
disk1 /sbus@3,0/SUNW,fas@3,8800000/sd@1,0
disk2 /sbus@3,0/SUNW,fas@3,8800000/sd@2,0
disk3 /sbus@3,0/SUNW,fas@3,8800000/sd@3,0
disk4 /sbus@3,0/SUNW,fas@3,8800000/sd@4,0
disk5 /sbus@3,0/SUNW,fas@3,8800000/sd@5,0
tape0 /sbus@3,0/SUNW,fas@3,8800000/st@4,0
tape1 /sbus@3,0/SUNW,fas@3,8800000/st@5,0
ttya /central/fhc/zs@0,902000:a
ttyb /central/fhc/zs@0,902000:b
keyboard /central/fhc/zs@0,904000
keyboard! /central/fhc/zs@0,904000:forcemode

To get around this I created a second alias �disk� and tried to remove the existing one

 
nvalias disk /sbus@7,0/SUNW,fas@3,8800000/sd@c,0
 
nvunalias disk /sbus@3,0/SUNW,fas@3,8800000/sd@0,0
 
setenv boot-device disk
 
reset

Upon reset the machines failed to remove the original disk, however now it has two alias�s which are �disk� the one I have added is in front of the old one and now if I type � boot disk� it boots the OS


{0} ok

 devalias

disk /sbus@7,0/SUNW,fas@3,8800000/sd@c,0
disk /sbus@3,0/SUNW,fas@3,8800000/sd@0,0
diskbrd /sbus@3,0/SUNW,fas@3,8800000/sd@a,0
diskisp /sbus@3,0/QLGC,isp@0,10000/sd@0,0
net /sbus@3,0/SUNW,hme@3,8c00000
cdrom /sbus@3,0/SUNW,fas@3,8800000/sd@6,0:f
tape /sbus@3,0/SUNW,fas@3,8800000/st@4,0
scsi /sbus@3,0/SUNW,fas@3,8800000
disk0 /sbus@3,0/SUNW,fas@3,8800000/sd@0,0
disk1 /sbus@3,0/SUNW,fas@3,8800000/sd@1,0
disk2 /sbus@3,0/SUNW,fas@3,8800000/sd@2,0
disk3 /sbus@3,0/SUNW,fas@3,8800000/sd@3,0
disk4 /sbus@3,0/SUNW,fas@3,8800000/sd@4,0
disk5 /sbus@3,0/SUNW,fas@3,8800000/sd@5,0
tape0 /sbus@3,0/SUNW,fas@3,8800000/st@4,0
tape1 /sbus@3,0/SUNW,fas@3,8800000/st@5,0
ttya /central/fhc/zs@0,902000:a
ttyb /central/fhc/zs@0,902000:b
keyboard /central/fhc/zs@0,904000
keyboard! /central/fhc/zs@0,904000:forcemode


 
show-post-results

Slot 0 - Status=Okay, Type: CPU/Memory

Cpu0=P Cpu0-OK=P FailCode=0 Cpu1=P Cpu1-OK=P FailCode=0
AC=P FHC=P SRAM=P FPROM=P LabCon=Not Ovtemp=Not
Bank0=0 Bank1=0 DTag0=P DTag1=P JTAG=P CntrPl=P
Bank0=P Bank1=P DC=ff

Slot 1 - Status=Okay, Type: I/O Type 4

Sysio0=P Sysio1=P FEPS=P FEPSFC=0 SOC=P
Sbus0=P Sbus1=P Sbus2=P
AC=P FHC=P SRAM=P FPROM=P LabCon=Not Ovtemp=Not
TODC=P JTAG=P CntrPl=P DC=ff

Slot 2 - Status=Okay, Type: CPU/Memory

Cpu0=P Cpu0-OK=P FailCode=0 Cpu1=P Cpu1-OK=P FailCode=0
AC=P FHC=P SRAM=P FPROM=P LabCon=Not Ovtemp=Not
Bank0=0 Bank1=0 DTag0=P DTag1=P JTAG=P CntrPl=P
Bank0=P Bank1=P DC=ff

Slot 3 - Status=Okay, Type: I/O Type 4

Sysio0=P Sysio1=P FEPS=P FEPSFC=0 SOC=P
Sbus0=P Sbus1=P Sbus2=P
AC=P FHC=P SRAM=P FPROM=P LabCon=Not Ovtemp=Not
TODC=P JTAG=P CntrPl=P DC=ff

Slot 4 - Status=Okay, Type: CPU/Memory

Cpu0=P Cpu0-OK=P FailCode=0 Cpu1=P Cpu1-OK=P FailCode=0
AC=P FHC=P SRAM=P FPROM=P LabCon=Not Ovtemp=Not
Bank0=0 Bank1=0 DTag0=P DTag1=P JTAG=P CntrPl=P
Bank0=P Bank1=P DC=ff

Slot 5 - Status=Okay, Type: Disk

Disk0=c Disk1=d VDDOK=P Fan=P JTAG=P

Slot 16 - Status=Okay, Type: Clock

Clock=P Serial=P KbdMse=P PPS-DC=P DCReg0=P DCReg1=P
AC=P ACFan=P KeyFan=P PSFail=0 Ovtemp=Not TODC=P
V5-P=P V12-P=P V5-Aux=P V5P-PC=P V12-PC=P V3-PC=P V5-PC=P
RckFan=P 3.3V=P 5.0V=P Triger=P Coolng=P AC-REV=P

P = Present or Passed
*** = Failed Component
Not = Not present

#

prtdiag �v

System Configuration: Sun Microsystems sun4u 8-slot Sun Enterprise E4500/E5500
System clock frequency: 100 MHz
Memory size: 6144Mb

========================= CPUs =========================

                Run   Ecache   CPU    CPU

Brd CPU Module MHz MB Impl. Mask
--- --- ------- ----- ------ ------ ----
0 0 0 400 8.0 US-II 10.0
0 1 1 400 8.0 US-II 10.0
2 4 0 400 8.0 US-II 10.0
2 5 1 400 8.0 US-II 10.0
4 8 0 400 8.0 US-II 10.0
4 9 1 400 8.0 US-II 10.0

========================= Memory =========================

                                          Intrlv.  Intrlv.

Brd Bank MB Status Condition Speed Factor With
--- ----- ---- ------- ---------- ----- ------- -------
0 0 1024 Active OK 60ns 4-way A
0 1 1024 Active OK 60ns 4-way A
2 0 1024 Active OK 60ns 4-way A
2 1 1024 Active OK 60ns 2-way B
4 0 1024 Active OK 60ns 4-way A
4 1 1024 Active OK 60ns 2-way B

========================= IO Cards =========================

 Bus   Freq

Brd Type MHz Slot Name Model
--- ---- ---- ---------- ---------------------------- --------------------
1 SBus 25 0 QLGC,isp/sd (block) QLGC,ISP1000U
1 SBus 25 1 SUNW,qfe SUNW,sbus-qfe
1 SBus 25 1 SUNW,qfe SUNW,sbus-qfe
1 SBus 25 1 SUNW,qfe SUNW,sbus-qfe
1 SBus 25 1 SUNW,qfe SUNW,sbus-qfe
1 SBus 25 2 SUNW,qfe SUNW,sbus-qfe
1 SBus 25 2 SUNW,qfe SUNW,sbus-qfe
1 SBus 25 2 SUNW,qfe SUNW,sbus-qfe
1 SBus 25 2 SUNW,qfe SUNW,sbus-qfe
1 SBus 25 3 SUNW,hme
1 SBus 25 3 SUNW,fas/sd (block)
1 SBus 25 13 SUNW,socal/sf (scsi-3) 501-3060
3 SBus 25 0 lpfs/sd (block) LP9002S
3 SBus 25 1 lpfs/sd (block) LP9002S
3 SBus 25 3 SUNW,hme
3 SBus 25 3 SUNW,fas/sd (block)
3 SBus 25 13 SUNW,socal/sf (scsi-3) 501-3060

Detached Boards

Slot State Type Info
---- --------- ------ -----------------------------------------
5 disabled disk Disk 0: Target: 12 Disk 1: Target: 13

No failures found in System

Detected System Faults

PROM detected failure
Detected Mon May 23 11:15:05 2011

========================= Environmental Status =========================
Keyswitch position is in Normal Mode
System Power Status: Redundant
System LED Status: GREEN YELLOW GREEN
WARNING ON ON BLINKING

Fans:
-----
Unit Status
---- ------
Rack OK
Key OK
AC OK

System Temperatures (Celsius):
------------------------------
Brd State Current Min Max Trend
--- ------- ------- --- --- -----
0 OK 31 30 32 stable
1 OK 38 37 38 stable
2 OK 33 31 34 stable
3 OK 43 42 43 stable
4 OK 31 28 32 stable
CLK OK 29 29 29 stable

Power Supplies:
---------------
Supply Status
--------- ------
0 OK
1 OK
2 OK
3 OK
PPS OK
System 3.3v OK
System 5.0v OK
Peripheral 5.0v OK
Peripheral 12v OK
Auxiliary 5.0v OK
Peripheral 5.0v precharge OK
Peripheral 12v precharge OK
System 3.3v precharge OK
System 5.0v precharge OK
AC Power OK

========================= HW Revisions =========================

ASIC Revisions:
---------------
Brd FHC AC SBus0 SBus1 PCI0 PCI1 FEPS Board Type Attributes
--- --- -- ----- ----- ---- ---- ---- ---------- ----------
0 1 5 CPU 100MHz Capable
1 1 5 1 1 22 Dual-SBus-SOC+ 100MHz Capable
2 1 5 CPU 100MHz Capable
3 1 5 1 1 22 Dual-SBus-SOC+ 100MHz Capable
4 1 5 CPU 100MHz Capable
5 Unknown 100MHz Capable

System Board PROM revisions:
----------------------------
Board 0: OBP 3.2.30 2002/10/25 14:03 POST 3.9.30 2002/10/25 14:04
Board 1: FCODE 1.8.30 2002/10/25 14:02 iPOST 3.4.30 2002/10/25 14:03
Board 2: OBP 3.2.30 2002/10/25 14:03 POST 3.9.30 2002/10/25 14:04
Board 3: FCODE 1.8.30 2002/10/25 14:02 iPOST 3.4.30 2002/10/25 14:03
Board 4: OBP 3.2.30 2002/10/25 14:03 POST 3.9.30 2002/10/25 14:04


The machines both seem to be fine now as long as I don�t set-defaults which change�s the boot disk back to the old one.

The only bit that looks a strange to me is that the disk boards on both machines are labelled as disabled in prtdiag -v and both turn on their amber lights while booting into the OS and the amber light remains on while in the OS, although they both boot fine.

I seem to be runnig the latest OBP available for these machines. I thought it maybe because I was using 10K disks and the board were not a high enough rev to run them, but when I re-installed on some 7.2K RPM disks they behaved the same. I have also run 3 different cables from the disk board to the X2612A and all do the same. The machines apart from that seem fine, but I don't know if the amber light is normal or if the is an underlying problem

Any advice is much appreciated