Enterprise3500 trouble

Hello,
My power to the scsi-side and the fcal internal disk-connectors is gone.When I turn on the key, the fans under my disks just spins for 2 seconds then die. On board 1 all three leds are on, on the powersupplyboard( with the 1/0 swith and the powercable) the yellow light is on. I tried alot in the OBP but it all ends with "login-all? failed" It wont boot from any disks, it does recognize them, however, and it wont boot from the cd-rom drive.I have tried remove one board after another to foind a faulty one but to no result other than described: "login-all? failed. I did "reset-all" between every "probe-scsi-all" and "probe-fcal-all"
probe-scsi-all gives:
/pci@13,4000/SUNW,isptwo@3
/sbus@2,0/SUNW,fas@2,8800000
/sbus@3,0/SUNW,fas@3,8800000.
probe-fcal-all gives:
/sbus@2,0/SUNW,socal@d,10000/sf@1,0
login-all? failed
login-all? failed
/sbus@2,0/SUNW,socal@d,10000/sf@0,0
login-all? failed
login-all? failed
{6} ok
Ideas ?:frowning:

are you able to boot from cdrom? please try.

I can not boot from cdrom, the whole scsi side of the machine is non-functional. The cdrom wonteven open, and the disk(s) dont spin, neitherthe fans underneath. They turn on for sometimes 3 seconds, sometimes 3 minutes. Must be the powersupply, but I have never heard of one gone bad, so I thought I might accidentally have turned it off with some strange misspelled commando.
Here is the serialoutput:
Hardware Power ON
3,0>ERROR: TEST=Environmental Probe,SUBTEST=Environmental Probe ID=1f.1
3,0>Component under test: Board 3 System Interrupt
3,0>Peripherial Power Supply PPSS_0 Failed
3,0>ERROR: TEST=Environmental Probe,SUBTEST=Environmental Probe ID=1f.1
3,0>Component under test: Board 3 System Interrupt
3,0>Peripherial Power Supply PPSS_0 Failed
3,0>ERROR: TEST=System Interrupts,SUBTEST=System Interrupts ID=1f.1
3,0>Component under test: Board 3 System Interrupt
3,0>Peripherial Power Supply PPSS_0 Failed
3,0>ERROR: TEST=System Interrupts,SUBTEST=System Interrupts ID=1f.1
3,0>Component under test: Board 3 System Interrupt
3,0>Peripherial Power Supply PPSS_0 Failed

5-slot Sun Enterprise E3500, No Keyboard
OpenBoot 3.2.30, 8192 MB memory installed, Serial #10563790.
Copyright 2002 Sun Microsystems, Inc. All rights reserved
Ethernet address 8:0:20:a1:30:ce, Host ID: 80a130ce.

{6} ok

you can always do a `show-post-results`

however, i see power supplies die constantly (in a datacenter of over several thousands pieces of equipment). Because the type of server it is, sun may not fully support all spare parts. However, give them a call. If you have a service contract with them, i would call them up and they will want an explorer sent to them. If not, grab the part number. its usually a XXX-XXXX number. something like 300-1358. Buy one and replace. :smiley:

{6} ok show-post-results

Slot 1 - Status=Okay, Type: I/O Type 4

Sysio0=P Sysio1=P FEPS=P FEPSFC=0 SOC=P
Sbus0=P Sbus1=P Sbus2=P
AC=P FHC=P SRAM=P FPROM=P LabCon=Not Ovtemp=Not
TODC=P JTAG=P CntrPl=P DC=ff

Slot 3 - Status=Okay, Type: CPU/Memory

Cpu0=P    Cpu0-OK=P   FailCode=0       Cpu1=P    Cpu1-OK=P   FailCode=0   
  AC=P        FHC=P       SRAM=P      FPROM=P     LabCon=Not   Ovtemp=Not 

Bank0=0 Bank1=0 DTag0=P DTag1=P JTAG=P CntrPl=P
Bank0=P Bank1=P DC=ff

Slot 5 - Status=Okay, Type: I/O Type 3

Sysio0=P Sysio1=P FEPS=*** FEPSFC=0 SOC=***
Sbus0=*** Sbus2=***
AC=P FHC=P SRAM=P FPROM=P LabCon=Not Ovtemp=Not
TODC=P JTAG=P CntrPl=P DC=ff

Slot 7 - Status=Okay, Type: CPU/Memory

Cpu0=P    Cpu0-OK=P   FailCode=0       Cpu1=P    Cpu1-OK=P   FailCode=0   
  AC=P        FHC=P       SRAM=P      FPROM=P     LabCon=Not   Ovtemp=Not 

Bank0=0 Bank1=0 DTag0=P DTag1=P JTAG=P CntrPl=P
Bank0=P Bank1=P DC=ff

Slot 9 - Status=Okay, Type: I/O Type 3

Sysio0=P Sysio1=P FEPS=*** FEPSFC=0 SOC=***
Sbus0=*** Sbus2=***
AC=P FHC=P SRAM=P FPROM=P LabCon=Not Ovtemp=Not
TODC=P JTAG=P CntrPl=P DC=ff

Slot 16 - Status=Fail, Type: Clock

Clock=P Serial=P KbdMse=P PPS-DC=*** DCReg0=P DCReg1=P
AC=P ACFan=P KeyFan=P PSFail=0 Ovtemp=Not TODC=P
V5-P=P V12-P=P V5-Aux=P V5P-PC=P V12-PC=P V3-PC=P V5-PC=P
RckFan=*** 3.3V=P 5.0V=P Triger=P Coolng=P AC-REV=P

P = Present or Passed
*** = Failed Component
Not = Not present

{6} ok
Can you tell me whatis broken, from readingthis?
M

Replace your power supply unit.
There's probably not enough current going through the power distribution board to the MB and its components, causing the fans to spin at low speed or even to die, in turn causing the CPU to heat up and the temp sensors detect that and forces to shut off the system..:eek:

Yes that has been the conclusion of many people so far, but since I live in Sweden I havent been able to come across one cheap yet, it would cost me 400 dollars to get one from the States. The Sun number is 300-1400-01 and if someone`s got one laying around, and can give me a better deal, I would appriciate it.
Magnus
vatch23@riseup.net
:smiley: