Hi,
We had a hardware problem with an IBM System p5 server, with AIX 5.2
We restore from a tape the last backup we had, but the server does not boot up as expected.
The server try to mount some directories from a storage, but could not comunicate with them, we check the FC and everything is fine.
A second alert is a misconfiguration in an ethernet adapter.
After this alerts the server starts a loop trying to find the storage.
Im stuck with this problem, any clues ?
Thank you in advance .
Here is the log.
Welcome to AIX.
boot image timestamp: 22:18 05/29
The current time and date: 18:02:15 05/31/2019
number of processors: 2 size of memory: 1904Mb
boot device: /pci@800000020000003/pci@2,4/pci1069,b166@1/scsi@0/sd@5:2
closing stdin and stdout...
-------------------------------------------------------------------------------
Saving Base Customize Data to boot disk
Starting the sync daemon
Mounting the platform dump file system, /var/adm/ras/platform
Starting the error daemon
System initialization completed.
Setting tunable parameters...complete
Starting Multi-user Initialization
Performing auto-varyon of Volume Groups
Activating all paging spaces
0517-075 swapon: Paging device /dev/hd6 is already active.
/dev/rhd1 (/users): ** Unmounted cleanly - Check suppressed
/dev/rhd10opt (/opt): ** Unmounted cleanly - Check suppressed
Performing all automatic mounts
mount: 0506-324 Cannot mount /dev/fslv00 on /home: A file or directory in the path name does not exist.
Replaying log for /dev/fslv01.
mount: 0506-324 Cannot mount /dev/fslv03 on /home/monitor_logs: A file or directory in the path name does not exist.
mount: 0506-324 Cannot mount /dev/archdwdg_S_01 on /usr/hacmp_fs/archdw_dg: A file or directory in the path name does not exist.
mount: 0506-324 Cannot mount /dev/archdg_S_01 on /usr/hacmp_fs/arch_dg: A file or directory in the path name does not exist.
Multi-user initialization completed
nsmb0 Available
Checking for srcmstr active...complete
Starting tcpip daemons:
0513-059 The syslogd Subsystem has been started. Subsystem PID is 27108.
0513-059 The portmap Subsystem has been started. Subsystem PID is 27546.
0513-059 The inetd Subsystem has been started. Subsystem PID is 24010.
0513-059 The snmpd Subsystem has been started. Subsystem PID is 24672.
May 31 14:03:57 localhost snmpd[24672]: EXCEPTIONS: open_device: Unable to connect to device driver.
May 31 14:03:57 localhost last message repeated 2 times
0513-059 The dpid2 Subsystem has been started. Subsystem PID is 24938.
0513-059 The hostmibd Subsystem has been started. Subsystem PID is 26714.
0513-059 The aixmibd Subsystem has been started. Subsystem PID is 25218.
0513-059 The muxatmd Subsystem has been started. Subsystem PID is 27162.
Finished starting tcpip daemons.
Starting NFS services:
May 31 14:04:09 localhost syslog: /usr/sbin/ifconfig -l
0513-059 The biod Subsystem has been started. Subsystem PID is 25658.
0513-059 The nfsd Subsystem has been started. Subsystem PID is 29758.
0513-059 The rpc.mountd Subsystem has been started. Subsystem PID is 28002.
0513-059 The rpc.lockd Subsystem has been started. Subsystem PID is 22564.
May 31 14:04:13 localhost syslog: /usr/sbin/ifconfig -l
May 31 14:04:13 localhost automountd[25508]: svc_create: no well known address for autofs on transport udp
May 31 14:04:13 localhost syslog: dlopen(/usr/ldap/lib/libibmldapn.a) failed: A file or directory in the path name does not exist.
May 31 14:04:13 localhost syslog: WARNING: ldap is not loaded
May 31 14:04:13 localhost syslog: WARNING: ldap is not configured
May 31 14:04:13 localhost unix:
May 31 14:04:13 localhost unix:
May 31 14:04:13 localhost unix:
May 31 14:04:13 localhost unix:
Completed NFS services.
May 31 14:04:20 localhost no[30460]: Network option tcp_keepinit was set to the value 40
May 31 14:04:20 localhost no[30462]: Network option tcp_keepidle was set to the value 20
May 31 14:04:23 localhost no[30208]: Network option tcp_keepintvl was set to the value 15
May 31 14:04:26 localhost no[30210]: Network option tcp_sendspace was set to the value 262144
May 31 14:04:29 localhost no[30212]: Network option tcp_recvspace was set to the value 262144
May 31 14:04:32 localhost no[30214]: Network option udp_sendspace was set to the value 65536
May 31 14:04:35 localhost no[30216]: Network option udp_recvspace was set to the value 262144
May 31 14:04:38 localhost no[30218]: Network option rfc1323 was set to the value 1
May 31 14:04:41 localhost no[30220]: Network option udp_sendspace was set to the value 65536
May 31 14:04:41 localhost no[30222]: Network option udp_recvspace was set to the value 262144
May 31 14:04:42 localhost su: from root to imdba at /dev/tty??
0513-059 The clcomdES Subsystem has been started. Subsystem PID is 31510.
0513-059 The nmbd Subsystem has been started. Subsystem PID is 31274.
0513-059 The smbd Subsystem has been started. Subsystem PID is 30226.
May 31 14:04:42 localhost last message repeated 6 times
May 31 14:04:50 localhost no[33086]: Network option routerevalidate was set to the value 1
May 31 14:05:03 localhost no[33864]: Network option nonlocsrcroute was set to the value 1
May 31 14:05:03 localhost no[33866]: Network option ipsrcroutesend was set to the value 1
May 31 14:05:03 localhost no[33866]: Network option ipsrcrouterecv was set to the value 1
May 31 14:05:03 localhost no[33866]: Network option ipsrcrouteforward was set to the value 1
May 31 14:05:06 localhost topsvcs[24034]: (Recorded using libct_ffdc.a cv 2):::Error ID: 6OP0ZW0GnKwQ/fhG./... z/...................:::Reference ID: :::Template ID: a29426da::: Details File: :::Location: rsct,nim_control.
C,1.39.1.2,4359 :::TS_MISCFG_ER Local adapter misconfiguration detected Adapter interface name en 3 Adapter offset 0 Adapter expected IP address 10.189.125.27
May 31 14:05:06 localhost topsvcs[24034]: (Recorded using libct_ffdc.a cv 2):::Error ID: 6OP0ZW0GnKwQ/p/I./... z/...................:::Reference ID: :::Template ID: a29426da::: Details File: :::Location: rsct,nim_control.
C,1.39.1.2,4359 :::TS_MISCFG_ER Local adapter misconfiguration detected Adapter interface name en 1 Adapter offset 1 Adapter expected IP address 10.189.126.27
May 31 14:05:06 localhost topsvcs[24034]: (Recorded using libct_ffdc.a cv 2):::Error ID: 6FYVDG0GnKwQ/GlJ./... z/...................:::Reference ID: :::Template ID: 923e1911::: Details File: :::Location: rsct,nim_control.
C,1.39.1.2,4428 :::TS_NIM_OPEN_ERROR_ER Failed to open NIM connection Interface name rhdisk29 Description 1 SYSCALL Description 2 openx() Value 1 19 Value 2 0
May 31 14:05:19 localhost clstrmgrES[36390]: Fri May 31 14:05:19 HACMP/ES Cluster Manager Started
May 31 14:05:19 localhost clstrmgrES[36390]: Fri May 31 14:05:19 IpcInit: called
0513-029 The ctrmc Subsystem is already active.
Multiple instances are not supported.
May 31 14:05:53 localhost HACMP for AIX: EVENT START: node_up hd2z
May 31 14:05:55 localhost HACMP for AIX: EVENT FAILED: 1: node_up hd2z 1
May 31 14:05:55 localhost HACMP for AIX: EVENT START: event_error 1 TE_JOIN_NODE
WARNING: Cluster his_cluster Failed while running event [JOIN], exit status was 1
May 31 14:05:55 localhost HAC MP for AIX: EVENT FAILED: -1: event_error 1 TE_JOIN_NODE -1
Fri May 31 14:10:39 AST 2019
Automatic Error Log Analysis for sysplanar0 has detected a problem.
The Service Request Number is B7006970: I/O subsystem (hub, bridge, bus) Unrecovered Error, general. Refer to the system service documentation for more information.
Additional Words: 2-00000062 3-00010002 4-24030 230 5-00000000 6-00001140 7-00020000 8-00000000 9-00000000.
May 31 14:11:50 localhost HACMP for AIX: EVENT START: config_too_long 360 /usr/es/sbin/cluster/events/node_up.rp
WARNING: Cluster his_cluster has been running recovery program '/usr/es/sbin/cluster/events/node_up.rp' for 360 seconds. Please check cluster status.
WARNING: Cluster his_cluster has been running recovery program '/usr/es /sbin/cluster/events/node_up.rp' for 390 seconds. Please check cluster status.
WARNING: Cluster his_cluster has been running recovery program '/usr/es/sbin/cluster/events/node_up.rp' for 420 seconds. Please check cluster status.
WARNING: Cluster his_cluster has been running recovery program '/usr/es/sbin/cluster/events/node_up.rp' for 450 seconds. Please check cluster status.
WARNING: Cluster his_cluster has been running recovery program ' /usr/es/sbin/cluster/events/node_up.rp' for 480 seconds. Please check cluster status.
Fri May 31 14:13:54 AST 2 019
Automatic Error Log Analysis for sysplanar0 has detected a problem.
The Service Request Number is BA210000 : Platform Firmware Unrecovered Error, general. Refer to the system service documentation for more information.
Additional Words: 2-20202020 3-20202020 4-20202020 5-20202020 6-20202020 7-20202020 8-20202020 9-20202020.
WARNING: Cluster his_cluster has been running recovery program '/usr/es/sbin/cluster/events/node_up.rp' for 540 seconds. Please check cluster status.
WARNING: Cluster his_cluster has been running recovery program '/usr/es/sbin/cluster/events/node_up.rp' for 600 seconds. Please check cluster status.
WARNING: Cluster his_cluster has been running recovery program '/usr/es/sbin/cluster/events/node_up.rp' for 660 seconds. Please check cluster status.
WARNING: Cluster his_cluster has been running recovery program '/usr/es/sbin/cluster/events/node_up.rp' for 720 seconds. Please check cluster status.
WARNING: Cluster his_cluster has been running recovery program '/usr/es/sbin/cluster/events/node_up.rp' for 780 seconds. Please check cluster status.
WARNING: Cluster his_cluster has been running recovery program '/usr/es/sbin/cluster/events/node_up.rp' for 900 seconds. Please check cluster status.
WARNING: Cluster his_cluster has been running recovery program '/usr/es/sbin/cluster/events/node_up.rp' for 1020 seconds. Please check cluster status.
WARNING: Cluster his_cluster has been running recovery program '/usr/es/sbin/cluster/events/node_up.rp' for 1140 seconds. Please check cluster status.