I have troubles making clstat
work. All the "usual suspects" have been covered but still no luck. The topology is a two-node active/passive with only one network-interface (it is a test-setup). The application running is SAP with DB/2 as database. We do not use SmartAssists or other gadgets.
Here are the OS and HACMP-versions:
# oslevel -s
7100-03-02-1412
# lslpp -L "cluster*"
Fileset Level State Type Description (Uninstaller)
----------------------------------------------------------------------------
cluster.adt.es.client.include
7.1.3.1 C F PowerHA SystemMirror Client
Include Files
cluster.adt.es.client.samples.clinfo
7.1.3.0 C F PowerHA SystemMirror Client
CLINFO Samples
cluster.es.client.clcomd 7.1.3.1 C F Cluster Communication
Infrastructure
cluster.es.client.lib 7.1.3.1 C F PowerHA SystemMirror Client
Libraries
cluster.es.client.rte 7.1.3.1 C F PowerHA SystemMirror Client
Runtime
cluster.es.client.utils 7.1.3.0 C F PowerHA SystemMirror Client
Utilities
cluster.es.cspoc.cmds 7.1.3.1 C F CSPOC Commands
cluster.es.cspoc.rte 7.1.3.1 C F CSPOC Runtime Commands
cluster.es.migcheck 7.1.3.0 C F PowerHA SystemMirror Migration
support
cluster.es.nfs.rte 7.1.3.0 C F NFS Support
cluster.es.server.diag 7.1.3.1 C F Server Diags
cluster.es.server.events 7.1.3.1 C F Server Events
cluster.es.server.rte 7.1.3.1 C F Base Server Runtime
cluster.es.server.testtool
7.1.3.0 C F Cluster Test Tool
cluster.es.server.utils 7.1.3.1 C F Server Utilities
cluster.license 7.1.3.0 C F PowerHA SystemMirror
Electronic License
cluster.man.en_US.es.data 7.1.3.1 C F Man Pages - U.S. English
cldump
works and all other cluster services are working as expected too. Alas, calling clstat:
# clstat -a
Failed retrieving cluster information.
There are a number of possible causes:
clinfoES or snmpd subsystems are not active.
snmp is unresponsive.
snmp is not configured correctly.
Cluster services are not active on any nodes.
Refer to the HACMP Administration Guide for more information.
I followed this procedure and double-checked everything mentioned there:
# tail -3 /etc/snmpdv3.conf
smux 1.3.6.1.4.1.2.3.1.2.1.2 gated_password
VACM_VIEW defaultView 1.3.6.1.4.1.2.3.1.2.1.5 - included -
smux 1.3.6.1.4.1.2.3.1.2.1.5 clsmuxpd_password ::1 128
# snmpinfo -m dump -v -o /usr/es/sbin/cluster/hacmp.defs cluster
clusterId.0 = 1560242040
clusterName.0 = "<mycluster>"
clusterConfiguration.0 = ""
clusterState.0 = 2
clusterPrimary.0 = 1
clusterLastChange.0 = 1412260986
clusterGmtOffset.0 = -3600
clusterSubState.0 = 32
clusterNodeName.0 = "<my-node-name-a>"
clusterPrimaryNodeName.0 = "<my-node-name-a>"
clusterNumNodes.0 = 2
clusterNodeId.0 = 1
clusterNumSites.0 = 0
I also made sure the services are up and snmpd is the correct one:
# lssrc -g cluster
Subsystem Group PID Status
clstrmgrES cluster 10027094 active
clinfoES cluster 18743412 active
# lssrc -a
aixmibd tcpip 27263194 active
snmpmibd tcpip 5046514 active
hostmibd tcpip 30802078 active
[...]
snmpd tcpip 24772704 active
# ls -l /usr/sbin/snmpd
lrwxrwxrwx 1 root system 9 Feb 5 2014 /usr/sbin/snmpd -> snmpdv3ne
The loopback-addresses for IPv6 are there in the /etc/hosts
:
# head -2 /etc/hosts
127.0.0.1 loopback localhost # loopback (lo0) name/address
::1 loopback localhost # IPv6 loopback (lo0) name/address
In the cited document it is mentioned to remove the comments in /etc/snmpdv3.conf
as a last-ditch effort which i did. The services were restarted as described there and finally the whole system rebooted. I also did a cluster verification and synchronisation (in fact several times, before and after the reboot).
To be honest i am out of ideas what i still could do.
bakunin