Greetings!
I'm testing a failover solution for NFSv4 on RHEL6 with latest updates.
My script umounts (umount -lf /share) the faulty NFS share if it sees that's hanging on the client (the NFS daemon is down on the NFS server) and it mounts the share from another healthy NFS server.
Sometimes I can see that a process/thread is still in the memory and producing the messages in the $SUBJ - 'kernel: nfs: server SERVER not responding, timed out'
I already have the new healthy NFS share from another server and 'nfsstat -m' shows that this is the only share on the NFS client system.
I've tested the following commands to find the stacked process/thread:
- lsof -i | egrep 'SERVER|SERVER_IP'
- lsof +d /share
- fuser -fvm /share
- netstat -anop | egrep 'SERVER|SERVER_IP
- ... and a couples of others with lsof.. but no PID.
Thread? ..the ps command thread related options don't show IP addresses/host names.. But the kernel is continuously logging the error.
Any suggestion is welcome to detect it.
Thank you Arsene