Our VSAN cluster has been stuck for several days showing a warning "vSphere HA detected an HA cluster state version inconsistency in cluster DEV in datacenter X". A task "Edit VSAN iSCSI target service in cluster" has been stuck at 50% completion for two days. I am unable to view any VSAN status or configuration info in the vSphere client (all selections hang), and eventually an error pops up "The query execution timed out because of a back-end property provider 'com.vmware.vsphere.client.vsan.health.VsanHealthPropertyProvider' which took more than 120 seconds". vCenter does not seem to be able to contact one of the ESXi hosts and I can't put it in maintenance mode or manually migrate any VMs off it..
One weird thing I am seeing is /locker/var/core (which is ver small, only 256MB) is filling up with core files named python-zdump.000, .001. (there is only room for 2)Any ideas what those are?
The VMs in the cluster seems to be running OK. How do I get control of VSAN bacK?
This is vSphere Version 6.5.0.10000 Build 6671409, ESXi 6.5.0, 5969303