I have a cluster of 4 servers, after apt upgrade one and rebooting, now all my servers are showing a blank lxc list, and lxc cluster show say there is no cluster.
The interesting thing, thankfully, the actual running containers are working, and I can even see inside them. But of course all LXC commands don’t work on them. lxd --debug --group lxd seems to be fine on 3 out 4 server. The containers that are not running are blank.
So at this point I am afraid of rebooting production servers, and making problem worse. I tried reboot fourth server and that does not seem make it better.
It seems the LXD cluster database is bad or stuck, is there a way to rebuild it. Or is this thing bound to crash and burn. Any and all help, ideas, etc… are welcomed.
lxc --debug list
DBUG[02-08|00:02:16] Connecting to a local LXD over a Unix socket
DBUG[02-08|00:02:16] Sending request to LXD method=GET url=http://unix.socket/1.0 etag=
Error: Get http://unix.socket/1.0: EOF
may be it is a socket error