Seeming cluster failure with: ""Retry failed db interaction (driver: bad connection)""

I tried to reproduce this with a few tricks, but I couldn’t.

Getting to the bottom of this might require several iterations of (adding debug messages and re-testing). I see a couple of options, in order of preference:

  1. Give me SSH address to those 4 VMs, I’ll then build lxd there and test
  2. I upload somewhere a modified version of the lxd .snap file, then you download it and install it by hand, testing again at each new build and sending me the logs.

Let me know how feasible those options would be for you.

The second option would be workable :slight_smile:

However, we recently redid the underlying networking so I’ll give it a little bit of time to see if that actually cures the issue. After all, I suspect unreliable networking was at the core of the issue