Hmm ...Set keepalive to 5s for *one* endpoint and rebooted the central gateway.All 4 branch offices (OPNSense based) came back and reconnected nicely.Set keepalive back to 25s and tried again.Same result.Last thursday I tried the reboot (with 25s keepalive) for three times and each time only the OpenWRT based branch offices reconnected. On the OPNSense based branch offices I had to restart wireguard I'm a bit clueless now, looks like some strange race condition ...
A race would be reproduceable
Can you setup a FreeBSD 13 VM at one of the remotes and see if you get the same results? The fix will likely need to come from upstream. Anything in the logs?
Did some more testing:Whether I restart or power cycle the "central" OPNsense seem's not to make a real difference.I typically ended up with some or all "branch" gateways not reconnecting via wireguard unless wireguard is restarted on the branch gateway. Rarely (2 of 10 attempts) they all came up again.The OpenWRT based branch routers (3 of 7) always reconnected.I think, I will fix this with a cron script checking the wireguard tunnel on branch gateways and restarting if necessary