Strange new reboot couple times a day

Started by zzup, March 04, 2024, 11:02:04 PM

Previous topic - Next topic
I'm just gonna chime in to say I'm having a similar issue. I think with as many people as there are this issue doesn't seem to be hardware specific. I just configured the OPNsense (24.1.19) today, an I'm running it inside a vm with NICs passed through. Couple times during the day I hit the aforementioned issue: `configd.py   action rfc2136.reload.wan not found for user root` I got similar entries for my 2 other interfaces. At the same time as the last crash, I saw the following entries in the General log (newest first):

```log
Notice   opnsense   /usr/local/etc/rc.newwanip: plugins_configure newwanip (execute task : wireguard_sync(,wan))   
Notice   opnsense   /usr/local/etc/rc.newwanip: plugins_configure newwanip (execute task : webgui_configure_do(,wan))   
Notice   opnsense   /usr/local/etc/rc.newwanip: plugins_configure newwanip (execute task : vxlan_configure_do())   
Notice   opnsense   /usr/local/etc/rc.newwanip: plugins_configure newwanip (execute task : unbound_configure_do(,wan))   
Notice   opnsense   /usr/local/etc/rc.newwanip: plugins_configure newwanip (execute task : openssh_configure_do(,wan))   
Notice   opnsense   /usr/local/etc/rc.newwanip: plugins_configure newwanip (execute task : opendns_configure_do())   
Notice   opnsense   /usr/local/etc/rc.newwanip: plugins_configure newwanip (execute task : ntpd_configure_do())   
Notice   opnsense   /usr/local/etc/rc.newwanip: plugins_configure newwanip (execute task : dnsmasq_configure_do())   
Notice   opnsense   /usr/local/etc/rc.newwanip: plugins_configure newwanip (,wan)   
Notice   opnsense   /usr/local/etc/rc.newwanip: plugins_configure vpn (execute task : wireguard_configure_do(,wan))   
Notice   opnsense   /usr/local/etc/rc.newwanip: Resyncing OpenVPN instances for interface WAN.   
Notice   opnsense   /usr/local/etc/rc.newwanip: plugins_configure vpn (execute task : openvpn_configure_do(,wan))   
Notice   opnsense   /usr/local/etc/rc.newwanip: plugins_configure vpn (execute task : ipsec_configure_do(,wan))   
Notice   opnsense   /usr/local/etc/rc.newwanip: plugins_configure vpn (,wan)   
Notice   opnsense   /usr/local/etc/rc.newwanip: plugins_configure monitor (execute task : dpinger_configure_do(,WAN_GW))   
Notice   opnsense   /usr/local/etc/rc.newwanip: plugins_configure monitor (,WAN_GW)   
Notice   opnsense   /usr/local/etc/rc.newwanip: ROUTING: keeping inet default route to www.www.www.www   
Notice   opnsense   /usr/local/etc/rc.newwanip: ROUTING: configuring inet default gateway on wan   
Notice   opnsense   /usr/local/etc/rc.linkup: plugins_configure dns (execute task : unbound_configure_do())   
Notice   opnsense   /usr/local/etc/rc.linkup: plugins_configure dns (execute task : dnsmasq_configure_do())
Notice   opnsense   /usr/local/etc/rc.linkup: plugins_configure dns (execute task : dnsmasq_configure_do())
Notice   opnsense   /usr/local/etc/rc.linkup: plugins_configure dns ()   
Notice   opnsense   /usr/local/etc/rc.newwanip: ROUTING: entering configure using 'wan'   
Notice   opnsense   /usr/local/etc/rc.newwanip: IP renewal starting (new: xxx.xxx.xxx.xxx, old: xxx.xxx.xxx.xxx, interface: wan, device: igc2, force: yes)   
Notice   opnsense   /usr/local/etc/rc.linkup: plugins_configure dhcp (execute task : dhcpd_dhcp_configure())   
Notice   opnsense   /usr/local/etc/rc.linkup: plugins_configure dhcp ()
2024-06-19T03:13:58-04:00   Notice   opnsense   /usr/local/etc/rc.linkup: plugins_configure ipsec (execute task : ipsec_configure_do(,wan))   
Notice   opnsense   /usr/local/etc/rc.linkup: plugins_configure ipsec (,wan)   
Notice   opnsense   /usr/local/etc/rc.linkup: plugins_configure monitor (execute task : dpinger_configure_do(,WAN_GW))   
Notice   opnsense   /usr/local/etc/rc.linkup: plugins_configure monitor (,WAN_GW)   
Notice   opnsense   /usr/local/etc/rc.linkup: ROUTING: setting inet default route to www.www.www.www   
Notice   opnsense   /usr/local/etc/rc.linkup: ROUTING: configuring inet default gateway on wan   
Notice   opnsense   /usr/local/etc/rc.linkup: ROUTING: entering configure using 'wan'   
Notice   dhclient   dhclient-script: Creating resolv.conf   
Notice   dhclient   dhclient-script: New Routers (igc2): www.www.www.www   
Notice   dhclient   dhclient-script: New Broadcast Address (igc2): zzz.zzz.zzz.zzz
Notice   dhclient   dhclient-script: New Subnet Mask (igc2): yyy.yyy.yyy.yyy   
Notice   dhclient   dhclient-script: New IP Address (igc2): xxx.xxx.xxx.xxx   
Notice   dhclient   dhclient-script: New Hostname (igc2): hhhhhhhhhh
Notice   dhclient   dhclient-script: Reason REBOOT on igc2 executing
Notice   kernel   <6>igc2: link state changed to UP   
Notice   dhclient   dhclient-script: Reason PREINIT on igc2 executing   
Notice   opnsense   /usr/local/etc/rc.linkup: DEVD: Ethernet attached event for wan(igc2)   
Notice   kernel   <6>igc2: link state changed to DOWN   
Critical   dhclient   exiting.   
Error   dhclient   connection closed   
Notice   opnsense   /usr/local/etc/rc.linkup: DEVD: Ethernet detached event for wan(igc2)   
Error   opnsense   /usr/local/etc/rc.newwanip: The command '/usr/sbin/daemon -f -p '/var/run/updaterrd.pid' '/var/db/rrd/updaterrd.sh'' returned exit code '3', the output was 'daemon: process already running, pid: 14295'   
```

Of note is that I don't have any vpn nor dns services enabled at this moment, tho it seems configuring them is just a part of the process.

I don't seem to have leaky memory either, nor any other significant metric during the time of the crash. Any ideas?

Even an update to the latest version 24.1.9_3 did not change the problem. Does anyone have another idea?

For those having this issue, do you happen to have ntopng installed on your OPNsense box?

I was having a similar issue after updating to 24.1.8 where my WAN gateways would drop every hour and I would have to bounce the WAN interface if I caught it soon enough or have to reboot to recover internet access. I was getting the 'configd.py   action rfc2136.reload.wan not found for user root' and similar rfc2136.reload errors in the system log files. I was also seeing dhcpd and dchpd6 services stopping and restarting.

Reverting back to 24.1.7 didn't fix anything so I did a clean reinstall of OPNsense (no backup restore) and noticed the issue went away. I slowly turned plugins back on and noticed my issues returned right after I installed ntopng. After deactivating the ntopng plugin the issues went away again.

I also commented on reddit and there was at least one other person who have seen a similar issue with ntopng.

https://www.reddit.com/r/opnsense/comments/1d39a61/comment/l6ca7we/

https://www.reddit.com/r/opnsense/comments/1ddm4sk/comment/l88mrbj/?context=3

I'm now on 24.1.9_3 without issue. I'm not sure what ntopng modifies in the system but it clearly was causing some conflict with my wan gateways and interface.

ntopng is not installed on my system. After a restart, the internet connection can be accessed for a short time, then only e.g. with pkg updates Network is unreachable.

I'm having the same issue with both the "community" and the "business" editions.

Having the same issue, do not have ntopng installed ... any other suggestions?


Quote from: joe26 on June 25, 2024, 12:34:59 AM
Having the same issue, do not have ntopng installed ... any other suggestions?
Imagine a thread with every possible reason for a machine rebooting.
Best to create your own thread with your own settings and we take it from there. It could be hardware problem but each hardware & settings combinations are different.

Quote from: cookiemonster on June 25, 2024, 12:46:08 AM
Quote from: joe26 on June 25, 2024, 12:34:59 AM
Having the same issue, do not have ntopng installed ... any other suggestions?
Imagine a thread with every possible reason for a machine rebooting.
Best to create your own thread with your own settings and we take it from there. It could be hardware problem but each hardware & settings combinations are different.

Thank you, created a new post here: https://forum.opnsense.org/index.php?topic=41237

Hello,
The same error and symptioms is repeating every few weeks since we upgrade to 24.7. Is it a recognized bug by the community? Is there any root cause identified?
Thank you.

getting 3 reboots today just noticed...... latest version..... :'( :'(

This is a newbie to Opnsense. Got my first protectli device with opnsense preinstalled to some 24.1.7 version and arrived on March 11. It was stable for 3 days and then out of excitement I installed the latest version and moved to 25.xx series. Random reboots started. First I thought it to be ipv6, then thought of system overload with IDS/IPS, Zenarmor, ntopng. Having 32GB of ram and new to the system, thought of scenarios ipv6 faulty with ISP, then realised the session from ISP is shown stable and sticky for almost a week, but my protectli kept on asking for DHCP renewals thinking the interface is down. Then I removed ntopng and re-enabled ipv6, it became stable for some days. But now again started to reboot. I feel the logs in opnsense are not clear enough which point to a scenario where it required a reboot. All I get is a sudden log of core boot and <BOOT>.

I can eliminate any issue with ISP, as they have reported and I also can see from their website that my session is stable. Also tried a couple of knock out hook on their website and saw a new ip being issued. So, all good from ISP. Should be something wrong with opnsense.