Opnsense randomly (?) crashes

Started by meikel, June 02, 2026, 09:06:59 AM

Previous topic - Next topic
Quote from: Nullman on June 03, 2026, 08:02:35 PMIts right there in his signature.
I see signatures of users but his is empty ?!

Quote from: meikel on June 03, 2026, 08:10:22 PMI also have to correct myself: the SSD is not running in raid1.

It's a single SSD but as a live OS is also crashing I assume it's not the SSD.
I also doubt that a faulty SSD crashes the whole machine.
Sometimes Live CD/ISO Images try to mount local storage automatically and if that fails in a certain way it can cause issues too so I would check that again just to be sure !!

- Which model is it ?
- Are there firmware updates available ?
- What does smartctl/nvmecontrol say ?
- Are there known issues with it in certain scenarios ?

And indeed : Memory testing should be done for at least 24 hours !!
Weird guy who likes everything Linux and *BSD on PC/Laptop/Tablet/Mobile and funny little ARM based boards :)

Today at 08:31:06 AM #16 Last Edit: Today at 08:35:38 AM by meikel Reason: added memtest info
The memtest was fine:




So I think I found the culprit:


Well I guess it was the SSD all along. I didn't think a system drive failing would have such symptoms.

Sadly the story didn't end here:

* I tried to clone the drive onto a new drive which took way to long and didn't work at the end. The OS booted but couldn't find some files (don't ask me what exactly).
* So I took another road: Backup opnsense via UI and import in a freshly installed version. Should be straightforward, right? Wrong
* I didn't ever bothering migrating from "deprecated" features as long as they worked or I get the message "This will no longer work in the next version". It's just my homelab after all, if it's working it's good enough.
* Today I learned ISC DCHP is so hard deprecated that it is not even backed up when you create a backup. It backups the config values but does not backup (now a plugin) ISC from what I could tell.
* I booted the original drive (and hoped it would survive as long as necessary (spoiler: it did not - had to restart multiple times) and migrated everything to Kea according to this guide
* Now everything is working as before and I hope the device survives longer this time

Afterthought: Shouldn't opnsense warn me about bad smart values? Is there any way to enable this?

Thanks for everyone helping me, bringing ideas to the table and for your valuable time.

There is a SMARTS plugin you can install and run tests from OPNsense directly.
It comes with a widget to show the health of the latest test.

Regards,
S.
Networking is love. You may hate it, but in the end, you always come back to it.

OPNSense HW
N355 - i226-V | AQC113C | 16G | 500G - PROD

PRXMX
N5105 - i226-V | 2x8G | 512G - NODE #1
N100 - i226-V | 16G | 1T - NODE #2

Hah! Called the failing SSD!

Quote from: meikel on Today at 08:31:06 AM* Today I learned ISC DCHP is so hard deprecated that it is not even backed up when you create a backup. It backups the config values but does not backup (now a plugin) ISC from what I could tell.

I just checked a backup that I made last week, when I still had old(disabled) ISC DHCP config left behind after transitioning to Kea. That config is included in the backup XML, under sections "dhcpd" and "dhcpdv6".

AFAIK, you do need to install the plugin (on a fresh installation) before you can restore the config for it, though.