OPNsense 25.7.10 . Noticekernel[370] nvme0: temperature above threshold

Started by dmacgowan, January 23, 2026, 06:10:16 AM

Previous topic - Next topic
Received the following alert this evening for my firewall. It corresponded to stopping of Internet access.

2026-01-22T22:51:13-06:00Noticekernel[370] nvme0: temperature above threshold

My firewall is installed in my garage in Minnesota and the outside air temperature is below -20 F degrees. Health report states the CPUTEMP is 0 Degrees C. My garage is probably below -5 right now.

The graph seems to not allow below O measurements.
Any advice besides heating my firewall with a hair dryer?

You can look at the actual NVME temps with "smartctl -a /dev/nvme0ns1" after having installed the os-smart plugin.

These things tend to get very hot, especially with high usage. You might suffer the "hostwatch" write problem discussed here.
Intel N100, 4* I226-V, 2* 82559, 16 GByte, 500 GByte NVME, ZTE F6005

1100 down / 800 up, Bufferbloat A+

Quote from: meyergru on January 23, 2026, 11:37:37 AMYou can look at the actual NVME temps with "smartctl -a /dev/nvme0ns1" after having installed the os-smart plugin.
Now I know why smartctl isn't available by default...

IMHO it should be :)

QuoteThese things tend to get very hot, especially with high usage.
IMHO mainly the Samsung "Pro" Series NVMe SSD's have that issue and the rest not as much.

Their controller gets a lot hotter compared to other brands/solutions AFAIK.
Weird guy who likes everything Linux and *BSD on PC/Laptop/Tablet/Mobile and funny little ARM based boards :)

Quote from: nero355 on January 23, 2026, 04:46:33 PM[...]IMHO mainly the Samsung "Pro" Series NVMe SSD's have that issue and the rest not as much.[...]

Overall, it's just a matter of:
  • power targets
  • thermal mass
  • radiative area
  • convection/airflow

I get "enterprise" or "data center" devices (a stretch for M.2, but hey), which tend to nail themselves at maximum power (~8W for M.2) for maximum performance. Keeping such devices in the M.2 form factor cool is a challenge, especially where radiative area is constrained. Consumer-type devices keep cool(er) mainly through low power targets (which hurt performance) and PSLC caching (which greatly improves performance, but has a limited endurance). Same old tradeoffs.

os-smart was enabled on my machine.
Information section says that
Warning  Comp. Temp. Threshold:     100 Celsius
Critical Comp. Temp. Threshold:     110 Celsius

=== START OF SMART DATA SECTION ===
SMART overall-health self-assessment test result: FAILED!
- temperature is above or below threshold

SMART/Health Information (NVMe Log 0x02, NSID 0xffffffff)
Critical Warning:                   0x02
Temperature:                        -1 Celsius

It would appear that the program doesn't know what to do with a negative temperature reading. It certainly isn't overheating in my -28 degree C garage in the middle of winter.

Did you check the suggested operating temperature for the drive in the specs?

Quote from: dmacgowan on January 23, 2026, 08:33:54 PM[...]It would appear that the program doesn't know what to do with a negative temperature reading. It certainly isn't overheating in my -28 degree C garage in the middle of winter.

Impressive. I'd really worry about thermal shock.

Sheesh. It's about to freeze here. I have a cheap house, so I'd be in a bad way if it got down that low. And now I'm off to try to keep the ice buildup on my porch to a minimum.

Quote from: pfry on January 23, 2026, 05:23:16 PMI get "enterprise" or "data center" devices (a stretch for M.2, but hey)
At least those are still REAL SSD's with decent Power Loss Protection unlike the earlier mentioned Samsung "Pro" Series and many of their competitors !! ;)

Quote from: dmacgowan on January 23, 2026, 08:33:54 PMIt certainly isn't overheating in my -28 degree C garage in the middle of winter.
CPU's have this thing called "Cold Boot bug" often when the temperature drops that low so maybe your SSD has something similar too ?!
Weird guy who likes everything Linux and *BSD on PC/Laptop/Tablet/Mobile and funny little ARM based boards :)