OPNsense 25.7.10 . Noticekernel[370] nvme0: temperature above threshold

Started by dmacgowan, Today at 06:10:16 AM

Previous topic - Next topic
Received the following alert this evening for my firewall. It corresponded to stopping of Internet access.

2026-01-22T22:51:13-06:00Noticekernel[370] nvme0: temperature above threshold

My firewall is installed in my garage in Minnesota and the outside air temperature is below -20 F degrees. Health report states the CPUTEMP is 0 Degrees C. My garage is probably below -5 right now.

The graph seems to not allow below O measurements.
Any advice besides heating my firewall with a hair dryer?

You can look at the actual NVME temps with "smartctl -a /dev/nvme0ns1" after having installed the os-smart plugin.

These things tend to get very hot, especially with high usage. You might suffer the "hostwatch" write problem discussed here.
Intel N100, 4* I226-V, 2* 82559, 16 GByte, 500 GByte NVME, ZTE F6005

1100 down / 800 up, Bufferbloat A+

Quote from: meyergru on Today at 11:37:37 AMYou can look at the actual NVME temps with "smartctl -a /dev/nvme0ns1" after having installed the os-smart plugin.
Now I know why smartctl isn't available by default...

IMHO it should be :)

QuoteThese things tend to get very hot, especially with high usage.
IMHO mainly the Samsung "Pro" Series NVMe SSD's have that issue and the rest not as much.

Their controller gets a lot hotter compared to other brands/solutions AFAIK.
Weird guy who likes everything Linux and *BSD on PC/Laptop/Tablet/Mobile and funny little ARM based boards :)

Quote from: nero355 on Today at 04:46:33 PM[...]IMHO mainly the Samsung "Pro" Series NVMe SSD's have that issue and the rest not as much.[...]

Overall, it's just a matter of:
  • power targets
  • thermal mass
  • radiative area
  • convection/airflow

I get "enterprise" or "data center" devices (a stretch for M.2, but hey), which tend to nail themselves at maximum power (~8W for M.2) for maximum performance. Keeping such devices in the M.2 form factor cool is a challenge, especially where radiative area is constrained. Consumer-type devices keep cool(er) mainly through low power targets (which hurt performance) and PSLC caching (which greatly improves performance, but has a limited endurance). Same old tradeoffs.