2024-08-13T21:45:01 Notice kernel (nda0:nvme0:0:0:1): Error 5, Retries exhausted 2024-08-13T21:45:01 Notice kernel (nda0:nvme0:0:0:1): CAM status: Unknown (0x420) 2024-08-13T21:45:01 Notice kernel (nda0:nvme0:0:0:1): READ. NCB: opc=2 fuse=0 nsid=1 prp1=0 prp2=0 cdw=11e0c7d0 0 27 0 0 0 2024-08-13T21:45:01 Notice kernel nvme0: UNRECOVERED READ ERROR (02/81) crd:0 m:0 dnr:0 p:1 sqid:2 cid:118 cdw0:0 2024-08-13T21:45:01 Notice kernel nvme0: READ sqid:2 cid:118 nsid:1 lba:299943888 len:40
dd if=/dev/nda0 of=/dev/null bs=512 skip=299943888 count=40
dd if=/dev/nda0 of=/dev/null bs=512 skip=299943888 count=40dd: /dev/nda0: Input/output error32+0 records in32+0 records out16384 bytes transferred in 0.007795 secs (2101909 bytes/sec)
smartctl -a /dev/nvme0smartctl 7.4 2023-08-01 r5530 [FreeBSD 14.1-RELEASE-p3 amd64] (local build)Copyright (C) 2002-23, Bruce Allen, Christian Franke, www.smartmontools.org=== START OF INFORMATION SECTION ===Model Number: UMIS LENSE40256GMSP34MESTB3ASerial Number: SS0L25152X3RC0AF114XFirmware Version: 2.3.7182PCI Vendor/Subsystem ID: 0x1cc4IEEE OUI Identifier: 0x044a50Total NVM Capacity: 256,060,514,304 [256 GB]Unallocated NVM Capacity: 0Controller ID: 6059NVMe Version: 1.3Number of Namespaces: 1Namespace 1 Size/Capacity: 256,060,514,304 [256 GB]Namespace 1 Utilization: 0Namespace 1 Formatted LBA Size: 512Namespace 1 IEEE EUI-64: 504a04 c500000000Local Time is: Wed Aug 14 20:27:21 2024 AESTFirmware Updates (0x12): 1 Slot, no Reset requiredOptional Admin Commands (0x0017): Security Format Frmw_DL Self_TestOptional NVM Commands (0x0016): Wr_Unc DS_Mngmt Sav/Sel_FeatLog Page Attributes (0x03): S/H_per_NS Cmd_Eff_LgMaximum Data Transfer Size: 32 PagesWarning Comp. Temp. Threshold: 80 CelsiusCritical Comp. Temp. Threshold: 84 CelsiusSupported Power StatesSt Op Max Active Idle RL RT WL WT Ent_Lat Ex_Lat 0 + 6.50W 6.50W - 0 0 0 0 0 0 1 + 4.60W 4.60W - 1 1 1 1 5 5 2 + 3.90W 3.90W - 2 2 2 2 5 5 3 - 1.50W 1.50W - 3 3 3 3 4000 4000 4 - 0.0050W 0.50W - 4 4 4 4 20000 30000Supported LBA Sizes (NSID 0x1)Id Fmt Data Metadt Rel_Perf 0 + 512 0 1=== START OF SMART DATA SECTION ===SMART overall-health self-assessment test result: PASSEDSMART/Health Information (NVMe Log 0x02)Critical Warning: 0x00Temperature: 43 CelsiusAvailable Spare: 98%Available Spare Threshold: 3%Percentage Used: 100%Data Units Read: 5,950,084 [3.04 TB]Data Units Written: 985,528,704 [504 TB]Host Read Commands: 82,219,540Host Write Commands: 10,135,637,294Controller Busy Time: 492,801Power Cycles: 53Power On Hours: 32,993Unsafe Shutdowns: 16Media and Data Integrity Errors: 2,809Error Information Log Entries: 3,018Warning Comp. Temperature Time: 0Critical Comp. Temperature Time: 0Temperature Sensor 1: 43 CelsiusError Information (NVMe Log 0x01, 16 of 64 entries)Num ErrCount SQId CmdId Status PELoc LBA NSID VS Message 0 3018 2 0x005f 0x0281 0x000 0 1 - Unknown Command Specific Status 0x40 1 3017 3 0x0068 0x0281 0x000 0 1 - Unknown Command Specific Status 0x40 2 3016 2 0x0063 0x0281 0x000 0 1 - Unknown Command Specific Status 0x40 3 3015 4 0x0073 0x0281 0x000 0 1 - Unknown Command Specific Status 0x40 4 3014 2 0x006d 0x0281 0xe800 0 1 - Unknown Command Specific Status 0x40 5 3013 1 0x0061 0x0281 0x000 0 1 - Unknown Command Specific Status 0x40 6 3012 4 0x007b 0x0281 0x000 0 1 - Unknown Command Specific Status 0x40 7 3011 3 0x006f 0x0281 0x000 0 1 - Unknown Command Specific Status 0x40 8 3010 3 0x006c 0x0281 0x000 0 1 - Unknown Command Specific Status 0x40 9 3009 1 0x006b 0x0281 0x000 0 1 - Unknown Command Specific Status 0x40 10 3008 2 0x007b 0x0281 0x000 0 1 - Unknown Command Specific Status 0x40 11 3007 2 0x0079 0x0281 0x7801 0 1 - Unknown Command Specific Status 0x40 12 3006 2 0x007d 0x0281 0x000 0 1 - Unknown Command Specific Status 0x40 13 3005 2 0x0079 0x0281 0x7801 0 1 - Unknown Command Specific Status 0x40 14 3004 2 0x0072 0x0281 0x7d1 0 1 - Unknown Command Specific Status 0x40 15 3003 1 0x006b 0x0281 0x000 0 1 - Unknown Command Specific Status 0x40... (48 entries not read)Self-test Log (NVMe Log 0x06)Self-test status: No self-test in progressNum Test_Description Status Power_on_Hours Failing_LBA NSID Seg SCT Code 0 Extended Completed: failed segments 32993 61616 1 7 - -
Failing_LBA61616
Except I also see posts like this, which suggests that the failures might not be what they seem to be.
I thought routers/firewalls weren't that disk intensive? I don't have OPN configured to do excessive firewall logging.