LAN drops connectivity and Unable to access opnsense every few days

Started by KirikParty, June 24, 2023, 10:58:05 AM

Previous topic - Next topic
All,

I am very new to opnsense and have been facing a weird issue over the last few weeks. I loose connectivity to opnsense at random every few days.

My setup is quite simple:
I have a PPPoE connection to a bare metal opnsense machine on the wan interface. A single LAN interface then connects to a Gigabit switch,  when then connects to two wifi AP, NAS and a desktop PC via ethernet.

Full specs below:
Opnsense Machine:
version - 23.1.10_1
AMD ryzen 5 1400 on a A320M motherboard
16GB ram
WAN: RTL8111G onboard PCI
LAN: RTL8111G PCIe network card on a PCIE X1 slot.

Services running: Apart from the normal opnsense services, only

  • Adguard home
  • CrowdSec - without any configuration

Switch: Tp-link TL-SG108E - 8 port Gigabit network switch

wifi AP: 2 x netgear wax202

Every few days I loose connectivity to the opnsense machine. The system is still turned on and the fans running, however I cannot access the web UI, all devices report there is no internet and cannot access via SSH either.

All devices connected to thee switch are still accessible. I can ping my NAS from any device connected to wifi etc. A reboot fixes the issue.
Its kind of like there issues,  but none of them apply to me.
https://forum.opnsense.org/index.php?topic=4533.0
https://forum.opnsense.org/index.php?topic=33469.0

I have tried doing a fresh install without saving configuration, but the issue still persists.

I need help is looking at the logs to determine where the issue is. Are the logs saved even though I do a hard reset?
Where can I find the logs? so that I can share it here? Any help is greatly appreciated.

Quote from: KirikParty on June 24, 2023, 10:58:05 AM
All,

I am very new to opnsense and have been facing a weird issue over the last few weeks. I loose connectivity to opnsense at random every few days.

My setup is quite simple:
I have a PPPoE connection to a bare metal opnsense machine on the wan interface. A single LAN interface then connects to a Gigabit switch,  when then connects to two wifi AP, NAS and a desktop PC via ethernet.

Full specs below:
Opnsense Machine:
version - 23.1.10_1
AMD ryzen 5 1400 on a A320M motherboard
16GB ram
WAN: RTL8111G onboard PCI
LAN: RTL8111G PCIe network card on a PCIE X1 slot.

Services running: Apart from the normal opnsense services, only

  • Adguard home
  • CrowdSec - without any configuration

Switch: Tp-link TL-SG108E - 8 port Gigabit network switch

wifi AP: 2 x netgear wax202

Every few days I loose connectivity to the opnsense machine. The system is still turned on and the fans running, however I cannot access the web UI, all devices report there is no internet and cannot access via SSH either.

All devices connected to thee switch are still accessible. I can ping my NAS from any device connected to wifi etc. A reboot fixes the issue.
Its kind of like there issues,  but none of them apply to me.
https://forum.opnsense.org/index.php?topic=4533.0
https://forum.opnsense.org/index.php?topic=33469.0

I have tried doing a fresh install without saving configuration, but the issue still persists.

I need help is looking at the logs to determine where the issue is. Are the logs saved even though I do a hard reset?
Where can I find the logs? so that I can share it here? Any help is greatly appreciated.

Are you able to switch to Intel NICs?  I know some people have had trouble with Realteks.

Is the connectivity loss limited to the LAN side?  Is it just accessing the OPNSense UI or all traffic?  Does it come back on it's own or do you have to do something?

Have you logged into the console when this happens?  Can you ping anything, WAN or LAN?

Go to the Gateway page and uncheck the disable gateway monitoring checkbox.  The next time you have connectivity issues, check the Quality report.

I have ordered two intel based NIC's this week. They should be arriving sometime next week.

Connectivity loss is to opnsense and as a result to the internet. Everything between the switch and other connected devices in LAN can ping and see each other.

This opnsense machine is not in an easy spot where I can connect a monitor to it. I was hoping logs would show the issue.
It doesn't come back on its own. Sometimes the issue is overnight and I reset the machine in the morning.

I have enabled gateway monitoring. How/where can I check the quality report? And will this report be saved even though I power cycle the machine?


Quote from: CJRoss on June 24, 2023, 04:53:26 PM
Go to the Gateway page and uncheck the disable gateway monitoring checkbox.  The next time you have connectivity issues, check the Quality report.

Can you please let me know what logs I should be looking at. Where to find them and what I should be looking at?

Quote from: KirikParty on June 24, 2023, 10:57:25 PM
I have ordered two intel based NIC's this week. They should be arriving sometime next week.

Good.  Not sure if that's your issue but it should help.

Quote from: KirikParty on June 24, 2023, 10:57:25 PM
Connectivity loss is to opnsense and as a result to the internet. Everything between the switch and other connected devices in LAN can ping and see each other.

My point was to determine where in the chain the issue is.  If your ISP fails that's one thing.  If your WAN fails, that's another.  And lastly, your LAN could fail.  Being able to access the OPNSense UI, ping the box and/or the internet, etc helps determine where in the process it fails.

Quote from: KirikParty on June 24, 2023, 10:57:25 PM
This opnsense machine is not in an easy spot where I can connect a monitor to it. I was hoping logs would show the issue.

Having a monitor connected lets you do some investigation while things are happening instead of waiting for afterwards.  What logs have you need looking at?  Have you adjusted the log level when you check them?  Have you looked at the Reporting section?

Quote from: KirikParty on June 24, 2023, 10:57:25 PM
It doesn't come back on its own. Sometimes the issue is overnight and I reset the machine in the morning.

Interesting.  So it's probably something failing, not anything like line noise, etc.

Quote from: KirikParty on June 24, 2023, 10:57:25 PM
I have enabled gateway monitoring. How/where can I check the quality report? And will this report be saved even though I power cycle the machine?

Reporting->Health->Quality

And yes, everything under Health gets saved.