Flowd_aggregate - Dies all of the time

Started by sirebral, September 15, 2023, 01:30:47 AM

Previous topic - Next topic
Hey all,

My flowd collector is dying about one time per hour.   I searched for the error, yet couldn't find any references to it or a fix.  I am running bare-metal on ZFS, latest release version of OPNSense.

flowd_aggregate.py   flowd aggregate died with message Traceback (most recent call last): File "/usr/local/opnsense/scripts/netflow/flowd_aggregate.py", line 160, in run aggregate_flowd(self.config, do_vacuum) File "/usr/local/opnsense/scripts/netflow/flowd_aggregate.py", line 86, in aggregate_flowd stream_agg_object.cleanup(do_vacuum) File "/usr/local/opnsense/scripts/netflow/lib/aggregates/__init__.py", line 209, in cleanup self.commit() File "/usr/local/opnsense/scripts/netflow/lib/aggregates/__init__.py", line 160, in commit self._db_connection.commit() sqlite3.OperationalError: database is locked

This kills the process, I'm not sure why sqllite would be locked.  I have no other issues that are blatent.  I do have a few of these as well.

[2a9c303b-b7ff-420b-af01-a1374bce2b09] Script action failed with Command '/usr/local/opnsense/scripts/netflow/get_top_usage.py --provider 'FlowDstPortTotals' --start_time '1694703600' --end_time '1694711555' --key_fields 'dst_port,protocol' --value_field 'octets' --filter 'if=mlxen0' --max_hits '25'' returned non-zero exit status 1. at Traceback (most recent call last): File "/usr/local/opnsense/service/modules/actions/script_output.py", line 44, in execute subprocess.check_call(script_command, env=self.config_environment, shell=True, File "/usr/local/lib/python3.9/subprocess.py", line 373, in check_call raise CalledProcessError(retcode, cmd) subprocess.CalledProcessError: Command '/usr/local/opnsense/scripts/netflow/get_top_usage.py --provider 'FlowDstPortTotals' --start_time '1694703600' --end_time '1694711555' --key_fields 'dst_port,protocol' --value_field 'octets' --filter 'if=mlxen0' --max_hits '25'' returned non-zero exit status 1.

Currently I have monit watching the pid and restarting when it's not running, yet it's dying off frequently.  Anyone happen to know what may be causing this?  Appreciate the assistance!


i don't have no monitoring tools, everything dies for me

netflow Insight
unable to open database file

ntopng
exited on signal 11

i can't help but looking for another tool to simply see a bandwith chart and have a look at the actors.