Show Posts

This section allows you to view all posts made by this member. Note that you can only see posts made in areas you currently have access to.

Messages - mfedv

Pages: 1 [2] 3

20.1 Legacy Series / [BUG] inc/filter.inc: check for 'kill_states' / state flush on ruleset update

« on: July 14, 2020, 03:36:54 pm »

With no changes yet being made to "Firewall / Settings / Advanced":
if gateway monitoring is active, and any monitored gateway is down,
opnsense will flush all pfctl states ("/sbin/pfctl -Fs") whenever
firewall rulesets are updated. Such updates happen e.g. when a gateway
goes down and /usr/local/etc/rc.syshook.d/monitor/10-dpinger invokes
"configctl filter reload". But this also happens when firewall rules are
changed using the admin GUI.

Killed sessions include the current admin sessions to the firewall via
web GUI and ssh.

After the flush, TCP ACK packets from the browser (in reply to GUI
output) are no longer part of an established session and get dropped by
"Default deny rule".

The admin GUI web request actually succeeds in installing the new
firewall rules, but the GUI HTTP response will time out and the browser
will display a corresponding error message.

Perhaps this is even related to some instances of the famous "slow GUI"
problem seen with opnsense clusters?

Firewall / Settings / Advanced:
Gateway Monitoring
Kill states (default: checked)
Disable State Killing on Gateway Failure

Help text: The monitoring process will flush states
for a gateway that goes down if this box is not
checked. Check this box to disable this behavior.

The corresponding config value "kill_states" has a slightly confusing
name: kill_states=1 means states should _not_ be killed on gateway
failures.

The help text also is misleading: "flush states for a gateway" sounds
as if only states using the failed gateway were involved, while
actually each and every pfctl connection state will get flushed.

So far I have seen three possible values for "kill_states" in
/conf/config.xml:

a) default after install (20.1 ISO)
+ upgrades to 20.1.8_1: <kill_states/>

b) after unchecking: ... no kill_states entry at all ...

c) after checking again: <kill_states>1</kill_states>

Bug 1) misinterpreting "kill_states" default value

/usr/local/etc/inc/filter.inc:

128 function filter_delete_states_for_down_gateways()
129 {
...
145 if ($any_gateway_down == true) {
146 mwexec("/sbin/pfctl -Fs");
147 }
148 }

...

211 function filter_configure_sync($verbose = false, $flush_states = false, $load_aliases = true)
212 {
...
561 if (empty($config['system']['kill_states'])) {
562 filter_delete_states_for_down_gateways();
563 }
...
570 if ($flush_states) {
571 mwexec('/sbin/pfctl -Fs');
572 }
...
590 unlock($filterlck);
591 }

line 561 works for case c) ("<kill_states>1</kill_states>"), but not for
case a) ("<kill_states/>"). Perhaps array_key_exists() would be better?

Bug 2) flushing state on ruleset update

lines 561-563 happen to be in the same codepath for "gateway down" and
for "admin ruleset update"; they ought to be in the "gateway down"
codepath only.

Function "filter_configure_sync()" already has an explicit parameter
"flush_states", which is used e.g. from /usr/local/etc/rc.newwanip
(reacting to an updated WAN ip address).

I think lines 561-563 should be removed from
/usr/local/etc/inc/filter.inc, and be moved to
/usr/local/etc/rc.syshook.d/monitor/10-dpinger instead.

Regards
Matthias Ferdinand

20.1 Legacy Series / Re: IPsec VPN Problem 20.1.4

« on: April 16, 2020, 11:29:37 pm »

with log lines un-reversed and some entries left out:

Quote from: HerrPenaten on April 16, 2020, 09:57:42 am

2020-04-16T08:39:12 kernel: [HBSD SEGVGUARD] [charon (24103)] Suspension expired.
2020-04-16T08:39:12 kernel: -> pid: 24103 ppid: 6338 p_pax: 0xa50<SEGVGUARD,ASLR,NOSHLIBRANDOM,NODISALLOWMAP32BIT>
2020-04-16T08:39:34 ipsec_starter[6338]: charon has died -- restart scheduled (5sec)
2020-04-16T08:39:34 kernel: pid 24103 (charon), uid 0: exited on signal 6 (core dumped)
2020-04-16T08:39:39 ipsec_starter[6338]: charon (73791) started after 20 ms
2020-04-16T08:40:45 ipsec_starter[6338]: charon has died -- restart scheduled (5sec)
2020-04-16T08:40:45 kernel: pid 73791 (charon), uid 0: exited on signal 6 (core dumped)
2020-04-16T08:40:50 ipsec_starter[6338]: charon (39754) started after 20 ms

charon (IKE daemon) keeps crashing (signal 6 = ABRT), usually an indication of memory problems, and SIGVGUARD feature of Hardened BSD has kicked in (s. first line) and has suspended charon execution for some time because of repeated crashes.

This looks like some serious problem. Not sure if memory pressure alone can cause this. Can you check memory usage (dashboard), log entries under System/Log Files/General or better yet "dmesg" output from a root command line (console or ssh login)?

20.1 Legacy Series / Re: Update 20.1.3 to 20.1.4: NAT problems

« on: April 16, 2020, 10:28:30 pm »

Quote from: StP on April 16, 2020, 04:18:22 pm

I have not done anything special regarding gateway configuration.
IPV4 Upstream Gateway is set to Auto-Detect.

not sure what auto-detect does, but can you try setting the gateway address instead?

in https://forum.opnsense.org/index.php?topic=13456.0 there was a similar problem, and setting the gateway address seems to have solved it.

20.1 Legacy Series / Re: Reflection Shows Router IP

« on: April 09, 2020, 03:57:32 pm »

sorry for late answer.

I told you to check the packet counters, but alas the GUI does not even provide that feature for NAT rules :-/ You could check them in a ssh session (pfctl -s nat -v).

I think it is worth checking if the NAT rule really applies. The rule itself looks good to me, but maybe some other rule hits first so it does not apply.

The additional fw rule is just to make sure the re-written packet will be allowed. Also it gives you opportunity to activate logging for these packets (and not for others, which might be too much).

So activate logging for the outbound NAT rule and the additional fw rule, and try find these in the Log Files.

20.1 Legacy Series / Re: Reflection Shows Router IP

« on: March 31, 2020, 05:09:22 pm »

strange. please check if that traffic actually hits the fw. Or if other rules keep the packets from hitting the new rules (packet counters via "Inspect")

Hardware and Performance / Re: OpenVPN slow throughput

« on: March 31, 2020, 05:05:17 pm »

just to check if encryption is the limiting factor: can you try with no encryption / no authentication ?
(what is your current Auth Digest Algorithm?)

20.1 Legacy Series / Re: Internet goes down.

« on: March 28, 2020, 10:21:37 pm »

Connect keyboard and monitor and hit a key (e.g. space) to see if the system is still alive.

Other than a hardware fault, it might be some powersave setting; check BIOS for that.

20.1 Legacy Series / Re: Reflection Shows Router IP

« on: March 28, 2020, 09:39:58 pm »

Hi,

a little awkward, but you can achieve this using two additional rules:

- outbound NAT rule: on LAN interface, src LAN Net, dst <ip> port 25, NAT address <external service ip>
- fw rule on LAN: src address <external service ip>, dst <ip> port 25

the second rule is necessary to avoid the "force gw" part of the automatic "let out anything from firewall host itself (force gw)" floating rule.

Hardware and Performance / Re: OpenVPN slow throughput

« on: March 26, 2020, 09:26:22 pm »

If you use TCP as transport protocol, then please disregard; TCP will not have fragmentation

If you use UDP as transport protocol, then take a packet capture: Interfaces / Diagnostics / Packet Capture. Select UDP as protocol, select the peer ip address if you know it, but leave port number unset; follow-up fragments do not carry port numbers.

Start the capture and start a file transfer through OpenVPN.

If you see something like
20:58:29.321685 IP a.b.c.d > u.v.w.x: ip-proto-17

then you have fragmentation issues. 17 is the protocol number for UDP, but no port numbers are displayed because they are missing in any but the initial fragment.

Depending on how big the reassembled packet is, you may also see "bad length x > y" for the initial fragment (where port numbers are shown).

If that is the case, start with something like "mssfix 1300". This is low enough you should not have UDP fragmentation. You can experiment with higher values and find the optimum value that still works without fragmentation. The exact value will also depend on the client's internet connection.

This only helps for TCP connections inside the tunnel, large UDP packets will still be fragmented.

Hardware and Performance / Re: OpenVPN slow throughput

« on: March 26, 2020, 04:48:48 pm »

You have explicitly disabled mssfix. Can you check if openvpn encapsulated traffic gets fragmented? That should be avoided, as fragment reassembly is rather slow.

20.1 Legacy Series / Re: High Availability Setup with Single WAN IP

« on: March 26, 2020, 04:29:12 pm »

It is possible to configure a CARP address that does not fall in the network range(s) of the interfaces used, but it has downsides, especially on a WAN interface.

If your only usable public address is the CARP address, only the master fw will have outside connectivity out of the box.

While you could use some trickery, using a gateway monitoring with a (directly connected!) upstream WAN address and a LAN CARP address with lower priority, such that outbound traffic from the slave would use the LAN address of the master as upstream gateway, you simply should not. Such a setup is hardly maintainable. You will get way more admin-caused malfunctions than you could expect to have hardware failures, and debugging will become almost impossible. Just don't.

But you say you have a /29 from your ISP. Standard setup would be to assign a different address from that range to each WAN interface, and use a third address from that range as the CARP address. Is this not possible in your case?

If your FTP server IP is not from that /29, then do the standard setup from above and add the FTP server IP as an additional CARP address.

20.1 Legacy Series / Re: IPSEC Multiple SPIs State Installed?

« on: March 26, 2020, 03:13:18 pm »

Hi,

lots of possible reasons, probably ipsec logs and perhaps packet filter logs required for further analysis.

On the HA opnsense side, please check if your fw rules allow incoming IKE/ESP traffic from everywhere (or some subnets, if you know the dynamic ip address comes from a certain pool). Fw rules for IKE/ESP are not auto-generated if you use a CARP address. It might still work sometimes, if the HA opnsense initiates the connection and the "let out anything" rule kicks in.

DPD keeps up the IKE connection, and with non-UDP-encapsulated ESP you may need traffic inside the tunnel to keep up connection state in the packet filters. Auto-Ping is perhaps not enough, it sends 3 packets every 4 minutes. If no other traffic is sent, 4 minutes may be too long to keep connection state for ESP up. This should not be a problem if your fw rules allow ESP traffic even without existing connection state as explained above.

20.1 Legacy Series / IPsec sending all CA certs, even with PSK auth only

« on: March 22, 2020, 08:39:27 pm »

Note: this does not prevent IPsec connection setup, it just inflates IKE_AUTH packages more than strictly necessary.

On one installation, for HTTPS reverse proxying I use os-acme plugin, starting with the staging environment, later switching to production environment. Also for OpenVPN I setup a local CA on the firewall.

Now I added an IPsec connection with PSK authentication, and now all three CA certs above are being used in IKE_AUTH messages:

Mar 22 17:54:08 OPNsense1 charon: 10[IKE] <con4|7> sending cert request for "CN=Fake LE Intermediate X1"
Mar 22 17:54:08 OPNsense1 charon: 10[IKE] <con4|7> sending cert request for "C=US, O=Let's Encrypt, CN=Let's Encrypt Authority X3"
Mar 22 17:54:08 OPNsense1 charon: 10[IKE] <con4|7> sending cert request for "C=DE, ST=Hessen, L=Darmstadt, O=MyCorp, E=tech@mycorp.corp, CN=MyCorp-OVPN-RootCA"
Mar 22 17:54:08 OPNsense1 charon: 10[IKE] <con4|7> authentication of 'a.b.c.d' (myself) with pre-shared key
Mar 22 17:54:08 OPNsense1 charon: 10[IKE] <con4|7> establishing CHILD_SA con4{11}
Mar 22 17:54:08 OPNsense1 charon: 10[ENC] <con4|7> generating IKE_AUTH request 1 [ IDi N(INIT_CONTACT) CERTREQ IDr AUTH N(ESP_TFC_PAD_N) SA TSi TSr N(MULT_AUTH) N(EAP_ONLY) N(MSG_ID_SYN_SUP) ]

(it is con4 now not con1 because I have added some more PSK only connections to experiment with)

All three CA certs are in /usr/local/etc/ipsec.d/cacerts/. Strongswan adds them all to IKE_AUTH packages, although the config says leftauth=psk / rightauth=psk. Is this a bug in strongswan?

Still the GUI could perhaps be more selective in adding CA certs.
In /usr/local/etc/inc/plugins.inc.d/ipsec.inc at line 1093 it writes every CA cert from config.xml to /usr/local/etc/ipsec.d/cacerts/. At line 1117 it writes user/server certificates to /usr/local/etc/ipsec.d/certs, but only if they are referenced in any enabled phase1 definitions. Perhaps CA certs could be restricted the same way.

20.1 Legacy Series / Re: HA XMLRPC sync cleartext password: recommendations for username?

« on: March 18, 2020, 07:27:03 pm »

Tried signing up at Github in several variations (all involving Tor), but they all failed. Sorry, for the time being I can't bring this as a feature request to Github.

---------------
boring details: following their troubleshooting-the-captcha link, all external captchas report success. Entering account details, I see a green checkmark at "Verify your account". When clicking on "Next: Select a plan" it reports a failed captcha (but no captcha was ever displayed). It seems that even the pros at github have difficulties binding together their own services with external ones (captcha providers) into a working logical unit.

General Discussion / Re: IPsec+OVPN config failed

« on: March 18, 2020, 07:13:35 pm »

Not sure if "NAT address : 192.168.100.0/24" will work (haven't tried); for a start, choose a single ip address from that range that is otherwise unused. Other than that, the outbound NAT rule looks ok.

"Manual SPD" entry: in VPN / IPsec / Tunnel Settings, edit the phase 2 entry (contains the remote network). The last option under "Advanced Options" is "Manual SPD entries". Add your OpenVPN network there (also click on the "i" and read the help text).

For IPsec to act upon a packet, it must recognize it as a packet to be encrypted even before NAT kicks in. Thats what the "Manual SPD" entry provides. It is not used on the wire in IKE exchanges, it only tricks IPsec into (later) acting on the packet. Actual packets are then both recognized as to be handled by IPsec and as to be handled by NAT rules.

Pages: 1 [2] 3