OPNsense Forum

Archive => 15.1 Legacy Series => Topic started by: Yordan Yordanov on April 05, 2015, 04:44:34 pm

Title: [SOLVED] IPsec and TCP flows
Post by: Yordan Yordanov on April 05, 2015, 04:44:34 pm
The system is running version 15.1.8.3-c6240d38f (amd64). I have configured three interfaces - 1 LAN and two ISP lines. Currently a rule is sending all the traffic into the first line only which has a public static IPv4 address. Outbound NAT is set to automatic mode.

It seems to work okay until I tried to set up several IPsec tunnels. Most of them were connected although the interface shows that they are disconnected, but this is a known issue. (https://forum.opnsense.org/index.php?topic=82.msg303) The problem is that all the VPN connections are very unstable. When pinging remote hosts, there are no lost packets at all. However, when I log on using Remote Desktop the connection is lost every 30-35 seconds and it takes about 20 seconds to reconnect itself. The tunnel itself does not get disconnected - after my Remote Desktop session stops responding, I continue to receive ICMP echo replies. I have not tested with UDP traffic as I don't have an application that uses UDP. Additionally, RDP connections to the Internet directly work OK. This is what I have tested so far:

1. Changing IKE version - tunnels do not connect. Only one tunnel connects, but the other side is running pfSense which supports IKEv2. However the issue persists with IKEv2 too.
2. Disabling ISP balancing (I had previously configured ISP balancing but disabled it to troubleshoot the issue), enabling only ISP Failover to alternate line. The issue persists.
3. Setting Prefer older IPSec SAs. The issue persists.
4. Setting Do not install LAN SPD - unchecks itself automatically after Save and reloading the page. The issue persists.
5. Setting Enable TCP MSS clamping on VPN traffic - tried with 1200 and 1400 bytes, the issue still persists.

I also did a tcpdump for one of the tunnels during which I just typed some text in Notepad on the remote computer which looks like this:

Code: [Select]
16:49:37.542263 IP my.side > their.side: ESP(spi=0xc3028c6c,seq=0xa2d), length 84
16:49:37.554964 IP their.side > my.side: ESP(spi=0xcda44f21,seq=0xa92), length 92
16:49:37.575476 IP my.side > their.side: ESP(spi=0xc3028c6c,seq=0xa2e), length 84
16:49:37.586123 IP their.side > my.side: ESP(spi=0xcda44f21,seq=0xa93), length 92
16:49:37.607720 IP my.side > their.side: ESP(spi=0xc3028c6c,seq=0xa2f), length 84
16:49:37.617368 IP their.side > my.side: ESP(spi=0xcda44f21,seq=0xa94), length 92
16:49:37.641175 IP my.side > their.side: ESP(spi=0xc3028c6c,seq=0xa30), length 84
16:49:37.648702 IP their.side > my.side: ESP(spi=0xcda44f21,seq=0xa95), length 92
16:49:37.674312 IP my.side > their.side: ESP(spi=0xc3028c6c,seq=0xa31), length 84
16:49:37.680109 IP their.side > my.side: ESP(spi=0xcda44f21,seq=0xa96), length 92
16:49:37.707601 IP my.side > their.side: ESP(spi=0xc3028c6c,seq=0xa32), length 84
16:49:37.711110 IP their.side > my.side: ESP(spi=0xcda44f21,seq=0xa97), length 92
16:49:37.739768 IP my.side > their.side: ESP(spi=0xc3028c6c,seq=0xa33), length 84
16:49:37.742396 IP their.side > my.side: ESP(spi=0xcda44f21,seq=0xa98), length 92
16:49:37.773296 IP my.side > their.side: ESP(spi=0xc3028c6c,seq=0xa34), length 84
16:49:37.789533 IP their.side > my.side: ESP(spi=0xcda44f21,seq=0xa99), length 156
16:49:37.806428 IP my.side > their.side: ESP(spi=0xc3028c6c,seq=0xa35), length 84
16:49:37.820509 IP their.side > my.side: ESP(spi=0xcda44f21,seq=0xa9a), length 132
16:49:37.839801 IP my.side > their.side: ESP(spi=0xc3028c6c,seq=0xa36), length 84
16:49:37.851763 IP their.side > my.side: ESP(spi=0xcda44f21,seq=0xa9b), length 100
16:49:37.872443 IP my.side > their.side: ESP(spi=0xc3028c6c,seq=0xa37), length 84
16:49:37.883013 IP their.side > my.side: ESP(spi=0xcda44f21,seq=0xa9c), length 92
16:49:37.905261 IP my.side > their.side: ESP(spi=0xc3028c6c,seq=0xa38), length 84
16:49:37.914280 IP their.side > my.side: ESP(spi=0xcda44f21,seq=0xa9d), length 92
16:49:37.938347 IP my.side > their.side: ESP(spi=0xc3028c6c,seq=0xa39), length 84
16:49:37.945486 IP their.side > my.side: ESP(spi=0xcda44f21,seq=0xa9e), length 92
16:49:37.971371 IP my.side > their.side: ESP(spi=0xc3028c6c,seq=0xa3a), length 84

When the RDP session stopped responding, this is what I captured:

Code: [Select]
16:49:37.976871 IP their.side > my.side: ESP(spi=0xcda44f21,seq=0xa9f), length 92
16:49:38.101861 IP their.side > my.side: ESP(spi=0xcda44f21,seq=0xaa0), length 92
16:49:38.195728 IP their.side > my.side: ESP(spi=0xcda44f21,seq=0xaa1), length 116
16:49:38.852116 IP their.side > my.side: ESP(spi=0xcda44f21,seq=0xaa2), length 116
16:49:39.133557 IP their.side > my.side: ESP(spi=0xcda44f21,seq=0xaa3), length 372
16:49:39.289521 IP their.side > my.side: ESP(spi=0xcda44f21,seq=0xaa4), length 84
16:49:40.055345 IP their.side > my.side: ESP(spi=0xcda44f21,seq=0xaa5), length 412
16:49:40.133263 IP their.side > my.side: ESP(spi=0xcda44f21,seq=0xaa6), length 100
16:49:41.133313 IP their.side > my.side: ESP(spi=0xcda44f21,seq=0xaa7), length 92
16:49:41.398770 IP my.side > their.side: ESP(spi=0xc3028c6c,seq=0xa3b), length 84
16:49:41.399912 IP their.side > my.side: ESP(spi=0xcda44f21,seq=0xaa8), length 76

So my side stops responding for a period, but I don't know why. The line quality is excellent, when plugging it into another router, there are no issues, the VPN connections are established successfully and operate normally. However I have to replace the old router with OPNsense.

I am very frustrated by this issue as I have been trying to work it out for weeks, but no result. Could someone help me with this, maybe I am missing something?
Title: Re: IPsec and TCP flows
Post by: AdSchellevis on April 05, 2015, 06:41:55 pm
Is it already unstable with 2 active tunnels? I would like to reproduce your issue on a set of machines on my side, but I'm looking for a method to trigger it as fast as possible.
Can you deliver a stripped down version of a config.xml with the issue, but without your personal data? Then I will look into it next week to see what's going on by reproducing it on our side with some freshly installed machines.
Title: Re: IPsec and TCP flows
Post by: Yordan Yordanov on April 05, 2015, 08:28:08 pm
Out of 7 configured tunnels, 5 were active at the time of testing and they all experience this issue. I think I haven't tested with one tunnel only, but I believe this shouldn't be a consideration. By config.xml, do you mean that I can extract the VPN connection profile somehow and send it to you? Or just to prepare a file with the VPN parameters so that you can test with the same Phase1/2 parameters? Or maybe the whole configuration of the device? Thanks for engaging into this problem!

By the way, this is the device (https://www.applianceshop.eu/opnsense-a10-quad-core-ssd-rack.html) if it matters.
Title: Re: IPsec and TCP flows
Post by: AdSchellevis on April 06, 2015, 12:29:45 pm
Can you test with one tunnel also? If it also fails as well then it's a little bit easier to reproduce on my side.

If possible I would really like a config with the issue in it with the as less as options enabled but with the issue, so we can concentrate on this one by installing it on some fresh machines (the same kind actually). 
You can backup your full configuration using the backup feature found in /diag_backup.php.

Because it's a regular text file, you can strip your personal information from the file before sending it over.

Maybe it's best if you email me the configuration directly ( ad at the project domain), just to be sure we're not posting any harmful data on the forum. ( posting a (part of) the config.xml on the forum is also fine by me, but then be sure to replace all the external ip's and passwords in it).
Title: Re: IPsec and TCP flows
Post by: Yordan Yordanov on April 06, 2015, 03:42:08 pm
All right, I think I'll be able to test that in the next 2 days as I need to do that outside of business hours. I'll report back when I'm ready.
Title: Re: IPsec and TCP flows
Post by: Yordan Yordanov on April 10, 2015, 05:50:47 pm
I tested with only one IPsec tunnel (the other 6 configured but disabled) and the issue still persists. I have sent the configuration to Ad per e-mail as requested. If anybody else wishes to test, I can provide it, just send me a PM.
Title: Re: IPsec and TCP flows
Post by: ristridin on April 12, 2015, 12:19:18 pm
I'm using an ipsec connection between homeoffice - office since 15.1.6 without any issue. What's your endpoint? Maybe this issue isn't really an ipsec case but loadbalances / nat issue..?
Title: Re: IPsec and TCP flows
Post by: Yordan Yordanov on April 12, 2015, 01:44:33 pm
The problem is at my side for sure as this happens with each of the 5 tunnels I tested. The endpoints are different devices and the connection is OK when I switch the OPNsense router with another one. So it may or may not be the IPsec component that is at fault but the whole configuration as a whole.
Title: Re: IPsec and TCP flows
Post by: AdSchellevis on April 13, 2015, 07:59:50 pm
Unfortunately I was not able to reproduce the issue with your config file. Although we have fixed the status page and the non functioning option "Do not install LAN SPD".

I tested with 2 OPNsense firewalls connected with each other using a direct cable connection and on one device your config filled with my own ip addresses and secrets. I was not able to test with multiple gateways.
To send traffic I used a ssh session to copy some files and connect to a machine behind the other machine.

Maybe your issue has something to do with the 2 WAN connections and routing of packets, if you have the time you might test again with only one WAN connection enabled.
ipsec itself doesn't really seem to be the issue,  your firewall rules don't explain such strange behaviour either for as far as I could see on my box.


Title: Re: IPsec and TCP flows
Post by: Yordan Yordanov on April 13, 2015, 08:11:23 pm
Thanks for testing it, I'll remove one of the WAN interfaces and the associated firewall rules and see if it helps. If it doesn't, I'm restoring factory defaults and starting from scratch with one WAN and one tunnel.
Title: Re: IPsec and TCP flows
Post by: Yordan Yordanov on April 14, 2015, 07:50:36 pm
Removing one of the WANs didn't change anything. What is more, I restored to factory defaults and used the wizard to configure the LAN interface and one of the WANs. Then I created one IPsec tunnel and the issue is still there. :( So, Multi-WAN is not causing it. It's not only Remote Desktop, I tried to copy some files using SMB (Windows File Sharing), the transfer doesn't start at all - network error. I noticed that the issue is caused by packets not being sent TO the other endpoint - I observed a clock ticking in a RDP session and the second hand on the clock didn't stop moving while I was unable to do anything in the session after which it just reconnects and this repeats every 30 seconds.

Now I'm taking the device with me at home and will test a tunnel to the old router that OPNsense is supposed to replace. If someone wants to help me further troubleshoot this, I'm ready to record all the steps in a video to show what I am doing and what exactly happens.
Title: Re: IPsec and TCP flows
Post by: AdSchellevis on April 14, 2015, 08:00:42 pm
Strange, this setup really sounds quite straightforward. If you record your steps I will certainly take a look at it. 
What version of pfSense is used on the other box you connect to?
Title: Re: IPsec and TCP flows
Post by: Yordan Yordanov on April 15, 2015, 04:16:03 pm
The other endpoint was running:

2.2.1-RELEASE (i386)
built on Fri Mar 13 08:16:53 CDT 2015
FreeBSD 10.1-RELEASE-p6

However it happens regardless of the other device - we have Cisco ASA, Lancom, Linksys and Cisco RVS 4000.
I tested yesterday on another site (using a completely different Internet connection) by building the configuration from scratch. I established the VPN from my home to the office (which runs Linksys RV082). The issue occurs exactly in the same manner. This time I also tried SSH and it's the same experience, the only difference being that SSH can't overcome the problem and doesn't reconnect, so I get "Software caused connection abort" after about 30-40 seconds. On Friday I'll record everything in a video and get back.
Title: Re: IPsec and TCP flows
Post by: Yordan Yordanov on April 17, 2015, 01:17:59 am
Today I installed OPNsense on a desktop PC with 2 Ethernet cards and tested my VPN using the exact same parameters (and ISP lines). The issue DOES NOT occur. I used the x86 image however and I see that the device is running x64. I'll check whether the Pentium 4 CPU I used supports 64-bit to test with it. I'd like to try this before recording the video.
Title: Re: IPsec and TCP flows
Post by: AdSchellevis on April 17, 2015, 09:20:33 am
Ok, just let me know if I can do anything.
If you have the opportunity to let me test from my side to your office using the same type of machine that might also be an option (just send me an email).
I tested with the 64bit version using 2 OPNsense installs and the same hardware. But if there's any weirdness going on with your specific machine we should be able to find it.

One last question, have you tried using one of the other network ports for your connection? It's probably not the issue, but you never know.
Title: Re: IPsec and TCP flows
Post by: Yordan Yordanov on April 19, 2015, 01:42:25 am
So, finally, I was able to produce the video. Now you can watch the entire process - initial configuration of OPNsense, then IPsec and two tests - with Remote Desktop connection and SSH (10 minutes in total):

https://www.youtube.com/watch?v=1l4IJ60CTpw (Switch to HD for a clearer video. Part of the GUI is not in English but that should not be a problem).

Before I did that I changed the network port with one of the unused as you suggested and there was no change, the issue persists. However I was not able to install the x64 edition on my test computer as the CPU is not 64-bit capable.

Have a look at it when you have time and there's no problem to schedule a tunnel test with my side when you're available. Thanks again.
Title: Re: IPsec and TCP flows
Post by: AdSchellevis on April 19, 2015, 12:13:55 pm
I've looked at the video, but I don't see the issue either. The only thing I noticed was the difference in ping times, when the connection was just enabled you had approx. 25 msec when you had the rdp session running they seemed to have dropped to 3 msec....

I would really like to setup a similar box from my end to test, If you like we could do that on Monday or Tuesday just send me an email so we can arrange that.
We must be missing something, but without the issue at hand it's very difficult to solve.

 
Title: Re: IPsec and TCP flows
Post by: Yordan Yordanov on April 19, 2015, 12:48:27 pm
I think there is some convergence time after the tunnel is established until the times reach the normal values of 3-4ms. This was the case with both machines I tested (the hardware appliance and the box I installed OPNsense on).

I'll send you an e-mail today to schedule a test with you tomorrow.
Title: Re: IPsec and TCP flows
Post by: AdSchellevis on April 21, 2015, 09:18:40 am
Yesterday we we're able to reproduce the issue, and it seems to be related to the non standard kernel patches we've inherited.
At the moment we're re-evaluating the need for those patches and perform cleanups, as soon as we're done it shouldn't happen again  :) .... for now there's a temporary fix by installing a custom kernel and base available at our server.

Title: Re: [SOLVED] IPsec and TCP flows
Post by: Yordan Yordanov on April 21, 2015, 09:14:49 pm
Yes, thank you very much for the time you spent to troubleshoot this.

I'd just like to add that I installed today OPNsense x64 as a Hyper-V virtual machine and set up some VPN tunnels. The issue did not occur and everything works as expected. So it remains unknown why it is only present on the hardware appliance.
Title: Re: [SOLVED] IPsec and TCP flows
Post by: AdSchellevis on May 01, 2015, 05:07:07 pm
Hi Yordan,

If you have the time, could you please test again with our new kernel? It will be released on Monday, but if you want to try it before Monday you could upgrade using:

opnsense-update -r 15.1.10 && reboot


Thanks,
Title: Re: [SOLVED] IPsec and TCP flows
Post by: Yordan Yordanov on May 02, 2015, 03:30:30 pm
Hi Ad,

I'll try it out today. Since the device is already in production, how do I revert to the previous kernel if it doesn't work? I think it is 15.1.9.1 with NoPatches.
Title: Re: [SOLVED] IPsec and TCP flows
Post by: AdSchellevis on May 03, 2015, 11:53:23 am
Hi Yordan,

I think the base/kernel image are already removed from our server, but I do have the same build configuration in GitHub (clean_kernel branch) so I can build it again for you.
As soon as the old test base/kernel are back on-line I will drop you a note, so you have a way to revert to the current config. 

Cheers,

Ad
Title: Re: [SOLVED] IPsec and TCP flows
Post by: Yordan Yordanov on May 03, 2015, 02:26:47 pm
Hi Ad,

I upgraded to build 15.1.10 and the issue is not present which is great news! My dashboard still shows 15.1.9, maybe because I upgraded manually. I checked for updates after it and it showed me that the package manager pkg requires updating. I updated and then it showed that 29 updates are available.

(http://imageshack.us/download/901/G7fN1G.png)
I'm not clicking Upgrade here to avoid reverting to the previous version which will probably have the issue.

This means I won't be needing the custom build for now. Thanks again for everything you did about this. By the way is it possible that the upgrade was not done? The console shows:

Enter an option:


*** Welcome to OPNsense 15.1.9.1-44b610abb (amd64) on gateway ***


However the OS version is from 30.04 which looks quite new:

FreeBSD 10.1-RELEASE-p9 (SMP) #0 b717d68(master): Thu Apr 30 08:38:44 CEST 2015

No error was shown during the manual upgrade, it ended with "Please reboot".
Title: Re: [SOLVED] IPsec and TCP flows
Post by: AdSchellevis on May 03, 2015, 04:03:49 pm
Hi Yordan,

Great to hear your issue is fixed in our latests version, about the updates it shows,  these are the none base/kernel updates provided by pkg (like core and all software packages). You could update now or wait a day, tomorrow there will be a new version containing the latests versions.

Looking at the kernel build date and branch, your using the correct version and the update was performed nicely.
(....(master): Thu Apr 30 08:38:44 CEST 2015)

Thanks for testing!

Cheers,

Ad
Title: Re: [SOLVED] IPsec and TCP flows
Post by: franco on May 03, 2015, 09:33:58 pm
Yordan, you should not see any real update beyond 15.1.9.2. It's safe to upgrade from the GUI. The console upgrade will reinstall an older kernel.

And don't confuse the kernel+system update with the GUI+packages update. Tomorrow you'll be able to bring the GUI up to date with 15.1.10. ;)
Title: Re: [SOLVED] IPsec and TCP flows
Post by: Yordan Yordanov on May 03, 2015, 09:59:21 pm
Great, thanks for the info and the cleaner kernel. :)