Menu

Show posts

This section allows you to view all posts made by this member. Note that you can only see posts made in areas you currently have access to.

Show posts Menu

Messages - opnsense-noc

#1
We operate two identical Opensense servers in HA. The slave on the HA switch gets kernel panic irregularly while switching to the master.

This is the excerpt from trace dumps when activating on the master "Enter Persistent maintenance mode" (for upgrade to version 25.1.1):
<6>carp: 3@igb4: BACKUP -> MASTER (preempting a slower master)
<6>carp: 6@vlan0.102: BACKUP -> MASTER (preempting a slower master)
<6>carp: 2@vlan0.73: BACKUP -> MASTER (preempting a slower master)
<6>carp: 8@igb5: BACKUP -> MASTER (preempting a slower master)
<6>carp: 9@vlan0.777: BACKUP -> MASTER (preempting a slower master)
<6>carp: 1@igb3: BACKUP -> MASTER (preempting a slower master)
<6>carp: 4@vlan0.101: BACKUP -> MASTER (preempting a slower master)
<6>igb1: link state changed to DOWN
<6>igb1: link state changed to UP
<6>igb1: link state changed to DOWN
<6>igb1: link state changed to UP
<6>igb1: link state changed to DOWN
<6>igb1: link state changed to UP
<6>igb1: link state changed to DOWN
<6>igb1: link state changed to UP
<6>carp: 4@vlan0.101: MASTER -> BACKUP (more frequent advertisement received)
<6>carp: 3@igb4: MASTER -> BACKUP (more frequent advertisement received)
<6>carp: 6@vlan0.102: MASTER -> BACKUP (more frequent advertisement received)
<6>carp: 9@vlan0.777: MASTER -> BACKUP (more frequent advertisement received)
<6>carp: 1@igb3: MASTER -> BACKUP (more frequent advertisement received)
<6>carp: 8@igb5: MASTER -> BACKUP (more frequent advertisement received)
<6>in_scrubprefix: err=65, prefix delete failed
<6>in_scrubprefix: err=65, prefix delete failed
<6>arp: 145.253.103.181 moved from 04:42:1a:ca:b0:4d to 00:00:5e:00:01:08 on igb5
<6>in_scrubprefix: err=65, prefix delete failed
<6>in_scrubprefix: err=65, prefix delete failed
<6>carp: 2@vlan0.73: MASTER -> BACKUP (more frequent advertisement received)
<6>in_scrubprefix: err=65, prefix delete failed
kernel trap 12 with interrupts disabled


Fatal trap 12: page fault while in kernel mode
cpuid = 0; apic id = 00
fault virtual address    = 0x0
fault code        = supervisor write data, page not present
instruction pointer    = 0x20:0xffffffff80bdc73d
stack pointer            = 0x28:0xfffffe00101acba0
frame pointer            = 0x28:0xfffffe00101acc00
code segment        = base 0x0, limit 0xfffff, type 0x1b
            = DPL 0, pres 1, long 1, def32 0, gran 1
processor eflags    = resume, IOPL = 0
current process        = 11 (idle: cpu0)
rdi: fffff800074279d0 rsi: 0000000000000007 rdx: 0000000000000000
rcx: 0000000000000000  r8: 0000000000000000  r9: 00000000000007d0
rax: 0000000000000000 rbx: ffffffff824e4300 rbp: fffffe00101acc00
r10: 0000000089705ba0 r11: 0000000000002710 r12: 00026cef827cf1c4
r13: 00000000026cef86 r14: 00026cef6d21fe8a r15: 00026cef8d000000
trap number        = 12
panic: page fault
cpuid = 0
time = 1739545932
KDB: stack backtrace:
db_trace_self_wrapper() at db_trace_self_wrapper+0x2b/frame 0xfffffe00101ac890
vpanic() at vpanic+0x131/frame 0xfffffe00101ac9c0
panic() at panic+0x43/frame 0xfffffe00101aca20
trap_fatal() at trap_fatal+0x40b/frame 0xfffffe00101aca80
trap_pfault() at trap_pfault+0x46/frame 0xfffffe00101acad0
calltrap() at calltrap+0x8/frame 0xfffffe00101acad0
--- trap 0xc, rip = 0xffffffff80bdc73d, rsp = 0xfffffe00101acba0, rbp = 0xfffffe00101acc00 ---
callout_process() at callout_process+0x1ad/frame 0xfffffe00101acc00
handleevents() at handleevents+0x180/frame 0xfffffe00101acc40
timercb() at timercb+0x24c/frame 0xfffffe00101acc90
lapic_handle_timer() at lapic_handle_timer+0xab/frame 0xfffffe00101accb0
Xtimerint() at Xtimerint+0xb1/frame 0xfffffe00101accb0
--- interrupt, rip = 0xffffffff804c294a, rsp = 0xfffffe00101acd80, rbp = 0xfffffe00101acdb0 ---
acpi_cpu_idle() at acpi_cpu_idle+0x2da/frame 0xfffffe00101acdb0
cpu_idle_acpi() at cpu_idle_acpi+0x46/frame 0xfffffe00101acdd0
cpu_idle() at cpu_idle+0x9d/frame 0xfffffe00101acdf0
sched_idletd() at sched_idletd+0x576/frame 0xfffffe00101acef0
fork_exit() at fork_exit+0x7f/frame 0xfffffe00101acf30
fork_trampoline() at fork_trampoline+0xe/frame 0xfffffe00101acf30
--- trap 0, rip = 0, rsp = 0, rbp = 0 ---
KDB: enter: panic
panic.txt0600001214753656514  7151 ustarrootwheelpage faultversion.txt0600007414753656514  7554 ustarrootwheelFreeBSD 14.2-RELEASE-p1 stable/25.1-n269632-cc316253c68 SMP