[FIPS 9.2] Bluetooth: L2CAP: Fix l2cap_global_chan_by_psm #95

gvrose8192 · 2025-01-29T18:28:37Z

jira VULN-211
cve CVE-2022-42896

Builds and Loads
build.log

`  CLEAN   include/config include/generated arch/x86/include/generated .config .config.old .version Module.symvers certs/signing_key.pem certs/signing_key.x509 certs/x509.genkey
[TIMER]{MRPROPER}: 19s
x86_64 architecture detected, copying config
'configs/kernel-5.14.0-x86_64.config' -> '.config'
Setting Local Version for build
CONFIG_LOCALVERSION="-gvrose_fips-9-compliant_5.14.0-284.30.1"
Making olddefconfig
  HOSTCC  scripts/basic/fixdep
  HOSTCC  scripts/kconfig/conf.o
  HOSTCC  scripts/kconfig/confdata.o
  HOSTCC  scripts/kconfig/expr.o
  LEX     scripts/kconfig/lexer.lex.c
  YACC    scripts/kconfig/parser.tab.[ch]
`
[SNIP]

`  INSTALL /lib/modules/5.14.0-gvrose_fips-9-compliant_5.14.0-284.30.1+/kernel/sound/x86/snd-hdmi-lpe-audio.ko
  STRIP   /lib/modules/5.14.0-gvrose_fips-9-compliant_5.14.0-284.30.1+/kernel/sound/x86/snd-hdmi-lpe-audio.ko
  SIGN    /lib/modules/5.14.0-gvrose_fips-9-compliant_5.14.0-284.30.1+/kernel/sound/x86/snd-hdmi-lpe-audio.ko
  INSTALL /lib/modules/5.14.0-gvrose_fips-9-compliant_5.14.0-284.30.1+/kernel/sound/xen/snd_xen_front.ko
  STRIP   /lib/modules/5.14.0-gvrose_fips-9-compliant_5.14.0-284.30.1+/kernel/sound/xen/snd_xen_front.ko
  SIGN    /lib/modules/5.14.0-gvrose_fips-9-compliant_5.14.0-284.30.1+/kernel/sound/xen/snd_xen_front.ko
  INSTALL /lib/modules/5.14.0-gvrose_fips-9-compliant_5.14.0-284.30.1+/kernel/virt/lib/irqbypass.ko
  STRIP   /lib/modules/5.14.0-gvrose_fips-9-compliant_5.14.0-284.30.1+/kernel/virt/lib/irqbypass.ko
  SIGN    /lib/modules/5.14.0-gvrose_fips-9-compliant_5.14.0-284.30.1+/kernel/virt/lib/irqbypass.ko
  DEPMOD  /lib/modules/5.14.0-gvrose_fips-9-compliant_5.14.0-284.30.1+
[TIMER]{MODULES}: 85s
Making Install
sh ./arch/x86/boot/install.sh \
        5.14.0-gvrose_fips-9-compliant_5.14.0-284.30.1+ arch/x86/boot/bzImage \
        System.map "/boot"
[TIMER]{INSTALL}: 56s
Checking kABI
Checking kABI
kABI check passed
Setting Default Kernel to /boot/vmlinuz-5.14.0-gvrose_fips-9-compliant_5.14.0-284.30.1+ and Index to 0
Hopefully Grub2.0 took everything ... rebooting after time metrices
[TIMER]{MRPROPER}: 19s
[TIMER]{BUILD}: 3054s
[TIMER]{MODULES}: 85s
[TIMER]{INSTALL}: 56s
[TIMER]{TOTAL} 3240s
Rebooting in 10 seconds

[g.v.rose@rocky92-lts-base ~]$ uname -a
Linux rocky92-lts-base 5.14.0-gvrose_fips-9-compliant_5.14.0-284.30.1+ #1 SMP PREEMPT_DYNAMIC Wed Jan 29 08:19:21 PST 2025 x86_64 x86_64 x86_64 GNU/Linux

Kernel Selftests
No significant difference between the before and after kernel selftest log files - some flap in the raw_skew test but not related to our change so ignored.
kernel-selftests-before.log
kernel-selftests-after.log

Kernel Selftests with lockdep, kmemleak and stress
I ran the tests with additional kernel debugging and stressing the system - no panics or unexpected traces (the kernel selftests for locking always throw some traces).
kernel-selftests-ldp_stress_on.log

Similar to several other PRs in this line for CVE-2022-42896.

jira VULN-211 cve CVE-2022-42896 commit-author Luiz Augusto von Dentz <[email protected]> commit f937b75 l2cap_global_chan_by_psm shall not return fixed channels as they are not meant to be connected by (S)PSM. Signed-off-by: Luiz Augusto von Dentz <[email protected]> Reviewed-by: Tedd Ho-Jeong An <[email protected]> (cherry picked from commit f937b75) Signed-off-by: Greg Rose <[email protected]>

PlaidCat · 2025-01-29T21:33:50Z

Actually I take back the

Could you pull this change in but also change it from pull_request_target -> pull_request, which technically needs fixed in 9.2 as well it seems
92a2ad4

gvrose8192 · 2025-01-29T21:50:19Z

Actually I take back the

Could you pull this change in but also change it from pull_request_target -> pull_request, which technically needs fixed in 9.2 as well it seems 92a2ad4

I can do that!

gvrose8192 · 2025-01-29T22:02:15Z

Actually I take back the
Could you pull this change in but also change it from pull_request_target -> pull_request, which technically needs fixed in 9.2 as well it seems 92a2ad4

I can do that!

Done!

gvrose8192 · 2025-01-29T22:38:50Z

Actually I take back the
Could you pull this change in but also change it from pull_request_target -> pull_request, which technically needs fixed in 9.2 as well it seems 92a2ad4

I can do that!

Done!

Uh, wait... I did not do the pull_request_target -> pull_request change. Let me fix that up.

Since we need to make sure external contributors code actually compiles prior to merging. To get access to the forked repos merge request we need to switch over our push/pull_request to pull_request_target. In addition we're fixing up some Naming Conventions, adding aarch64 to this branch and fixing the naming so that we can quickly identify if the CI is for x86_64 or aarch64. Removes pull request checker it is being rewritten and doesn't work as intented for fork merges.

gvrose8192 · 2025-01-29T22:43:38Z

Actually I take back the
Could you pull this change in but also change it from pull_request_target -> pull_request, which technically needs fixed in 9.2 as well it seems 92a2ad4

I can do that!

Done!

Uh, wait... I did not do the pull_request_target -> pull_request change. Let me fix that up.

OK, now.

PlaidCat

…address into one operation jira LE-3460 cve CVE-2024-56559 Rebuild_History Non-Buildable kernel-6.12.0-55.18.1.el10_0 commit-author Adrian Huang <[email protected]> commit 9e9e085 When compiling kernel source 'make -j $(nproc)' with the up-and-running KASAN-enabled kernel on a 256-core machine, the following soft lockup is shown: watchdog: BUG: soft lockup - CPU#28 stuck for 22s! [kworker/28:1:1760] CPU: 28 PID: 1760 Comm: kworker/28:1 Kdump: loaded Not tainted 6.10.0-rc5 #95 Workqueue: events drain_vmap_area_work RIP: 0010:smp_call_function_many_cond+0x1d8/0xbb0 Code: 38 c8 7c 08 84 c9 0f 85 49 08 00 00 8b 45 08 a8 01 74 2e 48 89 f1 49 89 f7 48 c1 e9 03 41 83 e7 07 4c 01 e9 41 83 c7 03 f3 90 <0f> b6 01 41 38 c7 7c 08 84 c0 0f 85 d4 06 00 00 8b 45 08 a8 01 75 RSP: 0018:ffffc9000cb3fb60 EFLAGS: 00000202 RAX: 0000000000000011 RBX: ffff8883bc4469c0 RCX: ffffed10776e9949 RDX: 0000000000000002 RSI: ffff8883bb74ca48 RDI: ffffffff8434dc50 RBP: ffff8883bb74ca40 R08: ffff888103585dc0 R09: ffff8884533a1800 R10: 0000000000000004 R11: ffffffffffffffff R12: ffffed1077888d39 R13: dffffc0000000000 R14: ffffed1077888d38 R15: 0000000000000003 FS: 0000000000000000(0000) GS:ffff8883bc400000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 00005577b5c8d158 CR3: 0000000004850000 CR4: 0000000000350ef0 Call Trace: <IRQ> ? watchdog_timer_fn+0x2cd/0x390 ? __pfx_watchdog_timer_fn+0x10/0x10 ? __hrtimer_run_queues+0x300/0x6d0 ? sched_clock_cpu+0x69/0x4e0 ? __pfx___hrtimer_run_queues+0x10/0x10 ? srso_return_thunk+0x5/0x5f ? ktime_get_update_offsets_now+0x7f/0x2a0 ? srso_return_thunk+0x5/0x5f ? srso_return_thunk+0x5/0x5f ? hrtimer_interrupt+0x2ca/0x760 ? __sysvec_apic_timer_interrupt+0x8c/0x2b0 ? sysvec_apic_timer_interrupt+0x6a/0x90 </IRQ> <TASK> ? asm_sysvec_apic_timer_interrupt+0x16/0x20 ? smp_call_function_many_cond+0x1d8/0xbb0 ? __pfx_do_kernel_range_flush+0x10/0x10 on_each_cpu_cond_mask+0x20/0x40 flush_tlb_kernel_range+0x19b/0x250 ? srso_return_thunk+0x5/0x5f ? kasan_release_vmalloc+0xa7/0xc0 purge_vmap_node+0x357/0x820 ? __pfx_purge_vmap_node+0x10/0x10 __purge_vmap_area_lazy+0x5b8/0xa10 drain_vmap_area_work+0x21/0x30 process_one_work+0x661/0x10b0 worker_thread+0x844/0x10e0 ? srso_return_thunk+0x5/0x5f ? __kthread_parkme+0x82/0x140 ? __pfx_worker_thread+0x10/0x10 kthread+0x2a5/0x370 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x30/0x70 ? __pfx_kthread+0x10/0x10 ret_from_fork_asm+0x1a/0x30 </TASK> Debugging Analysis: 1. The following ftrace log shows that the lockup CPU spends too much time iterating vmap_nodes and flushing TLB when purging vm_area structures. (Some info is trimmed). kworker: funcgraph_entry: | drain_vmap_area_work() { kworker: funcgraph_entry: | mutex_lock() { kworker: funcgraph_entry: 1.092 us | __cond_resched(); kworker: funcgraph_exit: 3.306 us | } ... ... kworker: funcgraph_entry: | flush_tlb_kernel_range() { ... ... kworker: funcgraph_exit: # 7533.649 us | } ... ... kworker: funcgraph_entry: 2.344 us | mutex_unlock(); kworker: funcgraph_exit: $ 23871554 us | } The drain_vmap_area_work() spends over 23 seconds. There are 2805 flush_tlb_kernel_range() calls in the ftrace log. * One is called in __purge_vmap_area_lazy(). * Others are called by purge_vmap_node->kasan_release_vmalloc. purge_vmap_node() iteratively releases kasan vmalloc allocations and flushes TLB for each vmap_area. - [Rough calculation] Each flush_tlb_kernel_range() runs about 7.5ms. -- 2804 * 7.5ms = 21.03 seconds. -- That's why a soft lock is triggered. 2. Extending the soft lockup time can work around the issue (For example, # echo 60 > /proc/sys/kernel/watchdog_thresh). This confirms the above-mentioned speculation: drain_vmap_area_work() spends too much time. If we combine all TLB flush operations of the KASAN shadow virtual address into one operation in the call path 'purge_vmap_node()->kasan_release_vmalloc()', the running time of drain_vmap_area_work() can be saved greatly. The idea is from the flush_tlb_kernel_range() call in __purge_vmap_area_lazy(). And, the soft lockup won't be triggered. Here is the test result based on 6.10: [6.10 wo/ the patch] 1. ftrace latency profiling (record a trace if the latency > 20s). echo 20000000 > /sys/kernel/debug/tracing/tracing_thresh echo drain_vmap_area_work > /sys/kernel/debug/tracing/set_graph_function echo function_graph > /sys/kernel/debug/tracing/current_tracer echo 1 > /sys/kernel/debug/tracing/tracing_on 2. Run `make -j $(nproc)` to compile the kernel source 3. Once the soft lockup is reproduced, check the ftrace log: cat /sys/kernel/debug/tracing/trace # tracer: function_graph # # CPU DURATION FUNCTION CALLS # | | | | | | | 76) $ 50412985 us | } /* __purge_vmap_area_lazy */ 76) $ 50412997 us | } /* drain_vmap_area_work */ 76) $ 29165911 us | } /* __purge_vmap_area_lazy */ 76) $ 29165926 us | } /* drain_vmap_area_work */ 91) $ 53629423 us | } /* __purge_vmap_area_lazy */ 91) $ 53629434 us | } /* drain_vmap_area_work */ 91) $ 28121014 us | } /* __purge_vmap_area_lazy */ 91) $ 28121026 us | } /* drain_vmap_area_work */ [6.10 w/ the patch] 1. Repeat step 1-2 in "[6.10 wo/ the patch]" 2. The soft lockup is not triggered and ftrace log is empty. cat /sys/kernel/debug/tracing/trace # tracer: function_graph # # CPU DURATION FUNCTION CALLS # | | | | | | | 3. Setting 'tracing_thresh' to 10/5 seconds does not get any ftrace log. 4. Setting 'tracing_thresh' to 1 second gets ftrace log. cat /sys/kernel/debug/tracing/trace # tracer: function_graph # # CPU DURATION FUNCTION CALLS # | | | | | | | 23) $ 1074942 us | } /* __purge_vmap_area_lazy */ 23) $ 1074950 us | } /* drain_vmap_area_work */ The worst execution time of drain_vmap_area_work() is about 1 second. Link: https://lore.kernel.org/lkml/[email protected]/ Link: https://lkml.kernel.org/r/[email protected] Fixes: 282631c ("mm: vmalloc: remove global purge_vmap_area_root rb-tree") Signed-off-by: Adrian Huang <[email protected]> Co-developed-by: Uladzislau Rezki (Sony) <[email protected]> Signed-off-by: Uladzislau Rezki (Sony) <[email protected]> Tested-by: Jiwei Sun <[email protected]> Reviewed-by: Baoquan He <[email protected]> Cc: Alexander Potapenko <[email protected]> Cc: Andrey Konovalov <[email protected]> Cc: Andrey Ryabinin <[email protected]> Cc: Christoph Hellwig <[email protected]> Cc: Dmitry Vyukov <[email protected]> Cc: Vincenzo Frascino <[email protected]> Cc: <[email protected]> Signed-off-by: Andrew Morton <[email protected]> (cherry picked from commit 9e9e085) Signed-off-by: Jonathan Maple <[email protected]>

gvrose8192 requested review from PlaidCat and bmastbergen January 29, 2025 18:29

bmastbergen approved these changes Jan 29, 2025

View reviewed changes

PlaidCat approved these changes Jan 29, 2025 •

edited

Loading

View reviewed changes

PlaidCat self-requested a review January 29, 2025 21:31

PlaidCat changed the title ~~[LTS 9.2] Bluetooth: L2CAP: Fix l2cap_global_chan_by_psm~~ [FIPS 9.2] Bluetooth: L2CAP: Fix l2cap_global_chan_by_psm Jan 29, 2025

gvrose8192 force-pushed the gvrose_fips-9-compliant_5.14.0-284.30.1 branch from a8f1a4d to 61eb857 Compare January 29, 2025 22:40

PlaidCat approved these changes Jan 30, 2025

View reviewed changes

gvrose8192 merged commit 8791d08 into fips-9-compliant/5.14.0-284.30.1 Jan 30, 2025
3 checks passed

gvrose8192 deleted the gvrose_fips-9-compliant_5.14.0-284.30.1 branch January 30, 2025 17:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[FIPS 9.2] Bluetooth: L2CAP: Fix l2cap_global_chan_by_psm #95

[FIPS 9.2] Bluetooth: L2CAP: Fix l2cap_global_chan_by_psm #95

Uh oh!

gvrose8192 commented Jan 29, 2025 •

edited by PlaidCat

Loading

Uh oh!

PlaidCat commented Jan 29, 2025 •

edited

Loading

Uh oh!

gvrose8192 commented Jan 29, 2025

Uh oh!

gvrose8192 commented Jan 29, 2025

Uh oh!

gvrose8192 commented Jan 29, 2025

Uh oh!

gvrose8192 commented Jan 29, 2025

Uh oh!

PlaidCat left a comment

Uh oh!

Uh oh!

Uh oh!

[FIPS 9.2] Bluetooth: L2CAP: Fix l2cap_global_chan_by_psm #95

[FIPS 9.2] Bluetooth: L2CAP: Fix l2cap_global_chan_by_psm #95

Uh oh!

Conversation

gvrose8192 commented Jan 29, 2025 • edited by PlaidCat Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

PlaidCat commented Jan 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gvrose8192 commented Jan 29, 2025

Uh oh!

gvrose8192 commented Jan 29, 2025

Uh oh!

gvrose8192 commented Jan 29, 2025

Uh oh!

gvrose8192 commented Jan 29, 2025

Uh oh!

PlaidCat left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

gvrose8192 commented Jan 29, 2025 •

edited by PlaidCat

Loading

PlaidCat commented Jan 29, 2025 •

edited

Loading