Hello,
syzbot found the following issue on:
HEAD commit: 795c58e4c7fc Merge tag 'trace-v6.10-rc6' of git://git.kern.. git tree: upstream console output: https://syzkaller.appspot.com/x/log.txt?x=11bf7976980000 kernel config: https://syzkaller.appspot.com/x/.config?x=c950e46ec3ea637a dashboard link: https://syzkaller.appspot.com/bug?extid=572f6e36bc6ee6f16762 compiler: gcc (Debian 12.2.0-14) 12.2.0, GNU ld (GNU Binutils for Debian) 2.40
Unfortunately, I don't have any reproducer for this issue yet.
Downloadable assets: disk image: https://storage.googleapis.com/syzbot-assets/393fdf1d08dd/disk-795c58e4.raw.... vmlinux: https://storage.googleapis.com/syzbot-assets/724001a1b4be/vmlinux-795c58e4.x... kernel image: https://storage.googleapis.com/syzbot-assets/1ede352392b0/bzImage-795c58e4.x...
IMPORTANT: if you fix the issue, please add the following tag to the commit: Reported-by: syzbot+572f6e36bc6ee6f16762@syzkaller.appspotmail.com
watchdog: BUG: soft lockup - CPU#0 stuck for 143s! [kworker/u8:8:2833] Modules linked in: irq event stamp: 1708674 hardirqs last enabled at (1708673): [<ffffffff81e178c0>] ___slab_alloc+0x870/0x1870 mm/slub.c:3577 hardirqs last disabled at (1708674): [<ffffffff8ae84a1e>] sysvec_apic_timer_interrupt+0xe/0xb0 arch/x86/kernel/apic/apic.c:1043 softirqs last enabled at (1708660): [<ffffffff8aa82f2d>] spin_unlock_bh include/linux/spinlock.h:396 [inline] softirqs last enabled at (1708660): [<ffffffff8aa82f2d>] batadv_iv_ogm_queue_add net/batman-adv/bat_iv_ogm.c:661 [inline] softirqs last enabled at (1708660): [<ffffffff8aa82f2d>] batadv_iv_ogm_schedule_buff+0x97d/0x1500 net/batman-adv/bat_iv_ogm.c:833 softirqs last disabled at (1708658): [<ffffffff8aa82ee3>] spin_lock_bh include/linux/spinlock.h:356 [inline] softirqs last disabled at (1708658): [<ffffffff8aa82ee3>] batadv_iv_ogm_queue_add net/batman-adv/bat_iv_ogm.c:639 [inline] softirqs last disabled at (1708658): [<ffffffff8aa82ee3>] batadv_iv_ogm_schedule_buff+0x933/0x1500 net/batman-adv/bat_iv_ogm.c:833 CPU: 0 PID: 2833 Comm: kworker/u8:8 Not tainted 6.10.0-rc6-syzkaller-00069-g795c58e4c7fc #0 Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 06/07/2024 Workqueue: bat_events batadv_iv_send_outstanding_bat_ogm_packet RIP: 0010:arch_atomic_read arch/x86/include/asm/atomic.h:23 [inline] RIP: 0010:raw_atomic_read include/linux/atomic/atomic-arch-fallback.h:457 [inline] RIP: 0010:atomic_read include/linux/atomic/atomic-instrumented.h:33 [inline] RIP: 0010:kfence_alloc include/linux/kfence.h:127 [inline] RIP: 0010:slab_alloc_node mm/slub.c:3986 [inline] RIP: 0010:kmem_cache_alloc_noprof+0x1bd/0x2f0 mm/slub.c:4009 Code: 5d 41 5e 41 5f 5d e9 2d aa 2e 09 31 c9 e9 71 ff ff ff 41 8b 44 24 08 a8 82 0f 84 5b ff ff ff a8 08 41 0f 45 de e9 50 ff ff ff <8b> 05 fd b8 f4 0b 85 c0 0f 85 c0 fe ff ff 4c 89 e7 44 89 f6 44 89 RSP: 0018:ffffc90009717840 EFLAGS: 00000246 RAX: 0000000000000000 RBX: 0000000000002120 RCX: 0000000000000000 RDX: 1ffffffff1bace70 RSI: 0000000000000028 RDI: ffff88802ba3b3c4 RBP: ffffc90009717888 R08: 00000000ffffffff R09: ffff8880250de500 R10: 0000000000000000 R11: 0000000000000004 R12: ffff88801544f8c0 R13: 0000000000002120 R14: 0000000000000028 R15: 00000000250de500 FS: 0000000000000000(0000) GS:ffff8880b9200000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 00007f64bd0ce2d8 CR3: 000000006623e000 CR4: 0000000000350ef0 Call Trace: <IRQ> </IRQ> <TASK> fill_pool+0x26b/0x5d0 lib/debugobjects.c:168 debug_objects_fill_pool lib/debugobjects.c:615 [inline] __debug_object_init+0xf7/0x480 lib/debugobjects.c:627 __init_work+0x4c/0x60 kernel/workqueue.c:678 batadv_iv_ogm_aggregate_new+0x2dd/0x4a0 net/batman-adv/bat_iv_ogm.c:584 batadv_iv_ogm_queue_add net/batman-adv/bat_iv_ogm.c:670 [inline] batadv_iv_ogm_schedule_buff+0x99b/0x1500 net/batman-adv/bat_iv_ogm.c:833 batadv_iv_ogm_schedule net/batman-adv/bat_iv_ogm.c:868 [inline] batadv_iv_ogm_schedule net/batman-adv/bat_iv_ogm.c:861 [inline] batadv_iv_send_outstanding_bat_ogm_packet+0x31e/0x8d0 net/batman-adv/bat_iv_ogm.c:1712 process_one_work+0x9c8/0x1b40 kernel/workqueue.c:3248 process_scheduled_works kernel/workqueue.c:3329 [inline] worker_thread+0x6c8/0xf30 kernel/workqueue.c:3409 kthread+0x2c4/0x3a0 kernel/kthread.c:389 ret_from_fork+0x48/0x80 arch/x86/kernel/process.c:147 ret_from_fork_asm+0x1a/0x30 arch/x86/entry/entry_64.S:244 </TASK> Sending NMI from CPU 0 to CPUs 1: NMI backtrace for cpu 1 skipped: idling at native_safe_halt arch/x86/include/asm/irqflags.h:48 [inline] NMI backtrace for cpu 1 skipped: idling at arch_safe_halt arch/x86/include/asm/irqflags.h:86 [inline] NMI backtrace for cpu 1 skipped: idling at acpi_safe_halt+0x1a/0x20 drivers/acpi/processor_idle.c:112
--- This report is generated by a bot. It may contain errors. See https://goo.gl/tpsmEJ for more information about syzbot. syzbot engineers can be reached at syzkaller@googlegroups.com.
syzbot will keep track of this issue. See: https://goo.gl/tpsmEJ#status for how to communicate with syzbot.
If the report is already addressed, let syzbot know by replying with: #syz fix: exact-commit-title
If you want to overwrite report's subsystems, reply with: #syz set subsystems: new-subsystem (See the list of subsystem names on the web dashboard)
If the report is a duplicate of another one, reply with: #syz dup: exact-subject-of-another-report
If you want to undo deduplication, reply with: #syz undup
syzbot has found a reproducer for the following issue on:
HEAD commit: a430d95c5efa Merge tag 'lsm-pr-20240911' of git://git.kern.. git tree: upstream console output: https://syzkaller.appspot.com/x/log.txt?x=17e87f00580000 kernel config: https://syzkaller.appspot.com/x/.config?x=44d46e514184cd24 dashboard link: https://syzkaller.appspot.com/bug?extid=572f6e36bc6ee6f16762 compiler: gcc (Debian 12.2.0-14) 12.2.0, GNU ld (GNU Binutils for Debian) 2.40 syz repro: https://syzkaller.appspot.com/x/repro.syz?x=1481cca9980000 C reproducer: https://syzkaller.appspot.com/x/repro.c?x=14929607980000
Downloadable assets: disk image: https://storage.googleapis.com/syzbot-assets/bdf130384fad/disk-a430d95c.raw.... vmlinux: https://storage.googleapis.com/syzbot-assets/c62ff195641a/vmlinux-a430d95c.x... kernel image: https://storage.googleapis.com/syzbot-assets/4069702199e2/bzImage-a430d95c.x...
IMPORTANT: if you fix the issue, please add the following tag to the commit: Reported-by: syzbot+572f6e36bc6ee6f16762@syzkaller.appspotmail.com
rcu: INFO: rcu_preempt detected stalls on CPUs/tasks: rcu: Tasks blocked on level-0 rcu_node (CPUs 0-1): P1119/1:b..l rcu: (detected by 0, t=10503 jiffies, g=23913, q=347 ncpus=2) task:kworker/u8:6 state:R running task stack:24576 pid:1119 tgid:1119 ppid:2 flags:0x00004000 Workqueue: bat_events batadv_iv_send_outstanding_bat_ogm_packet Call Trace: <TASK> context_switch kernel/sched/core.c:5188 [inline] __schedule+0xe37/0x5490 kernel/sched/core.c:6529 preempt_schedule_irq+0x51/0x90 kernel/sched/core.c:6851 irqentry_exit+0x36/0x90 kernel/entry/common.c:354 asm_sysvec_apic_timer_interrupt+0x1a/0x20 arch/x86/include/asm/idtentry.h:702 RIP: 0010:lock_acquire+0x1f2/0x560 kernel/locking/lockdep.c:5727 Code: c1 05 ea b0 98 7e 83 f8 01 0f 85 ea 02 00 00 9c 58 f6 c4 02 0f 85 d5 02 00 00 48 85 ed 74 01 fb 48 b8 00 00 00 00 00 fc ff df <48> 01 c3 48 c7 03 00 00 00 00 48 c7 43 08 00 00 00 00 48 8b 84 24 RSP: 0018:ffffc900045b7a70 EFLAGS: 00000206 RAX: dffffc0000000000 RBX: 1ffff920008b6f50 RCX: 0000000000000001 RDX: 0000000000000001 RSI: ffffffff8b4cddc0 RDI: ffffffff8bb118a0 RBP: 0000000000000200 R08: 0000000000000000 R09: fffffbfff2d39ae0 R10: ffffffff969cd707 R11: 0000000000000000 R12: 0000000000000000 R13: 0000000000000000 R14: ffffffff8ddba6a0 R15: 0000000000000000 rcu_lock_acquire include/linux/rcupdate.h:326 [inline] rcu_read_lock include/linux/rcupdate.h:838 [inline] batadv_iv_ogm_slide_own_bcast_window net/batman-adv/bat_iv_ogm.c:754 [inline] batadv_iv_ogm_schedule_buff+0x5ac/0x14d0 net/batman-adv/bat_iv_ogm.c:825 batadv_iv_ogm_schedule net/batman-adv/bat_iv_ogm.c:868 [inline] batadv_iv_ogm_schedule net/batman-adv/bat_iv_ogm.c:861 [inline] batadv_iv_send_outstanding_bat_ogm_packet+0x31e/0x8d0 net/batman-adv/bat_iv_ogm.c:1712 process_one_work+0x9c5/0x1b40 kernel/workqueue.c:3231 process_scheduled_works kernel/workqueue.c:3312 [inline] worker_thread+0x6c8/0xf00 kernel/workqueue.c:3393 kthread+0x2c1/0x3a0 kernel/kthread.c:389 ret_from_fork+0x45/0x80 arch/x86/kernel/process.c:147 ret_from_fork_asm+0x1a/0x30 arch/x86/entry/entry_64.S:244 </TASK> rcu: rcu_preempt kthread starved for 10529 jiffies! g23913 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x0 ->cpu=0 rcu: Unless rcu_preempt kthread gets sufficient CPU time, OOM is now expected behavior. rcu: RCU grace-period kthread stack dump: task:rcu_preempt state:R running task stack:27680 pid:17 tgid:17 ppid:2 flags:0x00004000 Call Trace: <TASK> context_switch kernel/sched/core.c:5188 [inline] __schedule+0xe37/0x5490 kernel/sched/core.c:6529 __schedule_loop kernel/sched/core.c:6606 [inline] schedule+0xe7/0x350 kernel/sched/core.c:6621 schedule_timeout+0x136/0x2a0 kernel/time/timer.c:2581 rcu_gp_fqs_loop+0x1eb/0xb00 kernel/rcu/tree.c:2034 rcu_gp_kthread+0x271/0x380 kernel/rcu/tree.c:2236 kthread+0x2c1/0x3a0 kernel/kthread.c:389 ret_from_fork+0x45/0x80 arch/x86/kernel/process.c:147 ret_from_fork_asm+0x1a/0x30 arch/x86/entry/entry_64.S:244 </TASK> rcu: Stack dump where RCU GP kthread last ran: CPU: 0 UID: 0 PID: 0 Comm: swapper/0 Not tainted 6.11.0-syzkaller-02574-ga430d95c5efa #0 Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 08/06/2024 RIP: 0010:native_irq_disable arch/x86/include/asm/irqflags.h:37 [inline] RIP: 0010:arch_local_irq_disable arch/x86/include/asm/irqflags.h:92 [inline] RIP: 0010:acpi_safe_halt+0x1a/0x20 drivers/acpi/processor_idle.c:112 Code: 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 65 48 8b 05 78 a2 eb 74 48 8b 00 a8 08 75 0c 66 90 0f 00 2d 68 56 a4 00 fb f4 <fa> c3 cc cc cc cc 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 RSP: 0018:ffffffff8da07d70 EFLAGS: 00000246 RAX: 0000000000004000 RBX: 0000000000000001 RCX: ffffffff8b181979 RDX: 0000000000000001 RSI: ffff8880212b3000 RDI: ffff8880212b3064 RBP: ffff8880212b3064 R08: 0000000000000001 R09: ffffed1017106fd9 R10: ffff8880b8837ecb R11: 0000000000000000 R12: ffff8880212be800 R13: ffffffff8e9faa20 R14: 0000000000000000 R15: 0000000000000000 FS: 0000000000000000(0000) GS:ffff8880b8800000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 00007ff2dde0dd58 CR3: 000000002ad40000 CR4: 00000000003506f0 DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 Call Trace: <IRQ> </IRQ> <TASK> acpi_idle_enter+0xc5/0x160 drivers/acpi/processor_idle.c:702 cpuidle_enter_state+0xaa/0x4f0 drivers/cpuidle/cpuidle.c:264 cpuidle_enter+0x4e/0xa0 drivers/cpuidle/cpuidle.c:385 cpuidle_idle_call kernel/sched/idle.c:230 [inline] do_idle+0x313/0x3f0 kernel/sched/idle.c:326 cpu_startup_entry+0x4f/0x60 kernel/sched/idle.c:424 rest_init+0x16b/0x2b0 init/main.c:747 start_kernel+0x3e4/0x4d0 init/main.c:1105 x86_64_start_reservations+0x18/0x30 arch/x86/kernel/head64.c:507 x86_64_start_kernel+0xb2/0xc0 arch/x86/kernel/head64.c:488 common_startup_64+0x13e/0x148 </TASK>
--- If you want syzbot to run the reproducer, reply with: #syz test: git://repo/address.git branch-or-commit-hash If you attach or paste a git patch, syzbot will apply it before testing.
syzbot suspects this issue was fixed by commit:
commit c662e2b1e8cfc3b6329704dab06051f8c3ec2993 Author: Peter Zijlstra peterz@infradead.org Date: Thu Sep 5 15:02:24 2024 +0000
sched: Fix sched_delayed vs sched_core
bisection log: https://syzkaller.appspot.com/x/bisect.txt?x=14886e87980000 start commit: a430d95c5efa Merge tag 'lsm-pr-20240911' of git://git.kern.. git tree: upstream kernel config: https://syzkaller.appspot.com/x/.config?x=44d46e514184cd24 dashboard link: https://syzkaller.appspot.com/bug?extid=572f6e36bc6ee6f16762 syz repro: https://syzkaller.appspot.com/x/repro.syz?x=1481cca9980000 C reproducer: https://syzkaller.appspot.com/x/repro.c?x=14929607980000
If the result looks correct, please mark the issue as fixed by replying with:
#syz fix: sched: Fix sched_delayed vs sched_core
For information about bisection process see: https://goo.gl/tpsmEJ#bisection
b.a.t.m.a.n@lists.open-mesh.org