atlantic kernel module fails to load after resume from suspend
by perrin4869 from LinuxQuestions.org on (#6NTP4)
I am running slackware64-current, currently on kernel 6.9.6, but this issue has been present for as long as I can remember.
I am using an AM5 build with an MSI x670e ACE board, which has a 10G port with an Aquantia Corp. AQtion AQC113CS NBase-T/IEEE 802.3an Ethernet Controller (Antigua 10G). After resuming from suspend, I will randomly encounter that the wired interface will not come up. It has never happened after the first iteration of suspend -> resume, at least that I can recall, but when it does, I can see a log error in dmesg:
Code:00:14.0 SMBus: Advanced Micro Devices, Inc. [AMD] FCH SMBus Controller (rev 71)
00:14.3 ISA bridge: Advanced Micro Devices, Inc. [AMD] FCH LPC Bridge (rev 51)
00:18.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Device 14e0
00:18.1 Host bridge: Advanced Micro Devices, Inc. [AMD] Device 14e1
[Wed May 15 12:09:33 2024] ? srso_alias_return_thunk+0x5/0xfbef5
[Wed May 15 12:09:33 2024] ? __alloc_pages_direct_compact+0xaf/0x280
[Wed May 15 12:09:33 2024] __alloc_pages_slowpath.constprop.0+0xd0d/0xd60
[Wed May 15 12:09:33 2024] ? aq_ring_alloc+0x22/0x80 [atlantic]
[Wed May 15 12:09:33 2024] __alloc_pages+0x329/0x360
[Wed May 15 12:09:33 2024] __kmalloc_large_node+0x6d/0x120
[Wed May 15 12:09:33 2024] __kmalloc+0x300/0x3e0
[Wed May 15 12:09:33 2024] ? aq_ring_alloc+0x22/0x80 [atlantic]
[Wed May 15 12:09:33 2024] aq_ring_alloc+0x22/0x80 [atlantic]
[Wed May 15 12:09:33 2024] aq_vec_ring_alloc+0xfd/0x1c0 [atlantic]
[Wed May 15 12:09:33 2024] aq_nic_init+0x114/0x1d0 [atlantic]
[Wed May 15 12:09:33 2024] atl_resume_common+0x43/0xd0 [atlantic]
[Wed May 15 12:09:33 2024] ? __pfx_pci_pm_resume+0x10/0x10
[Wed May 15 12:09:33 2024] dpm_run_callback+0x1d/0xf0
[Wed May 15 12:09:33 2024] device_resume+0x9c/0x210
[Wed May 15 12:09:33 2024] async_resume+0x19/0x30
[Wed May 15 12:09:33 2024] async_run_entry_fn+0x23/0xb0
[Wed May 15 12:09:33 2024] process_one_work+0x17d/0x3a0
[Wed May 15 12:09:33 2024] worker_thread+0x2ed/0x400
[Wed May 15 12:09:33 2024] ? __pfx_worker_thread+0x10/0x10
[Wed May 15 12:09:33 2024] kthread+0xda/0x110
[Wed May 15 12:09:33 2024] ? __pfx_kthread+0x10/0x10
[Wed May 15 12:09:33 2024] ret_from_fork+0x2d/0x50
[Wed May 15 12:09:33 2024] ? __pfx_kthread+0x10/0x10
[Wed May 15 12:09:33 2024] ret_from_fork_asm+0x1a/0x30
[Wed May 15 12:09:33 2024] </TASK>
[Wed May 15 12:09:33 2024] Mem-Info:
[Wed May 15 12:09:33 2024] active_anon:3864139 inactive_anon:0 isolated_anon:0
active_file:4065874 inactive_file:3073388 isolated_file:0
unevictable:30 dirty:1 writeback:0
slab_reclaimable:337974 slab_unreclaimable:77010
mapped:343328 shmem:121722 pagetables:53419
sec_pagetables:0 bounce:0
kernel_misc_reclaimable:0
free:910815 free_pcp:0 free_cma:0
[Wed May 15 12:09:33 2024] Node 0 active_anon:15456556kB inactive_anon:0kB active_file:16263496kB inactive_file:12293552kB unevictable:120kB isolated(anon):0kB isolated(file):0kB mapped:1373312kB dirty:4kB writeback:0kB shmem:486888kB shmem_thp:0kB shmem_pmdmapped:0kB anon_thp:0kB writeback_tmp:0kB kernel_stack:63816kB pagetables:213676kB sec_page
tables:0kB all_unreclaimable? no
[Wed May 15 12:09:33 2024] Node 0 DMA free:11264kB boost:0kB min:12kB low:24kB high:36kB reserved_highatomic:0KB active_anon:0kB inactive_anon:0kB active_file:0kB inactive_file:0kB unevictable:0kB writepending:0kB present:15996kB managed:15360kB mlocked:0kB bounce:0kB free_pcp:0kB local_pcp:0kB free_cma:0kB
[Wed May 15 12:09:33 2024] lowmem_reserve[]: 0 2626 63425 63425 63425
[Wed May 15 12:09:33 2024] Node 0 DMA32 free:258396kB boost:12256kB min:15052kB low:17740kB high:20428kB reserved_highatomic:0KB active_anon:358352kB inactive_anon:0kB active_file:2948kB inactive_file:1423612kB unevictable:0kB writepending:0kB present:2820232kB managed:2711000kB mlocked:0kB bounce:0kB free_pcp:0kB local_pcp:0kB free_cma:0kB
[Wed May 15 12:09:33 2024] lowmem_reserve[]: 0 0 60799 60799 60799
[Wed May 15 12:09:33 2024] Node 0 Normal free:3373600kB boost:283920kB min:348688kB low:410944kB high:473200kB reserved_highatomic:2048KB active_anon:15098204kB inactive_anon:0kB active_file:16260548kB inactive_file:10869940kB unevictable:120kB writepending:4kB present:63404544kB managed:62258552kB mlocked:120kB bounce:0kB free_pcp:0kB local_pcp:0
kB free_cma:0kB
[Wed May 15 12:09:33 2024] lowmem_reserve[]: 0 0 0 0 0
[Wed May 15 12:09:33 2024] Node 0 DMA: 0*4kB 0*8kB 0*16kB 0*32kB 0*64kB 0*128kB 0*256kB 0*512kB 1*1024kB (U) 1*2048kB (M) 2*4096kB (M) = 11264kB
[Wed May 15 12:09:33 2024] Node 0 DMA32: 26127*4kB (UM) 11962*8kB (UM) 3635*16kB (UM) 4*32kB (UM) 0*64kB 0*128kB 0*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 258492kB
[Wed May 15 12:09:33 2024] Node 0 Normal: 287019*4kB (UME) 199768*8kB (UME) 37645*16kB (UME) 643*32kB (UME) 7*64kB (UM) 0*128kB 0*256kB 0*512kB 0*1024kB 1*2048kB (H) 0*4096kB = 3371612kB
[Wed May 15 12:09:33 2024] Node 0 hugepages_total=0 hugepages_free=0 hugepages_surp=0 hugepages_size=1048576kB
[Wed May 15 12:09:33 2024] Node 0 hugepages_total=5000 hugepages_free=5000 hugepages_surp=0 hugepages_size=2048kB
[Wed May 15 12:09:33 2024] 7260984 total pagecache pages
[Wed May 15 12:09:33 2024] 0 pages in swap cache
[Wed May 15 12:09:33 2024] Free swap = 0kB
[Wed May 15 12:09:33 2024] Total swap = 0kB
[Wed May 15 12:09:33 2024] 16560193 pages RAM
[Wed May 15 12:09:33 2024] 0 pages HighMem/MovableOnly
[Wed May 15 12:09:33 2024] 313965 pages reserved
[Wed May 15 12:09:33 2024] 0 pages cma reserved
[Wed May 15 12:09:33 2024] 0 pages hwpoisoned
[Wed May 15 12:09:33 2024] atlantic 0000:07:00.0: PM: dpm_run_callback(): pci_pm_resume+0x0/0xe0 returns -12
[Wed May 15 12:09:33 2024] atlantic 0000:07:00.0: PM: failed to resume async: error -12I tried to rmmod atlantic, but that will hang. I tried to bring the interface up or down, but that also hangs.
The only solution is to hard reboot the PC.
I haven't tried any other operating system.
I have tried playing with the power kernel parameters of the module, but haven't had luck.
Was wondering if anyone could give me any hints how to further troubleshoot the issue?
Should I contact MSI support? Send an issue to the kernel?
I wish I had some kernel development experience, but unfortunately at my current skill level I am not able to debug the issue...
I am using an AM5 build with an MSI x670e ACE board, which has a 10G port with an Aquantia Corp. AQtion AQC113CS NBase-T/IEEE 802.3an Ethernet Controller (Antigua 10G). After resuming from suspend, I will randomly encounter that the wired interface will not come up. It has never happened after the first iteration of suspend -> resume, at least that I can recall, but when it does, I can see a log error in dmesg:
Code:00:14.0 SMBus: Advanced Micro Devices, Inc. [AMD] FCH SMBus Controller (rev 71)
00:14.3 ISA bridge: Advanced Micro Devices, Inc. [AMD] FCH LPC Bridge (rev 51)
00:18.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Device 14e0
00:18.1 Host bridge: Advanced Micro Devices, Inc. [AMD] Device 14e1
[Wed May 15 12:09:33 2024] ? srso_alias_return_thunk+0x5/0xfbef5
[Wed May 15 12:09:33 2024] ? __alloc_pages_direct_compact+0xaf/0x280
[Wed May 15 12:09:33 2024] __alloc_pages_slowpath.constprop.0+0xd0d/0xd60
[Wed May 15 12:09:33 2024] ? aq_ring_alloc+0x22/0x80 [atlantic]
[Wed May 15 12:09:33 2024] __alloc_pages+0x329/0x360
[Wed May 15 12:09:33 2024] __kmalloc_large_node+0x6d/0x120
[Wed May 15 12:09:33 2024] __kmalloc+0x300/0x3e0
[Wed May 15 12:09:33 2024] ? aq_ring_alloc+0x22/0x80 [atlantic]
[Wed May 15 12:09:33 2024] aq_ring_alloc+0x22/0x80 [atlantic]
[Wed May 15 12:09:33 2024] aq_vec_ring_alloc+0xfd/0x1c0 [atlantic]
[Wed May 15 12:09:33 2024] aq_nic_init+0x114/0x1d0 [atlantic]
[Wed May 15 12:09:33 2024] atl_resume_common+0x43/0xd0 [atlantic]
[Wed May 15 12:09:33 2024] ? __pfx_pci_pm_resume+0x10/0x10
[Wed May 15 12:09:33 2024] dpm_run_callback+0x1d/0xf0
[Wed May 15 12:09:33 2024] device_resume+0x9c/0x210
[Wed May 15 12:09:33 2024] async_resume+0x19/0x30
[Wed May 15 12:09:33 2024] async_run_entry_fn+0x23/0xb0
[Wed May 15 12:09:33 2024] process_one_work+0x17d/0x3a0
[Wed May 15 12:09:33 2024] worker_thread+0x2ed/0x400
[Wed May 15 12:09:33 2024] ? __pfx_worker_thread+0x10/0x10
[Wed May 15 12:09:33 2024] kthread+0xda/0x110
[Wed May 15 12:09:33 2024] ? __pfx_kthread+0x10/0x10
[Wed May 15 12:09:33 2024] ret_from_fork+0x2d/0x50
[Wed May 15 12:09:33 2024] ? __pfx_kthread+0x10/0x10
[Wed May 15 12:09:33 2024] ret_from_fork_asm+0x1a/0x30
[Wed May 15 12:09:33 2024] </TASK>
[Wed May 15 12:09:33 2024] Mem-Info:
[Wed May 15 12:09:33 2024] active_anon:3864139 inactive_anon:0 isolated_anon:0
active_file:4065874 inactive_file:3073388 isolated_file:0
unevictable:30 dirty:1 writeback:0
slab_reclaimable:337974 slab_unreclaimable:77010
mapped:343328 shmem:121722 pagetables:53419
sec_pagetables:0 bounce:0
kernel_misc_reclaimable:0
free:910815 free_pcp:0 free_cma:0
[Wed May 15 12:09:33 2024] Node 0 active_anon:15456556kB inactive_anon:0kB active_file:16263496kB inactive_file:12293552kB unevictable:120kB isolated(anon):0kB isolated(file):0kB mapped:1373312kB dirty:4kB writeback:0kB shmem:486888kB shmem_thp:0kB shmem_pmdmapped:0kB anon_thp:0kB writeback_tmp:0kB kernel_stack:63816kB pagetables:213676kB sec_page
tables:0kB all_unreclaimable? no
[Wed May 15 12:09:33 2024] Node 0 DMA free:11264kB boost:0kB min:12kB low:24kB high:36kB reserved_highatomic:0KB active_anon:0kB inactive_anon:0kB active_file:0kB inactive_file:0kB unevictable:0kB writepending:0kB present:15996kB managed:15360kB mlocked:0kB bounce:0kB free_pcp:0kB local_pcp:0kB free_cma:0kB
[Wed May 15 12:09:33 2024] lowmem_reserve[]: 0 2626 63425 63425 63425
[Wed May 15 12:09:33 2024] Node 0 DMA32 free:258396kB boost:12256kB min:15052kB low:17740kB high:20428kB reserved_highatomic:0KB active_anon:358352kB inactive_anon:0kB active_file:2948kB inactive_file:1423612kB unevictable:0kB writepending:0kB present:2820232kB managed:2711000kB mlocked:0kB bounce:0kB free_pcp:0kB local_pcp:0kB free_cma:0kB
[Wed May 15 12:09:33 2024] lowmem_reserve[]: 0 0 60799 60799 60799
[Wed May 15 12:09:33 2024] Node 0 Normal free:3373600kB boost:283920kB min:348688kB low:410944kB high:473200kB reserved_highatomic:2048KB active_anon:15098204kB inactive_anon:0kB active_file:16260548kB inactive_file:10869940kB unevictable:120kB writepending:4kB present:63404544kB managed:62258552kB mlocked:120kB bounce:0kB free_pcp:0kB local_pcp:0
kB free_cma:0kB
[Wed May 15 12:09:33 2024] lowmem_reserve[]: 0 0 0 0 0
[Wed May 15 12:09:33 2024] Node 0 DMA: 0*4kB 0*8kB 0*16kB 0*32kB 0*64kB 0*128kB 0*256kB 0*512kB 1*1024kB (U) 1*2048kB (M) 2*4096kB (M) = 11264kB
[Wed May 15 12:09:33 2024] Node 0 DMA32: 26127*4kB (UM) 11962*8kB (UM) 3635*16kB (UM) 4*32kB (UM) 0*64kB 0*128kB 0*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 258492kB
[Wed May 15 12:09:33 2024] Node 0 Normal: 287019*4kB (UME) 199768*8kB (UME) 37645*16kB (UME) 643*32kB (UME) 7*64kB (UM) 0*128kB 0*256kB 0*512kB 0*1024kB 1*2048kB (H) 0*4096kB = 3371612kB
[Wed May 15 12:09:33 2024] Node 0 hugepages_total=0 hugepages_free=0 hugepages_surp=0 hugepages_size=1048576kB
[Wed May 15 12:09:33 2024] Node 0 hugepages_total=5000 hugepages_free=5000 hugepages_surp=0 hugepages_size=2048kB
[Wed May 15 12:09:33 2024] 7260984 total pagecache pages
[Wed May 15 12:09:33 2024] 0 pages in swap cache
[Wed May 15 12:09:33 2024] Free swap = 0kB
[Wed May 15 12:09:33 2024] Total swap = 0kB
[Wed May 15 12:09:33 2024] 16560193 pages RAM
[Wed May 15 12:09:33 2024] 0 pages HighMem/MovableOnly
[Wed May 15 12:09:33 2024] 313965 pages reserved
[Wed May 15 12:09:33 2024] 0 pages cma reserved
[Wed May 15 12:09:33 2024] 0 pages hwpoisoned
[Wed May 15 12:09:33 2024] atlantic 0000:07:00.0: PM: dpm_run_callback(): pci_pm_resume+0x0/0xe0 returns -12
[Wed May 15 12:09:33 2024] atlantic 0000:07:00.0: PM: failed to resume async: error -12I tried to rmmod atlantic, but that will hang. I tried to bring the interface up or down, but that also hangs.
The only solution is to hard reboot the PC.
I haven't tried any other operating system.
I have tried playing with the power kernel parameters of the module, but haven't had luck.
Was wondering if anyone could give me any hints how to further troubleshoot the issue?
Should I contact MSI support? Send an issue to the kernel?
I wish I had some kernel development experience, but unfortunately at my current skill level I am not able to debug the issue...