Kernel ring buffer overflowing with AER errors (rtl8821ce)

Since the upgrade to kernel version 6.0.5, my dmesg is overflowing with AER errors like these:

[ 4467.652829] rtl8821ce 0000:03:00.0:   device [10ec:c821] error status/mask=00000001/0000e000
[ 4467.652839] rtl8821ce 0000:03:00.0:    [ 0] RxErr                  (First)
[ 4467.781242] pcieport 0000:00:1d.1: AER: Multiple Corrected error received: 0000:03:00.0
[ 4467.781279] rtl8821ce 0000:03:00.0: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
[ 4467.781288] rtl8821ce 0000:03:00.0:   device [10ec:c821] error status/mask=00000001/0000e000
[ 4467.781298] rtl8821ce 0000:03:00.0:    [ 0] RxErr                  (First)
[ 4468.014984] pcieport 0000:00:1d.1: AER: Multiple Corrected error received: 0000:03:00.0
[ 4468.015018] rtl8821ce 0000:03:00.0: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
[ 4468.015027] rtl8821ce 0000:03:00.0:   device [10ec:c821] error status/mask=00000001/0000e000
[ 4468.015090] rtl8821ce 0000:03:00.0:    [ 0] RxErr                  (First)
[ 4468.102322] pcieport 0000:00:1d.1: AER: Multiple Corrected error received: 0000:03:00.0
[ 4468.102358] rtl8821ce 0000:03:00.0: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
[ 4468.102367] rtl8821ce 0000:03:00.0:   device [10ec:c821] error status/mask=00000001/0000e000
[ 4468.102377] rtl8821ce 0000:03:00.0:    [ 0] RxErr                  (First)
[ 4468.495161] pcieport 0000:00:1d.1: AER: Multiple Corrected error received: 0000:03:00.0
[ 4468.495197] rtl8821ce 0000:03:00.0: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
[ 4468.495204] rtl8821ce 0000:03:00.0:   device [10ec:c821] error status/mask=00000001/0000e000
[ 4468.495214] rtl8821ce 0000:03:00.0:    [ 0] RxErr                  (First)
[ 4468.675257] pcieport 0000:00:1d.1: AER: Multiple Corrected error received: 0000:03:00.0
[ 4468.675289] rtl8821ce 0000:03:00.0: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
[ 4468.675298] rtl8821ce 0000:03:00.0:   device [10ec:c821] error status/mask=00000001/0000e000
[ 4468.675307] rtl8821ce 0000:03:00.0:    [ 0] RxErr                  (First)
[ 4469.248111] pcieport 0000:00:1d.1: AER: Multiple Corrected error received: 0000:03:00.0
[ 4469.248144] rtl8821ce 0000:03:00.0: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
[ 4469.248153] rtl8821ce 0000:03:00.0:   device [10ec:c821] error status/mask=00000001/0000e000
[ 4469.248163] rtl8821ce 0000:03:00.0:    [ 0] RxErr                  (First)
[ 4470.835463] pcieport 0000:00:1d.1: AER: Multiple Corrected error received: 0000:03:00.0
[ 4470.835504] rtl8821ce 0000:03:00.0: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
[ 4470.835514] rtl8821ce 0000:03:00.0:   device [10ec:c821] error status/mask=00000001/0000e000
[ 4470.835526] rtl8821ce 0000:03:00.0:    [ 0] RxErr                  (First)
[ 4471.213764] pcieport 0000:00:1d.1: AER: Multiple Corrected error received: 0000:03:00.0
[ 4471.213800] rtl8821ce 0000:03:00.0: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
[ 4471.213809] rtl8821ce 0000:03:00.0:   device [10ec:c821] error status/mask=00000001/0000e000
[ 4471.213818] rtl8821ce 0000:03:00.0:    [ 0] RxErr                  (First)
[ 4471.276599] pcieport 0000:00:1d.1: AER: Multiple Corrected error received: 0000:03:00.0
[ 4471.276636] rtl8821ce 0000:03:00.0: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
[ 4471.276646] rtl8821ce 0000:03:00.0:   device [10ec:c821] error status/mask=00000001/0000e000
[ 4471.276657] rtl8821ce 0000:03:00.0:    [ 0] RxErr                  (First)
[ 4471.278776] pcieport 0000:00:1d.1: AER: Multiple Corrected error received: 0000:03:00.0
[ 4471.278806] rtl8821ce 0000:03:00.0: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
[ 4471.278814] rtl8821ce 0000:03:00.0:   device [10ec:c821] error status/mask=00000001/0000e000
[ 4471.278823] rtl8821ce 0000:03:00.0:    [ 0] RxErr                  (First)
[ 4471.394159] pcieport 0000:00:1d.1: AER: Multiple Corrected error received: 0000:03:00.0
[ 4471.394195] rtl8821ce 0000:03:00.0: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
[ 4471.394204] rtl8821ce 0000:03:00.0:   device [10ec:c821] error status/mask=00000001/0000e000
[ 4471.394214] rtl8821ce 0000:03:00.0:    [ 0] RxErr                  (First)
[ 4471.664311] pcieport 0000:00:1d.1: AER: Multiple Corrected error received: 0000:03:00.0
[ 4471.664338] rtl8821ce 0000:03:00.0: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
[ 4471.664345] rtl8821ce 0000:03:00.0:   device [10ec:c821] error status/mask=00000001/0000e000
[ 4471.664352] rtl8821ce 0000:03:00.0:    [ 0] RxErr                  (First)

I am currently using the wifi driver from here, which I had to update due to build failing for kernel 6.0.5 (this driver is working despite the flood of errors). The same issue happens with the rtw88_8821ce kernel driver, with the added problem of not being able to connect to my network.

Should I report this as a kernel bug?

Sounds like a kernel bug, specifically related to that driver.

2 Likes

Indeed, and that driver specifically has a lot of bugs. I ended up replacing my 882x with Intel wifi cards and very much did not regret it.

2 Likes

I agree, a pain in my Linux experience. However I’m stuck with it…
I should file a bug report then. (Hoping that it won’t take realtek an eternity to fix).

3 Likes

I found the solution:
sudo grubby --update-kernel=ALL --args="pcie_aspm=off"
This disables PCIe ASPM. Apparently the kernel was turning it on? Because according to dmesg my laptop doesn’t support ASPM so it should be disabled.

dmesg | grep aspm -i
[    0.000000] Command line: <boot volumes> rhgb quiet intel_iommu=on pcie_aspm=off
[    0.066562] Kernel command line: BOOT_IMAGE=(hd1,gpt2)/vmlinuz-6.0.5-200.fc36.x86_64 root=UUID=<> ro rootflags=subvol=root rd.luks.uuid=<> rhgb quiet intel_iommu=on pcie_aspm=off
[    0.066761] PCIe ASPM is disabled
[    0.157331] ACPI FADT declares the system doesn't support PCIe ASPM, so disable it
[    0.379810] acpi PNP0A08:00: _OSC: not requesting OS control; OS requires [ExtendedConfig ASPM ClockPM MSI]
[    0.379983] acpi PNP0A08:00: FADT indicates ASPM is unsupported, using BIOS configuration
1 Like