English
Ask Your Question
1

Kernel panic in 4.11.3 using nouveau driver.

asked 2017-06-09 09:34:24 +0000

nasos117 gravatar image

updated 2017-06-11 08:24:47 +0000

Hello people, I had Fedora 25 with the 4.10 kernel and it worked fine but when the 4.11 update came in i couldn't boot, getting either a frozen screen at the fedora logo or an error message. I did a fresh uefi (f25- kernel 4.8.) install,updated to the new kernel kernel that came in recently 4.11.3-202. fc25. x8664 hoping this could fix the previous release (4.11.3-200. fc25. x8664) but the same thing happens. Yesterday i was able to boot with the 4.11.200 kernel, log in but after a short period of time it froze again. My hardware is an alienware 17 laptop (2013 model) with i7-4700MQ with nvida 770m

edit retag flag offensive close merge delete

6 answers

Sort by » oldest newest most voted
2

answered 2017-06-10 22:43:24 +0000

nullcat gravatar image

I was also able to work around this problem by disabling the nouveau video driver (I'm using my motherboard's built-in video instead of my Nvidia card.)

  1. Switch to superuser with sudo -i.
  2. Add modprobe.blacklist=nouveau to GRUB_CMDLINE_LINUX in /etc/default/grub.
  3. Run grub2-mkconfig -o /boot/efi/EFI/fedora/grub.cfg.
  4. Reboot.
edit flag offensive delete link more

Comments

I can boot to the new kernel but only with low res. I guess i 'll have to get nvidia's driver after all. Thanks for your time! You were great!

nasos117 ( 2017-06-10 23:26:12 +0000 )edit

OK, i got the nvidia Driver! Everything is fine, i can boot the 4.11.202 kernel without problems! Blacklisting nouveau did the trick! Thanks again!

nasos117 ( 2017-06-10 23:39:46 +0000 )edit

You're welcome. Are you able to edit the title of this post? Could you change it to something that will help other people find it? For example,

Kernel panic in version 4.11.3 using nouveau driver
nullcat ( 2017-06-11 01:15:41 +0000 )edit

I'm on it!

nasos117 ( 2017-06-11 08:22:27 +0000 )edit

When you installed the nvidia driver, did you use dkms so it updates with kernel changes? I have had no problems with kernel upgrades for the past 6 versions.

steveebey73742 ( 2017-06-11 12:46:25 +0000 )edit
1

answered 2017-06-09 20:26:02 +0000

nullcat gravatar image

I can reproduce this bug 100% of the time with 4.11.3-200.fc25 and 4.11.3.202.fc25. I have screenshots of the kernel stack trace here:

https://imgur.com/a/fHTCk

If you look at the top of the stack trace in the first image, you can see the unhandled kernel paging request occurs immediately after lines that begin with fbcon, which makes me think the bug is in the video subsystem.

To work around this problem:

  1. Press e at the boot menu.
  2. Delete the word quiet from the line that begins with linuxefi.
  3. Press Control+X to continue the boot process.

Here is my hardware:

❯ uname -a
Linux norfair 4.11.3-202.fc25.x86_64 #1 SMP Mon Jun 5 16:38:21 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux

❯ lspci
00:00.0 Host bridge: Intel Corporation Xeon E3-1200 v2/3rd Gen Core processor DRAM Controller (rev 09)
00:01.0 PCI bridge: Intel Corporation Xeon E3-1200 v2/3rd Gen Core processor PCI Express Root Port (rev 09)
00:02.0 VGA compatible controller: Intel Corporation Xeon E3-1200 v2/3rd Gen Core processor Graphics Controller (rev 09)
00:14.0 USB controller: Intel Corporation 7 Series/C210 Series Chipset Family USB xHCI Host Controller (rev 04)
00:16.0 Communication controller: Intel Corporation 7 Series/C216 Chipset Family MEI Controller #1 (rev 04)
00:19.0 Ethernet controller: Intel Corporation 82579V Gigabit Network Connection (rev 04)
00:1a.0 USB controller: Intel Corporation 7 Series/C216 Chipset Family USB Enhanced Host Controller #2 (rev 04)
00:1b.0 Audio device: Intel Corporation 7 Series/C216 Chipset Family High Definition Audio Controller (rev 04)
00:1c.0 PCI bridge: Intel Corporation 7 Series/C216 Chipset Family PCI Express Root Port 1 (rev c4)
00:1c.6 PCI bridge: Intel Corporation 7 Series/C210 Series Chipset Family PCI Express Root Port 7 (rev c4)
00:1c.7 PCI bridge: Intel Corporation 7 Series/C210 Series Chipset Family PCI Express Root Port 8 (rev c4)
00:1d.0 USB controller: Intel Corporation 7 Series/C216 Chipset Family USB Enhanced Host Controller #1 (rev 04)
00:1f.0 ISA bridge: Intel Corporation Z77 Express Chipset LPC Controller (rev 04)
00:1f.2 SATA controller: Intel Corporation 7 Series/C210 Series Chipset Family 6-port SATA Controller [AHCI mode] (rev 04)
00:1f.3 SMBus: Intel Corporation 7 Series/C216 Chipset Family SMBus Controller (rev 04)
01:00.0 VGA compatible controller: NVIDIA Corporation GK106 [GeForce GTX 660] (rev a1)
01:00.1 Audio device: NVIDIA Corporation GK106 HDMI Audio Controller (rev a1)
03:00.0 Network controller: Broadcom Limited BCM43228 802.11a/b/g/n
04:00.0 USB controller: ASMedia Technology Inc. ASM1042 SuperSpeed USB Host Controller

I was unable to get a core dump by following the instructions in https://ask.fedoraproject.org/en/ques... because adding crashkernel=128M to the boot parameters prevents the crash from happening.

edit flag offensive delete link more

Comments

Thanks for taking time to answer, i really appreciate it. I did as you suggested but without any good result.

nasos117 ( 2017-06-10 00:23:59 +0000 )edit

Would you please post the output of lspci on your computer?

Are you able to capture a kernel core dump?

  1. Switch to superuser with sudo -i.
  2. Add crashkernel=128M to GRUB_CMDLINE_LINUX in /etc/default/grub.
  3. Run grub2-mkconfig -o /boot/efi/EFI/fedora/grub.cfg.
  4. Run systemctl enable kdump.
  5. Reboot into your good kernel.
  6. Run systemctl status kdump and look for errors.
  7. If kdump reports no errors, reboot into your bad kernel and wait 60 seconds after the kernel panic.
  8. Reboot into your good kernel and see if there are files under /var/crash.
nullcat ( 2017-06-10 20:41:50 +0000 )edit

No files inside the /var/crash dir. The good kernel result of step #6 is this. I don't know if i did step #2 correctly. I changed the grub permissions to my account (sudo chown) so i could save it.

nasos117 ( 2017-06-10 22:18:46 +0000 )edit

Also I noticed that the errors at the kernel panic screen were ending with [nouveau] so i guess you were correct about the video fail thing.

The thing is that the first time i installed the nvidia driver the whole system went crazy (the screen started blinking the moment the fedora boot screen came up) so i didn't risk installing it again.

nasos117 ( 2017-06-10 22:25:33 +0000 )edit

https://bugzilla.redhat.com/show_bug.cgi?id=1461337

efi/bgrt: Skip efibgrtinit() in case of non-EFI boot

"The cause is that on this machine the firmware provides the EFI ACPI BGRT table even on legacy non-EFI bootups - which table should be EFI only.

The garbage BGRT data causes the efibgrtinit() panic.

Add a check to skip efibgrtinit() in case non-EFI bootup to work around this firmware bug."

In other words, 4.11.x are having issues on certain hardware setups, but fix should trickle down soon.

budious ( 2017-06-14 12:44:39 +0000 )edit
1

answered 2017-06-16 18:37:40 +0000

nullcat gravatar image

Good news: version 4.11.5 of the kernel fixes this problem for me. I upgraded using this command:

sudo dnf update --enablerepo=updates-testing kernel

edit flag offensive delete link more
0

answered 2017-06-09 15:56:43 +0000

dpeterson gravatar image

Same thing here. Exactly. Both kernal upgrades 4.11.2- and 4.11.3- go into full kernal-panic.

Sorry I can't help, but I thought I'd add some weight to the issue.

edit flag offensive delete link more

Comments

Thanks! :)

nasos117 ( 2017-06-10 00:26:17 +0000 )edit
0

answered 2017-06-10 21:22:57 +0000

nullcat gravatar image

I have reported this to Red Hat as bug #1460456:

https://bugzilla.redhat.com/show_bug....

edit flag offensive delete link more
0

answered 2017-06-13 15:07:46 +0000

I'm also having the similar issues with kernel 4.11.3-202.fc25.x8664 after peforming a system upgrade to Fedora 25 from 24. I can still use the previously installed 4.10.10-100.fc24.x8664 kernel to boot without issue.

My nouveau driver is blacklisted. My repositories are configured to use the akmod nvidia-driver packages from negativo17 and RPMFusion. Removing "quiet" from boot, the only line displayed is "Probing EDD (edd=off to disable)... ok" then the kernel hangs. My googlefu indicated this message was related to cpu microcode. I tried what they suggested to add "disucodeldr" to boot but no luck either.

https://bugzilla.redhat.com/show bug.cgi?id=1353103 https://bugzilla.redhat.com/showbug.cgi?id=1351943

I bring this up because nullcat's lspci looks very similar to my hardware (Xeon E3-1230v2 CPU).

[root@fedora ~]# lspci
00:00.0 Host bridge: Intel Corporation Xeon E3-1200 v2/Ivy Bridge DRAM Controller (rev 09)
00:01.0 PCI bridge: Intel Corporation Xeon E3-1200 v2/3rd Gen Core processor PCI Express Root Port (rev 09)
00:16.0 Communication controller: Intel Corporation 6 Series/C200 Series Chipset Family MEI Controller #1 (rev 04)
00:1a.0 USB controller: Intel Corporation 6 Series/C200 Series Chipset Family USB Enhanced Host Controller #2 (rev 05)
00:1b.0 Audio device: Intel Corporation 6 Series/C200 Series Chipset Family High Definition Audio Controller (rev 05)
00:1c.0 PCI bridge: Intel Corporation 6 Series/C200 Series Chipset Family PCI Express Root Port 1 (rev b5)
00:1c.4 PCI bridge: Intel Corporation 6 Series/C200 Series Chipset Family PCI Express Root Port 5 (rev b5)
00:1c.5 PCI bridge: Intel Corporation 6 Series/C200 Series Chipset Family PCI Express Root Port 6 (rev b5)
00:1c.6 PCI bridge: Intel Corporation 6 Series/C200 Series Chipset Family PCI Express Root Port 7 (rev b5)
00:1c.7 PCI bridge: Intel Corporation 6 Series/C200 Series Chipset Family PCI Express Root Port 8 (rev b5)
00:1d.0 USB controller: Intel Corporation 6 Series/C200 Series Chipset Family USB Enhanced Host Controller #1 (rev 05)
00:1f.0 ISA bridge: Intel Corporation P67 Express Chipset Family LPC Controller (rev 05)
00:1f.2 SATA controller: Intel Corporation 6 Series/C200 Series Chipset Family 6 port Desktop SATA AHCI Controller (rev 05)
00:1f.3 SMBus: Intel Corporation 6 Series/C200 Series Chipset Family SMBus Controller (rev 05)
01:00.0 VGA compatible controller: NVIDIA Corporation GP106 [GeForce GTX 1060 6GB] (rev a1)
01:00.1 Audio device: NVIDIA Corporation GP106 High Definition Audio Controller (rev a1)
03:00.0 SATA controller: Marvell Technology Group Ltd. 88SE9120 SATA 6Gb/s Controller (rev 12)
04:00.0 USB controller: Etron Technology, Inc. EJ168 USB 3.0 Host Controller (rev 01)
05:00.0 USB controller: Etron Technology, Inc. EJ168 USB 3.0 Host Controller (rev 01)
06:00.0 PCI bridge: PLX Technology, Inc. PEX 8608 8-lane, 8-Port ...
(more)
edit flag offensive delete link more

Comments

There are some instructions in the official documentation but i guess you have already done those. There was a new update today, did it help in any way? You can always make a new question so people can find the question easier! Sorry i can't help any more, i' m new to fedora !

nasos117 ( 2017-06-15 21:05:13 +0000 )edit

Your Answer

Please start posting anonymously - your entry will be published after you log in or create a new account.

Add Answer

[hide preview]

Use your votes!

  • Use the 30 daily voting points that you get!
  • Up-vote well framed questions that provide enough information to enable people provide answers.
  • Thank your helpers by up-voting their comments and answers. If a question you asked has been answered, accept the best answer by clicking on the checkbox on the left side of the answer.
  • Down-voting might cost you karma, but you should consider doing so for incorrect or clearly detrimental questions and answers.

Question Tools

Follow
2 followers

Stats

Asked: 2017-06-09 09:34:24 +0000

Seen: 826 times

Last updated: Jun 16