x11 fails after installing nvidia driver

Issues related to hardware problems
Post Reply
timmerov
Posts: 2
Joined: 2015/06/06 15:25:15

x11 fails after installing nvidia driver

Post by timmerov » 2020/12/19 00:42:39

i have a brand new computer: amd ryzen 5, geforce 1650 super, gigabyte motherboard, 1 tb ssd.
i have installed centos 8 successfully.
it boots fine with the nouveau drivers.
however... this is my gaming machine.
i need my nvidia drivers. ;->

in the past i have done the following:
0. somehow get nouveau.modeset=0 into the kernel command line.
1. if yum update gives me a new kernel...
2. download the latest nvidia long lived stable unix driver.
3. sudo systemctl set-default mult-user
4. reboot
5. install the latest nvidia driver.
6. reboot
7. init 5 to ensure everything is working.
8 sudo systemctl set-default graphical
this has literally been working for years with centos 7.

centos 8 has been a pain in the backside.
the above procedure no longer works.
it worked better with previous centos kernels.
ie earlier than: kernel-4.18.0-240.1.1.el8_3.x86_64
though there were issues.
namely, sometimes it would fail to start x.
sometimes x could not read the configuration file.
so my screens were sideways.
anywho...
a clean centos install ensued.

i have tried the following nvidia drivers:
NVIDIA-Linux-x86_64-450.80.02.run (long lived stable)
NVIDIA-Linux-x86_64-455.45.01.run (short lived stable)
NVIDIA-Linux-x86_64-460.27.04.run (latest beta)
i have tried all of the nvidia drivers with and without the nvida xorg.conf.
i have tried all of the nvidia drivers with a minimal xorg.conf

Code: Select all

Section "Device"
	Identifier  "Videocard0"
	Driver      "nvidia"
EndSection
init 5 ends up with a black screen.
alt-f2 gets me back to a shell login prompt.
startx is more productive.
every one of the above cases ends with:

Code: Select all

[    23.546] (EE) NVIDIA(GPU-0): Failed to initialize the NVIDIA GPU at PCI:8:0:0.  Please
[    23.546] (EE) NVIDIA(GPU-0):     check your system's kernel log for additional error
[    23.546] (EE) NVIDIA(GPU-0):     messages and refer to Chapter 8: Common Problems in the
[    23.546] (EE) NVIDIA(GPU-0):     README for additional information.
[    23.546] (EE) NVIDIA(GPU-0): Failed to initialize the NVIDIA graphics device!
[    23.546] (EE) NVIDIA(0): Failing initialization of X screen
[    23.546] (EE) Screen(s) found, but none have a usable configuration.
[    23.546] (EE) Fatal server error:
[    23.546] (EE) no screens found(EE) 
[    23.546] (EE) Please consult the The X.Org Foundation support at http://wiki.x.org for help. 
[    23.546] (EE) Please also check the log file at "/var/log/Xorg.0.log" for additional information.
[    23.547] (EE) Server terminated with error (1). Closing log file.
this section of xorg log is suspicious:

Code: Select all

[    18.901] (==) Matched nvidia as autoconfigured driver 0
[    18.901] (==) Matched nouveau as autoconfigured driver 1
[    18.901] (==) Matched modesetting as autoconfigured driver 2
[    18.901] (==) Matched fbdev as autoconfigured driver 3
[    18.901] (==) Matched vesa as autoconfigured driver 4
[    18.901] (==) Assigned the driver to the xf86ConfigLayout
[    18.901] (II) LoadModule: "nvidia"
[    18.901] (II) Loading /usr/lib64/xorg/modules/drivers/nvidia_drv.so
[    18.903] (II) Module nvidia: vendor="NVIDIA Corporation"
[    18.903] 	compiled for 1.6.99.901, module version = 1.0.0
[    18.903] 	Module class: X.Org Video Driver
[    18.903] (II) LoadModule: "nouveau"
[    18.903] (II) Loading /usr/lib64/xorg/modules/drivers/nouveau_drv.so
[    18.905] (II) Module nouveau: vendor="X.Org Foundation"
[    18.905] 	compiled for 1.20.3, module version = 1.0.15
[    18.905] 	Module class: X.Org Video Driver
[    18.905] 	ABI class: X.Org Video Driver, version 24.0
what the heck? why is nouveau matching and loading?

okay so i followed the advice found elsewhere in these forums: use el repo.
which is better.
the nouveau driver is not loaded.
but it still gets the same "no screens found" error.
here's the entire xorg.log

Code: Select all

[    18.564] 
X.Org X Server 1.20.8
X Protocol Version 11, Revision 0
[    18.565] Build Operating System:  4.19.34-300.el7.x86_64 
[    18.565] Current Operating System: Linux 2020sux 4.18.0-240.1.1.el8_3.x86_64 #1 SMP Thu Nov 19 17:20:08 UTC 2020 x86_64
[    18.565] Kernel command line: BOOT_IMAGE=(hd0,msdos1)/vmlinuz-4.18.0-240.1.1.el8_3.x86_64 root=/dev/mapper/cs_2020sux-root ro crashkernel=auto resume=/dev/mapper/cs_2020sux-swap rd.lvm.lv=cs_2020sux/root rd.lvm.lv=cs_2020sux/swap rhgb quiet nouveau.modeset=0 rd.driver.blacklist=nouveau plymouth.ignore-udev
[    18.566] Build Date: 24 August 2020  06:49:47PM
[    18.566] Build ID: xorg-x11-server 1.20.8-6.el8 
[    18.566] Current version of pixman: 0.38.4
[    18.566] 	Before reporting problems, check http://wiki.x.org
	to make sure that you have the latest version.
[    18.567] Markers: (--) probed, (**) from config file, (==) default setting,
	(++) from command line, (!!) notice, (II) informational,
	(WW) warning, (EE) error, (NI) not implemented, (??) unknown.
[    18.568] (==) Log file: "/var/log/Xorg.0.log", Time: Fri Dec 18 15:58:30 2020
[    18.569] (==) Using config file: "/etc/X11/xorg.conf"
[    18.569] (==) Using config directory: "/etc/X11/xorg.conf.d"
[    18.569] (==) Using system config directory "/usr/share/X11/xorg.conf.d"
[    18.570] (==) No Layout section.  Using the first Screen section.
[    18.570] (==) No screen section available. Using defaults.
[    18.570] (**) |-->Screen "Default Screen Section" (0)
[    18.570] (**) |   |-->Monitor "<default monitor>"
[    18.570] (==) No device specified for screen "Default Screen Section".
	Using the first device section listed.
[    18.570] (**) |   |-->Device "Videocard0"
[    18.570] (==) No monitor specified for screen "Default Screen Section".
	Using a default monitor configuration.
[    18.570] (==) Automatically adding devices
[    18.570] (==) Automatically enabling devices
[    18.570] (==) Automatically adding GPU devices
[    18.570] (==) Automatically binding GPU devices
[    18.570] (==) Max clients allowed: 256, resource mask: 0x1fffff
[    18.570] (==) FontPath set to:
	catalogue:/etc/X11/fontpath.d,
	built-ins
[    18.570] (==) ModulePath set to "/usr/lib64/xorg/modules"
[    18.570] (II) The server relies on udev to provide the list of input devices.
	If no devices become available, reconfigure udev or disable AutoAddDevices.
[    18.570] (II) Loader magic: 0x55fe9279d020
[    18.570] (II) Module ABI versions:
[    18.570] 	X.Org ANSI C Emulation: 0.4
[    18.570] 	X.Org Video Driver: 24.1
[    18.570] 	X.Org XInput driver : 24.1
[    18.570] 	X.Org Server Extension : 10.0
[    18.571] (++) using VT number 1

[    18.574] (II) systemd-logind: took control of session /org/freedesktop/login1/session/_31
[    18.575] (II) xfree86: Adding drm device (/dev/dri/card0)
[    18.575] (II) Platform probe for /sys/devices/pci0000:00/0000:00:03.1/0000:08:00.0/drm/card0
[    18.575] (II) systemd-logind: got fd for /dev/dri/card0 226:0 fd 11 paused 0
[    18.577] (--) PCI:*(8@0:0:0) 10de:2187:1462:8d95 rev 161, Mem @ 0xfb000000/16777216, 0xd0000000/268435456, 0xe0000000/33554432, I/O @ 0x0000e000/128, BIOS @ 0x????????/131072
[    18.577] (II) LoadModule: "glx"
[    18.578] (II) Loading /usr/lib64/xorg/modules/extensions/libglx.so
[    18.581] (II) Module glx: vendor="X.Org Foundation"
[    18.581] 	compiled for 1.20.8, module version = 1.0.0
[    18.581] 	ABI class: X.Org Server Extension, version 10.0
[    18.581] (II) LoadModule: "nvidia"
[    18.581] (II) Loading /usr/lib64/xorg/modules/drivers/nvidia_drv.so
[    18.584] (II) Module nvidia: vendor="NVIDIA Corporation"
[    18.584] 	compiled for 1.6.99.901, module version = 1.0.0
[    18.584] 	Module class: X.Org Video Driver
[    18.584] (II) NVIDIA dlloader X Driver  450.80.02  Wed Sep 23 00:53:01 UTC 2020
[    18.584] (II) NVIDIA Unified Driver for all Supported NVIDIA GPUs
[    18.584] (II) systemd-logind: releasing fd for 226:0
[    18.584] (II) Loading sub module "fb"
[    18.584] (II) LoadModule: "fb"
[    18.584] (II) Loading /usr/lib64/xorg/modules/libfb.so
[    18.586] (II) Module fb: vendor="X.Org Foundation"
[    18.586] 	compiled for 1.20.8, module version = 1.0.0
[    18.586] 	ABI class: X.Org ANSI C Emulation, version 0.4
[    18.586] (II) Loading sub module "wfb"
[    18.586] (II) LoadModule: "wfb"
[    18.586] (II) Loading /usr/lib64/xorg/modules/libwfb.so
[    18.586] (II) Module wfb: vendor="X.Org Foundation"
[    18.586] 	compiled for 1.20.8, module version = 1.0.0
[    18.586] 	ABI class: X.Org ANSI C Emulation, version 0.4
[    18.586] (II) Loading sub module "ramdac"
[    18.586] (II) LoadModule: "ramdac"
[    18.586] (II) Module "ramdac" already built-in
[    18.587] (II) NVIDIA(0): Creating default Display subsection in Screen section
	"Default Screen Section" for depth/fbbpp 24/32
[    18.587] (==) NVIDIA(0): Depth 24, (==) framebuffer bpp 32
[    18.587] (==) NVIDIA(0): RGB weight 888
[    18.587] (==) NVIDIA(0): Default visual is TrueColor
[    18.587] (==) NVIDIA(0): Using gamma correction (1.0, 1.0, 1.0)
[    18.587] (**) NVIDIA(0): Enabling 2D acceleration
[    18.587] (II) Loading sub module "glxserver_nvidia"
[    18.587] (II) LoadModule: "glxserver_nvidia"
[    18.587] (II) Loading /usr/lib64/xorg/modules/extensions/libglxserver_nvidia.so
[    18.612] (II) Module glxserver_nvidia: vendor="NVIDIA Corporation"
[    18.612] 	compiled for 1.6.99.901, module version = 1.0.0
[    18.612] 	Module class: X.Org Server Extension
[    18.612] (II) NVIDIA GLX Module  450.80.02  Wed Sep 23 00:51:32 UTC 2020
[    18.612] (II) NVIDIA: The X server supports PRIME Render Offload.
[    23.188] (EE) NVIDIA(GPU-0): Failed to initialize the NVIDIA GPU at PCI:8:0:0.  Please
[    23.188] (EE) NVIDIA(GPU-0):     check your system's kernel log for additional error
[    23.188] (EE) NVIDIA(GPU-0):     messages and refer to Chapter 8: Common Problems in the
[    23.188] (EE) NVIDIA(GPU-0):     README for additional information.
[    23.188] (EE) NVIDIA(GPU-0): Failed to initialize the NVIDIA graphics device!
[    23.188] (EE) NVIDIA(0): Failing initialization of X screen
[    23.188] (II) UnloadModule: "nvidia"
[    23.188] (II) UnloadSubModule: "glxserver_nvidia"
[    23.188] (II) Unloading glxserver_nvidia
[    23.188] (II) UnloadSubModule: "wfb"
[    23.188] (II) UnloadSubModule: "fb"
[    23.188] (EE) Screen(s) found, but none have a usable configuration.
[    23.188] (EE) 
Fatal server error:
[    23.188] (EE) no screens found(EE) 
[    23.188] (EE) 
Please consult the The X.Org Foundation support 
	 at http://wiki.x.org
 for help. 
[    23.188] (EE) Please also check the log file at "/var/log/Xorg.0.log" for additional information.
[    23.188] (EE) 
[    23.189] (EE) Server terminated with error (1). Closing log file.
help please.
i'm stumped.

DarkNemos
Posts: 3
Joined: 2018/04/11 16:32:03

Re: x11 fails after installing nvidia driver

Post by DarkNemos » 2020/12/23 19:16:55

I have exactly the same problem. I have been reinstalling centos8 and drivers for more than a week with different combinations of elrepo, nvidia directly, cuda, and who knows what else. Always the same issue.

Please help!

pxthron
Posts: 1
Joined: 2021/02/04 16:07:36

Re: x11 fails after installing nvidia driver

Post by pxthron » 2021/02/04 16:12:23

Facing the same problem, my pc keeps rebooting after install nvidia driver, using every possible methods. Did you guys solve the problem? I had to go back to RHEL 7

DarkNemos
Posts: 3
Joined: 2018/04/11 16:32:03

Re: x11 fails after installing nvidia driver

Post by DarkNemos » 2021/02/04 18:08:11

Nvidia replied to me , they said to update my bios to latest. I havent tried honestly, im on centos 7.

User avatar
jlehtone
Posts: 4523
Joined: 2007/12/11 08:17:33
Location: Finland

Re: x11 fails after installing nvidia driver

Post by jlehtone » 2021/02/04 18:22:54

Code: Select all

sudo yum install elrepo-release
sudo yum install nvidia-detect
sudo yum install $(nvidia-detect)
reboot

Post Reply