nVidia Drivers not detecting Quadro card

Issues related to hardware problems
Post Reply
philbax2
Posts: 3
Joined: 2018/03/01 21:57:00

nVidia Drivers not detecting Quadro card

Post by philbax2 » 2018/03/01 22:14:42

Hi! I had a CentOS 7 box with a 1080 in it, and I was able to use ELRepo to easily install the latest nVidia drivers.

I'm now setting up a CentOS 7 box with what dmesg says is a Quadro P2000 card in it. According to nvidia-detect, I need the latest drivers. According to the notes from nVidia on those drivers, this card is supported.

However, when I install kmod-nvidia and the associated nvidia drivers, I am unable to boot into a graphical interface. The Xorg logs state that the nvidia kernel module couldn't be loaded:

Code: Select all

[    52.768] 
X.Org X Server 1.19.3
Release Date: 2017-03-15
[    52.768] X Protocol Version 11, Revision 0
[    52.768] Build Operating System:  3.10.0-693.2.2.el7.x86_64 
[    52.768] Current Operating System: Linux seg02-h214 3.10.0-693.17.1.el7.x86_64 #1 SMP Thu Jan 25 20:13:58 UTC 2018 x86_64
[    52.768] Kernel command line: BOOT_IMAGE=/vmlinuz-3.10.0-693.17.1.el7.x86_64 root=/dev/mapper/centos-root ro rd.lvm.lv=centos/root rd.lvm.lv=centos/swap rhgb quiet LANG=en_US.UTF-8 nouveau.modeset=0 rd.driver.blacklist=nouveau plymouth.ignore-udev
[    52.768] Build Date: 25 January 2018  04:04:44PM
[    52.769] Build ID: xorg-x11-server 1.19.3-11.el7_4.1 
[    52.769] Current version of pixman: 0.34.0
[    52.769] 	Before reporting problems, check http://wiki.x.org
	to make sure that you have the latest version.
[    52.769] Markers: (--) probed, (**) from config file, (==) default setting,
	(++) from command line, (!!) notice, (II) informational,
	(WW) warning, (EE) error, (NI) not implemented, (??) unknown.
[    52.769] (==) Log file: "/var/log/Xorg.0.log", Time: Thu Mar  1 12:51:46 2018
[    52.769] (==) Using config file: "/etc/X11/xorg.conf"
[    52.769] (==) Using config directory: "/etc/X11/xorg.conf.d"
[    52.770] (==) Using system config directory "/usr/share/X11/xorg.conf.d"
[    52.770] (==) No Layout section.  Using the first Screen section.
[    52.770] (==) No screen section available. Using defaults.
[    52.770] (**) |-->Screen "Default Screen Section" (0)
[    52.770] (**) |   |-->Monitor "<default monitor>"
[    52.770] (==) No device specified for screen "Default Screen Section".
	Using the first device section listed.
[    52.770] (**) |   |-->Device "Videocard0"
[    52.770] (==) No monitor specified for screen "Default Screen Section".
	Using a default monitor configuration.
[    52.770] (==) Automatically adding devices
[    52.770] (==) Automatically enabling devices
[    52.770] (==) Automatically adding GPU devices
[    52.770] (==) Automatically binding GPU devices
[    52.770] (==) Max clients allowed: 256, resource mask: 0x1fffff
[    52.770] (==) FontPath set to:
	catalogue:/etc/X11/fontpath.d,
	built-ins
[    52.770] (**) ModulePath set to "/usr/lib64/xorg/modules/extensions/nvidia,/usr/lib64/xorg/modules"
[    52.770] (II) The server relies on udev to provide the list of input devices.
	If no devices become available, reconfigure udev or disable AutoAddDevices.
[    52.770] (II) Loader magic: 0x5624bc5c1020
[    52.770] (II) Module ABI versions:
[    52.770] 	X.Org ANSI C Emulation: 0.4
[    52.770] 	X.Org Video Driver: 23.0
[    52.770] 	X.Org XInput driver : 24.1
[    52.770] 	X.Org Server Extension : 10.0
[    52.776] (--) PCI:*(0:179:0:0) 10de:1c30:1028:11b3 rev 161, Mem @ 0xfa000000/16777216, 0xe0000000/268435456, 0xf0000000/33554432, I/O @ 0x0000f000/128, BIOS @ 0x????????/524288
[    52.776] (II) LoadModule: "glx"
[    52.776] (II) Loading /usr/lib64/xorg/modules/extensions/nvidia/libglx.so
[    52.781] (II) Module glx: vendor="NVIDIA Corporation"
[    52.781] 	compiled for 4.0.2, module version = 1.0.0
[    52.781] 	Module class: X.Org Server Extension
[    52.781] (II) NVIDIA GLX Module  390.25  Wed Jan 24 19:23:51 PST 2018
[    52.781] (II) LoadModule: "nvidia"
[    52.781] (II) Loading /usr/lib64/xorg/modules/drivers/nvidia_drv.so
[    52.781] (II) Module nvidia: vendor="NVIDIA Corporation"
[    52.781] 	compiled for 4.0.2, module version = 1.0.0
[    52.781] 	Module class: X.Org Video Driver
[    52.781] (II) NVIDIA dlloader X Driver  390.25  Wed Jan 24 18:57:05 PST 2018
[    52.781] (II) NVIDIA Unified Driver for all Supported NVIDIA GPUs
[    52.781] (++) using VT number 2

[    52.782] xf86EnableIOPorts: failed to set IOPL for I/O (Operation not permitted)
<=================== BEGIN IMPORTANT DIFFERENCES ============================= > 
[    52.782] (II) Loading sub module "fb"
[    52.782] (II) LoadModule: "fb"
[    52.782] (II) Loading /usr/lib64/xorg/modules/libfb.so
[    52.782] (II) Module fb: vendor="X.Org Foundation"
[    52.782] 	compiled for 1.19.3, module version = 1.0.0
[    52.782] 	ABI class: X.Org ANSI C Emulation, version 0.4
[    52.782] (II) Loading sub module "wfb"
[    52.782] (II) LoadModule: "wfb"
[    52.782] (II) Loading /usr/lib64/xorg/modules/libwfb.so
[    52.782] (II) Module wfb: vendor="X.Org Foundation"
[    52.782] 	compiled for 1.19.3, module version = 1.0.0
[    52.782] 	ABI class: X.Org ANSI C Emulation, version 0.4
[    52.782] (II) Loading sub module "ramdac"
[    52.782] (II) LoadModule: "ramdac"
[    52.782] (II) Module "ramdac" already built-in
[    52.799] (EE) NVIDIA: Failed to initialize the NVIDIA kernel module. Please see the
[    52.799] (EE) NVIDIA:     system's kernel log for additional error messages and
[    52.799] (EE) NVIDIA:     consult the NVIDIA README for details.
<=================== END IMPORTANT DIFFERENCES ============================= > 
[    52.799] (EE) No devices detected.
[    52.799] (EE) 
Fatal server error:
[    52.799] (EE) no screens found(EE) 
[    52.799] (EE) 
Please consult the The X.Org Foundation support 
	 at http://wiki.x.org
 for help. 
[    52.799] (EE) Please also check the log file at "/var/log/Xorg.0.log" for additional information.
[    52.799] (EE) 
[    52.806] (EE) Server terminated with error (1). Closing log file.
I noticed that no devices were detected as well (right below the marked portion). Out of curiosity, I tried informing Xorg of the device's bus ID. I obtained it with:

Code: Select all

lspci -v | grep -i VGA
0000:b3:00.0 VGA compatible controller: NVIDIA Corporation GP106GL [Quadro P2000] (rev a1) (prog-if 00 [VGA controller])
And entered it into the xorg.conf file like so:

Code: Select all

BusID = "PCI:0:b3:0:0" (I also tried just "PCI:b3:0:0")
In both cases, rebooting resulted in no more kernel module initialization error. Oddly, it also got rid of the rest of the portion marked in the previous log. I'm not sure what module "fb" and "wfb" have to do with anything? Perhaps they are loaded to try to determine the bus ID of the card if none is specified?

At any rate, this is the new output:

Code: Select all

[   250.840] 
X.Org X Server 1.19.3
Release Date: 2017-03-15
[   250.842] X Protocol Version 11, Revision 0
[   250.842] Build Operating System:  3.10.0-693.2.2.el7.x86_64 
[   250.843] Current Operating System: Linux seg02-h214 3.10.0-693.17.1.el7.x86_64 #1 SMP Thu Jan 25 20:13:58 UTC 2018 x86_64
[   250.843] Kernel command line: BOOT_IMAGE=/vmlinuz-3.10.0-693.17.1.el7.x86_64 root=/dev/mapper/centos-root ro rd.lvm.lv=centos/root rd.lvm.lv=centos/swap rhgb quiet LANG=en_US.UTF-8 nouveau.modeset=0 rd.driver.blacklist=nouveau plymouth.ignore-udev
[   250.844] Build Date: 25 January 2018  04:04:44PM
[   250.845] Build ID: xorg-x11-server 1.19.3-11.el7_4.1 
[   250.846] Current version of pixman: 0.34.0
[   250.847] 	Before reporting problems, check http://wiki.x.org
	to make sure that you have the latest version.
[   250.847] Markers: (--) probed, (**) from config file, (==) default setting,
	(++) from command line, (!!) notice, (II) informational,
	(WW) warning, (EE) error, (NI) not implemented, (??) unknown.
[   250.849] (==) Log file: "/var/log/Xorg.0.log", Time: Thu Mar  1 14:10:51 2018
[   250.850] (==) Using config file: "/etc/X11/xorg.conf"
[   250.851] (==) Using config directory: "/etc/X11/xorg.conf.d"
[   250.852] (==) Using system config directory "/usr/share/X11/xorg.conf.d"
[   250.852] (==) No Layout section.  Using the first Screen section.
[   250.852] (==) No screen section available. Using defaults.
[   250.852] (**) |-->Screen "Default Screen Section" (0)
[   250.852] (**) |   |-->Monitor "<default monitor>"
[   250.852] (==) No device specified for screen "Default Screen Section".
	Using the first device section listed.
[   250.852] (**) |   |-->Device "Videocard0"
[   250.852] (==) No monitor specified for screen "Default Screen Section".
	Using a default monitor configuration.
[   250.852] (==) Automatically adding devices
[   250.852] (==) Automatically enabling devices
[   250.852] (==) Automatically adding GPU devices
[   250.852] (==) Automatically binding GPU devices
[   250.852] (==) Max clients allowed: 256, resource mask: 0x1fffff
[   250.852] (==) FontPath set to:
	catalogue:/etc/X11/fontpath.d,
	built-ins
[   250.852] (**) ModulePath set to "/usr/lib64/xorg/modules/extensions/nvidia,/usr/lib64/xorg/modules"
[   250.852] (II) The server relies on udev to provide the list of input devices.
	If no devices become available, reconfigure udev or disable AutoAddDevices.
[   250.852] (II) Loader magic: 0x564748b0d020
[   250.852] (II) Module ABI versions:
[   250.852] 	X.Org ANSI C Emulation: 0.4
[   250.852] 	X.Org Video Driver: 23.0
[   250.852] 	X.Org XInput driver : 24.1
[   250.852] 	X.Org Server Extension : 10.0
[   250.857] (--) PCI:*(0:179:0:0) 10de:1c30:1028:11b3 rev 161, Mem @ 0xfa000000/16777216, 0xe0000000/268435456, 0xf0000000/33554432, I/O @ 0x0000f000/128, BIOS @ 0x????????/524288
[   250.857] (II) LoadModule: "glx"
[   250.857] (II) Loading /usr/lib64/xorg/modules/extensions/nvidia/libglx.so
[   250.862] (II) Module glx: vendor="NVIDIA Corporation"
[   250.862] 	compiled for 4.0.2, module version = 1.0.0
[   250.862] 	Module class: X.Org Server Extension
[   250.862] (II) NVIDIA GLX Module  390.25  Wed Jan 24 19:23:51 PST 2018
[   250.862] (II) LoadModule: "nvidia"
[   250.862] (II) Loading /usr/lib64/xorg/modules/drivers/nvidia_drv.so
[   250.862] (II) Module nvidia: vendor="NVIDIA Corporation"
[   250.862] 	compiled for 4.0.2, module version = 1.0.0
[   250.862] 	Module class: X.Org Video Driver
[   250.862] (II) NVIDIA dlloader X Driver  390.25  Wed Jan 24 18:57:05 PST 2018
[   250.862] (II) NVIDIA Unified Driver for all Supported NVIDIA GPUs
[   250.862] (++) using VT number 3

[   250.862] xf86EnableIOPorts: failed to set IOPL for I/O (Operation not permitted)
[   250.862] (EE) No devices detected.
[   250.862] (EE) 
Fatal server error:
[   250.862] (EE) no screens found(EE) 
[   250.862] (EE) 
Please consult the The X.Org Foundation support 
	 at http://wiki.x.org
 for help. 
[   250.862] (EE) Please also check the log file at "/var/log/Xorg.0.log" for additional information.
[   250.862] (EE) 
[   250.868] (EE) Server terminated with error (1). Closing log file.
Note that still no devices were detected.

I would greatly appreciate some help! Am I doing something wrong to make the driver not find the device? Is there something more sinister at work?

Thanks in advance for your time and help!

User avatar
TrevorH
Site Admin
Posts: 33202
Joined: 2009/09/24 10:40:56
Location: Brighton, UK

Re: nVidia Drivers not detecting Quadro card

Post by TrevorH » 2018/03/01 23:27:39

Any clues in /var/log/messages?
The future appears to be RHEL or Debian. I think I'm going Debian.
Info for USB installs on http://wiki.centos.org/HowTos/InstallFromUSBkey
CentOS 5 and 6 are deadest, do not use them.
Use the FAQ Luke

philbax2
Posts: 3
Joined: 2018/03/01 21:57:00

Re: nVidia Drivers not detecting Quadro card

Post by philbax2 » 2018/03/02 15:54:16

Ah! Right, I should've posted that as well. I didn't notice anything, but I forgot to take another look there. Digging into the logs, I found this:

Code: Select all

Mar  1 12:51:22 <mymachine> kernel: [   29.070399] Request for unknown module key 'The ELRepo Project (http://elrepo.org): ELRepo.org Secure Boot Key: f365ad3481a7b20e3427b61b2a26635b83fe427b' err -11
Mar  1 12:51:22 <mymachine> gdm: GdmDisplay: display lasted 0.049749 seconds
Mar  1 12:51:22 <mymachine> kernel: [   29.119008] Request for unknown module key 'The ELRepo Project (http://elrepo.org): ELRepo.org Secure Boot Key: f365ad3481a7b20e3427b61b2a26635b83fe427b' err -11
Mar  1 12:51:22 <mymachine> gdm: GdmDisplay: display lasted 0.044567 seconds
Mar  1 12:51:22 <mymachine> kernel: [   29.167851] Request for unknown module key 'The ELRepo Project (http://elrepo.org): ELRepo.org Secure Boot Key: f365ad3481a7b20e3427b61b2a26635b83fe427b' err -11
Mar  1 12:51:22 <mymachine> gdm: GdmDisplay: display lasted 0.044460 seconds
Mar  1 12:51:22 <mymachine> systemd[1]: Started Postfix Mail Transport Agent.
Mar  1 12:51:22 <mymachine> kernel: [   29.239969] Request for unknown module key 'The ELRepo Project (http://elrepo.org): ELRepo.org Secure Boot Key: f365ad3481a7b20e3427b61b2a26635b83fe427b' err -11
Mar  1 12:51:22 <mymachine> gdm: GdmDisplay: display lasted 0.066340 seconds
Mar  1 12:51:22 <mymachine> kernel: [   29.296816] Request for unknown module key 'The ELRepo Project (http://elrepo.org): ELRepo.org Secure Boot Key: f365ad3481a7b20e3427b61b2a26635b83fe427b' err -11
Mar  1 12:51:22 <mymachine> gdm: GdmDisplay: display lasted 0.053012 seconds
Mar  1 12:51:22 <mymachine> gdm: GdmLocalDisplayFactory: maximum number of X display failures reached: check X server log for errors
I'm now looking into either disabling secure boot (which I don't believe was enabled on the other machine I was on; it wasn't booting via uefi), or adding the key with mokutil (which there's apparently a bug with mokutil right now: https://bugs.centos.org/view.php?id=14050).

I think this'll get me fixed up though.

Thank you for pointing me in the right direction! I tried so many things yesterday, and forgot to go back to the basics and dig through /var/log/messages again.
Last edited by philbax2 on 2018/03/02 19:35:46, edited 1 time in total.

User avatar
TrevorH
Site Admin
Posts: 33202
Joined: 2009/09/24 10:40:56
Location: Brighton, UK

Re: nVidia Drivers not detecting Quadro card

Post by TrevorH » 2018/03/02 16:13:50

ELrepo have instructions on how to add their key to the approved list so you could use that and secure boot.
The future appears to be RHEL or Debian. I think I'm going Debian.
Info for USB installs on http://wiki.centos.org/HowTos/InstallFromUSBkey
CentOS 5 and 6 are deadest, do not use them.
Use the FAQ Luke

philbax2
Posts: 3
Joined: 2018/03/01 21:57:00

Re: nVidia Drivers not detecting Quadro card

Post by philbax2 » 2018/03/02 19:37:19

Yeah, the only holdup would be that bug I linked to; apparently the shim doesn't ask you for the password for the new module's key during the next boot.

I ended up turning off secure boot and I'm up and running! Thanks again for the pointers. Sorry I didn't look there first. Hopefully someone else will see this and benefit from it someday.

Post Reply