Dual GeForce RTX 2080 TI's can't communicate in Base Mosaic Configuration using Nvidia 410.57 drivers
Posted: 2018/10/17 22:22:26
Software Environment
Distribution: CentOS 7.5.1804 (Core) "GNOME Shell 3.25.4"
Linux Kernel: 4.18.14-1.el7.elrepo.x86_64
Nvidia Drivers: 410.57
Hardware Environment:
Monitors: (Quantity: 2) Asus MG279 Monitors
Graphic Cards: (Quantity: 2) Asus Dual GeForce RTX 2080TI's
Motherboard: X399 AORUS XTREME SocketTR4
CPU: Threadripper AMD 16 Core
Introduction
I started a discussion on the nvidia dev talk forums. I hope it's ok if I redirect to the forum post instead of typing it all out again here. Here is the link https://devtalk.nvidia.com/default/topi ... 13#5290775.
I have created a repository on gitlab that contains pictures, screenshots, debug reports and other useful output. This repository is public and is available in the following link.
https://gitlab.com/shanedora/dual_monit ... driver_410
I have really struggled here for a solid two weeks on getting my new PC build up and running with CentOS 7 specifically with two Asus DUAL GeForce RTX 2080 TI's. The main issue here is that I cannot get dual monitors to work with two RTX 2080TI's that are connected to my two monitors in the following physical setup...
Monitor_1 DisplayPort <----Cable----> GraphicsCard_1 DisplayPort
Monitor_2 DisplayPort <----Cable----> GraphicsCard_2 DisplayPort
Only one monitor is working in this case. The other monitor is just gray with a centos "7" logo on it. Running the nvidia-debug-report.sh script has created a log file indicating that in a Base Mosaic configuration the two graphic cards can't communicate with each other.
Installation
I've been able to install CentOS with the nouveau drivers, upgrade to the latest mainline kernel being 4.18, download the kernel-ml-devel, kernel-ml-headers from the elrepo.org repository release 7.0-3 and rebuild the 4.18 kernel with the correct NVIDIA Drivers (recommended for the 2080TI's) which are the latest drivers being 410.57. This entire process went very smoothly without any noticeable errors.
I realize the term "without noticeable errors" is ambiguous. So by it I simply mean error messages that would cause the installation to halt/fail or otherwise alert the user in some fashion that cannot be ignored.
I didn't receive any guidance or feedback from the nvidia team other than one random chap who was actually really helpful but eventually ran out of ideas and assumed it could be a bug in the driver . I reached out directly to the Nvidia support team and a ticket has been created. This just happened so I probably won't receive feedback until tomorrow or late tonight.
I've heard of people running these bleeding edge RTX 2080TI's with Linux which is why I would like to get some more exposure on this issue. This is why I wanted to post something here should someone else be in the same boat or may have a suggestion.
Distribution: CentOS 7.5.1804 (Core) "GNOME Shell 3.25.4"
Linux Kernel: 4.18.14-1.el7.elrepo.x86_64
Nvidia Drivers: 410.57
Hardware Environment:
Monitors: (Quantity: 2) Asus MG279 Monitors
Graphic Cards: (Quantity: 2) Asus Dual GeForce RTX 2080TI's
Motherboard: X399 AORUS XTREME SocketTR4
CPU: Threadripper AMD 16 Core
Introduction
I started a discussion on the nvidia dev talk forums. I hope it's ok if I redirect to the forum post instead of typing it all out again here. Here is the link https://devtalk.nvidia.com/default/topi ... 13#5290775.
I have created a repository on gitlab that contains pictures, screenshots, debug reports and other useful output. This repository is public and is available in the following link.
https://gitlab.com/shanedora/dual_monit ... driver_410
I have really struggled here for a solid two weeks on getting my new PC build up and running with CentOS 7 specifically with two Asus DUAL GeForce RTX 2080 TI's. The main issue here is that I cannot get dual monitors to work with two RTX 2080TI's that are connected to my two monitors in the following physical setup...
Monitor_1 DisplayPort <----Cable----> GraphicsCard_1 DisplayPort
Monitor_2 DisplayPort <----Cable----> GraphicsCard_2 DisplayPort
Only one monitor is working in this case. The other monitor is just gray with a centos "7" logo on it. Running the nvidia-debug-report.sh script has created a log file indicating that in a Base Mosaic configuration the two graphic cards can't communicate with each other.
Code: Select all
[ 19.572] (II) NVIDIA GLX Module 410.57 Tue Sep 18 23:27:13 CDT 2018
[ 22.088] (EE) NVIDIA(GPU-0): Failed to find a valid Base Mosaic configuration.
[ 22.088] (EE) NVIDIA(GPU-0): Invalid Base Mosaic configuration 1 of 1:
[ 22.088] (EE) NVIDIA(GPU-0): GPUs:
[ 22.088] (EE) NVIDIA(GPU-0): 1) NVIDIA GPU at PCI:10:0:0
[ 22.088] (EE) NVIDIA(GPU-0): 2) NVIDIA GPU at PCI:66:0:0
[ 22.088] (EE) NVIDIA(GPU-0): Errors:
[ 22.088] (EE) NVIDIA(GPU-0): - The video link was not detected
[ 22.088] (WW) NVIDIA(GPU-0): Failed to find a valid Base Mosaic configuration for the
[ 22.088] (WW) NVIDIA(GPU-0): NVIDIA graphics device PCI:10:0:0. Please see Chapter 28:
[ 22.088] (WW) NVIDIA(GPU-0): Configuring SLI and Multi-GPU FrameRendering in the README
[ 22.088] (WW) NVIDIA(GPU-0): for troubleshooting suggestions.
[ 22.149] (EE) NVIDIA(GPU-0): Only one GPU will be used for this X screen.
Installation
I've been able to install CentOS with the nouveau drivers, upgrade to the latest mainline kernel being 4.18, download the kernel-ml-devel, kernel-ml-headers from the elrepo.org repository release 7.0-3 and rebuild the 4.18 kernel with the correct NVIDIA Drivers (recommended for the 2080TI's) which are the latest drivers being 410.57. This entire process went very smoothly without any noticeable errors.
I realize the term "without noticeable errors" is ambiguous. So by it I simply mean error messages that would cause the installation to halt/fail or otherwise alert the user in some fashion that cannot be ignored.
I didn't receive any guidance or feedback from the nvidia team other than one random chap who was actually really helpful but eventually ran out of ideas and assumed it could be a bug in the driver . I reached out directly to the Nvidia support team and a ticket has been created. This just happened so I probably won't receive feedback until tomorrow or late tonight.
I've heard of people running these bleeding edge RTX 2080TI's with Linux which is why I would like to get some more exposure on this issue. This is why I wanted to post something here should someone else be in the same boat or may have a suggestion.