Dual GeForce RTX 2080 TI's can't communicate in Base Mosaic Configuration using Nvidia 410.57 drivers

Issues related to applications and software problems
Post Reply
shanedora
Posts: 9
Joined: 2018/10/17 21:40:17

Dual GeForce RTX 2080 TI's can't communicate in Base Mosaic Configuration using Nvidia 410.57 drivers

Post by shanedora » 2018/10/17 22:22:26

Software Environment

Distribution: CentOS 7.5.1804 (Core) "GNOME Shell 3.25.4"
Linux Kernel: 4.18.14-1.el7.elrepo.x86_64
Nvidia Drivers: 410.57

Hardware Environment:

Monitors: (Quantity: 2) Asus MG279 Monitors
Graphic Cards: (Quantity: 2) Asus Dual GeForce RTX 2080TI's
Motherboard: X399 AORUS XTREME SocketTR4
CPU: Threadripper AMD 16 Core


Introduction

I started a discussion on the nvidia dev talk forums. I hope it's ok if I redirect to the forum post instead of typing it all out again here. Here is the link https://devtalk.nvidia.com/default/topi ... 13#5290775.

I have created a repository on gitlab that contains pictures, screenshots, debug reports and other useful output. This repository is public and is available in the following link.
https://gitlab.com/shanedora/dual_monit ... driver_410

I have really struggled :( here for a solid two weeks on getting my new PC build up and running with CentOS 7 specifically with two Asus DUAL GeForce RTX 2080 TI's. The main issue here is that I cannot get dual monitors to work with two RTX 2080TI's that are connected to my two monitors in the following physical setup...

Monitor_1 DisplayPort <----Cable----> GraphicsCard_1 DisplayPort
Monitor_2 DisplayPort <----Cable----> GraphicsCard_2 DisplayPort

Only one monitor is working in this case. The other monitor is just gray with a centos "7" logo on it. Running the nvidia-debug-report.sh script has created a log file indicating that in a Base Mosaic configuration the two graphic cards can't communicate with each other.

Code: Select all

[    19.572] (II) NVIDIA GLX Module  410.57  Tue Sep 18 23:27:13 CDT 2018
[    22.088] (EE) NVIDIA(GPU-0): Failed to find a valid Base Mosaic configuration.
[    22.088] (EE) NVIDIA(GPU-0): Invalid Base Mosaic configuration 1 of 1:
[    22.088] (EE) NVIDIA(GPU-0): GPUs:
[    22.088] (EE) NVIDIA(GPU-0):     1) NVIDIA GPU at PCI:10:0:0
[    22.088] (EE) NVIDIA(GPU-0):     2) NVIDIA GPU at PCI:66:0:0
[    22.088] (EE) NVIDIA(GPU-0): Errors:
[    22.088] (EE) NVIDIA(GPU-0):     - The video link was not detected
[    22.088] (WW) NVIDIA(GPU-0): Failed to find a valid Base Mosaic configuration for the
[    22.088] (WW) NVIDIA(GPU-0):     NVIDIA graphics device PCI:10:0:0. Please see Chapter 28:
[    22.088] (WW) NVIDIA(GPU-0):     Configuring SLI and Multi-GPU FrameRendering in the README
[    22.088] (WW) NVIDIA(GPU-0):     for troubleshooting suggestions.
[    22.149] (EE) NVIDIA(GPU-0): Only one GPU will be used for this X screen.

Installation

I've been able to install CentOS with the nouveau drivers, upgrade to the latest mainline kernel being 4.18, download the kernel-ml-devel, kernel-ml-headers from the elrepo.org repository release 7.0-3 and rebuild the 4.18 kernel with the correct NVIDIA Drivers (recommended for the 2080TI's) which are the latest drivers being 410.57. This entire process went very smoothly without any noticeable errors.

I realize the term "without noticeable errors" is ambiguous. So by it I simply mean error messages that would cause the installation to halt/fail or otherwise alert the user in some fashion that cannot be ignored.

I didn't receive any guidance or feedback from the nvidia team other than one random chap who was actually really helpful but eventually ran out of ideas and assumed it could be a bug in the driver . I reached out directly to the Nvidia support team and a ticket has been created. This just happened so I probably won't receive feedback until tomorrow or late tonight.

I've heard of people running these bleeding edge RTX 2080TI's with Linux which is why I would like to get some more exposure on this issue. This is why I wanted to post something here should someone else be in the same boat or may have a suggestion.

espressobeanies
Posts: 10
Joined: 2018/09/09 20:16:00

Re: Dual GeForce RTX 2080 TI's can't communicate in Base Mosaic Configuration using Nvidia 410.57 drivers

Post by espressobeanies » 2018/10/18 17:07:21

shanedora,

I haven't heard of a person in CentOS getting native NVIDIA or ATI drivers working. Usually people would change to Ubuntu since it's more of a cutting-edge distro.

Ahmed_barakat
Posts: 18
Joined: 2017/08/17 13:46:12

Re: Dual GeForce RTX 2080 TI's can't communicate in Base Mosaic Configuration using Nvidia 410.57 drivers

Post by Ahmed_barakat » 2018/10/18 21:44:25

I am no expert and I struggled a lot till I managed to install the Nvidia Native drivers on my system, I a have the GTX 1080.
every guide I found was mentioning disabling Nouveau and the gdm and booting to non-GUI and install new kernel-devel and then install drivers.
but it never worked for me.
I only managed to do this when I installed Centos fresh without Nouveau driver. you have this option while installing.
then when it first booted I would install the Nvidia drivers.
Hope this helps.
Ahmed

shanedora
Posts: 9
Joined: 2018/10/17 21:40:17

Re: Dual GeForce RTX 2080 TI's can't communicate in Base Mosaic Configuration using Nvidia 410.57 drivers

Post by shanedora » 2018/10/22 22:22:05

Thanks Guys ~

I believe I've resolved my issue. I went off and bought a Nvlink SLI Adapter since the new RTX 2080 TI's require this new type of SLI bridge. The system has no problem now in the Base Mosaic configuration with dual monitors. I'm not sure why this was needed or even if it was the root cause. However, after installing the Nvidia 410.57 drivers with the Nvlink adapter hooked up the two cards were able to see each other. I ran the nvidia debug script to detect any current issues/errors and there were none. The system has been running smoothly now for 4 days. This post can now be closed.

amk
Posts: 16
Joined: 2015/06/02 08:02:58

Re: Dual GeForce RTX 2080 TI's can't communicate in Base Mosaic Configuration using Nvidia 410.57 drivers

Post by amk » 2018/10/24 11:30:49

espressobeanies wrote:
2018/10/18 17:07:21
shanedora,

I haven't heard of a person in CentOS getting native NVIDIA or ATI drivers working. Usually people would change to Ubuntu since it's more of a cutting-edge distro.
I think this is bot, because the level of nonsense and another so called distro advertisement present is just striking. ;)


Post Reply