I have 2 production machines with Centos 7.7. They've been off for the summer, and when turned back on X11 won't come up. They use the NVidia manual install driver, so I reinstalled it, but no go. I've tried many things:
- reinstalled the nvidia driver (390.27)
- installed the lastest nvidia drier (430.50)
- switched to nouveau
- tried the onboard intel graphic card
- tried the previous kernel
- switched from gdm to sddm and back
I always end up with variations of a black screen. Sometimes with boot messages. Sometimes I can switch to Ctrl-Alt-F2 console, sometimes all I get is a blinking cursor (I can ssh to the machines though)
$ uname -a Linux MyMachine 3.10.0-1062.1.2.el7.x86_64 #1 SMP Fri Sep 20 08:24:10 CDT 2019 x86_64 x86_64 x86_64 GNU/Linux $ cat /etc/*release* NAME="Scientific Linux" VERSION="7.7 (Nitrogen)" $ lspci | grep -i "graph\|nv" 00:02.0 Display controller: Intel Corporation HD Graphics 630 (rev 04) 01:00.0 VGA compatible controller: NVIDIA Corporation GP107GL [Quadro P400] (rev a1) $ lsmod | grep -i "nv\|nouv\|intel\|915" nvidia_drm 43690 0 nvidia_modeset 1112578 1 nvidia_drm nvidia 19040853 1 nvidia_modeset intel_pmc_core 18348 0 intel_powerclamp 14451 0 intel_rapl 19451 0 kvm_intel 188644 0 kvm 621480 1 kvm_intel ghash_clmulni_intel 13273 0 snd_hda_intel 40260 2 snd_hda_codec 136060 4 snd_hda_codec_realtek,snd_hda_codec_hdmi,snd_hda_codec_generic,snd_hda_intel snd_hda_core 90971 5 snd_hda_codec_realtek,snd_hda_codec_hdmi,snd_hda_codec_generic,snd_hda_codec,snd_hda_intel aesni_intel 189456 0 lrw 13286 1 aesni_intel glue_helper 13990 1 aesni_intel ablk_helper 13597 1 aesni_intel cryptd 21190 3 ghash_clmulni_intel,aesni_intel,ablk_helper snd_pcm 105777 4 snd_hda_codec_hdmi,snd_hda_codec,snd_hda_intel,snd_hda_core intel_wmi_thunderbolt 12826 0 ipmi_msghandler 56728 2 ipmi_devintf,nvidia snd 83941 14 snd_hda_codec_realtek,snd_hwdep,snd_timer,snd_hda_codec_hdmi,snd_pcm,snd_seq,snd_hda_codec_generic,snd_hda_codec,snd_hda_intel,snd_seq_device pinctrl_intel 23466 1 pinctrl_sunrisepoint i915 1850573 1 i2c_algo_bit 13413 1 i915 iosf_mbi 15582 2 i915,intel_rapl drm_kms_helper 186531 2 i915,nvidia_drm crc32c_intel 22094 1 drm 456166 4 i915,drm_kms_helper,nvidia_drm wmi 21636 4 dell_smbios,dell_wmi_descriptor,dell_wmi,intel_wmi_thunderbolt video 24538 1 i915 $ dmesg | grep -i "graph\|nvidia\|error\|warn" ACPI Error: [\_SB_.PCI0.XHC_.RHUB.HS11] Namespace lookup failure, AE_NOT_FOUND (20130517/dswload-194) ERST: Error Record Serialization Table (ERST) support is initialized. input: HDA NVidia HDMI/DP,pcm=3 as /devices/pci0000:00/0000:00:01.0/0000:01:00.1/sound/card1/input18 input: HDA NVidia HDMI/DP,pcm=7 as /devices/pci0000:00/0000:00:01.0/0000:01:00.1/sound/card1/input19 input: HDA NVidia HDMI/DP,pcm=8 as /devices/pci0000:00/0000:00:01.0/0000:01:00.1/sound/card1/input20 nvidia: loading out-of-tree module taints kernel. nvidia: module license 'NVIDIA' taints kernel. nvidia: module verification failed: signature and/or required key missing - tainting kernel nvidia-nvlink: Nvlink Core is being initialized, major device number 240 NVRM: loading NVIDIA UNIX x86_64 Kernel Module 430.50 Thu Sep 5 22:36:31 CDT 2019 nvidia-modeset: Loading NVIDIA Kernel Mode Setting Driver for UNIX platforms 430.50 Thu Sep 5 22:39:50 CDT 2019 [drm] [nvidia-drm] [GPU ID 0x00000100] Loading driver [drm] Initialized nvidia-drm 0.0.0 20160202 for 0000:01:00.0 on minor 1 nvidia 0000:01:00.0: irq 132 for MSI/MSI-X nvidia 0000:01:00.0: irq 132 for MSI/MSI-X nvidia 0000:01:00.0: irq 132 for MSI/MSI-X nvidia 0000:01:00.0: irq 132 for MSI/MSI-X nvidia 0000:01:00.0: irq 132 for MSI/MSI-X nvidia 0000:01:00.0: irq 132 for MSI/MSI-X $ grep -i "graph\|nv\|nouv\|error\|warn" /var/log/messages kernel: Command line: BOOT_IMAGE=/vmlinuz-3.10.0-1062.1.2.el7.x86_64 root=/dev/mapper/sl_MyMachine-root ro rd.lvm.lv=sl_MyMachine/root rd.lvm.lv=sl_MyMachine/swap rd.driver.blacklist=nouveau nouveau.modeset=0 rhgb quiet kernel: BIOS-e820: [mem 0x000000007da26000-0x000000007da26fff] ACPI NVS kernel: BIOS-e820: [mem 0x000000008b6e7000-0x000000008c007fff] ACPI NVS kernel: Kernel command line: BOOT_IMAGE=/vmlinuz-3.10.0-1062.1.2.el7.x86_64 root=/dev/mapper/sl_MyMachine-root ro rd.lvm.lv=sl_MyMachine/root rd.lvm.lv=sl_MyMachine/swap rd.driver.blacklist=nouveau nouveau.modeset=0 rhgb quiet kernel: ACPI Error: [\_SB_.PCI0.XHC_.RHUB.HS11] Namespace lookup failure, AE_NOT_FOUND (20130517/dswload-194) kernel: DMAR-IR: Queued invalidation will be enabled to support x2apic and Intr-remapping. kernel: PM: Registering ACPI NVS region [mem 0x7da26000-0x7da26fff] (4096 bytes) kernel: PM: Registering ACPI NVS region [mem 0x8b6e7000-0x8c007fff] (9572352 bytes) kernel: ERST: Error Record Serialization Table (ERST) support is initialized. kernel: rtc_cmos 00:06: alarms up to one month, y3k, 242 bytes nvram, hpet irqs kernel: tg3 0000:04:00.0: invalid short VPD tag 00 at offset 51 systemd-udevd: invalid key/value pair in file /usr/lib/udev/rules.d/59-fc-wwpn-id.rules on line 10, starting at character 26 (';') systemd-udevd: invalid key/value pair in file /usr/lib/udev/rules.d/59-fc-wwpn-id.rules on line 11, starting at character 29 (';') systemd-udevd: invalid key/value pair in file /usr/lib/udev/rules.d/59-fc-wwpn-id.rules on line 12, starting at character 25 (';') kernel: input: HDA NVidia HDMI/DP,pcm=3 as /devices/pci0000:00/0000:00:01.0/0000:01:00.1/sound/card1/input18 kernel: input: HDA NVidia HDMI/DP,pcm=7 as /devices/pci0000:00/0000:00:01.0/0000:01:00.1/sound/card1/input19 kernel: input: HDA NVidia HDMI/DP,pcm=8 as /devices/pci0000:00/0000:00:01.0/0000:01:00.1/sound/card1/input20 kernel: nvidia: loading out-of-tree module taints kernel. kernel: nvidia: module license 'NVIDIA' taints kernel. kernel: nvidia: module verification failed: signature and/or required key missing - tainting kernel kernel: nvidia-nvlink: Nvlink Core is being initialized, major device number 240 kernel: NVRM: loading NVIDIA UNIX x86_64 Kernel Module 430.50 Thu Sep 5 22:36:31 CDT 2019 kernel: nvidia-modeset: Loading NVIDIA Kernel Mode Setting Driver for UNIX platforms 430.50 Thu Sep 5 22:39:50 CDT 2019 kernel: [drm] [nvidia-drm] [GPU ID 0x00000100] Loading driver kernel: [drm] Initialized nvidia-drm 0.0.0 20160202 for 0000:01:00.0 on minor 1 smartd[1339]: Monitoring 1 ATA/SATA, 0 SCSI/SAS and 0 NVMe devices systemd: Reached target Graphical Interface. gdm: GdmLocalDisplayFactory: maximum number of X display failures reached: check X server log for errors ModemManager[1338]: <warn> Could not grab port (tty/ttyS1): 'Cannot add port 'tty/ttyS1', unhandled serial type' ModemManager[1338]: <warn> Couldn't create modem for device at '/sys/devices/pci0000:00/0000:00:16.3': Failed to find primary AT port $ cat /var/log/Xorg.0.log X.Org X Server 1.20.4 X Protocol Version 11, Revision 0 Build Operating System: 3.10.0-957.12.2.el7.x86_64 Current Operating System: Linux MyMachine 3.10.0-1062.1.2.el7.x86_64 #1 SMP Fri Sep 20 08:24:10 CDT 2019 x86_64 Kernel command line: BOOT_IMAGE=/vmlinuz-3.10.0-1062.1.2.el7.x86_64 root=/dev/mapper/sl_MyMachine-root ro rd.lvm.lv=sl_MyMachine/root rd.lvm.lv=sl_MyMachine/swap rd.driver.blacklist=nouveau nouveau.modeset=0 rhgb quiet Build Date: 07 August 2019 08:52:04AM Build ID: xorg-x11-server 1.20.4-7.el7 Current version of pixman: 0.34.0 Before reporting problems, check http://wiki.x.org to make sure that you have the latest version. Markers: (--) probed, (**) from config file, (==) default setting, (++) from command line, (!!) notice, (II) informational, (WW) warning, (EE) error, (NI) not implemented, (??) unknown. (==) Log file: "/var/log/Xorg.0.log", Time: Tue Oct 8 08:41:13 2019 (==) Using config file: "/etc/X11/xorg.conf" (==) Using config directory: "/etc/X11/xorg.conf.d" (==) Using system config directory "/usr/share/X11/xorg.conf.d" (==) ServerLayout "Layout0" (**) |-->Screen "Screen0" (0) (**) | |-->Monitor "Monitor0" (**) | |-->Device "Device0" (**) |-->Input Device "Keyboard0" (**) |-->Input Device "Mouse0" (==) Automatically adding devices (==) Automatically enabling devices (==) Automatically adding GPU devices (==) Automatically binding GPU devices (==) Max clients allowed: 256, resource mask: 0x1fffff (==) FontPath set to: catalogue:/etc/X11/fontpath.d, built-ins (==) ModulePath set to "/usr/lib64/xorg/modules" (WW) Hotplugging is on, devices using drivers 'kbd', 'mouse' or 'vmmouse' will be disabled. (WW) Disabling Keyboard0 (WW) Disabling Mouse0 (II) Loader magic: 0x556179962020 (II) Module ABI versions: X.Org ANSI C Emulation: 0.4 X.Org Video Driver: 24.0 X.Org XInput driver : 24.1 X.Org Server Extension : 10.0 (II) xfree86: Adding drm device (/dev/dri/card1) (II) xfree86: Adding drm device (/dev/dri/card0) (--) PCI: (0@0:2:0) 8086:5912:1028:06b7 rev 4, Mem @ 0xed000000/16777216, 0xc0000000/268435456, I/O @ 0x0000f000/64, BIOS @ 0x????????/65536 (--) PCI:*(1@0:0:0) 10de:1cb3:1028:11be rev 161, Mem @ 0xee000000/16777216, 0xd0000000/268435456, 0xe0000000/33554432, I/O @ 0x0000e000/128, BIOS @ 0x????????/524288 (II) LoadModule: "glx" (II) Loading /usr/lib64/xorg/modules/extensions/libglx.so (II) Module glx: vendor="X.Org Foundation" compiled for 1.19.5, module version = 1.0.0 ABI class: X.Org Server Extension, version 10.0 (II) LoadModule: "nvidia" (II) Loading /usr/lib64/xorg/modules/drivers/nvidia_drv.so (II) Module nvidia: vendor="NVIDIA Corporation" compiled for 1.6.99.901, module version = 1.0.0 Module class: X.Org Video Driver (II) NVIDIA dlloader X Driver 430.50 Thu Sep 5 22:43:53 CDT 2019 (II) NVIDIA Unified Driver for all Supported NVIDIA GPUs (++) using VT number 1 (II) Loading sub module "fb" (II) LoadModule: "fb" (II) Loading /usr/lib64/xorg/modules/libfb.so (II) Module fb: vendor="X.Org Foundation" compiled for 1.20.4, module version = 1.0.0 ABI class: X.Org ANSI C Emulation, version 0.4 (II) Loading sub module "wfb" (II) LoadModule: "wfb" (II) Loading /usr/lib64/xorg/modules/libwfb.so (II) Module wfb: vendor="X.Org Foundation" compiled for 1.20.4, module version = 1.0.0 ABI class: X.Org ANSI C Emulation, version 0.4 (II) Loading sub module "ramdac" (II) LoadModule: "ramdac" (II) Module "ramdac" already built-in (**) NVIDIA(0): Depth 24, (--) framebuffer bpp 32 (==) NVIDIA(0): RGB weight 888 (==) NVIDIA(0): Default visual is TrueColor (==) NVIDIA(0): Using gamma correction (1.0, 1.0, 1.0) (**) NVIDIA(0): Enabling 2D acceleration (II) Loading sub module "glxserver_nvidia" (II) LoadModule: "glxserver_nvidia" (II) Loading /usr/lib64/xorg/modules/extensions/libglxserver_nvidia.so (II) Module glxserver_nvidia: vendor="NVIDIA Corporation" compiled for 1.6.99.901, module version = 1.0.0 Module class: X.Org Server Extension (II) NVIDIA GLX Module 430.50 Thu Sep 5 22:41:46 CDT 2019 (--) NVIDIA(0): Valid display device(s) on GPU-0 at PCI:1:0:0 (--) NVIDIA(0): DFP-0 (--) NVIDIA(0): DFP-1 (--) NVIDIA(0): DFP-2 (--) NVIDIA(0): DFP-3 (--) NVIDIA(0): DFP-4 (boot) (--) NVIDIA(0): DFP-5 (II) NVIDIA(0): NVIDIA GPU Quadro P400 (GP107GL-A) at PCI:1:0:0 (GPU-0) (--) NVIDIA(0): Memory: 2097152 kBytes (--) NVIDIA(0): VideoBIOS: 86.07.3b.00.50 (II) NVIDIA(0): Detected PCI Express Link width: 16X (--) NVIDIA(GPU-0): DFP-0: disconnected (--) NVIDIA(GPU-0): DFP-0: Internal DisplayPort (--) NVIDIA(GPU-0): DFP-0: 1440.0 MHz maximum pixel clock (--) NVIDIA(GPU-0): (--) NVIDIA(GPU-0): DFP-1: disconnected (--) NVIDIA(GPU-0): DFP-1: Internal TMDS (--) NVIDIA(GPU-0): DFP-1: 165.0 MHz maximum pixel clock (--) NVIDIA(GPU-0): (--) NVIDIA(GPU-0): DFP-2: disconnected (--) NVIDIA(GPU-0): DFP-2: Internal DisplayPort (--) NVIDIA(GPU-0): DFP-2: 1440.0 MHz maximum pixel clock (--) NVIDIA(GPU-0): (--) NVIDIA(GPU-0): DFP-3: disconnected (--) NVIDIA(GPU-0): DFP-3: Internal TMDS (--) NVIDIA(GPU-0): DFP-3: 165.0 MHz maximum pixel clock (--) NVIDIA(GPU-0): (--) NVIDIA(GPU-0): DELL U2412M (DFP-4): connected (--) NVIDIA(GPU-0): DELL U2412M (DFP-4): Internal DisplayPort (--) NVIDIA(GPU-0): DELL U2412M (DFP-4): 1440.0 MHz maximum pixel clock (--) NVIDIA(GPU-0): (--) NVIDIA(GPU-0): DFP-5: disconnected (--) NVIDIA(GPU-0): DFP-5: Internal TMDS (--) NVIDIA(GPU-0): DFP-5: 165.0 MHz maximum pixel clock (--) NVIDIA(GPU-0): (==) NVIDIA(0): (==) NVIDIA(0): No modes were requested; the default mode "nvidia-auto-select" (==) NVIDIA(0): will be used as the requested mode. (==) NVIDIA(0): (II) NVIDIA(0): Validated MetaModes: (II) NVIDIA(0): "DFP-4:nvidia-auto-select" (II) NVIDIA(0): Virtual screen size determined to be 1920 x 1200 (--) NVIDIA(0): DPI set to (93, 95); computed from "UseEdidDpi" X config (--) NVIDIA(0): option (II) NVIDIA: Using 24576.00 MB of virtual memory for indirect memory (II) NVIDIA: access. (II) NVIDIA(0): ACPI: failed to connect to the ACPI event daemon; the daemon (II) NVIDIA(0): may not be running or the "AcpidSocketPath" X (II) NVIDIA(0): configuration option may not be set correctly. When the (II) NVIDIA(0): ACPI event daemon is available, the NVIDIA X driver will (II) NVIDIA(0): try to use it to receive ACPI event notifications. For (II) NVIDIA(0): details, please see the "ConnectToAcpid" and (II) NVIDIA(0): "AcpidSocketPath" X configuration options in Appendix B: X (II) NVIDIA(0): Config Options in the README. (II) NVIDIA(0): Setting mode "DFP-4:nvidia-auto-select" (==) NVIDIA(0): Disabling shared memory pixmaps (==) NVIDIA(0): Backing store enabled (==) NVIDIA(0): Silken mouse enabled (**) NVIDIA(0): DPMS enabled (II) Loading sub module "dri2" (II) LoadModule: "dri2" (II) Module "dri2" already built-in Setup complete VDPAU driver: nvidia (II) Initializing extension Generic Event Extension ... (II) Initializing extension GLX (II) AIGLX: Screen 0 is not DRI2 capable (EE) AIGLX: reverting to software rendering (II) IGLX: enabled GLX_MESA_copy_sub_buffer (EE) (EE) Backtrace: (EE) 0: /usr/bin/X (xorg_backtrace+0x55) [0x5561796d44f5] (EE) 1: /usr/bin/X (0x556179523000+0x1b5179) [0x5561796d8179] (EE) 2: /lib64/libpthread.so.0 (0x7fde404a0000+0xf630) [0x7fde404af630] (EE) 3: /usr/lib64/xorg/modules/extensions/libglx.so (0x7fde3d7a1000+0x29341) [0x7fde3d7ca341] (EE) 4: /usr/lib64/xorg/modules/extensions/libglx.so (0x7fde3d7a1000+0x28772) [0x7fde3d7c9772] (EE) 5: /usr/lib64/xorg/modules/extensions/libglx.so (0x7fde3d7a1000+0x27b5a) [0x7fde3d7c8b5a] (EE) 6: /usr/bin/X (InitExtensions+0x5d) [0x5561795eb7fd] (EE) 7: /usr/bin/X (0x556179523000+0x60356) [0x556179583356] (EE) 8: /lib64/libc.so.6 (__libc_start_main+0xf5) [0x7fde400f4545] (EE) 9: /usr/bin/X (0x556179523000+0x4a61e) [0x55617956d61e] (EE) (EE) Segmentation fault at address 0x7fde3bb479a0 (EE) Fatal server error: (EE) Caught signal 11 (Segmentation fault). Server aborting