Bug 1137067

Summary: latest 4.12.14-lp151.28.4-default , gets kernel NULL pointer dereference during nouveau drm establishment.
Product: [openSUSE] openSUSE Distribution Reporter: Ross Green <greenfross>
Component: KernelAssignee: E-mail List <kernel-maintainers>
Status: RESOLVED DUPLICATE QA Contact: E-mail List <qa-bugs>
Severity: Critical    
Priority: P5 - None CC: greenfross, meissner, mrmazda, paul.richards, ralf.koelmel, tcj, tiwai
Version: Leap 15.1   
Target Milestone: ---   
Hardware: x86-64   
OS: Other   
Whiteboard:
Found By: --- Services Priority:
Business Priority: Blocker: ---
Marketing QA Status: --- IT Deployment: ---
Attachments: copy of boot.msg on failed boot
journalctl -b -1 after previously booted to lp151.28.4
journalctl -b via remote login booted to lp151.28.4
boot.msg from NVIDIA geforce RTX 2070 card showing stack trace.

Description Ross Green 2019-06-02 16:47:36 UTC
User-Agent:       Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Firefox/60.0
Build Identifier: 

The following boot.msg extract from the latest kernel latest 4.12.14-lp151.28.4-default after update.
The previous kernel 4.12.14-lp151.27-default does successfully boots and brings up the normal desktop screen.

The new kernel does boot but because of the NULL pointer problem, the graphic screen never appears.


boot.msg from 4.12.14-lp151.28.4-default

<6>[    0.000000] microcode: microcode updated early to revision 0x2f, date = 2019-02-17
<5>[    0.000000] Linux version 4.12.14-lp151.28.4-default (geeko@buildhost) (gcc version 7.4.0 (SUSE Linux) ) #1 SMP Fri May 24 07:57:46 UTC 2019 (af35fd1)
<6>[    0.000000] Command line: BOOT_IMAGE=/boot/vmlinuz-4.12.14-lp151.28.4-default root=UUID=be592af4-7121-428e-85c2-798265147fd4
<6>[    0.000000] x86/fpu: Supporting XSAVE feature 0x001: 'x87 floating point registers'
<6>[    0.000000] x86/fpu: Supporting XSAVE feature 0x002: 'SSE registers'
<6>[    0.000000] x86/fpu: Supporting XSAVE feature 0x004: 'AVX registers'
<6>[    0.000000] x86/fpu: xstate_offset[2]:  576, xstate_sizes[2]:  256
<6>[    0.000000] x86/fpu: Enabled xstate features 0x7, context size is 832 bytes, using 'standard' format.
<6>[    0.000000] e820: BIOS-provided physical RAM map:
......
<6>[    5.897389] nouveau 0000:01:00.0: fb: 3072 MiB GDDR5
<6>[    5.920728] sr 2:0:0:0: [sr0] scsi3-mmc drive: 62x/62x writer dvd-ram cd/rw xa/form2 cdda tray
<6>[    5.920743] cdrom: Uniform CD-ROM driver Revision: 3.20
<7>[    5.920910] sr 2:0:0:0: Attached scsi CD-ROM sr0
<6>[    5.956994] [TTM] Zone  kernel: Available graphics memory: 8196364 kiB
<6>[    5.957002] [TTM] Zone   dma32: Available graphics memory: 2097152 kiB
<6>[    5.957005] [TTM] Initializing pool allocator
<6>[    5.957013] [TTM] Initializing DMA pool allocator
<6>[    5.957042] nouveau 0000:01:00.0: DRM: VRAM: 3072 MiB
<6>[    5.957046] nouveau 0000:01:00.0: DRM: GART: 1048576 MiB
<6>[    5.957052] nouveau 0000:01:00.0: DRM: TMDS table version 2.0
<6>[    5.957056] nouveau 0000:01:00.0: DRM: DCB version 4.0
<6>[    5.957061] nouveau 0000:01:00.0: DRM: DCB outp 00: 01800323 00010034
<6>[    5.957065] nouveau 0000:01:00.0: DRM: DCB outp 01: 02011300 00000000
<6>[    5.957069] nouveau 0000:01:00.0: DRM: DCB outp 02: 08022382 00020010
<6>[    5.957073] nouveau 0000:01:00.0: DRM: DCB conn 00: 00000040
<6>[    5.957077] nouveau 0000:01:00.0: DRM: DCB conn 01: 00000100
<6>[    5.957081] nouveau 0000:01:00.0: DRM: DCB conn 02: 00010261
<6>[    5.957766] [drm] Supports vblank timestamp caching Rev 2 (21.10.2013).
<6>[    5.957770] [drm] Driver supports precise vblank timestamp query.
<6>[    5.958580] nouveau 0000:01:00.0: DRM: MM: using COPY0 for buffer copies
<1>[    5.990023] BUG: unable to handle kernel NULL pointer dereference at           (null)
<1>[    5.990034] IP:           (null)
<6>[    5.990038] PGD 0 P4D 0 
<4>[    5.990043] Oops: 0010 [#1] SMP PTI
<4>[    5.990056] CPU: 0 PID: 268 Comm: systemd-udevd Not tainted 4.12.14-lp151.28.4-default #1 openSUSE Leap 15.1
<4>[    5.990062] Hardware name: ASUSTeK Computer Inc. G74Sx/G74Sx, BIOS G74Sx.201 04/27/2011
<4>[    5.990067] task: ffff88040ab06080 task.stack: ffffc90002380000
<4>[    5.990070] RIP: 0010:          (null)
<4>[    5.990074] RSP: 0018:ffffc900023837a8 EFLAGS: 00010246
<4>[    5.990078] RAX: 0000000000000000 RBX: 0000000000000020 RCX: ffffc9000238383f
<4>[    5.990081] RDX: ffff88040ab06080 RSI: ffffc900023837b8 RDI: ffff88040762c490
<4>[    5.990085] RBP: 0000000000000001 R08: 0000000000000001 R09: ffff88040ea23980
<4>[    5.990088] R10: 0000000000000000 R11: 0000000000000040 R12: 00000000fffffffb
<4>[    5.990092] R13: ffff88040762c490 R14: 0000000000000000 R15: ffff880409995c18
<4>[    5.990096] FS:  00007fc2135b0d40(0000) GS:ffff88040ea00000(0000) knlGS:0000000000000000
<4>[    5.990101] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4>[    5.990105] CR2: 0000000000000000 CR3: 000000040999a003 CR4: 00000000000606f0
<4>[    5.990109] Call Trace:
<4>[    5.990122]  ? drm_dp_dpcd_access+0x69/0xf0 [drm_kms_helper]
<4>[    5.990129]  ? drm_dp_dpcd_read+0x2e/0xc0 [drm_kms_helper]
<4>[    5.990137]  ? drm_dp_cec_set_edid+0x4e/0x140 [drm_kms_helper]
<4>[    5.990183]  ? nouveau_connector_detect+0x1fd/0x440 [nouveau]
<4>[    5.990220]  ? nouveau_connector_detect_lvds+0x132/0x180 [nouveau]
<4>[    5.990254]  ? nouveau_connector_detect_lvds+0x132/0x180 [nouveau]
<4>[    5.990262]  ? drm_helper_probe_single_connector_modes+0xc3/0x6b0 [drm_kms_helper]
<4>[    5.990270]  ? __alloc_pages_nodemask+0xda/0x230
<4>[    5.990278]  ? drm_setup_crtcs+0x139/0xa80 [drm_kms_helper]
<4>[    5.990285]  ? __drm_fb_helper_initial_config_and_unlock+0x2d/0x400 [drm_kms_helper]
<4>[    5.990293]  ? mutex_lock+0xe/0x30
<4>[    5.990327]  ? nouveau_fbcon_init+0x14f/0x1c0 [nouveau]
<4>[    5.990361]  ? nouveau_drm_device_init+0x184/0x7f0 [nouveau]
<4>[    5.990396]  ? nouveau_drm_probe+0x1eb/0x2b0 [nouveau]
<4>[    5.990400]  ? __pm_runtime_resume+0x54/0x70
<4>[    5.990406]  ? local_pci_probe+0x42/0xa0
<4>[    5.990410]  ? pci_device_probe+0x13c/0x160
<4>[    5.990416]  ? driver_probe_device+0x2f7/0x440
<4>[    5.990420]  ? __driver_attach+0xb8/0xe0
<4>[    5.990424]  ? driver_probe_device+0x440/0x440
<4>[    5.990428]  ? bus_for_each_dev+0x5e/0x90
<4>[    5.990432]  ? bus_add_driver+0x161/0x260
<4>[    5.990436]  ? 0xffffffffa04da000
<4>[    5.990440]  ? driver_register+0x57/0xc0
<4>[    5.990443]  ? 0xffffffffa04da000
<4>[    5.990448]  ? do_one_initcall+0x4e/0x190
<4>[    5.990453]  ? __vunmap+0x6d/0xb0
<4>[    5.990456]  ? __vunmap+0x6d/0xb0
<4>[    5.990461]  ? do_init_module+0x5b/0x1e4
<4>[    5.990467]  ? load_module+0x18ec/0x1f80
<4>[    5.990473]  ? SYSC_finit_module+0xb7/0xd0
<4>[    5.990477]  ? SYSC_finit_module+0xb7/0xd0
<4>[    5.990482]  ? do_syscall_64+0x7b/0x160
<4>[    5.990487]  ? entry_SYSCALL_64_after_hwframe+0x3d/0xa2
<4>[    5.990490] Code:  Bad RIP value.
<4>[    5.990496] Modules linked in: uas usb_storage sr_mod cdrom nouveau(+) mxm_wmi i2c_algo_bit xhci_pci drm_kms_helper ehci_pci syscopyarea sysfillrect xhci_hcd ahci sysimgblt ehci_hcd fb_sys_fops libahci ttm serio_raw drm usbcore drm_panel_orientation_quirks wmi video button sg dm_multipath dm_mod scsi_dh_rdac scsi_dh_emc scsi_dh_alua
<4>[    5.990524] CR2: 0000000000000000
<4>[    5.990533] ---[ end trace ade108cc5d3d80f8 ]---
<4>[    5.990536] RIP: 0010:          (null)
<4>[    5.990540] RSP: 0018:ffffc900023837a8 EFLAGS: 00010246
<4>[    5.990543] RAX: 0000000000000000 RBX: 0000000000000020 RCX: ffffc9000238383f
<4>[    5.990547] RDX: ffff88040ab06080 RSI: ffffc900023837b8 RDI: ffff88040762c490
<4>[    5.990550] RBP: 0000000000000001 R08: 0000000000000001 R09: ffff88040ea23980
<4>[    5.990554] R10: 0000000000000000 R11: 0000000000000040 R12: 00000000fffffffb
<4>[    5.990557] R13: ffff88040762c490 R14: 0000000000000000 R15: ffff880409995c18
<4>[    5.990561] FS:  00007fc2135b0d40(0000) GS:ffff88040ea00000(0000) knlGS:0000000000000000
<4>[    5.990566] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4>[    5.990570] CR2: ffffffffffffffd5 CR3: 000000040999a003 CR4: 00000000000606f0
<6>[    6.063983] usb 2-1.4: new high-speed USB device number 5 using ehci-pci
<6>[    6.173872] usb 2-1.4: New USB device found, idVendor=0bda, idProduct=0139
<6>[    6.173896] usb 2-1.4: New USB device strings: Mfr=1, Product=2, SerialNumber=3
<6>[    6.173917] usb 2-1.4: Product: USB2.0-CRW
<6>[    6.173929] usb 2-1.4: Manufacturer: Generic
<6>[    6.173942] usb 2-1.4: SerialNumber: 20100201396000000
<7>[    6.202921] systemd-journald[191]: Received SIGTERM from PID 1 (systemd).
<4>[    6.224934] systemd: 13 output lines suppressed due to ratelimiting
<6>[    6.548052] EXT4-fs (sda2): re-mounted. Opts: acl,user_xattr
<4>[    6.684209] ACPI: Deprecated procfs I/F for AC is loaded, please retry with CONFIG_ACPI_PROCFS_POWER cleared
<6>[    6.684274] ACPI: AC Adapter [AC0] (on-line)
<6>[    6.707406] thermal LNXTHERM:00: registered as thermal_zone0
<6>[    6.707415] ACPI: Thermal Zone [THRM] (62 C)
<4>[    6.711673] ACPI Warning: SystemIO range 0x0000000000000428-0x000000000000042F conflicts with OpRegion 0x0000000000000400-0x000000000000044F (\GPIS) (20170303/utaddress-213)
<4>[    6.711696] ACPI Warning: SystemIO range 0x0000000000000428-0x000000000000042F conflicts with OpRegion 0x0000000000000400-0x000000000000047F (\PMIO) (20170303/utaddress-213)
<6>[    6.711716] ACPI: If an ACPI driver is available for this device, you should use it instead of the native driver
<4>[    6.711727] ACPI Warning: SystemIO range 0x0000000000000540-0x000000000000054F conflicts with OpRegion 0x0000000000000500-0x000000000000057F (\GPIO) (20170303/utaddress-213)
<4>[    6.711744] ACPI Warning: SystemIO range 0x0000000000000540-0x000000000000054F conflicts with OpRegion 0x0000000000000500-0x0000000000000563 (\GP01) (20170303/utaddress-213)
<6>[    6.711761] ACPI: If an ACPI driver is available for this device, you should use it instead of the native driver
<4>[    6.711772] ACPI Warning: SystemIO range 0x0000000000000530-0x000000000000053F conflicts with OpRegion 0x0000000000000500-0x000000000000057F (\GPIO) (20170303/utaddress-213)
<4>[    6.711789] ACPI Warning: SystemIO range 0x0000000000000530-0x000000000000053F conflicts with OpRegion 0x0000000000000500-0x0000000000000563 (\GP01) (20170303/utaddress-213)
<6>[    6.711807] ACPI: If an ACPI driver is available for this device, you should use it instead of the native driver
<4>[    6.711818] ACPI Warning: SystemIO range 0x0000000000000500-0x000000000000052F conflicts with OpRegion 0x0000000000000500-0x000000000000057F (\GPIO) (20170303/utaddress-213)
<4>[    6.711833] ACPI Warning: SystemIO range 0x0000000000000500-0x000000000000052F conflicts with OpRegion 0x0000000000000500-0x0000000000000563 (\GP01) (20170303/utaddress-213)
<6>[    6.711850] ACPI: If an ACPI driver is available for this device, you should use it instead of the native driver
<4>[    6.711860] lpc_ich: Resource conflict(s) found affecting gpio_ich
<4>[    6.721318] ACPI: Deprecated procfs I/F for battery is loaded, please retry with CONFIG_ACPI_PROCFS_POWER cleared
<6>[    6.721335] ACPI: Battery Slot [BAT0] (battery present)
......

Reproducible: Always

Steps to Reproduce:
1.boot system
2.selecting leap 15.1 kernel 4.12.14-lp151.28.4-default
3.System Freezes after NULL pointer dereference
Actual Results:  
boot.msg from 4.12.14-lp151.28.4-default

<6>[    0.000000] microcode: microcode updated early to revision 0x2f, date = 2019-02-17
<5>[    0.000000] Linux version 4.12.14-lp151.28.4-default (geeko@buildhost) (gcc version 7.4.0 (SUSE Linux) ) #1 SMP Fri May 24 07:57:46 UTC 2019 (af35fd1)
<6>[    0.000000] Command line: BOOT_IMAGE=/boot/vmlinuz-4.12.14-lp151.28.4-default root=UUID=be592af4-7121-428e-85c2-798265147fd4
<6>[    0.000000] x86/fpu: Supporting XSAVE feature 0x001: 'x87 floating point registers'
<6>[    0.000000] x86/fpu: Supporting XSAVE feature 0x002: 'SSE registers'
<6>[    0.000000] x86/fpu: Supporting XSAVE feature 0x004: 'AVX registers'
<6>[    0.000000] x86/fpu: xstate_offset[2]:  576, xstate_sizes[2]:  256
<6>[    0.000000] x86/fpu: Enabled xstate features 0x7, context size is 832 bytes, using 'standard' format.
<6>[    0.000000] e820: BIOS-provided physical RAM map:
......
<6>[    5.897389] nouveau 0000:01:00.0: fb: 3072 MiB GDDR5
<6>[    5.920728] sr 2:0:0:0: [sr0] scsi3-mmc drive: 62x/62x writer dvd-ram cd/rw xa/form2 cdda tray
<6>[    5.920743] cdrom: Uniform CD-ROM driver Revision: 3.20
<7>[    5.920910] sr 2:0:0:0: Attached scsi CD-ROM sr0
<6>[    5.956994] [TTM] Zone  kernel: Available graphics memory: 8196364 kiB
<6>[    5.957002] [TTM] Zone   dma32: Available graphics memory: 2097152 kiB
<6>[    5.957005] [TTM] Initializing pool allocator
<6>[    5.957013] [TTM] Initializing DMA pool allocator
<6>[    5.957042] nouveau 0000:01:00.0: DRM: VRAM: 3072 MiB
<6>[    5.957046] nouveau 0000:01:00.0: DRM: GART: 1048576 MiB
<6>[    5.957052] nouveau 0000:01:00.0: DRM: TMDS table version 2.0
<6>[    5.957056] nouveau 0000:01:00.0: DRM: DCB version 4.0
<6>[    5.957061] nouveau 0000:01:00.0: DRM: DCB outp 00: 01800323 00010034
<6>[    5.957065] nouveau 0000:01:00.0: DRM: DCB outp 01: 02011300 00000000
<6>[    5.957069] nouveau 0000:01:00.0: DRM: DCB outp 02: 08022382 00020010
<6>[    5.957073] nouveau 0000:01:00.0: DRM: DCB conn 00: 00000040
<6>[    5.957077] nouveau 0000:01:00.0: DRM: DCB conn 01: 00000100
<6>[    5.957081] nouveau 0000:01:00.0: DRM: DCB conn 02: 00010261
<6>[    5.957766] [drm] Supports vblank timestamp caching Rev 2 (21.10.2013).
<6>[    5.957770] [drm] Driver supports precise vblank timestamp query.
<6>[    5.958580] nouveau 0000:01:00.0: DRM: MM: using COPY0 for buffer copies
<1>[    5.990023] BUG: unable to handle kernel NULL pointer dereference at           (null)
<1>[    5.990034] IP:           (null)
<6>[    5.990038] PGD 0 P4D 0 
<4>[    5.990043] Oops: 0010 [#1] SMP PTI
<4>[    5.990056] CPU: 0 PID: 268 Comm: systemd-udevd Not tainted 4.12.14-lp151.28.4-default #1 openSUSE Leap 15.1
<4>[    5.990062] Hardware name: ASUSTeK Computer Inc. G74Sx/G74Sx, BIOS G74Sx.201 04/27/2011
<4>[    5.990067] task: ffff88040ab06080 task.stack: ffffc90002380000
<4>[    5.990070] RIP: 0010:          (null)
<4>[    5.990074] RSP: 0018:ffffc900023837a8 EFLAGS: 00010246
<4>[    5.990078] RAX: 0000000000000000 RBX: 0000000000000020 RCX: ffffc9000238383f
<4>[    5.990081] RDX: ffff88040ab06080 RSI: ffffc900023837b8 RDI: ffff88040762c490
<4>[    5.990085] RBP: 0000000000000001 R08: 0000000000000001 R09: ffff88040ea23980
<4>[    5.990088] R10: 0000000000000000 R11: 0000000000000040 R12: 00000000fffffffb
<4>[    5.990092] R13: ffff88040762c490 R14: 0000000000000000 R15: ffff880409995c18
<4>[    5.990096] FS:  00007fc2135b0d40(0000) GS:ffff88040ea00000(0000) knlGS:0000000000000000
<4>[    5.990101] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4>[    5.990105] CR2: 0000000000000000 CR3: 000000040999a003 CR4: 00000000000606f0
<4>[    5.990109] Call Trace:
<4>[    5.990122]  ? drm_dp_dpcd_access+0x69/0xf0 [drm_kms_helper]
<4>[    5.990129]  ? drm_dp_dpcd_read+0x2e/0xc0 [drm_kms_helper]
<4>[    5.990137]  ? drm_dp_cec_set_edid+0x4e/0x140 [drm_kms_helper]
<4>[    5.990183]  ? nouveau_connector_detect+0x1fd/0x440 [nouveau]
<4>[    5.990220]  ? nouveau_connector_detect_lvds+0x132/0x180 [nouveau]
<4>[    5.990254]  ? nouveau_connector_detect_lvds+0x132/0x180 [nouveau]
<4>[    5.990262]  ? drm_helper_probe_single_connector_modes+0xc3/0x6b0 [drm_kms_helper]
<4>[    5.990270]  ? __alloc_pages_nodemask+0xda/0x230
<4>[    5.990278]  ? drm_setup_crtcs+0x139/0xa80 [drm_kms_helper]
<4>[    5.990285]  ? __drm_fb_helper_initial_config_and_unlock+0x2d/0x400 [drm_kms_helper]
<4>[    5.990293]  ? mutex_lock+0xe/0x30
<4>[    5.990327]  ? nouveau_fbcon_init+0x14f/0x1c0 [nouveau]
<4>[    5.990361]  ? nouveau_drm_device_init+0x184/0x7f0 [nouveau]
<4>[    5.990396]  ? nouveau_drm_probe+0x1eb/0x2b0 [nouveau]
<4>[    5.990400]  ? __pm_runtime_resume+0x54/0x70
<4>[    5.990406]  ? local_pci_probe+0x42/0xa0
<4>[    5.990410]  ? pci_device_probe+0x13c/0x160
<4>[    5.990416]  ? driver_probe_device+0x2f7/0x440
<4>[    5.990420]  ? __driver_attach+0xb8/0xe0
<4>[    5.990424]  ? driver_probe_device+0x440/0x440
<4>[    5.990428]  ? bus_for_each_dev+0x5e/0x90
<4>[    5.990432]  ? bus_add_driver+0x161/0x260
<4>[    5.990436]  ? 0xffffffffa04da000
<4>[    5.990440]  ? driver_register+0x57/0xc0
<4>[    5.990443]  ? 0xffffffffa04da000
<4>[    5.990448]  ? do_one_initcall+0x4e/0x190
<4>[    5.990453]  ? __vunmap+0x6d/0xb0
<4>[    5.990456]  ? __vunmap+0x6d/0xb0
<4>[    5.990461]  ? do_init_module+0x5b/0x1e4
<4>[    5.990467]  ? load_module+0x18ec/0x1f80
<4>[    5.990473]  ? SYSC_finit_module+0xb7/0xd0
<4>[    5.990477]  ? SYSC_finit_module+0xb7/0xd0
<4>[    5.990482]  ? do_syscall_64+0x7b/0x160
<4>[    5.990487]  ? entry_SYSCALL_64_after_hwframe+0x3d/0xa2
<4>[    5.990490] Code:  Bad RIP value.
<4>[    5.990496] Modules linked in: uas usb_storage sr_mod cdrom nouveau(+) mxm_wmi i2c_algo_bit xhci_pci drm_kms_helper ehci_pci syscopyarea sysfillrect xhci_hcd ahci sysimgblt ehci_hcd fb_sys_fops libahci ttm serio_raw drm usbcore drm_panel_orientation_quirks wmi video button sg dm_multipath dm_mod scsi_dh_rdac scsi_dh_emc scsi_dh_alua
<4>[    5.990524] CR2: 0000000000000000
<4>[    5.990533] ---[ end trace ade108cc5d3d80f8 ]---
<4>[    5.990536] RIP: 0010:          (null)
<4>[    5.990540] RSP: 0018:ffffc900023837a8 EFLAGS: 00010246
<4>[    5.990543] RAX: 0000000000000000 RBX: 0000000000000020 RCX: ffffc9000238383f
<4>[    5.990547] RDX: ffff88040ab06080 RSI: ffffc900023837b8 RDI: ffff88040762c490
<4>[    5.990550] RBP: 0000000000000001 R08: 0000000000000001 R09: ffff88040ea23980
<4>[    5.990554] R10: 0000000000000000 R11: 0000000000000040 R12: 00000000fffffffb
<4>[    5.990557] R13: ffff88040762c490 R14: 0000000000000000 R15: ffff880409995c18
<4>[    5.990561] FS:  00007fc2135b0d40(0000) GS:ffff88040ea00000(0000) knlGS:0000000000000000
<4>[    5.990566] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4>[    5.990570] CR2: ffffffffffffffd5 CR3: 000000040999a003 CR4: 00000000000606f0
<6>[    6.063983] usb 2-1.4: new high-speed USB device number 5 using ehci-pci
<6>[    6.173872] usb 2-1.4: New USB device found, idVendor=0bda, idProduct=0139
<6>[    6.173896] usb 2-1.4: New USB device strings: Mfr=1, Product=2, SerialNumber=3
<6>[    6.173917] usb 2-1.4: Product: USB2.0-CRW
<6>[    6.173929] usb 2-1.4: Manufacturer: Generic
<6>[    6.173942] usb 2-1.4: SerialNumber: 20100201396000000
<7>[    6.202921] systemd-journald[191]: Received SIGTERM from PID 1 (systemd).
<4>[    6.224934] systemd: 13 output lines suppressed due to ratelimiting
<6>[    6.548052] EXT4-fs (sda2): re-mounted. Opts: acl,user_xattr
<4>[    6.684209] ACPI: Deprecated procfs I/F for AC is loaded, please retry with CONFIG_ACPI_PROCFS_POWER cleared
<6>[    6.684274] ACPI: AC Adapter [AC0] (on-line)
<6>[    6.707406] thermal LNXTHERM:00: registered as thermal_zone0
<6>[    6.707415] ACPI: Thermal Zone [THRM] (62 C)
<4>[    6.711673] ACPI Warning: SystemIO range 0x0000000000000428-0x000000000000042F conflicts with OpRegion 0x0000000000000400-0x000000000000044F (\GPIS) (20170303/utaddress-213)
<4>[    6.711696] ACPI Warning: SystemIO range 0x0000000000000428-0x000000000000042F conflicts with OpRegion 0x0000000000000400-0x000000000000047F (\PMIO) (20170303/utaddress-213)
<6>[    6.711716] ACPI: If an ACPI driver is available for this device, you should use it instead of the native driver
<4>[    6.711727] ACPI Warning: SystemIO range 0x0000000000000540-0x000000000000054F conflicts with OpRegion 0x0000000000000500-0x000000000000057F (\GPIO) (20170303/utaddress-213)
<4>[    6.711744] ACPI Warning: SystemIO range 0x0000000000000540-0x000000000000054F conflicts with OpRegion 0x0000000000000500-0x0000000000000563 (\GP01) (20170303/utaddress-213)
<6>[    6.711761] ACPI: If an ACPI driver is available for this device, you should use it instead of the native driver
<4>[    6.711772] ACPI Warning: SystemIO range 0x0000000000000530-0x000000000000053F conflicts with OpRegion 0x0000000000000500-0x000000000000057F (\GPIO) (20170303/utaddress-213)
<4>[    6.711789] ACPI Warning: SystemIO range 0x0000000000000530-0x000000000000053F conflicts with OpRegion 0x0000000000000500-0x0000000000000563 (\GP01) (20170303/utaddress-213)
<6>[    6.711807] ACPI: If an ACPI driver is available for this device, you should use it instead of the native driver
<4>[    6.711818] ACPI Warning: SystemIO range 0x0000000000000500-0x000000000000052F conflicts with OpRegion 0x0000000000000500-0x000000000000057F (\GPIO) (20170303/utaddress-213)
<4>[    6.711833] ACPI Warning: SystemIO range 0x0000000000000500-0x000000000000052F conflicts with OpRegion 0x0000000000000500-0x0000000000000563 (\GP01) (20170303/utaddress-213)
<6>[    6.711850] ACPI: If an ACPI driver is available for this device, you should use it instead of the native driver
<4>[    6.711860] lpc_ich: Resource conflict(s) found affecting gpio_ich
<4>[    6.721318] ACPI: Deprecated procfs I/F for battery is loaded, please retry with CONFIG_ACPI_PROCFS_POWER cleared
<6>[    6.721335] ACPI: Battery Slot [BAT0] (battery present)
......

Expected Results:  
 boot.msg from 4.12.14-lp151.27-default

<5>[    0.000000] Linux version 4.12.14-lp151.27-default (geeko@buildhost) (gcc version 7.4.0 (SUSE Linux) ) #1 SMP Fri May 10 14:13:15 UTC 2019 (862c838)
<6>[    0.000000] Command line: BOOT_IMAGE=/boot/vmlinuz-4.12.14-lp151.27-default root=UUID=be592af4-7121-428e-85c2-798265147fd4 splash=silent resume=/dev/disk/by-uuid/2321f826-14ea-4c0b-a2ba-bb47c1252d62 mitigations=auto quiet
<6>[    0.000000] x86/fpu: Supporting XSAVE feature 0x001: 'x87 floating point registers'
<6>[    0.000000] x86/fpu: Supporting XSAVE feature 0x002: 'SSE registers'
<6>[    0.000000] x86/fpu: Supporting XSAVE feature 0x004: 'AVX registers'
<6>[    0.000000] x86/fpu: xstate_offset[2]:  576, xstate_sizes[2]:  256
<6>[    0.000000] x86/fpu: Enabled xstate features 0x7, context size is 832 bytes, using 'standard' format.
<6>[    0.000000] e820: BIOS-provided physical RAM map:
.....
<6>[    4.235214] nouveau 0000:01:00.0: bios: version 70.26.29.00.0c
<6>[    4.257965] nouveau 0000:01:00.0: fb: 3072 MiB GDDR5
<6>[    4.317849] [TTM] Zone  kernel: Available graphics memory: 8196366 kiB
<6>[    4.317850] [TTM] Zone   dma32: Available graphics memory: 2097152 kiB
<6>[    4.317851] [TTM] Initializing pool allocator
<6>[    4.317855] [TTM] Initializing DMA pool allocator
<6>[    4.317869] nouveau 0000:01:00.0: DRM: VRAM: 3072 MiB
<6>[    4.317870] nouveau 0000:01:00.0: DRM: GART: 1048576 MiB
<6>[    4.317873] nouveau 0000:01:00.0: DRM: TMDS table version 2.0
<6>[    4.317875] nouveau 0000:01:00.0: DRM: DCB version 4.0
<6>[    4.317877] nouveau 0000:01:00.0: DRM: DCB outp 00: 01800323 00010034
<6>[    4.317879] nouveau 0000:01:00.0: DRM: DCB outp 01: 02011300 00000000
<6>[    4.317880] nouveau 0000:01:00.0: DRM: DCB outp 02: 08022382 00020010
<6>[    4.317882] nouveau 0000:01:00.0: DRM: DCB conn 00: 00000040
<6>[    4.317891] nouveau 0000:01:00.0: DRM: DCB conn 01: 00000100
<6>[    4.317892] nouveau 0000:01:00.0: DRM: DCB conn 02: 00010261
<6>[    4.318543] ata1: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
<6>[    4.318640] ata3: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
<6>[    4.318894] [drm] Supports vblank timestamp caching Rev 2 (21.10.2013).
<6>[    4.318895] [drm] Driver supports precise vblank timestamp query.
<6>[    4.319279] ata1.00: ACPI cmd f5/00:00:00:00:00:a0 (SECURITY FREEZE LOCK) filtered out
<7>[    4.319317] ata1.00: ACPI cmd ef/10:06:00:00:00:a0 (SET FEATURES) succeeded
<6>[    4.319320] ata1.00: ACPI cmd ef/10:03:00:00:00:a0 (SET FEATURES) filtered out
<4>[    4.319537] ata1.00: supports DRM functions and may not be fully accessible
<6>[    4.319852] nouveau 0000:01:00.0: DRM: MM: using COPY0 for buffer copies
<7>[    4.320547] ata3.00: ACPI cmd ef/10:06:00:00:00:a0 (SET FEATURES) succeeded
<6>[    4.320553] ata1.00: ATA-10: Crucial_CT2050MX300SSD1,  M0CR031, max UDMA/133
<6>[    4.320556] ata1.00: 4004704368 sectors, multi 16: LBA48 NCQ (depth 31/32), AA
<6>[    4.321786] ata3.00: ATAPI: MATSHITABD-MLT UJ240AS, 1.01, max UDMA/100
<6>[    4.323131] ata1.00: ACPI cmd f5/00:00:00:00:00:a0 (SECURITY FREEZE LOCK) filtered out
<7>[    4.323171] ata1.00: ACPI cmd ef/10:06:00:00:00:a0 (SET FEATURES) succeeded
<6>[    4.323175] ata1.00: ACPI cmd ef/10:03:00:00:00:a0 (SET FEATURES) filtered out
<4>[    4.323451] ata1.00: supports DRM functions and may not be fully accessible
<7>[    4.323670] ata3.00: ACPI cmd ef/10:06:00:00:00:a0 (SET FEATURES) succeeded
<6>[    4.324926] ata3.00: configured for UDMA/100
<6>[    4.325987] ata1.00: configured for UDMA/133
<5>[    4.326238] scsi 0:0:0:0: Direct-Access     ATA      Crucial_CT2050MX R031 PQ: 0 ANSI: 5
<5>[    4.326555] sd 0:0:0:0: Attached scsi generic sg0 type 0
.....

This seems to be have been recently introduced during the latest kernel update for Leap 15.1.

It occurs within the nouveau drm establishment, shortly after TMDS table announcement.

<6>[    5.958580] nouveau 0000:01:00.0: DRM: MM: using COPY0 for buffer copies
<1>[    5.990023] BUG: unable to handle kernel NULL pointer dereference at           (null)
<1>[    5.990034] IP:           (null)
<6>[    5.990038] PGD 0 P4D 0

by booting the previous kernel release can recover the system.

There seems to be many problems within the nouveau drm driver, however this one causes a boot to crash.
Comment 1 Ross Green 2019-06-02 16:51:20 UTC
Created attachment 806604 [details]
copy of boot.msg on failed boot
Comment 2 Ross Green 2019-06-02 16:59:09 UTC
Another problem associated with the nouveau driver seems to be the loading of the firmware for the nouveau hardware.

I believe this use to not be a problem, however it may cause some extra complications.

In general the nouveau driver seems to less stable in Leap 15.1 compared to Leap 15.0 or even 42.3.

This system runs well on older releases.

Other hardware with NVIDIA drivers or radeon hardware has no problems with Leap 15.1 kernel.
Comment 3 Felix Miata 2019-06-02 19:34:37 UTC
Created attachment 806608 [details]
journalctl -b -1 after previously booted to lp151.28.4

reproduces with NVIDIA GT218 [GeForce 210] chip ID: 10de:0a65
Comment 4 Ross Green 2019-06-02 23:03:30 UTC
Looking at both these stack traces it appears that these systems are
caught somewhere within the KMS as drm_kms_helper is caught in the stack call unwind.
Seems like the nouveau_connector_detect area is involved.

Might be worth looking at drm_nouveau changes in this area between the kernel releases.

Thanks Felix for verifying the above condition on different hardware,
all be it with a NVIDIA display adapter.

On the whole the nouveau driver seems much less stable in Leap 15.1 compared to Leap 15.0. Often get lookups in 15.1 where the display just completely freezes and can't even get back to a virtual terminal to investigate.

Ross

Jun 02 15:17:24 p5bse kernel:  ? drm_dp_dpcd_access+0x69/0xf0 [drm_kms_helper]
Jun 02 15:17:24 p5bse kernel:  ? drm_dp_dpcd_read+0x2e/0xc0 [drm_kms_helper]
Jun 02 15:17:24 p5bse kernel:  ? drm_dp_cec_set_edid+0x4e/0x140 [drm_kms_helper]
Jun 02 15:17:24 p5bse kernel:  ? nouveau_connector_detect+0x1fd/0x440 [nouveau]
Jun 02 15:17:24 p5bse kernel:  ? drm_helper_probe_single_connector_modes+0xc3/0x6b0 [drm_kms_helper]
Jun 02 15:17:24 p5bse kernel:  ? nouveau_connector_force+0x80/0x80 [nouveau]
Jun 02 15:17:24 p5bse kernel:  ? drm_helper_probe_single_connector_modes+0xc3/0x6b0 [drm_kms_helper]
Jun 02 15:17:24 p5bse kernel:  ? dev_vprintk_emit+0x108/0x220
Jun 02 15:17:24 p5bse kernel:  ? drm_setup_crtcs+0x139/0xa80 [drm_kms_helper]
Jun 02 15:17:24 p5bse kernel:  ? __drm_fb_helper_initial_config_and_unlock+0x2d/0x400 [drm_kms_helper]
Jun 02 15:17:24 p5bse kernel:  ? mutex_lock+0xe/0x30
Jun 02 15:17:24 p5bse kernel:  ? nouveau_fbcon_init+0x14f/0x1c0 [nouveau]
Comment 5 Ross Green 2019-06-03 01:34:44 UTC
I have just been checking the upstream additions to the drm/nouveau driver
in linus' git tree.

There have not been a lot of new code changes to the nouveau drivers during May, April, or March.

In fact I had to go all the way back to 2019-02-20 before I found any large amount of code added to this driver.

So I suspect if this has crept in to the nouveau area it goes way back to at least kernel 5.0 development or earlier, maybe 4.19 or 4.20.

Of course there may be a problem in more generic code, but on the whole there has not been a lot of fixes directly to the nouveau area over quite some period.

Have not had time to go back further than 2019-02-20 kernel changes on linus git tree.

It may be useful to talk to some of the up stream kernel developers who have worked in this area. Maybe Ben Skeggs, Colin Ian King, Daniel Vetter, ...

To see if they can shed some more light on this problem. The stack traces may help the above people understand what might be happening here.

There are lots more people using 4.19+ kernels in other distributions these days so looking to some of the upstream kernel people could be useful.

Certainly the kernel used in the release Leap 15.1 did not exhibit this problem.
It is just the latest update where I noticed it. Very obvious, it happens during boot up. That is to say 4.12.14-lp151.27-default boots but 4.12.14-lp151.28.4-default does not.

The other thing to note is in the earlier 4.12.14-lp151.27-default kernel there are still freezes that occur during usage. On the whole Leap 15.1 has not been a happy experience regarding the nouveau driver. I have been using NVIDIA drivers on other 15.1 systems and have no such freezes.

So there are several problems in this area I suspect. But failing to boot is obviously the most pressing. ;-)


Ross
Comment 6 Felix Miata 2019-06-03 03:51:21 UTC
Created attachment 806612 [details]
journalctl -b via remote login booted to lp151.28.4

Also reproduces with NVIDIA G98 [GeForce 8400 GS Rev. 2] chip ID: 10de:06e4.

lp151.28.4 does boot with usable video using nomodeset.
Comment 7 Ross Green 2019-06-03 11:14:34 UTC
Yes I suspected calling with nomodeset might work.
If you look at the stack unwind the first three calls come from drm_kms_helper.
So Kernel modesetting was likely to be involved.

I guess the point is we have a reproducible stack trace for kernel 4.12.14-lp151.28.4-default. So it should be possible to work out what has happened here.
This problem did not happen with the earlier 15.1 kernel 4.12.14-lp151.27-default,
so a few people who work closely with the nouveau driver might have a few ideas as to what changes have brought upon this outcome.

With the exception of some of the additions for newer hardware there has not been a lot of activity in the nouveau driver for several months.

Most of the major changes happened way back during the 5.0 release cycle.
Likewise the 4.20 and 4.19 releases might be useful candidate changes that bought upon the problem.

I think I did notice there were also a few kernel modeset changes along the way as well, when looking at kernel updates to the drm driver. So I suspect a few of the driver maintainers might be able to quickly shed some light on some likely potential changes that have brought on this problem.

Ross
Comment 8 Felix Miata 2019-06-04 05:38:25 UTC
Does this differ from bug 1136978 ?
Comment 9 Ross Green 2019-06-04 13:02:00 UTC
Just looked at the Stack Trace unwind and it does look similar to bug 1136978.

In particular to the calls from drm_kms_helper.

While setting nomodeset might make some of the driver work.
It really is just hiding the problem considering that the earlier release kernel functioned better, i.e booted cleanly.

As mentioned before the stack trace should help work exactly what the problem is.

I understand this later kernel has drawn heavily from 5.0 kernel driver,
it may pay to check further with both 5.1 and 5.2 updates to see if there is already an upstream fix. It may also be that there are other things missing from upstream kernels that might be required

Ross

2019-05-31T11:08:23.970542+02:00 linux-ilj4 kernel: [    3.574019] BUG: unable to handle kernel NULL pointer dereference at         
  (null)
2019-05-31T11:08:23.970543+02:00 linux-ilj4 kernel: [    3.574023] IP:           (null)
2019-05-31T11:08:23.970543+02:00 linux-ilj4 kernel: [    3.574025] PGD 0 P4D 0 
2019-05-31T11:08:23.970544+02:00 linux-ilj4 kernel: [    3.574027] Oops: 0010 [#1] SMP PTI
2019-05-31T11:08:23.970546+02:00 linux-ilj4 kernel: [    3.574030] CPU: 1 PID: 312 Comm: systemd-udevd Not tainted 4.12.14-lp151.28.
4-default #1 openSUSE Leap 15.1
2019-05-31T11:08:23.970546+02:00 linux-ilj4 kernel: [    3.574032] Hardware name: Hewlett-Packard HP Z420 Workstation/1589, BIOS J61
 v03.15 05/09/2013
2019-05-31T11:08:23.970547+02:00 linux-ilj4 kernel: [    3.574033] task: ffff880808c04040 task.stack: ffffc90004090000
2019-05-31T11:08:23.970547+02:00 linux-ilj4 kernel: [    3.574035] RIP: 0010:          (null)
2019-05-31T11:08:23.970547+02:00 linux-ilj4 kernel: [    3.574036] RSP: 0018:ffffc900040937d8 EFLAGS: 00010246
2019-05-31T11:08:23.970548+02:00 linux-ilj4 kernel: [    3.574038] RAX: 0000000000000000 RBX: 0000000000000020 RCX: ffffc9000409386f
2019-05-31T11:08:23.970548+02:00 linux-ilj4 kernel: [    3.574040] RDX: ffff880808c04040 RSI: ffffc900040937e8 RDI: ffff88080766b490
2019-05-31T11:08:23.970550+02:00 linux-ilj4 kernel: [    3.574042] RBP: 0000000000000001 R08: 0000000000000001 R09: ffff88080f223980
2019-05-31T11:08:23.970551+02:00 linux-ilj4 kernel: [    3.574043] R10: 0000000000000000 R11: 0000000000000040 R12: 00000000fffffffb
2019-05-31T11:08:23.970551+02:00 linux-ilj4 kernel: [    3.574045] R13: ffff88080766b490 R14: 0000000000000000 R15: ffff880808b5b218
2019-05-31T11:08:23.970552+02:00 linux-ilj4 kernel: [    3.574047] FS:  00007f2be4dfed40(0000) GS:ffff88080f240000(0000) knlGS:00000
00000000000
2019-05-31T11:08:23.970552+02:00 linux-ilj4 kernel: [    3.574049] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
2019-05-31T11:08:23.970552+02:00 linux-ilj4 kernel: [    3.574050] CR2: 0000000000000000 CR3: 0000000808aa4001 CR4: 00000000000606e0
2019-05-31T11:08:23.970554+02:00 linux-ilj4 kernel: [    3.574052] Call Trace:
2019-05-31T11:08:23.970555+02:00 linux-ilj4 kernel: [    3.574063]  ? drm_dp_dpcd_access+0x69/0xf0 [drm_kms_helper]
2019-05-31T11:08:23.970555+02:00 linux-ilj4 kernel: [    3.574067]  ? drm_dp_dpcd_read+0x2e/0xc0 [drm_kms_helper]
2019-05-31T11:08:23.970556+02:00 linux-ilj4 kernel: [    3.574072]  ? drm_dp_cec_set_edid+0x4e/0x140 [drm_kms_helper]
2019-05-31T11:08:23.970556+02:00 linux-ilj4 kernel: [    3.574116]  ? nouveau_connector_detect+0x1fd/0x440 [nouveau]
2019-05-31T11:08:23.970557+02:00 linux-ilj4 kernel: [    3.574123]  ? drm_helper_probe_single_connector_modes+0xc3/0x6b0 [drm_kms_he
lper]
2019-05-31T11:08:23.970557+02:00 linux-ilj4 kernel: [    3.574152]  ? nouveau_connector_force+0x80/0x80 [nouveau]
2019-05-31T11:08:23.970559+02:00 linux-ilj4 kernel: [    3.574158]  ? drm_helper_probe_single_connector_modes+0xc3/0x6b0 [drm_kms_he
lper]
2019-05-31T11:08:23.970560+02:00 linux-ilj4 kernel: [    3.574164]  ? dev_vprintk_emit+0x108/0x220
2019-05-31T11:08:23.970560+02:00 linux-ilj4 kernel: [    3.574170]  ? drm_setup_crtcs+0x139/0xa80 [drm_kms_helper]
2019-05-31T11:08:23.970560+02:00 linux-ilj4 kernel: [    3.574176]  ? __drm_fb_helper_initial_config_and_unlock+0x2d/0x400 [drm_kms_
helper]
2019-05-31T11:08:23.970561+02:00 linux-ilj4 kernel: [    3.574182]  ? mutex_lock+0xe/0x30
2019-05-31T11:08:23.970561+02:00 linux-ilj4 kernel: [    3.574210]  ? nouveau_fbcon_init+0x14f/0x1c0 [nouveau]
2019-05-31T11:08:23.970563+02:00 linux-ilj4 kernel: [    3.574236]  ? nouveau_drm_device_init+0x184/0x7f0 [nouveau]
2019-05-31T11:08:23.970564+02:00 linux-ilj4 kernel: [    3.574263]  ? nouveau_drm_probe+0x1eb/0x2b0 [nouveau]
2019-05-31T11:08:23.970564+02:00 linux-ilj4 kernel: [    3.574266]  ? __pm_runtime_resume+0x54/0x70
2019-05-31T11:08:23.970565+02:00 linux-ilj4 kernel: [    3.574271]  ? local_pci_probe+0x42/0xa0
2019-05-31T11:08:23.970565+02:00 linux-ilj4 kernel: [    3.574274]  ? pci_device_probe+0x13c/0x160
2019-05-31T11:08:23.970566+02:00 linux-ilj4 kernel: [    3.574279]  ? driver_probe_device+0x2f7/0x440
2019-05-31T11:08:23.970566+02:00 linux-ilj4 kernel: [    3.574282]  ? __driver_attach+0xb8/0xe0
2019-05-31T11:08:23.970568+02:00 linux-ilj4 kernel: [    3.574285]  ? driver_probe_device+0x440/0x440
2019-05-31T11:08:23.970568+02:00 linux-ilj4 kernel: [    3.574288]  ? bus_for_each_dev+0x5e/0x90
2019-05-31T11:08:23.970569+02:00 linux-ilj4 kernel: [    3.574291]  ? bus_add_driver+0x161/0x260
2019-05-31T11:08:23.970569+02:00 linux-ilj4 kernel: [    3.574293]  ? 0xffffffffa05cf000
2019-05-31T11:08:23.970570+02:00 linux-ilj4 kernel: [    3.574296]  ? driver_register+0x57/0xc0
2019-05-31T11:08:23.970570+02:00 linux-ilj4 kernel: [    3.574299]  ? 0xffffffffa05cf000
2019-05-31T11:08:23.970570+02:00 linux-ilj4 kernel: [    3.574303]  ? do_one_initcall+0x4e/0x190
2019-05-31T11:08:23.970573+02:00 linux-ilj4 kernel: [    3.574308]  ? __vunmap+0x6d/0xb0
2019-05-31T11:08:23.970574+02:00 linux-ilj4 kernel: [    3.574310]  ? __vunmap+0x6d/0xb0
2019-05-31T11:08:23.970574+02:00 linux-ilj4 kernel: [    3.574314]  ? do_init_module+0x5b/0x1e4
2019-05-31T11:08:23.970575+02:00 linux-ilj4 kernel: [    3.574319]  ? load_module+0x18ec/0x1f80
2019-05-31T11:08:23.970575+02:00 linux-ilj4 kernel: [    3.574324]  ? SYSC_finit_module+0xb7/0xd0
2019-05-31T11:08:23.970575+02:00 linux-ilj4 kernel: [    3.574327]  ? SYSC_finit_module+0xb7/0xd0
2019-05-31T11:08:23.970577+02:00 linux-ilj4 kernel: [    3.574330]  ? do_syscall_64+0x7b/0x160
2019-05-31T11:08:23.970578+02:00 linux-ilj4 kernel: [    3.574334]  ? entry_SYSCALL_64_after_hwframe+0x3d/0xa2
Comment 10 Ross Green 2019-06-04 13:03:47 UTC
Just looked at the Stack Trace unwind and it does look similar to bug 1136978.

In particular to the calls from drm_kms_helper.

While setting nomodeset might make some of the driver work.
It really is just hiding the problem considering that the earlier release kernel functioned better, i.e booted cleanly.

As mentioned before the stack trace should help work exactly what the problem is.

I understand this later kernel has drawn heavily from 5.0 kernel driver,
it may pay to check further with both 5.1 and 5.2 updates to see if there is already an upstream fix. It may also be that there are other things missing from upstream kernels that might be required

Ross

2019-05-31T11:08:23.970542+02:00 linux-ilj4 kernel: [    3.574019] BUG: unable to handle kernel NULL pointer dereference at         
  (null)
2019-05-31T11:08:23.970543+02:00 linux-ilj4 kernel: [    3.574023] IP:           (null)
2019-05-31T11:08:23.970543+02:00 linux-ilj4 kernel: [    3.574025] PGD 0 P4D 0 
2019-05-31T11:08:23.970544+02:00 linux-ilj4 kernel: [    3.574027] Oops: 0010 [#1] SMP PTI
2019-05-31T11:08:23.970546+02:00 linux-ilj4 kernel: [    3.574030] CPU: 1 PID: 312 Comm: systemd-udevd Not tainted 4.12.14-lp151.28.
4-default #1 openSUSE Leap 15.1
2019-05-31T11:08:23.970546+02:00 linux-ilj4 kernel: [    3.574032] Hardware name: Hewlett-Packard HP Z420 Workstation/1589, BIOS J61
 v03.15 05/09/2013
2019-05-31T11:08:23.970547+02:00 linux-ilj4 kernel: [    3.574033] task: ffff880808c04040 task.stack: ffffc90004090000
2019-05-31T11:08:23.970547+02:00 linux-ilj4 kernel: [    3.574035] RIP: 0010:          (null)
2019-05-31T11:08:23.970547+02:00 linux-ilj4 kernel: [    3.574036] RSP: 0018:ffffc900040937d8 EFLAGS: 00010246
2019-05-31T11:08:23.970548+02:00 linux-ilj4 kernel: [    3.574038] RAX: 0000000000000000 RBX: 0000000000000020 RCX: ffffc9000409386f
2019-05-31T11:08:23.970548+02:00 linux-ilj4 kernel: [    3.574040] RDX: ffff880808c04040 RSI: ffffc900040937e8 RDI: ffff88080766b490
2019-05-31T11:08:23.970550+02:00 linux-ilj4 kernel: [    3.574042] RBP: 0000000000000001 R08: 0000000000000001 R09: ffff88080f223980
2019-05-31T11:08:23.970551+02:00 linux-ilj4 kernel: [    3.574043] R10: 0000000000000000 R11: 0000000000000040 R12: 00000000fffffffb
2019-05-31T11:08:23.970551+02:00 linux-ilj4 kernel: [    3.574045] R13: ffff88080766b490 R14: 0000000000000000 R15: ffff880808b5b218
2019-05-31T11:08:23.970552+02:00 linux-ilj4 kernel: [    3.574047] FS:  00007f2be4dfed40(0000) GS:ffff88080f240000(0000) knlGS:00000
00000000000
2019-05-31T11:08:23.970552+02:00 linux-ilj4 kernel: [    3.574049] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
2019-05-31T11:08:23.970552+02:00 linux-ilj4 kernel: [    3.574050] CR2: 0000000000000000 CR3: 0000000808aa4001 CR4: 00000000000606e0
2019-05-31T11:08:23.970554+02:00 linux-ilj4 kernel: [    3.574052] Call Trace:
2019-05-31T11:08:23.970555+02:00 linux-ilj4 kernel: [    3.574063]  ? drm_dp_dpcd_access+0x69/0xf0 [drm_kms_helper]
2019-05-31T11:08:23.970555+02:00 linux-ilj4 kernel: [    3.574067]  ? drm_dp_dpcd_read+0x2e/0xc0 [drm_kms_helper]
2019-05-31T11:08:23.970556+02:00 linux-ilj4 kernel: [    3.574072]  ? drm_dp_cec_set_edid+0x4e/0x140 [drm_kms_helper]
2019-05-31T11:08:23.970556+02:00 linux-ilj4 kernel: [    3.574116]  ? nouveau_connector_detect+0x1fd/0x440 [nouveau]
2019-05-31T11:08:23.970557+02:00 linux-ilj4 kernel: [    3.574123]  ? drm_helper_probe_single_connector_modes+0xc3/0x6b0 [drm_kms_he
lper]
2019-05-31T11:08:23.970557+02:00 linux-ilj4 kernel: [    3.574152]  ? nouveau_connector_force+0x80/0x80 [nouveau]
2019-05-31T11:08:23.970559+02:00 linux-ilj4 kernel: [    3.574158]  ? drm_helper_probe_single_connector_modes+0xc3/0x6b0 [drm_kms_he
lper]
2019-05-31T11:08:23.970560+02:00 linux-ilj4 kernel: [    3.574164]  ? dev_vprintk_emit+0x108/0x220
2019-05-31T11:08:23.970560+02:00 linux-ilj4 kernel: [    3.574170]  ? drm_setup_crtcs+0x139/0xa80 [drm_kms_helper]
2019-05-31T11:08:23.970560+02:00 linux-ilj4 kernel: [    3.574176]  ? __drm_fb_helper_initial_config_and_unlock+0x2d/0x400 [drm_kms_
helper]
2019-05-31T11:08:23.970561+02:00 linux-ilj4 kernel: [    3.574182]  ? mutex_lock+0xe/0x30
2019-05-31T11:08:23.970561+02:00 linux-ilj4 kernel: [    3.574210]  ? nouveau_fbcon_init+0x14f/0x1c0 [nouveau]
2019-05-31T11:08:23.970563+02:00 linux-ilj4 kernel: [    3.574236]  ? nouveau_drm_device_init+0x184/0x7f0 [nouveau]
2019-05-31T11:08:23.970564+02:00 linux-ilj4 kernel: [    3.574263]  ? nouveau_drm_probe+0x1eb/0x2b0 [nouveau]
2019-05-31T11:08:23.970564+02:00 linux-ilj4 kernel: [    3.574266]  ? __pm_runtime_resume+0x54/0x70
2019-05-31T11:08:23.970565+02:00 linux-ilj4 kernel: [    3.574271]  ? local_pci_probe+0x42/0xa0
2019-05-31T11:08:23.970565+02:00 linux-ilj4 kernel: [    3.574274]  ? pci_device_probe+0x13c/0x160
2019-05-31T11:08:23.970566+02:00 linux-ilj4 kernel: [    3.574279]  ? driver_probe_device+0x2f7/0x440
2019-05-31T11:08:23.970566+02:00 linux-ilj4 kernel: [    3.574282]  ? __driver_attach+0xb8/0xe0
2019-05-31T11:08:23.970568+02:00 linux-ilj4 kernel: [    3.574285]  ? driver_probe_device+0x440/0x440
2019-05-31T11:08:23.970568+02:00 linux-ilj4 kernel: [    3.574288]  ? bus_for_each_dev+0x5e/0x90
2019-05-31T11:08:23.970569+02:00 linux-ilj4 kernel: [    3.574291]  ? bus_add_driver+0x161/0x260
2019-05-31T11:08:23.970569+02:00 linux-ilj4 kernel: [    3.574293]  ? 0xffffffffa05cf000
2019-05-31T11:08:23.970570+02:00 linux-ilj4 kernel: [    3.574296]  ? driver_register+0x57/0xc0
2019-05-31T11:08:23.970570+02:00 linux-ilj4 kernel: [    3.574299]  ? 0xffffffffa05cf000
2019-05-31T11:08:23.970570+02:00 linux-ilj4 kernel: [    3.574303]  ? do_one_initcall+0x4e/0x190
2019-05-31T11:08:23.970573+02:00 linux-ilj4 kernel: [    3.574308]  ? __vunmap+0x6d/0xb0
2019-05-31T11:08:23.970574+02:00 linux-ilj4 kernel: [    3.574310]  ? __vunmap+0x6d/0xb0
2019-05-31T11:08:23.970574+02:00 linux-ilj4 kernel: [    3.574314]  ? do_init_module+0x5b/0x1e4
2019-05-31T11:08:23.970575+02:00 linux-ilj4 kernel: [    3.574319]  ? load_module+0x18ec/0x1f80
2019-05-31T11:08:23.970575+02:00 linux-ilj4 kernel: [    3.574324]  ? SYSC_finit_module+0xb7/0xd0
2019-05-31T11:08:23.970575+02:00 linux-ilj4 kernel: [    3.574327]  ? SYSC_finit_module+0xb7/0xd0
2019-05-31T11:08:23.970577+02:00 linux-ilj4 kernel: [    3.574330]  ? do_syscall_64+0x7b/0x160
2019-05-31T11:08:23.970578+02:00 linux-ilj4 kernel: [    3.574334]  ? entry_SYSCALL_64_after_hwframe+0x3d/0xa2
Comment 11 Ross Green 2019-06-04 13:06:45 UTC
Sorry typo accident
Comment 12 Ross Green 2019-06-05 13:33:31 UTC
Created attachment 806836 [details]
boot.msg from NVIDIA geforce RTX 2070 card showing stack trace.

This  attachment shows the stack trace after boot with 
kernel  4.12.14-lp151.28.4-default , gets kernel NULL pointer dereference during nouveau drm establishment.

The last few function calls are from within drm_kms_helper.

Ross
Comment 13 Ross Green 2019-06-05 13:47:53 UTC
After doing a new install from the usb .iso on new hardware with AMD 2700X and the NVIDIA geforce RTX 2070.

If you select network configuration and updates from update repository,
the new kernel 4.12.14-lp151.28.4-default is one of the new updates downloaded and installed.

The installation goes as planned with all the graphics working and presenting as expected. The installation finished the system then does its first reboot.

After the reboot, the poor user is left with nothing ... but a blank screen!
Virtually no indication what has gone on or in what state the machine is in.

Not a comfortable or informative state for a new user experience with openSuSE.
Even an experienced user is left just wondering what has just happened.

I just got trapped with such a scenario with the above installation
and I knew about the problem!

Ross
Comment 14 Paul Richards 2019-06-06 13:48:41 UTC
If the bug I just reported ( https://bugzilla.opensuse.org/show_bug.cgi?id=1137512 ) is a dupe of this, I'd be happy to help investigate further.

In particular, I'd be happy to bisect the kernel versions between these two releases to help isolate the offending changes.
Comment 15 Ross Green 2019-06-06 14:29:26 UTC
I suspect this is the same problem.

The trick to finding out for sure is have a look at the stack trace.

Even though the nouveau driver baffs, with a little bit of luck you might get enough written to the /var/log/boot.msg that will include the stack trace of the Null pointer. If you look for the nouveau establishment announcments,
you might see the lines:

Call Trace:
<4>[    5.990122]  ? drm_dp_dpcd_access+0x69/0xf0 [drm_kms_helper]
<4>[    5.990129]  ? drm_dp_dpcd_read+0x2e/0xc0 [drm_kms_helper]
<4>[    5.990137]  ? drm_dp_cec_set_edid+0x4e/0x140 [drm_kms_helper]
<4>[    5.990183]  ? nouveau_connector_detect+0x1fd/0x440 [nouveau]
<4>[    5.990220]  ? nouveau_connector_detect_lvds+0x132/0x180 [nouveau]
<4>[    5.990254]  ? nouveau_connector_detect_lvds+0x132/0x180 [nouveau]
<4>[    5.990262]  ? drm_helper_probe_single_connector_modes+0xc3/0x6b0 [drm_kms_helper]
<4>[    5.990270]  ? __alloc_pages_nodemask+0xda/0x230
<4>[    5.990278]  ? drm_setup_crtcs+0x139/0xa80 [drm_kms_helper]
<4>[    5.990285]  ? __drm_fb_helper_initial_config_and_unlock+0x2d/0x400 [drm_kms_helper]
<4>[    5.990293]  ? mutex_lock+0xe/0x30
<4>[    5.990327]  ? nouveau_fbcon_init+0x14f/0x1c0 [nouveau]
<4>[    5.990361]  ? nouveau_drm_device_init+0x184/0x7f0 [nouveau]
<4>[    5.990396]  ? nouveau_drm_probe+0x1eb/0x2b0 [nouveau]


this is the stack unwind of the calls leading up to the problem.


there are several calls from within drm_kms_helper.
plus nouveau_connector_dector calls.

This would indicate kernel modeset is involved, with spurious data.

Indeed if you set nomodeset you can boot the kernel, but you end up with a screen that is usually in a VGA resolution and not the real functionality you want from the system.

Paul if you are happy to undertake a bisect to narrow down the offending problem code by all means do.

I suspect part of the problem might be using some of the upstream nouveau driver code without all the required extras.

I now have several systems that exhibit the above problem.
Unfortunately from a work perspective NVIDIA drivers are used to allow use with various bits of CUDA hardware and software.

What I find very frustrating with this bug is that it can affect standard installs, where the newer driver gets downloaded and the resulting system ends with just a blank screen and no other visible information to tell what is going on.

I spent several hours the other day looking at Linus' tree following all the updates to the nouveau driver. It might be worth trying one of the more recent kernels with nouveau driver and see if the problem does exist in the upstream code.

Ross
Comment 16 Paul Richards 2019-06-06 15:14:10 UTC
> Even though the nouveau driver baffs, with a little bit of luck you might get enough written to the /var/log/boot.msg that will include the stack trace of the Null pointer.

In my case "/" isn't mounted yet (encrypted LVM partition hasn't been unlocked yet), so I don't expect anything in /var/log to be present.

> Paul if you are happy to undertake a bisect to narrow down the offending problem code by all means do.

I'd be happy to.  I am a software engineer by trade, but I don't have any specific openSUSE or kernel development experience.  I assume I can rebuild kernel RPMs locally from source somehow, and do a bisect.  Are there any instructions for that?

If not, I'll try to figure out something following the instructions here: https://en.opensuse.org/openSUSE:Kernel_git
Comment 17 Ross Green 2019-06-06 22:19:47 UTC
Paul in my case I construct ext4 file systems for /, /var, /usr/local, and /home.

Certainly lvm plus encryption adds an extra couple of layers of complications that might get in the way. I do notice even with a blank screen, there is at least some periodic disk activity going on that hopefully allows enough of the data to end up in the file system or at least in journals that can be recovered.

There seems to be a number of "problems" that have crept into the nouveau driver.

Another annoying habit seems to be "lockups" that occur after the driver has returned from suspend.

All in all, this driver is now very unstable. At least under Leap 15.0 it was usable. Now it is very difficult to use for normal expected usage. Not quite fit for purpose!

As an alternative NVIDIA propriety drivers could be used, but that does not suit everyone.

In terms of building kernels, I will normally compile and install by hand, placing all the components including kernel modules into appropriate places. I don't worry too much about producing rpms. However if you want to share with other people packaged rpms can allow other people to easily test your efforts.

Bisects will be good to help narrow down explicit code additions that have caused the issues. However, good old fashioned eye balling the code helps to clarify in your mind as to what is actually happening and intended with the code additions. 

As I mention earlier I suspect there might be some missing pieces that are necessary with this code that have some how not made it into the blended driver that has come from younger kernel drivers.

Please report back any findings and I am sure there will be a few people that a re more than willing to test anything you produce. I have not yet tried a tumbleweed install to see how a more recent kernel behaves with nouveau. Should do this as it has a complete nouveau driver that is from later kernels. The difficulty of cherry picking fixes from later kernels is that there is always a chance that not all of the fixes make it into the back port.

Given the range of NVIDIA graphics cards that seem to be affected it should not be too difficult to find a few test environments and people willing to give it a burn.

Ross
Comment 18 Ross Green 2019-06-08 00:14:48 UTC
I notice that there appears to be a number of sync problems with the NVIDIA firmware blobs. Getting the right versions of those binaries should be necessary to at least handle some of the clock frequency changes that might be required.
Especially in the area with Power Management will adjust clocks to save or match power budget.
A number of the freezes associated with a running nouveau driver seem to occur after the system has been in suspend or hibernate.

While this is just another problem within the nouveau drm driver it should be addressed.

The following warnings occur on booted system with NVIDIA card and nouveau DRM driver.  

019-06-01T22:21:29.800857+10:00 red plasmashell[4432]: trying to show an empty dialog
2019-06-01T22:21:30.027741+10:00 red plasmashell[4432]: libpng warning: iCCP: known incorrect sRGB profile
2019-06-01T22:21:31.760431+10:00 red kwin_x11[4426]: qt.qpa.xcb: QXcbConnection: XCB error: 3 (BadWindow), sequence: 16110, resource
 id: 37748815, major code: 18 (ChangeProperty), minor code: 0
2019-06-01T22:21:32.835597+10:00 red kernel: [  329.854993] nouveau 0000:01:00.0: Direct firmware load for nouveau/nvcf_fuc084 faile
d with error -2
2019-06-01T22:21:32.835621+10:00 red kernel: [  329.855007] nouveau 0000:01:00.0: Direct firmware load for nouveau/nvcf_fuc084d fail
ed with error -2
2019-06-01T22:21:32.835622+10:00 red kernel: [  329.855009] nouveau 0000:01:00.0: msvld: unable to load firmware data
2019-06-01T22:21:32.835623+10:00 red kernel: [  329.855011] nouveau 0000:01:00.0: msvld: init failed, -19
2019-06-01T22:21:43.923950+10:00 red plasmashell[4432]: QQuickItem::stackAfter: Cannot stack StatusNotifierItem_QMLTYPE_263(0x55cd33
769d80, parent=0x55cd32c84650, geometry=0,0 0x0) after StatusNotifierItem_QMLTYPE_263(0x55cd3328fe50), which must be a sibling
2019-06-01T22:25:02.856679+10:00 red org.kde.powerdevil.backlighthelper: QDBusArgument: read from a write-only object



I also see within the nouveau drm tree that there have been a number of additions to various modesetting changes needed for a number of the NVIDIA cards.

I was also reading comments made by Michael Larabel thst point to the need to be using the appropriate firmware binaries when using the nouveau drm driver.

An updated firmware configuration should help some GeForce GTX 1000 "Pascal" users with their limited open-source driver support, but the situation remains a mess. Besides the fact of being binary blobs, it's more complicated this time around with the interfaces changing for what is expected by the Nouveau DRM kernel driver.

Hitting today in linux-firmware.git were changing the GP102/GP104/GP106/GP107 firmware blobs around the SEC2 RTOS block to point to what's used by the GP108 graphics card. This firmware update/change was needed since the existing signed firmware files weren't working correctly on newer Pascal graphics cards.

Usually it's just a matter of overwriting/updating the firmware files, but due to interface breakage, it means an updated kernel DRM driver is needed to properly use the new files and to keep the old files still in the linux-firmware tree as not to regress users on older kernels.

The updated DRM kernel driver bits for dealing with the changed SEC2 RTOS firmware interfaces can currently be found in the Nouveau DRM tree maintained by Red Hat's Ben Skeggs.

But even with this latest Pascal firmware switching, the open-source Pascal support remains in poor shape... In particular, the same problem plagues Pascal as it does Maxwell/Volta (and ultimately Turing too, but there we're still waiting on the initial firmware files) of not having PMU access with the current firmware images in order to implement proper re-clocking support. As such, all the cards past the GTX 950 series remain very slow with Nouveau due to being stuck to their boot clock frequencies rather than their optimal base/boost clock frequencies. But if/when NVIDIA will help straight out this situation remains to be seen. For now those wanting open-source NVIDIA GPU support are best off with the GeForce GTX 600/700 Kepler series.
Comment 19 Paul Richards 2019-06-08 20:00:24 UTC
I tried to follow up on my earlier promise to try and bisect this.  Unfortunately I couldn't figure out how to build/install openSUSE kernels from source.  I tried following the instructions on the wiki ( https://en.opensuse.org/openSUSE:Kernel_git ), and also the instructions in the packages linked from there; but unfortunately I couldn't past the initial "sequence-patch.sh" step.

I'll not pollute the comments on this bug report further, but if folks are able to share either a place which contains RPMs for the intermediate versions, or some getting started guides for building openSUSE kernels; I'd be happy to try again.
Comment 20 Tomas Jensen 2019-06-11 10:57:08 UTC
Seems I have hit the same issue:
Kernel command line: BOOT_IMAGE=/vmlinuz-4.12.14-lp151.28.4-default root=UUID=64bb719b-a265-49b8-bde9-4ccf702792ce resume=/dev/disk/by-uuid/e740e737-a332-4f69-8650-ea8e83f17b7b splash=silent quiet showopts

jun 11 12:27:18 b-006 kernel: nouveau 0000:01:00.0: bios: M0203T not found
jun 11 12:27:18 b-006 kernel: nouveau 0000:01:00.0: bios: M0203E not matched!

jun 11 12:27:19 b-006 kernel: nouveau 0000:01:00.0: DRM: VRAM: 512 MiB
jun 11 12:27:19 b-006 kernel: nouveau 0000:01:00.0: DRM: GART: 1048576 MiB
jun 11 12:27:19 b-006 kernel: nouveau 0000:01:00.0: DRM: TMDS table version 2.0
jun 11 12:27:19 b-006 kernel: nouveau 0000:01:00.0: DRM: DCB version 4.0
jun 11 12:27:19 b-006 kernel: nouveau 0000:01:00.0: DRM: DCB outp 00: 02000300 00000028
jun 11 12:27:19 b-006 kernel: nouveau 0000:01:00.0: DRM: DCB outp 01: 01000302 00000030
jun 11 12:27:19 b-006 kernel: nouveau 0000:01:00.0: DRM: DCB outp 02: 04011310 00000028
jun 11 12:27:19 b-006 kernel: nouveau 0000:01:00.0: DRM: DCB conn 00: 00001030
jun 11 12:27:19 b-006 kernel: nouveau 0000:01:00.0: DRM: DCB conn 01: 00000100
jun 11 12:27:19 b-006 systemd-udevd[208]: worker [253] terminated by signal 9 (KILL)
jun 11 12:27:19 b-006 systemd-udevd[208]: worker [253] failed while handling '/devices/pci0000:00/0000:00:01.0/0000:01:00.0'
jun 11 12:27:19 b-006 kernel: BUG: unable to handle kernel NULL pointer dereference at           (null)
jun 11 12:27:19 b-006 kernel: IP:           (null)
jun 11 12:27:19 b-006 kernel: PGD 0 P4D 0 
jun 11 12:27:19 b-006 kernel: Oops: 0010 [#1] SMP PTI
jun 11 12:27:19 b-006 kernel: CPU: 1 PID: 253 Comm: systemd-udevd Not tainted 4.12.14-lp151.28.4-default #1 openSUSE Leap 15.1
jun 11 12:27:19 b-006 kernel: Hardware name: Gigabyte Technology Co., Ltd. G31M-ES2L/G31M-S2L, BIOS F10 09/29/2009
jun 11 12:27:19 b-006 kernel: task: ffff88012667c100 task.stack: ffffc90000aac000
jun 11 12:27:19 b-006 kernel: RIP: 0010:          (null)
jun 11 12:27:19 b-006 kernel: RSP: 0018:ffffc90000aaf7d8 EFLAGS: 00010246
jun 11 12:27:19 b-006 kernel: RAX: 0000000000000000 RBX: 0000000000000020 RCX: ffffc90000aaf86f
jun 11 12:27:19 b-006 kernel: RDX: ffff88012667c100 RSI: ffffc90000aaf7e8 RDI: ffff880125931490
jun 11 12:27:19 b-006 kernel: RBP: 0000000000000001 R08: 0000000000000001 R09: ffff88012fc23980
jun 11 12:27:19 b-006 kernel: R10: 0000000000000000 R11: 0000000000000040 R12: 00000000fffffffb
jun 11 12:27:19 b-006 kernel: R13: ffff880125931490 R14: 0000000000000000 R15: ffff8801265faa18
jun 11 12:27:19 b-006 kernel: FS:  00007f069a985d40(0000) GS:ffff88012fc80000(0000) knlGS:0000000000000000
jun 11 12:27:19 b-006 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
jun 11 12:27:19 b-006 kernel: CR2: 0000000000000000 CR3: 0000000126638000 CR4: 00000000000406e0
jun 11 12:27:19 b-006 kernel: Call Trace:
jun 11 12:27:19 b-006 kernel:  ? drm_dp_dpcd_access+0x69/0xf0 [drm_kms_helper]
jun 11 12:27:19 b-006 kernel:  ? drm_dp_dpcd_read+0x2e/0xc0 [drm_kms_helper]
jun 11 12:27:19 b-006 kernel:  ? drm_dp_cec_set_edid+0x4e/0x140 [drm_kms_helper]
jun 11 12:27:19 b-006 kernel:  ? nouveau_connector_detect+0x1fd/0x440 [nouveau]
jun 11 12:27:19 b-006 kernel:  ? drm_helper_probe_single_connector_modes+0xc3/0x6b0 [drm_kms_helper]
jun 11 12:27:19 b-006 kernel:  ? nouveau_connector_force+0x80/0x80 [nouveau]
jun 11 12:27:19 b-006 kernel:  ? drm_helper_probe_single_connector_modes+0xc3/0x6b0 [drm_kms_helper]
jun 11 12:27:19 b-006 kernel:  ? drm_setup_crtcs+0x139/0xa80 [drm_kms_helper]
jun 11 12:27:19 b-006 kernel:  ? __drm_fb_helper_initial_config_and_unlock+0x2d/0x400 [drm_kms_helper]
jun 11 12:27:19 b-006 kernel:  ? mutex_lock+0xe/0x30
jun 11 12:27:19 b-006 kernel:  ? nouveau_fbcon_init+0x14f/0x1c0 [nouveau]
jun 11 12:27:19 b-006 kernel:  ? nouveau_drm_device_init+0x184/0x7f0 [nouveau]
jun 11 12:27:19 b-006 kernel:  ? nouveau_drm_probe+0x1eb/0x2b0 [nouveau]
jun 11 12:27:19 b-006 kernel:  ? __pm_runtime_resume+0x54/0x70
jun 11 12:27:19 b-006 kernel:  ? local_pci_probe+0x42/0xa0
jun 11 12:27:19 b-006 kernel:  ? pci_device_probe+0x13c/0x160
jun 11 12:27:19 b-006 kernel:  ? driver_probe_device+0x2f7/0x440
jun 11 12:27:19 b-006 kernel:  ? __driver_attach+0xb8/0xe0
jun 11 12:27:19 b-006 kernel:  ? driver_probe_device+0x440/0x440
jun 11 12:27:19 b-006 kernel:  ? bus_for_each_dev+0x5e/0x90
jun 11 12:27:19 b-006 kernel:  ? bus_add_driver+0x161/0x260
jun 11 12:27:19 b-006 kernel:  ? 0xffffffffa0465000
jun 11 12:27:19 b-006 kernel:  ? driver_register+0x57/0xc0
jun 11 12:27:19 b-006 kernel:  ? 0xffffffffa0465000
jun 11 12:27:19 b-006 kernel:  ? do_one_initcall+0x4e/0x190
jun 11 12:27:19 b-006 kernel:  ? __vunmap+0x6d/0xb0
jun 11 12:27:19 b-006 kernel:  ? __vunmap+0x6d/0xb0
jun 11 12:27:19 b-006 kernel:  ? do_init_module+0x5b/0x1e4
jun 11 12:27:19 b-006 kernel:  ? load_module+0x18ec/0x1f80
jun 11 12:27:19 b-006 kernel:  ? SYSC_finit_module+0xb7/0xd0
jun 11 12:27:19 b-006 kernel:  ? SYSC_finit_module+0xb7/0xd0
jun 11 12:27:19 b-006 kernel:  ? do_syscall_64+0x7b/0x160
jun 11 12:27:19 b-006 kernel:  ? entry_SYSCALL_64_after_hwframe+0x3d/0xa2
jun 11 12:27:19 b-006 kernel: Code:  Bad RIP value.
jun 11 12:27:19 b-006 kernel: Modules linked in: sr_mod cdrom ata_generic nouveau(+) ata_piix ahci libahci video mxm_wmi wmi i2c_algo_bit drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops ttm drm drm_panel_orientation_quirks serio_raw ehci_pci uhci_hcd ehc>
jun 11 12:27:19 b-006 kernel: CR2: 0000000000000000
jun 11 12:27:19 b-006 kernel: ---[ end trace 608a6e05fc9a8bbd ]---
jun 11 12:27:19 b-006 kernel: RIP: 0010:          (null)
jun 11 12:27:19 b-006 kernel: RSP: 0018:ffffc90000aaf7d8 EFLAGS: 00010246
jun 11 12:27:19 b-006 kernel: RAX: 0000000000000000 RBX: 0000000000000020 RCX: ffffc90000aaf86f
jun 11 12:27:19 b-006 kernel: RDX: ffff88012667c100 RSI: ffffc90000aaf7e8 RDI: ffff880125931490
jun 11 12:27:19 b-006 kernel: RBP: 0000000000000001 R08: 0000000000000001 R09: ffff88012fc23980
jun 11 12:27:19 b-006 kernel: R10: 0000000000000000 R11: 0000000000000040 R12: 00000000fffffffb
jun 11 12:27:19 b-006 kernel: R13: ffff880125931490 R14: 0000000000000000 R15: ffff8801265faa18
jun 11 12:27:19 b-006 kernel: FS:  00007f069a985d40(0000) GS:ffff88012fc80000(0000) knlGS:0000000000000000
jun 11 12:27:19 b-006 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
jun 11 12:27:19 b-006 kernel: CR2: ffffffffffffffd5 CR3: 0000000126638000 CR4: 00000000000406e0




Working: Linux version 4.12.14-lp150.12.61-default

Graphics card is: NVidia G98 [GeForce 8400 GS Rev.2]
Comment 21 Takashi Iwai 2019-06-11 12:58:35 UTC
It's a dup of another bug.  Please check the test kernel mentioned there later.  Thanks.

*** This bug has been marked as a duplicate of bug 1136978 ***