Bug 1157428

Summary: drm/nouveau: call trace after recent update
Product: [openSUSE] openSUSE Tumbleweed Reporter: Lubos Kocman <lubos.kocman>
Component: X.OrgAssignee: E-mail List <xorg-maintainer-bugs>
Status: RESOLVED FIXED QA Contact: E-mail List <xorg-maintainer-bugs>
Severity: Major    
Priority: P5 - None CC: lubos.kocman
Version: Current   
Target Milestone: ---   
Hardware: Other   
OS: Other   
Whiteboard:
Found By: --- Services Priority:
Business Priority: Blocker: ---
Marketing QA Status: --- IT Deployment: ---
Attachments: full dmesg log including call trace

Description Lubos Kocman 2019-11-21 09:23:20 UTC
Created attachment 824645 [details]
full dmesg log including call trace

Hello team 

I received the "oops something went wrong screen" after yesterday's update of my Tumbleweed instance on Dell precision 5530.

Call trace from dmesg is in attachments

rpm -qa | grep nouveau
xf86-video-nouveau-1.0.15-3.5.x86_64
libvdpau_nouveau-19.2.3-232.1.x86_64
libdrm_nouveau2-32bit-2.4.100-2.1.x86_64
Mesa-dri-nouveau-19.2.3-232.1.x86_64
libdrm_nouveau2-2.4.100-2.1.x86_64


lkocman@linux-i0y4:~> dmesg | tail -60
[  186.412218] Code: 8b 40 10 48 8b 78 10 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 63 7c 79 d3 4c 89 ea 48 c7 c7 b8 5b 8a c0 48 89 c6 e8 af bc 1a d3 <0f> 0b eb a7 e8 c8 b9 1a d3 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00
[  186.412219] RSP: 0018:ffffb2a3007a7640 EFLAGS: 00010086
[  186.412220] RAX: 0000000000000000 RBX: ffff8cff1518d800 RCX: 0000000000000006
[  186.412220] RDX: 0000000000000007 RSI: 0000000000000002 RDI: ffff8cff1c499a10
[  186.412220] RBP: ffff8cf7e72ab0c8 R08: 0000002b6707a6ef R09: ffff8cff3c6da180
[  186.412221] R10: 0000000000001770 R11: 000000000006f830 R12: 0000000000000246
[  186.412221] R13: ffff8cff1887fe40 R14: ffff8cfef55068a0 R15: ffff8cff0f343360
[  186.412222] FS:  0000000000000000(0000) GS:ffff8cff1c480000(0000) knlGS:0000000000000000
[  186.412222] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  186.412222] CR2: 000056403a082038 CR3: 000000012780a002 CR4: 00000000003606e0
[  186.412223] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  186.412225] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[  186.412225] Call Trace:
[  186.412241]  nv50_instobj_release+0x27/0x90 [nouveau]
[  186.412258]  nvkm_vmm_iter.constprop.0+0x765/0x810 [nouveau]
[  186.412275]  ? nvkm_vmm_map_choose+0x90/0x90 [nouveau]
[  186.412291]  ? gp100_vmm_pgt_sparse+0xc0/0xc0 [nouveau]
[  186.412306]  ? nvkm_vmm_sparse_unref_ptes+0x70/0x70 [nouveau]
[  186.412322]  nvkm_vmm_ptes_unmap_put+0x2c/0x40 [nouveau]
[  186.412337]  ? nvkm_vmm_map_choose+0x90/0x90 [nouveau]
[  186.412353]  ? gp100_vmm_pgt_sparse+0xc0/0xc0 [nouveau]
[  186.412369]  nvkm_vmm_put_locked+0x1c5/0x210 [nouveau]
[  186.412386]  nvkm_uvmm_mthd+0x4ec/0x780 [nouveau]
[  186.412397]  nvkm_ioctl+0xde/0x180 [nouveau]
[  186.412408]  nvif_object_mthd+0x104/0x130 [nouveau]
[  186.412410]  ? recalibrate_cpu_khz+0x10/0x10
[  186.412411]  ? ktime_get+0x3c/0x90
[  186.412412]  ? dma_fence_signal+0x29/0x100
[  186.412422]  nvif_vmm_put+0x5c/0x80 [nouveau]
[  186.412443]  nouveau_mem_fini+0x37/0x70 [nouveau]
[  186.412464]  nouveau_mem_del+0x11/0x30 [nouveau]
[  186.412468]  ttm_bo_move_accel_cleanup+0x176/0x250 [ttm]
[  186.412489]  nouveau_bo_move_m2mf.constprop.0+0x1c2/0x200 [nouveau]
[  186.412509]  nouveau_bo_move+0xa5/0x430 [nouveau]
[  186.412519]  ? nvif_vmm_unmap+0x38/0x60 [nouveau]
[  186.412539]  ? nouveau_vma_unmap+0x20/0x30 [nouveau]
[  186.412541]  ttm_bo_handle_move_mem+0x10b/0x4f0 [ttm]
[  186.412543]  ttm_bo_evict+0x150/0x1d0 [ttm]
[  186.412550]  ? drm_add_edid_modes+0xf96/0x18a0 [drm]
[  186.412553]  ttm_mem_evict_first+0x266/0x350 [ttm]
[  186.412555]  ttm_bo_force_list_clean+0xa4/0x170 [ttm]
[  186.412576]  nouveau_do_suspend+0x93/0x190 [nouveau]
[  186.412596]  nouveau_pmops_runtime_suspend+0x40/0xa0 [nouveau]
[  186.412597]  pci_pm_runtime_suspend+0x58/0x140
[  186.412598]  ? __switch_to_asm+0x40/0x70
[  186.412599]  ? pci_pm_thaw_noirq+0xa0/0xa0
[  186.412600]  __rpm_callback+0x81/0x140
[  186.412600]  ? pci_pm_thaw_noirq+0xa0/0xa0
[  186.412601]  rpm_callback+0x1f/0x70
[  186.412602]  rpm_suspend+0x10a/0x5c0
[  186.412604]  ? finish_task_switch+0x7e/0x290
[  186.412605]  pm_runtime_work+0x93/0xa0
[  186.412606]  process_one_work+0x1df/0x380
[  186.412608]  worker_thread+0x4d/0x400
[  186.412610]  kthread+0xf9/0x130
[  186.412611]  ? process_one_work+0x380/0x380
[  186.412612]  ? kthread_park+0x80/0x80
[  186.412613]  ret_from_fork+0x3a/0x50
[  186.412613] ---[ end trace 47fdfc0ea804d195 ]---
[  201.410530] nouveau 0000:01:00.0: DRM: failed to idle channel 0 [DRM]
Comment 1 Stefan Dirsch 2019-11-21 10:58:03 UTC
I suggest to just disable nouveau kernel driver on this Optimus system (intel/nvidia combo).

   nouveau.modeset=0

This should already help. If you want use the nvidia GPU for rendering you probably want to install NVIDIA's proprietary driver anyway. Let me know whether this helps.

We're not looking into nouveau driver problems because ....

... Nouveau is an experimental driver under constant heavy development. This means that we cannot follow it closely, as we are not part of 
its team with reverse engineered knowledge of NVIDIA cards. ...
Comment 2 Stefan Dirsch 2019-11-21 10:59:13 UTC
*** Bug 1157430 has been marked as a duplicate of this bug. ***
Comment 3 Lubos Kocman 2019-11-21 11:06:14 UTC
Thank you Stefan, will do.

Lubos
Comment 4 Lubos Kocman 2019-11-21 11:48:23 UTC
Hmmm I tried one more thing. Dominique told me that there is even newer Mesa already available (233).

xf86-video-nouveau-1.0.15-3.5.x86_64
libdrm_nouveau2-32bit-2.4.100-2.1.x86_64
libvdpau_nouveau-19.2.4-233.1.x86_64
Mesa-dri-nouveau-19.2.4-233.1.x86_64
libdrm_nouveau2-2.4.100-2.1.x86_64

I can't no longer reproduce issue with these packages.
Comment 5 Stefan Dirsch 2019-11-21 13:20:07 UTC
Weird to see this, since Mesa  19.2.4 is only supposed to fix a rendering bug and the initial comment shows a kernel problem, but ok. Whatever . Then let's close it as fixed ...