Bug 1209203

Summary: gnome-shell segfaults in libmutter-cogl-11.so.0
Product: [openSUSE] openSUSE Tumbleweed Reporter: Michal Koutný <mkoutny>
Component: GNOMEAssignee: E-mail List <gnome-bugs>
Status: RESOLVED DUPLICATE QA Contact: E-mail List <qa-bugs>
Severity: Normal    
Priority: P5 - None CC: alynx.zhou
Version: Current   
Target Milestone: ---   
Hardware: Other   
OS: Other   
Whiteboard:
Found By: --- Services Priority:
Business Priority: Blocker: ---
Marketing QA Status: --- IT Deployment: ---

Description Michal Koutný 2023-03-13 10:33:40 UTC
Approximately since an update past Thu 2023-03-09 I can't log into GNOME session, gnome shell/gdm crashes.

> Mar 11 11:46:53 blackbook kernel: gnome-shell[15241]: segfault at 70 ip 00007f226648fe3d sp 00007ffdc6729bf0 error 6 in libmutter-cogl-11.so.0.0.0[7f2266457000+4c000] likely on CPU 0 (core 0, socket 0)                                                                         
> Mar 11 11:46:53 blackbook kernel: Code: 48 89 ef 48 89 c6 e8 62 8a fc ff 48 89 c7 e8 da 8a fc ff 48 89 ef 49 89 c6 e8 1f fa ff ff 48 89 ef e8 47 90 fc ff 49 8b 75 30 <83> 48 70 08 49 89 c4 49 8b 46 48 48 8b 78 50 4c 8b bf e8 00 00 00
> Mar 11 11:46:53 blackbook systemd[1]: Created slice Slice /system/systemd-coredump.
> Mar 11 11:46:53 blackbook systemd[1]: Started Process Core Dump (PID 15548/UID 0).
> Mar 11 11:46:54 blackbook systemd-coredump[15549]: Process 15241 (gnome-shell) of user 484 dumped core.
> 
>                                                    Stack trace of thread 15241:
>                                                    #0  0x00007f226648fe3d n/a (libmutter-cogl-11.so.0 + 0x4de3d)
>                                                    #1  0x00007f226648ef18 cogl_xlib_renderer_handle_event (libmutter-cogl-11.so.0 + 0x4cf18)
>                                                    #2  0x00007f22668de110 n/a (libmutter-11.so.0 + 0xde110)
>                                                    #3  0x00007f22668e9d93 n/a (libmutter-11.so.0 + 0xe9d93)
>                                                    #4  0x00007f226698c090 n/a (libmutter-11.so.0 + 0x18c090)
>                                                    #5  0x00007f22668e576d n/a (libmutter-11.so.0 + 0xe576d)
>                                                    #6  0x00007f2267727a90 g_main_context_dispatch (libglib-2.0.so.0 + 0x58a90)
>                                                    #7  0x00007f2267727e48 n/a (libglib-2.0.so.0 + 0x58e48)
>                                                    #8  0x00007f226772810f g_main_loop_run (libglib-2.0.so.0 + 0x5910f)
>                                                    #9  0x00007f22668c28c5 meta_context_run_main_loop (libmutter-11.so.0 + 0xc28c5)
>                                                    #10 0x000055edad1ce904 n/a (gnome-shell + 0x2904)
>                                                    #11 0x00007f226662caf0 __libc_start_call_main (libc.so.6 + 0x27af0)
>                                                    #12 0x00007f226662cbb9 __libc_start_main_impl (libc.so.6 + 0x27bb9)
>                                                    #13 0x000055edad1ceb65 n/a (gnome-shell + 0x2b65)
> 

I could extract the following stacktrace with debuginfo:

> #0  0x00007f226648fe3d in cogl_onscreen_glx_notify_swap_buffers (swap_event=0x7ffdc6729e90, onscreen=0x55edae5cf120 [CoglOnscreenGlx])                                                                                                                                            
>     at ../cogl/cogl/winsys/cogl-onscreen-glx.c:991
> #1  notify_swap_buffers (context=<optimized out>, swap_event=0x7ffdc6729e90) at ../cogl/cogl/winsys/cogl-winsys-glx.c:184
> #2  glx_event_filter_cb (xevent=0x7ffdc6729e90, data=<optimized out>) at ../cogl/cogl/winsys/cogl-winsys-glx.c:224
> #3  0x00007f226648ef18 in _cogl_renderer_handle_native_event (renderer=<optimized out>, event=0x7ffdc6729e90) at ../cogl/cogl/cogl-renderer.c:636
> #4  cogl_xlib_renderer_handle_event (renderer=<optimized out>, event=0x7ffdc6729e90) at ../cogl/cogl/cogl-xlib-renderer.c:579
> #5  0x00007f22668de110 in cogl_xlib_filter (xevent=<optimized out>, event=<optimized out>, data=<optimized out>) at ../src/backends/x11/meta-clutter-backend-x11.c:94
> #6  0x00007f22668e9d93 in meta_clutter_backend_x11_process_event_filters
>     (clutter_backend_x11=0x55edae3f4040 [MetaClutterBackendX11], event=0x55edb0d83480, native=0x7ffdc6729e90) at ../src/backends/x11/meta-clutter-backend-x11.c:329
> #7  meta_clutter_backend_x11_translate_event (clutter_backend=0x55edae3f4040 [MetaClutterBackendX11], native=0x7ffdc6729e90, event=0x55edb0d83480)
>     at ../src/backends/x11/meta-clutter-backend-x11.c:363
> #8  0x00007f226698c090 in meta_x11_handle_event.isra.0 (backend=backend@entry=0x55edae33b1b0 [MetaBackendX11Cm], xevent=xevent@entry=0x7ffdc6729e90)
>     at ../src/backends/x11/meta-event-x11.c:82
> #9  0x00007f22668e576d in handle_host_xevent (event=0x7ffdc6729e90, backend=0x55edae33b1b0 [MetaBackendX11Cm]) at ../src/backends/x11/meta-backend-x11.c:421
> #10 x_event_source_dispatch (source=<optimized out>, callback=<optimized out>, user_data=<optimized out>) at ../src/backends/x11/meta-backend-x11.c:475
> #11 0x00007f2267727a90 in g_main_dispatch (context=0x55edae3289f0) at ../glib/gmain.c:3454
> #12 g_main_context_dispatch (context=context@entry=0x55edae3289f0) at ../glib/gmain.c:4172
> #13 0x00007f2267727e48 in g_main_context_iterate (context=0x55edae3289f0, block=block@entry=1, dispatch=dispatch@entry=1, self=<optimized out>) at ../glib/gmain.c:4248
> #14 0x00007f226772810f in g_main_loop_run (loop=0x55edafd3dbf0) at ../glib/gmain.c:4448
> #15 0x00007f22668c28c5 in meta_context_run_main_loop (context=<optimized out>, error=error@entry=0x7ffdc672a0f0) at ../src/core/meta-context.c:465
> #16 0x000055edad1ce904 in main (argc=<optimized out>, argv=<optimized out>) at ../src/main.c:582

I'm not sure if related or just red herring (those messages were present even _before_ gnome-shell started to crash) but there were also some X/Mesa errors:

> Mar 11 11:46:50 blackbook /usr/libexec/gdm/gdm-x-session[15160]: (EE) open /dev/dri/card0: No such file or directory
> Mar 11 11:46:50 blackbook /usr/libexec/gdm/gdm-x-session[15160]: (II) Initializing extension MIT-SCREEN-SAVER
> Mar 11 11:46:50 blackbook /usr/libexec/gdm/gdm-x-session[15160]: (EE) AIGLX error: dlopen of /usr/lib64/dri/i965_dri.so failed (/usr/lib64/dri/i965_dri.so: cannot open shared object file: No such file or directory)
> Mar 11 11:46:50 blackbook /usr/libexec/gdm/gdm-x-session[15160]: (EE) AIGLX error: unable to load driver i965
Comment 1 Alynx Zhou 2023-03-14 08:04:07 UTC
Hi, could you please tell me your Mesa version? I am wondering it is related to your driver update.
Comment 2 Alynx Zhou 2023-03-14 08:08:15 UTC
Also, what is your intel GPU model? The i965 error may not be related because it is some old driver and replaced by the iris one.
Comment 3 Michal Koutný 2023-03-14 08:55:39 UTC
Mesa-dri-23.0.0-345.1.x86_64
The GPU is Intel HD Graphics 5500 (on CPU i7-5600U).

I've grepped available journal logs and the 'AIGLX error: unable to load driver i965' had been present for a long time when I observed no (user) issues.
Comment 4 Michal Koutný 2023-03-14 09:05:18 UTC
I've forgotten:
gnome-shell-43.3-1.1.x86_64
gdm-43.0-6.1.x86_64

(If you needed more info, let me know.)
Comment 5 Michal Koutný 2023-03-20 12:29:54 UTC
An update fixed it. Not sure which one was it so a  list for reference:
gnome-shell-43.3-1.1.x86_64
gdm-43.0-6.1.x86_64
mutter-43.3+2-2.1.x86_64
kernel-default-6.3~rc3-1.1.gd72bdba.x86_64
Mesa-dri-23.0.0-346.1.x86_64

---
Actually, looking at changelogs, it seems to be the Mesa-dri update, i.e. dup of bug 1209005.

*** This bug has been marked as a duplicate of bug 1209005 ***
Comment 6 Stefan Dirsch 2023-03-20 12:54:27 UTC
Yes,, this issue has been fixed with TW 20230315

==== Mesa ====
Subpackages: Mesa-libEGL1 Mesa-libGL1 Mesa-libglapi0 libgbm1

- U_glx-Remove-pointless-GLX_INTEL_swap_event-paranoia.patch
  * reverse apply this patch to fix a regression caused by this
    commit, which resulted in gnome-shell constantly crashing, which
    is making a GNOME/X11 session impossible (boo#1209005)

==== Mesa-drivers ====
Subpackages: Mesa-dri Mesa-gallium Mesa-libva libxatracker2

- U_glx-Remove-pointless-GLX_INTEL_swap_event-paranoia.patch
  * reverse apply this patch to fix a regression caused by this
    commit, which resulted in gnome-shell constantly crashing, which
    is making a GNOME/X11 session impossible (boo#1209005)