Bug 770246

Summary: i915 screenblank of death
Product: [openSUSE] openSUSE 12.2 Reporter: Marcus Meissner <meissner>
Component: KernelAssignee: Egbert Eich <eich>
Status: RESOLVED DUPLICATE QA Contact: E-mail List <qa-bugs>
Severity: Critical    
Priority: P1 - Urgent CC: ismail, lnussel
Version: Factory   
Target Milestone: ---   
Hardware: Other   
OS: Other   
Whiteboard:
Found By: --- Services Priority:
Business Priority: Blocker: ---
Marketing QA Status: --- IT Deployment: ---
Attachments: dmesg
i195_error_state.txt dump

Description Marcus Meissner 2012-07-06 13:00:37 UTC
screenblanking on my thinkpad x61 occasionaly never returns.

dmesg has
[ 7076.696056] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[ 7076.696067] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state
[ 7076.697356] [drm:kick_ring] *ERROR* Kicking stuck wait on render ring
[ 7076.697778] [drm:init_ring_common] *ERROR* render ring initialization failed ctl 00000000 head 00000000 tail 00000000 start 00000000
[ 7076.698348] [drm] Changing LVDS panel from (+hsync, +vsync) to (-hsync, -vsync)


full message attached.
Comment 1 Marcus Meissner 2012-07-06 13:00:57 UTC
Created attachment 497651 [details]
dmesg
Comment 2 Marcus Meissner 2012-07-09 09:20:37 UTC
Created attachment 497750 [details]
i195_error_state.txt dump

as suggested by the dmesg comments, the register dump
Comment 3 Marcus Meissner 2012-07-09 13:42:26 UTC
this annoying, i need to reboot my X61 Thinkpad every hour :(
Comment 4 Ludwig Nussel 2012-07-10 15:07:07 UTC
same here
Comment 5 Egbert Eich 2012-07-11 08:39:59 UTC
I need:
a. the device ID of the GFX chip (second line of lspci -n)
b. the machine itself to debug this.

Did this work with an earlier version of the kernel? If it did bisecting would probably be the best chance to find the origin.
Intel hardware has the tendency to hang only the latest can reliably be reset to a state to make it work again. I have numerous bugs with this across almost all gens of hardware.
Comment 6 Marcus Meissner 2012-07-11 08:50:16 UTC
00:02.0 VGA compatible controller: Intel Corporation Mobile GM965/GL960 Integrated Graphics Controller (primary) (rev 0c)

00:02.1 Display controller: Intel Corporation Mobile GM965/GL960 Integrated Graphics Controller (secondary) (rev 0c)

lspci -n
00:02.0 0300: 8086:2a02 (rev 0c)
00:02.1 0380: 8086:2a03 (rev 0c)


b. its my Lenovo ThinkPad X61 laptop ... how do you need the machine? ;)

I had openSUSE 11.3 before which worked fine and then installed 12.2 Beta 2... Did not have any intermediate kernels.
Comment 7 Egbert Eich 2012-07-11 09:32:14 UTC
(In reply to comment #6)
> 00:02.0 VGA compatible controller: Intel Corporation Mobile GM965/GL960
> Integrated Graphics Controller (primary) (rev 0c)
> 
> 00:02.1 Display controller: Intel Corporation Mobile GM965/GL960 Integrated
> Graphics Controller (secondary) (rev 0c)
> 
> lspci -n
> 00:02.0 0300: 8086:2a02 (rev 0c)
> 00:02.1 0380: 8086:2a03 (rev 0c)
> 

This is GEN4 hardware. Not really the very latest.

> 
> b. its my Lenovo ThinkPad X61 laptop ... how do you need the machine? ;)

I'm not sure if I even want it ;) - I have 'nough SLES issues to look at.
and ...
> 
> I had openSUSE 11.3 before which worked fine and then installed 12.2 Beta 2...
> Did not have any intermediate kernels.

It is not even clear that this is triggered due to a kernel change. Something else in the desktop may have changed which is triggering this issue. It'd be important to break down which application is triggering this.
I've just installed a box to investigate the same issue on GEN2 hardware.
I may be able to dig up some GEN4 hardware and could try to reproduce this there.
Comment 8 Egbert Eich 2012-07-11 10:33:22 UTC
This is very likely an dup of #768584.

*** This bug has been marked as a duplicate of bug 768584 ***