Bug 276928

Summary: intel: System hangs when x-server is terminated (945GM)
Product: [openSUSE] openSUSE 10.3 Reporter: Casual J. Programmer <casualprogrammer>
Component: X.OrgAssignee: Stefan Dirsch <sndirsch>
Status: RESOLVED FIXED QA Contact: Stefan Dirsch <sndirsch>
Severity: Major    
Priority: P5 - None CC: casualprogrammer
Version: Alpha 4   
Target Milestone: ---   
Hardware: x86   
OS: openSUSE 10.3   
Whiteboard:
Found By: Beta-Customer Services Priority:
Business Priority: Blocker: ---
Marketing QA Status: --- IT Deployment: ---
Attachments: Screenshot of corrupted display
/etc/x11/xorg.conf
/var/log/Xorg.0.log
most recent screenshot of corrupted display
Fedora 7 rc2 xorg.conf
Sabayon xorg.conf
Screenshot of corrupted display
Most recent screenshot of frozen systems display
output from hwinfo --gfxcard

Description Casual J. Programmer 2007-05-22 12:07:18 UTC
After installing openSuSE 10.3 alpha4 "clean" from DVD, every other logout, reboot init3 results in system freeze with corrupted display. Only functional key left is Power-Switch.

Hardware is FSC Amilo Si1520 with Intel 945GM
Comment 1 Casual J. Programmer 2007-05-22 12:08:21 UTC
Created attachment 141451 [details]
Screenshot of corrupted display
Comment 2 Casual J. Programmer 2007-05-22 12:10:18 UTC
The Screenshot was taken May 9, as this has been happening before, but the effect is identical to todays.
Comment 3 JP Rosevear 2007-05-26 15:09:24 UTC
This looks like a corrupted X display similar to the radeon driver.
Comment 4 Stefan Dirsch 2007-05-26 15:20:27 UTC
Could you attach xorg.conf and Xorg.0.log? Thanks.
Comment 5 Casual J. Programmer 2007-05-26 15:52:53 UTC
Created attachment 142427 [details]
/etc/x11/xorg.conf
Comment 6 Casual J. Programmer 2007-05-26 15:53:46 UTC
Created attachment 142428 [details]
/var/log/Xorg.0.log
Comment 7 Casual J. Programmer 2007-05-26 15:58:25 UTC
Created attachment 142429 [details]
most recent screenshot of corrupted display

This gets nicer each time ;-)

Actually the distortion appears on every logout ( shutdown of the x-server ) sometimes it recovers normally with hickups, mostly it just freezes the system.

My guess is, it's some kind of leaking memory or plain memory allocation misbehavoiur.
Comment 8 Stefan Dirsch 2007-05-26 19:41:56 UTC
Freeze with intel driver. Hmm ...
Comment 9 Stefan Dirsch 2007-05-29 14:10:06 UTC
DRI is enabled. Does it still happen when you disable 3D support?
Comment 10 Casual J. Programmer 2007-05-29 18:26:39 UTC
It seems to be a good starting point, happened 4 times while trying to remove 3d. ( Computer/Control Center/Graphics Card and Monitor uncheck 3D).

It may help you to know that it hapens in Sabayon as well, but not in Fedora 7 rc2. 

Can't tell now whether it's getting better, will report back after a few logouts.

Comment 11 Casual J. Programmer 2007-05-29 19:07:07 UTC
I did a couple of logout/login sequences as well as reboot and shutdown. System hang did not occur so far. The display is still distorted when x is switched off, but much less massive than before. Imagine the screen shot, where the upper 1/8th was blue with white streaks emanating downward. Now only a few bright spots where the lower border of the distortion was. Alas cant make a screen shot, as this passes too fast.

In Sabayon it looks much like I documented with the screenshots but doesn't hang, in Fedora 7 the screen stays completely dark when switching (3D enabled in both).
Comment 12 Stefan Dirsch 2007-05-31 10:45:40 UTC
> It may help you to know that it hapens in Sabayon as well, but not in Fedora
> 7 rc2. 
Unfortunately this information is rather useless as long as I don't know, which gfx driver is used for your machine by these distributions. Maybe i810?
Maybe fbdev? Maybe vesa? Maybe intel? Which version?
Comment 13 Casual J. Programmer 2007-05-31 11:07:33 UTC
Well, I was under the impression they both use "intel", otherwise I wouldn't have bothered you. I will check the versions for you as well.
Comment 14 Casual J. Programmer 2007-05-31 20:11:17 UTC
OK, for Fedora 7 rc2 its "intel" in the xorg.conf, uname -a produces: Linux localhost.localdomain 2.6.21-1.3194.fc7 #1 SMP Wed May 23 22:35:01 EDT 2007 i686 i686 i386 GNU/Linux.

The package manager lists:

xorg-x11-drv-i128-1.2.0-5.fc7.i386
xorg-x11-drv-i740-1.1.0-3.fc7.i386
xorg-x11-drv-i810-2.0.0-3.fc7.i386




Comment 15 Casual J. Programmer 2007-05-31 20:12:10 UTC
Created attachment 143321 [details]
Fedora 7 rc2 xorg.conf
Comment 16 Stefan Dirsch 2007-05-31 20:30:38 UTC
So Fedora 7 uses the same driver as we do. Possibly RH applied a patch we don't have. To rule out a configuration issue, you can test once more with RH's xorg.conf.
Comment 17 Casual J. Programmer 2007-05-31 22:46:52 UTC
copied  attachment (id=143321) to /etc/X11/xorg.conf, reboot to make sure it's used.

Logging out causes the same distortion as with the openSuSE xorg.conf.

You may check the src rpm from http://ftp.netcraft.com.au/pub/fedora/linux/releases/7/Fedora/source/SRPMS/xorg-x11-drv-i810-2.0.0-3.fc7.src.rpm 
for patch level.
Comment 18 Stefan Dirsch 2007-06-01 01:52:03 UTC
Thanks for testing with RH's config file. I could only find an unrelated DRI patch in the source RPM. So it must be something different. What about driver version in Sabayon, which suffers from the same problem?
Comment 19 Casual J. Programmer 2007-06-01 09:04:39 UTC
Not sure what the Sabayon driver versions are, they have a completely different package management system, didn't find enough motivation so far to uncover it's secrets :-(

xorg.conf attached
Comment 20 Casual J. Programmer 2007-06-01 09:08:20 UTC
Created attachment 143463 [details]
Sabayon xorg.conf

P.S. since disabling 3D I haven't expeienced another freeze so far, only the corrupted picture while screen is in transition.
Comment 21 Stefan Dirsch 2007-06-01 09:25:16 UTC
Ok. Obviously Sabayon uses "i810" - which means nothing, since normally this is a symlink to "intel", which includes the required "i810" symbols. SUSE patched the "intel" sources, so we also can provide the old real "i810" driver to our customers, if there are any problem with the new "intel" driver.

> only the corrupted picture while screen is in transition.
So this is no permanent problem, just some garbage during reinitialization?
This would mean that DRI is the problem.
Comment 22 Casual J. Programmer 2007-06-01 10:13:29 UTC
Well, to me it's not looking like random "garbage" it looks pretty constant like  attachment (id=142429) in comment #7, only, since deactivation of 3D, it doesn't freeze there any more but the (distorted) display gets overwritten by console output.

As I mentioned before, to me it looks like several programs think they should use other peoples scratchpad. I.e. memory leak or memory allocation problem

With F7 the display just gets black and then console output starts on a clean slate, as you would expect.
Comment 23 Stefan Dirsch 2007-06-01 10:27:04 UTC
So the difference between enabled/disabled DRI is that with DRI enabled the machine freezes, whereas the display output is broken in both situations?
Comment 24 Casual J. Programmer 2007-06-01 10:31:38 UTC
Yes
Comment 25 Casual J. Programmer 2007-06-05 08:00:36 UTC
OK, now it has happened again. This time it is different though.

1) When switching from the x-session to tty1 with CTL+ALT+F1 the display showed the garble at the top for an instant.

2) When shutting the system down for reboot ( Computer / logout ) the display showed content of tty1 overlayed by garble ( see attached screenshot ) and froze.
Comment 26 Casual J. Programmer 2007-06-05 08:02:25 UTC
Created attachment 144095 [details]
Screenshot of corrupted display
Comment 27 Stefan Dirsch 2007-06-07 08:57:17 UTC
I think you should better go back to i810 driver. Replace intel with i810 in your xorg.conf. Does this help?
Comment 28 Stefan Dirsch 2007-06-22 13:49:09 UTC
Any news on this one?
Comment 29 Casual J. Programmer 2007-07-01 15:28:33 UTC
Not really, the picture has become colorful though :-)
Comment 30 Stefan Dirsch 2007-07-01 15:33:15 UTC
So the Intel driver did neve work for you. Interesting.
Comment 31 Casual J. Programmer 2007-07-01 15:47:47 UTC
Actually, in the meantime I found that logging out on Fedora 7 also sometimes freezes the system, only the screen is completely blank. So it may be upstream after all. Also I think it is the same issue as Bug 263147, if you agree, it should be marked duplicate.
Comment 32 Stefan Dirsch 2007-07-01 17:41:42 UTC
Ok.

*** This bug has been marked as a duplicate of bug 263147 ***
Comment 33 Casual J. Programmer 2007-07-01 22:12:32 UTC
Sorry if I caused confusion, alas now the duplication shows everything as resolved, which is not the case. I think rather than marking 276928 as duplicate of 263147 it should be vice versa. I.e. 263147 is a duplicate of 276928 and it's pretty much alive.
Comment 34 Casual J. Programmer 2007-07-01 22:13:53 UTC
Created attachment 149037 [details]
Most recent screenshot of frozen systems display
Comment 35 Stefan Dirsch 2007-07-02 05:55:56 UTC
Bug #263147 is already a duplicate of Bug #252509.
Comment 36 Stefan Dirsch 2007-07-02 06:07:13 UTC
The situation looks hopeless to me. Could you add the output of "hwinfo --gfxcard", so we can use the fbdev driver for this notebook by default? Thanks.
Comment 37 Casual J. Programmer 2007-07-02 06:47:05 UTC
OK, here you go.
Comment 38 Casual J. Programmer 2007-07-02 06:47:54 UTC
Created attachment 149051 [details]
output from hwinfo --gfxcard
Comment 39 Casual J. Programmer 2007-07-02 06:55:19 UTC
I am still content, that there are processes trying to use the same bit of memory. Is there a way I can log memory usage for the area in question ? 

Also this keeps popping up in dmesg:

workstation6l:/home/cjp # dmesg | grep agpgart
Linux agpgart interface v0.102 (c) Dave Jones
agpgart: Detected an Intel 945GM Chipset.
agpgart: Detected 7932K stolen memory.
agpgart: AGP aperture is 256M @ 0xc0000000

not sure what significance it has, but if it is possible to detect "stolen" memory, it should be possible to identify the "thief" or leave that memory alone, as the "thief" is probably trying to make use of it, thus "stealing" it again.

Comment 40 Stefan Dirsch 2007-07-04 11:13:44 UTC
Ignore the dmesg warnings. These are *not* the problem at all. Beginning with Alpha6 your machine will be configured with fbdev by default.