|
Bugzilla – Full Text Bug Listing |
| Summary: | kernel oops related to unusual USB device | ||
|---|---|---|---|
| Product: | [openSUSE] openSUSE 11.0 | Reporter: | Juergen Weigert <jw> |
| Component: | Kernel | Assignee: | Greg Kroah-Hartman <gregkh> |
| Status: | RESOLVED FIXED | QA Contact: | E-mail List <qa-bugs> |
| Severity: | Minor | ||
| Priority: | P5 - None | CC: | jack |
| Version: | Final | ||
| Target Milestone: | --- | ||
| Hardware: | Other | ||
| OS: | Other | ||
| Whiteboard: | |||
| Found By: | --- | Services Priority: | |
| Business Priority: | Blocker: | --- | |
| Marketing QA Status: | --- | IT Deployment: | --- |
| Bug Depends on: | |||
| Bug Blocks: | 357354 | ||
| Attachments: |
full dmesg
pm-suspend.log dmesg with another two oopses, may be unrelated, though... dmesg snippet with another oops. Patch fixing similarly looking problem in SLES10 |
||
|
Description
Juergen Weigert
2008-10-01 09:45:11 UTC
Created attachment 242744 [details]
full dmesg
Created attachment 242756 [details]
pm-suspend.log
suspend to ram reports error 11 now, and invites me to view the attached logfile.
error 11 is EAGAIN, which probably means that a process is in state D (probably hal-find-by-cap*) and can not be stopped. This is _NOT_ a suspend problem, but the problem is the oops that occured before. This is a plain old kernel bug. Created attachment 244539 [details]
dmesg with another two oopses, may be unrelated, though...
...will suspend start working when you reboot? Are the preceeding oopses repeatable? yes, after a reboot, the system can susepend/resume again, as normal. Created attachment 245617 [details]
dmesg snippet with another oops.
OOpses are reproducable.
It took me two attempts to get one.
Reducing environment, disconnecting external monitor, disconnecting USB devices, one by one. With all the oopses, one particular USB device was present: USBasp from http://www.fischl.de/usbasp/, latest firmware 2007-10-23 idVendor=16c0, idProduct=05dc It is a programmer dongle for embedded systems, controlled via libusb calls. I can drop such a device on seife's desk, if needed. Ok, if particular device breaks your kernel, debug that particular device ;-). It seems like those oopses will happen even without the suspend/resume, right? Agreed. Seems to be independant of suspend. I failed to identify a specific driver. My assumption is that generic usb is used via libusb. As mentioned earlier, I can provide the hardware. Reassigning to kernel-maintainers for advice. Okay, this is not related to suspend, and I would not know how to use the device. If you can reliably crash kernel using libusb and non-root uid, that's probably important. Is that the case? Can you reproduce the crash on latest 11.1beta and/or latest vanilla kernel? The oopses are unrelated to usb. It must be verified that indeed usage of the device causes. Mere presence is not proof. Never tried as non-root. Maybe I can chmod something under /dev/bus/usb to allow me non-root. ...ok, I guess you need to find reliable way to reproduce the oops, then we can decide if it is usb-related... Olivier, usb is currently our best hope :-). Jan, does this look like the weird memory corruption issue with usbfs you found? Definitely not exactly the one I saw as that is already fixed in 2.6.25 used in OpenSUSE 11. But yes, symptoms look similar (i.e., random corruption when USB devices are discovered / removed). Please test whether the mere presence of the device is sufficient or some kind of use is needed. Jan, as this is too similar to ignore, could you attach that fix? Created attachment 272295 [details]
Patch fixing similarly looking problem in SLES10
OK, here you have the fix from SLES10 SP2
Looking into the logs, all oopses happen during discovering USB devices (in particular some Broadcom USB device seems to be always nearby). I don't think I'm the right person to debug this. Greg, can you have a look? Big question, does this happen still on 11.1? I'll have access to the relevant hardware again this weekend. Is it sufficient to test against SLED11-RC3 ? RC4 would be best of course :) All is well with SLED11-RC3 Cannot provoke any oopses, not even when exercising suspend/resume. Thanks for fixing, whoever it was and whatever it was :-) I suggest RESOLVED FIXED. ok, marking it as such, thanks for testing. |