Bug 568307

Summary: Regression: backported rt2860 driver in kernel-default-2.6.31.8-0.1.1 triggers kernel panics
Product: [openSUSE] openSUSE 11.2 Reporter: Forgotten User cAXlJ_FoSf <forgotten_cAXlJ_FoSf>
Component: KernelAssignee: Greg Kroah-Hartman <gregkh>
Status: RESOLVED DUPLICATE QA Contact: E-mail List <qa-bugs>
Severity: Critical    
Priority: P2 - High CC: AxelKoellhofer, freespacer, kailed, meissner, ralf, sergey1369, uli.geins
Version: Final   
Target Milestone: ---   
Hardware: x86   
OS: openSUSE 11.2   
Whiteboard:
Found By: --- Services Priority:
Business Priority: Blocker: ---
Marketing QA Status: --- IT Deployment: ---
Attachments: Screen capture of the panic
My /var/log/messages with comments

Description Forgotten User cAXlJ_FoSf 2010-01-05 13:02:37 UTC
Created attachment 334984 [details]
Screen capture of the panic

User-Agent:       Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.1.6) Gecko/20091201 SUSE/3.5.6-1.1.1 Firefox/3.5.6

After the last kernel update to kernel-default-2.6.31.8-0.1.1 connecting to a WPA-encrypted WLAN with my MSI Wind U-100/Ralink RT2860 chipset using the rt2860 driver triggers a kernel panic. This is a regression in the backported rt2860 driver contained in the above kernel update, the driver in kernel-default-2.6.31.5-0.1.1 worked fine.

A screen capture of the panic is attached.

I know the driver is in staging but I expect a stable kernel update not to break wireless on my netbook.

Reproducible: Always

Steps to Reproduce:
Trying to connect to a WPA-encrypted WLAN using NetworkManager triggers a kernel panic every time.
Comment 1 Forgotten User cAXlJ_FoSf 2010-01-05 13:11:49 UTC
Bug #568162 might be related to or a duplicate of this one but there are not enough details to be sure.
Comment 2 Daniele Tombolini 2010-01-05 17:44:35 UTC
I' have the same chip so I think it's a dupe of:
#568120

but rt2860 apart, there is this:

http://lists.opensuse.org/opensuse-kernel/2009-12/msg00034.html
Comment 3 Forgotten User cAXlJ_FoSf 2010-01-05 18:37:22 UTC
*** Bug 568162 has been marked as a duplicate of this bug. ***
Comment 4 Forgotten User cAXlJ_FoSf 2010-01-05 18:44:35 UTC
(In reply to comment #2)
> I' have the same chip so I think it's a dupe of:
> #568120
> 
> but rt2860 apart, there is this:
> 
> http://lists.opensuse.org/opensuse-kernel/2009-12/msg00034.html

That issue is unrelated to this bug which happens inside the network stack when connecting to a WLAN. See the attached screenshot for the details.
Comment 5 Raphael Javaux 2010-01-05 19:21:43 UTC
Created attachment 335041 [details]
My /var/log/messages with comments

Same regression on Linksys WMP600N (rt2860). Worked fine with 2.6.31.5 but crash since update to 2.6.31.8.

I've add my /var/log/messages with comments after 3 crashs to the attachments.

It looks like a problem with dhcp.
Comment 6 uli geins 2010-01-05 20:25:37 UTC
the same problem have my pc. since update from kernel 2.6.31.5 to 2.6.31.8 my wlan-card hangs by connecting to the accespoint and want everytime to give the wpa password.the card is a longshine 300mbit with a rt2860pci/express. befor the update it work fine.
i have try all new drivers and firmware from the factory repos created by nicko. the same as....
i can not give a log-output, because the maschine freez during connecting.
Comment 7 Sebastian Siebert 2010-01-10 00:00:27 UTC
I have the same problem with kernel panic. I used the WLAN chipset of Ralink RT2860.

kernel-desktop 2.6.31.8-0.1.1 (from official Update-Repo)

Kernel-Modul: rt2860sta

If I connect to the WLAN-Router via NetworkManager (KDE4), the computer is freezing. (Scroll Lock && Caps Lock are flashing)

This is very very critical and this bug of rt2860 driver have to fixed ASAP. Where is our quality management? Who tests this kernel update before release?

Workaround: Downgrade to kernel 2.6.31.5
Comment 8 Greg Kroah-Hartman 2010-01-10 00:45:11 UTC
ick, looks like the driver update caused big problems, unfortunatly it works, and
is required by other people who now have a working system.

I'll look into this on monday, sorry about this.

As for QA, again, it works for me here, it is just that there is a wide range
of devices this driver supports, and I do not have all of them to test with.
Comment 9 Daniele Tombolini 2010-01-10 01:06:14 UTC
I don't know which is the driver version in .8 (downgraded to .5) but I've tried kernel 2.6.32 (maybe from factory, not sure..)two week ago without issue.
Hope it helps.
Comment 10 Axel Köllhofer 2010-01-10 11:33:57 UTC
If the problem is really caused by the updated drivers (rt2860sta.ko/rt2870sta.ko) in 2.6.31.8, then this here might be a temporary workaround until there is an official update.

http://download.opensuse.org/repositories/home:/Akoellh:/Kernelmodules/openSUSE_11.2_Update/

You will find packages "rt28x0-kmp-$FLAVOR-2.6.31.5_2.6.31.8_0.1-1.1.$ARCHITECTURE.rpm" which contain the above mentioned kernel modules from 2.6.31.5 compiled against 2.6.31.8. 

After installing the appropriate kmp-package for your architecture/kernel flavor and unloading/reloading the respective kernel module, you will be using the "old" drivers on the "new" kernel.
Comment 11 uli geins 2010-01-10 16:22:51 UTC
ok. i have installed the driver from this repo http://download.opensuse.org/repositories/home:/Akoellh:/Kernelmodules/openSUSE_11.2_Update/
and the card is now connecting to the router perfectly and did not freeze!!!!
but there is one issue. my card is a 300 mbit and the actual connection is only 54 mbit. before i have installed the new kernel, it was every time nearly 270 up to 300 mbit connection speed. can you go on.......

here is a state of lsmod:

Intel:/home/geins # lsmod | grep rt2
rt2860sta             573496  1
Intel:/home/geins #

and here the iwconfig:
 
wlan0     RT2860 Wireless  ESSID:"Geins"  Nickname:"RT2860STA"
          Mode:Managed  Frequency=2.412 GHz  Access Point: 00:1F:3F:A2:40:AD
          Bit Rate=54 Mb/s
          RTS thr:off   Fragment thr:off
          Encryption key:2103-49B3-D5FE-0BBC-F61A-48DE-0E02-3EF4
          Link Quality=100/100  Signal level:-43 dBm  Noise level:-71 dBm
          Rx invalid nwid:0  Rx invalid crypt:0  Rx invalid frag:0
          Tx excessive retries:0  Invalid misc:0   Missed beacon:0

and at last ifconfig:

wlan0     Link encap:Ethernet  Hardware Adresse 00:1F:1F:26:71:16
          inet Adresse:192.168.178.3  Bcast:192.168.178.255  Maske:255.255.255.0
          inet6 Adresse: fe80::21f:1fff:fe26:7116/64  Gültigkeitsbereich:Verbindung
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:7650 errors:0 dropped:0 overruns:0 frame:0
          TX packets:796 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 Sendewarteschlangenlänge:1000
          RX bytes:1722162 (1.6 Mb)  TX bytes:44055 (43.0 Kb)
          Interrupt:16

my card is a longshine 300mbit pci express with a rt2860 chipset.

thanks
uli
Comment 12 uli geins 2010-01-10 16:55:06 UTC
i did it. i have my 300 mbit connection back.
it was the firmware. i have installed the actual firmware. i did it before the kernelupdate too.
see link http://www.ralinktech.com/support.php?s=2. from here i get the actual firmware and also install the webgui configurationtool.

here is the iwconfig:

Intel:/home/geins # iwconfig

wlan0     RT2860 Wireless  ESSID:"Geins"  Nickname:"RT2860STA"
          Mode:Managed  Frequency=2.412 GHz  Access Point: 00:1F:3F:A2:40:AD
          Bit Rate=270 Mb/s
          RTS thr:off   Fragment thr:off
          Encryption key:629F-F01B-F756-00CC-D20B-B85A-11FA-BE3D
          Link Quality=100/100  Signal level:-43 dBm  Noise level:-71 dBm
          Rx invalid nwid:0  Rx invalid crypt:0  Rx invalid frag:0
          Tx excessive retries:0  Invalid misc:0   Missed beacon:0

Intel:/home/geins #

and ifconfig:

wlan0     Link encap:Ethernet  Hardware Adresse 00:1F:1F:26:71:16
          inet Adresse:192.168.178.3  Bcast:192.168.178.255  Maske:255.255.255.0
          inet6 Adresse: fe80::21f:1fff:fe26:7116/64 Gültigkeitsbereich:Verbindung
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:3616 errors:0 dropped:0 overruns:0 frame:0
          TX packets:454 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 Sendewarteschlangenlänge:1000
          RX bytes:831521 (812.0 Kb)  TX bytes:24203 (23.6 Kb)
          Interrupt:16
Comment 13 Axel Köllhofer 2010-01-10 22:38:55 UTC
(In reply to comment #12)
> i did it. i have my 300 mbit connection back.
> it was the firmware. i have installed the actual firmware. i did it before the
> kernelupdate too.

Be more specific, please.

The firmware from "Firmware RT2860PCI/mPCI/PCIe/CB(RT2760/RT2790/RT2860/RT2890)" is exactly the same version as the one provided by either the "kernel-firmware" or "ralink-firmware" RPMs for 11.2, have a look at this here:

md5sum /lib/firmware/rt2860.bin RT2860_Firmware_V11/rt2860.bin rt2860.bin 7f55011396eff4983f26bb7dd7339fb3  /lib/firmware/rt2860.bin
7f55011396eff4983f26bb7dd7339fb3  RT2860_Firmware_V11/rt2860.bin
7f55011396eff4983f26bb7dd7339fb3  rt2860.bin

However, the firmware proviced with latest Vendor driver "RT2860PCI/mPCI/CB/PCIe(RT2760/RT2790/RT2860/RT2890)" has another md5sum:

md5sum 2009_0918_RT2860_Linux_STA_v2.2.0.0/common/rt2860.bin
1d7d8c575a2a247e2d9998e5e3d833b8  2009_0918_RT2860_Linux_STA_v2.2.0.0/common/rt2860.bin

So what is the md5sum of the rt2860.bin you are using at the Moment (= the one you placed in /lib/firmware/)?
Comment 14 uli geins 2010-01-11 18:45:07 UTC
that is interesting.
ok, let's compare.
here is the md5sum of my actual installed packages:

md5sum /lib/modules/2.6.31.8-0.1-pae/kernel/drivers/staging/rt2860/rt2860sta.ko
7334f59c4925214c3a248ec7a5738fa4  rt2860sta.ko

md5sum /lib/firmware/rt2860.bin
7f55011396eff4983f26bb7dd7339fb3  rt2860.bin

md5sum /lib/modules/2.6.31.8-0.1-pae/updates/rt2860sta.ko
1d118babc5acfd19a7b0b624c70a7284  /lib/modules/2.6.31.8-0.1-pae/updates/rt2860sta.ko

md5sum /lib/modules/2.6.31.8-0.1-pae/kernel/drivers/staging/rt2860/rt2860sta.ko
7334f59c4925214c3a248ec7a5738fa4  /lib/modules/2.6.31.8-0.1-pae/kernel/drivers/staging/rt2860/rt2860sta.ko

i saw, that the "webgui" installer install and override also some drivers from the ralink page, witch is not working properly after installation.
but you see in my postings befor, that i have a 300 mbit connenction, so were is the different?
Comment 15 Axel Köllhofer 2010-01-12 13:41:48 UTC
(In reply to comment #14)

> here is the md5sum of my actual installed packages:
> 

> md5sum /lib/firmware/rt2860.bin
> 7f55011396eff4983f26bb7dd7339fb3  rt2860.bin
> 

So you ware using the exact same firmware as provided by the official packages "ralink-firmware" or "kernel-firmware".

The md5sums of kernel modules are of no relevance here.

> i saw, that the "webgui" installer install and override also some drivers from
> the ralink page, witch is not working properly after installation.

As these are vendor drivers not shipped and maintaned by openSUSE, this is also of no relevance.
Comment 16 Greg Kroah-Hartman 2010-01-13 23:40:11 UTC
Caused by bug #540589, reassigning...

*** This bug has been marked as a duplicate of bug 540589 ***