Bug 849028

Summary: grub2-probe create kernel segfault
Product: [openSUSE] openSUSE 13.1 Reporter: Bruno Friedmann <bruno>
Component: KernelAssignee: E-mail List <kernel-maintainers>
Status: RESOLVED WONTFIX QA Contact: E-mail List <qa-bugs>
Severity: Normal    
Priority: P5 - None CC: arvidjaar, marvin24, tiwai
Version: RC 2   
Target Milestone: ---   
Hardware: x86-64   
OS: SUSE Other   
Whiteboard:
Found By: --- Services Priority:
Business Priority: Blocker: ---
Marketing QA Status: --- IT Deployment: ---
Attachments: dmesg with crash in

Description Bruno Friedmann 2013-11-05 13:51:06 UTC
User-Agent:       Mozilla/5.0 (X11; Linux x86_64; rv:24.0) Gecko/20100101 Firefox/24.0

During the last kernel update mkinirtd & grub2 installation are runned by the computer. 
grub2-probe show an error fail with nonexistent lvm logical volume.

having a look at dmesg show lots of segfault and failures

Reproducible: Always

Steps to Reproduce:
1. run RC1
2. upgrade to RC2 
3. run dmesg
Actual Results:  
dmesg show segfault in numerous try of detecting filesystems

Expected Results:  
warning or "under" control message

[1117631.441675] EXT4-fs (sdg1): VFS: Can't find ext4 filesystem
[1117631.446774] EXT4-fs (sdg1): VFS: Can't find ext4 filesystem
[1117631.451864] EXT4-fs (sdg1): VFS: Can't find ext4 filesystem
[1117631.464230] ISOFS: Unable to identify CD-ROM format.
[1117631.474272] REISERFS warning (device sdg1): sh-2021 reiserfs_fill_super: can not find reiserfs on sdg1
[1117631.479559] XFS (sdg1): bad magic number
[1117631.479571] ffff88040d4b1000: 52 56 be 1b 81 e8 39 01 5e bf f4 81 66 8b 2d 83  RV....9.^...f.-.
[1117631.479578] ffff88040d4b1010: 7d 08 00 0f 84 e2 00 80 7c ff 00 74 46 66 8b 1d  }.......|..tFf..
[1117631.479584] ffff88040d4b1020: 66 8b 4d 04 66 31 c0 b0 7f 39 45 08 7f 03 8b 45  f.M.f1...9E....E
[1117631.479589] ffff88040d4b1030: 08 29 45 08 66 01 05 66 83 55 04 00 c7 04 10 00  .)E.f..f.U......
[1117631.479596] XFS (sdg1): Internal error xfs_sb_read_verify at line 780 of file /home/abuild/rpmbuild/BUILD/kernel-default-3.11.6/linux-3.11/fs/xfs/xfs_mount.c.  Caller 0xffffffffa079b295

[1117631.479606] CPU: 3 PID: 239 Comm: kworker/3:1H Tainted: G        W    3.11.6-1-default #1
[1117631.479612] Hardware name: Gigabyte Technology Co., Ltd. GA-890GPA-UD3H/GA-890GPA-UD3H, BIOS FF 11/24/2010
[1117631.479669] Workqueue: xfslogd xfs_buf_iodone_work [xfs]
[1117631.479674]  0000000000000001 ffffffff81560e60 ffff8803d7f52800 ffffffffa079d7d6
[1117631.479683]  0000030c7d08000f ffffffffa079b295 ffff88040ffe4e00 0000000000000016
[1117631.479691]  ffff8803d7f52800 ffffe8ffffcc1300 ffffffffa07eccdd ffffffffa079b295
[1117631.479699] Call Trace:
[1117631.479727]  [<ffffffff81004868>] dump_trace+0x78/0x2c0
[1117631.479741]  [<ffffffff81004b3f>] show_stack_log_lvl+0x8f/0x170
[1117631.479752]  [<ffffffff81005e0c>] show_stack+0x1c/0x50
[1117631.479764]  [<ffffffff81560e60>] dump_stack+0x41/0x51
[1117631.479814]  [<ffffffffa079d7d6>] xfs_corruption_error+0x56/0x80 [xfs]
[1117631.479946]  [<ffffffffa07eccdd>] xfs_sb_read_verify+0xfd/0x120 [xfs]
[1117631.480225]  [<ffffffffa079b295>] xfs_buf_iodone_work+0x75/0xd0 [xfs]
[1117631.480284]  [<ffffffff810687fd>] process_one_work+0x16d/0x420
[1117631.480296]  [<ffffffff810693c6>] worker_thread+0x116/0x3b0
[1117631.480308]  [<ffffffff8106f8df>] kthread+0xaf/0xc0
[1117631.480320]  [<ffffffff8156ee7c>] ret_from_fork+0x7c/0xb0
[1117631.480331] XFS (sdg1): Corruption detected. Unmount and run xfs_repair
[1117631.480371] XFS (sdg1): SB validate failed with error 22.
[1117631.490622] FAT-fs (sdg1): invalid media value (0xe2)
[1117631.490631] FAT-fs (sdg1): Can't find a valid FAT filesystem
[1117631.495570] FAT-fs (sdg1): invalid media value (0xe2)
[1117631.495578] FAT-fs (sdg1): Can't find a valid FAT filesystem
[1117631.500448] VFS: Can't find a Minix filesystem V1 | V2 | V3 on device sdg1.
[1117631.505431] hfsplus: unable to find HFS+ superblock
[1117631.510246] qnx4: wrong fsid in superblock.
[1117631.515189] You didn't specify the type of your ufs filesystem

mount -t ufs -o ufstype=sun|sunx86|44bsd|ufs2|5xbsd|old|hp|nextstep|nextstep-cd|openstep ...

>>>WARNING<<< Wrong ufstype may corrupt your filesystem, default is ufstype=old
[1117631.515399] ufs_read_super: bad magic number
[1117631.530029] hfs: can't find a HFS filesystem on dev sdg1
[1117631.665095] EXT4-fs (sdh1): VFS: Can't find ext4 filesystem
[1117631.669939] EXT4-fs (sdh1): VFS: Can't find ext4 filesystem
[1117631.674722] EXT4-fs (sdh1): VFS: Can't find ext4 filesystem
[1117631.687038] ISOFS: Unable to identify CD-ROM format.
[1117631.697183] REISERFS warning (device sdh1): sh-2021 reiserfs_fill_super: can not find reiserfs on sdh1
[1117631.702431] XFS (sdh1): bad magic number
[1117631.702444] ffff880328a37000: 52 56 be 1b 81 e8 39 01 5e bf f4 81 66 8b 2d 83  RV....9.^...f.-.
[1117631.702451] ffff880328a37010: 7d 08 00 0f 84 e2 00 80 7c ff 00 74 46 66 8b 1d  }.......|..tFf..
[1117631.702457] ffff880328a37020: 66 8b 4d 04 66 31 c0 b0 7f 39 45 08 7f 03 8b 45  f.M.f1...9E....E
[1117631.702461] ffff880328a37030: 08 29 45 08 66 01 05 66 83 55 04 00 c7 04 10 00  .)E.f..f.U......
[1117631.702469] XFS (sdh1): Internal error xfs_sb_read_verify at line 780 of file /home/abuild/rpmbuild/BUILD/kernel-default-3.11.6/linux-3.11/fs/xfs/xfs_mount.c.  Caller 0xffffffffa079b295

[1117631.702479] CPU: 3 PID: 239 Comm: kworker/3:1H Tainted: G        W    3.11.6-1-default #1
[1117631.702485] Hardware name: Gigabyte Technology Co., Ltd. GA-890GPA-UD3H/GA-890GPA-UD3H, BIOS FF 11/24/2010
[1117631.702540] Workqueue: xfslogd xfs_buf_iodone_work [xfs]
[1117631.702546]  0000000000000001 ffffffff81560e60 ffff880411087800 ffffffffa079d7d6
[1117631.702555]  0000030c7d08000f ffffffffa079b295 ffff880411304e40 0000000000000016
[1117631.702563]  ffff880411087800 ffffe8ffffcc1300 ffffffffa07eccdd ffffffffa079b295
[1117631.702571] Call Trace:
[1117631.702600]  [<ffffffff81004868>] dump_trace+0x78/0x2c0
[1117631.702614]  [<ffffffff81004b3f>] show_stack_log_lvl+0x8f/0x170
[1117631.702625]  [<ffffffff81005e0c>] show_stack+0x1c/0x50
[1117631.702638]  [<ffffffff81560e60>] dump_stack+0x41/0x51
[1117631.702687]  [<ffffffffa079d7d6>] xfs_corruption_error+0x56/0x80 [xfs]
[1117631.702821]  [<ffffffffa07eccdd>] xfs_sb_read_verify+0xfd/0x120 [xfs]
[1117631.703100]  [<ffffffffa079b295>] xfs_buf_iodone_work+0x75/0xd0 [xfs]
[1117631.703233]  [<ffffffff810687fd>] process_one_work+0x16d/0x420
[1117631.703269]  [<ffffffff810693c6>] worker_thread+0x116/0x3b0
[1117631.703296]  [<ffffffff8106f8df>] kthread+0xaf/0xc0
[1117631.703309]  [<ffffffff8156ee7c>] ret_from_fork+0x7c/0xb0
[1117631.703322] XFS (sdh1): Corruption detected. Unmount and run xfs_repair
[1117631.703344] XFS (sdh1): SB validate failed with error 22.
[1117631.713687] FAT-fs (sdh1): invalid media value (0xe2)
[1117631.713696] FAT-fs (sdh1): Can't find a valid FAT filesystem
[1117631.718548] FAT-fs (sdh1): invalid media value (0xe2)
[1117631.718557] FAT-fs (sdh1): Can't find a valid FAT filesystem
[1117631.723607] VFS: Can't find a Minix filesystem V1 | V2 | V3 on device sdh1.
[1117631.728585] hfsplus: unable to find HFS+ superblock
[1117631.733418] qnx4: wrong fsid in superblock.
[1117631.738176] You didn't specify the type of your ufs filesystem

mount -t ufs -o ufstype=sun|sunx86|44bsd|ufs2|5xbsd|old|hp|nextstep|nextstep-cd|openstep ...

>>>WARNING<<< Wrong ufstype may corrupt your filesystem, default is ufstype=old
[1117631.738433] ufs_read_super: bad magic number
[1117631.753066] hfs: can't find a HFS filesystem on dev sdh1
Comment 1 Olaf Hering 2013-11-05 14:15:45 UTC
[1117631.702431] XFS (sdh1): bad magic number
its just noise.
Comment 2 Bruno Friedmann 2013-11-05 14:47:22 UTC
Created attachment 566178 [details]
dmesg with crash in

Thanks Olaf for the confirmation.
I'm adding the full dmesg in case of
Comment 3 Andrei Borzenkov 2013-11-05 16:26:02 UTC
Do you still observe it after you remove os-prober (rpm -e --nodeps os-prober)?
Comment 4 Bruno Friedmann 2013-11-05 20:24:05 UTC
Andrey no need to break package dependencies. 

If I simply uncheck use os-probe in yast bootloader the segfault doesn't appear and mkinitrd & grub2-mkconfig run more smoothly

mkinitrd

Kernel image:   /boot/vmlinuz-3.11.6-3-default
Initrd image:   /boot/initrd-3.11.6-3-default
KMS drivers:     radeon
Root device:    /dev/vgsys/lvsuse (mounted on / as ext4)
Resume device:  /dev/vgsys/lvswap
setup-md.sh: kusco.disney.interne:mdsys found multiple times
Kernel Modules: thermal_sys thermal processor fan dm-mod dm-log dm-region-hash dm-mirror dm-snapshot raid0 raid1 raid10 async_tx async_memcpy xor async_xor raid6_pq async_pq async_raid6_recov raid456 scsi_dh scsi_dh_hp_sw scsi_dh_alua scsi_dh_rdac scsi_dh_emc i2c-algo-bit drm drm_kms_helper ttm radeon usb-common usbcore ohci-hcd uhci-hcd ehci-hcd xhci-hcd usbhid hid-logitech-dj hid-generic hid-holtek-kbd hid-lenovo-tpkbd hid-ortek hid-roccat hid-roccat-common hid-roccat-arvo hid-roccat-isku hid-samsung ehci-pci ohci-pci linear
Firmware:       radeon/R520_cp.bin radeon/RS600_cp.bin radeon/RS690_cp.bin radeon/R420_cp.bin radeon/R300_cp.bin radeon/R200_cp.bin radeon/R100_cp.bin radeon/SUMO2_me.bin radeon/SUMO2_pfp.bin radeon/SUMO_me.bin radeon/SUMO_pfp.bin radeon/SUMO_rlc.bin radeon/PALM_me.bin radeon/PALM_pfp.bin radeon/CYPRESS_smc.bin radeon/CYPRESS_rlc.bin radeon/CYPRESS_me.bin radeon/CYPRESS_pfp.bin radeon/JUNIPER_smc.bin radeon/JUNIPER_rlc.bin radeon/JUNIPER_me.bin radeon/JUNIPER_pfp.bin radeon/REDWOOD_smc.bin radeon/REDWOOD_rlc.bin radeon/REDWOOD_me.bin radeon/REDWOOD_pfp.bin radeon/CEDAR_smc.bin radeon/CEDAR_rlc.bin radeon/CEDAR_me.bin radeon/CEDAR_pfp.bin radeon/R700_rlc.bin radeon/R600_rlc.bin radeon/RV710_smc.bin radeon/RV710_me.bin radeon/RV710_pfp.bin radeon/RV740_smc.bin radeon/RV730_smc.bin radeon/RV730_me.bin radeon/RV730_pfp.bin radeon/RV770_smc.bin radeon/RV770_me.bin radeon/RV770_pfp.bin radeon/RS780_me.bin radeon/RS780_pfp.bin radeon/RV670_me.bin radeon/RV670_pfp.bin radeon/RV635_me.bin radeon/RV635_pfp.bin radeon/RV620_me.bin radeon/RV620_pfp.bin radeon/RV630_me.bin radeon/RV630_pfp.bin radeon/RV610_me.bin radeon/RV610_pfp.bin radeon/R600_me.bin radeon/R600_pfp.bin radeon/ARUBA_rlc.bin radeon/ARUBA_me.bin radeon/ARUBA_pfp.bin radeon/CAYMAN_smc.bin radeon/CAYMAN_rlc.bin radeon/CAYMAN_mc.bin radeon/CAYMAN_me.bin radeon/CAYMAN_pfp.bin radeon/CAICOS_smc.bin radeon/CAICOS_mc.bin radeon/CAICOS_me.bin radeon/CAICOS_pfp.bin radeon/TURKS_smc.bin radeon/TURKS_mc.bin radeon/TURKS_me.bin radeon/TURKS_pfp.bin radeon/BTC_rlc.bin radeon/BARTS_smc.bin radeon/BARTS_mc.bin radeon/BARTS_me.bin radeon/BARTS_pfp.bin radeon/HAINAN_smc.bin radeon/HAINAN_rlc.bin radeon/HAINAN_mc.bin radeon/HAINAN_ce.bin radeon/HAINAN_me.bin radeon/HAINAN_pfp.bin radeon/OLAND_smc.bin radeon/OLAND_rlc.bin radeon/OLAND_mc.bin radeon/OLAND_ce.bin radeon/OLAND_me.bin radeon/OLAND_pfp.bin radeon/VERDE_smc.bin radeon/VERDE_rlc.bin radeon/VERDE_mc.bin radeon/VERDE_ce.bin radeon/VERDE_me.bin radeon/VERDE_pfp.bin radeon/PITCAIRN_smc.bin radeon/PITCAIRN_rlc.bin radeon/PITCAIRN_mc.bin radeon/PITCAIRN_ce.bin radeon/PITCAIRN_me.bin radeon/PITCAIRN_pfp.bin radeon/TAHITI_smc.bin radeon/TAHITI_rlc.bin radeon/TAHITI_mc.bin radeon/TAHITI_ce.bin radeon/TAHITI_me.bin radeon/TAHITI_pfp.bin radeon/BONAIRE_uvd.bin radeon/TAHITI_uvd.bin radeon/SUMO_uvd.bin radeon/CYPRESS_uvd.bin radeon/RV710_uvd.bin radeon/KABINI_sdma.bin radeon/KABINI_rlc.bin radeon/KABINI_mec.bin radeon/KABINI_ce.bin radeon/KABINI_me.bin radeon/KABINI_pfp.bin radeon/BONAIRE_sdma.bin radeon/BONAIRE_rlc.bin radeon/BONAIRE_mc.bin radeon/BONAIRE_mec.bin radeon/BONAIRE_ce.bin radeon/BONAIRE_me.bin radeon/BONAIRE_pfp.bin
Features:       acpi dm md kms plymouth block usb lvm2 resume.userspace resume.kernel

kuzco:~ # grub2-mkconfig -o /tmp/grub.cfg
Generating grub.cfg ...
Found theme: /boot/grub2/themes/openSUSE/theme.txt
Found linux image: /boot/vmlinuz-3.11.6-3-default
Found initrd image: /boot/initrd-3.11.6-3-default
done
Comment 5 Andrei Borzenkov 2013-11-06 17:04:54 UTC
If os-prober does not detect partition type (effectively it is using blkid for it) it simply tries to mount it using any filesystem currently loaded (/proc/filesystems). It is in any way specific to bios_grub partition and has nothing to do with grub2-probe.

Could you please show output of

blkid -s TYPE

I'll look if we can skip known partitions which cannot have filesystems, but kernel oops is kernel bug; you have to report it against kernel.
Comment 6 Bruno Friedmann 2013-11-06 18:13:25 UTC
blkid -s TYPE
/dev/sda1: TYPE="linux_raid_member"
/dev/sdb1: TYPE="linux_raid_member"
/dev/sdc1: TYPE="linux_raid_member"
/dev/sdd1: TYPE="linux_raid_member"
/dev/sde1: TYPE="linux_raid_member"
/dev/sdf1: TYPE="linux_raid_member"
/dev/sdg2: TYPE="linux_raid_member"
/dev/sdg3: TYPE="linux_raid_member"
/dev/sdh2: TYPE="linux_raid_member"
/dev/sdh3: TYPE="linux_raid_member"
/dev/md127: TYPE="LVM2_member"
/dev/mapper/vgsys-lvsuse: TYPE="ext4"
/dev/mapper/vgsys-lvswap: TYPE="swap"
/dev/mapper/vgsys-lvvar: TYPE="ext4"
/dev/md0: TYPE="LVM2_member"
/dev/md126: TYPE="ext4"

then we see that sdg1 & sdh1 are missing 

And yes in fact only xfs is the non happy component :-)
Comment 7 Takashi Iwai 2014-03-13 10:29:58 UTC
*** Bug 849843 has been marked as a duplicate of this bug. ***
Comment 8 Marvin FourtyTwo 2014-03-13 10:47:41 UTC
here it fails to detect the "extended" partition (sda4)
Comment 9 Takashi Iwai 2014-03-17 16:39:12 UTC
The kernel message from XFS is merely a noise as Olaf suggested.  It's no real Oops.  It could have been better, but we can live with that, as it's actually harmless.  So, let's close as WONTFIX (unless anyone can give a better fix patch for the upstream :)