Bug 548132 - YaST2 automatic repair from DVD still fails to recognize RAID1 array; 11.2 unable to recognize 11.1 raid filesystem
Summary: YaST2 automatic repair from DVD still fails to recognize RAID1 array; 11.2 un...
Status: RESOLVED FIXED
Alias: None
Product: openSUSE 11.2
Classification: openSUSE
Component: YaST2 (show other bugs)
Version: RC 1
Hardware: x86-64 openSUSE 11.2
: P3 - Medium : Major with 5 votes (vote)
Target Milestone: ---
Assignee: Arvin Schnell
QA Contact: Jiri Srain
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2009-10-19 14:50 UTC by Ian Cheong
Modified: 2009-12-30 13:03 UTC (History)
6 users (show)

See Also:
Found By: ---
Services Priority:
Business Priority:
Blocker: ---
Marketing QA Status: ---
IT Deployment: ---


Attachments
11.1 raid information (27.83 KB, application/x-gzip)
2009-10-24 23:09 UTC, Ian Cheong
Details
raid1 11.2rc1 (253.06 KB, application/x-gzip)
2009-10-24 23:14 UTC, Ian Cheong
Details
y2logs (6.17 MB, application/x-compressed-tar)
2009-11-04 23:01 UTC, Ian Cheong
Details
y2logs11.1 (3.53 MB, application/x-compressed-tar)
2009-11-06 10:10 UTC, Ian Cheong
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Ian Cheong 2009-10-19 14:50:59 UTC
User-Agent:       Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10_5_8; en-us) AppleWebKit/531.9 (KHTML, like Gecko) Version/4.0.3 Safari/531.9

I thought this problem in 11.1 may have related to 11.1 RAID/Intel bugs which have been fixed.

Repair fails to recognize that drives are part of RAID and wants to repair filesystem on individual drives. When I let 11.1 repair do this in the past, it caused endless booting problems requiring reinstall.

Problem appears persistent.

Reproducible: Always

Steps to Reproduce:
1. Run automatic repair fron DVD with RAID 1 array present.
2. Follow automatic repair suggestions.
3.
Actual Results:  
on 11.1, unbootable system. Not game to let YaST2 repair ruin my 11.2-RC2 system.

Expected Results:  
Fixed any drive errors
Comment 1 Ian Cheong 2009-10-19 14:54:04 UTC
YaST2 repair fails to repair brand new ext4 partition after crash.
Comment 2 Ian Cheong 2009-10-19 15:00:11 UTC
(cont) 
System crashed (running Kaffeine).

Running automatic repair from DVD boot after reboot.

Reports damaged (brand new) ext4 partition and wants to repair. Repair fails in endless loop via repair button back to dialog box "Error detected...partition corrupted".
Comment 3 Ian Cheong 2009-10-19 23:09:06 UTC
Subsequently ext4 partitions are OK. Due to automatic repair during hard disk boot??

RAID array still not recognised. YAST partitioner lists unmounted RAID. Individual devices and array have no filesystem. Unable to mount array.
Comment 4 Josef Reidinger 2009-10-20 11:15:49 UTC
Repair is not adapted to changes done in 11.2 and is possible that this problem is not fixed to release of 11.2.
Comment 5 Ian Cheong 2009-10-24 23:09:25 UTC
Created attachment 324010 [details]
11.1 raid information

raid1 fully functional
Comment 6 Ian Cheong 2009-10-24 23:14:10 UTC
Created attachment 324011 [details]
raid1 11.2rc1

raid1 filesystem not recognizable to 11.2rc1
Comment 7 Ian Cheong 2009-10-27 10:20:40 UTC
Possibly this is similar to this bug:
https://bugzilla.novell.com/show_bug.cgi?id=525282
Comment 8 Ian Cheong 2009-10-28 22:09:41 UTC
It appers that the underlying problem could be a strange partition table created by YaST partitioner for RAID1.

mount on 11.1 is unable to recognise filesystem type but manages to mount ext3 which functions normally. (YaST partitioner also correctly mounts.)

mount: you didn't specify a filesystem type for /dev/md0
       I will try type ext3
/dev/md0 on /srv type ext3 (rw)

parted /dev/md0 on 11.1:
(parted) check 1                                                          
Error: File system has an incompatible feature enabled.   
(parted)  

mount on 11.2 is unable mount unable to detect filesystem.
parted on 11.2:
 # parted /dev/md0
GNU Parted 1.9.0
Using /dev/md0
Welcome to GNU Parted! Type 'help' to view a list of commands.
(parted) print all
Warning: /dev/md0 contains GPT signatures, indicating that it has a GPT table.
However, it does not have a valid fake msdos partition table, as it should.
Perhaps it was corrupted -- possibly by a program that doesn't understand GPT
partition tables.  Or perhaps you deleted the GPT table, and are now using an
msdos partition table.  Is this a GPT partition table?
parted: invalid token: all
Yes/No? Y                                                                 
(parted)
Comment 9 Josef Reidinger 2009-11-04 13:25:36 UTC
I check code and find, that repair should not try to repair md RAID. It doesn't know it. So it should not touch your system if there is not problem in raid detection. But I find that this bug report doesn't contain logs from repair and without it I cannot find what is going wrong
Comment 11 Ian Cheong 2009-11-04 23:01:05 UTC
Created attachment 325613 [details]
y2logs

(Have since given up and rebuilt raid array in 11.2.)

Note the problem was easy to replicate - build a new raid1 array from new drives using YsT partitioner 11.1 assigning whole drive. Try to mount using YaST pertitioner in 11.2 system.
Comment 12 Josef Reidinger 2009-11-05 10:13:21 UTC
Arvin - it looks like related to some chagnes in yast2-starage. Can you provide any information what is changed in raids between 11.1 and 11.2?
Comment 13 Arvin Schnell 2009-11-05 10:41:07 UTC
The software RAID md0 contains a (GPT) partition table. Likely simply
a leftover from one of the disks in the RAID.

The kernel in 11.1 did not have support for that. YaST in 11.2 does not
support this.

As a workaround delete the partition table on the RAID with dd.
Comment 14 Ian Cheong 2009-11-05 19:37:17 UTC
I believe the erroneous partition table was built by YaST partitioner 11.1 from clean drives after a BIOS format, on more than one occasion.

mdadm happily administers a healthy RAID1, which YaST in 11.1 and 11.2RC1 builds a new GPT partition table for. Is that what is supposed to happen? If one is not meant to use a GPT partition table on a RAID1, then that option should be unavailable.
Comment 16 Arvin Schnell 2009-11-06 09:00:39 UTC
I do not think that YaST created the partition table on md with 11.1.
Please provide YaST logs of such an event.
Comment 17 Ian Cheong 2009-11-06 10:10:05 UTC
Created attachment 325943 [details]
y2logs11.1

Being a linux/suse newb, I don't know enough to create a partition table from scratch. So I used the automated yast tools. Initially, the RAID1 was build using BIOS + dmraid via Yast. After problems, the RAID drives were BIOS formatted and software RAID1 built using mdraid via Yast.

Have certainly done a lot of poking round trying to clarify the exact problem trying to preserve what (negligible) data I had. But when nothing could be made to work manually, have resorted to BIOS format and clean partition/rebuild.

RAID1 has now been rebuilt using 11.2RC1, so I can only recover what has been logged in 11.1 prior to the rebuild.
Comment 18 Arvin Schnell 2009-11-06 16:52:51 UTC
YaST does not create a partition table in the logs from comment #17.

As I tried to explain in comment #13 the partition table was likely
already present on one of the disks.

Also the logs seem to be incomplete. Once the RAID is made out of
sdb1 and sdc1 (partitions on disks) and once out of sdb and sdc (the
whole disks). But there is also one log where YaST creates a software
RAID (on sdb1 and sdc1).

But I'm not going to analyse the problem further since as stated already
before partitioned software RAIDs are not supported by YaST on openSUSE 11.2.
Feel free to make a feature request at https://features.opensuse.org/.
Comment 19 Fred Blaise 2009-11-14 09:40:13 UTC
Resolved? That's nice. Upgrading 11.1 to 11.2 with raid setup leaves a system in an unbootable state.

I could manually mount my md devices in rescue, and see the devices (raid1 and 5)

How is that a "feature" request for those SW-raid users which will end up nuking their system? What can we do to avoid a full reinstall? Will reinstalling a fresh 11.2 with raid devices cause problems?
Comment 20 Fred Blaise 2009-11-14 09:50:03 UTC
Manually fixed the problem somehow:

- boot into rescue
- load raid1 and raid5 modules
- load dm-mod module (LVM)
- mount partitions
- mount proc, sysfs, and bind /dev mount
- chroot into root mount point
- assure that your /proc/mdstat shows all well
- regenerated initrd with raid1 (and raid5 just to be sure)
- sync to disk
- rebooted

It seems that the raid modules were not loaded on boot. Now they are. Weird.
Comment 21 Forgotten User eDStDj8Y1e 2009-11-17 01:03:46 UTC
Comment 18 is erroneous. This has been a working FEATURE. I say that because I was using it at (64bit) 10.3, 11.0, 11.1 and now 11.2. It is as was stated in Comment 20, that when you run the rescue system, THEN and ONLY THEN does the system apparently include the RAID support.

In my specific case, after getting 11.2 RC2 work work correctly (with RAID1), I was able to upgrade to 11.2 "GA" without a major glitch. 

Accordingly, I am marking this as reopened.
Comment 22 Arvin Schnell 2009-11-17 09:37:35 UTC
Apparently we disagree. But it's open source, we accept patches.
Comment 23 Arvin Schnell 2009-12-30 13:03:49 UTC
Support for partitioned software RAIDs has been added for openSUSE 11.3
in YaST. Should fix the problem.