Bug 821877

Summary: unzip can't detect Chinese charset due to missing rcc-runtime in librcc0
Product: [openSUSE] openSUSE 12.3 Reporter: Marguerite Su <i>
Component: BasesystemAssignee: E-mail List <bnc-team-screening>
Status: RESOLVED FIXED QA Contact: E-mail List <qa-bugs>
Severity: Major    
Priority: P3 - Medium CC: lazy.kent
Version: Final   
Target Milestone: Final   
Hardware: All   
OS: openSUSE 12.3   
Whiteboard:
Found By: Community User Services Priority:
Business Priority: Blocker: No
Marketing QA Status: --- IT Deployment: ---
Attachments: zip with CP936

Description Marguerite Su 2013-05-27 15:51:02 UTC
Hi, 

Some time ago, I want to fix the "unrecognizable character with Chinese encodings" problem for unzip.

So I adapted iconv patch for unzip from Archlinux and made a package.

patch: https://aur.archlinux.org/packages/unzip-iconv/?setlang=zh_CN

After some testing, I found my adapted patch was overriden by our own librcc patch(internally, they can be both applied, but librcc will take over everything about codepage, iconv has no chance), which means we have done things to resolve that problem.

But that librcc patch can't do anything with Chinese (And I can't apply my patch):

use the zip file I uploaded, try to unzip it.
 
I tested it with 12.2 and 12.3, with built unzip rpms from Archiving repo. (librcc version 0.2.10)

I built the source to a deb on Ubuntu 12.04, failed. (librcc version 0.2.9)

But some Arch users told me it displays good on their systems with the same source (librcc version 0.2.6) 

So the things we can do are:

fix the patch itself or librcc (we have to confirm with librcc upstream to see if CJK is included in the "many languages" mentioned in our librcc description)

if librcc can't, drop the patch and find another one supports CJK and European encodings well. because we can't fulfill some users' needs by compromising others'.

Actually community users have found one: libnatspec, from Altlinux. it seems to be a successor of the iconv patch (icon patch is named alt-iconv-*.patch). And it seems to be the best over the Internet.

I don't have the required knowledge to debug any deeper, so I need a developer to take a look at our librcc.

Greetings

Marguerite
Comment 1 Marguerite Su 2013-05-27 15:52:16 UTC
Created attachment 541403 [details]
zip with CP936

try unzip this.
Comment 2 Marguerite Su 2013-06-24 14:42:15 UTC
I got it: our librcc doesn't have libguess enabled which is for CJK charset detection according to RusXMMS project page.
Comment 3 Marguerite Su 2013-06-24 19:04:03 UTC
fixed in SR180792.
Comment 4 Bernhard Wiedemann 2013-06-25 09:00:17 UTC
This is an autogenerated message for OBS integration:
This bug (821877) was mentioned in
https://build.opensuse.org/request/show/180887 Factory / librcc
Comment 5 Swamp Workflow Management 2013-07-03 10:04:40 UTC
openSUSE-RU-2013:1128-1: An update that has one recommended fix can now be installed.

Category: recommended (important)
Bug References: 821877
CVE References: 
Sources used:
openSUSE 12.3 (src):    librcc-0.2.10-2.4.1
openSUSE 12.2 (src):    librcc-0.2.9-2.4.1
Comment 6 Kyrill Detinov 2013-07-16 12:05:24 UTC
Marguerite, libguess still isn't present in Factory. sr#180887 was declined.
Comment 7 Marguerite Su 2013-07-16 13:48:07 UTC
Hi, Kent,

Thanks for the information. I'll contact libguess maintainer and push libguess to Factory.
Comment 8 Kyrill Detinov 2013-07-16 16:15:16 UTC
Marguerite, let me know when this is done. Then I'll SR updated librcc back to Factory.