|
Bugzilla – Full Text Bug Listing |
| Summary: | Wicked segfaults | ||
|---|---|---|---|
| Product: | [openSUSE] openSUSE Tumbleweed | Reporter: | Sid Boyce <sboyce> |
| Component: | Network | Assignee: | Olaf Hering <ohering> |
| Status: | RESOLVED FIXED | QA Contact: | E-mail List <qa-bugs> |
| Severity: | Normal | ||
| Priority: | P5 - None | CC: | mt, ohering, sboyce |
| Version: | 13.2 Milestone 0 | ||
| Target Milestone: | --- | ||
| Hardware: | x86-64 | ||
| OS: | Other | ||
| Whiteboard: | |||
| Found By: | --- | Services Priority: | |
| Business Priority: | Blocker: | --- | |
| Marketing QA Status: | --- | IT Deployment: | --- |
|
Description
Sid Boyce
2014-02-04 21:32:29 UTC
The segfault is likely in one of the string functions. Would be nice if a core dump is written per default. Maybe we have to hack around it by calling sh -c 'ulimit -c unlimited ; $binary' in the service files? Its proably not fixed in git, but in any case, please add this repo: zypper ar -c -f -G -K \ http://download.opensuse.org/repositories/network:/wicked:/master/openSUSE_Factory/ \ wicked_master zyper -v -v dup -l -r wicked_master Please also post your config: head /etc/sysconfig/network/ifcfg-* Thanks. ...... Google found LimitCORE=infinity as additional setting for the /usr/lib/systemd/system/wicked*.service files. I think /proc/sys/kernel/core_pattern has to be adjusted to this: echo "/tmp/core.%e.%p.%t" > /proc/sys/kernel/core_pattern # head /etc/sysconfig/network/ifcfg-*
==> /etc/sysconfig/network/ifcfg-enp0s19f2u4u4u4 <==
BOOTPROTO='dhcp'
BROADCAST=''
ETHTOOL_OPTIONS=''
IPADDR=''
MTU=''
NAME='RT73_Wireless'
NETMASK=''
NETWORK=''
REMOTE_IPADDR=''
STARTMODE='hotplug'
==> /etc/sysconfig/network/ifcfg-eth0 <==
BOOTPROTO='static'
BROADCAST=''
ETHTOOL_OPTIONS=''
IPADDR='192.168.10.2/24'
MTU=''
NAME='RTL8111/8168 PCI Express Gigabit Ethernet controller'
NETWORK=''
REMOTE_IPADDR=''
STARTMODE='auto'
==> /etc/sysconfig/network/ifcfg-eth1 <==
BOOTPROTO='static'
BROADCAST=''
ETHTOOL_OPTIONS=''
IPADDR='192.168.2.190/24'
MTU=''
NAME='RTL8169 PCI Gigabit Ethernet Controller'
NETWORK=''
REMOTE_IPADDR=''
STARTMODE='auto'
==> /etc/sysconfig/network/ifcfg-lo <==
# Loopback (lo) configuration
IPADDR=127.0.0.1/8
NETMASK=255.0.0.0
NETWORK=127.0.0.0
BROADCAST=127.255.255.255
STARTMODE=nfsroot
BOOTPROTO=static
USERCONTROL=no
FIREWALL=no
I noticed that ulimit is already set by wicked service files. A /core file should be there already. Please run this command and post output: gdb --readnow --core=/core -ex bt -ex q # gdb --readnow --core=/core -ex bt -ex q GNU gdb (GDB; openSUSE Factory) 7.6.50.20130731-cvs Copyright (C) 2013 Free Software Foundation, Inc. License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html> This is free software: you are free to change and redistribute it. There is NO WARRANTY, to the extent permitted by law. Type "show copying" and "show warranty" for details. This GDB was configured as "x86_64-suse-linux". Type "show configuration" for configuration details. For bug reporting instructions, please see: <http://bugs.opensuse.org/>. Find the GDB manual and other documentation resources online at: <http://www.gnu.org/software/gdb/documentation/>. For help, type "help". Type "apropos word" to search for commands related to "word". [New LWP 26487] Missing separate debuginfo for the main executable file Try: zypper install -C "debuginfo(build-id)=52c8bf747f016ea5488cadaf6fb0f3ac80400a67" Core was generated by `/usr/lib/wicked/bin/wickedd-dhcp4 --systemd --foreground'. Program terminated with signal SIGSEGV, Segmentation fault. #0 0x00007f33b6a82cca in ?? () #0 0x00007f33b6a82cca in ?? () #1 0x00007f33b6a829ee in ?? () #2 0x0000000000635850 in ?? () #3 0x0000000000000004 in ?? () #4 0x0000000000000000 in ?? () I have not rebooted since upgrading from the new repo advised in Comment 1. I shall probably be rebooting another box with the same problems later today. (In reply to comment #4) > # gdb --readnow --core=/core -ex bt -ex q > Missing separate debuginfo for the main executable file > Try: zypper install -C > "debuginfo(build-id)=52c8bf747f016ea5488cadaf6fb0f3ac80400a67" > Core was generated by `/usr/lib/wicked/bin/wickedd-dhcp4 --systemd > --foreground'. > Program terminated with signal SIGSEGV, Segmentation fault. > #0 0x00007f33b6a82cca in ?? () > #0 0x00007f33b6a82cca in ?? () > #1 0x00007f33b6a829ee in ?? () > #2 0x0000000000635850 in ?? () Please try either zypper in -C "debuginfo(build-id)=52c8bf747f016ea5488cadaf6fb0f3ac80400a67" or zypper -v -v in {glibc,wicked}-debug{source,info} Then run the gdb command again, it should be able to resolve the locations I have not rebooted since upgrading from the new repo advised in Comment 1. I shall probably be rebooting another box with the same problems later today. # gdb --readnow --core=/core -ex bt -ex q GNU gdb (GDB; openSUSE Factory) 7.6.50.20130731-cvs Copyright (C) 2013 Free Software Foundation, Inc. License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html> This is free software: you are free to change and redistribute it. There is NO WARRANTY, to the extent permitted by law. Type "show copying" and "show warranty" for details. This GDB was configured as "x86_64-suse-linux". Type "show configuration" for configuration details. For bug reporting instructions, please see: <http://bugs.opensuse.org/>. Find the GDB manual and other documentation resources online at: <http://www.gnu.org/software/gdb/documentation/>. For help, type "help". Type "apropos word" to search for commands related to "word". [New LWP 26487] Missing separate debuginfo for the main executable file Try: zypper install -C "debuginfo(build-id)=52c8bf747f016ea5488cadaf6fb0f3ac80400a67" Core was generated by `/usr/lib/wicked/bin/wickedd-dhcp4 --systemd --foreground'. Program terminated with signal SIGSEGV, Segmentation fault. #0 0x00007f33b6a82cca in ?? () #0 0x00007f33b6a82cca in ?? () #1 0x00007f33b6a829ee in ?? () #2 0x0000000000635850 in ?? () #3 0x0000000000000004 in ?? () #4 0x0000000000000000 in ?? () slipstream:/usr/src/QSDR_with_QT522/qsdr # l /core -rw------- 1 root root 1171456 Feb 4 20:30 /core so even after installing the debuginfo packages gdb can not resolve the backtrace? Thats weird. I think its best to dup to the wicked repo and try again. This is on a second system. tindog:~ # gdb --readnow --core=/core -ex bt -ex q GNU gdb (GDB; openSUSE Factory) 7.6.50.20130731-cvs Copyright (C) 2013 Free Software Foundation, Inc. License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html> This is free software: you are free to change and redistribute it. There is NO WARRANTY, to the extent permitted by law. Type "show copying" and "show warranty" for details. This GDB was configured as "x86_64-suse-linux". Type "show configuration" for configuration details. For bug reporting instructions, please see: <http://bugs.opensuse.org/>. Find the GDB manual and other documentation resources online at: <http://www.gnu.org/software/gdb/documentation/>. For help, type "help". Type "apropos word" to search for commands related to "word". "/core" is not a core dump: File format not recognized No stack. tindog:~ # l /core -rw------- 1 root root 1183744 Feb 6 00:17 /core tindog:~ # systemctl status wickedd.service wickedd.service - wicked network management service daemon Loaded: loaded (/usr/lib/systemd/system/wickedd.service; disabled) Active: failed (Result: start-limit) since Thu 2014-02-06 00:17:09 GMT; 12h ago Main PID: 943 (code=dumped, signal=SEGV) Feb 06 00:17:08 tindog systemd[1]: wickedd.service: main process exited, code=dumped, status=11/SEGV Feb 06 00:17:08 tindog systemd[1]: Unit wickedd.service entered failed state. Feb 06 00:17:09 tindog systemd[1]: wickedd.service holdoff time over, scheduling restart. Feb 06 00:17:09 tindog systemd[1]: Stopping wicked network management service daemon... Feb 06 00:17:09 tindog systemd[1]: Starting wicked network management service daemon... Feb 06 00:17:09 tindog systemd[1]: wickedd.service start request repeated too quickly, refusing to start. Feb 06 00:17:09 tindog systemd[1]: Failed to start wicked network management service daemon. Feb 06 00:17:09 tindog systemd[1]: Unit wickedd.service entered failed state. tindog:~ # systemctl status wicked.service wicked.service - wicked managed network interfaces Loaded: loaded (/usr/lib/systemd/system/wicked.service; disabled) Active: failed (Result: exit-code) since Thu 2014-02-06 00:17:06 GMT; 12h ago Main PID: 780 (code=exited, status=1/FAILURE) Feb 06 00:17:05 tindog wicked[780]: device wlp0s22f2u1u4u3 failed: device was deleted Feb 06 00:17:06 tindog systemd[1]: wicked.service: main process exited, code=exited, status=1/FAILURE Feb 06 00:17:06 tindog systemd[1]: Failed to start wicked managed network interfaces. Feb 06 00:17:06 tindog systemd[1]: Unit wicked.service entered failed state. I could not get networking to work with "ifconfig br0 192.168.10.10.up", "route add default gw 192.168.10.103" I had to configure the interfaces separately. tindog:~ # ifconfig enp3s0 Link encap:Ethernet HWaddr 50:E5:49:C8:2E:8C inet addr:192.168.2.197 Bcast:192.168.2.255 Mask:255.255.255.0 inet6 addr: fe80::52e5:49ff:fec8:2e8c/64 Scope:Link UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:41815 errors:0 dropped:0 overruns:0 frame:0 TX packets:41333 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:8085235 (7.7 Mb) TX bytes:8111543 (7.7 Mb) enp5s6 Link encap:Ethernet HWaddr 00:1C:F0:5B:65:5A inet addr:192.168.10.10 Bcast:192.168.10.255 Mask:255.255.255.0 inet6 addr: fe80::21c:f0ff:fe5b:655a/64 Scope:Link UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:419281 errors:0 dropped:0 overruns:0 frame:0 TX packets:766261 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:51932972 (49.5 Mb) TX bytes:1052372524 (1003.6 Mb) lo Link encap:Local Loopback inet addr:127.0.0.1 Mask:255.0.0.0 inet6 addr: ::1/128 Scope:Host UP LOOPBACK RUNNING MTU:65536 Metric:1 RX packets:130 errors:0 dropped:0 overruns:0 frame:0 TX packets:130 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:0 RX bytes:14032 (13.7 Kb) TX bytes:14032 (13.7 Kb) tindog:~ # head /etc/sysconfig/network/ifcfg-* ==> /etc/sysconfig/network/ifcfg-br0 <== BOOTPROTO='static' BROADCAST='' ETHTOOL_OPTIONS='' IPADDR='192.168.10.10/24' MTU='' NAME='DGE-528T Gigabit Ethernet Adapter' NETWORK='' REMOTE_IPADDR='' STARTMODE='auto' USERCONTROL='no' ==> /etc/sysconfig/network/ifcfg-enp3s0 <== BOOTPROTO='static' BROADCAST='' ETHTOOL_OPTIONS='' IPADDR='192.168.2.197/24' MTU='' NAME='RTL8111/8168B PCI Express Gigabit Ethernet controller' NETWORK='' REMOTE_IPADDR='' STARTMODE='auto' USERCONTROL='no' ==> /etc/sysconfig/network/ifcfg-lo <== # Loopback (lo) configuration IPADDR=127.0.0.1/8 NETMASK=255.0.0.0 NETWORK=127.0.0.0 BROADCAST=127.255.255.255 STARTMODE=nfsroot BOOTPROTO=static USERCONTROL=no FIREWALL=no ==> /etc/sysconfig/network/ifcfg-wlp0s22f2u1u4u3 <== BOOTPROTO='dhcp' BROADCAST='' ETHTOOL_OPTIONS='' IPADDR='' MTU='' NAME='RT73_Wireless' NETMASK='' NETWORK='' REMOTE_IPADDR='' STARTMODE='hotplug' Networking now automatically configured except for the wlan.
tindog:~ # ifconfig -a
enp3s0 Link encap:Ethernet HWaddr 50:E5:49:C8:2E:8C
inet addr:192.168.2.197 Bcast:192.168.2.255 Mask:255.255.255.0
inet6 addr: fe80::52e5:49ff:fec8:2e8c/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:6941 errors:0 dropped:0 overruns:0 frame:0
TX packets:6109 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:3448333 (3.2 Mb) TX bytes:4776903 (4.5 Mb)
enp5s6 Link encap:Ethernet HWaddr 00:1C:F0:5B:65:5A
inet addr:192.168.10.10 Bcast:192.168.10.255 Mask:255.255.255.0
inet6 addr: fe80::21c:f0ff:fe5b:655a/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:15474 errors:0 dropped:0 overruns:0 frame:0
TX packets:15661 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:12341193 (11.7 Mb) TX bytes:13230640 (12.6 Mb)
lo Link encap:Local Loopback
inet addr:127.0.0.1 Mask:255.0.0.0
inet6 addr: ::1/128 Scope:Host
UP LOOPBACK RUNNING MTU:65536 Metric:1
RX packets:18 errors:0 dropped:0 overruns:0 frame:0
TX packets:18 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:0
RX bytes:1060 (1.0 Kb) TX bytes:1060 (1.0 Kb)
wlp0s22f2 Link encap:Ethernet HWaddr 00:0E:2E:F1:36:99
BROADCAST MULTICAST MTU:1500 Metric:1
RX packets:0 errors:0 dropped:0 overruns:0 frame:0
TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:0 (0.0 b) TX bytes:0 (0.0 b)
tindog:~ # ps fax|grep wicked
723 ? SLs 0:00 /usr/sbin/wickedd --systemd --foreground
750 ? SLs 0:00 /usr/sbin/wickedd-nanny --systemd --foreground
751 ? SLs 0:00 /usr/lib/wicked/bin/wickedd-auto4 --systemd --foreground
752 ? SLs 0:00 /usr/lib/wicked/bin/wickedd-dhcp6 --systemd --foreground
753 ? SLs 0:00 /usr/lib/wicked/bin/wickedd-dhcp4 --systemd --foreground
One other problem is with setting up bridge br0 using enp5s6.
The bridge is set up and configured but does not work.
route -n and /etc/resolv.conf are all good.
# cat /etc/sysconfig/ifcfg-br0
BOOTPROTO='static'
BROADCAST=''
ETHTOOL_OPTIONS=''
IPADDR='192.168.10.10/24'
MTU=''
NAME='DGE-528T Gigabit Ethernet Adapter'
NETWORK=''
REMOTE_IPADDR=''
STARTMODE='auto'
USERCONTROL='no'
BRIDGE='yes'
BRIDGE_PORTS='enp5s6'
BRIDGE_STP='on'
- good to know the sefault is gone. - ifcfg-br0 has STP enabled. This causes delays, per specification. please remove it and try again. If it persists, open a separate bug. - ifcfg-wlp0s22f2 lacks the ESSID, it is mandatory. If it persists, open a separate bug. thanks for testing! A (cleaned up a bit) config which should work: ==> /etc/sysconfig/network/ifcfg-lo <== ## ALL FINE ==> /etc/sysconfig/network/ifcfg-br0 <== NAME='Bridge 0' USERCONTROL='no' STARTMODE='auto' BOOTPROTO='static' IPADDR='192.168.10.10/24' BRIDGE='yes' BRIDGE_STP='off' BRIDGE_FORWARDDELAY='0' BRIDGE_PORTS='enp5s6' ++> /etc/sysconfig/network/ifcfg-enp5s6 <++ NAME='DGE-528T Gigabit Ethernet Adapter' USERCONTROL='no' STARTMODE='auto' BOOTPROTO='static' ==> /etc/sysconfig/network/ifcfg-enp3s0 <== NAME='RTL8111/8168B PCI Express Gigabit Ethernet controller' USERCONTROL='no' STARTMODE='auto' BOOTPROTO='static' IPADDR='192.168.2.197/24' ==> /etc/sysconfig/network/ifcfg-wlp0s22f2u1u4u3 <== NAME='RT73_Wireless' BOOTPROTO='dhcp' USERCONTROL='no' STARTMODE='hotplug' WIRELESS_ESSID='foo bar network' WIRELESS_MODE='Managed' WIRELESS_AUTH_MODE='psk' WIRELESS_WPA_PSK='secret' when not, please open separate bug report. I currently see 2 bugs: 1) STP='on' we have to fix / adjust the timeouts dynamically when it is used 2) WIRELESS: it starts and works for me but "wicked ifstatus wlp0s29u1u7" reports "device-not-started" Would you be so kind to confirm / open bug reports for these issues? Bug 863371 submitted. Thanks! I'm reopening this bug, because I see SEGV in Bug 863371: "[...] tindog:/home/lancelot # systemctl status wickedd.service wickedd.service - wicked network management service daemon Loaded: loaded (/usr/lib/systemd/system/wickedd.service; enabled) Active: failed (Result: start-limit) since Tue 2014-02-11 20:37:37 GMT; 2min 2s ago Process: 997 ExecStart=/usr/sbin/wickedd $WICKED_DEBUG_PARAM --systemd --foreground (code=dumped, signal=SEGV) Main PID: 997 (code=dumped, signal=SEGV) CGroup: /system.slice/wickedd.service [...]" Would you provide a "rpm -qa | grep wicked" please? And ensure the versions of libwicked0 and wicked match? (See https://github.com/openSUSE/wicked/issues/145. We didn't released and increased the library version marking it incompatible yet, so you always need an exact version match) zypper in -f -r wicked_master libwicked0 wicked wicked-service libwicked0-debuginfo wicked-debuginfo wicked-debugsource systemctl restart wickedd wicked --debug most ifup all When a SEGV happens, please provide the backtrace output ("gdb --readnow --core=/core -ex bt -ex q" to see which binary dumped core, then again with additional parameter "/path/to/binary"). OK, the SEGV is be fixed now, see bug 863371 comment 22. |