|
Bugzilla – Full Text Bug Listing |
| Summary: | Nvidia Jetson TX2: Installer kernel crashes while decompressing initrd | ||
|---|---|---|---|
| Product: | [openSUSE] openSUSE Tumbleweed | Reporter: | Yousaf Kaukab <yousaf.kaukab> |
| Component: | Kernel | Assignee: | Yousaf Kaukab <yousaf.kaukab> |
| Status: | RESOLVED FIXED | QA Contact: | E-mail List <qa-bugs> |
| Severity: | Normal | ||
| Priority: | P5 - None | CC: | afaerber, mbrugger, ptesarik |
| Version: | Current | ||
| Target Milestone: | --- | ||
| Hardware: | aarch64 | ||
| OS: | Other | ||
| Whiteboard: | |||
| Found By: | --- | Services Priority: | |
| Business Priority: | Blocker: | --- | |
| Marketing QA Status: | --- | IT Deployment: | --- |
|
Description
Yousaf Kaukab
2019-08-09 10:43:16 UTC
CPU-feature complains about unexpected variations. This could be the root cause for the crash: [ 0.112442] CPU features: SANITY CHECK: Unexpected variation in SYS_CTR_EL0. Boot CPU: 0x0000008444c004, CPU1: 0x0000009444c004 [ 0.112451] CPU features: SANITY CHECK: Unexpected variation in SYS_ID_AA64DFR0_EL1. Boot CPU: 0x00000010305106, CPU1: 0x00000010305116 [ 0.112466] CPU features: SANITY CHECK: Unexpected variation in SYS_ID_DFR0_EL1. Boot CPU: 0x00000003010066, CPU1: 0x00000003001066 [ 0.112477] CPU features: Unsupported CPU feature variation detected. [ 0.112522] CPU1: Booted secondary processor 0x0000000000 [0x4e0f0030] [ 0.113340] Detected PIPT I-cache on CPU2 [ 0.113351] CPU features: SANITY CHECK: Unexpected variation in SYS_CTR_EL0. Boot CPU: 0x0000008444c004, CPU2: 0x0000009444c004 [ 0.113360] CPU features: SANITY CHECK: Unexpected variation in SYS_ID_AA64DFR0_EL1. Boot CPU: 0x00000010305106, CPU2: 0x00000010305116 [ 0.113372] CPU features: SANITY CHECK: Unexpected variation in SYS_ID_DFR0_EL1. Boot CPU: 0x00000003010066, CPU2: 0x00000003001066 Following patch under discussion upstream: https://lkml.org/lkml/2019/9/29/451 (In reply to Mian Yousaf Kaukab from comment #2) > Following patch under discussion upstream: > https://lkml.org/lkml/2019/9/29/451 One suggestion in the above thread was to use no-memory-wc, however it results in other issues with bpmp driver. e.g a crash like following: [ 7.304592] Unable to handle kernel paging request at virtual address ffff800012ace08c [ 7.312835] Mem abort info: [ 7.315735] ESR = 0x96000061 [ 7.318912] EC = 0x25: DABT (current EL), IL = 32 bits [ 7.324442] SET = 0, FnV = 0 [ 7.327613] EA = 0, S1PTW = 0 [ 7.330877] Data abort info: [ 7.333868] ISV = 0, ISS = 0x00000061 [ 7.337852] CM = 0, WnR = 1 [ 7.340935] swapper pgtable: 4k pages, 48-bit VAs, pgdp=00000000f58a2000 [ 7.347909] [ffff800012ace08c] pgd=00000002771ff003, p4d=00000002771ff003, pud=00000002771fe003, pmd=000000026bf8a003, pte=006800003004ef07 [ 7.360961] Internal error: Oops: 96000061 [#1] PREEMPT SMP [ 7.366758] Modules linked in: [ 7.369936] CPU: 0 PID: 1 Comm: swapper/0 Tainted: G S 5.7.0-rc4-next-20200505 #1 [ 7.379088] Hardware name: nvidia p2771-0000/p2771-0000, BIOS 2020.04-rc3 03/25/2020 [ 7.387147] pstate: 00000085 (nzcv daIf -PAN -UAO BTYPE=--) [ 7.392950] pc : __memcpy+0x48/0x180 [ 7.396670] lr : __tegra_bpmp_channel_write+0x68/0x70 [ 7.401921] sp : ffff80001003ba10 [ 7.405365] x29: ffff80001003ba10 x28: ffff0001eb1ebf38 [ 7.410897] x27: 0000000000000000 x26: 0000000000000013 [ 7.416425] x25: ffff80001003baf4 x24: 0000000000000014 [ 7.421952] x23: 0000000000000001 x22: ffff0001eb1ebe80 [ 7.427481] x21: ffff80001003bb08 x20: ffff0001eb1ebf10 [ 7.433000] x19: ffff0001e961a080 x18: 00000000000000c0 [ 7.438520] x17: 0000000000000000 x16: 00000000f771c478 [ 7.444046] x15: 00002361a9ba9a10 x14: ffff800011ded000 [ 7.449571] x13: ffff800015842000 x12: ffff800015801000 [ 7.455093] x11: 0140000000000000 x10: 0000000000000041 [ 7.460620] x9 : 0000000000000003 x8 : 00000000fffc0000 [ 7.466139] x7 : 0000000000040000 x6 : ffff800012ace08c [ 7.471663] x5 : 0000000000000000 x4 : 000000000000000c [ 7.477190] x3 : 00040000fffc0000 x2 : 0000000000000008 [ 7.482717] x1 : ffff80001003bb00 x0 : ffff800012ace088 [ 7.488243] Call trace: [ 7.490781] __memcpy+0x48/0x180 [ 7.494134] tegra_bpmp_transfer+0x128/0x370 [ 7.498572] tegra_bpmp_init_debugfs+0xe8/0x1a8 [ 7.503280] tegra_bpmp_probe+0x2fc/0x320 [ 7.507455] platform_drv_probe+0x50/0xa0 [ 7.511623] really_probe+0xd4/0x318 [ 7.515337] driver_probe_device+0x54/0xb0 [ 7.519599] device_driver_attach+0x6c/0x78 [ 7.523949] __driver_attach+0x54/0xd0 [ 7.527847] bus_for_each_dev+0x6c/0xc0 [ 7.531837] driver_attach+0x20/0x28 [ 7.535552] bus_add_driver+0x140/0x1e8 [ 7.539543] driver_register+0x60/0x110 [ 7.543537] __platform_driver_register+0x44/0x50 [ 7.548437] tegra_bpmp_driver_init+0x18/0x20 [ 7.552972] do_one_initcall+0x50/0x190 [ 7.556964] kernel_init_freeable+0x1cc/0x23c [ 7.561502] kernel_init+0x10/0x100 [ 7.565128] ret_from_fork+0x10/0x18 [ 7.568845] Code: b8404423 b80044c3 36180064 f8408423 (f80084c3) [ 7.575193] ---[ end trace 7b94fc879e9d6b70 ]--- [ 7.579994] note: swapper/0[1] exited with preempt_count 1 [ 7.585708] Kernel panic - not syncing: Attempted to kill init! exitcode=0x0000000b [ 7.593683] SMP: stopping secondary CPUs [ 7.597774] Kernel Offset: 0xc0000 from 0xffff800010000000 [ 7.603481] PHYS_OFFSET: 0x80000000 [ 7.607106] CPU features: 0x490002,21806000 [ 7.611456] Memory Limit: none [ 7.614635] ---[ end Kernel panic - not syncing: Attempted to kill init! exitcode=0x0000000b ]--- Started another discussion thread: https://lkml.org/lkml/2020/5/8/278 Fixed by following commit upstream (v5.15): fec29bf04994 misc: sram: Only map reserved areas in Tegra SYSRAM |