stan via users composed on 2022-10-26 20:07 (UTC-0700):
On Wed, 26 Oct 2022 14:06:04 -0400 Felix Miata wrote:
> # diff -u kcfg51819.txt kcfg51917.txt
> --- kcfg51819.txt 2022-10-26 13:45:37.000000000 -0400
> +++ kcfg51917.txt 2022-10-26 13:45:58.000000000 -0400
> @@ -3,6 +3,7 @@
> CONFIG_X86_MCE_AMD=y
> CONFIG_PERF_EVENTS_AMD_POWER=m
> CONFIG_PERF_EVENTS_AMD_UNCORE=y
> +CONFIG_PERF_EVENTS_AMD_BRS=y
> CONFIG_MICROCODE_AMD=y
> CONFIG_AMD_MEM_ENCRYPT=y
> # CONFIG_AMD_MEM_ENCRYPT_ACTIVE_BY_DEFAULT is not set
> BRS is "AMD Branch Sampling support". My Google-fu
hasn't found a way
> to disable it as yet, but maxcpus=1 works.
Dealing with this extra config solved the issue? Or just prevented
another issue?
I simply tried something that seemed would obviate BRS, but disabling cores is
a performance killer, so only done as test.
Maybe you can open a ticket asking that this be a module instead of
built in, if it is possible, in the fedora kernels.
I copied 5.19.17-300.fc37.x86_64 from the F37 installation to the F36 installation:
# ls -Gg /boot/ini*19.17*
-rw------- 1 19015128 Oct 25 19:11 /boot/initramfs-5.19.17-200.fc36.x86_64.img
-rw------- 1 34070416 Oct 25 23:02 /boot/initramfs-5.19.17-300.fc37.x86_64.img
#
The F37 version boots normally. Both installations have identical /etc/dracut.conf.d/*,
so I have no idea why the F37 is so much heavier, unless there's extra stuff in it
because F37 is still not yet released.
# rpm -qa | grep mwar
amd-gpu-firmware-20221012-141.fc36.noarch
linux-firmware-20221012-141.fc36.noarch
linux-firmware-whence-20221012-141.fc36.noarch
# rpm -qa | grep mwar
linux-firmware-whence-20221012-142.fc37.noarch
linux-firmware-20221012-142.fc37.noarch
# rpm -e amd-gpu-firmware
error: Failed dependencies:
amd-gpu-firmware is needed by (installed) linux-firmware-20221012-141.fc36.noarch
# rpm -e --nodeps amd-gpu-firmware
# <reboot using F36 kernel>
black
# <reboot using F37 kernel>
succeeds
# dnf install amd-gpu-firmware
# echo 'blacklist radeon' > /etc/modprobe.d/0radeon.conf
# <reboot using F36 kernel>
black
<remove blacklist,reboot with F36 kernel>
black
# lsmod | sort | grep -E 'vid|eon|amd'
amdgpu 8544256 0
drm_display_helper 172032 2 amdgpu,radeon
drm_ttm_helper 16384 2 amdgpu,radeon
edac_mce_amd 40960 0
gpu_sched 49152 1 amdgpu
iommu_v2 24576 1 amdgpu
radeon 1654784 0
ttm 90112 3 amdgpu,radeon,drm_ttm_helper
video 61440 0
#
So, amdgpu module loads, but:
# grep \(EE /var/log/Xorg.0.log
...
[ 319.021] (EE) open /dev/dri/card0: No such file or directory
...
# dmesg | grep -i amd | grep aile
[ 4.771180] amdgpu 0000:00:01.0: Direct firmware load for amdgpu/kaveri_pfp.bin failed
with error -2
[ 4.771187] amdgpu: gfx7: Failed to load firmware "amdgpu/kaveri_pfp.bin"
[ 4.771189] [drm:gfx_v7_0_sw_init.cold [amdgpu]] *ERROR* Failed to load gfx firmware!
[ 4.771559] [drm:amdgpu_device_init.cold [amdgpu]] *ERROR* sw_init of IP block
<gfx_v7_0> failed -2
[ 4.771868] amdgpu 0000:00:01.0: amdgpu: amdgpu_device_ip_init failed
[ 4.772081] amdgpu: probe of 0000:00:01.0 failed with error -2
#
What to do next? Bugzilla?
--
Evolution as taught in public schools is, like religion,
based on faith, not based on science.
Team OS/2 ** Reg. Linux User #211409 ** a11y rocks!
Felix Miata