High Fan Speed Issue on Lenovo ThinkSystem Servers
-
I had some time to test today so I upgraded the FW on a server, disabled IPMI as suggested above in the kernel and the fans remained spun up.
I disabled ACPI in grub (acpi=off) and the fans didn't spin up but then dom0 failed to fully load so that isn't great lol
Is it just a matter of the ACPI kernel driver being outdated? I'm not sure how to check that
-
@rmaclachlan If it's an ACPI issue and since Lenovo doesn't seem to be very cooperative, one could try to downgrade the firmware to a working version (i.e. one that runs fans at normal speed) and dump the ACPI table. Then upgrade to the latest firmware, dump the ACPI tables again, and then compare them.
I don't know ACPI much but I can have a look if you can share them.
The ACPI tools should be already installed on any XCP-ng host.
- Dump the ACPI tables in binary format
Do so in an empty folder as this produces numerous files
# acpidump -b
- Decompile the
dsdt.dat
file
# iasl -e ssdt*.dat -d dsdt.dat
- Do the same operations for both firmwares and share the
dsdt.dsl
files.
The files are pretty big so don't hesitate to compress them before sharing.
- Dump the ACPI tables in binary format
-
-
@rmaclachlan Thanks for the files. Did not see anything obvious at first sight.
I forgot to ask you for ssdt files too. Would it be possible to do the same with these files ?
iasl -d ssdt*.dat
(I hope you kept the old firmware ones somewhere, otherwise don't bother to downgrade again. Just share the new firmware ssdt files)
-
@ThierryEscande I kept all the files from the acpidump from both new and old fw. I've ran that on both sets of acpi dumps which produced quite a few dsl files (one per ssdt) so I've just zipped both folders for you here:
-
@rmaclachlan Thanks a lot. Unfortunately I did not find any evidence of what could be wrong from the ACPI tables.
It obviously does not come from the IPMI devices as there is no modification in this area.
So without help from Lenovo it will be difficult for us to go further. If you manage to get Lenovo involved one way or another we will be happy to collaborate and help.
-
@ThierryEscande Has anyone made any progress on this? @Riven you got contact details at Lenovo for contacting regarding this?
-
@LennertvdBerg no update on that, sorry. As said before, it will be hard to tell what's going on without feedback from Lenovo.
-
@LennertvdBerg
We already tried getting into contact with Lenovo a while ago. But like I already stated, they weren't able to escalate the ticket because of the unsupported OS. That's the same response that @Riven got.Maybe you could drop Lenovo a ticket as well and point them to this thread. Let's see if it helps if more people report this issue. Otherwise we seem to be pretty much out of Luck.
We have one of our two servers now in production running the old UEFI, Sound Level is not great, but bearable.
Still definitely far from an optimal solution. -
@RIX_IT I've dropped today a ticket as well, hoping them to realise it would be beneficial for all parties if they could help solving this.