High CPU temperatures on multiple XCP-ng hosts (identical hardware, low load)
-
We're running a small cluster of physical servers on XCP-ng 8.3, all with identical hardware:
Model: Lenovo ThinkSystem ST250 V2
CPU: Intel Xeon E-2356G
Each host runs only 2 VMs: one small VM for XOA, and one production VM with very low CPU usage (below 20%).
The servers are reporting consistently high CPU temperatures (~82–83 °C) even when idle. And are distributed across different locations; they are not in the same facility. These are systems deployed at different client sites.What we've checked
Very low CPU load on all hosts (load average ~0.1).
No heavy processes in top or htop.
All report similar ambient temperature (~22 °C via IPMI sensor).ipmitool sensor data:
Metric Affected servers
CPU Temp 82–83 °C
CPU Power 80 W
Sys Power 140–160 W
Fan 2 RPM ~950–1275 RPMAll servers are on the same versions:
BIOS: TQE112D-3.10
BMC Firmware: 3.10What we're looking for
Has anyone seen similar CPU thermal behavior on XCP-ng with Intel CPUs?
Can dynamic frequency scaling be enabled on XCP-ng reliably?
Should we be passing boot parameters like intel_pstate=enable or loading specific modules? -
@sotero This may seem odd, but are the CPU fans actually working? I know there are hundreds of people who are using Intel CPUs (likely the same exact model) that aren't experiencing the same issue.
My initial thought is that these servers you have, has some kind of hardware or firmware issue where the CPU Fan simply isn't working.
-
@DustinB Hi,
I believe the fans are working correctly in all cases:
Do you know if any firmware issue or a compatibility problem between Lenovo hardware and XCP-ng could explain this?
-
@sotero I'm not aware of any particular issue off hand. It seems like the hardware is operating as expected...
Were all of these units shipped from the manufacturer "ready to go" aside from having XCP-ng installed?
-
@DustinB Thanks for your reply!
Yes, all of these Lenovo ThinkSystem ST250 V2 servers were delivered preassembled from the manufacturer with identical hardware configurations. The only customization done was the installation of XCP-ng 8.3, and the deployment of the same two VMs on each host.
We didn't modify BIOS settings manually after delivery. The worrying thing here is that we do not have this problem on the same Lenovo ThinkSystem ST250 V2 hardware where we have installed Windows server.If you have any suggestions on how to verify or normalize power management behavior across identical units, that would be really helpful!