@fred974 Yep, see the docs about NUMA/core affinity (soft/hard pinning):
Posts made by tuxen
-
RE: Dedicated CPU topology
-
RE: error -104
@ptunstall when the GPU was pushed back to dom0, did you also remove the PCI address from the VM config?
What's the output of:
xe vm-param-get uuid=<...> param-name=other-config
?
-
RE: Proper way to set default CPU Governor?
@sluflyer06 In order to persist across reboots, you must set the
cpufreq
boot option. There's no need to rebuild grub because the change will occur at Xen level (instead of dom0):/opt/xensource/libexec/xen-cmdline --set-xen cpufreq=xen:ondemand
After that, change the System power profile to
Performance Per Watt (OS)
in BIOS.Verifying the config:
Check if the attribute
current_governor
is set toondemand
:xenpm get-cpufreq-para
Check the clock scaling:
xenpm start 1|grep "Avg freq"
-
RE: HPC with 2x64core (256 threads) possible with XCP-ng?
@Forza Take a look:
https://xcp-ng.org/forum/post/49400
At the time of this topic, I remember asking a coworker to boot a CentOS 7.9 with more than 64 vcpus on a 48C/96T Xeon server. The VM started normally, but it didn't recognizes the vcpus > 64.
I've not tested that VM param
platform:acpi=0
as a possible solution and the trade-offs. In the past, some old RHEL 5.x VMs without acpi support would simply power off (like pulling the power cord) instead of a clean shutdown on a vm-shutdown command.Regarding that CFD software, does it support a worker/farm design? vGPU offload? I'm not a HPC expert but considering the EPYC MCM architecture, instead of a big VM, spreading the workload across many workers pinned to each CCD (or each numa nodes on a NPS4 confg) may be interesting.
Before buying those monsters, I would ask AMD to deploy a PoC using the target server model. For such demands, it's very important to do some sort of certification/validation.
-
RE: Accedentally set up a pool on an xcp-ng server
It could be. For an user point of view, a single host pool wouldn't make any sense, so they created the "implicit/explicit" concept and treated everything as a pool internally.
-
RE: Accedentally set up a pool on an xcp-ng server
That's a question for the Citrix dev team
-
RE: Accedentally set up a pool on an xcp-ng server
Just FYI guys, XenCenter/XCP-ng Center have the menu option Pool > Make into standalone server. As pointed out by other members, every standalone host is in a pool, but that option reverts to an "implicit" one.
Hope this helps.
-
RE: XCP 8.2 VCPUs-max settings
@jeff In order to create a virtual NUMA topology and expose it to the guest, the vNUMA feature needs to be implemented at hypervisor level and accessible through XAPI. I'm not sure if that feature is fully supported at the moment. Maybe @olivierlambert can confirm this?
You could try adding the
cores-per-socket
attribute following the physical NUMA topology (96 / 4 nodes = 24):xe vm-param-set platform:cores-per-socket=24 uuid=<VM UUID>
Let me know if it works.
-
RE: Centos 8 is EOL in 2021, what will xcp-ng do?
@indyj said in Centos 8 is EOL in 2021, what will xcp-ng do?:
@jefftee I prefer Alpine Linux.
+1
Low resource footprint, no bloatware... They even have a pre-built Xen Hypervisor ISO flavor
-
RE: VDI_IO_ERROR(Device I/O errors) when you run scheduled backup
This got my attention:
Jan 15 19:17:40 xcp-ng-xen12-lon2 xapi: [error||623653 INET :::80||import] Caught exception in import handler: VDI_IO_ERROR: [ Device I/O errors ] Jan 15 19:17:40 xcp-ng-xen12-lon2 xapi: [error||623653 INET :::80||backtrace] VDI.import D:378e6880299b failed with exception Unix.Unix_error(Unix.EPIPE, "single_write", "") Jan 15 19:17:40 xcp-ng-xen12-lon2 xapi: [error||623653 INET :::80||backtrace] Raised Unix.Unix_error(Unix.EPIPE, "single_write", "")
This Unix.EPIPE error on the remote target means that the pipe stream is being closed before VDI.Import receives all the data. The outcome is a VDI I/O error due to a broken, partial sent/received VDI.
Since a remote-over-the-internet link can be more prone to latency/intermittency issues, it might be needed to adjust the remote NFS soft timeout/retries or mounting the target with hard option.
I would also check if the remote target is running out-of-space during the backup process.
-
RE: XCP-ng 8.1 host loses network when running gateway/firewall VMs
Could the
fcoe
driver causing the issue?dmesg:
[ 42.363389] bnx2fc: QLogic FCoE Driver bnx2fc v2.12.5 (November 16, 2018) [ 42.371336] bnx2fc: FCoE initialized for eth1. [ 42.371641] bnx2fc: [04]: FCOE_INIT passed [ 42.387017] bnx2fc: FCoE initialized for eth0. [ 42.387305] bnx2fc: [04]: FCOE_INIT passed
lsmod:
fcoe 32768 0 libfcoe 77824 2 fcoe,bnx2fc libfc 147456 3 fcoe,bnx2fc,libfcoe scsi_transport_fc 69632 3 fcoe,libfc,bnx2fc
-
RE: No free virtual function found vGPU S7150
@erfant probably not because the nvme driver is loaded and there're no nvme errors in the logs.
@olivierlambert thank you and your team for this great project and community! It's a nice place to share knowledge and learn new stuff. I learn a lot here!
-
RE: No free virtual function found vGPU S7150
@erfant after seeing your uploaded
dmesg
, the steps 2 & 3 boot options can be put aside for while because the error isn't the same as the other topics.The log is showing MxGPU driver probe/initialization errors. After some digging, could be the case of a GPU firmware being incompatible with UEFI. Do you have any spare server for testing XCP-ng boot in legacy/BIOS with this GPU?
[ 119.418930] gim error:(gim_probe:123) gim_probe(08:00.0) [ 121.145663] gim error:(wait_cmd_complete:2387) wait_cmd_complete -- time out after 0.003044131 sec [ 121.145719] gim error:(wait_cmd_complete:2390) Cmd = 0x17, Status = 0x0, cmd_Complete=0 [ 121.145984] gim error:(init_register_init_state:4643) Failed to INIT PF for initial register 'init-state'
Edited for clarification.
-
RE: No free virtual function found vGPU S7150
@erfant could you upload
lspci -k
anddmesg
? Also, have you checked/tried:- SR-IOV enabled in BIOS;
- On
vmlinuz
entry, boot withpci=realloc
; - On
vmlinuz
entry, boot withpci=realloc pci=assign-busses
-
RE: Strange issue with booting XCP-NG
@Appollonius said in Strange issue with booting XCP-NG:
Its only when I install the GPU and dont connect it to a monitor that it will not boot properly.
Maybe because, when there's a GPU installed but no monitor attached, the motherboard POST fails at EDID probe? As stated, some boards/BIOS require an explicit configuration in order to boot without a monitor/keyboard/mouse plugged, eg.:
https://www.supermicro.com/support/faqs/faq.cfm?faq=11902
-
RE: Strange issue with booting XCP-NG
@Appollonius check the BIOS for display emulation or ACPI configuration. Some boards may refuse to boot If the BIOS isn't properly configured to accept the headless mode.
-
RE: Firepro S7150x2 SR-IOV Errors
It's from the terminal/CLI. Alternatively, you can verify/change the boot options in
/boot/grub/grub.cfg
(for dom0 boot, seemodule2 /boot/vmlinuz
entries).Found this Citrix KB adding one more pci option, take a look:
https://support.citrix.com/article/CTX250121 -
RE: Firepro S7150x2 SR-IOV Errors
@tbluml did you try the
pci=realloc
workaround, as stated in the RHEL link?# /opt/xensource/libexec/xen-cmdline --set-dom0 pci=realloc
Edit: reboot the host after applying the change.
-
RE: New XCP-ng documentation!
I liked as well. Easy to find the topics and good layout
-
RE: 100,000 unique downloads for XCP-ng
@olivierlambert congrats to the team and also to this great community!