@jebrown OK good to know. This may be different on different OSes though, Windows may just report actually used RAM and leave out anything that is cached.
Posts
-
RE: Memory reporting incorrect values
-
RE: Memory reporting incorrect values
@fred974 This is normal behavior, most modern OSes will use up as much RAM as they possibly can for caching, unused RAM is wasted RAM.
I'd adjust your alert settings if you're getting high RAM usage alerts. You also could change things in the OS to prevent as much RAM being used for caching, but I'd avoid that if possible.
The OS reports, via the tools, to the hypervisor about it's RAM usage, so if the OS is caching a lot, and therefore using most of it's RAM, it will report that to the hypervisor.
I think, could be wrong, Windows doesn't report cached RAM via the tools so it may show "real" RAM usage, but this is OS specific stuff and not really XCP-ng or any hypervisor specific stuff.
-
RE: Wide VMs on XCP-ng
@plaidypus Ah gotcha, this makes sense.
I second scaling out instead of up.
If you're getting new hosts, I'd also keep in mind newer CPUs do have much higher per core performance (not sure what your current stuff is), so you also might be able to get away with less vCPUs and lower likelihood of NUMA spanning.
Either way though I think scaling out is the better direction to go.
-
RE: Wide VMs on XCP-ng
@plaidypus High CPU usage doesn't necessarily mean that NUMA spanning will be much of an issue, really comes down to latency at that point. I'd say you should be OK to just go with it, I get the hesitation though.
-
RE: Intel Flex GPU with SR-IOV for GPU accelarated VDIs
@jebrown Yeah I just meant in general, not specific to you. I think most businesses aren't looking for this kind of specific workload so the demand isn't very high for it.
Are you currently using Intel GPUs with Flex for this use case? Or NVidia right now and just looking to change? It might be worth considering leaving VDI, not sure if you're in the position to be able to argue that, but there are often better solutions now.
Either way, I would personally also love to see this, I just think there are other things that more companies are asking for from XCP-ng right now.
-
RE: Wide VMs on XCP-ng
@plaidypus I think the real question you should ask yourself is: do your workloads actually need to worry about the extra latency from a NUMA node span?
If you're not doing something pretty darn extreme here, I don't think it really will matter. There has been lots of talk for decades about this on VMware, but I just don't think it's that relevant anymore. Latency between sockets has gotten pretty good, so unless it's some really special workload, I don't think you should worry about this much.
The best way to validate is to just test and see how things go.
But do you have info on what workload this VM will be running?
-
RE: Intel Flex GPU with SR-IOV for GPU accelarated VDIs
@JamesG There probably is interest from homelab people, but for use in production setups, I don't really see a lot of businesses needing it.
VDI isn't really used that much by businesses now (at least the ones that do use it are slowly moving off it) and most that do use it don't need GPU acceleration. Often times it's for pretty basic applications, medical record systems, etc...
So I think that is where the lack of demand is coming from.
-
RE: Server Locks Up Periodically with ASRock X570D4I-2T AMD Ryzen 9 3900X and Intel X550-AT2
@R2rho Yeah that is really surprising.
I suppose it could be some kind of wider hardware incompatibility or something, but still crazy either way.
Glad you got that somewhat sorted out though.
-
RE: Server Locks Up Periodically with ASRock X570D4I-2T AMD Ryzen 9 3900X and Intel X550-AT2
@olivierlambert Yup, I've had exactly that a few times, usually on used boards.
@R2rho if possible, however annoying, I would also take the CPU out and check for pins on the motherboard being bent with a flashlight.
-
RE: Server Locks Up Periodically with ASRock X570D4I-2T AMD Ryzen 9 3900X and Intel X550-AT2
@olivierlambert Yeah @R2rho I am with this, it's strange to see memtest errors at all.
May be another component causing the failures though, and not the RAM itself. Possibly the board or the mem controller on the CPU.
You don't by chance have another AM4 CPU you can swap in do you?
-
RE: Server Locks Up Periodically with ASRock X570D4I-2T AMD Ryzen 9 3900X and Intel X550-AT2
Yeah wish I had a better response here but this is indeed odd.
Do you by chance have a PCIe ethernet card you can swap in to use for connectivity (and just not use the X550 ports), just to test and see if the X550 is causing the crashes.
It's a longshot though if I'm honest.
-
RE: UEFI guests not loading console
@Mefosheez I would try it on that other host you have and see if you run into the same issues, just for good measure.
Are they managed by the same XO? Maybe you can just migrate it to the other host so it's the exact same VM.
-
RE: UEFI guests not loading console
@Mefosheez Hmmm ok, yeah I would check a few of the logs on XCP-ng itself. https://docs.xcp-ng.org/troubleshooting/log-files/
Also, has this host been rebooted fairly recently?
-
RE: UEFI guests not loading console
@Mefosheez Hmmm lets double check all the advanced settings then, if this doesn't do it the might be worth checking the logs.
I'll give a reference point for an Ubuntu VM I have running, this is on an AMD Threadripper 1920X for what it's worth.
CPU Mask: none
CPU Weight: Default
CPU Cap: Default
Citrix PV drivers: disabled
HA: disabled
Affinity: none
GPUs: none
NIC: Realtek RTL8139
VGA: enabled
Video RAM: 8 MiB (might be worth bumping this to 16MiB to test, IIRC I had some VMs dislike 8MiB, don't recall the specifics)
Boot Firmware: UEFI
Secure Boot: enabled (probably leave disabled for now)
Viridian: disabled
CPU Limits: 4/4
Topology: Default
Memory is 2GiBNo VUSBs and no PCIs attached. Misc is all left default.
-
RE: UEFI guests not loading console
@Mefosheez Do you have secure boot enabled on these?
-
RE: Wide VMs on XCP-ng
I haven't had any issues with NUMA node balancing, do you need this for a specific high performance application? Generally speaking it should "just work".
There are some things you can do to optimize but I'd only go down that road if you are running into issues with very wide VMs.
Also, are we talking cross socket NUMA nodes (e.g. multiple CPUs) or just nodes within an EPYC CPU?
-
RE: Server Locks Up Periodically with ASRock X570D4I-2T AMD Ryzen 9 3900X and Intel X550-AT2
@R2rho Do you have a way to view the console output when this happens? Curious if you had a display attached, you may see the remnants of the crash.
And I presume you can't get ping responses from it right? The other thought is maybe it's lost network connectivity but isn't actually a fully locked up host.
There is also info here on the log files you can check.
-
RE: UEFI guests not loading console
@Mefosheez You definitely should be able to get console with UEFI, so this does sound like a configuration issue.
You aren't passing through any GPUs or anything like that right?
What OS are you trying to boot to?
I have a feeling it's hanging and isn't really something to do with XO or XCP-ng itself.
-
RE: Immutable S3 Backups (Backblaze) and Merging; A Little Confused
@florent Finally getting back to this post, I know it's been months, just haven't had a lot of time, sorry!
I think I am still a bit confused, but I will do some additional testing to see if I can confirm my suspicions.
My confusion is that, you can't merge the deltas into the key if the key is locked behind Object Lock, the file isn't writable so you can't do the merge operation, right?
So that being said, it sounds to me like maybe the retention of the object lock/immutability needs to be set to be less than the retention period in XOA, right?
This way the original key is not immutable and can be written to when the merge happens?
Or does XOA just "wait" until the key isn't locked and then do the merge operation?
-
RE: Immutable S3 Backups (Backblaze) and Merging; A Little Confused
@olivierlambert @florent Just following up on this again, I am going to be doing some more testing this week to see if I can discover any issues. Still very curious about how this should be handled though.