XCP-ng 8.3 updates announcements and testing
-
Thanks, my updates and my RPU are normally at different times. I also don't backup multiple times per day like I probably should.
-
I get this in dmesg after the latest updates :
[ 54.673443] python3[3691]: segfault at 200000 ip 00007f16eb8eca9f sp 00007ffd b84e9ff0 error 4 in libpython3.6m.so.1.0[7f16eb804000+28d000] [ 54.673450] Code: 01 00 00 8d 5f ff 48 8d 2d de 3a 3c 00 c1 eb 03 44 8d 24 1b 4e 8b 44 e5 00 49 8b 70 10 49 39 f0 74 5f 49 8b 40 08 41 83 00 01 <48> 8b 38 48 85 ff 49 89 78 08 74 0d 48 83 c4 10 5b 5d 41 5c c3 0f [ 84.587661] xapi[3697]: segfault at 7f28cacaea40 ip 00007f28c6df0ec2 sp 00007 f289a5b8af0 error 6 in libjemalloc.so.2[7f28c6d85000+85000] [ 84.587669] Code: 48 2b 73 08 44 8b 4d 84 ba 01 00 00 00 49 83 c2 01 49 0f af f1 4c 8d 0d ac 72 42 00 48 89 f1 48 c1 ee 26 48 c1 e9 20 48 d3 e2 <48> 31 54 f3 40 48 8b 8d 58 ff ff ff 48 8b 33 48 8d be 00 00 00 10Hi, if possible can you share us more information to troubleshoot, like a xen-bugtool --yestoall output ?
https://docs.xcp-ng.org/troubleshooting/log-files/If you can also do the usual hardware check (eg: memtest) that would help, because my intuition is that it is not related to this precise update but we like to be sure.
-
@rzr Hi. It appeared only once, after a host reboot. If it shows again, I'll let you know.
-
-
We pushed the updates to the
xcp-ng-updatesrepository: https://xcp-ng.org/blog/2026/05/21/may-2026-updates-3-for-xcp-ng-8-3-lts/Changed since the initial announcement,
xenwas updated with the proper vulnerability fix and an update tosmwas added to fix an issue on LVM-based SRs with CBT enabled.Thanks everyone for your feedback!
-
We pushed the updates to the
xcp-ng-updatesrepository: https://xcp-ng.org/blog/2026/05/21/may-2026-updates-3-for-xcp-ng-8-3-lts/Changed since the initial announcement,
xenwas updated with the proper vulnerability fix and an update tosmwas added to fix an issue on LVM-based SRs with CBT enabled.Thanks everyone for your feedback!
update main pool. so far so good. updates went well and vms are running.
-
Did my production this morning... Not a smooth upgrade this time.
The master took so long that the process timed out, I can't tell too much more because I was doing other work and was doing this remotely.
I handled the next host manually, and that took so much time I gathered my hearing protection and went over to the rack. It was stuck in a reboot phase but hadn't shut down for about 20 minutes. Held my finger on the power button. booted back up with a couple reboots in the process and it finally was ready.
Moved the VMs off the third host, manually triggered the update, and sat and watched looking at the VGA output to see what was happening. The reboot phase saw the XCP-ng animation progress all the way to an empty bar, but sat there another 5-10 minutes until I again held my finger on the power button to shut it off. Power up and after a bit of time it was ready again. All in all, updating three hosts took me around 2 hours today.
Here's the one condition that I think could contribute to this issue:
I have two NAS normally connected, an ISO and NFS connection on each. One of the servers is powered down for construction, but I did not disconnect it from the hosts. Could this severed connection be the reason why my updates took so long, something around not being able to purge or drain the state before the reboot?
I've disconnected those SR from the hosts, and I'll probably do a rolling pool reboot later today or next week and see if things go better.
And all that said, my XO sources is not happy with this update. XO6 isn't grabbing the data so dashboards are blank or take a long time with spinning wheels gathering the data. XO5 is immediate. One of my XO sources updated this morning, the other was yesterday so they should be pretty close to current.
-
I have two NAS normally connected, an ISO and NFS connection on each. One of the servers is powered down for construction, but I did not disconnect it from the hosts. Could this severed connection be the reason why my updates took so long, something around not being able to purge or drain the state before the reboot?
Don't look further, that's exactly the issue. Reboot would have occur in the end after 30 minutes (timeout) and all other operations will be extremely slow.
You must disconnect a SR for maintenance, otherwise you enter in a world of pain.
-
I have issue with rolling pool update with 1 of my 3 pools at work. It was the last pool to be updated. Host 1 updated no issues. vms stopped migrated off host 2 to complete updates.
Support ticket opened -
Ticket#7758427. Found 1 vm with cpu stuck at 100% and unresponsive. Force rebooted vm and proceed updates on host2. -
Now receiving
UUID_INVALIDwhen trying to disable CBT on a VDI.
Perhaps a result of fixing the "List index out of range"-bug?XO Source: 5811d
Node 24vdi.set { "id": "57e0db3e-3131-40df-a620-c1118047b9d4", "cbt": false } { "code": "UUID_INVALID", "params": [ "VDI", "7b179964-dec6-4e24-a13b-8c5c56efcd95" ], "call": { "duration": 2, "method": "VDI.get_by_uuid", "params": [ "* session id *", "7b179964-dec6-4e24-a13b-8c5c56efcd95" ] }, "message": "UUID_INVALID(VDI, 7b179964-dec6-4e24-a13b-8c5c56efcd95)", "name": "XapiError", "stack": "XapiError: UUID_INVALID(VDI, 7b179964-dec6-4e24-a13b-8c5c56efcd95) at XapiError.wrap (file:///opt/xen-orchestra/packages/xen-api/_XapiError.mjs:16:12) at file:///opt/xen-orchestra/packages/xen-api/transports/json-rpc.mjs:38:21 at runNextTicks (node:internal/process/task_queues:65:5) at processImmediate (node:internal/timers:472:9)" } -
That's what I thought, I have it disconnected now and I'll try a rolling reboot when I'm back at work.
-
acebmxer said:
I have issue with rolling pool update with 1 of my 3 pools at work. It was the last pool to be updated. Host 1 updated no issues. vms stopped migrated off host 2 to complete updates.
Support ticket opened -
Ticket#7758427. Found 1 vm with cpu stuck at 100% and unresponsive. Force rebooted vm and proceed updates on host2.Well I think i found the source of my problems. After having continues other odd issues with this remote pool. I decided i was going to reboot everything. That's when every vm started to fail. Logged into Synology rs1221+ and it was just very sluggish and not responsive. No new error alerts or anything to explain the odd behavior. Rebooted it and even after boot still odd behavior until finally disk error. Then the system started to respond.
Luckily I have a spare drive onsite but cant gain access until Monday possibly Tuesday. Fingers crossed. Lucky for backups. Looks like the important vms had a successful backup as of yesterday so thats good.
-
Now receiving UUID_INVALIDwhen trying to disable CBT on a VDI.
Perhaps a result of fixing the "List index out of range"-bug?Let me call @Team-Storage about this.
-
@probain Hello,
It's likely linked to the
List index out of rangebug.
That bug was linked to the SR scan failing to introduceCBT_metatadataVDI in the XAPI database, could you try to launch axe sr-scan uuid=<SR UUID>and try again to disable CBT?
If it does not work, could you share the/var/log/SMlogof around the time you are trying to disable CBT? -
acebmxer said:
acebmxer said:
I have issue with rolling pool update with 1 of my 3 pools at work. It was the last pool to be updated. Host 1 updated no issues. vms stopped migrated off host 2 to complete updates.
Support ticket opened -
Ticket#7758427. Found 1 vm with cpu stuck at 100% and unresponsive. Force rebooted vm and proceed updates on host2.Well I think i found the source of my problems. After having continues other odd issues with this remote pool. I decided i was going to reboot everything. That's when every vm started to fail. Logged into Synology rs1221+ and it was just very sluggish and not responsive. No new error alerts or anything to explain the odd behavior. Rebooted it and even after boot still odd behavior until finally disk error. Then the system started to respond.
Luckily I have a spare drive onsite but cant gain access until Monday possibly Tuesday. Fingers crossed. Lucky for backups. Looks like the important vms had a successful backup as of yesterday so thats good.
Still having issues with this remote pool. Synology is still rebuilding the storage pool, but the time seems unreal to complete 80+ days. It keeps dropping and increasing... Yet I tried to migrate vm from NFS SR to local storage and vm having issues boot. Try to determining but i think i have multiple issue just not sure which ones.
-
@probain Hello,
It's likely linked to the
List index out of rangebug.
That bug was linked to the SR scan failing to introduceCBT_metatadataVDI in the XAPI database, could you try to launch axe sr-scan uuid=<SR UUID>and try again to disable CBT?
If it does not work, could you share the/var/log/SMlogof around the time you are trying to disable CBT?I've sent you a DM for sharing the logs.. Unfortunately I "solved" the issue by deleting all snapshots related to each VM. Including CBT ones. That did make it so I could toggle CBT on the VDIs again.
But I've collected the logs for you.
This also seems like a good time to raise my suggestion to have somewhere at vates where we could upload details in a similar way to how TrueNAS does it. Suggested here: https://feedback.vates.tech/posts/69/suggesting-to-add-a-debug-file-option
-
New security and maintenance update candidates for XCP-ng 8.3 LTS (kernel)
This release batch contains security fixes on the Linux kernel in dom0, version updates, some bug fixes and a few improvements.
What changed
Virtualization & System
-
kernel: Update to 4.19.19-8.0.46.5- Fixes multiple vulnerabilities:
- CVE-2026-46300: A logic error in the network stack could allow an unprivileged local user to escalate its privileges to root by modifying page caches for file-backed files that were not supposed to be writable. The modifications are not persistent to a reboot (i.e. no disk corruption). This vulnerability is used by the public exploit
Fragnesia. - CVE-2026-46333: Incorrect tracking of users privilege level when a task is exiting in the ptrace sub-system could allow an unprivileged local user to escalate its privileges to root by writing to file descriptors they are not supposed to have access to. The changes made to potentially root-owned files are persisted across reboots. This vulnerability is used by the public exploits ssh-keysign-pwn as well as
ptrace_may_dream. - CVE-2026-43494: A double-free of pinned pages in the RDS kernel module in the transmit error path could allow an unprivileged local user to escalate its privileges to root by modifying page caches for file-backed files, allowing them to for example overwrite a SUID binary in page cache with a shellcode. Changes are not persistent across reboots. This vulnerability is used by the public exploit
pintheft.
- CVE-2026-46300: A logic error in the network stack could allow an unprivileged local user to escalate its privileges to root by modifying page caches for file-backed files that were not supposed to be writable. The modifications are not persistent to a reboot (i.e. no disk corruption). This vulnerability is used by the public exploit
- Fixes multiple vulnerabilities:
-
qemu: Fix a potential issue in guest memory mapping lookup. -
edk2:- Fix issues while booting from physical CD/DVD drive.
- Bump UEFI guest vCPU limit to 128 vCPU (was 96 vCPUs)
-
dmidecode: Update to 3.6-3- Version able to read type 42 tables (redfish)
-
varstored: Update to 1.3.2-2.1- Sync with upstream.
-
ipxe: PXE boot support of BIOS VMs on a VLAN with 802.1Q priority tags
Control plane
xapi: Enable USB passthrough of smartcards
Storage
blktap: No functional change. Only sync with upstream.
Network
openssh: Drop support of insecure clients- Old OpenSSH clients (version less than 7.2) can no longer connect with
ssh-rsa(due to SHA-1 being no longer accepted by the server). - The solution is either to update OpenSSH-clients (to a version >= 7.2), or to generate and use
ED25519keys.
- Old OpenSSH clients (version less than 7.2) can no longer connect with
Others
libtasn1: Update to 4.21.0 (hardening)fuse: Rebuildslang: Rebuildsystemtap: Rebuild
Optional packages
libreswan: Rebuildnetdata: Rebuild
Versions:
blktap: 3.55.5-6.7.xcpng8.3 -> 3.55.5-9.1.xcpng8.3dmidecode: 1:3.0-5.el7 -> 1:3.6-3.xcpng8.3edk2: 20220801-1.7.10.1.xcpng8.3 -> 20220801-1.7.11.1.xcpng8.3fuse: 2.9.2-10.xcpng8.3 -> 2.9.2-10.1.xcpng8.3ipxe: 20121005-1.0.7.xcpng8.3 -> 20121005-1.0.8.xcpng8.3kernel: 4.19.19-8.0.46.3.xcpng8.3 -> 4.19.19-8.0.46.5.xcpng8.3libreswam: 4.12-2.3.1.xcpng8.3 -> 4.12-2.3.2.xcpng8.3libtasn1: 4.10-1.el7 -> 4.21.0-1.xcpng8.3openssh: 9.8p1-1.2.3.xcpng8.3 -> 9.8p1-1.2.4.xcpng8.3netdata: 1.47.5-4.2.xcpng8.3 -> 1.47.5-4.3.xcpng8.3qemu: 2:4.2.1-5.2.17.1.xcpng8.3 -> 2:4.2.1-5.2.18.1.xcpng8.3slang: 2.3.2-11.xcpng8.3 -> 2.3.2-11.1.xcpng8.3systemtap: 4.0-5.2.xcpng8.3 -> 4.0-5.3.xcpng8.3varstored: 1.3.1-2.1.xcpng8.3 -> 1.3.2-2.1.xcpng8.3xapi: 26.1.4-3.1.xcpng8.3 -> 26.1.4-3.2.xcpng8.3
Test on XCP-ng 8.3
yum clean metadata --enablerepo=xcp-ng-testing,xcp-ng-candidates yum update --enablerepo=xcp-ng-testing,xcp-ng-candidates rebootThe usual update rules apply: pool coordinator first, etc.
What to test
As usual, normal use and anything else you want to test.
Test window before official release of the updates
~1 day
We would like to thank users who reported feedback since our last call for testing:
@Andrew, @acebmxer, @flakpyro, @greg_e, @jeffberntsen, @marcoi, @ovicz, @ph7, @probain.
-
-
@rzr
nslookupis broken, but it was before this update too.nslookup vates.com 8.8.8.8 Server: 8.8.8.8 Address: 8.8.8.8#53 Non-authoritative answer: Name: vates.com Address: 104.21.52.238 Name: vates.com Address: 172.67.205.118 openssl_link.c:132: INSIST(dst__memory_pool != ((void *)0)) failed, back trace #0 0x7fb705ab80e7 in ?? #1 0x7fb705ab803a in ?? #2 0x7fb7066c5780 in ?? #3 0x7fb704ed0df6 in ?? #4 0x7fb704f17464 in ?? #5 0x7fb704f17732 in ?? #6 0x7fb704f16b8d in ?? #7 0x7fb703681bd9 in ?? #8 0x7fb703681c27 in ?? #9 0x7fb70366a44c in ?? #10 0x405818 in ?? Aborted (core dumped) -
Updated both hosts at home with no issues. Will continue to test.
Dependency Installed: libtasn1-tools.x86_64 0:4.21.0-1.xcpng8.3 Updated: blktap.x86_64 0:3.55.5-9.1.xcpng8.3 dmidecode.x86_64 1:3.6-3.xcpng8.3 edk2.x86_64 0:20220801-1.7.11.1.xcpng8.3 forkexecd.x86_64 0:26.1.4-3.2.xcpng8.3 fuse-libs.x86_64 0:2.9.2-10.1.xcpng8.3 ipxe.noarch 0:20121005-1.0.8.xcpng8.3 kernel.x86_64 0:4.19.19-8.0.46.5.xcpng8.3 libtasn1.x86_64 0:4.21.0-1.xcpng8.3 libtasn1-devel.x86_64 0:4.21.0-1.xcpng8.3 message-switch.x86_64 0:26.1.4-3.2.xcpng8.3 openssh.x86_64 0:9.8p1-1.2.4.xcpng8.3 openssh-clients.x86_64 0:9.8p1-1.2.4.xcpng8.3 openssh-server.x86_64 0:9.8p1-1.2.4.xcpng8.3 qcow-stream-tool.x86_64 0:26.1.4-3.2.xcpng8.3 qemu.x86_64 2:4.2.1-5.2.18.1.xcpng8.3 rrdd-plugins.x86_64 0:26.1.4-3.2.xcpng8.3 slang.x86_64 0:2.3.2-11.1.xcpng8.3 sm-cli.x86_64 0:26.1.4-3.2.xcpng8.3 squeezed.x86_64 0:26.1.4-3.2.xcpng8.3 systemtap-runtime.x86_64 0:4.0-5.3.xcpng8.3 varstored.x86_64 0:1.3.2-2.1.xcpng8.3 varstored-guard.x86_64 0:26.1.4-3.2.xcpng8.3 varstored-tools.x86_64 0:1.3.2-2.1.xcpng8.3 vhd-tool.x86_64 0:26.1.4-3.2.xcpng8.3 wsproxy.x86_64 0:26.1.4-3.2.xcpng8.3 xapi-core.x86_64 0:26.1.4-3.2.xcpng8.3 xapi-nbd.x86_64 0:26.1.4-3.2.xcpng8.3 xapi-rrd2csv.x86_64 0:26.1.4-3.2.xcpng8.3 xapi-storage-script.x86_64 0:26.1.4-3.2.xcpng8.3 xapi-tests.x86_64 0:26.1.4-3.2.xcpng8.3 xapi-xe.x86_64 0:26.1.4-3.2.xcpng8.3 xcp-networkd.x86_64 0:26.1.4-3.2.xcpng8.3 xcp-rrdd.x86_64 0:26.1.4-3.2.xcpng8.3 xenopsd.x86_64 0:26.1.4-3.2.xcpng8.3 xenopsd-cli.x86_64 0:26.1.4-3.2.xcpng8.3 xenopsd-xc.x86_64 0:26.1.4-3.2.xcpng8.3 -
Installed on my test systems and all seems to be working well so far.
Hello! It looks like you're interested in this conversation, but you don't have an account yet.
Getting fed up of having to scroll through the same posts each visit? When you register for an account, you'll always come back to exactly where you were before, and choose to be notified of new replies (either via email, or push notification). You'll also be able to save bookmarks and upvote posts to show your appreciation to other community members.
With your input, this post could be even better 💗
Register Login