XCP-ng
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    XCP-ng 8.2 updates announcements and testing

    Scheduled Pinned Locked Moved News
    708 Posts 67 Posters 1.1m Views 86 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • stormiS Offline
      stormi Vates 🪐 XCP-ng Team
      last edited by stormi

      Urgent security update candidate (only a few hours to provide feedback)

      There's an escalation of privilege vulnerability in Intel CPUs of the last few years. It was silently mitigated by Intel in previous microcode updates for the most recent CPU generations, but older affected CPUs were only fixed in yesterday's microcode.

      See https://www.intel.com/content/www/us/en/security-center/advisory/intel-sa-00950.html

      Test on XCP-ng 8.2

      yum clean metadata --enablerepo=xcp-ng-testing
      yum update microcode_ctl --enablerepo=xcp-ng-testing
      reboot
      

      The usual update rules apply: pool coordinator first, etc.

      Versions

      • microcode_ctl: 2.1-26.xs26.2.xcpng8.2

      What to test

      Normal use and anything else you want to test. The closer to your actual use of XCP-ng, the better.

      If you don't have time for more, just installing the update, rebooting, and checking one VM can start, will be enough.

      Test window before official release of the updates

      A few hours.

      A 1 Reply Last reply Reply Quote 1
      • GheppyG Offline
        Gheppy
        last edited by

        The server (Lenovo System x3650 M5) has started and all 4 VMs are started and functional

        1 Reply Last reply Reply Quote 5
        • A Offline
          Andrew Top contributor @stormi
          last edited by

          @stormi Microcode updated on affected Gen11 i7. Running normally.

          1 Reply Last reply Reply Quote 5
          • stormiS Offline
            stormi Vates 🪐 XCP-ng Team
            last edited by

            Thanks for the feedback! Update published: https://xcp-ng.org/blog/2023/11/15/november-2023-security-update/

            The blog post also contains information about two vulnerabilities in Xen, but which don't affect XCP-ng in a supported and/or default configuration.

            Users of PV guests who still haven't converted them to HVM should consider it, though.

            1 Reply Last reply Reply Quote 3
            • stormiS Offline
              stormi Vates 🪐 XCP-ng Team
              last edited by stormi

              New security update candidates

              As promised in the announcement of the previous security update, here's a new one which includes changes for previously missing XSA updates as well as an updated AMD microcode.

              Security updates

              • xen-*:
                • Fix XSA-445 - x86/AMD: mismatch in IOMMU quarantine page table levels. On x86 AMD systems with IOMMU hardware, a device in quarantine mode, using dom_io, could access leaked data from previously quarantined pages. This is not enabled by default in XCP-ng, but can still be enabled at Xen boot time.
                • Fix XSA-446 - x86: BTC/SRSO fixes not fully effective. A PV guest could infer memory content from other guests. We do not recommand using PV guests and have been suggesting switching to HVM for a while, so we do hope most users were not impacted by this.
              • linux-firmware: Update AMD microcode to 2023-10-19 drop, updating the family 19h, so Zen 3, Zen3+ and Zen 4. AMD Advisory here.

              Other updates

              We plan to also push other, non security, updates at the same time, to pave the way for the upcoming refreshed installation ISOs.

              • gpumon: suppression of logs which were needlessly written every 5s into /var/log/daemon.log.
              • tzdata: updated timezones.
              • vendor-drivers: pull new drivers into XCP-ng:
                • igc-module: Intel device drivers for I225/I226
                • r8125-module: Realtek r8125 device drivers
                • mpi3mr-module: Broadcom mpi3mr RAID device driver

              Test on XCP-ng 8.2

              yum clean metadata --enablerepo=xcp-ng-testing
              yum update "xen-*" linux-firmware gpumon vendor-drivers tzdata --enablerepo=xcp-ng-testing
              reboot
              

              The usual update rules apply: pool coordinator first, etc.

              Versions:

              • xen: 4.13.5-9.38.1.xcpng8.2
              • linux-firmware: 20190314-10.1.xcpng8.2 (Update: now 20190314-10.2.xcpng8.2, which adds firmware for rtl8125)
              • gpumon: 0.18.0-11.2.xcpng8.2
              • tzdata: 2023c-1.el7
              • vendor-drivers: 1.0.2-1.6.xcpng8.2

              What to test

              Normal use and anything else you want to test. The closer to your actual use of XCP-ng, the better.

              Test window before official release of the updates
              ~4 days

              Samuel, along with David and Gaël

              J A gskgerG 3 Replies Last reply Reply Quote 2
              • olivierlambertO Offline
                olivierlambert Vates 🪐 Co-Founder CEO
                last edited by

                Update done and reboot successful 🙂

                1 Reply Last reply Reply Quote 2
                • J Offline
                  JeffBerntsen Top contributor @stormi
                  last edited by

                  The update is installed and seems to be working without problems on my two test systems.

                  1 Reply Last reply Reply Quote 2
                  • A Offline
                    Andrew Top contributor @stormi
                    last edited by Andrew

                    @stormi Updated on several Intel Xeon servers. Updated on new Intel and AMD (zen3) systems with IGC and r8125 chips. One issue... the base install does not include the standard firmware for the 8125.

                    stormiS 1 Reply Last reply Reply Quote 2
                    • gskgerG Offline
                      gskger Top contributor @stormi
                      last edited by

                      @stormi My two host cluster (HP ProDesk 600 G6) updated without an issue. Let's see how the cluster is performing during the coming days.

                      1 Reply Last reply Reply Quote 3
                      • stormiS Offline
                        stormi Vates 🪐 XCP-ng Team @Andrew
                        last edited by

                        @Andrew I just pushed an updated linux-firmware to testing, with firmware for rtl8125. Should be available within 10 minutes.

                        A 1 Reply Last reply Reply Quote 0
                        • A Offline
                          Andrew Top contributor @stormi
                          last edited by

                          @stormi r8125 firmware loads.

                          1 Reply Last reply Reply Quote 2
                          • stormiS Offline
                            stormi Vates 🪐 XCP-ng Team
                            last edited by

                            So, we found out the AMD vulnerability actually doesn't affect XCP-ng directly, because Xen doesn't use AMD's SEV features currently.

                            The other two vulnerabilities still need fixing, but they both can only be exploited if XCP-ng is used in an either unlikely or unsupported way. We'll fix them in due course, but won't push the update to everyone today as initially planned. We will delay them slightly to give them a chance to be grouped with future updates and thus cause less maintenance for users.

                            Thanks for the tests anyway: we will be able to publish these packages whenever we need now.

                            1 Reply Last reply Reply Quote 0
                            • bleaderB Offline
                              bleader Vates 🪐 XCP-ng Team
                              last edited by bleader

                              New security update candidates (kernel)

                              A new XSA was published on the 23rd of January, so we have a new security update to include it.

                              Security updates

                              • kernel:
                                  * Fix XSA-448 - Linux: netback processing of zero-length transmit fragment. An unprivileged guest can cause Denial of Service (DoS) of the host bysending network packets to the backend, causing the backend to crash. This was discovered through issues when using pfSense with wireguard causing random crashes of the host.

                              Test on XCP-ng 8.2

                              yum clean metadata --enablerepo=xcp-ng-testing
                              yum update kernel --enablerepo=xcp-ng-testing
                              reboot
                              

                              The usual update rules apply: pool coordinator first, etc.

                              Versions:

                              • kernel: 4.19.19-7.0.23.1.xcpng8.2

                              What to test

                              Normal use and anything else you want to test. The closer to your actual use of XCP-ng, the better.

                              Test window before official release of the updates
                              ~2 days due to security updates.

                              1 Reply Last reply Reply Quote 4
                              • stormiS Offline
                                stormi Vates 🪐 XCP-ng Team
                                last edited by

                                Did anyone install it? The 2 days delay is over and we'll publish today.

                                A 1 Reply Last reply Reply Quote 0
                                • A Offline
                                  Andrew Top contributor @stormi
                                  last edited by

                                  @stormi Yes, I installed it on a few running hosts. I did not have any kernel crashes before, and none after...

                                  1 Reply Last reply Reply Quote 2
                                  • olivierlambertO Offline
                                    olivierlambert Vates 🪐 Co-Founder CEO
                                    last edited by

                                    Installed here, works 🙂

                                    1 Reply Last reply Reply Quote 1
                                    • C Offline
                                      CJ @NielsH
                                      last edited by

                                      @NielsH Kind of off topic but figured I'd mention it as I only recently discovered this.

                                      Not sure what VMs you're running, but if they can survive being off for a short time (redundancy of services or planned outage) you can reboot the host using Smart Reboot under the Advanced tab. While it incurs some downtime, it allows for a much faster reboot time than migrating the VMs to another server and back.

                                      I use local storage as well and it's been a game changer for dealing with pool patches.

                                      N 1 Reply Last reply Reply Quote 0
                                      • bleaderB Offline
                                        bleader Vates 🪐 XCP-ng Team
                                        last edited by

                                        The update has been published, thanks for the feedback and tests.

                                        https://xcp-ng.org/blog/2024/01/26/january-2024-security-update/

                                        1 Reply Last reply Reply Quote 3
                                        • N Offline
                                          NielsH @CJ
                                          last edited by

                                          @CJ said in Updates announcements and testing:

                                          @NielsH Kind of off topic but figured I'd mention it as I only recently discovered this.

                                          Not sure what VMs you're running, but if they can survive being off for a short time (redundancy of services or planned outage) you can reboot the host using Smart Reboot under the Advanced tab. While it incurs some downtime, it allows for a much faster reboot time than migrating the VMs to another server and back.

                                          I use local storage as well and it's been a game changer for dealing with pool patches.

                                          Cheers, thanks for the suggestion. In our case we actually are phashing out xcp-ng and are in the process of migrating to Proxmox since we can migrate with 30-35Gbit/s there. The disk performance is so much faster there we can perform all the updates in a single day instead of 2 weeks 🙂

                                          Another issue we had was that VM migrations of very large VMs (usually 8cores+) are quite impactful. Because we want to use VMs with 24-48 cores and 128GB RAM as well it simply was not usable enough for us. There's several seconds, or sometimes even minutes of downtime during the last phase of the migration with the large VMs.

                                          With Proxmox we have seen very little downtime (<1s) which we are very happy about.

                                          1 Reply Last reply Reply Quote 0
                                          • bleaderB Offline
                                            bleader Vates 🪐 XCP-ng Team
                                            last edited by

                                            New security update candidates (xen)

                                            Two new XSAs were published on 30th of January.

                                            • XSA-449 impacts PCI passthrough users.
                                            • XSA-450 is only impacting the case where Xen is compiled without HVM support, that is not the case in XCP-ng. We therefore chose not to include this fix yet (will likely be included in future versions, maybe not part of a critical security update).

                                            SECURITY UPDATES

                                            • xen-*:
                                                  * Fix XSA-449 - pci: phantom functions assigned to incorrect contexts. A malicious VM assigned with a PCI device could in some cases access data of a guest previously using the same PCI device. This requires PCI passthrough on a device using phantom functions and reassigning the same device to a new VM to be exploitable.

                                            Test on XCP-ng 8.2

                                            yum clean metadata --enablerepo=xcp-ng-testing
                                            yum update "xen-*" --enablerepo=xcp-ng-testing
                                            reboot
                                            

                                            The usual update rules apply: pool coordinator first, etc.

                                            Versions:

                                            • xen: 4.13.5-9.38.2.xcpng8.2

                                            What to test

                                            Normal use and anything else you want to test, if you are using PCI passthrough devices that's even better, but we also would be glad to have confirmation from others that their normal use case still works as intended.

                                            Test window before official release of the updates
                                            2 day because of security updates.

                                            J A 2 Replies Last reply Reply Quote 4
                                            • First post
                                              Last post