XCP-ng
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    Slow LiveMigration for VMs with large memory

    Scheduled Pinned Locked Moved Compute
    7 Posts 4 Posters 153 Views 4 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • K Offline
      KPS Top contributor
      last edited by

      Hi!

      When I migrate VMs between hosts (with shared storage), the migration takes significantly longer compared to VMware.

      Environment:

      AMD EPYC 9374F

      768 GB RAM

      6 × 10 GbE

      Observation:

      Migrating a VM with 8 GB RAM takes 0:37 min.
      Migrating a VM with 96 GB RAM takes around 5:20 min.

      Additionally, during the final step of the migration, large VMs experience a noticeable freeze of about 8-10 seconds although they are nearly idle, which makes live pool upgrades during business hours impractical.

      Is there anything, I can do to speed this up - especially the freeze?

      Best wishes
      KPS

      1 Reply Last reply Reply Quote 0
      • olivierlambertO Offline
        olivierlambert Vates 🪐 Co-Founder CEO
        last edited by

        You mean 37 seconds for 8GiB, right?

        The freeze at the end is a bit long but that depends on many factors. We could test an idea we discussed recently about emu-manager 🤔

        Adding @TeddyAstie in the convo

        K 1 Reply Last reply Reply Quote 0
        • K Offline
          KPS Top contributor @olivierlambert
          last edited by

          @olivierlambert
          Yes, it's 37s for the migration of a VM with 8GB RAM.
          For large VMs, that is critical (5:20 min for 96GB RAM).

          ...but the core-issue is the freeze, as everything above 4s can leads to disconnects, etc.

          1 Reply Last reply Reply Quote 1
          • P Offline
            Pilow
            last edited by

            what do you observe as transfer speeds on your nics when live migrating the large one ?
            I spin up a windows 2025 VM with 96Gb RAM, even if in-OS consumption is 2Gb/96Gb I get same time as you :
            00693335-d11c-4a7a-9ee3-a234a31e3ae7-{CF201379-1D33-4C64-90E8-3A554DAAA20D}.png

            a very large transfer is occuring on the host
            b9a47f9d-0c35-4fa9-8cfe-489335716407-{AB323215-3EB1-4F95-8279-CE661C62EC45}.png

            and we can see the 96Gb of RAM ingress on the host
            4f416858-0c0f-4e02-a848-c170b6511364-{60A12DC9-4742-4DCF-A90B-3D2326AD5A69}.png

            any way to "only" migrate the USED ram ?
            as you can see, only 2.22Gb of RAM is really used by the OS, but 96Gb is transfered... fe4195f2-60e7-4b37-9fca-5eda40d23538-{6FCEC286-22D7-4D38-964E-0D6EA2050E49}.png

            CBT for the RAM ? ^^'

            K 1 Reply Last reply Reply Quote 0
            • K Offline
              KPS Top contributor @Pilow
              last edited by

              @Pilow
              My migration speed is a bit lower than yours. I am at about 350MiB. What is your setup?
              d17c8e34-8cfc-42a7-b84a-3d8c42cc0a10-image.png

              But: I think, the "migration time" is not as important, as the freeze-time... (at least for me)

              P 1 Reply Last reply Reply Quote 0
              • P Offline
                Pilow @KPS
                last edited by

                @KPS HPE DL360 G11 + 2x Mellanox ConnectX-6 dual ports 10/25Gb Nics, XCPNG 8.3, latest XOA.

                1 Reply Last reply Reply Quote 0
                • S Online
                  spelon
                  last edited by

                  I get ~510MiB on DL380 G10, 2.6ghz, 40gbe cx3, 8.3.

                  1 Reply Last reply Reply Quote 1
                  • First post
                    Last post