XCP-ng
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    Short VM freeze when migrating to another host

    Scheduled Pinned Locked Moved Compute
    33 Posts 8 Posters 6.8k Views 10 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • nikadeN Offline
      nikade Top contributor @arc1
      last edited by

      @arc1 so if you go to XOA and the console of the VM, what happends then?
      Is the VM frozen for the amount of 10 pings? Open taskmanager to see if there is any CPU activity.

      A 1 Reply Last reply Reply Quote 0
      • A Offline
        arc1 @nikade
        last edited by

        @nikade Yes, the MV is frozen without cpu activity.

        nikadeN 1 Reply Last reply Reply Quote 0
        • nikadeN Offline
          nikade Top contributor @arc1
          last edited by

          @arc1 said in Short VM freeze when migrating to another host:

          @nikade Yes, the MV is frozen without cpu activity.

          So the VM is actually frozen in the console?
          Because if it wasn't I'd suggest adjusting the mac-aging in your switches, since the VM's mac adress will be bound to the physical hosts switch-port for a period of time after migrating.

          1 Reply Last reply Reply Quote 1
          • andrewperryA Offline
            andrewperry
            last edited by

            We're seeing this issue when trying to migrate a Debian VM with 16GB of RAM.

            It is a worker node in a Kubernetes cluster so it is likely that the RAM changes a fair bit. It is not uncommon for the migration to fail due to the freeze hitting a 30 second time limit.

            A Windows 10 Pro VM with 16GB of RAM migrates fine, because not much is changing in the RAM I expect.

            Following along for recommendations! Our hosts sound very similar to @arc1 except our network speed is slower, which is one thing we are working on.

            1 Reply Last reply Reply Quote 0
            • nikadeN Offline
              nikade Top contributor
              last edited by

              And you guys aren't using any kind of dynamic memory?
              Can you post a screen dump of the Advanced tab where it shows the memory configuration?

              We have VM's with 128Gb ram that migrates just fine, when migrating it between hosts the network shows peaks at 7,6Gbit/s and it is migrated in about ~20 seconds.
              Smaller VM's with 8, 16 or even 32Gb ram is migrated almost instantly.

              robytR andrewperryA 2 Replies Last reply Reply Quote 0
              • robytR Offline
                robyt @nikade
                last edited by

                @nikade 2568c5bf-5336-4461-8f1f-60cf093f93a2-immagine.png
                in VM (linux) with a free i see 94 gb of total memory

                1 Reply Last reply Reply Quote 0
                • andrewperryA Offline
                  andrewperry @nikade
                  last edited by

                  @nikade thanks for the ideas of where to look!

                  In my case we're testing and just have a 1Gb link between these hosts, which is what I was putting it down to.

                  This particular VM is a freshly migrated PV from Debian Xen with:

                  Memory limits (min/max)
                  Static: 16 MiB/16 GiB
                  Dynamic: 8 GiB/16 GiB

                  Could that Dynamic setting be the problem because as I recall it reduces the VM to 8 on migrate, so when doing the migrate perhaps 8 isn't enough for the VM?

                  I will try changing it to 16/16 and see if that has any noticeable impact. Thanks!

                  nikadeN 1 Reply Last reply Reply Quote 0
                  • olivierlambertO Offline
                    olivierlambert Vates 🪐 Co-Founder CEO
                    last edited by

                    Yes that's very likely.

                    robytR 1 Reply Last reply Reply Quote 0
                    • nikadeN Offline
                      nikade Top contributor @andrewperry
                      last edited by

                      @andrewperry yeah try set it 16/16Gb instead, it will probably do some magic 🙂

                      1 Reply Last reply Reply Quote 0
                      • robytR Offline
                        robyt @olivierlambert
                        last edited by

                        @olivierlambert hi, today i've upgraded my host..
                        The big VM frozen for ~7 minutes, is a big vm (96 gbram and 32 cpu) but 7 minutes is a very long time (for customer!)
                        i've setting 96/06 in dynamic: is a normal time?

                        1 Reply Last reply Reply Quote 0
                        • olivierlambertO Offline
                          olivierlambert Vates 🪐 Co-Founder CEO
                          last edited by

                          Can you provide more details? 96/96 in dynamic? Just doing live migration or suspending the VM?

                          robytR 1 Reply Last reply Reply Quote 0
                          • robytR Offline
                            robyt @olivierlambert
                            last edited by

                            @olivierlambert live migration, the vm is very important (today, in christmas holyday, i've received some phone calls for 7 minutes of freeze..)
                            17407a92-730e-4e68-885f-44a4141e863d-immagine.png

                            1 Reply Last reply Reply Quote 0
                            • olivierlambertO Offline
                              olivierlambert Vates 🪐 Co-Founder CEO
                              last edited by olivierlambert

                              The topology looks insane 🤔 Also, a live migration shouldn't make the VM inaccessible for more than few seconds, except if there is a LOT of memory pages changes, at a pace that is close to the transfer speed.

                              robytR 1 Reply Last reply Reply Quote 0
                              • robytR Offline
                                robyt @olivierlambert
                                last edited by

                                @olivierlambert ops.. why the best topology?

                                1 Reply Last reply Reply Quote 0
                                • olivierlambertO Offline
                                  olivierlambert Vates 🪐 Co-Founder CEO
                                  last edited by

                                  IIRC, just remove it (small cross) so it should use something default

                                  1 Reply Last reply Reply Quote 0
                                  • First post
                                    Last post