XCP-ng
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    After installing updates: 0 bytes free, Control domain memory = 0B

    Scheduled Pinned Locked Moved XCP-ng
    92 Posts 7 Posters 28.8k Views 5 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • D Offline
      Dataslak @nikade
      last edited by

      @nikade Hello nikade. Thank you for pitching in.
      1874d12c-68d5-4e12-ad6f-a70f250101af-image.png

      nikadeN 1 Reply Last reply Reply Quote 0
      • nikadeN Offline
        nikade Top contributor @Dataslak
        last edited by

        @Dataslak said in After installing updates: 0 bytes free, Control domain memory = 0B:

        @nikade Hello nikade. Thank you for pitching in.
        1874d12c-68d5-4e12-ad6f-a70f250101af-image.png

        Thats really weird, whats the output of "ps aux|grep qemu" ?

        D 1 Reply Last reply Reply Quote 0
        • stormiS Offline
          stormi Vates πŸͺ XCP-ng Team @Dataslak
          last edited by

          @Dataslak The easiest for us is if you install XOA (you can without a license), open a support ticket (you don't need a support subscription to open one, to my best knowledge. We check the status once the ticket is open). Then open a support tunnel from XOA, which will allow us to connect.

          Or provide me with another way to SSH to your host, in private chat. No teamviewer :D. Real, actual, SSH.

          D 1 Reply Last reply Reply Quote 0
          • D Offline
            Dataslak @stormi
            last edited by

            @stormi
            I get this message on all afflicted hosts when trying to 832eb224-d6f1-40b8-af81-07ae5c7b5607-image.png

            5c04bb0e-81a2-4aae-bb16-7e733a94bb67-image.png

            I have an alternative; Will PM you about this.

            stormiS 1 Reply Last reply Reply Quote 0
            • stormiS Offline
              stormi Vates πŸͺ XCP-ng Team @Dataslak
              last edited by

              @Dataslak Yes, I didn't mention it, but it was obvious to me that you couldn't deploy a new VM on a host which can't start VMs :D. I thought you'd use the 8.2.1 one.

              D 1 Reply Last reply Reply Quote 0
              • D Offline
                Dataslak @nikade
                last edited by Dataslak

                @nikade asked : "Thats really weird, whats the output of "ps aux|grep qemu" ?"
                adbe2eb2-a2b2-4793-b8e9-e0247e270fa4-image.png

                nikadeN 1 Reply Last reply Reply Quote 0
                • D Offline
                  Dataslak @stormi
                  last edited by Dataslak

                  @stormi Forgive me for not keeping up πŸ˜„ I will strive to do better.
                  I meant to say that the hosts with the failed updates all struggle to install XOA. And as you correctly say: There's no point since they can not start VMs. Install on the working 8.2.1. This I managed to do, and Stormi connected to it but could not get in touch with the 8.3 server from there.
                  Stormi has now found another route and can SSH directly to both servers outside XOA.
                  I am super exited to hear what he discovers, and to know if he can save one VM on the 8.3 server (005) that my backup has failed to keep safe, and whether he can get the remaining 8.3 servers up and running normally again.

                  stormiS 1 Reply Last reply Reply Quote 0
                  • stormiS Offline
                    stormi Vates πŸͺ XCP-ng Team @Dataslak
                    last edited by

                    @Dataslak I haven't looked yet, but as long as you don't overwrite the disks, this VM is actually safe and can be recovered.

                    D 1 Reply Last reply Reply Quote 0
                    • D Offline
                      Dataslak @stormi
                      last edited by

                      @stormi
                      I am very relieved!
                      The 8.3 host (005) does not seem to want to become enabled. How can I then migrate the VM? Is it as easy as packing up the VM files in an archive through SSH and upload to a different working host?
                      Will I then meet problems trying to run a 8.3 VM on 8.2.1 ? Will this require XOA with more features (warm migration was mentioned above) unlocked through a subscription?

                      1 Reply Last reply Reply Quote 0
                      • nikadeN Offline
                        nikade Top contributor @Dataslak
                        last edited by

                        @Dataslak said in After installing updates: 0 bytes free, Control domain memory = 0B:

                        @nikade asked : "Thats really weird, whats the output of "ps aux|grep qemu" ?"
                        adbe2eb2-a2b2-4793-b8e9-e0247e270fa4-image.png

                        Im baffled, never seen this kind of issues before.
                        Are you sure the virtualization is enabled in BIOS?

                        D 1 Reply Last reply Reply Quote 0
                        • D Offline
                          Dataslak @nikade
                          last edited by Dataslak

                          @nikade Thank you for your help! I deeply appreciate your insights as well as @stormi .

                          I have six servers which have run 8.3 for 1/2 year, with 3 and 40 VMs on each working very nicely. Can I therefore assume that BIOS settings have been correct? Or will a recent update to XCP-ng 8.3 have triggered the need for additional BIOS settings to be made? Or a newer BIOS?

                          This is an AMD Ryzen 9 CPU running on B650 chipset motherboard. I have verified in BIOS that :

                          SVM Mode = Enabled (Enable/disable CPU Virtualization)
                          SR-IOV Support = Disabled
                          IOMMU = Disabled
                          I have not found the HAP setting yet. Don't know if it is called something else. When I google it I do not get any hits indicating that there's even a setting called "HAP" or "Hardware assistedΒ paging"?

                          I have found these four BIOS-settings to be of matter to XCP-ng in forums/troubleshooting docs I've found. Are there any other BIOS settings I have overlooked and should be mindful of?

                          The BIOS is more than 6 months old; I will try to update BIOS on one of the hosts to see if this makes any change to the problem.

                          Edit:
                          I have now updated the BIOS.
                          XOA still reports:
                          92e97566-9293-4409-8e46-e058c808ea42-image.png

                          The "Hardware-assisted virtualization is not enabled on this host" message appeared after installing these updates:
                          https://www.dropbox.com/scl/fi/fqh6gjc24j6fuvrpe9tdr/yum_history_info.txt?rlkey=cqbho9lm9pdbpds0dn409zub0&dl=1

                          1 Reply Last reply Reply Quote 0
                          • olivierlambertO Online
                            olivierlambert Vates πŸͺ Co-Founder CEO
                            last edited by

                            Enable IOMMU too

                            D 1 Reply Last reply Reply Quote 0
                            • D Offline
                              Dataslak @olivierlambert
                              last edited by Dataslak

                              @olivierlambert Thank you for your input. πŸ™
                              IOMMU changed from Auto to Enabled πŸ‘

                              Edit:
                              After the host comes up again XO still complains " Hardware-assisted virtualization is not enabled on this host".

                              Is there anything else I can do to debug this issue? On second thought: @stormi is on the case, so I should not do anything unless he tells me to.

                              1 Reply Last reply Reply Quote 0
                              • olivierlambertO Online
                                olivierlambert Vates πŸͺ Co-Founder CEO
                                last edited by

                                Is it better now?

                                D 1 Reply Last reply Reply Quote 0
                                • D Offline
                                  Dataslak @olivierlambert
                                  last edited by Dataslak

                                  @olivierlambert
                                  I can not see any change to the problem.
                                  The host still starts up in maintenance mode. Memory reports no space left despite no VMs running. Not even the host barebone OS itself is shown on the empty memory bar under the "General" tab.

                                  82d7f004-b3fb-48ba-a890-ed38e9eddba2-image.png

                                  0882f07a-3d06-4012-9fff-f72955f62eab-image.png

                                  8a1425c5-a916-45dd-a621-cb28bdac79b6-image.png

                                  3a8c9ebf-15b4-4645-a8c3-fbad945ce85a-image.png

                                  When trying to start a VM:

                                  46d96823-2db4-4f85-9a99-eddd35287a20-image.png

                                  8849a45d-c30f-423d-8663-a4286a4ea405-image.png

                                  The full message text:

                                  -1(list index out of range, , Traceback (most recent call last):
                                  File "/etc/xapi.d/plugins/xcpngutils/init.py", line 119, in wrapper
                                  return func(*args, **kwds)
                                  File "/etc/xapi.d/plugins/hyperthreading.py", line 17, in get_hyperthreading
                                  return json.dumps(int(lines[0]) > 1)
                                  IndexError: list index out of range

                                  When I google "AMD HyperThreading" (since this is not an Intel-system but AMD): Google says "Hyper-threading is the Intel technology name, and commonly known name, for the Computing concept known as simultaneous multithreading. AMD have an equivalent technology known as Cluster Multi-threading."
                                  This is enabled by default in BIOS (and I reset BIOS to defaults before upgrading BIOS just an hour ago).

                                  1 Reply Last reply Reply Quote 0
                                  • olivierlambertO Online
                                    olivierlambert Vates πŸͺ Co-Founder CEO
                                    last edited by

                                    What's the output of xl dmesg

                                    D 1 Reply Last reply Reply Quote 0
                                    • D Offline
                                      Dataslak @olivierlambert
                                      last edited by Dataslak

                                      @olivierlambert
                                      6222dd33-8819-441c-b834-bcc39e3d6c87-image.png

                                      1 Reply Last reply Reply Quote 0
                                      • olivierlambertO Online
                                        olivierlambert Vates πŸͺ Co-Founder CEO
                                        last edited by

                                        You have an issue: I'm pretty sure you are using Xen 4.13 while using the 4.17 tooling around, causing this problem. So the question is "why are you booting on 4.13?". Double check your Grub menu when you boot, to see that you actually boots on Xen 4.17

                                        D nikadeN 2 Replies Last reply Reply Quote 1
                                        • D Offline
                                          Dataslak @olivierlambert
                                          last edited by Dataslak

                                          @olivierlambert
                                          https://xcp-ng.org/forum/assets/uploads/files/1714683479351-b73237cc-8f27-4789-a9d3-5e322c2cd244-image.png

                                          Answer to your question : I don't know πŸ™‚
                                          Six servers have been running fine for months. Then i accepted XO's request to install these updates and the problem manifested on all six.
                                          I did nothing to XCP-ng for months before this happened. Did nothing other than reboot one host after, which then made me realize that something was seriously wrong since no VMs came up. I then also saw the other problems shown in screen clippings earlier in this chat.
                                          Luckily the VMs continued running on the other hosts. I am very happy I only rebooted one first. Or all VMs would have been down for days.

                                          Lacking in mental capacity to understand what has happened I am therefore hoping you and your eminent staff can "Sherlock Holmes" this :-).
                                          How can the abovementioned updates have triggered it?
                                          Can it happen to others?

                                          I may have waited too long between checking if updates were waiting to be installed. Could this have caused the problem? Were they installed in incorrect order?

                                          I can not find Xen 4.17 in the boot menu. How do I fix this? Should I create a new 8.3 installation media and try to upgrade the system? Can I do this safely without damaging VMs?

                                          1 Reply Last reply Reply Quote 0
                                          • olivierlambertO Online
                                            olivierlambert Vates πŸͺ Co-Founder CEO
                                            last edited by olivierlambert

                                            Well, first, you are on XCP-ng 8.3, right? It's still a beta, so by definition this could happen (especially a jump like this from Xen 4.13 to 4.17). If you want guarantee stability, you have to stick to our LTS version.

                                            Anyway, to know which Xen version you are running, you can do a: cat /sys/hypervisor/version/minor. If it's 17, then you are on Xen 4.17 and that's your issue. I don't know how this could happen in the first place, like if the Grub file was never updated or tinkered before.

                                            To check a bit more, when on Grub, press "e" key to edit the first entry and make a picture to see what's displayed there. I'm thinking on a way to regen Grub that will take the right Xen version into account (ping @stormi )

                                            D 1 Reply Last reply Reply Quote 1
                                            • First post
                                              Last post