XCP-ng
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    After installing updates: 0 bytes free, Control domain memory = 0B

    Scheduled Pinned Locked Moved XCP-ng
    92 Posts 7 Posters 28.8k Views 5 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • olivierlambertO Offline
      olivierlambert Vates 🪐 Co-Founder CEO
      last edited by

      So now we know the problem, the next step will be to solve it 🙂

      Let's check what's going on in /boot with a ls -la /boot

      D 1 Reply Last reply Reply Quote 1
      • nikadeN Offline
        nikade Top contributor @olivierlambert
        last edited by

        @olivierlambert said in After installing updates: 0 bytes free, Control domain memory = 0B:

        You have an issue: I'm pretty sure you are using Xen 4.13 while using the 4.17 tooling around, causing this problem. So the question is "why are you booting on 4.13?". Double check your Grub menu when you boot, to see that you actually boots on Xen 4.17

        Nice catch, this is what community is all about 🙂

        1 Reply Last reply Reply Quote 0
        • D Offline
          Dataslak @olivierlambert
          last edited by

          @olivierlambert
          I love and respect your attitude 🙂
          f956c2c2-c3dd-4ae4-9204-b61fbdae202f-image.png

          Agree wholeheartedly with @nikade 🙂

          yannY 1 Reply Last reply Reply Quote 0
          • olivierlambertO Offline
            olivierlambert Vates 🪐 Co-Founder CEO
            last edited by

            So /boot/xen.gz is pointing to xen-4.17.3-4.gz, which sounds correct. So why you are still running on Xen 4.13? 🤔 It's like you did not reboot, but since you showed me the Grub menu, I'm assuming you already did 🤔

            It would be interesting to compare the existing xen file and see if it's the right one from our repo. Something is fishy here 🤔

            1 Reply Last reply Reply Quote 0
            • olivierlambertO Offline
              olivierlambert Vates 🪐 Co-Founder CEO
              last edited by

              Can you md5sum xen-4.17.3-4.gz

              From the mirror & RPM, I have f011721be0c7b57563e29ed282558da3

              D 1 Reply Last reply Reply Quote 0
              • olivierlambertO Offline
                olivierlambert Vates 🪐 Co-Founder CEO
                last edited by

                Adding @yann in the loop in case I'm missing something obvious

                1 Reply Last reply Reply Quote 0
                • yannY Offline
                  yann Vates 🪐 XCP-ng Team @Dataslak
                  last edited by

                  @Dataslak what does lsblk -o name,mountpoint,label,size,uuid show?

                  D 1 Reply Last reply Reply Quote 0
                  • D Offline
                    Dataslak @olivierlambert
                    last edited by

                    @olivierlambert
                    c08e011f-a210-471f-87c9-ed421595dac5-image.png

                    1 Reply Last reply Reply Quote 0
                    • D Offline
                      Dataslak @yann
                      last edited by

                      @yann
                      Hello Yann, thank you for pitching in.
                      7c30db11-59d4-4898-a19b-83a933e8f9ee-image.png

                      yannY 1 Reply Last reply Reply Quote 0
                      • yannY Offline
                        yann Vates 🪐 XCP-ng Team @Dataslak
                        last edited by

                        @Dataslak can you please request a commandline from GRUB (hit c on the boot menu), and issue the following commands:

                        echo $root
                        search --label --set root root-eqjpzg
                        echo $root
                        
                        D 1 Reply Last reply Reply Quote 1
                        • olivierlambertO Offline
                          olivierlambert Vates 🪐 Co-Founder CEO
                          last edited by olivierlambert

                          Also a cat /proc/mdstat in the Dom0 would help.

                          D 1 Reply Last reply Reply Quote 0
                          • D Offline
                            Dataslak @yann
                            last edited by Dataslak

                            @yann
                            edbc1bca-dc7d-409a-85f4-9080ab4993df-image.png

                            Info: I am mirroring two M.2 SSDs ! Software RAID established by the installation routine of v8.3.
                            Could the mirror be broken and cause this somehow?

                            yannY 1 Reply Last reply Reply Quote 0
                            • D Offline
                              Dataslak @olivierlambert
                              last edited by Dataslak

                              @olivierlambert said in After installing updates: 0 bytes free, Control domain memory = 0B:

                              Also a cat /proc/mdstat in the Dom0 would help.

                              Please forgive my ignorance: How do I execute this command in Dom0 ?

                              I've read https://wiki.xenproject.org/wiki/Dom0 and it helped a little. Do I run the command in the console within XOA?

                              olivierlambertO 1 Reply Last reply Reply Quote 1
                              • yannY Offline
                                yann Vates 🪐 XCP-ng Team @Dataslak
                                last edited by

                                @Dataslak so it is choosing to "boot from the 1st disk of the raid1", we could try to tell him to boot from the 2nd one:

                                • on the grub menu hit e to edit the boot commands
                                • replace that search ... line with set root=hd1,gpt1
                                • then hit Ctrl-x to boot
                                D 1 Reply Last reply Reply Quote 0
                                • olivierlambertO Offline
                                  olivierlambert Vates 🪐 Co-Founder CEO @Dataslak
                                  last edited by

                                  @Dataslak Dom0 is "the host" (if you think it's the host it's not really but anyway), ie the machine you are connected to and showing results since the start 🙂

                                  D 1 Reply Last reply Reply Quote 0
                                  • D Offline
                                    Dataslak @yann
                                    last edited by Dataslak

                                    @yann
                                    Wohoo!!
                                    All VMs came up!
                                    Host is not in maintenance mode.
                                    Control domain memory = 12GiB
                                    Stats are back
                                    Etc....

                                    813787ec-7c49-41e1-8a59-af2fda8648bc-image.png

                                    1867f534-16d5-4da6-8418-17d61d664857-image.png

                                    As far as I can see (which is limited) everything looks good?

                                    How can I see the status of the RAID1 and see if the mirror is intact ?

                                    1 Reply Last reply Reply Quote 0
                                    • D Offline
                                      Dataslak @olivierlambert
                                      last edited by Dataslak

                                      @olivierlambert
                                      Thank you for explaining to me. I will look more into details when (if) I find time 😄

                                      fcb72f47-cdc2-46a6-b84e-b532b9089d14-image.png

                                      Ah - I see you were ahead of me !

                                      How can I interpret this? Raid1 OK? Synched? Ready to deal with a single drive failure?

                                      How will XO inform me if one of the drives fails? Will I have to scour through logs, or will there be a clear visible notice in the interface?

                                      1 Reply Last reply Reply Quote 0
                                      • olivierlambertO Offline
                                        olivierlambert Vates 🪐 Co-Founder CEO
                                        last edited by olivierlambert

                                        That's the problem. Your RAID1 lost the sync. And so it continued to boot on the disk out of sync, loading the old Xen from the boot while the rest (root partition) was up to date.

                                        D 1 Reply Last reply Reply Quote 0
                                        • D Offline
                                          Dataslak @olivierlambert
                                          last edited by Dataslak

                                          @olivierlambert
                                          Since this happened on six servers simultaneously when applying updates through XO I guess we may have found an error ?

                                          If so then all of this was not in vain, and I can be happy to have made a tiny tiny contribution to the development of 8.3 ?

                                          Will the modification of the Grub boot loader be safe to apply to all remaining 5 servers? Or should I do some verification on each before applying it?

                                          Is the modification of Grub what I will have to do if a drive fails? Change that one line from set root=hd1,gpt1 to set root=hd0,gpt1 or something?

                                          stormiS 1 Reply Last reply Reply Quote 0
                                          • olivierlambertO Offline
                                            olivierlambert Vates 🪐 Co-Founder CEO
                                            last edited by olivierlambert

                                            I don't know yet, but you lost one drive. Can you run xe host-call-plugin host-uuid=<uuid> plugin=raid.py fn=check_raid_pool? (replace with the UUID of the host)

                                            edit: check that on all your other hosts

                                            D 1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post