XCP-ng
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    CBT: the thread to centralize your feedback

    Scheduled Pinned Locked Moved Backup
    439 Posts 37 Posters 386.5k Views 29 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • F Offline
      flakpyro @florent
      last edited by

      @florent Testing a storage migration i do see CBT get disabled and reset during the process which is expected! I do notice it leaves the .cbtlog file on the old SR after the storage migration is complete but that's easy enough to clean up manually.

      The issue i posted above however is just a VM migration from host to host on a shared NFS SR, the SR the VM is on is not changing.

      1 Reply Last reply Reply Quote 0
      • R Offline
        Rhodderz
        last edited by

        We appear to have a similar issue to @flakpyro
        We dont have NFS storage but using iSCSI from Dell SC5020's
        We had backups with NBD and CBT enabled
        We updated one of our pools to the latest (stable branch) yesterday to try and get rid of the iSCSI disconnecting bug, which meant all the vms where shuffled around and migrated.
        This morning majority of the vms failed the backup with "can't create a stream from a metadata VDI, fall back to a base"
        Quick searching brought me here and following what flak did i found one of the cbtlogs for one of the failed vms is also zero'd as shown below:

        [09:40 xcp101 VG_XenStorage-6c2ec0ce-01ba-6975-741c-e2e86bc45e21]# cbt-util get -c -n cc2f2443-eb13-4eeb-951b-5faa3c7b8c55.cbtlog
        00000000-0000-0000-0000-000000000000
        

        We have an enterprise support with a ticket already open about NBD being slow (was on 1 NBD Connections) with a support tunnel open which I will update as well.
        Hopefully that gives you another point of reference to check from.

        Is it possible to force a clean fresh start for the backups similar to Veeam "Active Full"?

        ForzaF R 2 Replies Last reply Reply Quote 0
        • ForzaF Offline
          Forza @Rhodderz
          last edited by

          @Rhodderz said in CBT: the thread to centralize your feedback:

          Is it possible to force a clean fresh start for the backups similar to Veeam "Active Full"?

          Perhaps delete the snapshots for each vm. When backup job starts, it should be a 'full' backup.

          R 1 Reply Last reply Reply Quote 0
          • R Offline
            rtjdamen @Rhodderz
            last edited by

            @Rhodderz are u also on 8.3?

            R 1 Reply Last reply Reply Quote 0
            • R Offline
              Rhodderz @rtjdamen
              last edited by Rhodderz

              @rtjdamen Having a look i assumed we where on 8.3 as we updated yesterday and there is no available patches, but on 8.2.1

              NAME="XCP-ng"
              VERSION="8.2.1"
              ID="xenenterprise"
              ID_LIKE="centos rhel fedora"
              VERSION_ID="8.2.1"
              PRETTY_NAME="XCP-ng 8.2.1"

              release/yangtze/master/58

              Apologies forgot to check that and (wrongfully) assumed

              R 1 Reply Last reply Reply Quote 0
              • R Offline
                Rhodderz @Forza
                last edited by

                @Forza Tested this on a VM and seems i still get the same error sadly.

                1 Reply Last reply Reply Quote 0
                • R Offline
                  rtjdamen @Rhodderz
                  last edited by

                  @Rhodderz ok so it’s not only related to v8.3 as we were assuming. Or somerhing else is going on in your env. What happens if u use normal backup without cbt?

                  R 1 Reply Last reply Reply Quote 0
                  • R Offline
                    Rhodderz @rtjdamen
                    last edited by Rhodderz

                    @rtjdamen Just trying that now
                    However it seems if i disable CBT on the vm, the backup (trying a new backup job for this testing) just re-enabled it.
                    Seems based on the job i can have NBD+CBT or neither.
                    Annoyingly we would like NBD to run to speed up backups as they take quite some time.

                    EDIT:
                    To add, the new test backup for the vm that failed before actually finished successfully
                    Just manually rrerunning it on the main job now
                    If it works there, the temproary workaround could be to just disable CBT and let the backup job re-enable it.

                    EDIT EDIT:
                    Re-running the backup on the vm in the original job still failed with the same error
                    Testing with the new job and making it the same with NBD connection set to 2, purge snapshots after, still passes fine
                    So i am guessing CBT is job dependant and no vm dependant?
                    Which would explain why a new job on the same VM to the same place works fine?

                    F 1 Reply Last reply Reply Quote 0
                    • F Offline
                      flakpyro @Rhodderz
                      last edited by

                      For our production pool i have CBT + NBD enabled but i have "Purge snapshot data when using CBT" disabled. The results in successful backups but the snapshot is retained. I assume it then ends up using that snapshot for the following delta backups.

                      R 1 Reply Last reply Reply Quote 0
                      • R Offline
                        Rhodderz @flakpyro
                        last edited by

                        @flakpyro ah I will try that once proxy for that pool is back
                        We upgrade XOA from stable channel to latest as we had another issue which is apparently resolved in that with NBD (causing some machines to go RO)
                        Once thats fixed I will try again to see if the above update and/or disabling "Purge snapshot" works as a workaround.

                        We have purge enabled (and would like it left enabled) as we use iSCSI (Dell SC5020's) so everything is a little fat, especially with some clients.
                        I shal update tommorow on what happens.

                        F 1 Reply Last reply Reply Quote 0
                        • F Offline
                          flakpyro @Rhodderz
                          last edited by

                          @Rhodderz I agree we are using NFS so snapshots are thin at least but we would love to be able to delete the snapshots after a backup run as well. Hopefully in time we can get this working!

                          1 Reply Last reply Reply Quote 0
                          • R Offline
                            Rhodderz
                            last edited by

                            To add an update and to not leave on a cliff hanger.
                            We have since updated our XOA to the latest channel to attempt to fix an NBD issue.
                            This move broke a proxy of ours, but also all the backups are going through the XOA and after this the backups have not had an issue since.
                            So either the new NBD fixes, it being only on an XOA or something somehwere else resolved this problem for now.

                            We will be enabling the same in our other pool soon so will update if we have the same issues there.

                            F 1 Reply Last reply Reply Quote 0
                            • F Offline
                              flakpyro @Rhodderz
                              last edited by

                              Sadly the latest XOA release from today does not resolve my strange CBT issue,

                              [08:32 xcpng-test-01 45e457aa-16f8-41e0-d03d-8201e69638be]#  cbt-util get -c -n 4d7f0341-bbce-4957-a4c4-d603725a807a.cbtlog 
                              1950d6a3-c6a9-4b0c-b79f-068dd44479cc
                              After Migration from Host 01 to Host 02 (Shared NFS SR):
                              [08:33 xcpng-test-01 45e457aa-16f8-41e0-d03d-8201e69638be]#  cbt-util get -c -n 4d7f0341-bbce-4957-a4c4-d603725a807a.cbtlog 
                              00000000-0000-0000-0000-000000000000
                              
                              1 Reply Last reply Reply Quote 0
                              • olivierlambertO Offline
                                olivierlambert Vates πŸͺ Co-Founder CEO
                                last edited by

                                I don't think that's an XO issue, but more something weird in your XCP-ng setup that nobody can reproduce 😒 (but it doesn't mean we couldn't solve it)

                                F 1 Reply Last reply Reply Quote 0
                                • F Offline
                                  flakpyro @olivierlambert
                                  last edited by

                                  @olivierlambert Hmm im really not sure whats unique about my two pools. One is AMD + TrueNAS the other Intel + Pure Storage. If this is actually unique to me only perhaps i would be better off submitting a ticket to help get to the bottom of this?

                                  1 Reply Last reply Reply Quote 0
                                  • olivierlambertO Offline
                                    olivierlambert Vates πŸͺ Co-Founder CEO
                                    last edited by

                                    You manage to find a CBT issue without using any XO command, which is great because we know it's not XO now πŸ˜„ I think @dthenot is already taking a look internally.

                                    dthenotD 1 Reply Last reply Reply Quote 1
                                    • dthenotD Offline
                                      dthenot Vates πŸͺ XCP-ng Team @olivierlambert
                                      last edited by

                                      @olivierlambert I am πŸ™‚

                                      F 1 Reply Last reply Reply Quote 2
                                      • F Offline
                                        flakpyro @dthenot
                                        last edited by

                                        @dthenot @olivierlambert thanks guys ill hold off on submitting a ticket for now to keep the conversation centralized here but if you need any more info, would like me to try anything or would like a remote support tunnel opened just let me know! πŸ™‚

                                        1 Reply Last reply Reply Quote 2
                                        • Tristis OrisT Offline
                                          Tristis Oris Top contributor
                                          last edited by

                                          can't run live migration to another pool because VDI_CBT_ENABLED. is it intended?

                                          Tristis OrisT R 2 Replies Last reply Reply Quote 0
                                          • Tristis OrisT Offline
                                            Tristis Oris Top contributor @Tristis Oris
                                            last edited by

                                            @Tristis-Oris even halted VMs can't migrate with snapshot. need to remove it.

                                            ForzaF 1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post