XCP-ng
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    Backup Fail: Trying to add data in unsupported state

    Scheduled Pinned Locked Moved Backup
    38 Posts 7 Posters 4.6k Views 7 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • D Offline
      Delgado
      last edited by

      I can confirm I am seeing the same thing with full a mirror full backup job to Backblaze B2 utilizing encryption. I am on Master, commit 732ca in my homelab. I have attached the backup log as well.2024-08-28T22_32_37.271Z - backup NG.json.txt

      1 Reply Last reply Reply Quote 0
      • olivierlambertO Offline
        olivierlambert Vates 🪐 Co-Founder CEO
        last edited by

        Thanks for the feedback, let me ping @florent internally

        1 Reply Last reply Reply Quote 0
        • olivierlambertO Offline
          olivierlambert Vates 🪐 Co-Founder CEO @nmadunich
          last edited by

          @nmadunich and same on XOA stable?

          1 Reply Last reply Reply Quote 0
          • florentF Offline
            florent Vates 🪐 XO Team
            last edited by florent

            Hi,

            what are the size of the failing VMs ? is there anything in the syslog before having the cipher error message ?

            To be fair, uploading full backup ( > 50GB) , without knowing the size first is full of hurdles. And the xapi dont tell the size of the exports. Incremental backup with block storage completly circumvent this, by uploading separate blocks of known size

            Regards

            1 Reply Last reply Reply Quote 0
            • N Offline
              nmadunich
              last edited by nmadunich

              @olivierlambert Yes I get the same result on stable vs latest channel.

              @florent The ones that fail do seem to be some of my larger VMs the Windows 10 VM that I have been testing with is about 88.3 GB used according to the OS.

              All of my VMs are thin provisioned and our NetApp storage is using de-duplication so the size of the VHD on my storage is significantly less in this case it was about 3 GBs.

              As a test I created a new storage volume without thin provisioning and de-duplication. I migrated the Disk to the new volume and the VHD is 103 GB. I also removed de-duplication and compression on my remote. I tried the backup again and it failed with the same error.

              I do see some errors from the xensource.log around the time it fails and I attached those here.

              xensource errors.txt

              I am editing my post after looking at the log file @Delgado posted mine are slightly different. I added mine for comparison. At some point during my testing the error also changed slightly and started stating VDI must be free or attached to exactly one VM. It appears after a failed backup it's not cleaning up the snapshots.

              2024-08-29T21_32_42.161Z - backup NG.json.txt

              2024-08-28T15_05_12.613Z - backup NG.json.txt

              1 Reply Last reply Reply Quote 0
              • D Offline
                Delgado
                last edited by

                Hello,

                My vms are about 150G each. I was using compression when I backed up the vm to the remote before mirroring it to the s3 bucket. I did end up changing to delta backups and the error did go away but I can create another normal backup and mirror it to the bucket again to see if I get the same results.

                1 Reply Last reply Reply Quote 0
                • olivierlambertO Offline
                  olivierlambert Vates 🪐 Co-Founder CEO
                  last edited by

                  That's interesting 🤔 So it's only with full backup (XVAs) then.

                  nvossN 1 Reply Last reply Reply Quote 0
                  • nvossN Offline
                    nvoss @olivierlambert
                    last edited by nvoss

                    @olivierlambert yeah my experience is also that deltas run without error. Though what they're backing up exactly w/o a full in the remote is pretty questionable. I assume its a delta off of the snapshot full, where the snapshot is completed without issue and it's just the copy to encrypted remote that's failing.

                    These are definitely my larger VMs -- >100gb total disk.

                    1 Reply Last reply Reply Quote 0
                    • D Offline
                      daniel.grimm
                      last edited by

                      Hi,

                      Same problem here.
                      Its an encrypted S3 remote to Backblaze.
                      Full mirror backup with selected VMs.
                      Small VM like Xen-Orchestra works. As soon as a large VM is added (approx. 500GB), the error occurs after about 3 hours.
                      Tried several times.

                      xen-orchestra build from source

                      1 Reply Last reply Reply Quote 0
                      • olivierlambertO Offline
                        olivierlambert Vates 🪐 Co-Founder CEO
                        last edited by

                        And with or without backup compression?

                        1 Reply Last reply Reply Quote 0
                        • D Offline
                          daniel.grimm
                          last edited by

                          sorry... i forgot...
                          with zstd compression

                          1 Reply Last reply Reply Quote 0
                          • olivierlambertO Offline
                            olivierlambert Vates 🪐 Co-Founder CEO
                            last edited by

                            Can you try without it and report?

                            1 Reply Last reply Reply Quote 0
                            • D Offline
                              daniel.grimm
                              last edited by

                              Yes.

                              iam making a non compressed backup now.
                              And then i try to mirror it with the mirroring job.

                              Report follows... But uncompressed backup and upload will need some time

                              1 Reply Last reply Reply Quote 1
                              • D Offline
                                daniel.grimm
                                last edited by

                                So, same error after 3 hours of uploading/mirroring an uncompressed backup to the encrypted backblaze remote.

                                transfer
                                Start: 2024-09-17 07:27
                                End: 2024-09-17 10:33
                                Duration: 3 hours
                                Error: Trying to add data in unsupported state
                                
                                1 Reply Last reply Reply Quote 0
                                • olivierlambertO Offline
                                  olivierlambert Vates 🪐 Co-Founder CEO
                                  last edited by

                                  I have the feeling it might be related to Backblaze and a potential timeout or something 🤔

                                  nvossN 1 Reply Last reply Reply Quote 0
                                  • D Offline
                                    daniel.grimm
                                    last edited by

                                    Before this error, I had the following error:

                                    transfer
                                            Start: 2024-09-11 15:14
                                            End: 2024-09-11 16:07
                                            Duration: an hour
                                            Error: no tomes available
                                        Start: 2024-09-11 15:14
                                        End: 2024-09-11 16:07
                                        Duration: an hour
                                        Error: no tomes available
                                    
                                    Start: 2024-09-11 15:14
                                    End: 2024-09-11 16:07
                                    Duration: an hour
                                    Error: no tomes available
                                    Type: full
                                    

                                    I was able to fix this by giving the xen-orchestra vm more RAM.
                                    I thought these were triggered by some kind of timeout.

                                    When the current error first occurred, I doubled the RAM again. Unfortunately that didn't help

                                    1 Reply Last reply Reply Quote 0
                                    • olivierlambertO Offline
                                      olivierlambert Vates 🪐 Co-Founder CEO
                                      last edited by

                                      Error: no tomes available

                                      🤔 Never heard of this before.

                                      1 Reply Last reply Reply Quote 0
                                      • olivierlambertO Offline
                                        olivierlambert Vates 🪐 Co-Founder CEO
                                        last edited by

                                        It seems to come from BackBlaze, eg https://github.com/mastodon/mastodon/issues/30030

                                        Sadly, I'm not sure this is something we are able to fix on our side 😕

                                        cadusilva created this issue in mastodon/mastodon

                                        open Backblaze B2 "no tomes available" and related errors #30030

                                        1 Reply Last reply Reply Quote 0
                                        • olivierlambertO Offline
                                          olivierlambert Vates 🪐 Co-Founder CEO
                                          last edited by olivierlambert

                                          It might be related to BackBlaze being overloaded at some point. Our advice:

                                          • reduce backup concurrency
                                          • reduce block concurrency during upload (writeblockConcurrency) and merge (mergeBlockConcurrency) in the config.toml
                                          D 1 Reply Last reply Reply Quote 0
                                          • nvossN Offline
                                            nvoss @olivierlambert
                                            last edited by

                                            @olivierlambert @florent

                                            Of note from ours is we use Wasabi S3-compatible as the remote in one case and a Synology NAS as our local remote in the other. Both of those remotes fail with the unsupported state error when the backups are encrypted.

                                            In the same encrypted job I have the following machines which have a backup size and duration of:

                                            VM1 - 31.55GB - 47 mins
                                            VM2 - 14.51GB - 22 mins
                                            VM3 - 30.28GB - 48 mins
                                            VM4 - 45.33GB - 24 mins
                                            VM5 - FAIL - 1hr 27 min
                                            VM6 - 2.14GB - 4 mins
                                            VM7 - FAIL - 1hr 28 min
                                            VM8 - 35.95GB - 1hr 5 min

                                            The two machines erroring have thin provisioned disks whose size are
                                            VM5 -- 128GB and 100GB which are 10.94GB and 86MB on disk
                                            VM7 -- 123GB and 128GB which are 11.09GB and 10.3MB on disk

                                            At first I thought it was size related or perhaps duration. But what's causing that extra duration for machines of these sizes? Something about activity on the Windows VMs?

                                            Or perhaps that it was related to having multiple disks on Windows machines?

                                            1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post