XCP-ng
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    Backup timeout - hang or didn't work

    Scheduled Pinned Locked Moved Xen Orchestra
    36 Posts 5 Posters 2.9k Views 5 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Tristis OrisT Offline
      Tristis Oris Top contributor @Darkbeldin
      last edited by

      @Darkbeldin got it, so it not a problem and no need to do anything.

      just started it manually, still error. But shapshot always created, i dont understand why it say failed.

      DarkbeldinD 1 Reply Last reply Reply Quote 0
      • DarkbeldinD Offline
        Darkbeldin Vates 🪐 Pro Support Team @Tristis Oris
        last edited by

        @Tristis-Oris @florent will have to take a look at it because i'm not sure.

        1 Reply Last reply Reply Quote 0
        • florentF Offline
          florent Vates 🪐 XO Team @Tristis Oris
          last edited by florent

          @Tristis-Oris there is a missing block in the backup of 22:11 . Either a failed transfer or a failed merge that can't be resumed securily

          Is it the same backups with the error message when you restart it manually ?

          Tristis OrisT 1 Reply Last reply Reply Quote 0
          • Tristis OrisT Offline
            Tristis Oris Top contributor @florent
            last edited by

            @florent yes, same error. But im already removed old backup and start new chain.

            1 Reply Last reply Reply Quote 0
            • Tristis OrisT Offline
              Tristis Oris Top contributor
              last edited by

              @olivierlambert More and more vms got this bug btw. i think a rotation can be a cause.

              a backup log i see

              "message": "merge"
              "message": "ENOENT: no such file or directory, rename 
              
              "syscall": "rename",
               "message": "ENOENT: no such file or directory, rename 
              

              and that still a successful backup, but with fail status. So it can't do something after backup?
              After 30 days it should remove old snapshots. But i can't remove them for some vms. when try it - got same error.
              f091b7ba-9dad-4a54-9d1f-fb6c0a06ca14-изображение.png

              ENOENT: no such file or directory, open '/mnt/mih-5300-2/xo-vm-backups/2cac5777-cc11-117a-f107-5027fdf10950/20220608T020201Z.json'
              

              Have ~300 old snapshots and can't remove, for a long. Is it the same problem or not?

              commit ce78d

              florentF 1 Reply Last reply Reply Quote 0
              • olivierlambertO Offline
                olivierlambert Vates 🪐 Co-Founder CEO
                last edited by

                Adding @florent or @julien-f

                1 Reply Last reply Reply Quote 0
                • florentF Offline
                  florent Vates 🪐 XO Team @Tristis Oris
                  last edited by florent

                  @Tristis-Oris hi tristis , can you show us the full stack trace got the error on rename ?

                  can you make a healthcheck on the suspicious backups and see if it can restore ?

                  Tristis OrisT 1 Reply Last reply Reply Quote 0
                  • Tristis OrisT Offline
                    Tristis Oris Top contributor @florent
                    last edited by

                    @florent
                    backup, 4\4 VMs with fail status https://pastebin.com/JxqEp41Q
                    error on remove old snapshots https://pastebin.com/sukzKCe1
                    done healthchecks, all failed https://pastebin.com/AnXLJH64

                    Now waiting for restore a copy of one vm.

                    florentF 1 Reply Last reply Reply Quote 0
                    • florentF Offline
                      florent Vates 🪐 XO Team @Tristis Oris
                      last edited by

                      @Tristis-Oris hi,I see two problem :

                      • an older merge failed for all theses VM , and some block ae missing ( 2/49 for the fisrt one, 2/52 for the second one) . These backup are irrecuperable and should be deleted

                      • there is a desynchro between the cached metadata and the file system, you can delete them by running
                        find /mnt/mih-5300-2/xo-vm-backups/ -name "cache.json.gz" -type f -delete

                      Tristis OrisT 2 Replies Last reply Reply Quote 1
                      • Tristis OrisT Offline
                        Tristis Oris Top contributor @florent
                        last edited by Tristis Oris

                        @florent i understand that this backup maybe corrupted. But if some VMs failed first time only yesterday, that mean something continues to corrupt this procedure.

                        Also some VMs with that error sometimes do backup without errors.

                        And i already tried to remove the whole backup chain or do backup with another task - to force full backup creation. But anyway got this error.

                        • restored one this backups - VM run and work fine.
                        florentF 1 Reply Last reply Reply Quote 0
                        • florentF Offline
                          florent Vates 🪐 XO Team @Tristis Oris
                          last edited by

                          @Tristis-Oris have you got a XOA which we could use to access your system ?

                          Tristis OrisT 1 Reply Last reply Reply Quote 0
                          • Tristis OrisT Offline
                            Tristis Oris Top contributor @florent
                            last edited by

                            @florent sure, i'm deployed XOA already.
                            what you need to get access?

                            florentF 1 Reply Last reply Reply Quote 0
                            • Tristis OrisT Offline
                              Tristis Oris Top contributor @florent
                              last edited by

                              @florent said in Backup timeout - hang or didn't work:

                              find /mnt/mih-5300-2/xo-vm-backups/ -name "cache.json.gz" -type f -delete

                              That works, removed them finally. Thanks.

                              1 Reply Last reply Reply Quote 0
                              • florentF Offline
                                florent Vates 🪐 XO Team @Tristis Oris
                                last edited by olivierlambert

                                @Tristis-Oris can you open a support tunnel and give me the access code ( by direct message ) ?
                                https://help.vates.fr/help/en-us/8-technical-support/16-create-a-support-tunnel

                                I would like to look into this in detail since we made quite a lot of changes on the merge and the caching recently
                                regards

                                1 Reply Last reply Reply Quote 0
                                • Tristis OrisT Offline
                                  Tristis Oris Top contributor
                                  last edited by

                                  any progress with that problem?

                                  1 Reply Last reply Reply Quote 0
                                  • STeffaineS Offline
                                    STeffaine
                                    last edited by

                                    any updates? im also having this issue.

                                    1 Reply Last reply Reply Quote 0
                                    • Tristis OrisT Offline
                                      Tristis Oris Top contributor
                                      last edited by

                                      After bunch of last fixes i removed some broken backup chains and now all errors gone. Thanks.

                                      1 Reply Last reply Reply Quote 1
                                      • Tristis OrisT Offline
                                        Tristis Oris Top contributor
                                        last edited by

                                        @olivierlambert to not create new topic, small continue here.

                                        timeout option still ignored.

                                        probably because of network problems i got backup error and job hangs every day.
                                        doesn't matter why it happens, but job will never get terminated by timeout.

                                        21h job with 7h timeout. https://pastebin.com/U5HgKEzM
                                        just 1 Gb.
                                        1c80c5b5-248f-4506-a351-5bca47ee4f8c-image.png
                                        7458ace9-3e08-4537-a7da-2d304033995e-image.png

                                        i setup scheduled task to reboot xo once a day, to close this endless tasks.

                                        1 Reply Last reply Reply Quote 0
                                        • First post
                                          Last post