XCP-ng
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    Too many snapshots

    Scheduled Pinned Locked Moved Backup
    45 Posts 7 Posters 2.9k Views 4 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • P Offline
      Pilow @McHenry
      last edited by Pilow

      @McHenry I dont think more than 3 snapshots triggers an error, just tested on one VM a150619d-1010-4efc-80ec-9dc4bcc730b0-image.jpeg

      it is not recommended for "in production" VMs, but for a CR destination, it's OK (as you would need to start a copy anyway)

      your problem, failing CR jobs is probably due to garbage collection not finishing in the one hour timeframe when chain is long.

      M 1 Reply Last reply Reply Quote 1
      • M Offline
        McHenry @Pilow
        last edited by

        @Pilow

        CR jobs are not failing just XO reports too many snapshots under:
        Dashboard >> Health

        All good if I can just ignore this warning but thought best to check in case it was an issue.

        I got the value of 3 from here.
        https://docs.xen-orchestra.com/manage_infrastructure#too-many-snapshots

        P 1 Reply Last reply Reply Quote 0
        • P Offline
          Pilow @McHenry
          last edited by

          @McHenry could you screen the health page ?
          where we could see the chain length

          henri9813H M 2 Replies Last reply Reply Quote 0
          • henri9813H Offline
            henri9813 @Pilow
            last edited by henri9813

            Hello,

            I see also this behavior which is "new" since few weeks.

            Previously, when a backup start:

            • it stake a snapshot ( if there another one before, it delete it ).
            • it upload the snapshot as a backup
            • it coalesce the backup on the remote.
            • end of the game.

            Now, the old snapshots are not deleted anymore which can lead easily to some disk full.

            Even with a retention of 1, the problem is present.

            I observe this only in Backup job, not DR/CR job.

            I just updated my XO to latest version, i will see if the issue is fixed.

            1 Reply Last reply Reply Quote 0
            • M Offline
              McHenry @Pilow
              last edited by

              @Pilow
              eac3d82b-82a4-46b3-b8e1-1f4b64c57e35-image.jpeg

              The number of snapshots shows 16, which makes sense as I have two backup schedules, one with a retention of 15 and one with a retention of 1. The daily backup with a retention of 1 resets the chain, as it is a full backup.
              883856d8-222a-4593-a013-3204a340ecbc-image.jpeg

              henri9813H 1 Reply Last reply Reply Quote 0
              • henri9813H Offline
                henri9813 @McHenry
                last edited by

                Hello @McHenry .

                Yes but no, once the snapshot is exported, the previous one must be cleaned on local.

                Best regards,

                M 1 Reply Last reply Reply Quote 0
                • M Offline
                  McHenry @henri9813
                  last edited by

                  @henri9813

                  Thanks.
                  The old snapshots are being removed as the total never increases beyond 16, so when a new snapshot is added, the old one is removed.

                  P 1 Reply Last reply Reply Quote 0
                  • P Offline
                    Pilow @McHenry
                    last edited by

                    @McHenry said:

                    Thanks.
                    The old snapshots are being removed as the total never increases beyond 16, so when a new snapshot is added, the old one is removed.

                    immediatly removed, yes, but then Garbage collection takes place.
                    and perhaps with 19x16 GC to process it can't be done in one hour, and then next CR is launched, etc etc...

                    M 1 Reply Last reply Reply Quote 0
                    • M Offline
                      McHenry @Pilow
                      last edited by

                      @Pilow

                      I did check this and it definitely completes within the hour.

                      I am testing a lesser value for CR retention to see if this resolves it.

                      poddingueP 1 Reply Last reply Reply Quote 0
                      • poddingueP Online
                        poddingue Vates 🪐 @McHenry
                        last edited by

                        If the lower retention value gets things stable, that probably confirms Pilow's hypothesis. If it doesn't help, that's the signal that something heavier is going on, and a @Team-XO-Backend ping would make sense. Would you mind dropping the result back here either way? Helps the next person hitting the same wall.

                        1 Reply Last reply Reply Quote 0

                        Hello! It looks like you're interested in this conversation, but you don't have an account yet.

                        Getting fed up of having to scroll through the same posts each visit? When you register for an account, you'll always come back to exactly where you were before, and choose to be notified of new replies (either via email, or push notification). You'll also be able to save bookmarks and upvote posts to show your appreciation to other community members.

                        With your input, this post could be even better 💗

                        Register Login
                        • First post
                          Last post