XCP-ng
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    Too many snapshots

    Scheduled Pinned Locked Moved Backup
    40 Posts 6 Posters 557 Views 3 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • P Offline
      Pilow @tjkreidl
      last edited by

      @tjkreidl said:

      It's better IMO to have a solid backup less frequently than have them fail on a regular basis.

      totally agree.

      1 Reply Last reply Reply Quote 0
      • M Online
        McHenry @tjkreidl
        last edited by

        @tjkreidl
        The offsite backup runs at 8 pm and takes 6/7 hours, whereas the hourly runs from 7 am to 7 pm and only take a few minutes.
        76eca590-f9f4-4f99-8964-23be858c62c0-image.jpeg

        The backup job has 19 VMs, suely this is not too many.

        P 1 Reply Last reply Reply Quote 0
        • P Offline
          Pilow @McHenry
          last edited by

          @McHenry 19 VMs is 19 Chains of 16 VDIs
          at each hourly run, a new snapshot is created (some minutes) and the oldest one is merged/garbage collected in the first snap (time undetermined)

          I guess 19 merge + chain garbage collected seems to not be able to be done in the one hour timeframe before next CR is done

          you possibly have a chain growing

          can you check in DASHBOARD/HEALTH the unhealthy VDI section at 11 am ?

          tjkreidlT 1 Reply Last reply Reply Quote 0
          • tjkreidlT Offline
            tjkreidl Ambassador @Pilow
            last edited by

            @Pilow Agree.... have to be sure that garbage collection is completed or it'll never catch up if backups continue to be run without the coalesce completing.

            M 1 Reply Last reply Reply Quote 0
            • M Online
              McHenry @tjkreidl
              last edited by

              @tjkreidl

              If each CR backup is now created as a snapshot, instead of a new VM, and the alert triggers after a VM has more than three snapshots, this logically means that the alert will trigger if the CR has a retention value greater than 3.

              Have I misunderstood how the CR backup process works?

              P 1 Reply Last reply Reply Quote 0
              • P Offline
                Pilow @McHenry
                last edited by Pilow

                @McHenry I dont think more than 3 snapshots triggers an error, just tested on one VM a150619d-1010-4efc-80ec-9dc4bcc730b0-image.jpeg

                it is not recommended for "in production" VMs, but for a CR destination, it's OK (as you would need to start a copy anyway)

                your problem, failing CR jobs is probably due to garbage collection not finishing in the one hour timeframe when chain is long.

                M 1 Reply Last reply Reply Quote 1
                • M Online
                  McHenry @Pilow
                  last edited by

                  @Pilow

                  CR jobs are not failing just XO reports too many snapshots under:
                  Dashboard >> Health

                  All good if I can just ignore this warning but thought best to check in case it was an issue.

                  I got the value of 3 from here.
                  https://docs.xen-orchestra.com/manage_infrastructure#too-many-snapshots

                  P 1 Reply Last reply Reply Quote 0
                  • P Offline
                    Pilow @McHenry
                    last edited by

                    @McHenry could you screen the health page ?
                    where we could see the chain length

                    henri9813H M 2 Replies Last reply Reply Quote 0
                    • henri9813H Offline
                      henri9813 @Pilow
                      last edited by henri9813

                      Hello,

                      I see also this behavior which is "new" since few weeks.

                      Previously, when a backup start:

                      • it stake a snapshot ( if there another one before, it delete it ).
                      • it upload the snapshot as a backup
                      • it coalesce the backup on the remote.
                      • end of the game.

                      Now, the old snapshots are not deleted anymore which can lead easily to some disk full.

                      Even with a retention of 1, the problem is present.

                      I observe this only in Backup job, not DR/CR job.

                      I just updated my XO to latest version, i will see if the issue is fixed.

                      1 Reply Last reply Reply Quote 0
                      • M Online
                        McHenry @Pilow
                        last edited by

                        @Pilow
                        eac3d82b-82a4-46b3-b8e1-1f4b64c57e35-image.jpeg

                        The number of snapshots shows 16, which makes sense as I have two backup schedules, one with a retention of 15 and one with a retention of 1. The daily backup with a retention of 1 resets the chain, as it is a full backup.
                        883856d8-222a-4593-a013-3204a340ecbc-image.jpeg

                        1 Reply Last reply Reply Quote 0

                        Hello! It looks like you're interested in this conversation, but you don't have an account yet.

                        Getting fed up of having to scroll through the same posts each visit? When you register for an account, you'll always come back to exactly where you were before, and choose to be notified of new replies (either via email, or push notification). You'll also be able to save bookmarks and upvote posts to show your appreciation to other community members.

                        With your input, this post could be even better 💗

                        Register Login
                        • First post
                          Last post