XCP-ng
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    Coalesce failed, skipping - corrupted

    Scheduled Pinned Locked Moved Xen Orchestra
    6 Posts 2 Posters 631 Views 2 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • G Offline
      gojjily
      last edited by Danp

      My Xen orchestra backup is failing for one specific VM with the error below.
      How do I fix this?

      Jul 28 01:28:51 milsrv01 SMGC: [30191] Coalesce candidate: *c5bf9937(350.000G/105.039G) (tree height 6)
      Jul 28 01:28:51 milsrv01 SMGC: [30191] Coalescing *c5bf9937(350.000G/105.039G) -> *29153ed4(350.000G/214.587G)
      Jul 28 01:28:51 milsrv01 SMGC: [30191] coalesce: EXCEPTION <class 'util.SMException'>, VHD *c5bf9937(350.000G/105.039G) corrupted
      Jul 28 01:28:51 milsrv01 SMGC: [30191]   File "/opt/xensource/sm/cleanup.py", line 1753, in coalesce
      Jul 28 01:28:51 milsrv01 SMGC: [30191]     self._coalesce(vdi)
      Jul 28 01:28:51 milsrv01 SMGC: [30191]   File "/opt/xensource/sm/cleanup.py", line 1942, in _coalesce
      Jul 28 01:28:51 milsrv01 SMGC: [30191]     vdi._doCoalesce()
      Jul 28 01:28:51 milsrv01 SMGC: [30191]   File "/opt/xensource/sm/cleanup.py", line 764, in _doCoalesce
      Jul 28 01:28:51 milsrv01 SMGC: [30191] Coalesce failed, skipping
      
      1 Reply Last reply Reply Quote 0
      • olivierlambertO Online
        olivierlambert Vates 🪐 Co-Founder CEO
        last edited by

        This is a storage issue, you VHD chain seems to be corrupted sadly.

        1 Reply Last reply Reply Quote 0
        • G Offline
          gojjily
          last edited by

          right, but what are we saying the options are to fix this?

          1 Reply Last reply Reply Quote 0
          • olivierlambertO Online
            olivierlambert Vates 🪐 Co-Founder CEO
            last edited by

            There's no universal answer. You might try with the vhd-util repair tool, but if it's very corrupted, restoring a backup is the best approach.

            Corruption doesn't appear randomly, something went wrong at some point, but it's impossible to guess without more investigation.

            G 2 Replies Last reply Reply Quote 0
            • G Offline
              gojjily @olivierlambert
              last edited by gojjily

              @olivierlambert I would guess it was a power failure. I had an APC UPS blow up, still working to troubleshoot the UPS issue. APC send a new warranty battery, and now also a new warranty UPS.

              1 Reply Last reply Reply Quote 0
              • G Offline
                gojjily @olivierlambert
                last edited by

                @olivierlambert so I went to restore this VM from backup, and noticed that the monthly backup is currently running on this VM as we speak.

                Will restoring from this newest backup fix my issue? Or does it restore the issue back into production?

                1 Reply Last reply Reply Quote 0
                • First post
                  Last post