XCP-ng
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    XOSTOR hyperconvergence preview

    Scheduled Pinned Locked Moved XOSTOR
    446 Posts 47 Posters 480.6k Views 48 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • F Offline
      ferrao @lover
      last edited by

      @lover said in XOSTOR hyperconvergence preview:

      Anyone else getting a 301 error?

      http://mirrors.xcp-ng.org/8/8.2/base/x86_64/repodata/repomd.xml: [Errno 14] HTTPS Error 301 - Moved Permanently
      Trying other mirror.
      
      

      301 is not an error (as a failure) it's a redirect. Here it redirects correctly to a mirror nearby. In my case: https://mirror.uepg.br/xcp-ng/8/8.2/base/x86_64/repodata/repomd.xml

      L 1 Reply Last reply Reply Quote 1
      • L Offline
        lover @ferrao
        last edited by

        This post is deleted!
        1 Reply Last reply Reply Quote 0
        • M Offline
          Mark C @lover
          last edited by

          @lover

          See /etc/yum.repos.d/xcp-ng.repo and update all references from http:/ to https:/

          1 Reply Last reply Reply Quote 0
          • J Offline
            Jonathon
            last edited by

            Hello all!

            I have an issue with backing up to S3. I am hoping someone can point out the mistake I am making.
            Our xcp-ng hosts are all up to date.

            jonathon@jonathon-framework:~$ linstor --controller 10.2.0.10 remote l
            ╭─────────────────────────────────────────────────────────────────────────╮
            ┊ Name                 ┊ Type ┊ Info                                      ┊
            ╞═════════════════════════════════════════════════════════════════════════╡
            ┊ linbit-velero-backup ┊ S3   ┊ us-east-1.s3.wasabisys.com/velero-preprod ┊
            ╰─────────────────────────────────────────────────────────────────────────╯
            jonathon@jonathon-framework:~$ linstor --controller 10.2.0.10 backup create linbit-velero-backup pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d
            SUCCESS:
                Suspended IO of '[pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d]' on 'ovbh-vtest-k8s01-worker02' for snapshot
            SUCCESS:
                Suspended IO of '[pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d]' on 'ovbh-pprod-xen02' for snapshot
            SUCCESS:
                Suspended IO of '[pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d]' on 'ovbh-pprod-xen03' for snapshot
            SUCCESS:
                Suspended IO of '[pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d]' on 'ovbh-pprod-xen01' for snapshot
            SUCCESS:
                Took snapshot of '[pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d]' on 'ovbh-vtest-k8s01-worker02'
            SUCCESS:
                Took snapshot of '[pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d]' on 'ovbh-pprod-xen02'
            SUCCESS:
                Took snapshot of '[pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d]' on 'ovbh-pprod-xen03'
            SUCCESS:
                Took snapshot of '[pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d]' on 'ovbh-pprod-xen01'
            SUCCESS:
                Resumed IO of '[pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d]' on 'ovbh-vtest-k8s01-worker02' after snapshot
            SUCCESS:
                Resumed IO of '[pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d]' on 'ovbh-pprod-xen01' after snapshot
            SUCCESS:
                Resumed IO of '[pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d]' on 'ovbh-pprod-xen02' after snapshot
            SUCCESS:
                Resumed IO of '[pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d]' on 'ovbh-pprod-xen03' after snapshot
            INFO:
                Generated snapshot name for backup of resourcepvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d to remote linbit-velero-backup
            INFO:
                Shipping of resource pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d to remote linbit-velero-backup in progress.
            SUCCESS:
                Started shipping of resource 'pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d'
            SUCCESS:
                Started shipping of resource 'pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d'
            SUCCESS:
                Started shipping of resource 'pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d'
            jonathon@jonathon-framework:~$ linstor --controller 10.2.0.10 snapshot l
            ╭─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
            ┊ ResourceName                             ┊ SnapshotName         ┊ NodeNames                                            ┊ Volumes   ┊ CreatedOn           ┊ State    ┊
            ╞═════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════╡
            ┊ pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d ┊ back_20241002_191139 ┊ ovbh-pprod-xen01, ovbh-pprod-xen02, ovbh-pprod-xen03 ┊ 0: 50 GiB ┊ 2024-10-02 16:11:40 ┊ Shipping ┊
            ╰─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
            jonathon@jonathon-framework:~$ linstor --controller 10.2.0.10 backup list linbit-velero-backup
            ╭───────────────────────────────────────────────────────╮
            ┊ Resource ┊ Snapshot ┊ Finished at ┊ Based On ┊ Status ┊
            ╞═══════════════════════════════════════════════════════╡
            ╰───────────────────────────────────────────────────────╯
            jonathon@jonathon-framework:~$ linstor --controller 10.2.0.10 snapshot l
            ╭───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
            ┊ ResourceName                             ┊ SnapshotName         ┊ NodeNames                                            ┊ Volumes   ┊ CreatedOn           ┊ State      ┊
            ╞═══════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════╡
            ┊ pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d ┊ back_20241002_191139 ┊ ovbh-pprod-xen01, ovbh-pprod-xen02, ovbh-pprod-xen03 ┊ 0: 50 GiB ┊ 2024-10-02 16:11:40 ┊ Successful ┊
            ╰───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
            

            Nothing shows up on S3.
            And after, enabling logs by modifying /usr/share/linstor-server/lib/conf/logback.xml I see the following

            [19:15 ovbh-pprod-xen01 ~]# tail /var/log/linstor-satellite/linstor-Satellite.log -n 20
            2024_10_02 19:11:41.511 [shipping_pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d_00000_back_20241002_191139] WARN  LINSTOR/Satellite - SYSTEM - stdErr:   Device read short 40960 bytes remaining
            2024_10_02 19:11:41.512 [shipping_pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d_00000_back_20241002_191139] WARN  LINSTOR/Satellite - SYSTEM - stdErr:   Device read short 40960 bytes remaining
            2024_10_02 19:11:41.513 [shipping_pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d_00000_back_20241002_191139] WARN  LINSTOR/Satellite - SYSTEM - stdErr:   Device read short 40960 bytes remaining
            2024_10_02 19:11:41.516 [shipping_pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d_00000_back_20241002_191139] WARN  LINSTOR/Satellite - SYSTEM - stdErr:   Device read short 82432 bytes remaining
            2024_10_02 19:11:41.543 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - End DeviceManager cycle 42
            2024_10_02 19:11:41.543 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Begin DeviceManager cycle 43
            2024_10_02 19:11:41.552 [MainWorkerPool-5] INFO  LINSTOR/Satellite - SYSTEM - Snapshot 'back_20241002_191139' of resource 'pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d' registered.
            2024_10_02 19:11:41.553 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Aligning /dev/linstor_group/pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d_00000 size from 52440040 KiB to 52441088 KiB to be a multiple of extent size 4096 KiB (from Storage Pool)
            2024_10_02 19:11:41.615 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Resource 'pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d' [DRBD] adjusted.
            2024_10_02 19:11:41.781 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - End DeviceManager cycle 43
            2024_10_02 19:11:41.781 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Begin DeviceManager cycle 44
            2024_10_02 19:11:47.220 [shipping_pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d_00000_back_20241002_191139] WARN  LINSTOR/Satellite - SYSTEM - stdErr: Incomplete copy_data, 4194304 bytes missing.
            2024_10_02 19:11:47.295 [shipping_pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d_00000_back_20241002_191139] WARN  LINSTOR/Satellite - SYSTEM - Exception occurred while checking for support of requester-pays on remote linbit-velero-backup. Defaulting to false
            2024_10_02 19:11:47.307 [MainWorkerPool-7] INFO  LINSTOR/Satellite - SYSTEM - Snapshot 'back_20241002_191139' of resource 'pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d' registered.
            2024_10_02 19:11:47.309 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Aligning /dev/linstor_group/pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d_00000 size from 52440040 KiB to 52441088 KiB to be a multiple of extent size 4096 KiB (from Storage Pool)
            2024_10_02 19:11:47.312 [shipping_pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d_00000_back_20241002_191139] ERROR LINSTOR/Satellite - SYSTEM -  [Report number 66FDD1AE-3AE91-000000]
            
            2024_10_02 19:11:47.398 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Resource 'pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d' [DRBD] adjusted.
            2024_10_02 19:11:47.561 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - End DeviceManager cycle 44
            2024_10_02 19:11:47.561 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Begin DeviceManager cycle 45
            

            The error

            [19:12 ovbh-pprod-xen01 ~]# cat /var/log/linstor-satellite/ErrorReport-66FDD1AE-3AE91-000000.log
            ERROR REPORT 66FDD1AE-3AE91-000000
            
            ============================================================
            
            Application:                        LINBIT® LINSTOR
            Module:                             Satellite
            Version:                            1.26.1
            Build ID:                           12746ac9c6e7882807972c3df56e9a89eccad4e5
            Build time:                         2024-02-22T05:27:50+00:00
            Error time:                         2024-10-02 19:11:47
            Node:                               ovbh-pprod-xen01
            Thread:                             shipping_pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d_00000_back_20241002_191139
            
            ============================================================
            
            Reported error:
            ===============
            
            Category:                           RuntimeException
            Class name:                         AbortedException
            Class canonical name:               com.amazonaws.AbortedException
            Generated at:                       Method 'handleInterruptedException', Source file 'AmazonHttpClient.java', Line #880
            
            Error message:                      
            
            Call backtrace:
            
                Method                                   Native Class:Line number
                handleInterruptedException               N      com.amazonaws.http.AmazonHttpClient$RequestExecutor:880
                execute                                  N      com.amazonaws.http.AmazonHttpClient$RequestExecutor:757
                access$500                               N      com.amazonaws.http.AmazonHttpClient$RequestExecutor:715
                execute                                  N      com.amazonaws.http.AmazonHttpClient$RequestExecutionBuilderImpl:697
                execute                                  N      com.amazonaws.http.AmazonHttpClient:561
                execute                                  N      com.amazonaws.http.AmazonHttpClient:541
                invoke                                   N      com.amazonaws.services.s3.AmazonS3Client:5516
                invoke                                   N      com.amazonaws.services.s3.AmazonS3Client:5463
                abortMultipartUpload                     N      com.amazonaws.services.s3.AmazonS3Client:3620
                abortMultipart                           N      com.linbit.linstor.api.BackupToS3:199
                threadFinished                           N      com.linbit.linstor.backupshipping.BackupShippingS3Daemon:320
                run                                      N      com.linbit.linstor.backupshipping.BackupShippingS3Daemon:298
                run                                      N      java.lang.Thread:829
            
            Caused by:
            ==========
            
            Category:                           Exception
            Class name:                         SdkInterruptedException
            Class canonical name:               com.amazonaws.http.timers.client.SdkInterruptedException
            Generated at:                       Method 'checkInterrupted', Source file 'AmazonHttpClient.java', Line #935
            
            
            Call backtrace:
            
                Method                                   Native Class:Line number
                checkInterrupted                         N      com.amazonaws.http.AmazonHttpClient$RequestExecutor:935
                checkInterrupted                         N      com.amazonaws.http.AmazonHttpClient$RequestExecutor:921
                executeHelper                            N      com.amazonaws.http.AmazonHttpClient$RequestExecutor:1115
                doExecute                                N      com.amazonaws.http.AmazonHttpClient$RequestExecutor:814
                executeWithTimer                         N      com.amazonaws.http.AmazonHttpClient$RequestExecutor:781
                execute                                  N      com.amazonaws.http.AmazonHttpClient$RequestExecutor:755
                access$500                               N      com.amazonaws.http.AmazonHttpClient$RequestExecutor:715
                execute                                  N      com.amazonaws.http.AmazonHttpClient$RequestExecutionBuilderImpl:697
                execute                                  N      com.amazonaws.http.AmazonHttpClient:561
                execute                                  N      com.amazonaws.http.AmazonHttpClient:541
                invoke                                   N      com.amazonaws.services.s3.AmazonS3Client:5516
                invoke                                   N      com.amazonaws.services.s3.AmazonS3Client:5463
                abortMultipartUpload                     N      com.amazonaws.services.s3.AmazonS3Client:3620
                abortMultipart                           N      com.linbit.linstor.api.BackupToS3:199
                threadFinished                           N      com.linbit.linstor.backupshipping.BackupShippingS3Daemon:320
                run                                      N      com.linbit.linstor.backupshipping.BackupShippingS3Daemon:298
                run                                      N      java.lang.Thread:829
            
            
            END OF ERROR REPORT.
            
            1 Reply Last reply Reply Quote 0
            • J Offline
              Jonathon
              last edited by Jonathon

              Ok, so, turns out this is because of the thin-send-recv package I build from https://github.com/LINBIT/thin-send-recv/tree/master

              I just swapped out the version I built for the last one I was able to get online to test, and it works.

              The last version I was able to get from any repository before they went 403 was thin-send-recv-1.0.1-1.x86_64.rpm.txt, I was able to get this from https://piraeus.daocloud.io/linbit/rpms/7/x86_64/thin-send-recv-1.0.1-1.x86_64.rpm. FYI https://packages.linbit.com/yum/sles12-sp2/drbd-9.0/x86_64/Packages/ returns 403's too so no point in looking for it there if they have it hosted.

              I built thin-send-recv-1.1.2-1.xcpng8.2.x86_64.rpm.txt using this doc I put together thin-send-recv.txt. But this package I built is resulting in the error posted previously.

              So I am a bit at a loss, I want to be able to use velero for backing up pvs which are not managed by an operator with backup capabilities, but I do not want to be stuck with this old version I can not update.

              Any advice would be greatly appreciated!

              1 Reply Last reply Reply Quote 0
              • J Offline
                Jonathon
                last edited by Jonathon

                Ok great, I manually built 1.0.1, and it works just like the package I got online, that means that what I am doing is working and the build process is correct.

                The bad new is there is a breaking change with v1.1.2, and I think I am potentially SOL. 😭
                I am going to build and test v1.1.0 and v1.1.1 to see which ones work. NVM v1.1.0 is also broken.

                So the change that breaks it is in here: https://github.com/LINBIT/thin-send-recv/compare/6b7c9002cd7716ff6ef93f5a5e8908032b81f853...e44f566ea0c975e2baa475868ebc176065a5b22d

                v1.0.1 might just be the version that works with the version of linstor, and whenever that gets updated it might call for a newer version of thin-send-recv.

                1 Reply Last reply Reply Quote 0
                • olivierlambertO Offline
                  olivierlambert Vates 🪐 Co-Founder CEO
                  last edited by

                  @ronan-a might take a look if he found a minute free (which is complicated right now 😄 )

                  ronan-aR 1 Reply Last reply Reply Quote 0
                  • ronan-aR Offline
                    ronan-a Vates 🪐 XCP-ng Team @olivierlambert
                    last edited by

                    @olivierlambert @Jonathon Unfortunately we don't maintain this package, so it's not available in our repositories, the simplest thing is that you address this problem directly to linbit. Maybe there is a regression or something else?

                    B 1 Reply Last reply Reply Quote 0
                    • B Offline
                      BHellman 3rd party vendor @ronan-a
                      last edited by

                      This post is deleted!
                      1 Reply Last reply Reply Quote 0
                      • B Offline
                        BHellman 3rd party vendor
                        last edited by

                        I spoke with one of our developers. He said:

                        "When I search for

                        Device read short 40960 bytes remaining
                        

                        I only get results for XCP-ng/Citrix with LVM. So I think this is an issue with LVM on xcp-ng. One thing we changed between thin-send-recv 1.0.X and 1.1.X is the error handling, so now errors are properly propagated. So I would guess it was always broken but newer versions actually make the error visible."

                        Not sure if that helps, but if there is anything to relay I'm more than happy to pass it along.

                        1 Reply Last reply Reply Quote 0
                        • J Offline
                          Jonathon
                          last edited by

                          @BHellman Appreciate the weigh in and the time from your dev.

                          Ok yeah I thought I was having a hallucination lol. v1.0.1 was 100% working when I installed it at time of my posting, and it was failing today. Restarting all the satellites and it works, assuming it will break again.

                          When it actually works, I can see the pvc in s3 remote
                          da01e3ab-e7b2-4830-8aec-91e343d49196-image.png

                          Here are a scatter of commands and outputs. In this I restarted the satellites, so it may be difficult to read but thought it would be better then nothing.
                          commands-and-outputs.txt

                          xen01-linstor-Satellite.txt
                          xen02-linstor-Satellite.txt
                          xen03-linstor-Satellite.txt

                          1 Reply Last reply Reply Quote 0
                          • J Offline
                            Jonathon
                            last edited by Jonathon

                            And now for something completely different lol. It's the same thing 😭

                            We have a new xcp-ng cluster that we would like to migrate everything to. Not migrating k8s clusters, creating new ones on a new RKE2 rancher. So to migrate the applications it would simplify things if I could move pvc's over.

                            Command that fails, same if I add --target-storage-pool xcp-sr-linstor_group_thin_device

                            jonathon@jonathon-framework:~$ linstor --controller 10.2.0.19 backup ship newCluster pvc-086a5817-d813-41fe-86d8-3fac2ae2028f pvc-086a5817-d813-41fe-86d8-3fac2ae2028f
                            ERROR:
                            Description:
                                Remote 'newCluster': Could not find suitable storage pool to receive backup
                            Cause:
                                ErrorReport id on target cluster: 66FF0E92-00000-000011
                            

                            Setup remotes

                            jonathon@jonathon-framework:~$ linstor --controller 10.2.0.19 controller list-properties
                            ╭───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
                            ┊ Key                                                ┊ Value                                                                                                                                                                                                                        ┊
                            ╞═══════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════╡
                            ┊ Cluster/LocalID                                    ┊ 941fc610-acb9-484a-9837-d2c0df8a86aa     
                            
                            
                            jonathon@jonathon-framework:~$ linstor --controller 10.2.0.10 controller list-properties
                            ╭───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
                            ┊ Key                                                ┊ Value                                                                                                                                                                                                                        ┊
                            ╞═══════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════╡
                            ┊ Cluster/LocalID                                    ┊ 717be8f7-1aec-4830-9aab-cc0afba0dd3a
                            
                            linstor --controller 10.2.0.19 remote create linstor newCluster 10.2.0.10 --cluster-id 717be8f7-1aec-4830-9aab-cc0afba0dd3a
                            linstor --controller 10.2.0.10 remote create linstor sourceCluster 10.2.0.19 --cluster-id 941fc610-acb9-484a-9837-d2c0df8a86aa
                            

                            Nothing interesting in any satellite logs.
                            Error on new cluster

                            jonathon@jonathon-framework:~$ linstor --controller 10.2.0.10 err show 66FF0E92-00000-000011
                            ERROR REPORT 66FF0E92-00000-000011
                            
                            ============================================================
                            
                            Application:                        LINBIT® LINSTOR
                            Module:                             Controller
                            Version:                            1.26.1
                            Build ID:                           12746ac9c6e7882807972c3df56e9a89eccad4e5
                            Build time:                         2024-02-22T05:27:50+00:00
                            Error time:                         2024-10-07 14:51:35
                            Node:                               ovbh-pprod-xen01
                            Thread:                             MainWorkerPool-3
                            
                            ============================================================
                            
                            Reported error:
                            ===============
                            
                            Category:                           RuntimeException
                            Class name:                         ApiRcException
                            Class canonical name:               com.linbit.linstor.core.apicallhandler.response.ApiRcException
                            Generated at:                       Method 'restoreBackupL2LInTransaction', Source file 'CtrlBackupRestoreApiCallHandler.java', Line #1123
                            
                            Error message:                      Could not find suitable storage pool to receive backup
                            
                            Asynchronous stage backtrace:
                                
                                Error has been observed at the following site(s):
                                	*__checkpoint ⇢ restore backup
                                	*__checkpoint ⇢ Backupshipping L2L start receive
                                Original Stack Trace:
                            
                            Call backtrace:
                            
                                Method                                   Native Class:Line number
                                restoreBackupL2LInTransaction            N      com.linbit.linstor.core.apicallhandler.controller.backup.CtrlBackupRestoreApiCallHandler:1123
                            
                            Suppressed exception 1 of 1:
                            ===============
                            Category:                           RuntimeException
                            Class name:                         OnAssemblyException
                            Class canonical name:               reactor.core.publisher.FluxOnAssembly.OnAssemblyException
                            Generated at:                       Method 'restoreBackupL2LInTransaction', Source file 'CtrlBackupRestoreApiCallHandler.java', Line #1123
                            
                            Error message:                      
                            Error has been observed at the following site(s):
                            	*__checkpoint ⇢ restore backup
                            	*__checkpoint ⇢ Backupshipping L2L start receive
                            Original Stack Trace:
                            
                            Call backtrace:
                            
                                Method                                   Native Class:Line number
                                restoreBackupL2LInTransaction            N      com.linbit.linstor.core.apicallhandler.controller.backup.CtrlBackupRestoreApiCallHandler:1123
                                lambda$startReceivingInTransaction$4     N      com.linbit.linstor.core.apicallhandler.controller.backup.CtrlBackupL2LDstApiCallHandler:526
                                doInScope                                N      com.linbit.linstor.core.apicallhandler.ScopeRunner:149
                                lambda$fluxInScope$0                     N      com.linbit.linstor.core.apicallhandler.ScopeRunner:76
                                call                                     N      reactor.core.publisher.MonoCallable:72
                                trySubscribeScalarMap                    N      reactor.core.publisher.FluxFlatMap:127
                                subscribeOrReturn                        N      reactor.core.publisher.MonoFlatMapMany:49
                                subscribe                                N      reactor.core.publisher.Flux:8759
                                onNext                                   N      reactor.core.publisher.MonoFlatMapMany$FlatMapManyMain:195
                                request                                  N      reactor.core.publisher.Operators$ScalarSubscription:2545
                                onSubscribe                              N      reactor.core.publisher.MonoFlatMapMany$FlatMapManyMain:141
                                subscribe                                N      reactor.core.publisher.MonoJust:55
                                subscribe                                N      reactor.core.publisher.MonoDeferContextual:55
                                subscribe                                N      reactor.core.publisher.Flux:8773
                                onNext                                   N      reactor.core.publisher.MonoFlatMapMany$FlatMapManyMain:195
                                onNext                                   N      reactor.core.publisher.FluxMapFuseable$MapFuseableSubscriber:129
                                completePossiblyEmpty                    N      reactor.core.publisher.Operators$BaseFluxToMonoOperator:2071
                                onComplete                               N      reactor.core.publisher.MonoCollect$CollectSubscriber:145
                                onComplete                               N      reactor.core.publisher.MonoFlatMapMany$FlatMapManyInner:260
                                checkTerminated                          N      reactor.core.publisher.FluxFlatMap$FlatMapMain:847
                                drainLoop                                N      reactor.core.publisher.FluxFlatMap$FlatMapMain:609
                                drain                                    N      reactor.core.publisher.FluxFlatMap$FlatMapMain:589
                                onComplete                               N      reactor.core.publisher.FluxFlatMap$FlatMapMain:466
                                checkTerminated                          N      reactor.core.publisher.FluxFlatMap$FlatMapMain:847
                                drainLoop                                N      reactor.core.publisher.FluxFlatMap$FlatMapMain:609
                                innerComplete                            N      reactor.core.publisher.FluxFlatMap$FlatMapMain:895
                                onComplete                               N      reactor.core.publisher.FluxFlatMap$FlatMapInner:998
                                onComplete                               N      reactor.core.publisher.FluxMap$MapSubscriber:144
                                onComplete                               N      reactor.core.publisher.Operators$MultiSubscriptionSubscriber:2205
                                onComplete                               N      reactor.core.publisher.FluxSwitchIfEmpty$SwitchIfEmptySubscriber:85
                                complete                                 N      reactor.core.publisher.FluxCreate$BaseSink:460
                                drain                                    N      reactor.core.publisher.FluxCreate$BufferAsyncSink:805
                                complete                                 N      reactor.core.publisher.FluxCreate$BufferAsyncSink:753
                                drainLoop                                N      reactor.core.publisher.FluxCreate$SerializedFluxSink:247
                                drain                                    N      reactor.core.publisher.FluxCreate$SerializedFluxSink:213
                                complete                                 N      reactor.core.publisher.FluxCreate$SerializedFluxSink:204
                                apiCallComplete                          N      com.linbit.linstor.netcom.TcpConnectorPeer:506
                                handleComplete                           N      com.linbit.linstor.proto.CommonMessageProcessor:372
                                handleDataMessage                        N      com.linbit.linstor.proto.CommonMessageProcessor:296
                                doProcessInOrderMessage                  N      com.linbit.linstor.proto.CommonMessageProcessor:244
                                lambda$doProcessMessage$4                N      com.linbit.linstor.proto.CommonMessageProcessor:229
                                subscribe                                N      reactor.core.publisher.FluxDefer:46
                                subscribe                                N      reactor.core.publisher.Flux:8773
                                onNext                                   N      reactor.core.publisher.FluxFlatMap$FlatMapMain:427
                                drainAsync                               N      reactor.core.publisher.FluxFlattenIterable$FlattenIterableSubscriber:453
                                drain                                    N      reactor.core.publisher.FluxFlattenIterable$FlattenIterableSubscriber:724
                                onNext                                   N      reactor.core.publisher.FluxFlattenIterable$FlattenIterableSubscriber:256
                                drainFused                               N      reactor.core.publisher.SinkManyUnicast:319
                                drain                                    N      reactor.core.publisher.SinkManyUnicast:362
                                tryEmitNext                              N      reactor.core.publisher.SinkManyUnicast:237
                                tryEmitNext                              N      reactor.core.publisher.SinkManySerialized:100
                                processInOrder                           N      com.linbit.linstor.netcom.TcpConnectorPeer:419
                                doProcessMessage                         N      com.linbit.linstor.proto.CommonMessageProcessor:227
                                lambda$processMessage$2                  N      com.linbit.linstor.proto.CommonMessageProcessor:164
                                onNext                                   N      reactor.core.publisher.FluxPeek$PeekSubscriber:185
                                runAsync                                 N      reactor.core.publisher.FluxPublishOn$PublishOnSubscriber:440
                                run                                      N      reactor.core.publisher.FluxPublishOn$PublishOnSubscriber:527
                                call                                     N      reactor.core.scheduler.WorkerTask:84
                                call                                     N      reactor.core.scheduler.WorkerTask:37
                                run                                      N      java.util.concurrent.FutureTask:264
                                run                                      N      java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask:304
                                runWorker                                N      java.util.concurrent.ThreadPoolExecutor:1128
                                run                                      N      java.util.concurrent.ThreadPoolExecutor$Worker:628
                                run                                      N      java.lang.Thread:829
                            
                            
                            END OF ERROR REPORT.
                            

                            Info on new cluster

                            jonathon@jonathon-framework:~$ linstor --controller 10.2.0.19 r l | grep -e "pvc-086a5817-d813-41fe-86d8-3fac2ae2028f"
                            | pvc-086a5817-d813-41fe-86d8-3fac2ae2028f        | ovbh-pprod-xen10                         | 7117 | Unused | Ok    | UpToDate | 2023-05-31 14:42:09 |
                            | pvc-086a5817-d813-41fe-86d8-3fac2ae2028f        | ovbh-pprod-xen12                         | 7117 | Unused | Ok    | UpToDate | 2023-05-31 14:42:09 |
                            | pvc-086a5817-d813-41fe-86d8-3fac2ae2028f        | ovbh-pprod-xen13                         | 7117 | Unused | Ok    | UpToDate | 2023-05-31 14:42:07 |
                            | pvc-086a5817-d813-41fe-86d8-3fac2ae2028f        | ovbh-vtest-k8s02-worker01 | 7117 | InUse  | Ok    | Diskless | 2024-08-09 11:31:25 |
                            | pvc-086a5817-d813-41fe-86d8-3fac2ae2028f        | ovbh-vtest-k8s02-worker03 | 7117 | Unused | Ok    | Diskless | 2024-06-13 14:15:57 |
                            jonathon@jonathon-framework:~$ linstor --controller 10.2.0.19 rd l | grep -e "pvc-086a5817-d813-41fe-86d8-3fac2ae2028f"
                            | pvc-086a5817-d813-41fe-86d8-3fac2ae2028f        | 7117 | sc-74e1434b-b435-587e-9dea-fa067deec898 | ok    |
                            jonathon@jonathon-framework:~$ linstor --controller 10.2.0.19 rg l 
                            ╭───────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
                            ┊ ResourceGroup                           ┊ SelectFilter                                     ┊ VlmNrs ┊ Description ┊
                            ╞═══════════════════════════════════════════════════════════════════════════════════════════════════════════════════╡
                            ┊ DfltRscGrp                              ┊ PlaceCount: 2                                    ┊        ┊             ┊
                            ╞┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄╡
                            ┊ sc-74e1434b-b435-587e-9dea-fa067deec898 ┊ PlaceCount: 3                                    ┊ 0      ┊             ┊
                            ┊                                         ┊ DisklessOnRemaining: True                        ┊        ┊             ┊
                            ┊                                         ┊ LayerStack: ['DRBD', 'STORAGE']                  ┊        ┊             ┊
                            ╞┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄╡
                            ┊ sc-b066e430-6206-5588-a490-cc91ecef53d6 ┊ PlaceCount: 1                                    ┊ 0      ┊             ┊
                            ┊                                         ┊ DisklessOnRemaining: True                        ┊        ┊             ┊
                            ┊                                         ┊ LayerStack: ['DRBD', 'STORAGE']                  ┊        ┊             ┊
                            ╞┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄╡
                            ┊ xcp-sr-linstor_group_thin_device        ┊ PlaceCount: 3                                    ┊ 0      ┊             ┊
                            ┊                                         ┊ StoragePool(s): xcp-sr-linstor_group_thin_device ┊        ┊             ┊
                            ┊                                         ┊ DisklessOnRemaining: True                        ┊        ┊             ┊
                            ┊                                         ┊ LayerStack: ['DRBD', 'STORAGE']                  ┊        ┊             ┊
                            ╰───────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
                            jonathon@jonathon-framework:~$ linstor --controller 10.2.0.19 sp l 
                            ╭──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
                            ┊ StoragePool                      ┊ Node                                     ┊ Driver   ┊ PoolName                  ┊ FreeCapacity ┊ TotalCapacity ┊ CanSnapshots ┊ State ┊ SharedName                                                    ┊
                            ╞══════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════╡
                            ┊ DfltDisklessStorPool             ┊ ovbh-pprod-xen10                         ┊ DISKLESS ┊                           ┊              ┊               ┊ False        ┊ Ok    ┊ ovbh-pprod-xen10;DfltDisklessStorPool                         ┊
                            ┊ DfltDisklessStorPool             ┊ ovbh-pprod-xen11                         ┊ DISKLESS ┊                           ┊              ┊               ┊ False        ┊ Ok    ┊ ovbh-pprod-xen11;DfltDisklessStorPool                         ┊
                            ┊ DfltDisklessStorPool             ┊ ovbh-pprod-xen12                         ┊ DISKLESS ┊                           ┊              ┊               ┊ False        ┊ Ok    ┊ ovbh-pprod-xen12;DfltDisklessStorPool                         ┊
                            ┊ DfltDisklessStorPool             ┊ ovbh-pprod-xen13                         ┊ DISKLESS ┊                           ┊              ┊               ┊ False        ┊ Ok    ┊ ovbh-pprod-xen13;DfltDisklessStorPool                         ┊
                            ┊ DfltDisklessStorPool             ┊ ovbh-vprod-k8s04-worker01 ┊ DISKLESS ┊                           ┊              ┊               ┊ False        ┊ Ok    ┊ ovbh-vprod-k8s04-worker01;DfltDisklessStorPool ┊
                            ┊ DfltDisklessStorPool             ┊ ovbh-vprod-k8s04-worker02 ┊ DISKLESS ┊                           ┊              ┊               ┊ False        ┊ Ok    ┊ ovbh-vprod-k8s04-worker02;DfltDisklessStorPool ┊
                            ┊ DfltDisklessStorPool             ┊ ovbh-vprod-k8s04-worker03 ┊ DISKLESS ┊                           ┊              ┊               ┊ False        ┊ Ok    ┊ ovbh-vprod-k8s04-worker03;DfltDisklessStorPool ┊
                            ┊ DfltDisklessStorPool             ┊ ovbh-vprod-k8s04-worker07 ┊ DISKLESS ┊                           ┊              ┊               ┊ False        ┊ Ok    ┊ ovbh-vprod-k8s04-worker07;DfltDisklessStorPool ┊
                            ┊ DfltDisklessStorPool             ┊ ovbh-vtest-k8s02-worker01 ┊ DISKLESS ┊                           ┊              ┊               ┊ False        ┊ Ok    ┊ ovbh-vtest-k8s02-worker01;DfltDisklessStorPool ┊
                            ┊ DfltDisklessStorPool             ┊ ovbh-vtest-k8s02-worker02 ┊ DISKLESS ┊                           ┊              ┊               ┊ False        ┊ Ok    ┊ ovbh-vtest-k8s02-worker02;DfltDisklessStorPool ┊
                            ┊ DfltDisklessStorPool             ┊ ovbh-vtest-k8s02-worker03 ┊ DISKLESS ┊                           ┊              ┊               ┊ False        ┊ Ok    ┊ ovbh-vtest-k8s02-worker03;DfltDisklessStorPool ┊
                            ┊ DfltDisklessStorPool             ┊ ovbh-vtest-k8s02-worker04 ┊ DISKLESS ┊                           ┊              ┊               ┊ False        ┊ Ok    ┊ ovbh-vtest-k8s02-worker04;DfltDisklessStorPool ┊
                            ┊ xcp-sr-linstor_group_thin_device ┊ ovbh-pprod-xen10                         ┊ LVM_THIN ┊ linstor_group/thin_device ┊     2.48 TiB ┊      3.49 TiB ┊ True         ┊ Ok    ┊ ovbh-pprod-xen10;xcp-sr-linstor_group_thin_device             ┊
                            ┊ xcp-sr-linstor_group_thin_device ┊ ovbh-pprod-xen11                         ┊ LVM_THIN ┊ linstor_group/thin_device ┊     2.42 TiB ┊      3.49 TiB ┊ True         ┊ Ok    ┊ ovbh-pprod-xen11;xcp-sr-linstor_group_thin_device             ┊
                            ┊ xcp-sr-linstor_group_thin_device ┊ ovbh-pprod-xen12                         ┊ LVM_THIN ┊ linstor_group/thin_device ┊     2.83 TiB ┊      3.49 TiB ┊ True         ┊ Ok    ┊ ovbh-pprod-xen12;xcp-sr-linstor_group_thin_device             ┊
                            ┊ xcp-sr-linstor_group_thin_device ┊ ovbh-pprod-xen13                         ┊ LVM_THIN ┊ linstor_group/thin_device ┊     4.12 TiB ┊      4.99 TiB ┊ True         ┊ Ok    ┊ ovbh-pprod-xen13;xcp-sr-linstor_group_thin_device             ┊
                            ╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
                            

                            On the new cluster

                            jonathon@jonathon-framework:~$ linstor --controller 10.2.0.10 rg l
                            ╭───────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
                            ┊ ResourceGroup                           ┊ SelectFilter                                     ┊ VlmNrs ┊ Description ┊
                            ╞═══════════════════════════════════════════════════════════════════════════════════════════════════════════════════╡
                            ┊ DfltRscGrp                              ┊ PlaceCount: 2                                    ┊        ┊             ┊
                            ╞┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄╡
                            ┊ sc-74e1434b-b435-587e-9dea-fa067deec898 ┊ PlaceCount: 3                                    ┊ 0      ┊             ┊
                            ┊                                         ┊ DisklessOnRemaining: True                        ┊        ┊             ┊
                            ┊                                         ┊ LayerStack: ['DRBD', 'STORAGE']                  ┊        ┊             ┊
                            ╞┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄╡
                            ┊ xcp-ha-linstor_group_thin_device        ┊ PlaceCount: 3                                    ┊ 0      ┊             ┊
                            ┊                                         ┊ StoragePool(s): xcp-sr-linstor_group_thin_device ┊        ┊             ┊
                            ┊                                         ┊ DisklessOnRemaining: False                       ┊        ┊             ┊
                            ╞┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄╡
                            ┊ xcp-sr-linstor_group_thin_device        ┊ PlaceCount: 3                                    ┊ 0      ┊             ┊
                            ┊                                         ┊ StoragePool(s): xcp-sr-linstor_group_thin_device ┊        ┊             ┊
                            ┊                                         ┊ DisklessOnRemaining: False                       ┊        ┊             ┊
                            ╰───────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
                            jonathon@jonathon-framework:~$ linstor --controller 10.2.0.10 sp l
                            ╭──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
                            ┊ StoragePool                      ┊ Node                                     ┊ Driver   ┊ PoolName                  ┊ FreeCapacity ┊ TotalCapacity ┊ CanSnapshots ┊ State ┊ SharedName                                                    ┊
                            ╞══════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════╡
                            ┊ DfltDisklessStorPool             ┊ ovbh-pprod-xen01                         ┊ DISKLESS ┊                           ┊              ┊               ┊ False        ┊ Ok    ┊ ovbh-pprod-xen01;DfltDisklessStorPool                         ┊
                            ┊ DfltDisklessStorPool             ┊ ovbh-pprod-xen02                         ┊ DISKLESS ┊                           ┊              ┊               ┊ False        ┊ Ok    ┊ ovbh-pprod-xen02;DfltDisklessStorPool                         ┊
                            ┊ DfltDisklessStorPool             ┊ ovbh-pprod-xen03                         ┊ DISKLESS ┊                           ┊              ┊               ┊ False        ┊ Ok    ┊ ovbh-pprod-xen03;DfltDisklessStorPool                         ┊
                            ┊ DfltDisklessStorPool             ┊ ovbh-vprod-rancher01      ┊ DISKLESS ┊                           ┊              ┊               ┊ False        ┊ Ok    ┊ ovbh-vprod-rancher01;DfltDisklessStorPool      ┊
                            ┊ DfltDisklessStorPool             ┊ ovbh-vprod-rancher02      ┊ DISKLESS ┊                           ┊              ┊               ┊ False        ┊ Ok    ┊ ovbh-vprod-rancher02;DfltDisklessStorPool      ┊
                            ┊ DfltDisklessStorPool             ┊ ovbh-vprod-rancher03      ┊ DISKLESS ┊                           ┊              ┊               ┊ False        ┊ Ok    ┊ ovbh-vprod-rancher03;DfltDisklessStorPool      ┊
                            ┊ DfltDisklessStorPool             ┊ ovbh-vtest-k8s01-worker01 ┊ DISKLESS ┊                           ┊              ┊               ┊ False        ┊ Ok    ┊ ovbh-vtest-k8s01-worker01;DfltDisklessStorPool ┊
                            ┊ DfltDisklessStorPool             ┊ ovbh-vtest-k8s01-worker02 ┊ DISKLESS ┊                           ┊              ┊               ┊ False        ┊ Ok    ┊ ovbh-vtest-k8s01-worker02;DfltDisklessStorPool ┊
                            ┊ DfltDisklessStorPool             ┊ ovbh-vtest-k8s01-worker03 ┊ DISKLESS ┊                           ┊              ┊               ┊ False        ┊ Ok    ┊ ovbh-vtest-k8s01-worker03;DfltDisklessStorPool ┊
                            ┊ xcp-sr-linstor_group_thin_device ┊ ovbh-pprod-xen01                         ┊ LVM_THIN ┊ linstor_group/thin_device ┊    13.75 TiB ┊     13.97 TiB ┊ True         ┊ Ok    ┊ ovbh-pprod-xen01;xcp-sr-linstor_group_thin_device             ┊
                            ┊ xcp-sr-linstor_group_thin_device ┊ ovbh-pprod-xen02                         ┊ LVM_THIN ┊ linstor_group/thin_device ┊    13.75 TiB ┊     13.97 TiB ┊ True         ┊ Ok    ┊ ovbh-pprod-xen02;xcp-sr-linstor_group_thin_device             ┊
                            ┊ xcp-sr-linstor_group_thin_device ┊ ovbh-pprod-xen03                         ┊ LVM_THIN ┊ linstor_group/thin_device ┊    13.75 TiB ┊     13.97 TiB ┊ True         ┊ Ok    ┊ ovbh-pprod-xen03;xcp-sr-linstor_group_thin_device             ┊
                            ╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
                            
                            1 Reply Last reply Reply Quote 0
                            • J Offline
                              Jonathon
                              last edited by Jonathon

                              Turns out I did not have SOCAT on the new linstor cluster, and that was why I was getting that error message 😭

                              I am able to run the command

                              jonathon@jonathon-framework:~$ linstor --controller 10.2.0.19 backup create linbit-velero-preprod-backup  pvc-086a5817-d813-41fe-86d8-3fac2ae2028f
                              SUCCESS:
                                  Suspended IO of '[pvc-086a5817-d813-41fe-86d8-3fac2ae2028f]' on 'ovbh-vtest-k8s02-worker01' for snapshot
                              SUCCESS:
                                  Suspended IO of '[pvc-086a5817-d813-41fe-86d8-3fac2ae2028f]' on 'ovbh-vtest-k8s02-worker03' for snapshot
                              SUCCESS:
                                  Suspended IO of '[pvc-086a5817-d813-41fe-86d8-3fac2ae2028f]' on 'ovbh-pprod-xen13' for snapshot
                              SUCCESS:
                                  Suspended IO of '[pvc-086a5817-d813-41fe-86d8-3fac2ae2028f]' on 'ovbh-pprod-xen12' for snapshot
                              SUCCESS:
                                  Suspended IO of '[pvc-086a5817-d813-41fe-86d8-3fac2ae2028f]' on 'ovbh-pprod-xen10' for snapshot
                              SUCCESS:
                                  Took snapshot of '[pvc-086a5817-d813-41fe-86d8-3fac2ae2028f]' on 'ovbh-vtest-k8s02-worker01'
                              SUCCESS:
                                  Took snapshot of '[pvc-086a5817-d813-41fe-86d8-3fac2ae2028f]' on 'ovbh-vtest-k8s02-worker03'
                              SUCCESS:
                                  Took snapshot of '[pvc-086a5817-d813-41fe-86d8-3fac2ae2028f]' on 'ovbh-pprod-xen13'
                              SUCCESS:
                                  Took snapshot of '[pvc-086a5817-d813-41fe-86d8-3fac2ae2028f]' on 'ovbh-pprod-xen12'
                              SUCCESS:
                                  Took snapshot of '[pvc-086a5817-d813-41fe-86d8-3fac2ae2028f]' on 'ovbh-pprod-xen10'
                              SUCCESS:
                                  Resumed IO of '[pvc-086a5817-d813-41fe-86d8-3fac2ae2028f]' on 'ovbh-vtest-k8s02-worker01' after snapshot
                              SUCCESS:
                                  Resumed IO of '[pvc-086a5817-d813-41fe-86d8-3fac2ae2028f]' on 'ovbh-vtest-k8s02-worker03' after snapshot
                              SUCCESS:
                                  Resumed IO of '[pvc-086a5817-d813-41fe-86d8-3fac2ae2028f]' on 'ovbh-pprod-xen13' after snapshot
                              SUCCESS:
                                  Resumed IO of '[pvc-086a5817-d813-41fe-86d8-3fac2ae2028f]' on 'ovbh-pprod-xen12' after snapshot
                              SUCCESS:
                                  Resumed IO of '[pvc-086a5817-d813-41fe-86d8-3fac2ae2028f]' on 'ovbh-pprod-xen10' after snapshot
                              INFO:
                                  Generated snapshot name for backup of resourcepvc-086a5817-d813-41fe-86d8-3fac2ae2028f to remote linbit-velero-preprod-backup
                              INFO:
                                  Shipping of resource pvc-086a5817-d813-41fe-86d8-3fac2ae2028f to remote linbit-velero-preprod-backup in progress.
                              SUCCESS:
                                  Started shipping of resource 'pvc-086a5817-d813-41fe-86d8-3fac2ae2028f'
                              SUCCESS:
                                  Started shipping of resource 'pvc-086a5817-d813-41fe-86d8-3fac2ae2028f'
                              SUCCESS:
                                  Started shipping of resource 'pvc-086a5817-d813-41fe-86d8-3fac2ae2028f'
                              

                              But over an hour later it has still not finished.

                              jonathon@jonathon-framework:~$ linstor --controller 10.2.0.10 s l
                              ╭────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
                              ┊ ResourceName                             ┊ SnapshotName                                  ┊ NodeNames                                            ┊ Volumes   ┊ CreatedOn           ┊ State      ┊
                              ╞════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════╡
                              ┊ pvc-086a5817-d813-41fe-86d8-3fac2ae2028f ┊ back_20241009_161658_5ttp634a                 ┊ ovbh-pprod-xen01                                     ┊ 0: 8 GiB  ┊ 2024-10-09 13:17:02 ┊ Restoring  ┊
                              ╰────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
                              jonathon@jonathon-framework:~$ linstor --controller 10.2.0.10 rd l
                              ╭──────────────────────────────────────────────────────────────────────────────────────────────────────────╮
                              ┊ ResourceName                                    ┊ Port ┊ ResourceGroup                           ┊ State ┊
                              ╞══════════════════════════════════════════════════════════════════════════════════════════════════════════╡
                              ┊ pvc-086a5817-d813-41fe-86d8-3fac2ae2028f        ┊      ┊ DfltRscGrp                              ┊ ok    ┊
                              ╰──────────────────────────────────────────────────────────────────────────────────────────────────────────╯
                              

                              Seems like it might be the same issue as S3.

                              
                              2024_10_09 16:17:00.885 [MainWorkerPool-11] INFO  LINSTOR/Satellite - SYSTEM - Snapshot 'back_20241009_161658' of resource 'pvc-086a5817-d813-41fe-86d8-3fac2ae2028f' registered.
                              2024_10_09 16:17:00.886 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Aligning /dev/linstor_group/pvc-086a5817-d813-41fe-86d8-3fac2ae2028f_00000 size from 8390440 KiB to 8392704 KiB to be a multiple of extent size 4096 KiB (from Storage Pool)
                              2024_10_09 16:17:01.034 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Resource 'pvc-086a5817-d813-41fe-86d8-3fac2ae2028f' [DRBD] adjusted.
                              2024_10_09 16:17:01.262 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - End DeviceManager cycle 96
                              2024_10_09 16:17:01.262 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Begin DeviceManager cycle 97
                              2024_10_09 16:17:01.301 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - End DeviceManager cycle 97
                              2024_10_09 16:17:01.301 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Begin DeviceManager cycle 98
                              2024_10_09 16:17:02.765 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - End DeviceManager cycle 98
                              2024_10_09 16:17:02.766 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Begin DeviceManager cycle 99
                              2024_10_09 16:17:02.774 [MainWorkerPool-1] INFO  LINSTOR/Satellite - SYSTEM - Snapshot 'back_20241009_161658' of resource 'pvc-086a5817-d813-41fe-86d8-3fac2ae2028f' registered.
                              2024_10_09 16:17:02.774 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Aligning /dev/linstor_group/pvc-086a5817-d813-41fe-86d8-3fac2ae2028f_00000 size from 8390440 KiB to 8392704 KiB to be a multiple of extent size 4096 KiB (from Storage Pool)
                              2024_10_09 16:17:03.012 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Resource 'pvc-086a5817-d813-41fe-86d8-3fac2ae2028f' [DRBD] adjusted.
                              2024_10_09 16:17:03.037 [pvc-086a5817-d813-41fe-86d8-3fac2ae2028f_00000_back_20241009_161658] WARN  LINSTOR/Satellite - SYSTEM - stdErr: 2024/10/09 16:17:03 socat[23463] E connect(5, AF=2 10.2.0.10:12012, 16): No route to host
                              2024_10_09 16:17:03.092 [pvc-086a5817-d813-41fe-86d8-3fac2ae2028f_00000_back_20241009_161658] WARN  LINSTOR/Satellite - SYSTEM - stdErr:   Device read short 40960 bytes remaining
                              2024_10_09 16:17:03.094 [pvc-086a5817-d813-41fe-86d8-3fac2ae2028f_00000_back_20241009_161658] WARN  LINSTOR/Satellite - SYSTEM - stdErr:   Device read short 40960 bytes remaining
                              2024_10_09 16:17:03.095 [pvc-086a5817-d813-41fe-86d8-3fac2ae2028f_00000_back_20241009_161658] WARN  LINSTOR/Satellite - SYSTEM - stdErr:   Device read short 94208 bytes remaining
                              2024_10_09 16:17:03.095 [pvc-086a5817-d813-41fe-86d8-3fac2ae2028f_00000_back_20241009_161658] WARN  LINSTOR/Satellite - SYSTEM - stdErr:   Device read short 57856 bytes remaining
                              2024_10_09 16:17:03.099 [pvc-086a5817-d813-41fe-86d8-3fac2ae2028f_00000_back_20241009_161658] WARN  LINSTOR/Satellite - SYSTEM - stdErr:   Device read short 94208 bytes remaining
                              2024_10_09 16:17:03.100 [pvc-086a5817-d813-41fe-86d8-3fac2ae2028f_00000_back_20241009_161658] WARN  LINSTOR/Satellite - SYSTEM - stdErr:   Device read short 57856 bytes remaining
                              2024_10_09 16:17:03.109 [pvc-086a5817-d813-41fe-86d8-3fac2ae2028f_00000_back_20241009_161658] WARN  LINSTOR/Satellite - SYSTEM - stdErr:   Device read short 82432 bytes remaining
                              2024_10_09 16:17:03.248 [pvc-086a5817-d813-41fe-86d8-3fac2ae2028f_00000_back_20241009_161658] WARN  LINSTOR/Satellite - SYSTEM - stdErr:   Device read short 40960 bytes remaining
                              2024_10_09 16:17:03.249 [pvc-086a5817-d813-41fe-86d8-3fac2ae2028f_00000_back_20241009_161658] WARN  LINSTOR/Satellite - SYSTEM - stdErr:   Device read short 40960 bytes remaining
                              2024_10_09 16:17:03.250 [pvc-086a5817-d813-41fe-86d8-3fac2ae2028f_00000_back_20241009_161658] WARN  LINSTOR/Satellite - SYSTEM - stdErr:   Device read short 94208 bytes remaining
                              2024_10_09 16:17:03.251 [pvc-086a5817-d813-41fe-86d8-3fac2ae2028f_00000_back_20241009_161658] WARN  LINSTOR/Satellite - SYSTEM - stdErr:   Device read short 57856 bytes remaining
                              2024_10_09 16:17:03.254 [pvc-086a5817-d813-41fe-86d8-3fac2ae2028f_00000_back_20241009_161658] WARN  LINSTOR/Satellite - SYSTEM - stdErr:   Device read short 94208 bytes remaining
                              2024_10_09 16:17:03.256 [pvc-086a5817-d813-41fe-86d8-3fac2ae2028f_00000_back_20241009_161658] WARN  LINSTOR/Satellite - SYSTEM - stdErr:   Device read short 57856 bytes remaining
                              2024_10_09 16:17:03.266 [pvc-086a5817-d813-41fe-86d8-3fac2ae2028f_00000_back_20241009_161658] WARN  LINSTOR/Satellite - SYSTEM - stdErr:   Device read short 82432 bytes remaining
                              2024_10_09 16:17:03.282 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - End DeviceManager cycle 99
                              2024_10_09 16:17:03.282 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Begin DeviceManager cycle 100
                              2024_10_09 16:17:03.288 [MainWorkerPool-3] INFO  LINSTOR/Satellite - SYSTEM - Snapshot 'back_20241009_161658' of resource 'pvc-086a5817-d813-41fe-86d8-3fac2ae2028f' registered.
                              2024_10_09 16:17:03.289 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Aligning /dev/linstor_group/pvc-086a5817-d813-41fe-86d8-3fac2ae2028f_00000 size from 8390440 KiB to 8392704 KiB to be a multiple of extent size 4096 KiB (from Storage Pool)
                              2024_10_09 16:17:03.421 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Resource 'pvc-086a5817-d813-41fe-86d8-3fac2ae2028f' [DRBD] adjusted.
                              2024_10_09 16:17:03.644 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - End DeviceManager cycle 100
                              2024_10_09 16:17:03.644 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Begin DeviceManager cycle 101
                              2024_10_09 16:17:03.674 [MainWorkerPool-5] INFO  LINSTOR/Satellite - SYSTEM - Snapshot 'back_20241009_161658' of resource 'pvc-086a5817-d813-41fe-86d8-3fac2ae2028f' registered.
                              2024_10_09 16:17:03.674 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Aligning /dev/linstor_group/pvc-086a5817-d813-41fe-86d8-3fac2ae2028f_00000 size from 8390440 KiB to 8392704 KiB to be a multiple of extent size 4096 KiB (from Storage Pool)
                              2024_10_09 16:17:03.807 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Resource 'pvc-086a5817-d813-41fe-86d8-3fac2ae2028f' [DRBD] adjusted.
                              2024_10_09 16:17:04.031 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - End DeviceManager cycle 101
                              2024_10_09 16:17:04.031 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Begin DeviceManager cycle 102
                              2024_10_09 16:47:03.682 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - End DeviceManager cycle 102
                              2024_10_09 16:47:03.682 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Begin DeviceManager cycle 103
                              

                              Full log: linstor-satellite.txt

                              No Error Report on either cluster

                              1 Reply Last reply Reply Quote 0
                              • J Offline
                                Jonathon
                                last edited by Jonathon

                                I am curious if anyone else can replicate this, as it is just attempting to move a resource between two xostor clusters? If it is just me I can continue troubleshooting, otherwise it would be nice to know it is an exercise in futility.

                                But I am well aware that the release a few days ago has everyone swamped and this can wait, would just be awesome to know as it would change migration plans.

                                https://linbit.com/drbd-user-guide/linstor-guide-1_0-en/#s-linstor-snapshots-shipping

                                thin-send-recv is needed to ship data when using LVM thin-provisioned volumes

                                Yeah this seems to be for any type of shipping, s3 or otherwise.

                                1 Reply Last reply Reply Quote 0
                                • J Offline
                                  Jonathon
                                  last edited by Jonathon

                                  So, did more testing. Looks like thin_send_recv is not the problem, but maybe socat.
                                  I am able to manually migrate resource between XOSTOR (linstor) cluster using thin_send_recv. I have encluded all steps below so that it can be replicated.

                                  And we know socat is used, cause it complains if it is not there.

                                  jonathon@jonathon-framework:~$ linstor --controller 10.2.0.19 backup ship newCluster pvc-086a5817-d813-41fe-86d8-3fac2ae2028f pvc-086a5817-d813-41fe-86d8-3fac2ae2028f
                                  INFO:
                                      Cannot use node 'ovbh-pprod-xen10' as it does not support the tool(s): SOCAT
                                  INFO:
                                      Cannot use node 'ovbh-pprod-xen12' as it does not support the tool(s): SOCAT
                                  INFO:
                                      Cannot use node 'ovbh-pprod-xen13' as it does not support the tool(s): SOCAT
                                  ERROR:
                                      Backup shipping of resource 'pvc-086a5817-d813-41fe-86d8-3fac2ae2028f' cannot be started since there is no node available that supports backup shipping.
                                  

                                  Using 1.0.1 thin_send_recv.

                                  [16:16 ovbh-pprod-xen11 ~]# thin_send --version
                                  1.0.1
                                  [16:16 ovbh-pprod-xen01 ~]# thin_recv --version
                                  1.0.1
                                  

                                  Versions of socat match.

                                  [16:16 ovbh-pprod-xen11 ~]# socat -V
                                  socat by Gerhard Rieger and contributors - see www.dest-unreach.org
                                  socat version 1.7.3.2 on Aug  4 2017 04:57:10
                                     running on Linux version #1 SMP Tue Jan 23 14:12:55 CET 2024, release 4.19.0+1, machine x86_64
                                  features:
                                    #define WITH_STDIO 1
                                    #define WITH_FDNUM 1
                                    #define WITH_FILE 1
                                    #define WITH_CREAT 1
                                    #define WITH_GOPEN 1
                                    #define WITH_TERMIOS 1
                                    #define WITH_PIPE 1
                                    #define WITH_UNIX 1
                                    #define WITH_ABSTRACT_UNIXSOCKET 1
                                    #define WITH_IP4 1
                                    #define WITH_IP6 1
                                    #define WITH_RAWIP 1
                                    #define WITH_GENERICSOCKET 1
                                    #define WITH_INTERFACE 1
                                    #define WITH_TCP 1
                                    #define WITH_UDP 1
                                    #define WITH_SCTP 1
                                    #define WITH_LISTEN 1
                                    #define WITH_SOCKS4 1
                                    #define WITH_SOCKS4A 1
                                    #define WITH_PROXY 1
                                    #define WITH_SYSTEM 1
                                    #define WITH_EXEC 1
                                    #define WITH_READLINE 1
                                    #define WITH_TUN 1
                                    #define WITH_PTY 1
                                    #define WITH_OPENSSL 1
                                    #undef WITH_FIPS
                                    #define WITH_LIBWRAP 1
                                    #define WITH_SYCLS 1
                                    #define WITH_FILAN 1
                                    #define WITH_RETRY 1
                                    #define WITH_MSGLEVEL 0 /*debug*/
                                  ...
                                  [16:17 ovbh-pprod-xen01 ~]# socat -V
                                  socat by Gerhard Rieger and contributors - see www.dest-unreach.org
                                  socat version 1.7.3.2 on Aug  4 2017 04:57:10
                                     running on Linux version #1 SMP Tue Jan 23 14:12:55 CET 2024, release 4.19.0+1, machine x86_64
                                  features:
                                    #define WITH_STDIO 1
                                    #define WITH_FDNUM 1
                                    #define WITH_FILE 1
                                    #define WITH_CREAT 1
                                    #define WITH_GOPEN 1
                                    #define WITH_TERMIOS 1
                                    #define WITH_PIPE 1
                                    #define WITH_UNIX 1
                                    #define WITH_ABSTRACT_UNIXSOCKET 1
                                    #define WITH_IP4 1
                                    #define WITH_IP6 1
                                    #define WITH_RAWIP 1
                                    #define WITH_GENERICSOCKET 1
                                    #define WITH_INTERFACE 1
                                    #define WITH_TCP 1
                                    #define WITH_UDP 1
                                    #define WITH_SCTP 1
                                    #define WITH_LISTEN 1
                                    #define WITH_SOCKS4 1
                                    #define WITH_SOCKS4A 1
                                    #define WITH_PROXY 1
                                    #define WITH_SYSTEM 1
                                    #define WITH_EXEC 1
                                    #define WITH_READLINE 1
                                    #define WITH_TUN 1
                                    #define WITH_PTY 1
                                    #define WITH_OPENSSL 1
                                    #undef WITH_FIPS
                                    #define WITH_LIBWRAP 1
                                    #define WITH_SYCLS 1
                                    #define WITH_FILAN 1
                                    #define WITH_RETRY 1
                                    #define WITH_MSGLEVEL 0 /*debug*/
                                  

                                  Migrating using only thin_send_recv works.

                                  1 Reply Last reply Reply Quote 0
                                  • J Offline
                                    Jonathon
                                    last edited by

                                    The reason that it may be socat, is because the commands fail when I try using it, as instructed by https://github.com/LINBIT/thin-send-recv

                                    [13:03 ovbh-pprod-xen11 ~]# thin_send linstor_group/pvc-12aca72c-d94a-4c09-8102-0a6646906f8d_00000_s_test 2>/dev/null | zstd | socat STDIN TCP:10.2.0.10:4321
                                    2024/10/28 13:04:59 socat[25701] E write(5, 0x55da36101da0, 8192): Broken pipe
                                    ...
                                    [13:03 ovbh-pprod-xen01 ~]# socat TCP-LISTEN:4321 STDOUT | zstd -d | thin_recv linstor_group/pvc-12aca72c-d94a-4c09-8102-0a6646906f8d_00000 2>/dev/null
                                    2024/10/28 13:04:59 socat[27039] E read(1, 0x560ef6ff4350, 8192): Bad file descriptor
                                    

                                    And the same thing happens if I exclude zstd from both commands.

                                    1 Reply Last reply Reply Quote 0
                                    • splastunovS Offline
                                      splastunov
                                      last edited by

                                      Hi!

                                      I'm looking for new storage cluster for XCP-ng, because ceph RBD performance is very poor.
                                      The main quetion now - is it possible to build XOSTOR (linstore) cluster separatly from xcp-ng and connect it over ethernet?
                                      No inforamtion about such scenario in this article.
                                      So I would like to have "compute" claster of xcp-ng nodes with fast local NVMe disks + and dedicated storage cluster with big amount of HDDs connected vie ethernet.

                                      And second question is about scaling.
                                      How this storage cluster could be scaled? Is it possible to add storage nodes online without interrupting clients (VMs)?

                                      Thank you

                                      1 Reply Last reply Reply Quote 0
                                      • olivierlambertO Offline
                                        olivierlambert Vates 🪐 Co-Founder CEO
                                        last edited by

                                        LINSTOR can be reach in NFS or iSCSI if I remember correctly. You question is more for LINSTOR than for XCP-ng 🙂

                                        splastunovS 1 Reply Last reply Reply Quote 0
                                        • splastunovS Offline
                                          splastunov @olivierlambert
                                          last edited by splastunov

                                          @olivierlambert NFS and iSCSI have single point of failure. Yes, it is possible to deploy multipath iSCSI, but it is to complicated. I like CEPH RBD because it does not have single point of failure.
                                          So I'm looking for something similar.

                                          From my point of vew XOSTOR is good idea, but in some cases there is no need to use all nodes as xcp-ng hosts. For example you do not need large amount of RAM and fast modern CPU for storage cluster nodes.
                                          I think the best solution in my case will be to deploy XOSTOR controller in xcp-ng cluster connected to separate storage cluster.

                                          At first glance, I assume that it should be possible to connect storage cluster to xcp-ng with this command
                                          linstor resource create node1 test1 --diskless

                                          So the base idea is to use xcp-ng nodes for linstor-controllers/linstor-satellite and "storage" nodes as linstor-satellite only.

                                          M 1 Reply Last reply Reply Quote 0
                                          • M Offline
                                            Mark C @splastunov
                                            last edited by

                                            @splastunov / all.

                                            Just on multipath iSCSI. I had this running for years with separate switches running the 'A side' and 'B side' multipath networks between the iSCSI dot hill's resilient and redundant controllers.

                                            Once you have it configured, it is incredibly solid for what is a very budget solution. It just takes a bit of careful planning up front to make sure you wire up in a way that is properly resilient. Plus reliable monitoring to make sure that you notice when one of the resilient elements fails.

                                            We had a number of switch power supply failures over the 10 year period that were completely transparent to the services running on the connected XCP-NG and XenServer hosts.

                                            Similarly we were able to do a full DotHill controller replacement without any downtime after one of the two controllers failed.

                                            We also replaced the hypervisor hardware a number of times over the lifetime of the platform.

                                            splastunovS 1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post