XCP-ng
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    XOSTOR hyperconvergence preview

    Scheduled Pinned Locked Moved XOSTOR
    458 Posts 50 Posters 748.9k Views 53 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Maelstrom96M Offline
      Maelstrom96 @ronan-a
      last edited by

      @ronan-a Do you know of a way to update a node name in Linstor? I've tried to look in their documentation and checked through CLI commands but couldn't find a way.

      ronan-aR 1 Reply Last reply Reply Quote 0
      • ronan-aR Offline
        ronan-a Vates 🪐 XCP-ng Team @Maelstrom96
        last edited by

        @Maelstrom96 Well there is no simple helper to do that using the CLI.

        So you can create a new node using:

        linstor node create --node-type Combined <NAME> <IP>
        

        Then you must evacuate the old node to preserve the replication count:

        linstor node evacuate <OLD_NAME>
        

        Next, you can change your hostname an restart the services on each host:

        systemctl stop linstor-controller
        systemctl restart linstor-satellites
        

        Finally you can delete the node:

        linstor node delete <OLD_NAME>
        

        After that you must recreate the diskless resources if necessary. Exec linstor advise r to see the commands to execute.

        Maelstrom96M 1 Reply Last reply Reply Quote 2
        • Maelstrom96M Offline
          Maelstrom96 @ronan-a
          last edited by

          @ronan-a Thanks a lot for that procedure.

          Ended up needing to do a little bit more, since for some reason, "evacuate" failed. I deleted the node and then went and just manually recreated my resources using:

          linstor resource create --auto-place +1 <resource_name>
          

          Which didn't work at first because the new node didn't have a storage-pool configured, which required this command to work (NOTE - This is only valid if your SR was setup as thin):

          linstor storage-pool create lvmthin <node_name> xcp-sr-linstor_group_thin_device linstor_group/thin_device
          

          Also, worth nothing that before actually re-creating the resources, you might want to manually clean up the lingering Logical Volumes that weren't cleaned up if evacuate failed.

          Find volumes with:

          lvdisplay
          

          and then delete them with:

          lvremove <LV Path>
          

          example:

          lvremove /dev/linstor_group/xcp-persistent-database_00000
          
          1 Reply Last reply Reply Quote 0
          • F Offline
            flibbi
            last edited by flibbi

            I've using xcp-ng with NFS Shared Storage for some months now and I am happy with it so far.
            I've some ssds in a 3 server setup and I'd like to test xostor. Before I will setup xostor, there are some questions regarding it, as I am only familiar with virtuozzo/acronis-storage and ceph so far. Are there the same restrictions for using xostor, that exists in ceph e.g.?

            • only enterprise ssds because of power loss protection
            • do not use Raids ( especially Raid 0 ) if the controller is capable of using HBA Mode. I've an Dell H330 Controller and if Raid is no problem, I'd like to setup OS with Hardware Raid 1 and xostor on the rest of the ssds with raid0 arrays per each disk. If HBA mode is prefered, I need to stick with Software Raid 1, I think. Software Raid 1 is working fine, but I've had some problems in the past if the boot drive of the mirror died...
            • I've installed xen-orchestra manually. Once xostor is installed, will the xostor button in xen orchestra will have a function or is it only available within the XOA appliance?

            Thanks for your answers!

            1 Reply Last reply Reply Quote 0
            • olivierlambertO Offline
              olivierlambert Vates 🪐 Co-Founder CEO
              last edited by olivierlambert

              1. That shouldn't be a problem, if it's bad on one disk for one host, it should be resync. It's not a big filesystem shared, it's only a block space split between each created virtual disk.
              2. That should work fine (@ronan-a will confirm but I don't think there's very low level optimization that could be affected by a RAID card?)
              3. XOSTOR UI is only available in XOA, but you'll be able to manage and have all the features from the CLI
              J 1 Reply Last reply Reply Quote 0
              • J Offline
                JeffBerntsen Top contributor @olivierlambert
                last edited by

                @flibbi, @olivierlambert RAID shouldn't be problem for XOSTOR. During some of my testing shortly after the preview was released, I was running it on software RAID 10 arrays on each of my test servers. As long as the RAID isn't some sort of "fake RAID" and is done in hardware, it should work fine.

                1 Reply Last reply Reply Quote 0
                • H Offline
                  ha_tu_su
                  last edited by

                  I read on the blog that XOSTOR has been officially released and wanted to test it. I have installed v8.2.1 of XCP-ng on the server nodes. On a separate computer in the management network I have XO built from sources. I have updated the hosts to the latest packages.

                  Then I started following instructions from the first post in the thread. I am getting error at the sr-create step.

                  [15:11 xcp-ng-vh1 ~]# xe sr-create type=linstor name-label=XOSTOR host-uuid=382d49a5-7435-425e-8588-f56e7a7711f8  device-config:group-name=linstor_group/thin_device device-config:redundancy=2 shared=true device-config:provisioning=thin
                  Error code: SR_BACKEND_FAILURE_202
                  Error parameters: , General backend error [opterr=['XENAPI_PLUGIN_FAILURE', 'non-zero exit', '', 'Traceback (most recent call last):\n  File "/etc/xapi.d/plugins/linstor-manager", line 24, in <module>\n    from linstorjournaler import LinstorJournaler\n  File "/opt/xensource/sm/linstorjournaler.py", line 19, in <module>\n    from linstorvolumemanager import LinstorVolumeManager\n  File "/opt/xensource/sm/linstorvolumemanager.py", line 20, in <module>\n    import linstor\nImportError: No module named linstor\n']], 
                  

                  I tried to find possible causes on the forums and it was mentioned that the linstor packages are not yet mature for 8.3 release and that python versions between 8.2 and 8.3 versions of xcp-ng can cause issues. I am using 8.2 branch though so not sure what I am missing here:

                  [15:12 xcp-ng-vh1 ~]# cat /etc/os-release 
                  NAME="XCP-ng"
                  VERSION="8.2.1"
                  ID="xenenterprise"
                  ID_LIKE="centos rhel fedora"
                  VERSION_ID="8.2.1"
                  PRETTY_NAME="XCP-ng 8.2.1"
                  ANSI_COLOR="0;31"
                  HOME_URL="http://xcp-ng.org/"
                  BUG_REPORT_URL="https://github.com/xcp-ng/xcp"
                  

                  Packages related to linstor on the system:

                  [20:11 xcp-ng-vh1 ~]# yum list | grep linstor
                  drbd.x86_64                        9.27.0-1.el7             @xcp-ng-linstor     
                  drbd-bash-completion.x86_64        9.27.0-1.el7             @xcp-ng-linstor     
                  drbd-pacemaker.x86_64              9.27.0-1.el7             @xcp-ng-linstor     
                  drbd-reactor.x86_64                1.4.0-1                  @xcp-ng-linstor     
                  drbd-udev.x86_64                   9.27.0-1.el7             @xcp-ng-linstor     
                  drbd-utils.x86_64                  9.27.0-1.el7             @xcp-ng-linstor     
                  drbd-xen.x86_64                    9.27.0-1.el7             @xcp-ng-linstor     
                  kmod-drbd.x86_64                   9.2.8_4.19.0+1-1         @xcp-ng-linstor     
                  linstor-client.noarch              1.21.1-1.xcpng8.2        @xcp-ng-linstor     
                  linstor-common.noarch              1.26.1-1.el7             @xcp-ng-linstor     
                  linstor-controller.noarch          1.26.1-1.el7             @xcp-ng-linstor     
                  linstor-satellite.noarch           1.26.1-1.el7             @xcp-ng-linstor     
                  python-linstor.noarch              1.21.1-1.xcpng8.2        @xcp-ng-linstor     
                  sm.x86_64                          2.30.8-10.1.0.linstor.2.xcpng8.2
                                                                              @xcp-ng-linstor     
                  sm-rawhba.x86_64                   2.30.8-10.1.0.linstor.2.xcpng8.2
                                                                              @xcp-ng-linstor     
                  tzdata-java.noarch                 2023c-1.el7              @xcp-ng-linstor     
                  xcp-ng-linstor.noarch              1.1-3.xcpng8.2           @xcp-ng-updates     
                  xcp-ng-release-linstor.noarch      1.3-1.xcpng8.2           @xcp-ng-updates     
                  drbd-debuginfo.x86_64              9.27.0-1.el7             xcp-ng-linstor      
                  drbd-heartbeat.x86_64              9.27.0-1.el7             xcp-ng-linstor      
                  sm-debuginfo.x86_64                2.30.8-10.1.0.linstor.2.xcpng8.2
                                                                              xcp-ng-linstor      
                  sm-test-plugins.x86_64             2.30.8-10.1.0.linstor.2.xcpng8.2
                                                                              xcp-ng-linstor      
                  sm-testresults.x86_64              2.30.8-10.1.0.linstor.2.xcpng8.2
                                                                              xcp-ng-linstor     
                  

                  Any help appreciated.

                  Thanks.

                  H 1 Reply Last reply Reply Quote 0
                  • H Offline
                    ha_tu_su @ha_tu_su
                    last edited by

                    @ha_tu_su said in XOSTOR hyperconvergence preview:

                    I read on the blog that XOSTOR has been officially released and wanted to test it. I have installed v8.2.1 of XCP-ng on the server nodes. On a separate computer in the management network I have XO built from sources. I have updated the hosts to the latest packages.

                    Then I started following instructions from the first post in the thread. I am getting error at the sr-create step.

                    [15:11 xcp-ng-vh1 ~]# xe sr-create type=linstor name-label=XOSTOR host-uuid=382d49a5-7435-425e-8588-f56e7a7711f8  device-config:group-name=linstor_group/thin_device device-config:redundancy=2 shared=true device-config:provisioning=thin
                    Error code: SR_BACKEND_FAILURE_202
                    Error parameters: , General backend error [opterr=['XENAPI_PLUGIN_FAILURE', 'non-zero exit', '', 'Traceback (most recent call last):\n  File "/etc/xapi.d/plugins/linstor-manager", line 24, in <module>\n    from linstorjournaler import LinstorJournaler\n  File "/opt/xensource/sm/linstorjournaler.py", line 19, in <module>\n    from linstorvolumemanager import LinstorVolumeManager\n  File "/opt/xensource/sm/linstorvolumemanager.py", line 20, in <module>\n    import linstor\nImportError: No module named linstor\n']], 
                    

                    I tried to find possible causes on the forums and it was mentioned that the linstor packages are not yet mature for 8.3 release and that python versions between 8.2 and 8.3 versions of xcp-ng can cause issues. I am using 8.2 branch though so not sure what I am missing here:

                    [15:12 xcp-ng-vh1 ~]# cat /etc/os-release 
                    NAME="XCP-ng"
                    VERSION="8.2.1"
                    ID="xenenterprise"
                    ID_LIKE="centos rhel fedora"
                    VERSION_ID="8.2.1"
                    PRETTY_NAME="XCP-ng 8.2.1"
                    ANSI_COLOR="0;31"
                    HOME_URL="http://xcp-ng.org/"
                    BUG_REPORT_URL="https://github.com/xcp-ng/xcp"
                    

                    Packages related to linstor on the system:

                    [20:11 xcp-ng-vh1 ~]# yum list | grep linstor
                    drbd.x86_64                        9.27.0-1.el7             @xcp-ng-linstor     
                    drbd-bash-completion.x86_64        9.27.0-1.el7             @xcp-ng-linstor     
                    drbd-pacemaker.x86_64              9.27.0-1.el7             @xcp-ng-linstor     
                    drbd-reactor.x86_64                1.4.0-1                  @xcp-ng-linstor     
                    drbd-udev.x86_64                   9.27.0-1.el7             @xcp-ng-linstor     
                    drbd-utils.x86_64                  9.27.0-1.el7             @xcp-ng-linstor     
                    drbd-xen.x86_64                    9.27.0-1.el7             @xcp-ng-linstor     
                    kmod-drbd.x86_64                   9.2.8_4.19.0+1-1         @xcp-ng-linstor     
                    linstor-client.noarch              1.21.1-1.xcpng8.2        @xcp-ng-linstor     
                    linstor-common.noarch              1.26.1-1.el7             @xcp-ng-linstor     
                    linstor-controller.noarch          1.26.1-1.el7             @xcp-ng-linstor     
                    linstor-satellite.noarch           1.26.1-1.el7             @xcp-ng-linstor     
                    python-linstor.noarch              1.21.1-1.xcpng8.2        @xcp-ng-linstor     
                    sm.x86_64                          2.30.8-10.1.0.linstor.2.xcpng8.2
                                                                                @xcp-ng-linstor     
                    sm-rawhba.x86_64                   2.30.8-10.1.0.linstor.2.xcpng8.2
                                                                                @xcp-ng-linstor     
                    tzdata-java.noarch                 2023c-1.el7              @xcp-ng-linstor     
                    xcp-ng-linstor.noarch              1.1-3.xcpng8.2           @xcp-ng-updates     
                    xcp-ng-release-linstor.noarch      1.3-1.xcpng8.2           @xcp-ng-updates     
                    drbd-debuginfo.x86_64              9.27.0-1.el7             xcp-ng-linstor      
                    drbd-heartbeat.x86_64              9.27.0-1.el7             xcp-ng-linstor      
                    sm-debuginfo.x86_64                2.30.8-10.1.0.linstor.2.xcpng8.2
                                                                                xcp-ng-linstor      
                    sm-test-plugins.x86_64             2.30.8-10.1.0.linstor.2.xcpng8.2
                                                                                xcp-ng-linstor      
                    sm-testresults.x86_64              2.30.8-10.1.0.linstor.2.xcpng8.2
                                                                                xcp-ng-linstor     
                    

                    Any help appreciated.

                    Thanks.

                    Ok, I had 3 hosts in the pool. Above error I was getting on 2 hosts. Just to repeat the process cleanly given in the first post I tried steps on 3rd host and SR creation was successful.

                    Initially on the 2 hosts I had used the 'thick' version of command to prepare disks. Then I had deleted the lvm and used wipefs on disks and then redid steps using the 'thin' version of command. My guess is that the disks were not 'wiped' completely and then I got error during SR creation.

                    I am going to use gparted to wipe the disks properly and then redo steps. If that doesn't work, then nuke the install of xcp-ng and reinstall and then check. Will update the post accordingly.

                    Cheers.

                    H 1 Reply Last reply Reply Quote 0
                    • H Offline
                      ha_tu_su @ha_tu_su
                      last edited by

                      @ha_tu_su
                      After using gparted to wiping out all disks, sr-create command works as expected to create XOSTOR.

                      1 Reply Last reply Reply Quote 3
                      • H ha_tu_su referenced this topic on
                      • F Offline
                        ferrao @ronan-a
                        last edited by ferrao

                        @ronan-a and @Maelstrom96 I didn't get this hostname issue.

                        Does XOSTOR needs a fully functional DNS setup to work? Or the failure was local due to the local change of the hostname?

                        I didn't understand if the communication is done by IP addresses directly or if DNS name resolution is needed.

                        I'm particularly interested in this because with XOSTOR I'm considering virtualizing my pfSense firewall directly and get rid of the physical servers. And in this scenario in a case of a entire pool reboot I must guarantee that I will have the two pfSense VMs up and running, with the option to auto start after reboot, so I can access the entire infrastructure or else I'll be locked from outside.

                        ronan-aR 1 Reply Last reply Reply Quote 0
                        • ronan-aR Offline
                          ronan-a Vates 🪐 XCP-ng Team @ferrao
                          last edited by

                          @ferrao said in XOSTOR hyperconvergence preview:

                          Does XOSTOR needs a fully functional DNS setup to work? Or the failure was local due to the local change of the hostname?

                          No. But your LINSTOR node name must match the hostname. We use IPs to communicate between nodes and in our driver.

                          F 1 Reply Last reply Reply Quote 0
                          • F Offline
                            ferrao @ronan-a
                            last edited by

                            @ronan-a thanks. I've deployed it already with the script on the first post. Seems to be working. I've opted to used redundancy=3 in a 3 hosts setup. It's a lot of 'wasted' resources but seems to be the best option for performance and reliability.

                            May I ask now a licensing issue: if we upgrade to Vates VM, does the deployment mode on the first message is considered supported or everything will need to be done again from XOA?

                            Thanks.

                            ronan-aR 1 Reply Last reply Reply Quote 0
                            • ronan-aR Offline
                              ronan-a Vates 🪐 XCP-ng Team @ferrao
                              last edited by

                              @ferrao said in XOSTOR hyperconvergence preview:

                              May I ask now a licensing issue: if we upgrade to Vates VM, does the deployment mode on the first message is considered supported or everything will need to be done again from XOA?

                              Regarding XOSTOR Support Licenses: In general, we prefer our users to use a trial license through XOA. And if they are interested, they can subscribe to a commercial license.
                              To be more precise: the manual steps in this thread are still valid to configure an SR LINSTOR, no difference with the XOA commands. However, if you wish to suscribe to a support license from a pool without XOA nor trial license, we are quite strict on the fact that the infrastructure must be in a stable state.

                              1 Reply Last reply Reply Quote 1
                              • L Offline
                                lover
                                last edited by lover

                                Anyone else getting a 301 error?

                                http://mirrors.xcp-ng.org/8/8.2/base/x86_64/repodata/repomd.xml: [Errno 14] HTTPS Error 301 - Moved Permanently
                                Trying other mirror.
                                
                                
                                F M 2 Replies Last reply Reply Quote 0
                                • F Offline
                                  ferrao @lover
                                  last edited by

                                  @lover said in XOSTOR hyperconvergence preview:

                                  Anyone else getting a 301 error?

                                  http://mirrors.xcp-ng.org/8/8.2/base/x86_64/repodata/repomd.xml: [Errno 14] HTTPS Error 301 - Moved Permanently
                                  Trying other mirror.
                                  
                                  

                                  301 is not an error (as a failure) it's a redirect. Here it redirects correctly to a mirror nearby. In my case: https://mirror.uepg.br/xcp-ng/8/8.2/base/x86_64/repodata/repomd.xml

                                  L 1 Reply Last reply Reply Quote 1
                                  • L Offline
                                    lover @ferrao
                                    last edited by

                                    This post is deleted!
                                    1 Reply Last reply Reply Quote 0
                                    • M Offline
                                      Mark C @lover
                                      last edited by

                                      @lover

                                      See /etc/yum.repos.d/xcp-ng.repo and update all references from http:/ to https:/

                                      1 Reply Last reply Reply Quote 0
                                      • J Offline
                                        Jonathon
                                        last edited by

                                        Hello all!

                                        I have an issue with backing up to S3. I am hoping someone can point out the mistake I am making.
                                        Our xcp-ng hosts are all up to date.

                                        jonathon@jonathon-framework:~$ linstor --controller 10.2.0.10 remote l
                                        ╭─────────────────────────────────────────────────────────────────────────╮
                                        ┊ Name                 ┊ Type ┊ Info                                      ┊
                                        ╞═════════════════════════════════════════════════════════════════════════╡
                                        ┊ linbit-velero-backup ┊ S3   ┊ us-east-1.s3.wasabisys.com/velero-preprod ┊
                                        ╰─────────────────────────────────────────────────────────────────────────╯
                                        jonathon@jonathon-framework:~$ linstor --controller 10.2.0.10 backup create linbit-velero-backup pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d
                                        SUCCESS:
                                            Suspended IO of '[pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d]' on 'ovbh-vtest-k8s01-worker02' for snapshot
                                        SUCCESS:
                                            Suspended IO of '[pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d]' on 'ovbh-pprod-xen02' for snapshot
                                        SUCCESS:
                                            Suspended IO of '[pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d]' on 'ovbh-pprod-xen03' for snapshot
                                        SUCCESS:
                                            Suspended IO of '[pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d]' on 'ovbh-pprod-xen01' for snapshot
                                        SUCCESS:
                                            Took snapshot of '[pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d]' on 'ovbh-vtest-k8s01-worker02'
                                        SUCCESS:
                                            Took snapshot of '[pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d]' on 'ovbh-pprod-xen02'
                                        SUCCESS:
                                            Took snapshot of '[pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d]' on 'ovbh-pprod-xen03'
                                        SUCCESS:
                                            Took snapshot of '[pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d]' on 'ovbh-pprod-xen01'
                                        SUCCESS:
                                            Resumed IO of '[pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d]' on 'ovbh-vtest-k8s01-worker02' after snapshot
                                        SUCCESS:
                                            Resumed IO of '[pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d]' on 'ovbh-pprod-xen01' after snapshot
                                        SUCCESS:
                                            Resumed IO of '[pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d]' on 'ovbh-pprod-xen02' after snapshot
                                        SUCCESS:
                                            Resumed IO of '[pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d]' on 'ovbh-pprod-xen03' after snapshot
                                        INFO:
                                            Generated snapshot name for backup of resourcepvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d to remote linbit-velero-backup
                                        INFO:
                                            Shipping of resource pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d to remote linbit-velero-backup in progress.
                                        SUCCESS:
                                            Started shipping of resource 'pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d'
                                        SUCCESS:
                                            Started shipping of resource 'pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d'
                                        SUCCESS:
                                            Started shipping of resource 'pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d'
                                        jonathon@jonathon-framework:~$ linstor --controller 10.2.0.10 snapshot l
                                        ╭─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
                                        ┊ ResourceName                             ┊ SnapshotName         ┊ NodeNames                                            ┊ Volumes   ┊ CreatedOn           ┊ State    ┊
                                        ╞═════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════╡
                                        ┊ pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d ┊ back_20241002_191139 ┊ ovbh-pprod-xen01, ovbh-pprod-xen02, ovbh-pprod-xen03 ┊ 0: 50 GiB ┊ 2024-10-02 16:11:40 ┊ Shipping ┊
                                        ╰─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
                                        jonathon@jonathon-framework:~$ linstor --controller 10.2.0.10 backup list linbit-velero-backup
                                        ╭───────────────────────────────────────────────────────╮
                                        ┊ Resource ┊ Snapshot ┊ Finished at ┊ Based On ┊ Status ┊
                                        ╞═══════════════════════════════════════════════════════╡
                                        ╰───────────────────────────────────────────────────────╯
                                        jonathon@jonathon-framework:~$ linstor --controller 10.2.0.10 snapshot l
                                        ╭───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
                                        ┊ ResourceName                             ┊ SnapshotName         ┊ NodeNames                                            ┊ Volumes   ┊ CreatedOn           ┊ State      ┊
                                        ╞═══════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════╡
                                        ┊ pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d ┊ back_20241002_191139 ┊ ovbh-pprod-xen01, ovbh-pprod-xen02, ovbh-pprod-xen03 ┊ 0: 50 GiB ┊ 2024-10-02 16:11:40 ┊ Successful ┊
                                        ╰───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
                                        

                                        Nothing shows up on S3.
                                        And after, enabling logs by modifying /usr/share/linstor-server/lib/conf/logback.xml I see the following

                                        [19:15 ovbh-pprod-xen01 ~]# tail /var/log/linstor-satellite/linstor-Satellite.log -n 20
                                        2024_10_02 19:11:41.511 [shipping_pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d_00000_back_20241002_191139] WARN  LINSTOR/Satellite - SYSTEM - stdErr:   Device read short 40960 bytes remaining
                                        2024_10_02 19:11:41.512 [shipping_pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d_00000_back_20241002_191139] WARN  LINSTOR/Satellite - SYSTEM - stdErr:   Device read short 40960 bytes remaining
                                        2024_10_02 19:11:41.513 [shipping_pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d_00000_back_20241002_191139] WARN  LINSTOR/Satellite - SYSTEM - stdErr:   Device read short 40960 bytes remaining
                                        2024_10_02 19:11:41.516 [shipping_pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d_00000_back_20241002_191139] WARN  LINSTOR/Satellite - SYSTEM - stdErr:   Device read short 82432 bytes remaining
                                        2024_10_02 19:11:41.543 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - End DeviceManager cycle 42
                                        2024_10_02 19:11:41.543 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Begin DeviceManager cycle 43
                                        2024_10_02 19:11:41.552 [MainWorkerPool-5] INFO  LINSTOR/Satellite - SYSTEM - Snapshot 'back_20241002_191139' of resource 'pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d' registered.
                                        2024_10_02 19:11:41.553 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Aligning /dev/linstor_group/pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d_00000 size from 52440040 KiB to 52441088 KiB to be a multiple of extent size 4096 KiB (from Storage Pool)
                                        2024_10_02 19:11:41.615 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Resource 'pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d' [DRBD] adjusted.
                                        2024_10_02 19:11:41.781 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - End DeviceManager cycle 43
                                        2024_10_02 19:11:41.781 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Begin DeviceManager cycle 44
                                        2024_10_02 19:11:47.220 [shipping_pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d_00000_back_20241002_191139] WARN  LINSTOR/Satellite - SYSTEM - stdErr: Incomplete copy_data, 4194304 bytes missing.
                                        2024_10_02 19:11:47.295 [shipping_pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d_00000_back_20241002_191139] WARN  LINSTOR/Satellite - SYSTEM - Exception occurred while checking for support of requester-pays on remote linbit-velero-backup. Defaulting to false
                                        2024_10_02 19:11:47.307 [MainWorkerPool-7] INFO  LINSTOR/Satellite - SYSTEM - Snapshot 'back_20241002_191139' of resource 'pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d' registered.
                                        2024_10_02 19:11:47.309 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Aligning /dev/linstor_group/pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d_00000 size from 52440040 KiB to 52441088 KiB to be a multiple of extent size 4096 KiB (from Storage Pool)
                                        2024_10_02 19:11:47.312 [shipping_pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d_00000_back_20241002_191139] ERROR LINSTOR/Satellite - SYSTEM -  [Report number 66FDD1AE-3AE91-000000]
                                        
                                        2024_10_02 19:11:47.398 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Resource 'pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d' [DRBD] adjusted.
                                        2024_10_02 19:11:47.561 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - End DeviceManager cycle 44
                                        2024_10_02 19:11:47.561 [DeviceManager] INFO  LINSTOR/Satellite - SYSTEM - Begin DeviceManager cycle 45
                                        

                                        The error

                                        [19:12 ovbh-pprod-xen01 ~]# cat /var/log/linstor-satellite/ErrorReport-66FDD1AE-3AE91-000000.log
                                        ERROR REPORT 66FDD1AE-3AE91-000000
                                        
                                        ============================================================
                                        
                                        Application:                        LINBIT® LINSTOR
                                        Module:                             Satellite
                                        Version:                            1.26.1
                                        Build ID:                           12746ac9c6e7882807972c3df56e9a89eccad4e5
                                        Build time:                         2024-02-22T05:27:50+00:00
                                        Error time:                         2024-10-02 19:11:47
                                        Node:                               ovbh-pprod-xen01
                                        Thread:                             shipping_pvc-7746af6f-d37e-4c5d-9f44-9616f2f9b33d_00000_back_20241002_191139
                                        
                                        ============================================================
                                        
                                        Reported error:
                                        ===============
                                        
                                        Category:                           RuntimeException
                                        Class name:                         AbortedException
                                        Class canonical name:               com.amazonaws.AbortedException
                                        Generated at:                       Method 'handleInterruptedException', Source file 'AmazonHttpClient.java', Line #880
                                        
                                        Error message:                      
                                        
                                        Call backtrace:
                                        
                                            Method                                   Native Class:Line number
                                            handleInterruptedException               N      com.amazonaws.http.AmazonHttpClient$RequestExecutor:880
                                            execute                                  N      com.amazonaws.http.AmazonHttpClient$RequestExecutor:757
                                            access$500                               N      com.amazonaws.http.AmazonHttpClient$RequestExecutor:715
                                            execute                                  N      com.amazonaws.http.AmazonHttpClient$RequestExecutionBuilderImpl:697
                                            execute                                  N      com.amazonaws.http.AmazonHttpClient:561
                                            execute                                  N      com.amazonaws.http.AmazonHttpClient:541
                                            invoke                                   N      com.amazonaws.services.s3.AmazonS3Client:5516
                                            invoke                                   N      com.amazonaws.services.s3.AmazonS3Client:5463
                                            abortMultipartUpload                     N      com.amazonaws.services.s3.AmazonS3Client:3620
                                            abortMultipart                           N      com.linbit.linstor.api.BackupToS3:199
                                            threadFinished                           N      com.linbit.linstor.backupshipping.BackupShippingS3Daemon:320
                                            run                                      N      com.linbit.linstor.backupshipping.BackupShippingS3Daemon:298
                                            run                                      N      java.lang.Thread:829
                                        
                                        Caused by:
                                        ==========
                                        
                                        Category:                           Exception
                                        Class name:                         SdkInterruptedException
                                        Class canonical name:               com.amazonaws.http.timers.client.SdkInterruptedException
                                        Generated at:                       Method 'checkInterrupted', Source file 'AmazonHttpClient.java', Line #935
                                        
                                        
                                        Call backtrace:
                                        
                                            Method                                   Native Class:Line number
                                            checkInterrupted                         N      com.amazonaws.http.AmazonHttpClient$RequestExecutor:935
                                            checkInterrupted                         N      com.amazonaws.http.AmazonHttpClient$RequestExecutor:921
                                            executeHelper                            N      com.amazonaws.http.AmazonHttpClient$RequestExecutor:1115
                                            doExecute                                N      com.amazonaws.http.AmazonHttpClient$RequestExecutor:814
                                            executeWithTimer                         N      com.amazonaws.http.AmazonHttpClient$RequestExecutor:781
                                            execute                                  N      com.amazonaws.http.AmazonHttpClient$RequestExecutor:755
                                            access$500                               N      com.amazonaws.http.AmazonHttpClient$RequestExecutor:715
                                            execute                                  N      com.amazonaws.http.AmazonHttpClient$RequestExecutionBuilderImpl:697
                                            execute                                  N      com.amazonaws.http.AmazonHttpClient:561
                                            execute                                  N      com.amazonaws.http.AmazonHttpClient:541
                                            invoke                                   N      com.amazonaws.services.s3.AmazonS3Client:5516
                                            invoke                                   N      com.amazonaws.services.s3.AmazonS3Client:5463
                                            abortMultipartUpload                     N      com.amazonaws.services.s3.AmazonS3Client:3620
                                            abortMultipart                           N      com.linbit.linstor.api.BackupToS3:199
                                            threadFinished                           N      com.linbit.linstor.backupshipping.BackupShippingS3Daemon:320
                                            run                                      N      com.linbit.linstor.backupshipping.BackupShippingS3Daemon:298
                                            run                                      N      java.lang.Thread:829
                                        
                                        
                                        END OF ERROR REPORT.
                                        
                                        1 Reply Last reply Reply Quote 0
                                        • J Offline
                                          Jonathon
                                          last edited by Jonathon

                                          Ok, so, turns out this is because of the thin-send-recv package I build from https://github.com/LINBIT/thin-send-recv/tree/master

                                          I just swapped out the version I built for the last one I was able to get online to test, and it works.

                                          The last version I was able to get from any repository before they went 403 was thin-send-recv-1.0.1-1.x86_64.rpm.txt, I was able to get this from https://piraeus.daocloud.io/linbit/rpms/7/x86_64/thin-send-recv-1.0.1-1.x86_64.rpm. FYI https://packages.linbit.com/yum/sles12-sp2/drbd-9.0/x86_64/Packages/ returns 403's too so no point in looking for it there if they have it hosted.

                                          I built thin-send-recv-1.1.2-1.xcpng8.2.x86_64.rpm.txt using this doc I put together thin-send-recv.txt. But this package I built is resulting in the error posted previously.

                                          So I am a bit at a loss, I want to be able to use velero for backing up pvs which are not managed by an operator with backup capabilities, but I do not want to be stuck with this old version I can not update.

                                          Any advice would be greatly appreciated!

                                          1 Reply Last reply Reply Quote 0
                                          • J Offline
                                            Jonathon
                                            last edited by Jonathon

                                            Ok great, I manually built 1.0.1, and it works just like the package I got online, that means that what I am doing is working and the build process is correct.

                                            The bad new is there is a breaking change with v1.1.2, and I think I am potentially SOL. 😭
                                            I am going to build and test v1.1.0 and v1.1.1 to see which ones work. NVM v1.1.0 is also broken.

                                            So the change that breaks it is in here: https://github.com/LINBIT/thin-send-recv/compare/6b7c9002cd7716ff6ef93f5a5e8908032b81f853...e44f566ea0c975e2baa475868ebc176065a5b22d

                                            v1.0.1 might just be the version that works with the version of linstor, and whenever that gets updated it might call for a newer version of thin-send-recv.

                                            1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post