XCP-ng
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    XOSTOR hyperconvergence preview

    Scheduled Pinned Locked Moved XOSTOR
    458 Posts 50 Posters 744.3k Views 53 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • BlueToastB Offline
      BlueToast @Danp
      last edited by

      @Danp Success this this - thanks for the assist. 🙂 Executed with great success:

      yum install xcp-ng-linstor
      yum install xcp-ng-release-linstor
      ./install --disks /dev/nvme0n1 --thin
      
      1 Reply Last reply Reply Quote 0
      • G Offline
        gb.123 @olivierlambert
        last edited by

        @olivierlambert

        Since XOSTOR is now supported on XCP-ng 8.3 LTS, should we use the same script or some other method is required ?
        Can you remove the heading which states the script is only compatible with 8.2 ?

        1 Reply Last reply Reply Quote 0
        • olivierlambertO Offline
          olivierlambert Vates 🪐 Co-Founder CEO
          last edited by

          Ping @Team-Storage

          1 Reply Last reply Reply Quote 0
          • P Offline
            peter_webbird
            last edited by

            Is CBT meant to be supported on XOSTOR?

            I've been experimenting with XOSTOR recently, but upon testing a delta-backup, noticed this warning...
            couldn't enable CBT

            The error message behind this is SR_OPERATION_NOT_SUPPORTED when calling Async.VDI.enable_cbt.

            Running xe sr-param-list uuid={uuid} shows the following:

            [~]# xe sr-param-list uuid={...}
                uuid ( RO) : {...}
                name-label ( RW): CD6
                name-description ( RW): Array of Kioxia CD6 U.2 drives, one in each Host.
                host ( RO): <shared>
                allowed-operations (SRO): unplug; plug; PBD.create; update; PBD.destroy; VDI.resize; VDI.clone; scan; VDI.snapshot; VDI.mirror; VDI.create; VDI.destroy
                {...etc}
                type ( RO): linstor
                content-type ( RO): user
                shared ( RW): true
                introduced-by ( RO): <not in database>
                is-tools-sr ( RO): false
                other-config (MRW): auto-scan: true
                sm-config (MRO):
                {...etc}
            

            Compared to another SR, the following allowed-operations are missing:

            VDI.enable_cbt;
            VDI.list_changed_blocks;
            VDI.disable_cbt;
            VDI.data_destroy;
            VDI.set_on_boot
            

            Is this the expected behaviour? Note that this is using XCP-ng 8.2 (I've yet to test out 8.3).

            ronan-aR 1 Reply Last reply Reply Quote 0
            • olivierlambertO Offline
              olivierlambert Vates 🪐 Co-Founder CEO
              last edited by

              Ping @ronan-a or @dthenot

              1 Reply Last reply Reply Quote 0
              • ronan-aR Offline
                ronan-a Vates 🪐 XCP-ng Team @peter_webbird
                last edited by ronan-a

                @peter_webbird We've already had feedback on CBT and LINSTOR/DRBD, we don't necessarily recommend enabling it. We have a blocking dev card regarding a bug with LVM lvchange command that may fail on CBT volumes used by a XOSTOR SR. We also have other issues related to migration with CBT.

                1 Reply Last reply Reply Quote 2
                • G Offline
                  gb.123
                  last edited by

                  @ronan-a @dthenot @Team-Storage

                  Guys, Can you please clarify which method to use for installing XOSTOR in XCP-ng 8.3 ?

                  Simple :

                  yum install xcp-ng-linstor
                  yum install xcp-ng-release-linstor
                  ./install --disks /dev/nvme0n1 --thin
                  

                  Or the script in the first post ?
                  Or Some other script ?

                  dthenotD 1 Reply Last reply Reply Quote 2
                  • dthenotD Offline
                    dthenot Vates 🪐 XCP-ng Team @gb.123
                    last edited by

                    @gb.123 Hello,
                    The instruction in the first post are still the way to go 🙂

                    J 1 Reply Last reply Reply Quote 3
                    • J Offline
                      JeffBerntsen Top contributor @dthenot
                      last edited by

                      @dthenot said in XOSTOR hyperconvergence preview:

                      @gb.123 Hello,
                      The instruction in the first post are still the way to go 🙂

                      I'm curious about that as well but the first post says that the installation script is only compatible with 8.2 and doesn't mention 8.3. Is that still the case or is the installation script now compatible with 8.3 as well? If not, is there an installation script that is compatible with 8.3?

                      I know that using XO is the recommended method for installation but I'm interested in an installation script as I would like to try to integrate XOSTOR installation into an XCP-ng installation script I already have which runs via PXE boot.

                      dthenotD 1 Reply Last reply Reply Quote 0
                      • dthenotD Offline
                        dthenot Vates 🪐 XCP-ng Team @JeffBerntsen
                        last edited by

                        @JeffBerntsen That's why I meant, the way to install written in the first post still work in 8.3, the script still work as expected also, it basically only create the VG/LV needed on hosts before you create the SR.

                        J 1 Reply Last reply Reply Quote 0
                        • J Offline
                          JeffBerntsen Top contributor @dthenot
                          last edited by

                          @dthenot said in XOSTOR hyperconvergence preview:

                          @JeffBerntsen That's why I meant, the way to install written in the first post still work in 8.3, the script still work as expected also, it basically only create the VG/LV needed on hosts before you create the SR.

                          Got it. Thanks!

                          henri9813H 1 Reply Last reply Reply Quote 0
                          • henri9813H Offline
                            henri9813 @JeffBerntsen
                            last edited by henri9813

                            Hello,

                            I plan to install my XOSTOR cluster on a pool of 7 nodes with 3 replicas, but not all nodes at once because disks are in use.
                            consider:

                            • node1
                            • node2
                            • node ...
                            • node 5
                            • node 6
                            • node 7.

                            with 2 disks on each

                            • sda: 128GB for the OS
                            • sdb: 1TB for local sr ( for now 😄 )

                            I emptied node 6 & 7.

                            so, here is what i plan to do:

                            • On ALL NODES: setup linstor packages

                            Run the install script on node 6 & 7 to add their disks
                            so:

                            node6# install.sh --disks /dev/sdb
                            node7# install.sh --disks /dev/sdb
                            

                            Then, configure the SR and the linstor plugin manager as the following

                            xe sr-create \ 
                                type=linstor name-label=pool-01 \
                                host-uuid=XXXX \
                                device-config:group-name=linstor_group/thin_device device-config:redundancy=3 shared=true device-config:provisioning=thin
                            

                            Normally, i should have a linstor cluster running of 2 nodes ( 2 satellite and one controller randomly placed ) with only 2 disks and then, only 2/3 working replicas.

                            The cluster SHOULD be usable ( i'm right on this point ? )

                            The next step, would be to move VM from node 5 on it to evacuate node 5. and then add it to the cluster by the following

                            node5# install.sh --disks /dev/sdb
                            node5# xe host-call-plugin \
                              host-uuid=node5-uuid \
                              plugin=linstor-manager \
                              fn=addHost args:groupName=linstor_group/thin_device
                            

                            That should deploy satelite on node 5 and add the disk.

                            I normally should have 3/3 working replicas and can start to deploy others nodes progressively.

                            I'm right on the process ?

                            aS mentionned in the discord, i will post my feedbacks and results from my setup once i finalized it. ( maybe thought a blog post somewhere ).

                            Thanks to provide xostor in opensource, it's clearly the missing piece for this virtualization stack in opensource ( vs proxmox )

                            1 Reply Last reply Reply Quote 0
                            • henri9813H henri9813 referenced this topic on
                            • J Offline
                              Jonathon @Jonathon
                              last edited by

                              I have amazing news!

                              After the upgrade to xcp-ng 8.3, I retested velero backup, and it all just works 😁

                              Completed Backup

                              jonathon@jonathon-framework:~$ velero --kubeconfig k8s_configs/production.yaml backup describe grafana-test
                              Name:         grafana-test
                              Namespace:    velero
                              Labels:       objectset.rio.cattle.io/hash=c2b5f500ab5d9b8ffe14f2c70bf3742291df565c
                                            velero.io/storage-location=default
                              Annotations:  objectset.rio.cattle.io/applied=H4sIAAAAAAAA/4SSQW/bPgzFvwvPtv9OajeJj/8N22HdBqxFL0MPlEQlWmTRkOhgQ5HvPsixE2yH7iji8ffIJ74CDu6ZYnIcoIMTeYpcOf7vtIICji4Y6OB/1MdxgAJ6EjQoCN0rYAgsKI5Dyk9WP0hLIqmi40qjiKfMcRlAq7pBY+py26qmbEi15a5p78vtaqe0oqbVVsO5AI+K/Ju4A6YDdKDXqrVtXaNqzU5traVVY9d6Uyt7t2nW693K2Pa+naABe4IO9hEtBiyFksClmgbUdN06a9NAOtvr5B4DDunA8uR64lGgg7u6rxMUYMji6OWZ/dhTeuIPaQ6os+gTFUA/tR8NmXd+TELxUfNA5hslHqOmBN13OF16ZwvNQShIqpZClYQj7qk6blPlGF5uzC/L3P+kvok7MB9z0OcCXPiLPLHmuLLWCfVfB4rTZ9/iaA5zHovNZz7R++k6JI50q89BXcuXYR5YT0DolkChABEPHWzW9cK+rPQx8jgsH/KQj+QT/frzXCdduc/Ca9u1Y7aaFvMu5Ang5Xz+HQAA//8X7Fu+/QIAAA
                                            objectset.rio.cattle.io/id=e104add0-85b4-4eb5-9456-819bcbe45cfc
                                            velero.io/resource-timeout=10m0s
                                            velero.io/source-cluster-k8s-gitversion=v1.33.4+rke2r1
                                            velero.io/source-cluster-k8s-major-version=1
                                            velero.io/source-cluster-k8s-minor-version=33
                              
                              Phase:  Completed
                              
                              
                              Namespaces:
                                Included:  grafana
                                Excluded:  <none>
                              
                              Resources:
                                Included cluster-scoped:    <none>
                                Excluded cluster-scoped:    volumesnapshotcontents.snapshot.storage.k8s.io
                                Included namespace-scoped:  *
                                Excluded namespace-scoped:  volumesnapshots.snapshot.storage.k8s.io
                              
                              Label selector:  <none>
                              
                              Or label selector:  <none>
                              
                              Storage Location:  default
                              
                              Velero-Native Snapshot PVs:  true
                              Snapshot Move Data:          true
                              Data Mover:                  velero
                              
                              TTL:  720h0m0s
                              
                              CSISnapshotTimeout:    30m0s
                              ItemOperationTimeout:  4h0m0s
                              
                              Hooks:  <none>
                              
                              Backup Format Version:  1.1.0
                              
                              Started:    2025-10-15 15:29:52 -0700 PDT
                              Completed:  2025-10-15 15:31:25 -0700 PDT
                              
                              Expiration:  2025-11-14 14:29:52 -0800 PST
                              
                              Total items to be backed up:  35
                              Items backed up:              35
                              
                              Backup Item Operations:  1 of 1 completed successfully, 0 failed (specify --details for more information)
                              Backup Volumes:
                                Velero-Native Snapshots: <none included>
                              
                                CSI Snapshots:
                                  grafana/central-grafana:
                                    Data Movement: included, specify --details for more information
                              
                                Pod Volume Backups: <none included>
                              
                              HooksAttempted:  0
                              HooksFailed:     0
                              

                              Completed Restore

                              jonathon@jonathon-framework:~$ velero --kubeconfig k8s_configs/production.yaml restore describe restore-grafana-test --details
                              Name:         restore-grafana-test
                              Namespace:    velero
                              Labels:       objectset.rio.cattle.io/hash=252addb3ed156c52d9fa9b8c045b47a55d66c0af
                              Annotations:  objectset.rio.cattle.io/applied=H4sIAAAAAAAA/3yRTW7zIBBA7zJrO5/j35gzfE2rtsomymIM45jGBgTjbKLcvaKJm6qL7kDwnt7ABdDpHfmgrQEBZxrJ25W2/85rSOCkjQIBrxTYeoIEJmJUyAjiAmiMZWRtTYhb232Q5EC88tquJDKPFEU6GlpUG5UVZdpUdZ6WZZ+niOtNWtR1SypvqC8buCYwYkfjn7oBwwAC8ipHpbqC1LqqZZWrtse228isrLqywapSdS0z7KPU4EQgwN+mSI8eezSYMgWG22lwKOl7/MgERzJmdChPs9veDL9IGfSbQRcGy+96IjszCCiyCRLQRo6zIrVd5AHEfuHhkIBmmp4d+a/3e9Dl8LPoCZ3T5hg7FvQRcR8nxt6XL7sAgv1MCZztOE+01P23cvmnPYzaxNtwuF4/AwAA//8k6OwC/QEAAA
                                            objectset.rio.cattle.io/id=9ad8d034-7562-44f2-aa18-3669ed27ef47
                              
                              Phase:                       Completed
                              Total items to be restored:  33
                              Items restored:              33
                              
                              Started:    2025-10-15 15:35:26 -0700 PDT
                              Completed:  2025-10-15 15:36:34 -0700 PDT
                              
                              Warnings:
                                Velero:     <none>
                                Cluster:    <none>
                                Namespaces:
                                  grafana-restore:  could not restore, ConfigMap:elasticsearch-es-transport-ca-internal already exists. Warning: the in-cluster version is different than the backed-up version
                                                    could not restore, ConfigMap:kube-root-ca.crt already exists. Warning: the in-cluster version is different than the backed-up version
                              
                              Backup:  grafana-test
                              
                              Namespaces:
                                Included:  grafana
                                Excluded:  <none>
                              
                              Resources:
                                Included:        *
                                Excluded:        nodes, events, events.events.k8s.io, backups.velero.io, restores.velero.io, resticrepositories.velero.io, csinodes.storage.k8s.io, volumeattachments.storage.k8s.io, backuprepositories.velero.io
                                Cluster-scoped:  auto
                              
                              Namespace mappings:  grafana=grafana-restore
                              
                              Label selector:  <none>
                              
                              Or label selector:  <none>
                              
                              Restore PVs:  true
                              
                              CSI Snapshot Restores:
                                grafana-restore/central-grafana:
                                  Data Movement:
                                    Operation ID: dd-ffa56e1c-9fd0-44b4-a8bb-8163f40a49e9.330b82fc-ca6a-423217ee5
                                    Data Mover: velero
                                    Uploader Type: kopia
                              
                              Existing Resource Policy:   <none>
                              ItemOperationTimeout:       4h0m0s
                              
                              Preserve Service NodePorts:  auto
                              
                              Restore Item Operations:
                                Operation for persistentvolumeclaims grafana-restore/central-grafana:
                                  Restore Item Action Plugin:  velero.io/csi-pvc-restorer
                                  Operation ID:                dd-ffa56e1c-9fd0-44b4-a8bb-8163f40a49e9.330b82fc-ca6a-423217ee5
                                  Phase:                       Completed
                                  Progress:                    856284762 of 856284762 complete (Bytes)
                                  Progress description:        Completed
                                  Created:                     2025-10-15 15:35:28 -0700 PDT
                                  Started:                     2025-10-15 15:36:06 -0700 PDT
                                  Updated:                     2025-10-15 15:36:26 -0700 PDT
                              
                              HooksAttempted:   0
                              HooksFailed:      0
                              
                              Resource List:
                                apps/v1/Deployment:
                                  - grafana-restore/central-grafana(created)
                                  - grafana-restore/grafana-debug(created)
                                apps/v1/ReplicaSet:
                                  - grafana-restore/central-grafana-5448b9f65(created)
                                  - grafana-restore/central-grafana-56887c6cb6(created)
                                  - grafana-restore/central-grafana-56ddd4f497(created)
                                  - grafana-restore/central-grafana-5f4757844b(created)
                                  - grafana-restore/central-grafana-5f69f86c85(created)
                                  - grafana-restore/central-grafana-64545dcdc(created)
                                  - grafana-restore/central-grafana-69c66c54d9(created)
                                  - grafana-restore/central-grafana-6c8d6f65b8(created)
                                  - grafana-restore/central-grafana-7b479f79ff(created)
                                  - grafana-restore/central-grafana-bc7d96cdd(created)
                                  - grafana-restore/central-grafana-cb88bd49c(created)
                                  - grafana-restore/grafana-debug-556845ff7b(created)
                                  - grafana-restore/grafana-debug-6fb594cb5f(created)
                                  - grafana-restore/grafana-debug-8f66bfbf6(created)
                                discovery.k8s.io/v1/EndpointSlice:
                                  - grafana-restore/central-grafana-hkgd5(created)
                                networking.k8s.io/v1/Ingress:
                                  - grafana-restore/central-grafana(created)
                                rbac.authorization.k8s.io/v1/Role:
                                  - grafana-restore/central-grafana(created)
                                rbac.authorization.k8s.io/v1/RoleBinding:
                                  - grafana-restore/central-grafana(created)
                                v1/ConfigMap:
                                  - grafana-restore/central-grafana(created)
                                  - grafana-restore/elasticsearch-es-transport-ca-internal(failed)
                                  - grafana-restore/kube-root-ca.crt(failed)
                                v1/Endpoints:
                                  - grafana-restore/central-grafana(created)
                                v1/PersistentVolume:
                                  - pvc-e3f6578f-08b2-4e79-85f0-76bbf8985b55(skipped)
                                v1/PersistentVolumeClaim:
                                  - grafana-restore/central-grafana(created)
                                v1/Pod:
                                  - grafana-restore/central-grafana-cb88bd49c-fc5br(created)
                                v1/Secret:
                                  - grafana-restore/fpinfra-net-cf-cert(created)
                                  - grafana-restore/grafana(created)
                                v1/Service:
                                  - grafana-restore/central-grafana(created)
                                v1/ServiceAccount:
                                  - grafana-restore/central-grafana(created)
                                  - grafana-restore/default(skipped)
                                velero.io/v2alpha1/DataUpload:
                                  - velero/grafana-test-nw7zj(skipped)
                              

                              Image of working restore pod, with correct data in PV
                              34d87db1-19ae-4348-8d4e-6599375d7634-image.png

                              Velero installed from helm: https://vmware-tanzu.github.io/helm-charts
                              Version: velero:11.1.0
                              Values

                              ---
                              image:
                                repository: velero/velero
                                tag: v1.17.0
                              
                              # Whether to deploy the restic daemonset.
                              deployNodeAgent: true
                              
                              initContainers:
                                 - name: velero-plugin-for-aws
                                   image: velero/velero-plugin-for-aws:latest
                                   imagePullPolicy: IfNotPresent
                                   volumeMounts:
                                     - mountPath: /target
                                       name: plugins
                              
                              configuration:
                                defaultItemOperationTimeout: 2h
                                features: EnableCSI
                                defaultSnapshotMoveData: true
                              
                                backupStorageLocation:
                                  - name: default
                                    provider: aws
                                    bucket: velero
                                    config:
                                      region: us-east-1
                                      s3ForcePathStyle: true
                                      s3Url: https://s3.location
                              
                                # Destination VSL points to LINSTOR snapshot class
                                volumeSnapshotLocation:
                                  - name: linstor
                                    provider: velero.io/csi
                                    config:
                                      snapshotClass: linstor-vsc
                              
                              credentials:
                                useSecret: true
                                existingSecret: velero-user
                              
                              
                              metrics:
                                enabled: true
                              
                                serviceMonitor:
                                  enabled: true
                              
                                prometheusRule:
                                  enabled: true
                                  # Additional labels to add to deployed PrometheusRule
                                  additionalLabels: {}
                                  # PrometheusRule namespace. Defaults to Velero namespace.
                                  # namespace: ""
                                  # Rules to be deployed
                                  spec:
                                    - alert: VeleroBackupPartialFailures
                                      annotations:
                                        message: Velero backup {{ $labels.schedule }} has {{ $value | humanizePercentage }} partialy failed backups.
                                      expr: |-
                                        velero_backup_partial_failure_total{schedule!=""} / velero_backup_attempt_total{schedule!=""} > 0.25
                                      for: 15m
                                      labels:
                                        severity: warning
                                    - alert: VeleroBackupFailures
                                      annotations:
                                        message: Velero backup {{ $labels.schedule }} has {{ $value | humanizePercentage }} failed backups.
                                      expr: |-
                                        velero_backup_failure_total{schedule!=""} / velero_backup_attempt_total{schedule!=""} > 0.25
                                      for: 15m
                                      labels:
                                        severity: warning
                              

                              Also create the following.

                              apiVersion: snapshot.storage.k8s.io/v1
                              kind: VolumeSnapshotClass
                              metadata:
                                name: linstor-vsc
                                labels:
                                  velero.io/csi-volumesnapshot-class: "true"
                              driver: linstor.csi.linbit.com
                              deletionPolicy: Delete
                              

                              We are using Piraeus operator to use xostor in k8s
                              https://github.com/piraeusdatastore/piraeus-operator.git
                              Version: v2.9.1
                              Values:

                              ---
                              operator: 
                                resources:
                                  requests:
                                    cpu: 250m
                                    memory: 500Mi
                                  limits:
                                    memory: 1Gi
                              installCRDs: true
                              imageConfigOverride:
                              - base: quay.io/piraeusdatastore
                                components:
                                  linstor-satellite:
                                    image: piraeus-server
                                    tag: v1.29.0
                              tls:
                                certManagerIssuerRef:
                                  name: step-issuer
                                  kind: StepClusterIssuer
                                  group: certmanager.step.sm
                              

                              Then we just connect to the xostor cluster like external linstor controller.

                              1 Reply Last reply Reply Quote 1
                              • First post
                                Last post