@ronan-a Since XOSTOR is supposed to be stable now, I figured I would try it out with a new setup of 3 newly installed 8.2 nodes.
I used the CLI to deploy it. It all went well, and the SR was quickly ready. I was even able to migrate a disk to the Linstor SR and boot the VM. However, after rebooting the master, it seems like the SR doesn't want to allow any disk migration, and manual Scan are failing. I've tried unmounting/remounting the SR fully, restarting the toolstack, but nothing seems to help. The disk that was on Linstor is still accessible and the VM is able to boot.
Here is the error I'm getting:
sr.scan
{
"id": "e1a9bf4d-26ad-3ef6-b4a5-db98d012e0d9"
}
{
"code": "SR_BACKEND_FAILURE_47",
"params": [
"",
"The SR is not available [opterr=Database is not mounted]",
""
],
"task": {
"uuid": "a467bd90-8d47-09cc-b8ac-afa35056ff25",
"name_label": "Async.SR.scan",
"name_description": "",
"allowed_operations": [],
"current_operations": {},
"created": "20240502T21:40:00Z",
"finished": "20240502T21:40:01Z",
"status": "failure",
"resident_on": "OpaqueRef:b3e2f390-f45f-4614-a150-1eee53f204e1",
"progress": 1,
"type": "<none/>",
"result": "",
"error_info": [
"SR_BACKEND_FAILURE_47",
"",
"The SR is not available [opterr=Database is not mounted]",
""
],
"other_config": {},
"subtask_of": "OpaqueRef:NULL",
"subtasks": [],
"backtrace": "(((process xapi)(filename lib/backtrace.ml)(line 210))((process xapi)(filename ocaml/xapi/storage_access.ml)(line 32))((process xapi)(filename lib/xapi-stdext-pervasives/pervasiveext.ml)(line 24))((process xapi)(filename lib/xapi-stdext-pervasives/pervasiveext.ml)(line 35))((process xapi)(filename ocaml/xapi/message_forwarding.ml)(line 131))((process xapi)(filename lib/xapi-stdext-pervasives/pervasiveext.ml)(line 24))((process xapi)(filename ocaml/xapi/rbac.ml)(line 205))((process xapi)(filename ocaml/xapi/server_helpers.ml)(line 95)))"
},
"message": "SR_BACKEND_FAILURE_47(, The SR is not available [opterr=Database is not mounted], )",
"name": "XapiError",
"stack": "XapiError: SR_BACKEND_FAILURE_47(, The SR is not available [opterr=Database is not mounted], )
at Function.wrap (file:///opt/xo/xo-builds/xen-orchestra-202404270302/packages/xen-api/_XapiError.mjs:16:12)
at default (file:///opt/xo/xo-builds/xen-orchestra-202404270302/packages/xen-api/_getTaskResult.mjs:11:29)
at Xapi._addRecordToCache (file:///opt/xo/xo-builds/xen-orchestra-202404270302/packages/xen-api/index.mjs:1029:24)
at file:///opt/xo/xo-builds/xen-orchestra-202404270302/packages/xen-api/index.mjs:1063:14
at Array.forEach (<anonymous>)
at Xapi._processEvents (file:///opt/xo/xo-builds/xen-orchestra-202404270302/packages/xen-api/index.mjs:1053:12)
at Xapi._watchEvents (file:///opt/xo/xo-builds/xen-orchestra-202404270302/packages/xen-api/index.mjs:1226:14)"
}
I quickly glanced over the source code and the SM logs to see if I could identify what was going on but it doesn't seem to be a simple thing.
Logs from SM:
May 2 13:22:02 xcp-ng-labs-host01 SM: [19242] LinstorSR.scan for e1a9bf4d-26ad-3ef6-b4a5-db98d012e0d9
May 2 13:22:02 xcp-ng-labs-host01 SM: [19242] Raising exception [47, The SR is not available [opterr=Database is not mounted]]
May 2 13:22:02 xcp-ng-labs-host01 SM: [19242] lock: released /var/lock/sm/e1a9bf4d-26ad-3ef6-b4a5-db98d012e0d9/sr
May 2 13:22:02 xcp-ng-labs-host01 SM: [19242] ***** generic exception: sr_scan: EXCEPTION <class 'SR.SROSError'>, The SR is not available [opterr=Database is not mounted]
May 2 13:22:02 xcp-ng-labs-host01 SM: [19242] File "/opt/xensource/sm/SRCommand.py", line 110, in run
May 2 13:22:02 xcp-ng-labs-host01 SM: [19242] return self._run_locked(sr)
May 2 13:22:02 xcp-ng-labs-host01 SM: [19242] File "/opt/xensource/sm/SRCommand.py", line 159, in _run_locked
May 2 13:22:02 xcp-ng-labs-host01 SM: [19242] rv = self._run(sr, target)
May 2 13:22:02 xcp-ng-labs-host01 SM: [19242] File "/opt/xensource/sm/SRCommand.py", line 364, in _run
May 2 13:22:02 xcp-ng-labs-host01 SM: [19242] return sr.scan(self.params['sr_uuid'])
May 2 13:22:02 xcp-ng-labs-host01 SM: [19242] File "/opt/xensource/sm/LinstorSR", line 536, in wrap
May 2 13:22:02 xcp-ng-labs-host01 SM: [19242] return load(self, *args, **kwargs)
May 2 13:22:02 xcp-ng-labs-host01 SM: [19242] File "/opt/xensource/sm/LinstorSR", line 521, in load
May 2 13:22:02 xcp-ng-labs-host01 SM: [19242] return wrapped_method(self, *args, **kwargs)
May 2 13:22:02 xcp-ng-labs-host01 SM: [19242] File "/opt/xensource/sm/LinstorSR", line 381, in wrapped_method
May 2 13:22:02 xcp-ng-labs-host01 SM: [19242] return method(self, *args, **kwargs)
May 2 13:22:02 xcp-ng-labs-host01 SM: [19242] File "/opt/xensource/sm/LinstorSR", line 777, in scan
May 2 13:22:02 xcp-ng-labs-host01 SM: [19242] opterr='Database is not mounted'
May 2 13:22:02 xcp-ng-labs-host01 SM: [19242]