XOA gets disconnected very often
-
Hello,
When we do usual task such as creating/removing/migrating VM or starting console, XOA gets disconnected for about 30s then comes back again.
We had many issues because our PoC Pool (hosted on OVHcloud Baremetal servers) was first installed using public IP then were re-configured to use private IPs :
- reconfigure management IP of every host
- changed reference master IP on /etc/xensource/pool.cfg
- generate new SSL certificates for every node of the pool
Maybe we missed some reconfiguration due to lack of documentation.
I might be wrong but when XOA looses connection, /var/log/xensource.log on pool's master node shows :
Dec 27 21:35:45 xcp-ng-317 xapi: [ info||160 /var/lib/xcp/xapi|session_check D:468110568dff|session_check] Session check failed: the client used an illegal or expired session ref 'OpaqueRef:f679a584-7f81-dc17-8dde-c3a575a072ba' Dec 27 21:35:45 xcp-ng-317 xapi: [error||160 /var/lib/xcp/xapi|dispatch:VIF.get_by_uuid D:dd849736b2fb|backtrace] session_check D:468110568dff failed with exception Server_error(SESSION_INVALID, [ OpaqueRef:f679a584-7f81-dc17-8dde-c3a575a072ba ]) Dec 27 21:35:45 xcp-ng-317 xapi: [error||160 /var/lib/xcp/xapi|dispatch:VIF.get_by_uuid D:dd849736b2fb|backtrace] Raised Server_error(SESSION_INVALID, [ OpaqueRef:f679a584-7f81-dc17-8dde-c3a575a072ba ]) Dec 27 21:35:45 xcp-ng-317 xapi: [error||160 /var/lib/xcp/xapi|dispatch:VIF.get_by_uuid D:dd849736b2fb|backtrace] 1/4 xapi Raised at file ocaml/xapi/session_check.ml, line 88 Dec 27 21:35:45 xcp-ng-317 xapi: [error||160 /var/lib/xcp/xapi|dispatch:VIF.get_by_uuid D:dd849736b2fb|backtrace] 2/4 xapi Called from file ocaml/xapi/server_helpers.ml, line 75 Dec 27 21:35:45 xcp-ng-317 xapi: [error||160 /var/lib/xcp/xapi|dispatch:VIF.get_by_uuid D:dd849736b2fb|backtrace] 3/4 xapi Called from file ocaml/xapi/server_helpers.ml, line 97 Dec 27 21:35:45 xcp-ng-317 xapi: [error||160 /var/lib/xcp/xapi|dispatch:VIF.get_by_uuid D:dd849736b2fb|backtrace] 4/4 xapi Called from file ocaml/libs/log/debug.ml, line 250 Dec 27 21:35:45 xcp-ng-317 xapi: [error||160 /var/lib/xcp/xapi|dispatch:VIF.get_by_uuid D:dd849736b2fb|backtrace] Dec 27 21:35:45 xcp-ng-317 xapi: [ info||160 /var/lib/xcp/xapi|session.login_with_password D:f354a161de1a|xapi_session] Session.create trackid=c5635894fdaa50bf600bcc507ae8a7e3 pool=false uname= originator= is_local_superuser=true auth_user_sid= parent=trackid=9834f5af41c964e225f24279aefe4e49Is it normal that XOA often looses connection to pool / master ? Is there anything we can do to avoid this ?
Thanks in advance for your help !
-
Hi,
Out of curiosity, do you have HA enabled on that pool?
-
@olivierlambert yes we do with XOSTOR as SR for a few VMs (this is a PoC on OVHcloud Baremetal servers).
-
Can you just disable HA and see if you still have the same disconnection problem?
-
@olivierlambert after disabling HA and trying to make many actions (VM deploys, start/stop/restart, VM console, etc), I cannot reproduce XOA's issue at all.
It looks like everything is smoother, starting with VM deploy/start actions.