XCP-ng
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    XO server loses pool and hosts momentarily, timeout error

    Scheduled Pinned Locked Moved Management
    24 Posts 5 Posters 1.8k Views 3 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • olivierlambertO Offline
      olivierlambert Vates 🪐 Co-Founder CEO
      last edited by

      And it's a good advice 🙂

      1 Reply Last reply Reply Quote 0
      • F Offline
        felibb @Andrew
        last edited by

        @Andrew said in XO server loses pool and hosts momentarily, timeout error:

        some issues with undici that were resolved in a later commit 0794a63

        Tried 79c9ef0 (1 day older than 0794a63), seeing timeouts.

        @olivierlambert said in XO server loses pool and hosts momentarily, timeout error:

        1. Can you try to use XOA in latest release channel in the same environment and see if you also have the issue?

        Unsure I understand what you are referring to, can you please clarify?

        1. Is your XO far away from the pool in terms of network latency?

        I would expect it the latency to be quite low: XOA VM lives on the same pool, has an IP in the same subnet as 10Gx2 bond interface on each host. This is not however the same 1G network as the one marked with "Management" blue bubble in the Host network tab. These two are different subnets. Can this have an effect?

        1. Your OS is Debian 11, IDK if that could cause the problem (XOA is on Debian 12).

        dist-upgrade is fast and easy, I can definitely try that.

        @julien-f said in XO server loses pool and hosts momentarily, timeout error:

        If you can, please test the xen-api-blocking branch and let me know if that helps.

        ce15ef6 deployed, seeing timeouts.

        1 Reply Last reply Reply Quote 0
        • olivierlambertO Offline
          olivierlambert Vates 🪐 Co-Founder CEO
          last edited by

          @felibb I'm talking about using are pre-baked/turnkey virtual appliance, that you can easily deploy from https://vates.tech/deploy

          1. Register
          2. Update and select "latest" release channel
          3. Test

          This will allow to check if it's your setup or XO.

          F 1 Reply Last reply Reply Quote 0
          • F Offline
            felibb @olivierlambert
            last edited by

            @olivierlambert right, XO vs. XOA, gotcha. XOA seems to work fine, no timeouts for about 1/2hr. I did select "Management" LAN for it.

            I think the next step for me would be to upgrade my old XO to bookworm + latest commit in master. Then I probably can try a fresh VM with bookworm + XO latest commit in master + interface in mgmt LAN.

            F 1 Reply Last reply Reply Quote 0
            • olivierlambertO Offline
              olivierlambert Vates 🪐 Co-Founder CEO
              last edited by

              Okay so XOA works fine on both stable & latest channels, fully up to date right? Double checking to be 100% sure 🙂

              F 1 Reply Last reply Reply Quote 0
              • F Offline
                felibb @olivierlambert
                last edited by

                @olivierlambert both channels seem to work fine, yes.

                1 Reply Last reply Reply Quote 0
                • olivierlambertO Offline
                  olivierlambert Vates 🪐 Co-Founder CEO
                  last edited by

                  Okay so it's clearly something related to your source installation and/or an interaction with your setup 🙂 Thanks for the feedback!

                  1 Reply Last reply Reply Quote 0
                  • F Offline
                    felibb @felibb
                    last edited by

                    @felibb said in XO server loses pool and hosts momentarily, timeout error:

                    upgrade my old XO to bookworm + latest commit in master

                    Welp, that didn't help much, still seeing timeouts. Also neither XO nor XOA show the VM's own IP in the GUI anymore. dist-upgrade renamed interface from eth0 to etX0, and I had to edit /etc/network/interfaces to get the network back up, and I can connect, but GUI still says "No IP record". Management agent 8.0.50-1 detected, in case it matters.

                    Fresh VM setup to be tested another day.

                    F 1 Reply Last reply Reply Quote 0
                    • F Offline
                      felibb @felibb
                      last edited by

                      (Replying to my previous post, a bit off-topic for the thread, but having installed https://github.com/xenserver/xe-guest-utilities/releases/tag/v8.4.0 manually, I see the IP in GUI now, but XOA says "Management agent 8.3.60-1 detected")

                      1 Reply Last reply Reply Quote 0
                      • F Offline
                        felibb
                        last edited by felibb

                        So I booted a pre-bookworm upgrade XO VM I had created earlier, moved its network to Management LAN, and installed 6444f88 (latest as of writing this). No timeouts anymore. Seems that Debian version doesn't matter here, and the code is fine (as we sort of determined already), but my networks are perhaps not routed fast enough for XO? It makes sense to put a mgmt appliance into the mgmt LAN, of course, however older code (that I tested, pre-bfb8d3b) did not have this issue. So maybe a combo of that new undici piece and slightly higher latency is causing it for me?

                        1 Reply Last reply Reply Quote 1
                        • olivierlambertO Offline
                          olivierlambert Vates 🪐 Co-Founder CEO
                          last edited by

                          That's a great feedback @felibb , thanks!

                          We'll try to dig on this to see if that lead takes us to something 🙂

                          1 Reply Last reply Reply Quote 1
                          • First post
                            Last post