XCP-ng
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    Issues joining pool with less pif on the newest host

    Scheduled Pinned Locked Moved XCP-ng
    11 Posts 5 Posters 146 Views 4 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • D Offline
      DustinB @afmart_dei
      last edited by

      @afmart_dei I'd be surprised that you can't add this host to the pool because it has a different number of PIFs... more likely than not the version mismatch is what is getting you hung up here.

      Is there anyway to update everything to the same level, once that is done then you should be able to add this host to the pool.

      As for Windows XP, CentOS 4 (CentOS 4 was EoL'd in 2012.... only 14 years ago) so that's likely gonna be a hard no. XP was in 2014.. likely the same boat.

      The issue likely being that no one wants to program on old ass OS's to support something that the original dev no longer supports.

      A 1 Reply Last reply Reply Quote 0
      • A Offline
        acebmxer
        last edited by acebmxer

        @afmart_dei

        I had similar issues recently. But not exacly the same. I was able to add second host to pool but was not able to work because in my case eth0 was host managment, eth4 storage, eth5 migration, eth 7 vm network. My second host at the time did not have eht7

        Then i further messed myself up by using second host as its own pool attached to same storage and when down hill form there, but was an easy fix.

        1 Reply Last reply Reply Quote 0
        • A Offline
          afmart_dei @DustinB
          last edited by

          @DustinB The issue is lack of pool resources to have all prod machines up, if i can't join the new host withou update, i'll have to schedule some downtime.

          1 Reply Last reply Reply Quote 0
          • A Offline
            afmart_dei
            last edited by

            Update:
            I'm having issues resetting the network on the new host, but looks like the issue is eth6 and eth7 are the fiber ports.

            I'm going to reinstall the new host and reorder the eth2 and eth3 to eth6 and eth7, then try to rejoin.

            1 Reply Last reply Reply Quote 0
            • A Offline
              afmart_dei
              last edited by afmart_dei

              Update 2:
              Renaming eth2 to eth6 and eth7 worked like a charm.

              Had to do a fresh install because the emergency network restart didn't work.

              it's hard to tell which PIF belong to each bond on XO (at least on source-compiled version), needed to use old xcp-center to figure it out.

              This thread can be closed.

              still having some network issues :S

              1 Reply Last reply Reply Quote 0
              • A Offline
                afmart_dei
                last edited by afmart_dei

                Update 3:
                On a fresh install, after renaming eth2 and eth3 to eth6 and eth7 in order to match the active interfaces on the pool, and joining the pool, something happens and the new host dom0 loses the networking interfaces.
                Trying to do an emergency network reboot does nothing, the network doesn't come up, and doing any xe command returns nothing.

                Seems like not having 2 interfaces to match all 8 interfaces on the other hosts is an real issue.

                Any feedback or recommendations are welcome. The docs don't cover this, but is it possible to delete unused networks associated with non-used PIF?

                1 Reply Last reply Reply Quote 0
                • olivierlambertO Offline
                  olivierlambert Vates 🪐 Co-Founder CEO
                  last edited by

                  Ping @Team-XAPI-Network

                  1 Reply Last reply Reply Quote 1
                  • A Offline
                    afmart_dei
                    last edited by afmart_dei

                    The issue seems to be synchronizing the network configurations.

                    The pool management is configured for bond1 (eth6+ eth7), that is set for LACP, both switch ports are already configured for LACP.

                    Bond0 (eth0 +1) is created, but bond 1 fails, and management interface is never assigned to Bond1.

                    Network 2 and 3 are active on the pool even if never used.

                    semarieS 1 Reply Last reply Reply Quote 0
                    • semarieS Offline
                      semarie @afmart_dei
                      last edited by

                      @afmart_dei could you share the /var/log/xensource.log of the joining host ? ideally the whole part after the host started (you could look at the file before rebooting and strip all lines before the last line you saw)

                      1 Reply Last reply Reply Quote 0
                      • A Offline
                        afmart_dei
                        last edited by

                        Update 4:

                        Hello,
                        After a few tries, i reattempted to re-add the host to the pool, this time capturing the logs in another terminal and this time it seems to have worked.
                        I think i followed the same steps the previous time, but maybe i did something different, like using pif-scan to introduce the reordered interfaces.

                        This is how I've managed to get it to work:

                        All hosts on version 8.2.1 all up to date.

                        On the new host disconnected the management, remove pif, reorder the interfaces, reboot, add interfaces manually (used pif-introduce, not pif-scan), reboot, re-enable management, join pool.

                        ifconfig eth2 down
                        ifconfig eth3 down
                        xe pif-list device=eth2
                        xe pif-list device=eth3
                        xe pif-forget uuid=9104e54c-6c82-5d83-b9fc-1d2b73d5d6f1
                        xe pif-forget uuid=27abec81-6861-796e-9abe-3b2653444c8f
                        
                        interface-rename --update eth6=14:23:f2:24:5a:80 eth7=14:23:f2:24:5a:81
                        
                        reboot now
                        
                        interface-rename --list
                        ifconfig eth6 up
                        ifconfig eth7 up
                        
                        xe host-list
                        
                        xe pif-introduce device=eth6 host-uuid=edfaf68e-2c28-4486-8939-723bf2c72820 mac=14:23:f2:24:5a:80
                        xe pif-introduce device=eth7 host-uuid=edfaf68e-2c28-4486-8939-723bf2c72820 mac=14:23:f2:24:5a:81
                        
                        reboot now
                        

                        I'm still baffled why it worked now, I've tried a few time with all the host up to date, and was planning to source a dual nic pcie card, to solve this issue. However on this host the the bond id doesnt match the pool ...
                        on poll:
                        f68e687e-5896-459f-9463-8806ffe4bb63-image.png
                        on the new host:
                        b58bac6f-f7fa-484e-be30-de2c4d4304bc-image.png xensource_error_log.txt

                        xensource.log also presents lots of errors, so I'm going to leave it running over the weekend, and will try to add the necessary data for our SR on monday.

                        what is the best way to send the full log file?

                        1 Reply Last reply Reply Quote 0
                        • First post
                          Last post