XCP-ng
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    Minisforum MS-01 unstable and hangs running Xcp-ng 8.3

    Scheduled Pinned Locked Moved Hardware
    24 Posts 6 Posters 7.1k Views 6 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • M Offline
      manilx @steff22
      last edited by

      @steff22 I actually had the same issue when I got my MS01. Not only on xcpng but also on proxmox, which I tried before.
      I have sent the unit back for a refund and got a protectli VP6670, which is rock solid.
      Also it has a 12th gen Intel CPU and not one of the 13th/14th one as the MS01 (https://www.theverge.com/24216305/intel-13th-14th-gen-raptor-lake-cpu-crash-news-updates-patches-fixes-motherboards)

      I S 2 Replies Last reply Reply Quote 0
      • I Offline
        imaginapix @manilx
        last edited by

        @manilx

        To be fair the MS-01 is also available with 12th gen Intel CPU.

        M 1 Reply Last reply Reply Quote 0
        • S Offline
          steff22 @manilx
          last edited by

          @manilx Thanks for the info. I thought I had heard something that there is a fault with 13th gen intel, But I was unsure if it also applied to the mobile version.
          But actually feared it after I contacted Minisforum and explained the problem and was asked to send it back to them.

          So I'll probably send it back. I don't really like sending it if there had been a software error, But now I have confirmed that it has happened to several people too, so it should probably be safe to send it back

          M 2 Replies Last reply Reply Quote 0
          • M Offline
            manilx @steff22
            last edited by manilx

            @steff22 I have in the last days contacted Minisforum for an updated bios. They've sent me the latest but no fix for the Intel issue. And when I pressed the issue, this was their answer:

            There is no mention in the release notes of how to fix the Intel CPU bug that broke the 13th and 14th generation Raptor Lake cpus.
            No explanation

            Intel has therefore extended its warranty by two years. What is your policy on this?
            We don't have a policy on that

            I still have an NPB7 with the 13th gen chip. So if this one burns I'm on my own.

            I have moved all xcpng VM's from this and removed it from the main pool, now on it's own pool. Will have 2 unimportant VN's running on it, all else on Protectli.

            1 Reply Last reply Reply Quote 0
            • M Offline
              manilx @imaginapix
              last edited by

              @imaginapix True BUT this doesn't help if you pick the higher end one, which you would for running a hypervisor.
              AND their stance on this is well: you're on your own. See my other post.

              1 Reply Last reply Reply Quote 0
              • M Offline
                manilx @steff22
                last edited by

                @steff22 You should. It's been very popular on Youtube (most getting their units for free for promotion) BUT nobody speeks of this (to my knowledge).
                I had nothing but trouble with it.

                1 Reply Last reply Reply Quote 0
                • olivierlambertO Offline
                  olivierlambert Vates 🪐 Co-Founder CEO
                  last edited by

                  Speaking of Protectli, good news coming in the next months 😉 Stay tuned.

                  M 1 Reply Last reply Reply Quote 0
                  • M Offline
                    manilx @olivierlambert
                    last edited by

                    @olivierlambert NICE!

                    Can only speak VERY positive on them!!!!

                    1 Reply Last reply Reply Quote 0
                    • I Offline
                      iLix
                      last edited by

                      E+P cores are not support in the kernel, have you tried running with only E or P cores enabled?

                      My 13 Gen MS-01’s with Intel microcode installed work great with Proxmox. Running xcp-ng on D-1541’s still, so all good there 🙂

                      M S 2 Replies Last reply Reply Quote 0
                      • M Offline
                        manilx @iLix
                        last edited by

                        @iLix Good for you. Other's may vary as we have seen.....

                        1 Reply Last reply Reply Quote 0
                        • olivierlambertO Offline
                          olivierlambert Vates 🪐 Co-Founder CEO
                          last edited by

                          Well, at my surprise, XCP-ng works with P/E cores (even if it's more by luck than by design). I'll run extensive tests with a Protectli unit I just received 👼

                          S M I 3 Replies Last reply Reply Quote 0
                          • S Offline
                            steff22 @iLix
                            last edited by

                            @iLix No, have not tried with only E or P cores.

                            But I thought I had triggered an error when I tried to test the cpu by assigning all 20 cores to 1 windows vm and running the cpu test test program for 15 min with 100% cpu usage. This did not cause any problems. also no problems with it getting too high a temperature.

                            So it is a bit strange that this error occurs when ms 01 is almost idle, the vms that are running use very little cpu. is at 0% to 4% cpu use on the xcp-ng host mostly all day

                            Have installed the latest bios but have not done anything with the Intel microcode.
                            How do I see which version of Intel microcode I have and how do I update this?

                            I 1 Reply Last reply Reply Quote 0
                            • S Offline
                              steff22 @olivierlambert
                              last edited by

                              @olivierlambert But am I right about the backup that I can't trust them?
                              I must have held the power button in to force reboot opposite 5 times now I think. But notice no errors on xcp-ng or other vms that are running, everything just starts working again after a reboot.

                              How much can Xcp-ng 8.3 withstand this extreme treatment before I should fear corrupt system files on Xcp-ng itself?

                              1 Reply Last reply Reply Quote 0
                              • olivierlambertO Offline
                                olivierlambert Vates 🪐 Co-Founder CEO
                                last edited by olivierlambert

                                Just received it for testing purpose 👼

                                W 1 Reply Last reply Reply Quote 2
                                • M Offline
                                  manilx @olivierlambert
                                  last edited by manilx

                                  @olivierlambert Running also on a VP6670 with 64GB RAM (which seems is what you got). So anything you want me to test, fire away 😉

                                  1 Reply Last reply Reply Quote 0
                                  • olivierlambertO Offline
                                    olivierlambert Vates 🪐 Co-Founder CEO
                                    last edited by

                                    I wonder about the power draw difference with and without the P cores (leaving only the E cores).

                                    1 Reply Last reply Reply Quote 0
                                    • I Offline
                                      iLix @olivierlambert
                                      last edited by

                                      That is great to hear that E/P-cores are working. I now have a migration project on my hands 🙂

                                      1 Reply Last reply Reply Quote 0
                                      • I Offline
                                        iLix @steff22
                                        last edited by

                                        @steff22 You should not need the microcode if xcp-ng works out of the box with hybrid CPU's.

                                        When you force the power off, are the VM's reset time the same as for the host? Example: Does the kernel-power off event correspond with your hosts hard reboot, if you have Windows VM's running?

                                        S 1 Reply Last reply Reply Quote 0
                                        • S Offline
                                          steff22 @iLix
                                          last edited by

                                          @iLix don't know how to check the last activity log in the xcp-ng host before the ms-01 crashes.

                                          But according to the smart house log on windows vm, the crash time was 2 hours before I forced the power off. So expect this time to be the same on all running vms and the xcp-ng host itself.

                                          windows vm has lost direct contact with the m.2 storage at this point and cannot write more in the log. so to me it seems like the whole ms-01 freezes and crashes.

                                          I 1 Reply Last reply Reply Quote 0
                                          • I Offline
                                            iLix @steff22
                                            last edited by

                                            @steff22 Mine "crashed" before i added this to the grub.cfg "pcie_port_pm=off pcie_aspm.policy=performance" (Proxmox)
                                            The Windows event log would state kernel-power error matching the same time as the forced reset, so the VM's are still running while in this state.
                                            This happened while idle, I would see a lot of cluster sync errors, as if they lost connection to the network (Using the 2.5 NIC's)

                                            Maybe in the same realm as this issue:
                                            https://xcp-ng.org/forum/topic/8092/add-kernel-boot-params-for-dom0

                                            S 1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post