XCP-ng
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    nVidia Tesla P4 for vgpu and Plex encoding

    Scheduled Pinned Locked Moved Solved Compute
    vgpu
    63 Posts 14 Posters 18.4k Views 16 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • splastunovS Offline
      splastunov @mohammadm
      last edited by

      @mohammadm
      I'm talking now about vGPU not passthrough

      1. old drivers
      2. no way to monitor GPU load
      3. Sometimes the GPU on Dom0 stops responding and the only thing that can be done to solve this problem is to reboot the entire server with all the virtual machines on it.
        and etc.... do not remember all troubles I had with it
      M A 2 Replies Last reply Reply Quote 0
      • M Offline
        mohammadm @splastunov
        last edited by

        @splastunov said in nVidia Tesla P4 for vgpu and Plex encoding:

        @mohammadm
        I'm talking now about vGPU not passthrough

        1. old drivers
        2. no way to monitor GPU load
        3. Sometimes the GPU on Dom0 stops responding and the only thing that can be done to solve this problem is to reboot the entire server with all the virtual machines on it.
          and etc.... do not remember all troubles I had with it

        I installed the Firepro S7150x2 yesterday without any issues. It's been about 24 hours, so far no issues. I do agree I am missing the nvidia-smi command to get a better overview.

        Why is the support regarding vGPU so bad and mostly outdated 😞

        1 Reply Last reply Reply Quote 0
        • olivierlambertO Offline
          olivierlambert Vates πŸͺ Co-Founder CEO
          last edited by

          I will have the opportunity to discuss more with AMD (on a regular basis, for some reasons), I'll try to see if I can connect to their GPU division πŸ™‚

          J 1 Reply Last reply Reply Quote 1
          • A Offline
            austinw @splastunov
            last edited by

            @splastunov Dom0 would need to have a vGPU in this scenario?

            1 Reply Last reply Reply Quote 0
            • J Offline
              JamesG @olivierlambert
              last edited by

              @olivierlambert As mentioned in another thread...Intel Flex GPU's seem primed for this. nVidia is closed and license greedy. AMD seems a little lost and wandering. Intel has said, "No licensing...Just use it." but they require some development.

              It should be relatively easy to incorporate the Intel Flex GPU's, but I'm not sure if the newer kernels are required. That might be where the wheels fall off for now.

              M 1 Reply Last reply Reply Quote 0
              • M Offline
                mohammadm @JamesG
                last edited by

                @JamesG This would indeed be awesome! I would prefer going the Intel route. Any contacts there @olivierlambert ?

                J 1 Reply Last reply Reply Quote 0
                • J Offline
                  JamesG @mohammadm
                  last edited by

                  From my perspective, there's literally money on the ground for any virtualization platform to pick up VDI with Intel. The GPU's are affordable and performant for VDI work. They currently work with Openshift and Proxmox is at work on it.

                  1 Reply Last reply Reply Quote 2
                  • stormiS Offline
                    stormi Vates πŸͺ XCP-ng Team
                    last edited by

                    @splastunov why do you extract the vgpu RPM rather than just installing the RPM directly?

                    1 Reply Last reply Reply Quote 1
                    • H Offline
                      high-voltages
                      last edited by

                      I am currently trying to get a NVIDIA Telsa M10 working with the latest updated Xcp-ng 8.2.1 Build Date 2024-07-17.

                      The Drivers "NVIDIA-vGPU-xenserver-8-550.54.10.x86_64.rpm" are installed and the system was rebooted. Now it is possible to select a vGPU via XOA for the VM.

                      Further the extracted vgpu file from the CitrixHypervisor-8.2.0-install-cd.iso was copied to /usr/lib64/xen/bin/vgpu and is executable.

                      The Start of the VM exited with:

                      "FAILED_TO_START_EMULATOR(OpaqueRef:0cc388f0-b606-469d-b68c-b4713c7f4abb, vgpu, Daemon exited unexpectedly)
                      "

                      Is there someone who has solved this Problem?

                      Platform:
                      HPE DL380 Gen10

                      msupportM 1 Reply Last reply Reply Quote 0
                      • msupportM Offline
                        msupport @high-voltages
                        last edited by

                        @high-voltages

                        download XenServer iso file (https://www.xenserver.com/downloads | XenServer8_2024-06-03.iso)
                        copy the file (vgpu-7.4.13-1.xs8.x86_64.rpm) in the packages directory ! Do not use CitrixHypervisor-8.2.0-install-cd file vgpu-7.4.8-1.x86_64
                        unpack file vgpu-7.4.13-1.xs8.x86_64
                        copy the file \usr\lib64\xen\bin\vgpu (size 129KB) to \usr\lib64\xen\bin\ on your XCP-NG host (chmod 755)

                        stormiS H 2 Replies Last reply Reply Quote 1
                        • stormiS Offline
                          stormi Vates πŸͺ XCP-ng Team @msupport
                          last edited by

                          Why unpacking and making manual changes to the filesystem instead of installing the RPM directly?

                          1 Reply Last reply Reply Quote 0
                          • H Offline
                            high-voltages @msupport
                            last edited by

                            Thanks...

                            I tried at first the driver from above:
                            NVIDIA-GRID-CitrixHypervisor-8.2-550.54.10-550.54.14-551.61
                            what doesn’t work.
                            After this I have found the newer Version:
                            NVIDIA-GRID-CitrixHypervisor-8.2-550.54.16-550.54.15-551.78
                            Now the Error on VM startup does not came up and the driver installation in the VM works.

                            The Driver packages what are specially for Xenserver8 are not installable on the Xcp-NG Version 8.2.1 because the different Base "https://xcp-ng.org/forum/post/80461https://xcp-ng.org/forum/post/80461"

                            now I get one VM working with a vGPU but a second one does not detect the mapped vGPU and the windows driver cant be installed
                            I am currently not sure why this happens, because a copy of the working one with another vGPU config will work.
                            The Xentools and Windowsversions are the same and the template also

                            the vGPU rpm is installable without any problems😊

                            M 1 Reply Last reply Reply Quote 0
                            • M Offline
                              mgformula @high-voltages
                              last edited by mgformula

                              @high-voltages hello i'm stuck at the power on issue, i can assign a vGPU profile but the VM wont power on, it sounds like a mis match in the binary file but i've tried vgpu-7.4.8.x and currently 7.4.13.x also tried host driver NVIDIA-GRID-CitrixHypervisor-8.2-550.54.10 and now running 54.16. i also tried 8.3 with their respective drivers, but same power on even issue so i did a clean install of 8.2

                              vm.start
                              {
                              "id": "1eadda44-c82d-69ad-5b83-514a8e421d65",
                              "bypassMacAddressesCheck": false,
                              "force": false
                              }
                              {
                              "code": "FAILED_TO_START_EMULATOR",
                              "params": [
                              "OpaqueRef:f44e209a-5d6f-4022-b9b0-263fddcc5f12",
                              "vgpu",
                              "Daemon exited unexpectedly"
                              ],
                              "call": {
                              "method": "VM.start",
                              "params": [
                              "OpaqueRef:f44e209a-5d6f-4022-b9b0-263fddcc5f12",
                              false,
                              false
                              ]
                              },
                              "message": "FAILED_TO_START_EMULATOR(OpaqueRef:f44e209a-5d6f-4022-b9b0-263fddcc5f12, vgpu, Daemon exited unexpectedly)",
                              "name": "XapiError",
                              "stack": "XapiError: FAILED_TO_START_EMULATOR(OpaqueRef:f44e209a-5d6f-4022-b9b0-263fddcc5f12, vgpu, Daemon exited unexpectedly)
                              at Function.wrap (file:///usr/local/lib/node_modules/xo-server/node_modules/xen-api/_XapiError.mjs:16:12)
                              at file:///usr/local/lib/node_modules/xo-server/node_modules/xen-api/transports/json-rpc.mjs:38:21
                              at runNextTicks (node:internal/process/task_queues:60:5)
                              at processImmediate (node:internal/timers:447:9)
                              at process.callbackTrampoline (node:internal/async_hooks:128:17)"
                              }

                              H 1 Reply Last reply Reply Quote 0
                              • H Offline
                                high-voltages @mgformula
                                last edited by

                                @mgformula i can take a look at my setup tomorrow, currently all runs fine.

                                8.2.1 with Tesla m10 will work with the correct drivers an patches.

                                M 1 Reply Last reply Reply Quote 0
                                • M Offline
                                  mgformula @high-voltages
                                  last edited by mgformula

                                  @high-voltages thanks so much!! I'm also testing with M10 cards, we are trying to find VMware alternatives and are a large vGPU shop πŸ™‚

                                  here is what i'm currently running
                                  xe host-param-get param-name=software-version uuid=$(xe host-list --minimal)
                                  product_version: 8.2.1; product_version_text: 8.2; product_version_text_short: 8.2; platform_name: XCP; platform_version: 3.2.1; product_brand: XCP-ng; build_number: release/yangtze/master/58; hostname: localhost; date: 2024-07-17; dbv: 0.0.1; xapi: 1.20; xen: 4.13.5-9.44; linux: 4.19.0+1; xencenter_min: 2.16; xencenter_max: 2.16; network_backend: openvswitch; db_schema: 5.603

                                  i installed the nvidia host driver using rpm -iv "nvidia.rpm ive also tried other methods copying the .iso and installing the supplemental pack. One thing to mention is i grabbed the binaries from XenServer8_2024-10-03
                                  14488ebb-1fe5-4553-a3e2-7c6a0decd8a6-image.png

                                  d65c947e-addb-4a07-bbae-548f236012de-image.png

                                  H 1 Reply Last reply Reply Quote 0
                                  • H Offline
                                    high-voltages @mgformula
                                    last edited by

                                    @mgformula
                                    I used this driverpackage "NVIDIA-vGPU-CitrixHypervisor-8.2-550.54.16.x86_64.iso" and installed the vgpu source with "rpm -ihv vgpu-7.4.8-1.x86_64.rpm" this I have got from the Citrix installation iso.

                                    The binary 7.4.13 should used for the Xcp-ng 8.3 release not for the 8.2.1.

                                    The Problem from my earlier post is also solved, a new created vm was working. some copies from a another host does not work. perhaps there some driverconflicts in this Windows Vm.

                                    M 3 Replies Last reply Reply Quote 1
                                    • olivierlambertO Offline
                                      olivierlambert Vates πŸͺ Co-Founder CEO
                                      last edited by

                                      Great news! Enjoy vGPU with XCP-ng πŸ™‚

                                      1 Reply Last reply Reply Quote 0
                                      • M Offline
                                        mgformula @high-voltages
                                        last edited by mgformula

                                        @high-voltages thanks, i'm trying this now, Just to confirm we are not downloading the NVidia host drivers from NVidia site, we are using them from the Citrix .ISO?

                                        and my install commands are just the binary.rpm and the nvidia host driver correct? I was basically just copying the binary and making it executable, maybe that was the issue i never installed it

                                        1 Reply Last reply Reply Quote 0
                                        • M Offline
                                          mgformula @high-voltages
                                          last edited by

                                          @high-voltages said in nVidia Tesla P4 for vgpu and Plex encoding:

                                          rpm -ihv vgpu-7.4.8-1.x86_64.rpm

                                          Here are my commands just to confirm I'm doing this right πŸ™‚ Do i still need to copy the binary to the location other have mentioned and make it executable?

                                          1e99e311-8bc2-44e3-a5fc-9edeb323bb39-image.png

                                          a490effb-9b63-49cd-b904-85283812ec93-image.png

                                          1 Reply Last reply Reply Quote 0
                                          • M Offline
                                            mgformula @high-voltages
                                            last edited by

                                            @high-voltages Just wanted to thank you again, its working now with the commands i did in the screenshot.

                                            1 Reply Last reply Reply Quote 1
                                            • olivierlambertO olivierlambert marked this topic as a question on
                                            • olivierlambertO olivierlambert has marked this topic as solved on
                                            • nathanael-hN nathanael-h referenced this topic
                                            • First post
                                              Last post