XCP-ng
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    GPU support and Nvidia Grid vGPU

    Scheduled Pinned Locked Moved Compute
    34 Posts 10 Posters 6.5k Views 9 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • D Offline
      dangerd
      last edited by

      Has there been any changes to support GPU\vGPU with Nvidia Grid and or similar?

      1 Reply Last reply Reply Quote 1
      • olivierlambertO Offline
        olivierlambert Vates 🪐 Co-Founder CEO
        last edited by olivierlambert

        Hi,

        Short answer: no. You can't do it with XCP-ng, except if you decide to hack things around by replacing some binaries cherry picked from the XenServer ISO. Then it will work. Something we obviously can't do nor redistribute since it's not open source 🙂

        E 1 Reply Last reply Reply Quote 0
        • E Offline
          EspenU @olivierlambert
          last edited by

          @olivierlambert Do you think it's possible for that support to be added, or are there proprietary code limitations that prevent it?
          What would it take to solve that limitation?

          We have currently delivered a project which which uses vmware, and the current temporary licenses end in September (could probably be extended a little).
          In our cluster we have a total of 10x Nvidia A2, and 3x Nvidia L40 cards.
          I would love to move to xcp-ng, as it really seems perfect for us, but the lack of Nvidia vgpu support stops us.

          msupportM 1 Reply Last reply Reply Quote 2
          • msupportM Offline
            msupport @EspenU
            last edited by

            @EspenU
            I have completed the migration step from Vmware to XCP-NG. I am using Nvidia M10 and Nvidia A16. I had to invest some time to get the Nvidia cards up and running. What I can reveal is that so far they do not work with xcp-ng release candidate 1

            E 1 Reply Last reply Reply Quote 0
            • E Offline
              EspenU @msupport
              last edited by

              @msupport said in GPU support and Nvidia Grid vGPU:

              @EspenU
              I have completed the migration step from Vmware to XCP-NG. I am using Nvidia M10 and Nvidia A16. I had to invest some time to get the Nvidia cards up and running. What I can reveal is that so far they do not work with xcp-ng release candidate 1

              I've seen your post from March 14. where you got it working on 8.3.
              So you're saying that something has broken between then and RC1?
              That's not good 😞 .
              I have it running in a test environment on 8.2 and since it was working fine there I was hoping it would still work in 8.3. sigh...

              msupportM 1 Reply Last reply Reply Quote 0
              • olivierlambertO Offline
                olivierlambert Vates 🪐 Co-Founder CEO
                last edited by

                8.3 isn't even officially released 😆 I don't see any reason why it would break between beta 2 and RC1, but if you can confirm it, we could investigate. There's no rush to get out of 8.2, it's still an LTS 🙂

                1 Reply Last reply Reply Quote 0
                • msupportM Offline
                  msupport @EspenU
                  last edited by olivierlambert

                  @EspenU
                  #*** Here are my insights for the nvidia drivers and xcp-ng 8.3
                  #*** Install xcp-ng beta2
                  #*** then start updates

                  yum update kernel device-mapper guest-templates-json guest-templates-json-data-linux intel-microcode openssh python2-scapy amd-microcode cisco* libblkid libcgroup libcgroup-tools libcurl libmount libuuid curl util-linux edk2 forkexecd fuse-libs gdisk guest-templates-json-data-other guest-templates-json-data-windows intel-ice python-fasteners python2-defusedxml python2-xapi-storage nss-sysinit nss-tools nspr nss nss-softokn nss-softokn-freebl nss-tools   nss-util   kernel-livepatch logrotate mellanox-mlnxen message-switch openssl* openvswitch swtpm* tzdata microsemi-smartpqi vendor-drivers sudo newt qlogic-qla2xxx qlogic-fastlinq gpumon sm vhd-tool vcputune
                  

                  *** Nvidia Driver must install before this operation below, because the driver installation is not compatible with phython3 ****

                  yum remove xcp-python-libs-2.3.5-1.1.xcpng8.3.noarch
                  yum update ncurses-compat-libs python3-fasteners python3-pyudev python3-scapy python3-xcp-libs python36-future    
                  yum update net-snmp yum update net-snmp-agent-libs net-snmp-libs
                  yum update xapi-storage-script
                  yum update xs-openssl-libs
                  yum update xapi-nbd
                  yum remove net-snmp
                  yum net-snmp-libs net-snmp-agent-libs net-snmp
                  yum update xcp-ng-plymouth-theme
                  yum update xen-crashdump-analyser
                  

                  *** Driver don't work anymore with one of these 40 Updates,because the updates are dependent:

                  Name	Description	Version	Release	Size	
                  blktap	blktap user space utilities	3.54.9	1.1.xcpng8.3	305.47 KiB	
                  kexec-tools	kexec/kdump userspace tools	2.0.15	20.xcpng8.3	67.8 KiB	
                  ncurses	Ncurses support utilities	6.4	3.xcpng8.3	394.83 KiB	
                  ncurses-base	Descriptions of common terminals	6.4	3.xcpng8.3	57.81 KiB	
                  ncurses-libs	Ncurses libraries	6.4	3.xcpng8.3	312.17 KiB	
                  qemu	qemu-dm device model	4.2.1	5.2.9.xcpng8.3	15.57 MiB	
                  rrdd-plugins	RRDD metrics plugin	24.16.0	1.2.xcpng8.3	4.29 MiB	
                  setup	A set of system configuration and setup files	2.8.71	9.1.xcpng8.3	169.24 KiB	
                  sm-cli	CLI for xapi toolstack storage managers	24.16.0	1.2.xcpng8.3	1.53 MiB	
                  squeezed	Memory ballooning daemon for the xapi toolstack	24.16.0	1.2.xcpng8.3	1.54 MiB	
                  varstored	EFI Variable Storage Daemon	1.2.0	2.3.xcpng8.3	46.55 KiB	
                  varstored-guard	Deprivileged XAPI socket Daemon for EFI variable storage	24.16.0	1.2.xcpng8.3	4.3 MiB	
                  varstored-tools	Tools for manipulating a guest's EFI variables offline	1.2.0	2.3.xcpng8.3	58.66 KiB	
                  vncterm	vncterm tty to vnc utility	10.2.1	2.xcpng8.3	43.94 KiB	
                  wsproxy	Websockets proxy for VNC traffic	24.16.0	1.2.xcpng8.3	932.78 KiB	
                  xapi-core	The xapi toolstack	24.16.0	1.2.xcpng8.3	24.55 MiB	
                  xapi-rrd2csv	A tool to output RRD values in CSV format	24.16.0	1.2.xcpng8.3	2.61 MiB	
                  xapi-tests	Toolstack test programs	24.16.0	1.2.xcpng8.3	6.25 MiB	
                  xapi-xe	The xapi toolstack CLI	24.16.0	1.2.xcpng8.3	1.13 MiB	
                  xcp-clipboardd	Daemon to share a virtualized Windows clipboard	1.0.3	8.xcpng8.3	22.53 KiB	
                  xcp-featured	XCP-ng feature daemon	1.1.7	2.xcpng8.3	1.25 MiB	
                  xcp-networkd	Simple host network management service for the xapi toolstack	24.16.0	1.2.xcpng8.3	4.15 MiB	
                  xcp-ng-release	XCP-ng release file	8.3.0	24	112.56 KiB	
                  xcp-ng-release-config	XCP-ng configuration	8.3.0	24	49.93 KiB	
                  xcp-ng-release-presets	XCP-ng presets file	8.3.0	24	18.44 KiB	
                  xcp-ng-xapi-plugins	XAPI additional plugins for XCP-ng	1.10.0	1.xcpng8.3	46.17 KiB	
                  xcp-rrdd	Statistics gathering daemon for the xapi toolstack	24.16.0	1.2.xcpng8.3	3.14 MiB	
                  xen-dom0-libs	Xen Hypervisor Domain 0 libraries	4.17.4	3.xcpng8.3	691.85 KiB	
                  xen-dom0-tools	Xen Hypervisor Domain 0 tools	4.17.4	3.xcpng8.3	1.9 MiB	
                  xen-hypervisor	The Xen Hypervisor	4.17.4	3.xcpng8.3	2.34 MiB	
                  xen-libs	Xen Hypervisor general libraries	4.17.4	3.xcpng8.3	54.05 KiB	
                  xen-livepatch	Live patches for Xen	2.0	1.xcpng8.3	2.91 KiB	
                  xen-tools	Xen Hypervisor general tools	4.17.4	3.xcpng8.3	35.66 KiB	
                  xenopsd	Simple VM manager	24.16.0	1.2.xcpng8.3	1.17 MiB	
                  xenopsd-cli	CLI for xenopsd, the xapi toolstack domain manager	24.16.0	1.2.xcpng8.3	1.61 MiB	
                  xenopsd-xc	Xenopsd using xc	24.16.0	1.2.xcpng8.3	4.61 MiB	
                  xenserver-hwdata	Additional hardware identification and configuration data	20240411	1.xcpng8.3	284.41 KiB	
                  xenserver-status-report	A program that generates status reports for a XenServer host	2.0.3	1.xcpng8.3	33.24 KiB	
                  xo-lite	Xen Orchestra Lite	0.2.3	1.xcpng8.3	816.19 KiB	
                  xsconsole	XCP-ng Host Configuration Console	11.0.2	1.1.xcpng8.3	304.44 KiB
                  
                  1 Reply Last reply Reply Quote 0
                  • olivierlambertO Offline
                    olivierlambert Vates 🪐 Co-Founder CEO
                    last edited by

                    Have you took the drivers for XS8, which is equivalent from XCP-ng 8.3?

                    E msupportM 2 Replies Last reply Reply Quote 0
                    • E Offline
                      EspenU @olivierlambert
                      last edited by

                      @olivierlambert Would that mean that the vgpu binary should be taken from XS8 as well when using XCP-ng 8.3?
                      During testing I tried using that binary in XCP-ng 8.2, and it didn't work (VMs would no boot). I had to use the one from Citrix Hypervisor 8.2.

                      1 Reply Last reply Reply Quote 0
                      • msupportM Offline
                        msupport @olivierlambert
                        last edited by msupport

                        @olivierlambert
                        I have tested the Nvidia XenServer version 17.0.

                        1 Reply Last reply Reply Quote 0
                        • olivierlambertO Offline
                          olivierlambert Vates 🪐 Co-Founder CEO
                          last edited by

                          I don't know all the versions, but I can tell that:

                          • XCP-ng 8.2 == XS 8.2
                          • XCP-ng 8.3 == XS 8

                          So be sure to use the right/matching binary first 🙂

                          msupportM 2 Replies Last reply Reply Quote 0
                          • msupportM Offline
                            msupport @olivierlambert
                            last edited by

                            @olivierlambert
                            I have found the solution. I will test the whole thing again tomorrow with a clean installation with rc1.

                            tjkreidlT 1 Reply Last reply Reply Quote 2
                            • olivierlambertO Offline
                              olivierlambert Vates 🪐 Co-Founder CEO
                              last edited by

                              Oh great! Keep us posted!!

                              1 Reply Last reply Reply Quote 0
                              • tjkreidlT Offline
                                tjkreidl Ambassador @msupport
                                last edited by

                                @msupport Please write up all the steps involved, as this would be very useful documentation for anyone else wanting to accomplish this. Many have delayed switching to XCP-ng because of not being able to make use of NVIDIA GPUs.

                                1 Reply Last reply Reply Quote 0
                                • olivierlambertO Offline
                                  olivierlambert Vates 🪐 Co-Founder CEO
                                  last edited by

                                  It works on 8.2 already even if it's not official at all 😉

                                  1 Reply Last reply Reply Quote 0
                                  • msupportM Offline
                                    msupport
                                    last edited by

                                    Installation instructions XCP-NG (RC1) Nvidia M10 | A16 GPU

                                    1. install XCP-NG 8.3 RC1
                                    2. download XenServer Driver Nvidia 17.1 (NVIDIA-GRID-XenServer-8-550.54.16-550.54.15-551.78)
                                    3. unzip driver and copy host driver (NVIDIA-vGPU-xenserver-8-550.54.16.x86_64.iso) I used winscp to copy the driver to the tmp directory.
                                    4. download XenServer iso file (https://www.xenserver.com/downloads | XenServer8_2024-06-03.iso)
                                    5. copy the file (vgpu-7.4.13-1.xs8.x86_64.rpm) in the packages directory ! Do not use CitrixHypervisor-8.2.0-install-cd file vgpu-7.4.8-1.x86_64
                                    6. unpack file vgpu-7.4.13-1.xs8.x86_64
                                    7. copy the file \usr\lib64\xen\bin\vgpu (size 129KB) to \usr\lib64\xen\bin\ on your XCP-NG host (chmod 755)
                                    8. (putty) /tmp/ xe-install-supplemental-pack NVIDIA-vGPU-xenserver-8-550.54.16.x86_64.iso
                                    9. reboot
                                    10. install guest driver on the VM client (551.78_grid_win10_win11_server2022_dch_64bit_international.exe)
                                    11. token file from Nvidia (C:\Program Files\Nvidia Corporation\vGPU Licensing\ClientConfigToken*.tok)

                                    Nvidia drivers 17.2 and 17.3 do not work yet (Guest driver crashes)
                                    I will stay tuned and inform you about new findings

                                    Have fun

                                    tjkreidlT 1 Reply Last reply Reply Quote 4
                                    • msupportM Offline
                                      msupport @olivierlambert
                                      last edited by

                                      @olivierlambert
                                      Thanks for the hint, that helped me a lot

                                      1 Reply Last reply Reply Quote 1
                                      • olivierlambertO Offline
                                        olivierlambert Vates 🪐 Co-Founder CEO
                                        last edited by

                                        Thank you very much!

                                        1 Reply Last reply Reply Quote 0
                                        • tjkreidlT Offline
                                          tjkreidl Ambassador @msupport
                                          last edited by

                                          @msupport Many thanks for your write-up! Have you experienced any issues communicating with the NVIDIA license server?

                                          msupportM 1 Reply Last reply Reply Quote 0
                                          • F Offline
                                            fatek
                                            last edited by

                                            I also used instructions from @msupport

                                            https://xcp-ng.org/forum/topic/8987/vgpu-nvidia-tesla-p4-xcp-ng-8-3-beta-2?_=1721015408249

                                            1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post