XCP-ng
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    XOA Recipe for Kubernetes cluster requiements?

    Scheduled Pinned Locked Moved Xen Orchestra
    8 Posts 6 Posters 1.4k Views 6 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • bullerwinsB Offline
      bullerwins
      last edited by

      Hi!
      I'm trying to deploy a k8s cluster but getting a "500 internal error" this is the log.

      Is there maybe a minimum requirement for free RAM/Disk that I'm not meeting? Is there anywhere where I can check the recipe config file or something similar to check it?

      The target SR free space is 350GB

      This are the host stats
      6d731b09-1ee8-4ec5-ad99-ba130de00bb6-image.png

      Error log file (i've only taken out the pub key):

      xoa.recipe.createKubernetesCluster
      {
        "clusterName": "k8sRodri",
        "controlPlanePoolSize": 1,
        "k8sVersion": "1.28.3-00",
        "nbNodes": 1,
        "network": "fe86cad9-df93-deb8-0f56-afe8e453b8e0",
        "sr": "a1316ff3-b8ad-fdd8-4bb3-4ed660bbdb8b",
        "sshKey": "AAAAB3xx[censored]xxxx2kQ=="
      }
      {
        "originalUrl": "https://192.168.10.198/import/?sr_id=OpaqueRef%3A3cec6886-1be2-4dce-b152-1e190fd8aa5b&session_id=OpaqueRef%3A3c71c7f9-27ce-470d-b5a5-524f0f73f887",
        "url": "https://192.168.10.198/import/?sr_id=OpaqueRef%3A3cec6886-1be2-4dce-b152-1e190fd8aa5b&session_id=OpaqueRef%3A3c71c7f9-27ce-470d-b5a5-524f0f73f887",
        "pool_master": {
          "uuid": "b5a406c7-e3c3-4656-a0c4-f01baf26d2b4",
          "name_label": "xcp-ng-12400",
          "name_description": "Default install",
          "memory_overhead": 655278080,
          "allowed_operations": [
            "vm_migrate",
            "provision",
            "vm_resume",
            "evacuate",
            "vm_start"
          ],
          "current_operations": {},
          "API_version_major": 2,
          "API_version_minor": 16,
          "API_version_vendor": "XenSource",
          "API_version_vendor_implementation": {},
          "enabled": true,
          "software_version": {
            "product_version": "8.2.1",
            "product_version_text": "8.2",
            "product_version_text_short": "8.2",
            "platform_name": "XCP",
            "platform_version": "3.2.1",
            "product_brand": "XCP-ng",
            "build_number": "release/yangtze/master/58",
            "hostname": "localhost",
            "date": "2023-10-18",
            "dbv": "0.0.1",
            "xapi": "1.20",
            "xen": "4.13.5-9.37",
            "linux": "4.19.0+1",
            "xencenter_min": "2.16",
            "xencenter_max": "2.16",
            "network_backend": "openvswitch",
            "db_schema": "5.603"
          },
          "other_config": {
            "agent_start_time": "1700030285.",
            "boot_time": "1700030236.",
            "rpm_patch_installation_time": "1699202788.539",
            "iscsi_iqn": "iqn.2023-07.com.example:aa9ef3b7"
          },
          "capabilities": [
            "xen-3.0-x86_64",
            "hvm-3.0-x86_32",
            "hvm-3.0-x86_32p",
            "hvm-3.0-x86_64",
            ""
          ],
          "cpu_configuration": {},
          "sched_policy": "credit",
          "supported_bootloaders": [
            "pygrub",
            "eliloader"
          ],
          "resident_VMs": [
            "OpaqueRef:9758ce9f-bfc4-49d4-a5c9-92ac9457fe64",
            "OpaqueRef:3da033b8-1a94-4e56-bed5-0edda49315ab",
            "OpaqueRef:8e78839e-826f-429f-bb8b-ac6141fe7abe",
            "OpaqueRef:b11b30e1-353a-42c1-b5d4-1c7e44c5ff24",
            "OpaqueRef:2f0bea77-1af8-4e25-a9a5-1b8758732343"
          ],
          "logging": {},
          "PIFs": [
            "OpaqueRef:d44da9d7-0dff-4919-b891-dc6342f73570"
          ],
          "suspend_image_sr": "OpaqueRef:9f3db79f-b06d-4ef9-adc2-df280d476d3c",
          "crash_dump_sr": "OpaqueRef:9f3db79f-b06d-4ef9-adc2-df280d476d3c",
          "crashdumps": [],
          "patches": [],
          "updates": [],
          "PBDs": [
            "OpaqueRef:feb86773-f35c-41b0-8e37-741c6a315613",
            "OpaqueRef:7619d6a7-db7c-4670-97cc-f4fc0fd06ec4",
            "OpaqueRef:5a1a4090-0602-4298-a837-fd718dc689a8",
            "OpaqueRef:561f9549-a6e9-41be-b711-1619bc54e309",
            "OpaqueRef:537ce3e9-f134-4de3-8274-06675517b111",
            "OpaqueRef:3088a871-d7ab-4011-94f0-d961dff9eb36",
            "OpaqueRef:19d98d1e-5166-4b88-b861-d5fe2335fc16",
            "OpaqueRef:07b0be80-9c78-4216-9745-02cd9ebe8b9a"
          ],
          "host_CPUs": [
            "OpaqueRef:0d0e820a-1785-450c-be71-2ba3bafd6185",
            "OpaqueRef:aeb53d4a-2a40-4713-b8a3-647ddf4820d1",
            "OpaqueRef:d5630f42-a5b6-4226-8611-4b2ff3204c2c",
            "OpaqueRef:27f06ef0-3a1b-48c0-b93b-435a91da639e",
            "OpaqueRef:267853ae-f85f-44b8-b07a-7d0233f84110",
            "OpaqueRef:057190e0-b7db-474b-b9e2-27f78c271cf3",
            "OpaqueRef:ac60e622-673e-47dd-8abb-66a8a4ab4ede",
            "OpaqueRef:66aa9e2d-65b0-4ba0-b3ef-0881d41f3ab6",
            "OpaqueRef:f441c719-c49f-4b75-be34-98701e36d684",
            "OpaqueRef:50cd74cb-bd30-48b0-b67e-62b2589b7fef",
            "OpaqueRef:0fe4767c-a5bd-46f8-a02c-a505d3313f1b",
            "OpaqueRef:29cadecc-d48f-4ad1-b95b-db6baab5a41e"
          ],
          "cpu_info": {
            "cpu_count": "12",
            "socket_count": "1",
            "vendor": "GenuineIntel",
            "speed": "2496.126",
            "modelname": "12th Gen Intel(R) Core(TM) i5-12400",
            "family": "6",
            "model": "151",
            "stepping": "2",
            "flags": "fpu de tsc msr pae mce cx8 apic sep mca cmov pat clflush acpi mmx fxsr sse sse2 ss ht syscall nx rdtscp lm constant_tsc rep_good nopl nonstop_tsc cpuid pni pclmulqdq monitor est ssse3 fma cx16 sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch cpuid_fault ssbd ibrs ibpb stibp ibrs_enhanced fsgsbase bmi1 avx2 bmi2 erms rdseed adx clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 gfni vaes vpclmulqdq rdpid arch_capabilities",
            "features_pv": "1fc9cbf5-f6f83203-2991cbf5-00000123-00000007-218c0329-00400700-00000000-00001000-ac000400-00000000-00000000-00000000-00000001-00000000-00000000-0408e167-00000000-00000000-00000000-00000000-00000000",
            "features_hvm": "1fcbfbff-f7fa3223-2d93fbff-00000523-0000000f-219c07ab-0040070c-00000000-00001000-bc000400-00000000-00000000-00000000-00000001-00000000-00000000-0408e167-00000000-00000000-00000000-00000000-00000000",
            "features_hvm_host": "1fcbfbff-f7fa3223-2c100800-00000121-0000000f-219c07ab-0040070c-00000000-00001000-bc000400-00000000-00000000-00000000-00000001-00000000-00000000-0408e163-00000000-00000000-00000000-00000000-00000000",
            "features_pv_host": "1fc9cbf5-f6f83203-28100800-00000121-00000007-218c0329-00400700-00000000-00001000-ac000400-00000000-00000000-00000000-00000001-00000000-00000000-0408e163-00000000-00000000-00000000-00000000-00000000"
          },
          "hostname": "xcp-ng-12400",
          "address": "192.168.10.198",
          "metrics": "OpaqueRef:ab7679c5-31e0-4975-8e72-ca800ad31e02",
          "license_params": {
            "restrict_vswitch_controller": "false",
            "restrict_lab": "false",
            "restrict_stage": "false",
            "restrict_storagelink": "false",
            "restrict_storagelink_site_recovery": "false",
            "restrict_web_selfservice": "false",
            "restrict_web_selfservice_manager": "false",
            "restrict_hotfix_apply": "false",
            "restrict_export_resource_data": "false",
            "restrict_read_caching": "false",
            "restrict_cifs": "false",
            "restrict_health_check": "false",
            "restrict_xcm": "false",
            "restrict_vm_memory_introspection": "false",
            "restrict_batch_hotfix_apply": "false",
            "restrict_management_on_vlan": "false",
            "restrict_ws_proxy": "false",
            "restrict_vlan": "false",
            "restrict_qos": "false",
            "restrict_pool_attached_storage": "false",
            "restrict_netapp": "false",
            "restrict_equalogic": "false",
            "restrict_pooling": "false",
            "enable_xha": "true",
            "restrict_marathon": "false",
            "restrict_email_alerting": "false",
            "restrict_historical_performance": "false",
            "restrict_wlb": "false",
            "restrict_rbac": "false",
            "restrict_dmc": "false",
            "restrict_checkpoint": "false",
            "restrict_cpu_masking": "false",
            "restrict_connection": "false",
            "platform_filter": "false",
            "regular_nag_dialog": "false",
            "restrict_vmpr": "false",
            "restrict_vmss": "false",
            "restrict_intellicache": "false",
            "restrict_gpu": "false",
            "restrict_dr": "false",
            "restrict_vif_locking": "false",
            "restrict_storage_xen_motion": "false",
            "restrict_vgpu": "false",
            "restrict_integrated_gpu_passthrough": "false",
            "restrict_vss": "false",
            "restrict_guest_agent_auto_update": "false",
            "restrict_pci_device_for_auto_update": "false",
            "restrict_xen_motion": "false",
            "restrict_guest_ip_setting": "false",
            "restrict_ad": "false",
            "restrict_nested_virt": "false",
            "restrict_live_patching": "false",
            "restrict_set_vcpus_number_live": "false",
            "restrict_pvs_proxy": "false",
            "restrict_igmp_snooping": "false",
            "restrict_rpu": "false",
            "restrict_pool_size": "false",
            "restrict_cbt": "false",
            "restrict_usb_passthrough": "false",
            "restrict_network_sriov": "false",
            "restrict_corosync": "true",
            "restrict_zstd_export": "false",
            "restrict_pool_secret_rotation": "false"
          },
          "ha_statefiles": [],
          "ha_network_peers": [],
          "blobs": {},
          "tags": [],
          "external_auth_type": "",
          "external_auth_service_name": "",
          "external_auth_configuration": {},
          "edition": "xcp-ng",
          "license_server": {
            "address": "localhost",
            "port": "27000"
          },
          "bios_strings": {
            "bios-vendor": "American Megatrends International, LLC.",
            "bios-version": "F11",
            "system-manufacturer": "Gigabyte Technology Co., Ltd.",
            "system-product-name": "B760M GAMING X DDR4",
            "system-version": "Default string",
            "system-serial-number": "Default string",
            "baseboard-manufacturer": "Gigabyte Technology Co., Ltd.",
            "baseboard-product-name": "B760M GAMING X DDR4",
            "baseboard-version": "x.x",
            "baseboard-serial-number": "Default string",
            "oem-1": "Xen",
            "oem-2": "MS_VM_CERT/SHA1/bdbeb6e0a816d43fa6d3fe8aaef04c2bad9d3e3d",
            "oem-3": "Default string",
            "hp-rombios": ""
          },
          "power_on_mode": "",
          "power_on_config": {},
          "local_cache_sr": "OpaqueRef:9f3db79f-b06d-4ef9-adc2-df280d476d3c",
          "chipset_info": {
            "iommu": "false"
          },
          "PCIs": [
            "OpaqueRef:c3ccbcef-757c-426e-a6c0-71ca59c392a8",
            "OpaqueRef:b634af83-c4f6-44c7-be28-aad0158d2b72",
            "OpaqueRef:2d18b304-b03a-45d7-9378-4055cddf68a0",
            "OpaqueRef:16189584-ccdd-4b31-acfd-28ccd80a0c9e",
            "OpaqueRef:0dc37b73-b7f5-4c67-a55e-18ae3c7e2588"
          ],
          "PGPUs": [
            "OpaqueRef:1b9f4c3b-1136-4670-969a-4d8e72946a43"
          ],
          "PUSBs": [],
          "ssl_legacy": false,
          "guest_VCPUs_params": {},
          "display": "enabled",
          "virtual_hardware_platform_versions": [
            0,
            1,
            2
          ],
          "control_domain": "OpaqueRef:2f0bea77-1af8-4e25-a9a5-1b8758732343",
          "updates_requiring_reboot": [],
          "features": [],
          "iscsi_iqn": "iqn.2023-07.com.example:aa9ef3b7",
          "multipathing": false,
          "uefi_certificates": "",
          "certificates": [],
          "editions": [
            "xcp-ng"
          ],
          "https_only": false
        },
        "SR": {
          "uuid": "a1316ff3-b8ad-fdd8-4bb3-4ed660bbdb8b",
          "name_label": "Local storage",
          "name_description": "",
          "allowed_operations": [
            "vdi_enable_cbt",
            "vdi_list_changed_blocks",
            "unplug",
            "plug",
            "pbd_create",
            "vdi_disable_cbt",
            "update",
            "pbd_destroy",
            "vdi_resize",
            "forget",
            "vdi_clone",
            "vdi_data_destroy",
            "scan",
            "vdi_snapshot",
            "vdi_mirror",
            "vdi_create",
            "vdi_destroy",
            "vdi_set_on_boot"
          ],
          "current_operations": {},
          "VDIs": [
            "OpaqueRef:e711b1d9-8a68-4774-9625-92d50da403e1",
            "OpaqueRef:9c46f21b-23d4-4aa4-9cfe-a047f7309fab"
          ],
          "PBDs": [],
          "virtual_allocation": 42949672960,
          "physical_utilisation": 31253938176,
          "physical_size": 448251158528,
          "type": "ext",
          "content_type": "user",
          "shared": false,
          "other_config": {
            "dirty": "",
            "i18n-original-value-name_label": "Local storage",
            "i18n-key": "local-storage"
          },
          "tags": [],
          "sm_config": {
            "devserial": "scsi-3500a075110f7b66d"
          },
          "blobs": {},
          "local_cache_enabled": true,
          "introduced_by": "OpaqueRef:NULL",
          "clustered": false,
          "is_tools_sr": false
        },
        "message": "500 Internal Error",
        "name": "Error",
        "stack": "Error: 500 Internal Error
          at Object.assertSuccess (/usr/local/lib/node_modules/xo-server/node_modules/http-request-plus/index.js:144:19)
          at httpRequestPlus (/usr/local/lib/node_modules/xo-server/node_modules/http-request-plus/index.js:211:22)
          at Xapi.putResource (/usr/local/lib/node_modules/xo-server/node_modules/xen-api/src/index.js:508:22)
          at Xapi._importVm (file:///usr/local/lib/node_modules/xo-server/src/xapi/index.mjs:677:19)
          at Xapi.importVm (file:///usr/local/lib/node_modules/xo-server/src/xapi/index.mjs:799:48)
          at Xoa._downloadAndInstallHubXva (/usr/local/lib/node_modules/xo-server-xoa/src/index.js:702:16)
          at Xoa.createCluster (/usr/local/lib/node_modules/xo-server-xoa/src/recipes/kubernetes-cluster.js:235:18)
          at Api.#callApiMethod (file:///usr/local/lib/node_modules/xo-server/src/xo-mixins/api.mjs:417:20)"
      }
      
      1 Reply Last reply Reply Quote 0
      • olivierlambertO Offline
        olivierlambert Vates 🪐 Co-Founder CEO
        last edited by

        It's hard to know without knowing more: how much resources you declared and how many nodes?

        bullerwinsB 1 Reply Last reply Reply Quote 0
        • bullerwinsB Offline
          bullerwins @olivierlambert
          last edited by

          @olivierlambert
          I declared the minimum just in case:
          88b19d38-bd89-49c8-9511-056c34dbaff7-image.png

          I rebooted the server and this time I didn't get any error (it seems there was a task stuck from previous attempts).

          It has created the master node, but no worker node.
          2966e0f1-1088-40c6-9fef-0e6509b2c822-image.png

          I didn't get a green confirmation message, the "create" button it's stuck spinning.
          8f2aaf56-d364-4908-925d-aaed77df82f4-image.png

          It's been like this for 15min.

          This is the console output for the master node:
          68b21064-4b58-40e9-9454-cc36ff395478-image.png

          I'm not sure where to look to see if there is any progress creating the worker node of if it has failed somewhere

          Thanks a lot for the help!

          dc31xxD 1 Reply Last reply Reply Quote 0
          • dc31xxD Offline
            dc31xx @bullerwins
            last edited by

            @bullerwins Same issue hier. I think the kubeadm: not found is a good indicator that waiting is not going to help. Seems like the recipe is broken.

            0ba9c74f-079f-40c2-acaf-bbfdae55e731-image.png

            T 1 Reply Last reply Reply Quote 0
            • T Offline
              typerlc @dc31xx
              last edited by typerlc

              So, the problem seems to be in this step where it tries to install the version of the tools you requested:

              apt-get install -y kubelet=1.28.5-00 kubeadm=1.28.5-00 kubectl=1.28.5-00
              

              If I login to the VM and run that manually, I get the following error:

              debian@cp:~$ sudo apt-get install -y kubelet=1.28.5-00 kubeadm=1.28.5-00 kubectl=1.28.5-00
              Reading package lists... Done
              Building dependency tree... Done
              Reading state information... Done
              E: Version '1.28.5-00' for 'kubelet' was not found
              E: Version '1.28.5-00' for 'kubeadm' was not found
              E: Version '1.28.5-00' for 'kubectl' was not found
              

              The latest version that seems to be available to the VM is 1.28.2 ... try that and see if it works for you. 1.28.0 also worked for me. Just specify one of these versions when creating the instances through the recipe.

              S 1 Reply Last reply Reply Quote 0
              • olivierlambertO Offline
                olivierlambert Vates 🪐 Co-Founder CEO
                last edited by

                Thanks, @shinuza will fix this ASAP 🙂

                1 Reply Last reply Reply Quote 0
                • S Offline
                  shinuza @typerlc
                  last edited by shinuza

                  @typerlc @dc31xx @olivierlambert

                  Currently the Kubernetes version list is pulled from:

                  https://api.github.com/repos/kubernetes/kubernetes/releases

                  Which lists every Kubernetes available releases, but the latest available version for the xenial repo is v1.28.2 as mentioned by @typerlc. You can use that while we figure out the best way to fix the issue on our side.

                  R 1 Reply Last reply Reply Quote 1
                  • R Offline
                    Rom1 @shinuza
                    last edited by

                    @shinuza
                    The version inside the Debian image and the version on GitHub may be different. It is likely that there will be more recent versions on GitHub. Just don't select a version and the latest version from the Debian repository will be installed.

                    const k8sInstall = k8sVersion !== undefined ? `apt-get install -y kubelet=${k8sVersion} kubeadm=${k8sVersion} kubectl=${k8sVersion}` : "apt-get install -y kubelet kubeadm kubectl";
                    
                    1 Reply Last reply Reply Quote 1
                    • First post
                      Last post