XOA Recipe for Kubernetes cluster requiements?
-
Hi!
I'm trying to deploy a k8s cluster but getting a "500 internal error" this is the log.Is there maybe a minimum requirement for free RAM/Disk that I'm not meeting? Is there anywhere where I can check the recipe config file or something similar to check it?
The target SR free space is 350GB
This are the host stats
Error log file (i've only taken out the pub key):
xoa.recipe.createKubernetesCluster { "clusterName": "k8sRodri", "controlPlanePoolSize": 1, "k8sVersion": "1.28.3-00", "nbNodes": 1, "network": "fe86cad9-df93-deb8-0f56-afe8e453b8e0", "sr": "a1316ff3-b8ad-fdd8-4bb3-4ed660bbdb8b", "sshKey": "AAAAB3xx[censored]xxxx2kQ==" } { "originalUrl": "https://192.168.10.198/import/?sr_id=OpaqueRef%3A3cec6886-1be2-4dce-b152-1e190fd8aa5b&session_id=OpaqueRef%3A3c71c7f9-27ce-470d-b5a5-524f0f73f887", "url": "https://192.168.10.198/import/?sr_id=OpaqueRef%3A3cec6886-1be2-4dce-b152-1e190fd8aa5b&session_id=OpaqueRef%3A3c71c7f9-27ce-470d-b5a5-524f0f73f887", "pool_master": { "uuid": "b5a406c7-e3c3-4656-a0c4-f01baf26d2b4", "name_label": "xcp-ng-12400", "name_description": "Default install", "memory_overhead": 655278080, "allowed_operations": [ "vm_migrate", "provision", "vm_resume", "evacuate", "vm_start" ], "current_operations": {}, "API_version_major": 2, "API_version_minor": 16, "API_version_vendor": "XenSource", "API_version_vendor_implementation": {}, "enabled": true, "software_version": { "product_version": "8.2.1", "product_version_text": "8.2", "product_version_text_short": "8.2", "platform_name": "XCP", "platform_version": "3.2.1", "product_brand": "XCP-ng", "build_number": "release/yangtze/master/58", "hostname": "localhost", "date": "2023-10-18", "dbv": "0.0.1", "xapi": "1.20", "xen": "4.13.5-9.37", "linux": "4.19.0+1", "xencenter_min": "2.16", "xencenter_max": "2.16", "network_backend": "openvswitch", "db_schema": "5.603" }, "other_config": { "agent_start_time": "1700030285.", "boot_time": "1700030236.", "rpm_patch_installation_time": "1699202788.539", "iscsi_iqn": "iqn.2023-07.com.example:aa9ef3b7" }, "capabilities": [ "xen-3.0-x86_64", "hvm-3.0-x86_32", "hvm-3.0-x86_32p", "hvm-3.0-x86_64", "" ], "cpu_configuration": {}, "sched_policy": "credit", "supported_bootloaders": [ "pygrub", "eliloader" ], "resident_VMs": [ "OpaqueRef:9758ce9f-bfc4-49d4-a5c9-92ac9457fe64", "OpaqueRef:3da033b8-1a94-4e56-bed5-0edda49315ab", "OpaqueRef:8e78839e-826f-429f-bb8b-ac6141fe7abe", "OpaqueRef:b11b30e1-353a-42c1-b5d4-1c7e44c5ff24", "OpaqueRef:2f0bea77-1af8-4e25-a9a5-1b8758732343" ], "logging": {}, "PIFs": [ "OpaqueRef:d44da9d7-0dff-4919-b891-dc6342f73570" ], "suspend_image_sr": "OpaqueRef:9f3db79f-b06d-4ef9-adc2-df280d476d3c", "crash_dump_sr": "OpaqueRef:9f3db79f-b06d-4ef9-adc2-df280d476d3c", "crashdumps": [], "patches": [], "updates": [], "PBDs": [ "OpaqueRef:feb86773-f35c-41b0-8e37-741c6a315613", "OpaqueRef:7619d6a7-db7c-4670-97cc-f4fc0fd06ec4", "OpaqueRef:5a1a4090-0602-4298-a837-fd718dc689a8", "OpaqueRef:561f9549-a6e9-41be-b711-1619bc54e309", "OpaqueRef:537ce3e9-f134-4de3-8274-06675517b111", "OpaqueRef:3088a871-d7ab-4011-94f0-d961dff9eb36", "OpaqueRef:19d98d1e-5166-4b88-b861-d5fe2335fc16", "OpaqueRef:07b0be80-9c78-4216-9745-02cd9ebe8b9a" ], "host_CPUs": [ "OpaqueRef:0d0e820a-1785-450c-be71-2ba3bafd6185", "OpaqueRef:aeb53d4a-2a40-4713-b8a3-647ddf4820d1", "OpaqueRef:d5630f42-a5b6-4226-8611-4b2ff3204c2c", "OpaqueRef:27f06ef0-3a1b-48c0-b93b-435a91da639e", "OpaqueRef:267853ae-f85f-44b8-b07a-7d0233f84110", "OpaqueRef:057190e0-b7db-474b-b9e2-27f78c271cf3", "OpaqueRef:ac60e622-673e-47dd-8abb-66a8a4ab4ede", "OpaqueRef:66aa9e2d-65b0-4ba0-b3ef-0881d41f3ab6", "OpaqueRef:f441c719-c49f-4b75-be34-98701e36d684", "OpaqueRef:50cd74cb-bd30-48b0-b67e-62b2589b7fef", "OpaqueRef:0fe4767c-a5bd-46f8-a02c-a505d3313f1b", "OpaqueRef:29cadecc-d48f-4ad1-b95b-db6baab5a41e" ], "cpu_info": { "cpu_count": "12", "socket_count": "1", "vendor": "GenuineIntel", "speed": "2496.126", "modelname": "12th Gen Intel(R) Core(TM) i5-12400", "family": "6", "model": "151", "stepping": "2", "flags": "fpu de tsc msr pae mce cx8 apic sep mca cmov pat clflush acpi mmx fxsr sse sse2 ss ht syscall nx rdtscp lm constant_tsc rep_good nopl nonstop_tsc cpuid pni pclmulqdq monitor est ssse3 fma cx16 sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch cpuid_fault ssbd ibrs ibpb stibp ibrs_enhanced fsgsbase bmi1 avx2 bmi2 erms rdseed adx clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 gfni vaes vpclmulqdq rdpid arch_capabilities", "features_pv": "1fc9cbf5-f6f83203-2991cbf5-00000123-00000007-218c0329-00400700-00000000-00001000-ac000400-00000000-00000000-00000000-00000001-00000000-00000000-0408e167-00000000-00000000-00000000-00000000-00000000", "features_hvm": "1fcbfbff-f7fa3223-2d93fbff-00000523-0000000f-219c07ab-0040070c-00000000-00001000-bc000400-00000000-00000000-00000000-00000001-00000000-00000000-0408e167-00000000-00000000-00000000-00000000-00000000", "features_hvm_host": "1fcbfbff-f7fa3223-2c100800-00000121-0000000f-219c07ab-0040070c-00000000-00001000-bc000400-00000000-00000000-00000000-00000001-00000000-00000000-0408e163-00000000-00000000-00000000-00000000-00000000", "features_pv_host": "1fc9cbf5-f6f83203-28100800-00000121-00000007-218c0329-00400700-00000000-00001000-ac000400-00000000-00000000-00000000-00000001-00000000-00000000-0408e163-00000000-00000000-00000000-00000000-00000000" }, "hostname": "xcp-ng-12400", "address": "192.168.10.198", "metrics": "OpaqueRef:ab7679c5-31e0-4975-8e72-ca800ad31e02", "license_params": { "restrict_vswitch_controller": "false", "restrict_lab": "false", "restrict_stage": "false", "restrict_storagelink": "false", "restrict_storagelink_site_recovery": "false", "restrict_web_selfservice": "false", "restrict_web_selfservice_manager": "false", "restrict_hotfix_apply": "false", "restrict_export_resource_data": "false", "restrict_read_caching": "false", "restrict_cifs": "false", "restrict_health_check": "false", "restrict_xcm": "false", "restrict_vm_memory_introspection": "false", "restrict_batch_hotfix_apply": "false", "restrict_management_on_vlan": "false", "restrict_ws_proxy": "false", "restrict_vlan": "false", "restrict_qos": "false", "restrict_pool_attached_storage": "false", "restrict_netapp": "false", "restrict_equalogic": "false", "restrict_pooling": "false", "enable_xha": "true", "restrict_marathon": "false", "restrict_email_alerting": "false", "restrict_historical_performance": "false", "restrict_wlb": "false", "restrict_rbac": "false", "restrict_dmc": "false", "restrict_checkpoint": "false", "restrict_cpu_masking": "false", "restrict_connection": "false", "platform_filter": "false", "regular_nag_dialog": "false", "restrict_vmpr": "false", "restrict_vmss": "false", "restrict_intellicache": "false", "restrict_gpu": "false", "restrict_dr": "false", "restrict_vif_locking": "false", "restrict_storage_xen_motion": "false", "restrict_vgpu": "false", "restrict_integrated_gpu_passthrough": "false", "restrict_vss": "false", "restrict_guest_agent_auto_update": "false", "restrict_pci_device_for_auto_update": "false", "restrict_xen_motion": "false", "restrict_guest_ip_setting": "false", "restrict_ad": "false", "restrict_nested_virt": "false", "restrict_live_patching": "false", "restrict_set_vcpus_number_live": "false", "restrict_pvs_proxy": "false", "restrict_igmp_snooping": "false", "restrict_rpu": "false", "restrict_pool_size": "false", "restrict_cbt": "false", "restrict_usb_passthrough": "false", "restrict_network_sriov": "false", "restrict_corosync": "true", "restrict_zstd_export": "false", "restrict_pool_secret_rotation": "false" }, "ha_statefiles": [], "ha_network_peers": [], "blobs": {}, "tags": [], "external_auth_type": "", "external_auth_service_name": "", "external_auth_configuration": {}, "edition": "xcp-ng", "license_server": { "address": "localhost", "port": "27000" }, "bios_strings": { "bios-vendor": "American Megatrends International, LLC.", "bios-version": "F11", "system-manufacturer": "Gigabyte Technology Co., Ltd.", "system-product-name": "B760M GAMING X DDR4", "system-version": "Default string", "system-serial-number": "Default string", "baseboard-manufacturer": "Gigabyte Technology Co., Ltd.", "baseboard-product-name": "B760M GAMING X DDR4", "baseboard-version": "x.x", "baseboard-serial-number": "Default string", "oem-1": "Xen", "oem-2": "MS_VM_CERT/SHA1/bdbeb6e0a816d43fa6d3fe8aaef04c2bad9d3e3d", "oem-3": "Default string", "hp-rombios": "" }, "power_on_mode": "", "power_on_config": {}, "local_cache_sr": "OpaqueRef:9f3db79f-b06d-4ef9-adc2-df280d476d3c", "chipset_info": { "iommu": "false" }, "PCIs": [ "OpaqueRef:c3ccbcef-757c-426e-a6c0-71ca59c392a8", "OpaqueRef:b634af83-c4f6-44c7-be28-aad0158d2b72", "OpaqueRef:2d18b304-b03a-45d7-9378-4055cddf68a0", "OpaqueRef:16189584-ccdd-4b31-acfd-28ccd80a0c9e", "OpaqueRef:0dc37b73-b7f5-4c67-a55e-18ae3c7e2588" ], "PGPUs": [ "OpaqueRef:1b9f4c3b-1136-4670-969a-4d8e72946a43" ], "PUSBs": [], "ssl_legacy": false, "guest_VCPUs_params": {}, "display": "enabled", "virtual_hardware_platform_versions": [ 0, 1, 2 ], "control_domain": "OpaqueRef:2f0bea77-1af8-4e25-a9a5-1b8758732343", "updates_requiring_reboot": [], "features": [], "iscsi_iqn": "iqn.2023-07.com.example:aa9ef3b7", "multipathing": false, "uefi_certificates": "", "certificates": [], "editions": [ "xcp-ng" ], "https_only": false }, "SR": { "uuid": "a1316ff3-b8ad-fdd8-4bb3-4ed660bbdb8b", "name_label": "Local storage", "name_description": "", "allowed_operations": [ "vdi_enable_cbt", "vdi_list_changed_blocks", "unplug", "plug", "pbd_create", "vdi_disable_cbt", "update", "pbd_destroy", "vdi_resize", "forget", "vdi_clone", "vdi_data_destroy", "scan", "vdi_snapshot", "vdi_mirror", "vdi_create", "vdi_destroy", "vdi_set_on_boot" ], "current_operations": {}, "VDIs": [ "OpaqueRef:e711b1d9-8a68-4774-9625-92d50da403e1", "OpaqueRef:9c46f21b-23d4-4aa4-9cfe-a047f7309fab" ], "PBDs": [], "virtual_allocation": 42949672960, "physical_utilisation": 31253938176, "physical_size": 448251158528, "type": "ext", "content_type": "user", "shared": false, "other_config": { "dirty": "", "i18n-original-value-name_label": "Local storage", "i18n-key": "local-storage" }, "tags": [], "sm_config": { "devserial": "scsi-3500a075110f7b66d" }, "blobs": {}, "local_cache_enabled": true, "introduced_by": "OpaqueRef:NULL", "clustered": false, "is_tools_sr": false }, "message": "500 Internal Error", "name": "Error", "stack": "Error: 500 Internal Error at Object.assertSuccess (/usr/local/lib/node_modules/xo-server/node_modules/http-request-plus/index.js:144:19) at httpRequestPlus (/usr/local/lib/node_modules/xo-server/node_modules/http-request-plus/index.js:211:22) at Xapi.putResource (/usr/local/lib/node_modules/xo-server/node_modules/xen-api/src/index.js:508:22) at Xapi._importVm (file:///usr/local/lib/node_modules/xo-server/src/xapi/index.mjs:677:19) at Xapi.importVm (file:///usr/local/lib/node_modules/xo-server/src/xapi/index.mjs:799:48) at Xoa._downloadAndInstallHubXva (/usr/local/lib/node_modules/xo-server-xoa/src/index.js:702:16) at Xoa.createCluster (/usr/local/lib/node_modules/xo-server-xoa/src/recipes/kubernetes-cluster.js:235:18) at Api.#callApiMethod (file:///usr/local/lib/node_modules/xo-server/src/xo-mixins/api.mjs:417:20)" }
-
It's hard to know without knowing more: how much resources you declared and how many nodes?
-
@olivierlambert
I declared the minimum just in case:
I rebooted the server and this time I didn't get any error (it seems there was a task stuck from previous attempts).
It has created the master node, but no worker node.
I didn't get a green confirmation message, the "create" button it's stuck spinning.
It's been like this for 15min.
This is the console output for the master node:
I'm not sure where to look to see if there is any progress creating the worker node of if it has failed somewhere
Thanks a lot for the help!
-
@bullerwins Same issue hier. I think the
kubeadm: not found
is a good indicator that waiting is not going to help. Seems like the recipe is broken. -
So, the problem seems to be in this step where it tries to install the version of the tools you requested:
apt-get install -y kubelet=1.28.5-00 kubeadm=1.28.5-00 kubectl=1.28.5-00
If I login to the VM and run that manually, I get the following error:
debian@cp:~$ sudo apt-get install -y kubelet=1.28.5-00 kubeadm=1.28.5-00 kubectl=1.28.5-00 Reading package lists... Done Building dependency tree... Done Reading state information... Done E: Version '1.28.5-00' for 'kubelet' was not found E: Version '1.28.5-00' for 'kubeadm' was not found E: Version '1.28.5-00' for 'kubectl' was not found
The latest version that seems to be available to the VM is 1.28.2 ... try that and see if it works for you. 1.28.0 also worked for me. Just specify one of these versions when creating the instances through the recipe.
-
Thanks, @shinuza will fix this ASAP
-
@typerlc @dc31xx @olivierlambert
Currently the Kubernetes version list is pulled from:
https://api.github.com/repos/kubernetes/kubernetes/releases
Which lists every Kubernetes available releases, but the latest available version for the
xenial
repo isv1.28.2
as mentioned by @typerlc. You can use that while we figure out the best way to fix the issue on our side. -
@shinuza
The version inside the Debian image and the version on GitHub may be different. It is likely that there will be more recent versions on GitHub. Just don't select a version and the latest version from the Debian repository will be installed.const k8sInstall = k8sVersion !== undefined ? `apt-get install -y kubelet=${k8sVersion} kubeadm=${k8sVersion} kubectl=${k8sVersion}` : "apt-get install -y kubelet kubeadm kubectl";