How to kubernetes on xcp-ng (csi?)
-
@Theoi-Meteoroi said in How to kubernetes on xcp-ng (csi?):
I've been using this with NVMe on 3 Dell 7920 boxen with PCI passthru.
https://github.com/piraeusdatastore/piraeus-operator
It worked well enough that I installed the rest of the NVMe slots to have 7TB per node. I pin the master kubernetes nodes each to a physical node, I use 3 so I can roll updates and patches. The masters serve the storage out to containers - so the workers are basically "storage-less". Those worker nodes can move around. All the networking is 10G with 4 interfaces, so I have one specifically as the backend for this.
Just one note on handing devices to the operator - I use raw NVMe disk.
There can't be any partition or PV on the device. I put a PV on, then erase it so the disk is wiped. Then the operator finds the disk usable an initializes. It tries to not use a disk that seems in use already.I also played a bit with XOSTOR but on spinning rust. Its really robust with the DRBD backend once you get used to working with it. Figuring out object relationships will have you maybe drink more than usual.
Did you use the built-in Recipes to create the kubernetes cluster? I tried NVMe, iCSCI, SSD, NFS Share. All the same thing.
-
You have a network issue (well, a DNS one) inside your VM, are you using the right network?
-
@olivierlambert said in How to kubernetes on xcp-ng (csi?):
You have a network issue (well, a DNS one) inside your VM, are you using the right network?
Feel so dumb. When creating a VM, usually the top Network is the correct one. For Kubernetes, I had to scroll all the way down and select the correct network.
-
Is it working correctly now?
-
@olivierlambert said in How to kubernetes on xcp-ng (csi?):
Is it working correctly now?
Currently it is stuck on this.
[FAILED] Failed to start Execute cloud user/final scripts.
cp-1 login:
I did not specify login credentials.
-
Still a network problem (Cloud init can't reach something, no route to host)
-
@olivierlambert said in How to kubernetes on xcp-ng (csi?):
Still a network problem (Cloud init can't reach something, no route to host)
This is the same error I used to get on my homelab when manually installing Ubuntu and trying to deploy k3s with rancheros and longhorn.
This try is on our datacenter and not my homelab. I'll do another setup with DHCP.
-
@olivierlambert Nope, still the same error.
I don't think our Sonicwall (in the datacenter) is blocking anything. Since at home, I am using Unifi. -
fwiw, i'm about to set up a kubernetes cluster on xcp-ng. i'm still in the process, but i'm planning on just passing disks as hba storage to worker/storage nodes, then using openebs jiva (or maybe rook/ceph)
if anybody is interested in how that goes, i can post about it later on
-
Adding @shinuza also in the convo
-
@mohammadm said in How to kubernetes on xcp-ng (csi?):
@olivierlambert said in How to kubernetes on xcp-ng (csi?):
Is it working correctly now?
Currently it is stuck on this.
[FAILED] Failed to start Execute cloud user/final scripts.
cp-1 login:
I did not specify login credentials.
The error is probably earlier than that. You can see the full output of the cloudinit script in /var/log/cloud-init-output.log
The error you are seeing is most likely due to the fact the kubernetes cluster could not be initialized. (kubeadm init failed) -
@xyhhx said in How to kubernetes on xcp-ng (csi?):
fwiw, i'm about to set up a kubernetes cluster on xcp-ng. i'm still in the process, but i'm planning on just passing disks as hba storage to worker/storage nodes, then using openebs jiva (or maybe rook/ceph)
if anybody is interested in how that goes, i can post about it later on
Jiva is probably not the way to go, I believe the supported way is to use Mayastor (via nvme-tcp)
-
@shinuza oh, yeah. they've deprecated it in the last couple of months. thanks for the heads up
-
I should mention I use static IP for my VMs and I don't deploy using the hub recipes because my home setup is all Community bits and 8.3 XCP, updated regularly.
I use kubespray. I'm at kubernetes 1.27.7So I can't provide any help with cloud init or hub/recipes.
The working setup I have is built on CentOS7 ( need to replace ) VMs. I tried deploying on RockyOS 9.3 and spun my wheels for too long on issues there so I'm going to try 8.9.
I just updated my XO-CE bits and see some nice new features in the UI that will be helpful with all the PCI passthru I want to do. I was putting add/remove instructions in the VM notes and tags on the VMs just to remind me of what they show below. Its like they read my mind.
I need to learn KubeVirt.
Its christmas in may.
-
@Theoi-Meteoroi check out https://talos.dev
-
@xyhhx Why thanks! I'll take a look at it. I've done coreos and flatcar for work but find immutable OS no fun at home. I'm used to doing all the admin already and most of my VMs live long enough to be classed as "pets".
-
@Theoi-Meteoroi lol i've been seeing that analogy more and more lately. to each their own!
i think that's worth doing when the vms aren't being used as kubernetes nodes, but for kubernetes i'd prefer the vms be strictly for kubernetes
talos is purpose-built for this, too, so you get lots of great features, and you can be confident they're secure, and minimal. their support chat is very responsive too, if you're on matrix
ok i'm done shilling talos now
-
@xyhhx As I look further, it does seem to check a lot of boxes if I was building out a prod instance. I have enough exposure to dev-sec that I see the value. I feel like I sound like Linus and his comments about Debian but learning new ecosystems takes time and energy and sometimes is just distraction from building something. I kinda wore myself out learning all the details and quirks of DRBD and Linstor recently so I'll read some of the docs and hope the toolchain doesn't chase me away. Any product or project that genuinely and openly has a community like we have here would usually have my attention, just on those terms.
-
@Theoi-Meteoroi if you do check it out don't be shy to ping me either here or on matrix (
@xyhhx:matrix.org
) and i'll be happy to try to help -
So all of you have connected the storage directly to the vms?
I'm trying to do it on iscsi and nfs storage.