K8s DaemonSet Incompatible with Autoscaling #888

11xor6 · 2024-12-17T10:53:26Z

When the sysbox DaemonSet is deployed against an autoscaling node pool (GKE, but probably relevant on other providers) pods fail to be scheduled on the Node(s). The reason for this is that the RuntimeClass configuration adds the sysbox-runtime: running label to the pod's nodeSelector which then prevents the pod from matching the node pool and in turn preventing scale-up.

Switching the RuntimeClass to use a static label for node selection seems workable given the taint added to the node during installation, however I have randomly (and very rarely) seen issues with pods dying.

The text was updated successfully, but these errors were encountered:

11xor6 · 2025-01-14T05:56:02Z

Just a small update here, I am consistently seeing errors whenever a node scales up. Generally all pods created during a scale up will fail if when they are scheduled to the new node. The failure seems to be that the pod gets scheduled to the node between the time the DaemonSet script removes the taint and the time that the actual RuntimeClass is ready and supported; and the pod fails because the RuntimeClass isn't supported on the Node.

Generally constructs like Deployments, StatefulSets, and the like will hide this error by automatically restarting the pod. I only found this because my current application directly creates a pod. This could be mitigated by not removing the taint until after the RuntimeClass is actually available on the Node.

ctalledo · 2025-01-25T05:13:45Z

Hi @11xor6, thanks for reporting the issue.

The failure seems to be that the pod gets scheduled to the node between the time the DaemonSet script removes the taint and the time that the actual RuntimeClass is ready and supported; and the pod fails because the RuntimeClass isn't supported on the Node.

Not sure how that can be the case though, because no sysbox-pods will be scheduled on the node until it's labeled with sysbox-runtime=running, and that labeling already occurs ** before ** the taint is removed (see sysbox-deploy-k8s main script here):

1250 │     add_label_to_node "crio-runtime=running"                                                                                                                                                                                                                             
1251 │     add_label_to_node "sysbox-runtime=running"                                                                                                                                                                                                                           
1252 │     rm_taint_from_node "${k8s_taints}"

So something else must be going on (?)

ctalledo · 2025-01-25T05:19:40Z

BTW, in case you want to play around with the sequencing of steps in sysbox-deploy-k8s, you can follow these steps:

git clone the sysbox repo and edit the sysbox-deploy-k8s main script as needed.
From within the sysbox-pkgr/k8s directory type make to build a new sysbox-deploy-k8s container image.
Push that iamge to your repo
Modify the sysbox-install.yaml to point to your new image
kubectl apply -f sysbox-install.yaml to apply it on your k8s cluster.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

K8s DaemonSet Incompatible with Autoscaling #888

K8s DaemonSet Incompatible with Autoscaling #888

11xor6 commented Dec 17, 2024

11xor6 commented Jan 14, 2025 •

edited

Loading

ctalledo commented Jan 25, 2025

ctalledo commented Jan 25, 2025 •

edited

Loading

K8s DaemonSet Incompatible with Autoscaling #888

K8s DaemonSet Incompatible with Autoscaling #888

Comments

11xor6 commented Dec 17, 2024

11xor6 commented Jan 14, 2025 • edited Loading

ctalledo commented Jan 25, 2025

ctalledo commented Jan 25, 2025 • edited Loading

11xor6 commented Jan 14, 2025 •

edited

Loading

ctalledo commented Jan 25, 2025 •

edited

Loading