-
Notifications
You must be signed in to change notification settings - Fork 113
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Compute nodes are in 'not ready' state when they are PXE booted and joined to the cluster again #242
Comments
I think for now reinitializing would be alright. We used to have functionality through the |
Currently this functionality will work but someone has to manually enter 'kubeadm reset' command in all the nodes, since there is no task that executes this command. Adding this 'kubeadm reset' task in the k8s_start_manager and k8s_start_workers roles will do the job. |
We do have a script ( |
Issue #242: Added kubeadm reset task to fix removing & adding of comp…
I'd like to revisit this issue. If you already have a k8s manager/head node up and running and you PXE boot computes, the system should not automatically reset with when adding or re-imaging nodes:
I think @blesson-james took care of the 1st bullet with the PR yesterday but we should also take into account the 2nd bullet to resolve close this issue. |
@j0hnL I have added checks for 'NotReady' compute nodes in PR #262 , this will take care of the below points:
For redeploying the whole cluster, user will have to PXE boot the head node along with the compute nodes. This can be later taken care of by giving user a rollback functionality by converting scuttle into a playbook, hence avoiding PXE boot. |
Issue #242: Added checks to join compute nodes without redeploying cl…
Describe the bug
In an existing kubernetes cluster with head and compute nodes, if the compute nodes are PXE booted, and user wants to join the same/new compute node to the cluster again and executes omnia.yml, then 'kubeadm join' task present in k8s_start_workers role is skipped.
Reasons:
Solution:
To Reproduce
Steps to reproduce the behavior:
Expected behavior
The compute node should be joined back to cluster and config details in the head node should be updated.
Screenshots
If applicable, add screenshots to help explain your problem.
Desktop (please complete the following information):
Smartphone (please complete the following information):
Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: