Skip to content

Running asg-roller at the same time as cluster-autoscaler results in a cluster of unschedulable nodes #19

@tom-butler

Description

@tom-butler

I've been trying to get ASG Roller to work with Cluster Autoscaler but the two seem to be clashing and resulting in a cluster of unschedulable nodes.

I think the following is happening:

  1. ASG Roller notices difference in launch template
  2. ASG Roller scales up cluster
  3. Cluster Autoscaler notices new nodes with no usage, and taints then as PreferNoSchedule
  4. ASG Roller cordons and drains old nodes (all nodes are now unschedulable)

The issue seems to be that cluster-autoscaler taints nodes before it scales them down, the timing of the taint isn't configurable in cluster autoscaler.

Could ASG roller be updated to set the annotation "cluster-autoscaler.kubernetes.io/scale-down-disabled": "true" during scaling events?

I believe this will stop the clashing of ASG Roller and Cluster Autoscaler

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions