Evict to Rollout

This tool helps single-instance Kubernetes deployments survive node evictions (e.g., Karpenter consolidation or manual drains) without downtime.

The Problem

When a node is drained, single-instance pods are evicted. Kubernetes kills the pod and starts a new one elsewhere. This causes downtime. If you use a PodDisruptionBudget (PDB) with minAvailable: 1, the eviction is blocked indefinitely, preventing node scale-down.

The Solution

This script acts as a bridge. It detects:

Nodes that are draining (unschedulable: true).
Pods on those nodes with the annotation evict-to-rollout: "true".
Deployments that are currently stable.

When found, it triggers a rollout (per default: rolling restart) of the Deployment. This ensures a new pod is started on a different node before the old one is killed, guaranteeing zero downtime.

Usage

1. Annotate your Deployment/Pod

Add the annotation to your Deployment (which propagates to Pods):

apiVersion: apps/v1 kind: Deployment metadata: name: my-app spec: template: metadata: annotations: evict-to-rollout: "true"

Ensure you have a PDB that blocks eviction:

apiVersion: policy/v1 kind: PodDisruptionBudget metadata: name: my-app spec: minAvailable: 1 selector: matchLabels: app: my-app

2. Run locally (Dry Run)

export DRY_RUN=true ./evict_to_rollout.sh

Deployment Options

Helm (recommended)

This repository ships a Helm chart (chart/evict-to-rollout) so you can tweak the schedule, annotation selector, and naming without forking the manifest.

helm upgrade --install evict-to-rollout \ oci://ghcr.io/hivemindtechnologies/evict-to-rollout \ --version 0.1.0 \ --namespace kube-system --create-namespace \ --set schedule="*/2 * * * *" \ --set annotationSelector.key="evict-to-rollout" \ --set annotationSelector.value="true"

Key values:

Value	Description	Default
`schedule`	Cron expression for how often to scan nodes	`/1 * * *`
`annotationSelector.key`/`.value`	Annotation pair that marks pods for rollout	`evict-to-rollout` / `true`
`dryRun`	Set `true` to log would-be rollouts without patching deployments	`false`
`image.repository` / `.tag`	Container image that provides `kubectl` + `jq`	`ghcr.io/hivemindtechnologies/evict-to-rollout/kubectl-jq` / (empty = use chart `appVersion`)
`serviceAccount.create`	Whether to create a dedicated SA	`true`
`rbac.create`	Whether to install ClusterRole + binding	`true`
`nodeSelector` / `tolerations` / `affinity`	Optional scheduling hints	`{}`
`podAnnotations` / `podLabels`	Extra metadata for the CronJob pod	`{}`
`resources`	CPU/memory requests & limits for the CronJob	`{}`

See chart/evict-to-rollout/values.yaml for the full list.

Development & Testing

This repo ships a devbox.json so everyone (including CI) uses the same versions of helm, kubectl, kind, and jq.

# Start a dev shell with all tools: devbox shell # Lint the chart: devbox run lint # Run the end-to-end test (requires Docker since it spins up kind): devbox run test

The test script (scripts/test-kind.sh) creates a 3-node kind cluster, installs the Helm chart, deploys a sample annotated app, cordons a node, runs the controller job manually, and asserts that the deployment was restarted and rescheduled onto a different node.

GitHub Actions mirrors the same flow via .github/workflows/ci.yaml:

on every PR, it runs helm lint and the kind-based integration test.
on pushes to main, it additionally publishes:
- the multi-arch kubectl-jq image tagged as latest and ${LAST_TAG}-sha.${GITHUB_SHA::7}
- a Helm chart tagged as ${LAST_TAG}-sha.${GITHUB_SHA::7} to oci://ghcr.io/hivemindtechnologies/evict-to-rollout
on git tag pushes (e.g. v0.2.0), the same workflow publishes stable artifacts tagged with the release version

Release workflow

The CI pipeline keeps versions in sync automatically:

For pushes to main, it reads the most recent git tag (or 0.0.0 if none exists) and publishes snapshot artifacts tagged as <last-tag>-sha.<short-sha>.
For pushes to annotated tags (e.g. v0.3.0), it strips the v prefix and publishes both the Docker image and the Helm chart with the exact release version.
The pipeline patches chart/evict-to-rollout/Chart.yaml on the fly so that version and appVersion match the artifact tag, and the default image tag in the chart inherits from appVersion.

For local testing the kind script (devbox run test) builds the image and loads it directly into the cluster, so no registry push is required.

Operational gotchas

Node termination grace vs schedule: The CronJob only reacts on its schedule (default 1 minute). Ensure your node termination grace period comfortably exceeds schedule interval + controller runtime, otherwise the node may terminate before the rollout finishes.
Rolling update strategy required: Deployments must use the standard rolling update strategy so that a new pod starts before the old pod is deleted. StatefulSets or Deployments using Recreate will still experience downtime.
Single replica + PDB: Remember to pair single-replica workloads with a PodDisruptionBudget (minAvailable: 1 / maxUnavailable: 0). Without it, Kubernetes can evict the pod immediately even if the controller is running.
Annotation opt-in: Only pods whose template contains the configured annotation (default evict-to-rollout: "true") are handled. Forgetting the annotation means eviction proceeds as usual.
RBAC scope: The included ClusterRole grants read access to nodes/pods and patch access to deployments. Tighten or namespace-scope it if your environment requires stricter permissions.

Missing something? Open an issue with details so we can cover your use-case.

References

Related issues:

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github/workflows		.github/workflows
chart/evict-to-rollout		chart/evict-to-rollout
scripts		scripts
.envrc		.envrc
.gitignore		.gitignore
Dockerfile.kubectl-jq		Dockerfile.kubectl-jq
README.md		README.md
devbox.json		devbox.json
devbox.lock		devbox.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Evict to Rollout

The Problem

The Solution

Usage

1. Annotate your Deployment/Pod

2. Run locally (Dry Run)

Deployment Options

Helm (recommended)

Development & Testing

Release workflow

Operational gotchas

References

About

Uh oh!

Releases

Packages

Uh oh!

Languages

HivemindTechnologies/evict-to-rollout

Folders and files

Latest commit

History

Repository files navigation

Evict to Rollout

The Problem

The Solution

Usage

1. Annotate your Deployment/Pod

2. Run locally (Dry Run)

Deployment Options

Helm (recommended)

Development & Testing

Release workflow

Operational gotchas

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages