How it works

An operator that explains your autoscaling, then tunes it

stepscale installs as a Kubernetes operator in your own cluster. It reads your HPAs and KEDA ScaledObjects, analyzes the metrics, and writes recommendations you review with kubectl. Nothing is applied until you approve it and turn on apply mode. Metrics, analysis, and any applied change stay in your account.

The loop

Watch

The operator watches your HorizontalPodAutoscalers and KEDA ScaledObjects and reads the real metrics from metrics-server or Prometheus over a rolling window.

Detect

A deterministic rule engine flags candidates: floors idle 24/7, CPU targets set too low, autoscalers that thrash, and scale-up that lags traffic.

Judge

An LLM, on your own OpenAI or Anthropic key, reviews each candidate: whether to act, a risk rating, plain-language reasoning, and a savings estimate. No key means rule-based output instead.

Recommend

Results are written as ScalingRecommendation custom resources you read and diff with kubectl. This is the default, and it never changes a workload.

Apply

With a license, you approve a recommendation and the operator patches the target, holds it on a probation window, and rolls back automatically if health regresses.

What it does

Four detectors

Idle floors, over-provisioned targets, thrashing, and scale-up lag. Each candidate explains the evidence behind it.

LLM reasoning, your key

Bring your own OpenAI or Anthropic key, point at a proxy or local model, or run rules-only with no LLM at all.

Predictive schedules

For genuinely periodic workloads, forecast recurring peaks and pre-scale the floor ahead of them. Off until validated for your workload.

HPA + KEDA

Reads and patches HorizontalPodAutoscalers and KEDA ScaledObjects. A KEDA workload gets one recommendation, not two.

Safety net

Applied changes run on probation; the operator verifies workload health and auto-reverts a change that degrades it.

Air-gapped

The license is verified offline with no phone-home, and rules-only mode needs no external model. Mirror the signed image and chart internally.

What it is not

- Not a hosted SaaS. There is no stepscale account and no cross-account access to your cloud.
- Not in your traffic path. It reconciles on an interval; your autoscaler keeps making every real-time decision.
- Not a data pipeline out of your cluster. The only outbound call is to your LLM provider, with your key, and only if you set one.
- Not automatic by default. Recommend-only is the default; applying is opt-in and licensed.

Install it and read the first recommendations

One helm install, your own LLM key, recommendations in about 30 minutes once metrics are flowing. No login, no license.

Install now Read the docs