Skip to main content

Scaling

Controls replica count and autoscaling behavior for app environments.

Attributes

AttributeTypeDescription
replicasUnion[int, Tuple[int, int]] = (0, 1)Number of replicas to run. Can be a single int or a tuple of two ints representing the min and max replicas.
metricOptional[Union[Concurrency, RequestRate]] = nullMetric to use for autoscaling. Can be a concurrency or request rate.
scaledown_after`inttimedelta

Constructor

Signature

def Scaling(
replicas: Union[int, Tuple[int, int]] = (0, 1),
metric: Optional[Union[Concurrency, RequestRate]] = None,
scaledown_after: int | timedelta | None = None
) - > null

Parameters

NameTypeDescription
replicasUnion[int, Tuple[int, int]] = (0, 1)Number of replicas. An int for fixed count, or a (min, max) tuple for autoscaling.
metricOptional[Union[Concurrency, RequestRate]] = NoneAutoscaling metric such as Scaling.Concurrency(val) or Scaling.RequestRate(val).
scaledown_after`inttimedelta

Methods


get_replicas()

@classmethod
def get_replicas() - > Tuple[int, int]

Retrieves the normalized replica range for the environment. This method ensures the output is always a tuple, converting fixed integer replica counts into a (min, max) pair where both values are identical.

Returns

TypeDescription
Tuple[int, int]A tuple containing the minimum and maximum number of replicas to be maintained by the autoscaler.