Skip to content

Shuttle Failure Modes

Failure MVP Behavior
Shuttle pod crashes Kubernetes restarts it. Cache rebuilds. ~10s downtime. Customer workloads unaffected.
Starbase unreachable Snapshots retry 3x then drop. Capacity retries 3x then drop. Desired State Runnable retries on next tick. Workloads stay running.
K8s API unreachable Informer's built-in resync handles reconnection. Brief gap in snapshots.
Desired state version mismatch Runnable always pulls latest state; no versioning conflicts in MVP.
Pod stuck in terminating Runnable logs warning, does not force delete in MVP.

Cross-references

Retry/loss semantics for snapshots → §19.4 · the level-driven loop that recovers from drift → §19.3 · the silent-staleness alert that catches a frozen Informer → §26.1 · resilience of running workloads during a control-plane outage → §32.