Shuttle Failure Modes¶

Failure	MVP Behavior
Shuttle pod crashes	Kubernetes restarts it. Cache rebuilds. ~10s downtime. Customer workloads unaffected.
Starbase unreachable	Snapshots retry 3x then drop. Capacity retries 3x then drop. Desired State Runnable retries on next tick. Workloads stay running.
K8s API unreachable	Informer's built-in resync handles reconnection. Brief gap in snapshots.
Desired state version mismatch	Runnable always pulls latest state; no versioning conflicts in MVP.
Pod stuck in terminating	Runnable logs warning, does not force delete in MVP.

Cross-references

Retry/loss semantics for snapshots → §19.4 · the level-driven loop that recovers from drift → §19.3 · the silent-staleness alert that catches a frozen Informer → §26.1 · resilience of running workloads during a control-plane outage → §32.