Failover is also supported using the Astro API.
Running, scheduled, and event-triggered tasks may be impacted during the failover window. Tasks may fail and require a retry.
In the Astro UI, go to Organization Settings > Clusters, select your primary cluster, and open the Disaster Recovery tab. Confirm the Status indicator shows the primary region is active.
On the Disaster Recovery tab, click Failover. Alternatively, open the cluster’s actions menu (⋯) at the top right of the page and select Failover to Secondary…. Follow the prompts to confirm.
The secondary cluster is promoted to active, and all Deployments and data become available in the secondary cluster.
After the primary region recovers, you can fail back to the original primary cluster.
Failback is also supported using the Astro API.
Running, scheduled, and event-triggered tasks may be impacted during the failback window. Tasks may fail and require a retry.
If you have Universal Metrics Export (UME) configured, the same UME configuration applies to both the primary and secondary clusters. Metrics exported from each cluster include a cloud_region attribute so you can distinguish data from each cluster in your metrics system.
After failover, update your UME settings if needed to reflect the new active cluster.