Actions Larger Runners Provisioning Delays

Severity: Major
Category: Misconfiguration
Service: GitHub

This summary is created by Generative AI and may differ from the actual content.

Overview

Between Feb 5, 2025 00:34 UTC and 11:16 UTC, up to 7% of organizations using GitHub-hosted larger runners with public IP addresses had those jobs fail to start during the impact window. The issue was caused by a backend migration in the public IP management system, which caused certain public IP address runners to be placed in a non-functioning state.

Impact

up to 7% of organizations using GitHub-hosted larger runners with public IP addresses had those jobs fail to start during the impact window.

Trigger

Actions larger runners are stuck in provisioning for some customers

Detection

We are currently investigating this issue.

Resolution

We have improved the rollback steps for this migration to reduce the time to mitigate any future recurrences, are working to improve automated detection of this error state, and are improving the resiliency of runners to handle this error state without customer impact.

Root Cause

The issue was caused by a backend migration in the public IP management system, which caused certain public IP address runners to be placed in a non-functioning state.