GitHub actions broke helm/kind-action

Until about yesterday it was possible to use the GitHub helm/kind-action action to spin up a Kubernetes cluster using GitHub actions.

Starting today I see those clusters fail with a network error (cni).

For example: always collect logs · ctron/kind-reproducer@4ff4f2a · GitHub … NOTE: The fact that the test run succeeded is failure of the action, which reports the outcome as “ok”, although the logs show a failure:

Run helm/kind-action@v1.1.0
Installing kind...
Adding kind directory to PATH...
Installing kubectl...
Adding kubectl directory to PATH...
kind v0.9.0 go1.15.2 linux/amd64
Client Version: version.Info{Major:"1", Minor:"19", GitVersion:"v1.19.1", GitCommit:"206bcadf021e76c27513500ca24182692aabd17e", GitTreeState:"clean", BuildDate:"2020-09-09T11:26:42Z", GoVersion:"go1.15", Compiler:"gc", Platform:"linux/amd64"}
Creating kind cluster...
Creating cluster "kind" ...
 • Ensuring node image (kindest/node:v1.19.1) 🖼  ...
 ✓ Ensuring node image (kindest/node:v1.19.1) 🖼
 • Preparing nodes 📦   ...
 ✓ Preparing nodes 📦 
 • Writing configuration 📜  ...
 ✓ Writing configuration 📜
 • Starting control-plane 🕹️  ...
 ✓ Starting control-plane 🕹️
 • Installing CNI 🔌  ...
 ✓ Installing CNI 🔌
 • Installing StorageClass 💾  ...
 ✓ Installing StorageClass 💾
 • Waiting ≤ 5m0s for control-plane = Ready ⏳  ...
 ✗ Waiting ≤ 5m0s for control-plane = Ready ⏳
 • WARNING: Timed out waiting for Ready ⚠️
Set kubectl context to "kind-kind"
You can now use your cluster with:

kubectl cluster-info --context kind-kind

Not sure what to do next? 😅  Check out https://kind.sigs.k8s.io/docs/user/quick-start/

Taking a look at the logs, I can find that the CNI plugin crashes:

2021-06-30T12:10:06.697532647Z stderr F I0630 12:10:06.696212       1 main.go:65] hostIP = 172.18.0.2
2021-06-30T12:10:06.697555049Z stderr F podIP = 172.18.0.2
2021-06-30T12:10:06.697560949Z stderr F I0630 12:10:06.696339       1 main.go:74] setting mtu 1500 for CNI 
2021-06-30T12:10:36.872151809Z stderr F I0630 12:10:36.872022       1 main.go:111] Failed to get nodes, retrying after error: Get https://10.96.0.1:443/api/v1/nodes: dial tcp 10.96.0.1:443: i/o timeout
2021-06-30T12:11:06.872474169Z stderr F I0630 12:11:06.872340       1 main.go:111] Failed to get nodes, retrying after error: Get https://10.96.0.1:443/api/v1/nodes: dial tcp 10.96.0.1:443: i/o timeout
2021-06-30T12:11:37.872999728Z stderr F I0630 12:11:37.872867       1 main.go:111] Failed to get nodes, retrying after error: Get https://10.96.0.1:443/api/v1/nodes: dial tcp 10.96.0.1:443: i/o timeout
2021-06-30T12:12:09.873430044Z stderr F I0630 12:12:09.873303       1 main.go:111] Failed to get nodes, retrying after error: Get https://10.96.0.1:443/api/v1/nodes: dial tcp 10.96.0.1:443: i/o timeout
2021-06-30T12:12:42.874130909Z stderr F I0630 12:12:42.873979       1 main.go:111] Failed to get nodes, retrying after error: Get https://10.96.0.1:443/api/v1/nodes: dial tcp 10.96.0.1:443: i/o timeout
2021-06-30T12:12:46.876398745Z stderr F panic: Reached maximum retries obtaining node list: Get https://10.96.0.1:443/api/v1/nodes: dial tcp 10.96.0.1:443: i/o timeout
2021-06-30T12:12:46.876426847Z stderr F 
2021-06-30T12:12:46.876433547Z stderr F goroutine 1 [running]:
2021-06-30T12:12:46.876438648Z stderr F main.main()
2021-06-30T12:12:46.876443448Z stderr F 	/go/src/cmd/kindnetd/main.go:115 +0x905