GloriousFlywheel

Troubleshooting

Common issues with the runner infrastructure and how to resolve them.

Runner Not Registering

Symptom: Runner pod starts but does not appear in the GitLab group runner list.

Causes and fixes:

Pods Crashing (OOMKilled)

Symptom: Runner pods restart repeatedly. kubectl describe pod shows OOMKilled as the termination reason.

Fix: Increase the memory limit for the affected runner type in organization.yaml and run tofu apply. See HPA Tuning for resource limit configuration.

Common memory-hungry workloads:

Cache Misses on Nix Runner

Symptom: Nix builds download or compile everything from scratch despite previous builds having populated the cache.

Causes and fixes:

TOML Configuration Gotchas

The GitLab Runner TOML configuration has several pitfalls in Runner 17.x:

Runner Pods Pending

Symptom: Pods stay in Pending state and are not scheduled.

Causes and fixes: