Why does my GitLab CI job timeout exactly at 60 minutes?

60 minutes is the default project-level pipeline timeout setting in GitLab. If your compilation, tests, or deployment takes longer than this, GitLab forcefully terminates the runner. You can increase this by navigating to Settings > CI/CD > General pipelines > Timeout in your project repository.

How do I fix 'Permission denied (publickey)' during CI/CD?

This error means the runner lacks the correct SSH private key to authenticate with a remote server or a private Git repository. You must add your private key to the GitLab CI/CD Variables as `SSH_PRIVATE_KEY` and use `ssh-agent` in your job's `before_script` to load the key before executing SSH or Git clone commands.

Can I override the timeout for a single specific job instead of the whole project?

Yes. You can use the `timeout:` keyword directly inside a job definition in your `.gitlab-ci.yml` file (e.g., `timeout: 2 hours`). Note that this value can override the project timeout, but it can never exceed the hard limit set by the runner's administrator.

Why do I get 'bash: line 14: ./script.sh: Permission denied' in my pipeline?

The shell script was committed to Git without executable permissions. Do not just use chmod in the pipeline; fix it permanently in Git by running `git update-index --chmod=+x script.sh`, committing the change, and pushing it to your repository.

Troubleshooting GitLab CI: Fixing Timeout, Permission Denied, and Stuck Pipeline Errors

Q: My pipeline says 'This job is stuck because you don't have any active runners online' - what now?

This usually means your job in `.gitlab-ci.yml` has a specific `tags:` array, but no active runners possess those exact tags. Alternatively, your custom runner may have crashed or lost network access. Remove the tags to use shared runners, or check your custom runner's status using `sudo gitlab-runner verify`.

Fix Approaches Compared
Method	When to Use	Time	Risk
Increase Project Timeout	Jobs legitimately take longer than 60m (e.g., heavy e2e tests, ML model builds)	2 mins	Low
Deploy Specific Runners	Shared runners consistently timeout or lack necessary compute/memory resources	30 mins	Medium
Inject SSH Keys via ssh-agent	Git clone of private submodules or rsync deployments fail with 'publickey' errors	10 mins	Low
Run GitLab Runner in Privileged Mode	Docker-in-Docker (DinD) builds fail with docker.sock permission denied	15 mins	High

Understanding the Errors

GitLab CI is a robust continuous integration tool, but complex deployment pipelines often run into execution limits, runner configuration mismatches, and access control roadblocks. When your pipeline halts, you are typically dealing with one of three primary symptoms: a GitLab CI timeout, a Permission denied error during execution or cloning, or a completely stalled pipeline where GitLab CI is not working at all.

This guide breaks down each error state, providing architectural context and exact technical steps to restore green builds.

Symptom 1: GitLab CI Timeout

The Error: ERROR: Job failed: execution took longer than 1h0m0s or ERROR: Job failed: execution took longer than 10m0s

The Context: Timeouts in GitLab CI occur at three distinct levels, and a misconfiguration in any of them will forcefully terminate your job. GitLab enforces timeouts to prevent runaway processes from consuming infinite compute hours, especially on shared SaaS runners.

Project-Level Timeout: The default is 60 minutes. If your e2e test suite or docker image build takes 65 minutes, it will be killed.
Runner-Level Timeout: The administrator of a specific runner can set a maximum job timeout. If the runner limit is 10 minutes, but your project limit is 60 minutes, the job will still fail at 10 minutes. The runner limit strictly overrides the project limit if the runner limit is lower.
Job-Level Timeout: Defined directly in the .gitlab-ci.yml file using the timeout keyword.

Step 1: Diagnose the Timeout Layer First, verify exactly how long the job ran before failing. If it failed at exactly 60 minutes, it's almost certainly the project default. If it failed at a seemingly random round number like 10 or 30 minutes, suspect the runner configuration.

Step 2: Fix the Project Timeout If you have maintainer access to the repository:

Navigate to your project in GitLab.
Go to Settings > CI/CD > General pipelines.
Scroll down to Timeout.
Change the value from 60 (or whatever the current limit is) to a value that accommodates your longest job, plus a 20% buffer (e.g., 90 or 120).
Save changes.

Step 3: Fix Job-Level Overrides Sometimes, you only want one massive job to have a long timeout so you don't risk blocking runners for hours on simple linting jobs. Edit your .gitlab-ci.yml:

heavy_integration_test:
  stage: test
  script:
    - make test-all
  timeout: 3 hours 30 minutes

Step 4: Check Runner Constraints If you increased the project timeout but the job still fails early, you are hitting the runner's hard limit. If you host your own runner, edit the /etc/gitlab-runner/config.toml:

[[runners]]
  name = "heavy-lifter"
  url = "https://gitlab.com/"
  token = "YOUR_TOKEN"
  executor = "docker"
  # Increase the runner-wide timeout to 7200 seconds (2 hours)
  output_limit = 4096

Note: Runner configuration doesn't strictly define the timeout in the config.toml (that dictates output limit), but the runner registration limits it. You must update the maximum job timeout in the GitLab Admin UI under Admin Area -> CI/CD -> Runners -> [Edit Runner] -> Maximum job timeout.

Symptom 2: GitLab CI Permission Denied

The Error Variations:

bash: line 14: ./deploy.sh: Permission denied
Permission denied (publickey,keyboard-interactive)
Got permission denied while trying to connect to the Docker daemon socket at unix:///var/run/docker.sock

The Context: Permissions issues manifest differently depending on whether you are executing a local script, pulling code via SSH, or interacting with a privileged daemon like Docker.

Scenario A: Script Execution Denial If you see ./script.sh: Permission denied, Git tracked the file, but it didn't track the executable bit. Windows users frequently commit shell scripts without setting POSIX executable permissions.

The Fix: Do not just run chmod +x script.sh in the CI pipeline (though that works as a band-aid). Fix it at the Git level so it persists:

git update-index --chmod=+x script.sh
git commit -m "Make script.sh executable"
git push

Scenario B: SSH Key Access Errors If your pipeline fails while running npm install for a private git package, cloning a submodule, or rsync-ing to a remote server, it lacks the proper SSH keys.

The Fix: You must inject the SSH key securely using GitLab CI/CD variables and ssh-agent.

Go to Settings > CI/CD > Variables.
Add a variable named SSH_PRIVATE_KEY. Paste the exact contents of your id_rsa or id_ed25519 key. Crucial: Ensure there is a trailing newline at the end of the key block in the variable text box, or ssh-add will silently fail.
Update your .gitlab-ci.yml before_script:

before_script:
  - 'command -v ssh-agent >/dev/null || ( apt-get update -y && apt-get install openssh-client -y )'
  - eval $(ssh-agent -s)
  # Properly inject the key, handling line endings
  - echo "$SSH_PRIVATE_KEY" | tr -d '\r' | ssh-add -
  - mkdir -p ~/.ssh
  - chmod 700 ~/.ssh
  # Disable StrictHostKeyChecking for the pipeline environment
  - echo -e "Host *\n\tStrictHostKeyChecking no\n\n" > ~/.ssh/config

Scenario C: Docker Socket Permissions When running Docker-in-Docker (DinD) to build container images within a CI pipeline, you might be denied access to /var/run/docker.sock.

The Fix: If using a self-hosted runner, the runner executor must be configured in privileged mode to spawn inner Docker containers. Edit /etc/gitlab-runner/config.toml:

[runners.docker]
  tls_verify = false
  image = "docker:20.10.16"
  privileged = true     # THIS IS THE CRITICAL FIX
  disable_entrypoint_overwrite = false
  oom_kill_disable = false
  disable_cache = false
  volumes = ["/certs/client", "/cache"]

Restart the runner: sudo gitlab-runner restart.

Symptom 3: GitLab CI Not Working (Pending or Stuck)

The Error: This job is stuck because you don't have any active runners online with any of these tags assigned to them...

The Context: When a pipeline shows as "Pending" indefinitely, or seems to "not work" without producing logs, the job has not actually reached a runner. The GitLab coordinator is waiting for an available runner that matches the job's requirements to poll for work.

Step 1: Verify Tags GitLab routes jobs to runners based on tags. If your .gitlab-ci.yml specifies a tag that no active runner possesses, the job will hang forever.

build_app:
  stage: build
  tags:
    - aws-linux-heavy   # Does a runner with this exact tag exist?
  script:
    - make all

Check Settings > CI/CD > Runners and ensure a runner with a green circle (online) has the tag aws-linux-heavy. If you intend to use shared runners, remove the tags: block from your YAML entirely so it can execute on any generic available runner.

Step 2: Check Runner Registration Status If you host your own runners, log into the server hosting the runner and verify its connection to the GitLab instance:

sudo gitlab-runner verify

Output should look like: Runtime platform arch=amd64 os=linux pid=1409 revision=... Verifying runner... is alive runner=xyz123

If it says is removed, the runner token was revoked or deleted from the GitLab UI. You must re-register the runner using sudo gitlab-runner register.

Step 3: Concurrency Limits If your pipelines only "stop working" during busy hours, you are likely hitting concurrency limits. A runner will only execute x jobs simultaneously. In /etc/gitlab-runner/config.toml, look at the very top line: concurrent = 1 If it is set to 1, and job A is running, job B will be stuck in "Pending" until job A finishes. Increase this number (e.g., concurrent = 10) based on the CPU and RAM available on your runner host machine.

Advanced Debugging: Unmasking Hidden Failures

When standard logs fail to explain why a job is timing out or failing with cryptic permissions errors, enable highly verbose logging.

Add this to your CI/CD Variables (or directly in the .gitlab-ci.yml under variables:): CI_DEBUG_TRACE: "true"

This will expose the raw shell execution, variable expansion, and exact exit codes of every background command the GitLab Runner executes before and after your defined script block. Warning: This can expose masked secrets in logs, so use it only temporarily for debugging private repositories, and clear the logs afterward.