debian-mirror-gitlab/doc/ci/troubleshooting.md

393 lines
18 KiB
Markdown
Raw Normal View History

2020-07-28 23:09:34 +05:30
---
stage: Verify
2021-09-04 01:27:46 +05:30
group: Pipeline Execution
2021-02-22 17:27:13 +05:30
info: To determine the technical writer assigned to the Stage/Group associated with this page, see https://about.gitlab.com/handbook/engineering/ux/technical-writing/#assignments
2020-07-28 23:09:34 +05:30
type: reference
---
2021-09-30 23:02:18 +05:30
# Troubleshooting CI/CD **(FREE)**
2020-07-28 23:09:34 +05:30
2020-11-24 15:15:51 +05:30
GitLab provides several tools to help make troubleshooting your pipelines easier.
2020-10-24 23:57:45 +05:30
2020-11-24 15:15:51 +05:30
This guide also lists common issues and possible solutions.
2020-10-24 23:57:45 +05:30
2020-11-24 15:15:51 +05:30
## Verify syntax
2020-10-24 23:57:45 +05:30
2020-11-24 15:15:51 +05:30
An early source of problems can be incorrect syntax. The pipeline shows a `yaml invalid`
badge and does not start running if any syntax or formatting problems are found.
2020-10-24 23:57:45 +05:30
2022-07-16 23:28:13 +05:30
### Edit `.gitlab-ci.yml` with the pipeline editor
2020-10-24 23:57:45 +05:30
2021-12-11 22:18:48 +05:30
The [pipeline editor](pipeline_editor/index.md) is the recommended editing
experience (rather than the single file editor or the Web IDE). It includes:
- Code completion suggestions that ensure you are only using accepted keywords.
- Automatic syntax highlighting and validation.
- The [CI/CD configuration visualization](pipeline_editor/index.md#visualize-ci-configuration),
a graphical representation of your `.gitlab-ci.yml` file.
2020-11-24 15:15:51 +05:30
2022-07-23 23:45:48 +05:30
### Edit `.gitlab-ci.yml` locally
If you prefer to edit your pipeline configuration locally, you can use the
GitLab CI/CD schema in your editor to verify basic syntax issues. Any
[editor with Schemastore support](https://www.schemastore.org/json/#editors) uses
the GitLab CI/CD schema by default.
If you need to link to the schema directly, it
is at:
```plaintext
https://gitlab.com/gitlab-org/gitlab/-/blob/master/app/assets/javascripts/editor/schema/ci.json.
```
The schema rules for custom YAML tags like `!reference` will not work until the
custom tags are set in the editor settings. For example, in VS Code, you can set
`vscode-yaml` to parse `customTags`:
```json
"yaml.customTags": [
"!reference sequence"
]
```
To see the full list of custom tags covered by the CI/CD schema, check the
latest version of the schema, linked above.
2020-11-24 15:15:51 +05:30
### Verify syntax with CI Lint tool
The [CI Lint tool](lint.md) is a simple way to ensure the syntax of a CI/CD configuration
2021-11-11 11:23:49 +05:30
file is correct. Paste in full `.gitlab-ci.yml` files or individual jobs configuration,
2020-11-24 15:15:51 +05:30
to verify the basic syntax.
When a `.gitlab-ci.yml` file is present in a project, you can also use the CI Lint
2022-04-04 11:22:00 +05:30
tool to [simulate the creation of a full pipeline](lint.md#simulate-a-pipeline).
2020-11-24 15:15:51 +05:30
It does deeper verification of the configuration syntax.
## Verify variables
A key part of troubleshooting CI/CD is to verify which variables are present in a
pipeline, and what their values are. A lot of pipeline configuration is dependent
on variables, and verifying them is one of the fastest ways to find the source of
a problem.
2021-09-30 23:02:18 +05:30
[Export the full list of variables](variables/index.md#list-all-environment-variables)
2020-11-24 15:15:51 +05:30
available in each problematic job. Check if the variables you expect are present,
and check if their values are what you expect.
## GitLab CI/CD documentation
2021-11-11 11:23:49 +05:30
The [complete `.gitlab-ci.yml` reference](yaml/index.md) contains a full list of
2021-11-18 22:05:49 +05:30
every keyword you can use to configure your pipelines.
2020-11-24 15:15:51 +05:30
2021-09-30 23:02:18 +05:30
You can also look at a large number of pipeline configuration [examples](examples/index.md)
and [templates](examples/index.md#cicd-templates).
2020-11-24 15:15:51 +05:30
### Documentation for pipeline types
Some pipeline types have their own detailed usage guides that you should read
if you are using that type:
2021-09-30 23:02:18 +05:30
- [Multi-project pipelines](pipelines/multi_project_pipelines.md): Have your pipeline trigger
2020-11-24 15:15:51 +05:30
a pipeline in a different project.
2021-09-30 23:02:18 +05:30
- [Parent/child pipelines](pipelines/parent_child_pipelines.md): Have your main pipeline trigger
2020-11-24 15:15:51 +05:30
and run separate pipelines in the same project. You can also
2021-09-30 23:02:18 +05:30
[dynamically generate the child pipeline's configuration](pipelines/parent_child_pipelines.md#dynamic-child-pipelines)
2020-11-24 15:15:51 +05:30
at runtime.
2022-04-04 11:22:00 +05:30
- [Merge request pipelines](pipelines/merge_request_pipelines.md): Run a pipeline
2020-11-24 15:15:51 +05:30
in the context of a merge request.
2022-04-04 11:22:00 +05:30
- [Merged results pipelines](pipelines/merged_results_pipelines.md):
Merge request pipelines that run on the combined source and target branch
- [Merge trains](pipelines/merge_trains.md):
Multiple merged results pipelines that queue and run automatically before
2020-11-24 15:15:51 +05:30
changes are merged.
### Troubleshooting Guides for CI/CD features
2021-11-18 22:05:49 +05:30
Troubleshooting guides are available for some CI/CD features and related topics:
2020-11-24 15:15:51 +05:30
- [Container Registry](../user/packages/container_registry/index.md#troubleshooting-the-gitlab-container-registry)
- [GitLab Runner](https://docs.gitlab.com/runner/faq/)
2021-09-30 23:02:18 +05:30
- [Merge Trains](pipelines/merge_trains.md#troubleshooting)
2020-11-24 15:15:51 +05:30
- [Docker Build](docker/using_docker_build.md#troubleshooting)
- [Environments](environments/deployment_safety.md#ensure-only-one-deployment-job-runs-at-a-time)
## Common CI/CD issues
A lot of common pipeline issues can be fixed by analyzing the behavior of the `rules`
or `only/except` configuration. You shouldn't use these two configurations in the same
pipeline, as they behave differently. It's hard to predict how a pipeline runs with
this mixed behavior.
If your `rules` or `only/except` configuration makes use of [predefined variables](variables/predefined_variables.md)
like `CI_PIPELINE_SOURCE`, `CI_MERGE_REQUEST_ID`, you should [verify them](#verify-variables)
as the first troubleshooting step.
### Jobs or pipelines don't run when expected
The `rules` or `only/except` keywords are what determine whether or not a job is
added to a pipeline. If a pipeline runs, but a job is not added to the pipeline,
it's usually due to `rules` or `only/except` configuration issues.
If a pipeline does not seem to run at all, with no error message, it may also be
due to `rules` or `only/except` configuration, or the `workflow: rules` keyword.
If you are converting from `only/except` to the `rules` keyword, you should check
2021-09-30 23:02:18 +05:30
the [`rules` configuration details](yaml/index.md#rules) carefully. The behavior
2020-11-24 15:15:51 +05:30
of `only/except` and `rules` is different and can cause unexpected behavior when migrating
between the two.
2021-09-04 01:27:46 +05:30
The [common `if` clauses for `rules`](jobs/job_control.md#common-if-clauses-for-rules)
2020-11-24 15:15:51 +05:30
can be very helpful for examples of how to write rules that behave the way you expect.
#### Two pipelines run at the same time
Two pipelines can run when pushing a commit to a branch that has an open merge request
associated with it. Usually one pipeline is a merge request pipeline, and the other
is a branch pipeline.
2020-10-24 23:57:45 +05:30
2021-11-18 22:05:49 +05:30
This situation is usually caused by the `rules` configuration, and there are several ways to
2021-09-04 01:27:46 +05:30
[prevent duplicate pipelines](jobs/job_control.md#avoid-duplicate-pipelines).
2020-07-28 23:09:34 +05:30
2020-11-24 15:15:51 +05:30
#### A job is not in the pipeline
2020-07-28 23:09:34 +05:30
2021-09-30 23:02:18 +05:30
GitLab determines if a job is added to a pipeline based on the [`only/except`](yaml/index.md#only--except)
or [`rules`](yaml/index.md#rules) defined for the job. If it didn't run, it's probably
2020-11-24 15:15:51 +05:30
not evaluating as you expect.
2020-07-28 23:09:34 +05:30
2020-11-24 15:15:51 +05:30
#### No pipeline or the wrong type of pipeline runs
2020-07-28 23:09:34 +05:30
2020-11-24 15:15:51 +05:30
Before a pipeline can run, GitLab evaluates all the jobs in the configuration and tries
to add them to all available pipeline types. A pipeline does not run if no jobs are added
to it at the end of the evaluation.
If a pipeline did not run, it's likely that all the jobs had `rules` or `only/except` that
blocked them from being added to the pipeline.
If the wrong pipeline type ran, then the `rules` or `only/except` configuration should
be checked to make sure the jobs are added to the correct pipeline type. For
example, if a merge request pipeline did not run, the jobs may have been added to
a branch pipeline instead.
2021-09-30 23:02:18 +05:30
It's also possible that your [`workflow: rules`](yaml/index.md#workflow) configuration
2020-11-24 15:15:51 +05:30
blocked the pipeline, or allowed the wrong pipeline type.
### A job runs unexpectedly
A common reason a job is added to a pipeline unexpectedly is because the `changes`
keyword always evaluates to true in certain cases. For example, `changes` is always
true in certain pipeline types, including scheduled pipelines and pipelines for tags.
2021-09-30 23:02:18 +05:30
The `changes` keyword is used in combination with [`only/except`](yaml/index.md#onlychanges--exceptchanges)
or [`rules`](yaml/index.md#ruleschanges)). It's recommended to use `changes` with
2020-11-24 15:15:51 +05:30
`rules` or `only/except` configuration that ensures the job is only added to branch
pipelines or merge request pipelines.
### "fatal: reference is not a tree" error
> [Introduced](https://gitlab.com/gitlab-org/gitlab/-/merge_requests/17043) in GitLab 12.4.
Previously, you'd have encountered unexpected pipeline failures when you force-pushed
a branch to its remote repository. To illustrate the problem, suppose you've had the current workflow:
1. A user creates a feature branch named `example` and pushes it to a remote repository.
1. A new pipeline starts running on the `example` branch.
2021-04-17 20:07:23 +05:30
1. A user rebases the `example` branch on the latest default branch and force-pushes it to its remote repository.
2020-11-24 15:15:51 +05:30
1. A new pipeline starts running on the `example` branch again, however,
the previous pipeline (2) fails because of `fatal: reference is not a tree:` error.
2021-11-18 22:05:49 +05:30
This occurs because the previous pipeline cannot find a checkout-SHA (which is associated with the pipeline record)
2020-11-24 15:15:51 +05:30
from the `example` branch that the commit history has already been overwritten by the force-push.
2022-04-04 11:22:00 +05:30
Similarly, [Merged results pipelines](pipelines/merged_results_pipelines.md)
might have failed intermittently due to [the same reason](pipelines/merged_results_pipelines.md#pipelines-fail-intermittently-with-a-fatal-reference-is-not-a-tree-error).
2020-11-24 15:15:51 +05:30
As of GitLab 12.4, we've improved this behavior by persisting pipeline refs exclusively.
To illustrate its life cycle:
1. A pipeline is created on a feature branch named `example`.
1. A persistent pipeline ref is created at `refs/pipelines/<pipeline-id>`,
which retains the checkout-SHA of the associated pipeline record.
This persistent ref stays intact during the pipeline execution,
even if the commit history of the `example` branch has been overwritten by force-push.
1. The runner fetches the persistent pipeline ref and gets source code from the checkout-SHA.
1. When the pipeline finishes, its persistent ref is cleaned up in a background process.
### Merge request pipeline messages
The merge request pipeline widget shows information about the pipeline status in
a merge request. It's displayed above the [ability to merge status widget](#merge-request-status-messages).
2022-04-04 11:22:00 +05:30
#### "Checking ability to merge automatically" message
There is a [known issue](https://gitlab.com/gitlab-org/gitlab/-/issues/229352)
where a merge request can be stuck with the `Checking ability to merge automatically`
message.
If your merge request has this message and it does not disappear after a few minutes,
you can try one of these workarounds:
- Refresh the merge request page.
- Close & Re-open the merge request.
- Rebase the merge request with the `/rebase` [quick action](../user/project/quick_actions.md).
- If you have already confirmed the merge request is ready to be merged, you can merge
it with the `/merge` quick action.
2020-11-24 15:15:51 +05:30
#### "Checking pipeline status" message
This message is shown when the merge request has no pipeline associated with the
latest commit yet. This might be because:
2020-07-28 23:09:34 +05:30
- GitLab hasn't finished creating the pipeline yet.
- You are using an external CI service and GitLab hasn't heard back from the service yet.
- You are not using CI/CD pipelines in your project.
2021-02-22 17:27:13 +05:30
- You are using CI/CD pipelines in your project, but your configuration prevented a pipeline from running on the source branch for your merge request.
2020-10-24 23:57:45 +05:30
- The latest pipeline was deleted (this is a [known issue](https://gitlab.com/gitlab-org/gitlab/-/issues/214323)).
2021-11-18 22:05:49 +05:30
- The source branch of the merge request is on a private fork.
2020-07-28 23:09:34 +05:30
2020-11-24 15:15:51 +05:30
After the pipeline is created, the message updates with the pipeline status.
### Merge request status messages
The merge request status widget shows the **Merge** button and whether or not a merge
request is ready to merge. If the merge request can't be merged, the reason for this
is displayed.
If the pipeline is still running, the **Merge** button is replaced with the
**Merge when pipeline succeeds** button.
2020-07-28 23:09:34 +05:30
2021-09-30 23:02:18 +05:30
If [**Merge Trains**](pipelines/merge_trains.md)
2020-11-24 15:15:51 +05:30
are enabled, the button is either **Add to merge train** or **Add to merge train when pipeline succeeds**. **(PREMIUM)**
2020-07-28 23:09:34 +05:30
2020-11-24 15:15:51 +05:30
#### "A CI/CD pipeline must run and be successful before merge" message
2020-07-28 23:09:34 +05:30
2020-11-24 15:15:51 +05:30
This message is shown if the [Pipelines must succeed](../user/project/merge_requests/merge_when_pipeline_succeeds.md#only-allow-merge-requests-to-be-merged-if-the-pipeline-succeeds)
setting is enabled in the project and a pipeline has not yet run successfully.
This also applies if the pipeline has not been created yet, or if you are waiting
for an external CI service. If you don't use pipelines for your project, then you
should disable **Pipelines must succeed** so you can accept merge requests.
2022-04-04 11:22:00 +05:30
#### "Merge blocked: pipeline must succeed. Push a new commit that fixes the failure" message
2021-03-11 19:13:27 +05:30
2021-09-30 23:02:18 +05:30
This message is shown if the [merge request pipeline](pipelines/merge_request_pipelines.md),
2022-04-04 11:22:00 +05:30
[merged results pipeline](pipelines/merged_results_pipelines.md),
2021-09-30 23:02:18 +05:30
or [merge train pipeline](pipelines/merge_trains.md)
2021-03-11 19:13:27 +05:30
has failed or been canceled.
If a merge request pipeline or merged result pipeline was canceled or failed, you can:
- Re-run the entire pipeline by clicking **Run pipeline** in the pipeline tab in the merge request.
- [Retry only the jobs that failed](pipelines/index.md#view-pipelines). If you re-run the entire pipeline, this is not necessary.
- Push a new commit to fix the failure.
If the merge train pipeline has failed, you can:
- Check the failure and determine if you can use the [`/merge` quick action](../user/project/quick_actions.md) to immediately add the merge request to the train again.
- Re-run the entire pipeline by clicking **Run pipeline** in the pipeline tab in the merge request, then add the merge request to the train again.
- Push a commit to fix the failure, then add the merge request to the train again.
If the merge train pipeline was canceled before the merge request was merged, without a failure, you can:
- Add it to the train again.
2021-12-11 22:18:48 +05:30
### Project `group/project` not found or access denied
This message is shown if configuration is added with [`include`](yaml/index.md#include) and one of the following:
- The configuration refers to a project that can't be found.
- The user that is running the pipeline is unable to access any included projects.
To resolve this, check that:
- The path of the project is in the format `my-group/my-project` and does not include
any folders in the repository.
- The user running the pipeline is a [member of the projects](../user/project/members/index.md#add-users-to-a-project)
that contain the included files. Users must also have the [permission](../user/permissions.md#job-permissions)
to run CI/CD jobs in the same projects.
### "The parsed YAML is too big" message
This message displays when the YAML configuration is too large or nested too deeply.
YAML files with a large number of includes, and thousands of lines overall, are
more likely to hit this memory limit. For example, a YAML file that is 200kb is
likely to hit the default memory limit.
To reduce the configuration size, you can:
- Check the length of the expanded CI/CD configuration in the pipeline editor's
[merged YAML](pipeline_editor/index.md#view-expanded-configuration) tab. Look for
duplicated configuration that can be removed or simplified.
- Move long or repeated `script` sections into standalone scripts in the project.
- Use [parent and child pipelines](pipelines/parent_child_pipelines.md) to move some
work to jobs in an independent child pipeline.
On a self-managed instance, you can [increase the size limits](../administration/instance_limits.md#maximum-size-and-depth-of-cicd-configuration-yaml-files).
2020-11-24 15:15:51 +05:30
## Pipeline warnings
Pipeline configuration warnings are shown when you:
2021-09-30 23:02:18 +05:30
- [Validate configuration with the CI Lint tool](yaml/index.md).
2020-11-24 15:15:51 +05:30
- [Manually run a pipeline](pipelines/index.md#run-a-pipeline-manually).
### "Job may allow multiple pipelines to run for a single action" warning
2022-01-26 12:08:38 +05:30
When you use [`rules`](yaml/index.md#rules) with a `when` clause without an `if`
2020-11-24 15:15:51 +05:30
clause, multiple pipelines may run. Usually this occurs when you push a commit to
a branch that has an open merge request associated with it.
2021-09-04 01:27:46 +05:30
To [prevent duplicate pipelines](jobs/job_control.md#avoid-duplicate-pipelines), use
2021-09-30 23:02:18 +05:30
[`workflow: rules`](yaml/index.md#workflow) or rewrite your rules to control
2020-11-24 15:15:51 +05:30
which pipelines can run.
2020-07-28 23:09:34 +05:30
2021-11-18 22:05:49 +05:30
### Console workaround if job using resource_group gets stuck **(FREE SELF)**
2021-03-11 19:13:27 +05:30
```ruby
# find resource group by name
resource_group = Project.find_by_full_path('...').resource_groups.find_by(key: 'the-group-name')
busy_resources = resource_group.resources.where('build_id IS NOT NULL')
2021-06-08 01:23:25 +05:30
# identify which builds are occupying the resource
2021-03-11 19:13:27 +05:30
# (I think it should be 1 as of today)
busy_resources.pluck(:build_id)
# it's good to check why this build is holding the resource.
# Is it stuck? Has it been forcefully dropped by the system?
# free up busy resources
busy_resources.update_all(build_id: nil)
```
2022-07-16 23:28:13 +05:30
### Job log slow to update
When you visit the job log page for a running job, there could be a delay of up to
60 seconds before the log updates. The default refresh time is 60 seconds, but after
the log is viewed in the UI, the following log updates should occur every 3 seconds.
2022-08-13 15:12:31 +05:30
## Disaster recovery
You can disable some important but computationally expensive parts of the application
to relieve stress on the database during ongoing downtime.
### Disable fair scheduling on shared runners
When clearing a large backlog of jobs, you can temporarily enable the `ci_queueing_disaster_recovery_disable_fair_scheduling`
[feature flag](../administration/feature_flags.md). This flag disables fair scheduling
on shared runners, which reduces system resource usage on the `jobs/request` endpoint.
When enabled, jobs are processed in the order they were put in the system, instead of
balanced across many projects.
### Disable CI/CD minutes quota enforcement
To disable the enforcement of CI/CD minutes quotas on shared runners, you can temporarily
enable the `ci_queueing_disaster_recovery_disable_quota` [feature flag](../administration/feature_flags.md).
This flag reduces system resource usage on the `jobs/request` endpoint.
When enabled, jobs created in the last hour can run in projects which are out of quota.
Earlier jobs are already canceled by a periodic background worker (`StuckCiJobsWorker`).
2020-11-24 15:15:51 +05:30
## How to get help
2020-07-28 23:09:34 +05:30
2020-11-24 15:15:51 +05:30
If you are unable to resolve pipeline issues, you can get help from:
2020-07-28 23:09:34 +05:30
2020-11-24 15:15:51 +05:30
- The [GitLab community forum](https://forum.gitlab.com/)
- GitLab [Support](https://about.gitlab.com/support/)