2019-10-12 21:52:04 +05:30
|
|
|
---
|
|
|
|
type: reference
|
|
|
|
---
|
|
|
|
|
|
|
|
# Directed Acyclic Graph
|
|
|
|
|
2020-04-22 19:07:51 +05:30
|
|
|
> [Introduced](https://gitlab.com/gitlab-org/gitlab-foss/issues/47063) in GitLab 12.2.
|
2019-10-12 21:52:04 +05:30
|
|
|
|
|
|
|
A [directed acyclic graph](https://www.techopedia.com/definition/5739/directed-acyclic-graph-dag) can be
|
|
|
|
used in the context of a CI/CD pipeline to build relationships between jobs such that
|
|
|
|
execution is performed in the quickest possible manner, regardless how stages may
|
|
|
|
be set up.
|
|
|
|
|
|
|
|
For example, you may have a specific tool or separate website that is built
|
|
|
|
as part of your main project. Using a DAG, you can specify the relationship between
|
|
|
|
these jobs and GitLab will then execute the jobs as soon as possible instead of waiting
|
|
|
|
for each stage to complete.
|
|
|
|
|
|
|
|
Unlike other DAG solutions for CI/CD, GitLab does not require you to choose one or the
|
|
|
|
other. You can implement a hybrid combination of DAG and traditional
|
|
|
|
stage-based operation within a single pipeline. Configuration is kept very simple,
|
|
|
|
requiring a single keyword to enable the feature for any job.
|
|
|
|
|
|
|
|
Consider a monorepo as follows:
|
|
|
|
|
2020-03-13 15:44:24 +05:30
|
|
|
```plaintext
|
2019-10-12 21:52:04 +05:30
|
|
|
./service_a
|
|
|
|
./service_b
|
|
|
|
./service_c
|
|
|
|
./service_d
|
|
|
|
```
|
|
|
|
|
|
|
|
It has a pipeline that looks like the following:
|
|
|
|
|
|
|
|
| build | test | deploy |
|
|
|
|
| ----- | ---- | ------ |
|
|
|
|
| build_a | test_a | deploy_a |
|
|
|
|
| build_b | test_b | deploy_b |
|
2019-12-04 20:38:33 +05:30
|
|
|
| build_c | test_c | deploy_c |
|
2019-10-12 21:52:04 +05:30
|
|
|
| build_d | test_d | deploy_d |
|
|
|
|
|
|
|
|
Using a DAG, you can relate the `_a` jobs to each other separately from the `_b` jobs,
|
|
|
|
and even if service `a` takes a very long time to build, service `b` will not
|
|
|
|
wait for it and will finish as quickly as it can. In this very same pipeline, `_c` and
|
|
|
|
`_d` can be left alone and will run together in staged sequence just like any normal
|
|
|
|
GitLab pipeline.
|
|
|
|
|
|
|
|
## Use cases
|
|
|
|
|
|
|
|
A DAG can help solve several different kinds of relationships between jobs within
|
|
|
|
a CI/CD pipeline. Most typically this would cover when jobs need to fan in or out,
|
|
|
|
and/or merge back together (diamond dependencies). This can happen when you're
|
|
|
|
handling multi-platform builds or complex webs of dependencies as in something like
|
|
|
|
an operating system build or a complex deployment graph of independently deployable
|
|
|
|
but related microservices.
|
|
|
|
|
|
|
|
Additionally, a DAG can help with general speediness of pipelines and helping
|
|
|
|
to deliver fast feedback. By creating dependency relationships that don't unnecessarily
|
|
|
|
block each other, your pipelines will run as quickly as possible regardless of
|
|
|
|
pipeline stages, ensuring output (including errors) is available to developers
|
|
|
|
as quickly as possible.
|
|
|
|
|
|
|
|
## Usage
|
|
|
|
|
|
|
|
Relationships are defined between jobs using the [`needs:` keyword](../yaml/README.md#needs).
|
|
|
|
|
|
|
|
Note that `needs:` also works with the [parallel](../yaml/README.md#parallel) keyword,
|
2019-12-04 20:38:33 +05:30
|
|
|
giving you powerful options for parallelization within your pipeline.
|
2019-10-12 21:52:04 +05:30
|
|
|
|
|
|
|
## Limitations
|
|
|
|
|
|
|
|
A directed acyclic graph is a complicated feature, and as of the initial MVC there
|
|
|
|
are certain use cases that you may need to work around. For more information:
|
|
|
|
|
|
|
|
- [`needs` requirements and limitations](../yaml/README.md#requirements-and-limitations).
|
2019-12-21 20:55:43 +05:30
|
|
|
- Related epic [tracking planned improvements](https://gitlab.com/groups/gitlab-org/-/epics/1716).
|