debian-mirror-gitlab/doc/development/gitaly.md

368 lines
13 KiB
Markdown
Raw Normal View History

2020-07-28 23:09:34 +05:30
---
stage: Create
group: Gitaly
2021-02-22 17:27:13 +05:30
info: To determine the technical writer assigned to the Stage/Group associated with this page, see https://about.gitlab.com/handbook/engineering/ux/technical-writing/#assignments
2020-07-28 23:09:34 +05:30
type: reference
---
2018-03-17 18:26:18 +05:30
2020-07-28 23:09:34 +05:30
# Gitaly developers guide
[Gitaly](https://gitlab.com/gitlab-org/gitaly) is a high-level Git RPC service used by GitLab Rails,
Workhorse and GitLab Shell.
2018-03-17 18:26:18 +05:30
2019-09-30 21:07:59 +05:30
## Deep Dive
2020-05-24 23:13:21 +05:30
In May 2019, Bob Van Landuyt hosted a Deep Dive (GitLab team members only: `https://gitlab.com/gitlab-org/create-stage/issues/1`)
2021-02-22 17:27:13 +05:30
on the [Gitaly project](https://gitlab.com/gitlab-org/gitaly) and how to contribute to it as a
2020-04-22 19:07:51 +05:30
Ruby developer, to share his domain specific knowledge with anyone who may work in this part of the
2021-02-22 17:27:13 +05:30
codebase in the future.
2019-09-30 21:07:59 +05:30
2020-04-22 19:07:51 +05:30
You can find the [recording on YouTube](https://www.youtube.com/watch?v=BmlEWFS8ORo), and the slides
on [Google Slides](https://docs.google.com/presentation/d/1VgRbiYih9ODhcPnL8dS0W98EwFYpJ7GXMPpX-1TM6YE/edit)
and in [PDF](https://gitlab.com/gitlab-org/create-stage/uploads/a4fdb1026278bda5c1c5bb574379cf80/Create_Deep_Dive__Gitaly_for_Create_Ruby_Devs.pdf).
Everything covered in this deep dive was accurate as of GitLab 11.11, and while specific details may
have changed since then, it should still serve as a good introduction.
2019-09-30 21:07:59 +05:30
2019-07-07 11:18:12 +05:30
## Beginner's guide
2019-12-21 20:55:43 +05:30
Start by reading the Gitaly repository's
2021-03-08 18:12:59 +05:30
[Beginner's guide to Gitaly contributions](https://gitlab.com/gitlab-org/gitaly/-/blob/master/doc/beginners_guide.md).
It describes how to set up Gitaly, the various components of Gitaly and what
they do, and how to run its test suites.
2019-07-07 11:18:12 +05:30
2018-10-15 14:42:47 +05:30
## Developing new Git features
2019-07-07 11:18:12 +05:30
To read or write Git data, a request has to be made to Gitaly. This means that
if you're developing a new feature where you need data that's not yet available
in `lib/gitlab/git` changes have to be made to Gitaly.
2018-10-15 14:42:47 +05:30
2021-03-08 18:12:59 +05:30
There should be no new code that touches Git repositories via disk access (for example,
Rugged, `git`, `rm -rf`) anywhere in the `gitlab` repository. Anything that
needs direct access to the Git repository *must* be implemented in Gitaly, and
exposed via an RPC.
2018-10-15 14:42:47 +05:30
2021-03-08 18:12:59 +05:30
It's often easier to develop a new feature in Gitaly if you make the changes to
GitLab that will use the new feature in a separate merge request, to be merged
immediately after the Gitaly one. This allows you to test your changes before
they are merged.
2018-10-15 14:42:47 +05:30
2021-03-08 18:12:59 +05:30
- See [below](#running-tests-with-a-locally-modified-version-of-gitaly) for instructions on running GitLab tests with a modified version of Gitaly.
2018-10-15 14:42:47 +05:30
- In GDK run `gdk install` and restart `gdk run` (or `gdk run app`) to use a locally modified Gitaly version for development
2020-04-22 19:07:51 +05:30
### `gitaly-ruby`
2018-10-15 14:42:47 +05:30
It is possible to implement and test RPC's in Gitaly using Ruby code,
in
2019-12-21 20:55:43 +05:30
[`gitaly-ruby`](https://gitlab.com/gitlab-org/gitaly/tree/master/ruby).
2018-10-15 14:42:47 +05:30
This should make it easier to contribute for developers who are less
comfortable writing Go code.
There is documentation for this approach in [the Gitaly
2020-06-23 00:09:42 +05:30
repository](https://gitlab.com/gitlab-org/gitaly/blob/master/doc/ruby_endpoint.md).
2018-10-15 14:42:47 +05:30
2018-03-17 18:26:18 +05:30
## Gitaly-Related Test Failures
If your test-suite is failing with Gitaly issues, as a first step, try running:
```shell
rm -rf tmp/tests/gitaly
```
2021-02-22 17:27:13 +05:30
During RSpec tests, the Gitaly instance writes logs to `gitlab/log/gitaly-test.log`.
2019-07-31 22:56:46 +05:30
2019-05-03 19:53:19 +05:30
## Legacy Rugged code
While Gitaly can handle all Git access, many of GitLab customers still
run Gitaly atop NFS. The legacy Rugged implementation for Git calls may
be faster than the Gitaly RPC due to N+1 Gitaly calls and other
reasons. See [the
2020-06-23 00:09:42 +05:30
issue](https://gitlab.com/gitlab-org/gitlab-foss/-/issues/57317) for more
2019-05-03 19:53:19 +05:30
details.
Until GitLab has eliminated most of these inefficiencies or the use of
NFS is discontinued for Git data, Rugged implementations of some of the
most commonly-used RPCs can be enabled via feature flags:
2019-09-30 21:07:59 +05:30
- `rugged_find_commit`
- `rugged_get_tree_entries`
- `rugged_tree_entry`
- `rugged_commit_is_ancestor`
- `rugged_commit_tree_entry`
- `rugged_list_commits_by_oid`
2019-05-03 19:53:19 +05:30
A convenience Rake task can be used to enable or disable these flags
all together. To enable:
2020-03-13 15:44:24 +05:30
```shell
2019-05-03 19:53:19 +05:30
bundle exec rake gitlab:features:enable_rugged
```
To disable:
2020-03-13 15:44:24 +05:30
```shell
2019-05-03 19:53:19 +05:30
bundle exec rake gitlab:features:disable_rugged
```
Most of this code exists in the `lib/gitlab/git/rugged_impl` directory.
2021-02-22 17:27:13 +05:30
NOTE:
2020-07-28 23:09:34 +05:30
You should NOT need to add or modify code related to
2019-05-03 19:53:19 +05:30
Rugged unless explicitly discussed with the [Gitaly
2021-02-22 17:27:13 +05:30
Team](https://gitlab.com/groups/gl-gitaly/group_members). This code does
2019-05-03 19:53:19 +05:30
NOT work on GitLab.com or other GitLab instances that do not use NFS.
2018-03-17 18:26:18 +05:30
## `TooManyInvocationsError` errors
During development and testing, you may experience `Gitlab::GitalyClient::TooManyInvocationsError` failures.
2021-02-22 17:27:13 +05:30
The `GitalyClient` attempts to block against potential n+1 issues by raising this error
2018-03-17 18:26:18 +05:30
when Gitaly is called more than 30 times in a single Rails request or Sidekiq execution.
2021-02-22 17:27:13 +05:30
As a temporary measure, export `GITALY_DISABLE_REQUEST_LIMITS=1` to suppress the error. This disables the n+1 detection
2018-03-17 18:26:18 +05:30
in your development environment.
Please raise an issue in the GitLab CE or EE repositories to report the issue. Include the labels ~Gitaly
~performance ~"technical debt". Please ensure that the issue contains the full stack trace and error message of the
`TooManyInvocationsError`. Also include any known failing tests if possible.
2021-02-22 17:27:13 +05:30
Isolate the source of the n+1 problem. This is normally a loop that results in Gitaly being called for each
2018-03-17 18:26:18 +05:30
element in an array. If you are unable to isolate the problem, please contact a member
of the [Gitaly Team](https://gitlab.com/groups/gl-gitaly/group_members) for assistance.
Once the source has been found, wrap it in an `allow_n_plus_1_calls` block, as follows:
```ruby
# n+1: link to n+1 issue
Gitlab::GitalyClient.allow_n_plus_1_calls do
# original code
commits.each { |commit| ... }
end
```
2021-02-22 17:27:13 +05:30
Once the code is wrapped in this block, this code path is excluded from n+1 detection.
2018-03-17 18:26:18 +05:30
## Request counts
2019-12-04 20:38:33 +05:30
Commits and other Git data, is now fetched through Gitaly. These fetches can,
2018-03-17 18:26:18 +05:30
much like with a database, be batched. This improves performance for the client
and for Gitaly itself and therefore for the users too. To keep performance stable
and guard performance regressions, Gitaly calls can be counted and the call count
can be tested against. This requires the `:request_store` flag to be set.
```ruby
describe 'Gitaly Request count tests' do
context 'when the request store is activated', :request_store do
it 'correctly counts the gitaly requests made' do
expect { subject }.to change { Gitlab::GitalyClient.get_request_count }.by(10)
end
end
end
```
## Running tests with a locally modified version of Gitaly
2019-12-04 20:38:33 +05:30
Normally, GitLab CE/EE tests use a local clone of Gitaly in
2018-10-15 14:42:47 +05:30
`tmp/tests/gitaly` pinned at the version specified in
2019-12-26 22:10:19 +05:30
`GITALY_SERVER_VERSION`. The `GITALY_SERVER_VERSION` file supports also
2020-03-13 15:44:24 +05:30
branches and SHA to use a custom commit in <https://gitlab.com/gitlab-org/gitaly>.
2021-02-22 17:27:13 +05:30
NOTE:
2020-03-13 15:44:24 +05:30
With the introduction of auto-deploy for Gitaly, the format of
`GITALY_SERVER_VERSION` was aligned with Omnibus syntax.
2021-02-22 17:27:13 +05:30
It no longer supports `=revision`, it evaluates the file content as a Git
reference (branch or SHA). Only if it matches a semver does it prepend a `v`.
2020-03-13 15:44:24 +05:30
If you want to run tests locally against a modified version of Gitaly you
2018-10-15 14:42:47 +05:30
can replace `tmp/tests/gitaly` with a symlink. This is much faster
2021-02-22 17:27:13 +05:30
because it avoids a Gitaly re-install each time you run `rspec`.
Make sure this directory contains the files `config.toml` and `praefect.config.toml`.
You can copy them from `config.toml.example` and `config.praefect.toml.example` respectively.
After copying, make sure to edit them so everything points to the correct paths.
2018-03-17 18:26:18 +05:30
```shell
rm -rf tmp/tests/gitaly
ln -s /path/to/gitaly tmp/tests/gitaly
```
Make sure you run `make` in your local Gitaly directory before running
2021-02-22 17:27:13 +05:30
tests. Otherwise, Gitaly fails to boot.
2018-03-17 18:26:18 +05:30
If you make changes to your local Gitaly in between test runs you need
to manually run `make` again.
2021-02-22 17:27:13 +05:30
Note that CI tests do not use your locally modified version of
2018-03-17 18:26:18 +05:30
Gitaly. To use a custom Gitaly version in CI you need to update
2021-03-08 18:12:59 +05:30
GITALY_SERVER_VERSION as described at the beginning of this section.
2018-03-17 18:26:18 +05:30
2019-02-02 18:00:53 +05:30
To use a different Gitaly repository, e.g., if your changes are present
on a fork, you can specify a `GITALY_REPO_URL` environment variable when
running tests:
```shell
GITALY_REPO_URL=https://gitlab.com/nick.thomas/gitaly bundle exec rspec spec/lib/gitlab/git/repository_spec.rb
```
If your fork of Gitaly is private, you can generate a [Deploy Token](../user/project/deploy_tokens/index.md)
and specify it in the URL:
```shell
GITALY_REPO_URL=https://gitlab+deploy-token-1000:token-here@gitlab.com/nick.thomas/gitaly bundle exec rspec spec/lib/gitlab/git/repository_spec.rb
```
To use a custom Gitaly repository in CI, for instance if you want your
GitLab fork to always use your own Gitaly fork, set `GITALY_REPO_URL`
2019-07-31 22:56:46 +05:30
as a [CI environment variable](../ci/variables/README.md#gitlab-cicd-environment-variables).
2019-02-02 18:00:53 +05:30
2020-03-13 15:44:24 +05:30
### Use a locally modified version of Gitaly RPC client
If you are making changes to the RPC client, such as adding a new endpoint or adding a new
parameter to an existing endpoint, follow the guide for
[Gitaly proto](https://gitlab.com/gitlab-org/gitaly/blob/master/proto/README.md). After pushing
the branch with the changes (`new-feature-branch`, for example):
1. Change the `gitaly` line in the Rails' `Gemfile` to:
```ruby
gem 'gitaly', git: 'https://gitlab.com/gitlab-org/gitaly.git', branch: 'new-feature-branch'
```
1. Run `bundle install` to use the modified RPC client.
2021-03-08 18:12:59 +05:30
Re-run `bundle install` in the `gitlab` project each time the Gitaly branch
changes to embed a new SHA in the `Gemfile.lock` file.
2018-03-17 18:26:18 +05:30
---
[Return to Development documentation](README.md)
2019-07-07 11:18:12 +05:30
## Wrapping RPCs in Feature Flags
Here are the steps to gate a new feature in Gitaly behind a feature flag.
### Gitaly
1. Create a package scoped flag name:
2020-04-08 14:13:33 +05:30
```golang
2019-07-07 11:18:12 +05:30
var findAllTagsFeatureFlag = "go-find-all-tags"
```
1. Create a switch in the code using the `featureflag` package:
2020-04-08 14:13:33 +05:30
```golang
2019-07-07 11:18:12 +05:30
if featureflag.IsEnabled(ctx, findAllTagsFeatureFlag) {
// go implementation
} else {
// ruby implementation
}
```
2019-12-21 20:55:43 +05:30
1. Create Prometheus metrics:
2019-07-07 11:18:12 +05:30
2020-04-08 14:13:33 +05:30
```golang
2019-09-30 21:07:59 +05:30
var findAllTagsRequests = prometheus.NewCounterVec(
prometheus.CounterOpts{
Name: "gitaly_find_all_tags_requests_total",
Help: "Counter of go vs ruby implementation of FindAllTags",
},
[]string{"implementation"},
2019-07-07 11:18:12 +05:30
)
func init() {
2019-09-30 21:07:59 +05:30
prometheus.Register(findAllTagsRequests)
2019-07-07 11:18:12 +05:30
}
if featureflag.IsEnabled(ctx, findAllTagsFeatureFlag) {
2019-09-30 21:07:59 +05:30
findAllTagsRequests.WithLabelValues("go").Inc()
2019-07-07 11:18:12 +05:30
// go implementation
} else {
2019-09-30 21:07:59 +05:30
findAllTagsRequests.WithLabelValues("ruby").Inc()
2019-09-04 21:01:54 +05:30
// ruby implementation
2019-07-07 11:18:12 +05:30
}
```
1. Set headers in tests:
2020-04-08 14:13:33 +05:30
```golang
2019-07-07 11:18:12 +05:30
import (
"google.golang.org/grpc/metadata"
"gitlab.com/gitlab-org/gitaly/internal/featureflag"
)
//...
md := metadata.New(map[string]string{featureflag.HeaderKey(findAllTagsFeatureFlag): "true"})
ctx = metadata.NewOutgoingContext(context.Background(), md)
c, err = client.FindAllTags(ctx, rpcRequest)
require.NoError(t, err)
```
2019-12-04 20:38:33 +05:30
### GitLab Rails
2019-07-07 11:18:12 +05:30
2020-01-01 13:55:28 +05:30
1. Test in a Rails console by setting the feature flag:
2021-02-22 17:27:13 +05:30
NOTE:
2020-01-01 13:55:28 +05:30
Pay attention to the name of the flag and the one used in the Rails console.
There is a difference between them (dashes replaced by underscores and name
2020-03-13 15:44:24 +05:30
prefix is changed). Make sure to prefix all flags with `gitaly_`.
2019-07-07 11:18:12 +05:30
```ruby
2020-01-01 13:55:28 +05:30
Feature.enable('gitaly_go_find_all_tags')
2019-07-07 11:18:12 +05:30
```
2020-01-01 13:55:28 +05:30
### Testing with GDK
To be sure that the flag is set correctly and it goes into Gitaly, you can check
the integration by using GDK:
1. The state of the flag must be observable. To check it, you need to enable it
by fetching the Prometheus metrics:
1. Navigate to GDK's root directory.
1. Make sure you have the proper branch checked out for Gitaly.
1. Recompile it with `make gitaly-setup` and restart the service with `gdk restart gitaly`.
2020-06-23 00:09:42 +05:30
1. Make sure your setup is running: `gdk status | grep praefect`.
1. Check what configuration file is used: `cat ./services/praefect/run | grep praefect` value of the `-config` flag
2020-01-01 13:55:28 +05:30
1. Uncomment `prometheus_listen_addr` in the configuration file and run `gdk restart gitaly`.
1. Make sure that the flag is not enabled yet:
1. Perform whatever action is required to trigger your changes (project creation,
submitting commit, observing history, etc.).
1. Check that the list of current metrics has the new counter for the feature flag:
2020-03-13 15:44:24 +05:30
```shell
2021-02-22 17:27:13 +05:30
curl --silent "http://localhost:9236/metrics" | grep go_find_all_tags
2020-01-01 13:55:28 +05:30
```
1. Once you observe the metrics for the new feature flag and it increments, you
can enable the new feature:
1. Navigate to GDK's root directory.
1. Start a Rails console:
2020-03-13 15:44:24 +05:30
```shell
2020-01-01 13:55:28 +05:30
bundle install && bundle exec rails console
```
1. Check the list of feature flags:
```ruby
Feature::Gitaly.server_feature_flags
```
It should be disabled `"gitaly-feature-go-find-all-tags"=>"false"`.
1. Enable it:
```ruby
Feature.enable('gitaly_go_find_all_tags')
```
1. Exit the Rails console and perform whatever action is required to trigger
your changes (project creation, submitting commit, observing history, etc.).
1. Verify the feature is on by observing the metrics for it:
2020-03-13 15:44:24 +05:30
```shell
2021-02-22 17:27:13 +05:30
curl --silent "http://localhost:9236/metrics" | grep go_find_all_tags
2020-01-01 13:55:28 +05:30
```