debian-mirror-gitlab/doc/update/mysql_to_postgresql.md

309 lines
10 KiB
Markdown
Raw Normal View History

2021-01-29 00:20:46 +05:30
---
2021-03-08 18:12:59 +05:30
stage: Enablement
group: Database
2021-02-22 17:27:13 +05:30
info: To determine the technical writer assigned to the Stage/Group associated with this page, see https://about.gitlab.com/handbook/engineering/ux/technical-writing/#assignments
2021-01-29 00:20:46 +05:30
---
2021-06-08 01:23:25 +05:30
# Migrating from MySQL to PostgreSQL **(FREE SELF)**
2014-09-02 18:07:02 +05:30
2019-07-07 11:18:12 +05:30
This guide documents how to take a working GitLab instance that uses MySQL and
migrate it to a PostgreSQL database.
2014-09-02 18:07:02 +05:30
2019-07-07 11:18:12 +05:30
## Requirements
2014-09-02 18:07:02 +05:30
2021-02-22 17:27:13 +05:30
NOTE:
2019-09-30 21:07:59 +05:30
Support for MySQL was removed in GitLab 12.1. This procedure should be performed
**before** installing GitLab 12.1.
2021-03-11 19:13:27 +05:30
[pgLoader](https://pgloader.io/) 3.4.1+ is required, confirm with `pgloader -V`.
2018-03-17 18:26:18 +05:30
2019-07-07 11:18:12 +05:30
You can install it directly from your distribution, for example in
Debian/Ubuntu:
2018-03-17 18:26:18 +05:30
2019-07-07 11:18:12 +05:30
1. Search for the version:
2018-03-17 18:26:18 +05:30
2020-03-13 15:44:24 +05:30
```shell
2019-07-07 11:18:12 +05:30
apt-cache madison pgloader
```
2018-03-17 18:26:18 +05:30
2019-07-07 11:18:12 +05:30
1. If the version is 3.4.1+, install it with:
2020-03-13 15:44:24 +05:30
```shell
2019-07-07 11:18:12 +05:30
sudo apt-get install pgloader
```
If your distribution's version is too old, use PostgreSQL's repository:
2020-03-13 15:44:24 +05:30
```shell
2019-07-07 11:18:12 +05:30
# Add repository
sudo sh -c 'echo "deb http://apt.postgresql.org/pub/repos/apt/ $(lsb_release -cs)-pgdg main" > /etc/apt/sources.list.d/pgdg.list'
# Add key
sudo apt-get install wget ca-certificates
wget --quiet -O - https://www.postgresql.org/media/keys/ACCC4CF8.asc | sudo apt-key add -
# Install package
sudo apt-get update
sudo apt-get install pgloader
```
2019-07-31 22:56:46 +05:30
For other distributions, follow the instructions in PostgreSQL's
2019-07-07 11:18:12 +05:30
[download page](https://www.postgresql.org/download/) to add their repository
and then install `pgloader`.
2021-02-22 17:27:13 +05:30
If you are migrating to a Docker based installation, you must install
2021-10-27 15:23:28 +05:30
pgLoader in the container as it is not included in the container image.
2019-07-31 22:56:46 +05:30
1. Start a shell session in the context of the running container:
2020-04-08 14:13:33 +05:30
```shell
2019-07-31 22:56:46 +05:30
docker exec -it gitlab bash
```
2021-03-11 19:13:27 +05:30
1. Install pgLoader:
2019-09-30 21:07:59 +05:30
2020-04-08 14:13:33 +05:30
```shell
2019-07-31 22:56:46 +05:30
apt-get update
apt-get -y install pgloader
```
2019-07-07 11:18:12 +05:30
## Omnibus GitLab installations
2021-02-22 17:27:13 +05:30
For [Omnibus GitLab packages](https://about.gitlab.com/install/), you first
2021-10-27 15:23:28 +05:30
enable the bundled PostgreSQL:
2018-03-17 18:26:18 +05:30
1. Stop GitLab:
2020-03-13 15:44:24 +05:30
```shell
2019-10-12 21:52:04 +05:30
sudo gitlab-ctl stop
```
2018-03-17 18:26:18 +05:30
1. Edit `/etc/gitlab/gitlab.rb` to enable bundled PostgreSQL:
2020-04-08 14:13:33 +05:30
```ruby
2019-10-12 21:52:04 +05:30
postgresql['enable'] = true
```
2018-03-17 18:26:18 +05:30
2021-02-22 17:27:13 +05:30
1. Edit `/etc/gitlab/gitlab.rb` to use the bundled PostgreSQL. Review all of the
settings beginning with `db_` (such as `gitlab_rails['db_adapter']`). To use
the default values, you can comment all of them out.
2018-03-17 18:26:18 +05:30
2019-07-07 11:18:12 +05:30
1. [Reconfigure GitLab](../administration/restart_gitlab.md#omnibus-gitlab-reconfigure)
for the changes to take effect.
2021-02-22 17:27:13 +05:30
2021-09-04 01:27:46 +05:30
1. Start Puma and PostgreSQL so that we can prepare the schema:
2018-03-17 18:26:18 +05:30
2020-03-13 15:44:24 +05:30
```shell
2021-09-04 01:27:46 +05:30
sudo gitlab-ctl start puma
2019-10-12 21:52:04 +05:30
sudo gitlab-ctl start postgresql
```
2018-03-17 18:26:18 +05:30
1. Run the following commands to prepare the schema:
2020-03-13 15:44:24 +05:30
```shell
2019-10-12 21:52:04 +05:30
sudo gitlab-rake db:create db:migrate
```
2018-03-17 18:26:18 +05:30
2021-09-04 01:27:46 +05:30
1. Stop Puma to prevent other database access from interfering with the loading of data:
2018-03-17 18:26:18 +05:30
2020-03-13 15:44:24 +05:30
```shell
2021-09-04 01:27:46 +05:30
sudo gitlab-ctl stop puma
2019-10-12 21:52:04 +05:30
```
2018-03-17 18:26:18 +05:30
2021-02-22 17:27:13 +05:30
After these steps, you have a fresh PostgreSQL database with up-to-date schema.
2018-03-17 18:26:18 +05:30
2021-02-22 17:27:13 +05:30
Next, use `pgloader` to migrate the data from the old MySQL database to the
2019-07-07 11:18:12 +05:30
new PostgreSQL one:
2018-03-17 18:26:18 +05:30
1. Save the following snippet in a `commands.load` file, and edit with your
2019-07-07 11:18:12 +05:30
MySQL database `username`, `password` and `host`:
2018-03-17 18:26:18 +05:30
2020-04-08 14:13:33 +05:30
```sql
2019-10-12 21:52:04 +05:30
LOAD DATABASE
FROM mysql://username:password@host/gitlabhq_production
INTO postgresql://gitlab-psql@unix://var/opt/gitlab/postgresql:/gitlabhq_production
2018-03-17 18:26:18 +05:30
2019-10-12 21:52:04 +05:30
WITH include no drop, truncate, disable triggers, create no tables,
create no indexes, preserve index names, no foreign keys,
data only
2018-03-17 18:26:18 +05:30
2019-12-04 20:38:33 +05:30
SET MySQL PARAMETERS
net_read_timeout = '90',
net_write_timeout = '180'
2019-10-12 21:52:04 +05:30
ALTER SCHEMA 'gitlabhq_production' RENAME TO 'public'
2018-03-17 18:26:18 +05:30
2019-10-12 21:52:04 +05:30
;
```
2018-03-17 18:26:18 +05:30
1. Start the migration:
2020-03-13 15:44:24 +05:30
```shell
2019-10-12 21:52:04 +05:30
sudo -u gitlab-psql pgloader commands.load
```
2018-03-17 18:26:18 +05:30
2021-03-11 19:13:27 +05:30
1. After the migration finishes, you should see a summary table that looks like
2019-07-07 11:18:12 +05:30
the following:
2018-03-17 18:26:18 +05:30
2020-04-08 14:13:33 +05:30
```plaintext
2019-10-12 21:52:04 +05:30
table name read imported errors total time
----------------------------------------------- --------- --------- --------- --------------
fetch meta data 119 119 0 0.388s
Truncate 119 119 0 1.134s
----------------------------------------------- --------- --------- --------- --------------
public.abuse_reports 0 0 0 0.490s
public.appearances 0 0 0 0.488s
.
.
.
public.web_hook_logs 0 0 0 1.080s
----------------------------------------------- --------- --------- --------- --------------
COPY Threads Completion 4 4 0 2.008s
Reset Sequences 113 113 0 0.304s
Install Comments 0 0 0 0.000s
----------------------------------------------- --------- --------- --------- --------------
Total import time 1894 1894 0 12.497s
```
If there is no output for more than 30 minutes, it's possible `pgloader` encountered an error. See
the [troubleshooting guide](#troubleshooting) for more details.
2018-03-17 18:26:18 +05:30
1. Start GitLab:
2020-03-13 15:44:24 +05:30
```shell
2019-10-12 21:52:04 +05:30
sudo gitlab-ctl start
```
2018-03-17 18:26:18 +05:30
2019-07-07 11:18:12 +05:30
You can now verify that everything works as expected by visiting GitLab.
2018-03-17 18:26:18 +05:30
2019-07-07 11:18:12 +05:30
## Source installations
2014-09-02 18:07:02 +05:30
2021-02-22 17:27:13 +05:30
For installations from source that use MySQL, you must first
2019-07-07 11:18:12 +05:30
[install PostgreSQL and create a database](../install/installation.md#6-database).
2015-04-26 12:48:37 +05:30
2019-07-07 11:18:12 +05:30
After the database is created, go on with the following steps:
2018-03-17 18:26:18 +05:30
1. Stop GitLab:
2020-03-13 15:44:24 +05:30
```shell
2019-07-07 11:18:12 +05:30
sudo service gitlab stop
```
2018-03-17 18:26:18 +05:30
1. Switch database from MySQL to PostgreSQL
2014-09-02 18:07:02 +05:30
2020-03-13 15:44:24 +05:30
```shell
2019-07-07 11:18:12 +05:30
cd /home/git/gitlab
sudo -u git mv config/database.yml config/database.yml.bak
sudo -u git cp config/database.yml.postgresql config/database.yml
sudo -u git -H chmod o-rwx config/database.yml
```
2019-12-21 20:55:43 +05:30
1. Install Gems related to PostgreSQL
2018-11-18 11:00:15 +05:30
2020-03-13 15:44:24 +05:30
```shell
2019-07-07 11:18:12 +05:30
sudo -u git -H rm .bundle/config
sudo -u git -H bundle install --deployment --without development test mysql aws kerberos
```
2014-09-02 18:07:02 +05:30
2018-03-17 18:26:18 +05:30
1. Run the following commands to prepare the schema:
2014-09-02 18:07:02 +05:30
2020-03-13 15:44:24 +05:30
```shell
2019-07-07 11:18:12 +05:30
sudo -u git -H bundle exec rake db:create db:migrate RAILS_ENV=production
```
2014-09-02 18:07:02 +05:30
2021-02-22 17:27:13 +05:30
After these steps, you have a fresh PostgreSQL database with up-to-date schema.
2014-09-02 18:07:02 +05:30
2021-02-22 17:27:13 +05:30
Next, use `pgloader` to migrate the data from the old MySQL database to the
2019-07-07 11:18:12 +05:30
new PostgreSQL one:
2014-09-02 18:07:02 +05:30
2018-03-17 18:26:18 +05:30
1. Save the following snippet in a `commands.load` file, and edit with your
MySQL `username`, `password` and `host`:
2014-09-02 18:07:02 +05:30
2020-04-08 14:13:33 +05:30
```sql
2019-07-07 11:18:12 +05:30
LOAD DATABASE
FROM mysql://username:password@host/gitlabhq_production
INTO postgresql://postgres@unix://var/run/postgresql:/gitlabhq_production
2014-09-02 18:07:02 +05:30
2019-07-07 11:18:12 +05:30
WITH include no drop, truncate, disable triggers, create no tables,
create no indexes, preserve index names, no foreign keys,
data only
2014-09-02 18:07:02 +05:30
2019-12-04 20:38:33 +05:30
SET MySQL PARAMETERS
net_read_timeout = '90',
net_write_timeout = '180'
2019-07-07 11:18:12 +05:30
ALTER SCHEMA 'gitlabhq_production' RENAME TO 'public'
2018-03-17 18:26:18 +05:30
2019-07-07 11:18:12 +05:30
;
```
2018-03-17 18:26:18 +05:30
1. Start the migration:
2020-03-13 15:44:24 +05:30
```shell
2019-07-07 11:18:12 +05:30
sudo -u postgres pgloader commands.load
```
2018-03-17 18:26:18 +05:30
2021-03-11 19:13:27 +05:30
1. After the migration finishes, you should see a summary table that looks like
2019-07-07 11:18:12 +05:30
the following:
2020-04-08 14:13:33 +05:30
```plaintext
2019-07-07 11:18:12 +05:30
table name read imported errors total time
----------------------------------------------- --------- --------- --------- --------------
fetch meta data 119 119 0 0.388s
Truncate 119 119 0 1.134s
----------------------------------------------- --------- --------- --------- --------------
public.abuse_reports 0 0 0 0.490s
public.appearances 0 0 0 0.488s
.
.
.
public.web_hook_logs 0 0 0 1.080s
----------------------------------------------- --------- --------- --------- --------------
COPY Threads Completion 4 4 0 2.008s
Reset Sequences 113 113 0 0.304s
Install Comments 0 0 0 0.000s
----------------------------------------------- --------- --------- --------- --------------
Total import time 1894 1894 0 12.497s
```
If there is no output for more than 30 minutes, it's possible `pgloader` encountered an error. See
the [troubleshooting guide](#troubleshooting) for more details.
2018-03-17 18:26:18 +05:30
2019-07-07 11:18:12 +05:30
1. Start GitLab:
2018-03-17 18:26:18 +05:30
2020-03-13 15:44:24 +05:30
```shell
2019-07-07 11:18:12 +05:30
sudo service gitlab start
```
2018-03-17 18:26:18 +05:30
2019-07-07 11:18:12 +05:30
You can now verify that everything works as expected by visiting GitLab.
2018-03-17 18:26:18 +05:30
2019-07-07 11:18:12 +05:30
## Troubleshooting
2019-05-18 00:54:41 +05:30
2019-07-07 11:18:12 +05:30
Sometimes, you might encounter some errors during or after the migration.
### Database error permission denied
2018-03-17 18:26:18 +05:30
2021-10-27 15:23:28 +05:30
The PostgreSQL user that you use for the migration **must** have **superuser** privileges.
2019-07-07 11:18:12 +05:30
Otherwise, you may see a similar message to the following:
2018-03-17 18:26:18 +05:30
2020-04-08 14:13:33 +05:30
```plaintext
2019-07-07 11:18:12 +05:30
debugger invoked on a CL-POSTGRES-ERROR:INSUFFICIENT-PRIVILEGE in thread
#<THREAD "lparallel" RUNNING {10078A3513}>:
Database error 42501: permission denied: "RI_ConstraintTrigger_a_20937" is a system trigger
QUERY: ALTER TABLE ci_builds DISABLE TRIGGER ALL;
2017-08-23T00:36:56.782000Z ERROR Database error 42501: permission denied: "RI_ConstraintTrigger_c_20864" is a system trigger
QUERY: ALTER TABLE approver_groups DISABLE TRIGGER ALL;
```
2018-03-17 18:26:18 +05:30
2021-10-27 15:23:28 +05:30
### 500 errors after the migration
2018-03-17 18:26:18 +05:30
If you experience 500 errors after the migration, try to clear the cache:
2020-03-13 15:44:24 +05:30
```shell
2019-07-07 11:18:12 +05:30
# Omnibus GitLab
sudo gitlab-rake cache:clear
# Installations from source
2018-03-17 18:26:18 +05:30
sudo -u git -H bundle exec rake cache:clear RAILS_ENV=production
2014-09-02 18:07:02 +05:30
```