debian-mirror-gitlab/doc/operations/incident_management/alerts.md
2022-07-17 14:43:12 +02:00

10 KiB

stage group info
Monitor Respond To determine the technical writer assigned to the Stage/Group associated with this page, see https://about.gitlab.com/handbook/engineering/ux/technical-writing/#assignments

Alerts (FREE)

Alerts are a critical entity in your incident management workflow. They represent a notable event that might indicate a service outage or disruption. GitLab provides a list view for triage and detail view for deeper investigation of what happened.

Alert List

Users with at least the Developer role can access the Alert list at Monitor > Alerts in your project's sidebar. The Alert list displays alerts sorted by start time, but you can change the sort order by clicking the headers in the Alert list.

The alert list displays the following information:

Alert List

  • Search: The alert list supports a simple free text search on the title, description, monitoring tool, and service fields. (Introduced in GitLab 13.1.)
  • Severity: The current importance of a alert and how much attention it should receive. For a listing of all statuses, read Alert Management severity.
  • Start time: How long ago the alert fired. This field uses the standard GitLab pattern of X time ago, but is supported by a granular date/time tooltip depending on the user's locale.
  • Alert description: The description of the alert, which attempts to capture the most meaningful data.
  • Event count: The number of times that an alert has fired.
  • Issue: A link to the incident issue that has been created for the alert.
  • Status: The current status of the alert:
    • Triggered: Investigation has not started.
    • Acknowledged: Someone is actively investigating the problem.
    • Resolved: No further work is required.
    • Ignored: No action will be taken on the alert.

NOTE: Check out a live example available from the tanuki-inc project page in GitLab to examine alerts in action.

Alert severity

Each level of alert contains a uniquely shaped and color-coded icon to help you identify the severity of a particular alert. These severity icons help you immediately identify which alerts you should prioritize investigating:

Alert Management Severity System

Alerts contain one of the following icons:

Severity Icon Color (hexadecimal)
Critical {severity-critical} #8b2615
High {severity-high} #c0341d
Medium {severity-medium} #fca429
Low {severity-low} #fdbc60
Info {severity-info} #418cd8
Unknown {severity-unknown} #bababa

Alert details page

Navigate to the Alert details view by visiting the Alert list and selecting an alert from the list. You need at least the Developer role to access alerts.

NOTE: To review live examples of GitLab alerts, visit the alert list for this demo project. Select any alert in the list to examine its alert details page.

Alerts provide Overview and Alert details tabs to give you the right amount of information you need.

Alert details tab

The Alert details tab has two sections. The top section provides a short list of critical details such as the severity, start time, number of events, and originating monitoring tool. The second section displays the full alert payload.

Metrics tab

  • Introduced in GitLab 13.2.
  • Changed in GitLab 14.10. In GitLab 14.9 and earlier, this tab shows a metrics chart for alerts coming from Prometheus.

In many cases, alerts are associated to metrics. You can upload screenshots of metric charts in the Metrics tab.

To do so, either:

  • Select upload and then select an image from your file browser.
  • Drag a file from your file browser and drop it in the drop zone.

When you upload an image, you can add text to the image and link it to the original graph.

Text link modal

If you add a link, it is shown above the uploaded image.

View an alert's logs

Viewing logs from a metrics panel can be useful if you're triaging an application incident and need to explore logs from across your application. These logs help you understand what's affecting your application's performance and how to resolve any problems.

Prerequisite:

  • You must have at least the Developer role.

To view the logs for an alert:

  1. On the top bar, select Menu > Projects and find your project.
  2. On the left sidebar, select Monitor > Alerts.
  3. Select the alert you want to view.
  4. Below the title of the alert, select the Metrics tab.
  5. Select the menu of the metric chart to view options.
  6. Select View logs.

Activity feed tab

Introduced in GitLab 13.1.

The Activity feed tab is a log of activity on the alert. When you take action on an alert, this is logged as a system note. This gives you a linear timeline of the alert's investigation and assignment history.

The following actions result in a system note:

Alert Details Activity Feed

Alert actions

There are different actions available in GitLab to help triage and respond to alerts.

Update an alert's status

Triggered is the default status for new alerts. For users with the Developer role or higher, the alert status can be updated from these locations:

  • Alert list: select the status dropdown corresponding to an alert, then select an alternate status.
  • Alert details page: select Edit in the right-hand side bar, then select an alternate status.

To stop email notifications for alert reoccurrences in projects with email notifications enabled, change the alert's status away from Triggered.

In projects with GitLab Premium, on-call responders can respond to alert pages by changing the status. Setting the status to:

  • Resolved silences all on-call pages for the alert.
  • Acknowledged limits on-call pages based on the project's escalation policy.
  • Triggered from Resolved restarts the alert escalating from the beginning.

For alerts with an associated incident, updating the alert status also updates the incident status.

Create an incident from an alert

Introduced in GitLab 13.1.

The Alert detail view enables you to create an issue with a description populated from an alert. To create the issue, select the Create Issue button. You can then view the issue from the alert by selecting the View Issue button.

You can also create incidents for alerts automatically.

Closing a GitLab issue associated with an alert changes the alert's status to Resolved. See Alert List for more details about alert statuses.

Assign an alert

Introduced in GitLab 13.1.

In large teams, where there is shared ownership of an alert, it can be difficult to track who is investigating and working on it. Assigning alerts eases collaboration and delegation by indicating which user is owning the alert. GitLab supports only a single assignee per alert.

To assign an alert:

  1. Display the list of current alerts:

    1. On the top bar, select Menu > Projects and find your project.
    2. On the left sidebar, select Monitor > Alerts.
  2. Select your desired alert to display its details.

    Alert Details View Assignees

  3. If the right sidebar is not expanded, select Expand sidebar ({angle-double-right}) to expand it.

  4. On the right sidebar, locate the Assignee, and then select Edit. From the list, select each user you want to assign to the alert. GitLab creates a to-do item for each user.

After completing their portion of investigating or fixing the alert, users can unassign themselves from the alert. To remove an assignee, select Edit next to the Assignee dropdown menu and clear the user from the list of assignees, or select Unassigned.

Create a to-do item from an alert

Introduced in GitLab 13.1.

You can manually create To-Do list items for yourself from the Alert details screen, and view them later on your To-Do List. To add a to-do item:

  1. Display the list of current alerts:

    1. On the top bar, select Menu > Projects and find your project.
    2. On the left sidebar, select Monitor > Alerts.
  2. Select your desired alert to display its Alert Management Details View.

  3. On the right sidebar, select Add a to do:

    Alert Details Add a to do

To view your To-Do List, on the top bar, select To-Do List ({todo-done}).