Backup and restore Operate and Tasklist data
As of the Camunda 8.8 release, the /actuator
endpoints for backups have been moved to /actuator/backupHistory
. The previous /actuator/backups
endpoint is still active only if the applications are deployed standalone (each application is running in its own process).
This page refers to the components Operate and Tasklist as "web applications".
Optimize is not backed up as part of this process. Optimize is a dedicated application with its own backup system. Please see the documentation for Optimize to perform a backup.
The Camunda web applications store their data over multiple indices in Elasticsearch. A backup of web application data includes several Elasticsearch snapshots containing sets of different indices. Each backup is identified by a backupId
. For example, a backup with an ID of 123
may contain the following Elasticsearch snapshots:
camunda_webapps_123_8.8.0_part_1_of_6
camunda_webapps_123_8.8.0_part_2_of_6
camunda_webapps_123_8.8.0_part_3_of_6
camunda_webapps_123_8.8.0_part_4_of_6
camunda_webapps_123_8.8.0_part_5_of_6
camunda_webapps_123_8.8.0_part_6_of_6
All web applications provide the same API to perform a backup and manage backups (list, check state, delete). Restore a backup using the standard Elasticsearch API.
The backup API can be reached via the Actuator management port, which default defaults to port 9600.
Prerequisites
Snapshot repository
Before you can use the backup and restore feature:
- The Elasticsearch snapshot repository must be configured.
- All deployed web applications must be configured with the same repository name using one of the following configuration options.
- Web applications must have the right to take snapshots.
Configuring different web applications with different repository names will potentially create multiple backups in different repositories.
- YAML file
- Environment variables
Tasklist
- YAML file
- Environment variables
camunda:
tasklist:
backup:
repositoryName: <es snapshot repository name>
CAMUNDA_TASKLIST_BACKUP_REPOSITORYNAME=<es snapshot repository name>
Index prefix
As of Camunda 8.8, the indexPrefix
of all web applications must match. By default it is set to ""
. If overriden, it must set consistently across Operate and Tasklist.
Create backup API
During backup creation, web applications can continue running. To create the backup, call the following endpoint:
POST actuator/backupHistory
{
"backupId": <backupId>
}
For backward compatibility, the endpoint actuator/backups
is available if the component is running standalone.
Response:
Code | Description |
---|---|
200 OK | Backup was successfully started, snapshots will be created asynchronously. List of snapshots is returned in the response body (see example below). This list must be persisted together with the backup ID to be able to restore it later. |
400 Bad Request | In case something is wrong with backupId , e.g. the same backup ID already exists. |
500 Server Error | All other errors, e.g. ES returned error response when attempting to create a snapshot. |
502 Bad Gateway | Elasticsearch is not accessible, the request can be retried when it is back. |
Example request:
curl --request POST 'http://localhost:9600/actuator/backupHistory' \
-H 'Content-Type: application/json' \
-d '{ "backupId": 123 }'
Example response:
{
"scheduledSnapshots": [
"camunda_operate_123_8.8.0_part_1_of_6",
"camunda_operate_123_8.8.0_part_2_of_6",
"camunda_operate_123_8.8.0_part_3_of_6",
"camunda_operate_123_8.8.0_part_4_of_6",
"camunda_operate_123_8.8.0_part_5_of_6",
"camunda_operate_123_8.8.0_part_6_of_6"
]
}
Get backup state API
As a backup is created asynchronously, call the following endpoint to check the state of the backup:
GET actuator/backupHistory/{backupId}
For backward compatibility, the endpoint actuator/backups
is available if the component is running standalone.
Response:
Code | Description |
---|---|
200 OK | Backup state could be determined and is returned in the response body. |
404 Not Found | Backup with given ID does not exist. |
500 Server Error | All other errors, e.g. ES returned error response when attempting to execute the query. |
502 Bad Gateway | Elasticsearch is not accessible, the request can be retried when it is back. |
For example, the request could look like this:
curl 'http://localhost:9600/actuator/backupHistory/123'
Example response:
{
"backupId": 123,
"state": "COMPLETED",
"failureReason": null,
"details": [
//here goes the list of all Elasticsearch snapshots included in the backup
{
"snapshotName": "camunda_operate_123_8.8.0_part_1_of_6",
"state": "SUCCESS",
"startTime": "2023-01-01T10:10:10.100+0000",
"failures": []
},
<..>
]
}
Possible states of the backup:
COMPLETED
: Backup can be used for restoring the data.IN_PROGRESS
: Wait until the backup completes to use it for restore.FAILED
: Something went wrong when creating this backup. To find out the exact problem, use the Elasticsearch get snapshot status API for each of the snapshots included in the given backup.INCOMPATIBLE
: Backup is incompatible with the current Elasticsearch version.INCOMPLETE
: Backup is incomplete (e.g. when backup process was interrupted).
State of the individual snapshot is a copy of Elasticsearch state.
Get backups list API
To get the list of existing backups, the following endpoint can be used:
GET actuator/backupHistory
For backward compatibility, the endpoint actuator/backups
is available if the component is running standalone.
Response:
Code | Description |
---|---|
200 OK | Backup list could be determined and is returned in the response body. Can be an empty response in case no backups were created yet. |
404 Not Found | Backup repository is not configured. |
500 Server Error | All other errors, e.g. ES returned error response when attempting to execute the query. |
502 Bad Gateway | Elasticsearch is not accessible, the request can be retried when it is back. |
For example, the request could look like this:
curl 'http://localhost:9600/actuator/backupHistory'
Response will contain JSON with array of objects representing state of each backup (see get backup state API endpoint).
Delete backup API
To delete all the Elasticsearch snapshots associated with the specific backup id, the following endpoint may be used:
DELETE actuator/backupHistory/123
For backward compatibility, the endpoint actuator/backups
is available if the component is running standalone.
Response:
Code | Description |
---|---|
204 No Content | All commands to delete corresponding ELS snapshots were successfully sent to ELS. ELS will continue deletion asynchronously. |
404 Not Found | Not a single snapshot corresponding to given ID exist. |
500 Server Error | All other errors, e.g. ES returned error response when attempting to execute the query. |
502 Bad Gateway | Elasticsearch is not accessible, the request can be retried when it is back. |
Restore backup
There is no web application API to preform the backup restore. Instead, use the Elasticsearch restore snapshot API.
Operate and Tasklist must not be running while a backup restore is taking place.
To restore the backup with a known backup id, you must restore all the snapshots this backup contains (check the response of the create backup API).
Example of Elasticsearch query:
curl --request POST `http://localhost:9200/_snapshot/test/camunda_operate_123_8.1.0-snapshot_part_1_of_6/_restore?wait_for_completion=true`
To summarize, the process may look as follows:
- Stop all web applications.
- Ensure there are no web application indices present in Elasticsearch (otherwise the restore process will fail).
- Iterate over all Elasticsearch snapshots included in the desired backup and restore them using the Elasticsearch restore snapshot API.
- Start all web applications.