Version: 8.9 (unreleased)

Migrate from Bitnami subcharts to managed services

Migrate a Camunda 8 Helm installation from Bitnami-managed infrastructure to cloud-managed services, such as:

PostgreSQL: AWS RDS, Azure Database for PostgreSQL, Google Cloud SQL, or any managed PostgreSQL service
Elasticsearch: Elastic Cloud or any managed Elasticsearch service
Keycloak: This guide does not assume a managed Keycloak service. Keep Keycloak on the Keycloak Operator, or replace it with an external OIDC provider if that better fits your environment.

When to use this guide

Managed services are ideal when your organization:

Prefers minimal operational overhead for database and search infrastructure
Already uses a cloud provider's managed services catalog
Requires SLA-backed availability and automated patching from the cloud vendor
Does not want to manage Kubernetes operators for infrastructure components

Read the topic overview to learn why you should migrate.

Choose your migration strategy

Before starting, choose between the standard migration and the warm reindex strategy. This choice affects how Elasticsearch data is transferred and, therefore, how long the downtime window lasts.

Standard
Reduced downtime (warm reindex)

The standard migration performs all Elasticsearch data transfer during the cutover (Phase 3). Downtime scales linearly with Elasticsearch data volume, typically 5–60 minutes.

This is the simplest option and is recommended when:

Your Elasticsearch data volume is small (< 1 GB) or moderate (< 10 GB).
You can tolerate a longer maintenance window.
You want the fewest moving parts.

Phase	Description	Downtime
Phase 1 – Deploy targets	Install operators and create target clusters alongside Bitnami	No
Phase 2 – Initial backup	Back up all data while the application is still running	No
Phase 3 – Cutover	Freeze → final backup → full ES reindex → Helm upgrade	Yes (5–60 minutes typical)
Phase 4 – Validate	Verify all components are healthy on the new infrastructure	No
Phase 5 – Cleanup Bitnami	Remove old Bitnami resources and re-verify	No

The warm reindex strategy pre-copies Elasticsearch data to the target during Phase 2, while the application is still running. At cutover, only a fast delta reindex is needed to sync changes written since Phase 2, significantly reducing the downtime window.

Choose this option when:

Your Elasticsearch data volume is large (> 10 GB).
You need to minimize the maintenance window.
You can accept the additional complexity of a two-step ES migration.

To enable, set ES_WARM_REINDEX=true in env.sh.

Phase	Description	Downtime
Phase 1 – Deploy targets	Install operators and create target clusters alongside Bitnami	No
Phase 2 – Initial backup	Back up all data + full ES reindex to target (while app runs)	No
Phase 3 – Cutover	Freeze → final backup → delta ES reindex → Helm upgrade	Yes (~5 minutes)
Phase 4 – Validate	Verify all components are healthy on the new infrastructure	No
Phase 5 – Cleanup Bitnami	Remove old Bitnami resources and re-verify	No

Prerequisites

Before starting the migration, ensure you have the following general prerequisites:

A running Camunda 8 installation using the Helm chart with Bitnami subcharts enabled
kubectl configured and pointing to your cluster
helm with the camunda/camunda-platform repository added
Sufficient cluster resources to temporarily run both old and new infrastructure side-by-side
A tested backup of your current installation (see Precautions)

In addition to the general prerequisites:

Managed services already provisioned: PostgreSQL and Elasticsearch instances must be running and accessible before starting the migration. If you haven't provisioned them yet, see the provisioning reference in Step 1.
Ensure network connectivity between your Kubernetes cluster and the managed services.
Have credentials ready for each managed service.

Precautions

Review the general precautions that apply to all migration paths.

Before running in production

Review the operational readiness checklist, including the staging rehearsal and pre-migration checklist, before starting a production migration.

IRSA / IAM-based authentication not supported

AWS only

The migration jobs use password-based PostgreSQL authentication (PGPASSWORD) and standard Elasticsearch HTTP API. Setups using AWS IAM Roles for Service Accounts (IRSA) with jdbc:aws-wrapper or Elasticsearch endpoints protected by cloud-specific IAM auth require a custom migration approach.

Identity authentication

You need to decide how Identity will authenticate before the cutover. For managed services, the infrastructure decision is separate from the authentication decision:

If you keep Keycloak, deploy it with the Keycloak Operator and set the hostname to the full public URL, for example https://your-domain.example.com/auth.
If you replace Keycloak with external OIDC, prepare the provider configuration and the corresponding Identity Helm values before running the migration.

Elasticsearch to OpenSearch not supported

This migration path does not support moving data from the Bitnami Elasticsearch subchart to Amazon OpenSearch Service or another OpenSearch target.

If your target architecture requires OpenSearch, treat that as a separate replatforming effort instead of a supported in-place migration from the Bitnami Elasticsearch subchart.

Clone the deployment references repository

The migration scripts are part of the Camunda deployment references repository. Clone the repository and navigate to the migration directory:

git clone https://github.com/camunda/camunda-deployment-references.git
cd camunda-deployment-references/generic/kubernetes/migration

The migration reuses the operator-based reference architecture scripts for deploying target infrastructure, ensuring consistency:

generic/kubernetes/
├── operator-based/                  # Reference architecture (reused by migration)
│   ├── postgresql/
│   │   ├── deploy.sh               #   CNPG operator + cluster deployment
│   │   ├── set-secrets.sh          #   PostgreSQL secret management
│   │   ├── postgresql-clusters.yml #   ★ CUSTOMIZE: PG cluster specs
│   │   ├── camunda-identity-values.yml
│   │   └── camunda-webmodeler-values.yml
│   ├── elasticsearch/
│   │   ├── deploy.sh               #   ECK operator + cluster deployment
│   │   ├── elasticsearch-cluster.yml #   ★ CUSTOMIZE: ES cluster specs
│   │   └── camunda-elastic-values.yml
│   └── keycloak/
│       ├── deploy.sh               #   Keycloak operator + CR deployment
│       ├── keycloak-instance-*.yml #   ★ CUSTOMIZE: Keycloak CR specs
│       ├── camunda-keycloak-domain-values.yml
│       └── camunda-keycloak-no-domain-values.yml
│
└── migration/                       # Migration scripts
    ├── env.sh                       # Configuration variables
    ├── lib.sh                       # Shared library (do not edit)
    ├── 1-deploy-targets.sh          # Phase 1: Deploy operators + clusters
    ├── 2-backup.sh                  # Phase 2: Initial backup
    ├── 3-cutover.sh                 # Phase 3: Freeze → Restore → Switch
    ├── 4-validate.sh                # Phase 4: Validate everything
    ├── 5-cleanup-bitnami.sh         # Phase 5: Remove old Bitnami resources
    ├── rollback.sh                  # Emergency rollback
    ├── .state/                      # Migration state tracking (auto-generated)
    ├── hooks/                       # Custom hook scripts (optional)
    ├── jobs/                        # Kubernetes Job templates
    │   ├── pg-backup.job.yml
    │   ├── pg-restore.job.yml
    │   ├── es-backup.job.yml        #   ES health verification
    │   └── es-restore.job.yml       #   ES reindex-from-remote restore
    └── manifests/
        ├── backup-pvc.yml           # Shared backup PVC
        └── eck-migration-patch.yml  # ES reindex.remote.whitelist patch

Step 1: Create Kubernetes Secrets for managed services

This step assumes your managed PostgreSQL and Elasticsearch services are already provisioned and accessible from the Kubernetes cluster. You need the endpoint, port, database names, usernames, and credentials for each service.

Provisioning reference

If you haven't provisioned your managed services yet, use your cloud provider's official documentation:

PostgreSQL:

Minimum requirements:

Create the identity, keycloak, and webmodeler databases.
Create or assign credentials for each database according to your provider's model.
Ensure private connectivity from the Kubernetes cluster to PostgreSQL.
Record the endpoint, port, database names, usernames, and secret material.

Elasticsearch:

Elastic Cloud — note the deployment endpoint, Cloud ID, and credentials.
Self-managed Elasticsearch — ensure it is accessible from the cluster, and note the endpoint and credentials.

Create Kubernetes Secrets

Store the managed service credentials as Kubernetes Secrets so both the migration scripts and Camunda Helm chart can use them:

# PostgreSQL secrets — one per component
kubectl create secret generic external-pg-identity \
  -n ${NAMESPACE} \
  --from-literal=password='<identity-pg-password>'

kubectl create secret generic external-pg-keycloak \
  -n ${NAMESPACE} \
  --from-literal=password='<keycloak-pg-password>'

kubectl create secret generic external-pg-webmodeler \
  -n ${NAMESPACE} \
  --from-literal=password='<webmodeler-pg-password>'

# Elasticsearch secret
kubectl create secret generic external-es \
  -n ${NAMESPACE} \
  --from-literal=elastic='<es-password>'

Step 2: Configure the migration for external targets

Terminology — "managed services" vs. "external targets"

The migration scripts use the term external targets (PG_TARGET_MODE=external, ES_TARGET_MODE=external) for any non-operator target. This includes cloud-managed services (AWS RDS, Elastic Cloud, etc.) but also self-hosted databases outside the Kubernetes cluster. This guide uses "managed services" as a shorthand, but the scripts themselves are not restricted to cloud-managed offerings.

Edit env.sh, and set the target mode to external. The base configuration variables (NAMESPACE, CAMUNDA_RELEASE_NAME, MIGRATE_*, etc.) are the same as in the operator-based guide, only the target mode and external endpoint variables differ:

Show details: external target configuration example

# Set target modes
export PG_TARGET_MODE="external"
export ES_TARGET_MODE="external"

# PostgreSQL external targets
export EXTERNAL_PG_IDENTITY_HOST="your-rds-endpoint.region.rds.amazonaws.com"
export EXTERNAL_PG_IDENTITY_PORT="5432"
export EXTERNAL_PG_IDENTITY_SECRET="external-pg-identity"

export EXTERNAL_PG_KEYCLOAK_HOST="your-rds-endpoint.region.rds.amazonaws.com"
export EXTERNAL_PG_KEYCLOAK_PORT="5432"
export EXTERNAL_PG_KEYCLOAK_SECRET="external-pg-keycloak"

export EXTERNAL_PG_WEBMODELER_HOST="your-rds-endpoint.region.rds.amazonaws.com"
export EXTERNAL_PG_WEBMODELER_PORT="5432"
export EXTERNAL_PG_WEBMODELER_SECRET="external-pg-webmodeler"

# Elasticsearch external target
export EXTERNAL_ES_HOST="your-elastic-endpoint.example.com"
export EXTERNAL_ES_PORT="443"
export EXTERNAL_ES_SECRET="external-es"

You can use the same managed PostgreSQL host for all components—each database is separate. This is common when using a single RDS instance with multiple databases.

Create custom Helm values

When using external targets, you need a custom Helm values file that configures Camunda to connect to the managed services. Set CUSTOM_HELM_VALUES_FILE to point to this file:

export CUSTOM_HELM_VALUES_FILE="./my-external-values.yaml"

Example custom values file for AWS RDS + external Elasticsearch:

Show details: external Helm values example

# Disable Bitnami subcharts
identityPostgresql:
  enabled: false
webModelerPostgresql:
  enabled: false
elasticsearch:
  enabled: false
identityKeycloak:
  enabled: false

# Configure Identity to use external PostgreSQL
identity:
  externalDatabase:
    host: "your-rds-endpoint.region.rds.amazonaws.com"
    port: 5432
    database: "identity"
    username: "identity"
    existingSecret: "external-pg-identity"
    existingSecretPasswordKey: "password"

# Configure Web Modeler to use external PostgreSQL
webModeler:
  restapi:
    externalDatabase:
      host: "your-rds-endpoint.region.rds.amazonaws.com"
      port: 5432
      database: "webmodeler"
      user: "webmodeler"
      existingSecret: "external-pg-webmodeler"
      existingSecretPasswordKey: "password"

# Configure external Elasticsearch using the per-component values schema
orchestration:
  data:
    secondaryStorage:
      type: elasticsearch
      elasticsearch:
        url: "https://your-elastic-endpoint.example.com:443"
        auth:
          username: "elastic"
          secret:
            existingSecret: "external-es"
            existingSecretKey: "elastic"

optimize:
  database:
    elasticsearch:
      enabled: true
      external: true
      url:
        protocol: "https"
        host: "your-elastic-endpoint.example.com"
        port: 443
      auth:
        username: "elastic"
        secret:
          existingSecret: "external-es"
          existingSecretKey: "elastic"

elasticsearch:
  enabled: false

Helm values customization

The example above is a starting point. Adjust the values to match your specific managed service configuration, authentication method (IAM, username/password, etc.), and TLS requirements. Refer to the Camunda Helm chart parameters for all available options.

Source `env.sh`

Source the configuration:

source env.sh

Step 3: Run the migration

The migration follows the same five-phase approach described in the migration overview. Each phase is idempotent and can be safely rerun. The key difference with external targets is that operator installation is skipped for components using managed services.

Phase 1: Deploy targets (no downtime)

bash 1-deploy-targets.sh

What happens:

When PG_TARGET_MODE=external, the CloudNativePG (CNPG) operator is not installed; your managed PostgreSQL is used directly.
When ES_TARGET_MODE=external, the Elastic Cloud on Kubernetes (ECK) operator is not installed; your managed Elasticsearch target is used directly.
The Keycloak Operator is still deployed with a Custom Resource pointing to your managed PostgreSQL.
The script validates connectivity to each external endpoint before proceeding.

Phase 2: Initial backup (no downtime)

bash 2-backup.sh

Standard
Reduced downtime (warm reindex)

What happens:

PostgreSQL: A pg_dump Kubernetes Job is created for each component (Identity, Keycloak, and Web Modeler).
Elasticsearch: A verification job checks source Elasticsearch health and lists all Camunda indices to be migrated.
All backup data is stored on a shared Persistent Volume Claim (PVC).

What happens:

PostgreSQL: A pg_dump Kubernetes Job is created for each component (Identity, Keycloak, and Web Modeler).
Elasticsearch: A verification job checks source Elasticsearch health and lists all Camunda indices to be migrated.
Elasticsearch warm reindex: A full reindex from the source Bitnami ES to the managed target is performed while the application is still running. This pre-populates the target with all existing data so Phase 3 only needs a fast delta reindex. The warm reindex may take a significant amount of time depending on your data volume, but it runs without any downtime.
All backup data is stored on a shared Persistent Volume Claim (PVC).

The target type does not affect backup operations — backups always run against the source Bitnami instances.

Phase 3: Cutover (downtime required)

Maintenance window required

This is the only phase that causes downtime. Schedule a maintenance window before proceeding.

Standard
Reduced downtime (warm reindex)

With ES_WARM_REINDEX=true, downtime is reduced to ~5 minutes regardless of Elasticsearch data volume. Phase 3 only syncs the delta written since the warm reindex in Phase 2.

Measure downtime before the real cutover

You can run bash 3-cutover.sh --estimate to measure the actual cutover duration on your environment without causing any downtime. This runs the real data operations (PG backup/restore and ES reindex) against the target infrastructure but skips freezing the application and the Helm upgrade. See Measure with --estimate for details.

bash 3-cutover.sh

Standard
Reduced downtime (warm reindex)

What happens:

Save current Helm values for rollback.
Freeze all Camunda deployments and StatefulSets (scale to zero replicas).
Final backup — consistent backup with no active connections.
Restore — pg_restore runs against the managed PostgreSQL endpoints instead of CNPG clusters. Elasticsearch full reindex from remote — all indices are copied from the source Bitnami ES to the managed target using the _reindex API. This is the dominant factor in downtime duration.
Helm upgrade — reconfigures Camunda to use the new backends and restarts all components.

What happens:

Save current Helm values for rollback.
Freeze all Camunda deployments and StatefulSets (scale to zero replicas).
Final backup — consistent backup with no active connections.
Restore — pg_restore runs against the managed PostgreSQL endpoints. Elasticsearch delta reindex — only documents written between Phase 2 (warm reindex) and the freeze are synced. This uses version_type=external with conflicts=proceed to skip documents already present on the target, making it dramatically faster than a full reindex.
Helm upgrade — reconfigures Camunda to use the new backends and restarts all components.

Elasticsearch data migration for managed services

When ES_WARM_REINDEX=true is set in env.sh, the migration scripts automatically handle Elasticsearch data transfer for external targets using the _reindex API. Phase 2 performs a full reindex from the source Bitnami ES to the managed target (no downtime), and Phase 3 runs a fast delta reindex to sync changes.

Prerequisite for automated ES migration

Your managed Elasticsearch target must have reindex.remote.whitelist configured to allow pulling data from the source Bitnami ES service. This is required for the _reindex API to work across clusters. Consult your managed Elasticsearch provider's documentation for how to configure this setting.

If you cannot configure reindex.remote.whitelist on the managed target, or prefer a manual approach, you can leave ES_WARM_REINDEX=false (default) and transfer data manually using one of the options below:

elasticdump
S3 snapshot repository
Reindex API

Use the elasticdump npm tool to transfer indices from source to target. For example:

# Install elasticdump
npm install -g elasticdump

# Get source ES password
SOURCE_ES_PWD=$(kubectl get secret ${CAMUNDA_RELEASE_NAME}-elasticsearch \
  -n ${NAMESPACE} -o jsonpath='{.data.elasticsearch-password}' | base64 -d)

# Port-forward source ES
kubectl port-forward svc/${CAMUNDA_RELEASE_NAME}-elasticsearch -n ${NAMESPACE} 9200:9200 &

# Dump and restore each index pattern
for pattern in zeebe operate tasklist optimize connectors camunda; do
  elasticdump \
    --input="http://elastic:${SOURCE_ES_PWD}@localhost:9200/${pattern}-*" \
    --output="https://elastic:<password>@your-elastic-endpoint.example.com:443/${pattern}-*" \
    --type=data \
    --limit=1000
done

If both source and target elasticsearch support Amazon S3 snapshot repositories, you can use a shared S3 bucket. For example:

# Register an S3 snapshot repository on the source Bitnami Elasticsearch, and create a snapshot
curl -X PUT "localhost:9200/_snapshot/s3_backup" \
  -H 'Content-Type: application/json' \
  -d '{"type":"s3","settings":{"bucket":"my-migration-bucket","region":"us-east-1"}}'

curl -X PUT "localhost:9200/_snapshot/s3_backup/migration?wait_for_completion=true" \
  -H 'Content-Type: application/json' \
  -d '{"indices":"*","ignore_unavailable":true}'

# Register the same S3 repository on the target managed Elasticsearch, and restore the snapshot
curl -X PUT "https://target-endpoint/_snapshot/s3_backup" \
  -H 'Content-Type: application/json' \
  -d '{"type":"s3","settings":{"bucket":"my-migration-bucket","region":"us-east-1"}}'

curl -X POST "https://target-endpoint/_snapshot/s3_backup/migration/_restore" \
  -H 'Content-Type: application/json' \
  -d '{"indices":"*","ignore_unavailable":true}'

Use the Elasticsearch reindex API to copy data from the source to the target. This requires the target to allowlist the source as a remote.

Reindex each concrete Camunda index individually rather than using a single wildcard destination. To stay aligned with the migration scripts, include zeebe-*, operate-*, tasklist-*, optimize-*, connectors-*, and camunda-* indices. For example:

# On the target, add source to reindex.remote.allowlist.
# Then iterate over every Camunda index you want to copy.
for idx in $(curl -s -u "elastic:<password>" \
  "http://source-es:9200/_cat/indices/zeebe-*,operate-*,tasklist-*,optimize-*,connectors-*,camunda-*?h=index"); do
  curl -X POST "https://target-endpoint/_reindex?wait_for_completion=true" \
    -H 'Content-Type: application/json' \
    -d '{
      "source": {
        "remote": {
          "host": "http://source-es:9200",
          "username": "elastic",
          "password": "<password>"
        },
        "index": "'${idx}'"
      },
      "dest": {
        "index": "'${idx}'"
      }
    }'
done

Review the source index list before running the loop. If your deployment uses custom index prefixes, include those prefixes in the _cat/indices query.

Phase 4: Validate (no downtime)

bash 4-validate.sh

The validation script checks that all Camunda deployments and StatefulSets are ready, and that the Keycloak Custom Resource is healthy. For external PostgreSQL and Elasticsearch targets, it verifies connectivity to the managed service endpoints rather than checking CNPG/ECK cluster status. A migration report is generated at .state/migration-report.md.

Wait before cleanup

Do not move on to the next phase immediately after validation. Operate with the new infrastructure through at least one full business cycle (for example, a complete weekday with peak traffic) before cleanup. Once Bitnami resources are deleted, rollback is no longer possible without restoring from backup. If you need to fail back, run bash rollback.sh before this phase (see rollback).

Phase 5: Cleanup Bitnami resources (no downtime)

Destructive and irreversible

This phase permanently deletes old Bitnami StatefulSets, PVCs, and the migration backup PVC. After cleanup, rollback to Bitnami subcharts is no longer possible.

Before running this phase, strongly consider:

Taking a full backup of all databases (pg_dumpall or equivalent)
Taking PVC or storage volume snapshots (cloud provider snapshots)
Storing backups in cold storage—for example, S3 Glacier or GCS Archive
Keeping rollback artifacts in .state/ as a safety net

After confirming the migration is successful, remove old Bitnami StatefulSets, PVCs, services, and the migration backup PVC:

bash 5-cleanup-bitnami.sh

What happens:

The script requires Phase 4 to be completed and displays a destructive operation warning with a confirmation prompt.
Deletes old Bitnami PostgreSQL StatefulSets, their PVCs, and headless services (for each migrated component: Identity, Keycloak, and Web Modeler).
Deletes old Bitnami Elasticsearch StatefulSet, PVCs, and services.
Deletes old Bitnami Keycloak StatefulSet.
Deletes the migration backup PVC.
Reverifies that all Camunda components and operator-managed targets remain healthy after cleanup.
Suggests removing the reindex.remote.whitelist setting from the ECK Elasticsearch configuration as a post-cleanup step.

The script checks whether each resource exists before attempting deletion, so it can be safely rerun if interrupted.

Show details: Phase 5 script reference

generic/kubernetes/migration/5-cleanup-bitnami.sh
loading...

See full example on GitHub

Measure with `--estimate`

You can measure the actual cutover duration on your environment without causing any downtime:

# After completing Phases 1 and 2:
bash 3-cutover.sh --estimate

This runs the real PG backup/restore and ES reindex operations against the target infrastructure but skips freezing the application and the Helm upgrade. The application remains fully operational throughout.

Use this to:

Measure real timing with your actual data volumes before scheduling a maintenance window.
Validate ES reindex throughput on your cluster hardware (storage, network, CPU).
Compare standard vs. warm reindex — run Phase 2 once without ES_WARM_REINDEX, estimate with bash 3-cutover.sh --estimate, then enable ES_WARM_REINDEX=true, rerun Phase 2, and estimate again.

The estimate does not mark Phase 3 as complete, so you can run the real cutover afterwards with bash 3-cutover.sh.

The estimate restores data to the target backends (CNPG, ECK, or managed services). This is harmless — the real cutover overwrites with the final consistent backup taken after freezing the application.

Rollback

If the migration fails or produces unexpected results, you can roll back to the pre-cutover state:

bash rollback.sh

This restores the previous Helm values (re-enabling Bitnami subcharts) and restarts Camunda on the original infrastructure.

Rollback is available after Phase 3 (cutover) and before Phase 5 (cleanup). Before Phase 3, simply stop the migration; your Bitnami infrastructure is still active and untouched.

Operational readiness

Before running this migration in production, use the checklist below to reduce risk, especially where network policy and external service access add complexity.

If you are using the migration scripts, also consult the downtime estimation, migration hooks, and troubleshooting sections in the operator-based guide — they apply equally to external targets.

Staging rehearsal

Provision staging managed services that mirror your production setup—same cloud provider, same region, and same tier/SKU.
Run the full migration end to end in staging, including all five phases: deploy, backup, cutover, validate, and cleanup.
Measure actual timings: record how long each phase takes. Network latency to external services, such as RDS, Cloud SQL, and Elasticsearch, may increase backup and restore times compared to in-cluster operators.
Test rollback: after a successful staging migration, run bash rollback.sh to verify the Helm values revert correctly and Camunda reconnects to the Bitnami subcharts.

tip

When staging with managed services, use the same authentication method (IAM, managed identity, workload identity) that you plan to use in production. Password-based staging does not catch permission issues.

Production dry-run

The migration scripts support a --dry-run flag that shows what would be executed without making changes:

bash 1-deploy-targets.sh --dry-run
bash 2-backup.sh --dry-run
bash 3-cutover.sh --dry-run

Pay special attention to:

External endpoints and port configurations.
Kubernetes Secret names and keys referenced in the Helm values.
Network connectivity from the cluster to the managed services (security groups, private endpoints, firewall rules).

Pre-migration checklist

Before starting the migration in production:

Verify managed service connectivity: from within the cluster, confirm you can connect to each managed service endpoint using kubectl run with a temporary client pod.
Notify stakeholders: announce the maintenance window at least 48 hours in advance. Include expected start time, duration (measured in staging), and impact on end users.
Verify independent backups: confirm a recent backup exists via both your cluster backup tool (Velero, snapshots) and the cloud provider's managed service backup (RDS snapshots, automated backups).
Check IAM permissions: ensure the Kubernetes service account has the correct role bindings for the managed services (IRSA for AWS, Workload Identity for GCP, Managed Identity for Azure).
Monitor readiness: have dashboards open for cluster health, managed service metrics (CPU, connections, storage), and pod status.

Failback procedure

Immediate failback (Bitnami PVCs still exist): run bash rollback.sh to revert the Helm values.
Late failback (Bitnami PVCs deleted): restore from the backup taken during Phase 2. If your managed service has point-in-time recovery (PITR), you can also restore from a managed service snapshot, but note that Camunda would need to be reconfigured to point back to the Bitnami infrastructure.

warning

Once you delete the old Bitnami PVCs (during post-migration cleanup), rollback is no longer trivial. Keep the old resources until your team has observed the system under production load through at least one full business cycle (for example, a complete weekday with peak traffic). Only proceed with cleanup once you are confident the new infrastructure is stable.

Data safety measures

All pg_dump backups are stored on a dedicated PVC that persists independently.
Managed services typically offer their own automated backups (RDS snapshots, Cloud SQL backups). Verify these are enabled and have adequate retention.
The migration scripts are idempotent and can be rerun safely.
No Bitnami resources are deleted during migration. They must be explicitly removed afterward.

Post-migration monitoring

After completing the migration, monitor the following for at least 48 hours:

Pod restarts: kubectl get pods -n ${NAMESPACE} --watch
Managed service metrics: check connection counts, latency, CPU, and storage usage in your cloud provider console.
Camunda component logs: look for connection timeouts, SSL/TLS handshake errors, or authentication failures.
Process instance completion: verify that in-flight process instances continue to execute correctly.
Zeebe export lag: confirm that Zeebe exporters are writing to the external Elasticsearch target without delays.

When to use this guide​

Choose your migration strategy​

Prerequisites​

Precautions​

IRSA / IAM-based authentication not supported​

Identity authentication​

Elasticsearch to OpenSearch not supported​

Clone the deployment references repository​

Step 1: Create Kubernetes Secrets for managed services​

Create Kubernetes Secrets​

Step 2: Configure the migration for external targets​

Create custom Helm values​

Source env.sh​

Step 3: Run the migration​

Phase 1: Deploy targets (no downtime)​

Phase 2: Initial backup (no downtime)​

Phase 3: Cutover (downtime required)​

Elasticsearch data migration for managed services​

Phase 4: Validate (no downtime)​

Phase 5: Cleanup Bitnami resources (no downtime)​

Measure with --estimate​

Rollback​

Operational readiness​

Staging rehearsal​

Production dry-run​

Pre-migration checklist​

Failback procedure​

Data safety measures​

Post-migration monitoring​

When to use this guide

Choose your migration strategy

Prerequisites

Precautions

IRSA / IAM-based authentication not supported

Identity authentication

Elasticsearch to OpenSearch not supported

Clone the deployment references repository

Step 1: Create Kubernetes Secrets for managed services

Create Kubernetes Secrets

Step 2: Configure the migration for external targets

Create custom Helm values

Source `env.sh`

Step 3: Run the migration

Phase 1: Deploy targets (no downtime)

Phase 2: Initial backup (no downtime)

Phase 3: Cutover (downtime required)

Elasticsearch data migration for managed services

Phase 4: Validate (no downtime)

Phase 5: Cleanup Bitnami resources (no downtime)

Measure with `--estimate`

Rollback

Operational readiness

Staging rehearsal

Production dry-run

Pre-migration checklist

Failback procedure

Data safety measures

Post-migration monitoring