Project Name

Automated 40-Service DB Migrations With Zero Downtime

Automated 40-Service DB Migrations With Zero Downtime
Industry
Logistics
Technology
Dagger (TypeScript SDK), Liquibase, Oracle Autonomous Database, OCI Object Storage, Backstage (Migration Plugin)

Loading

Automated 40-Service DB Migrations With Zero Downtime
Overview

The client is a mid-to-large logistics and supply chain enterprise running a microservices platform of 40-plus services on OCI Kubernetes Engine, all sharing a single Oracle Autonomous Database as the core data store. Operating across complex logistics workflows: order management, fleet tracking, fulfilment, and supply chain orchestration, their platform demands high availability with zero tolerance for planned downtime.

 

The engineering organisation had scaled to 40-plus independent service teams, each with dependent schema contracts, but database migration governance had not kept pace: every schema change required organisation-wide coordination and a quarterly maintenance window that disrupted live operations.

 

With no automated migration history, no compatibility verification, and rollbacks that frequently failed mid-execution, every migration cycle was a high-risk event that threatened platform stability.

Key Challenges

Four migration cycles a year, six-hour downtime windows, rollbacks that failed mid-execution, and 40-plus service teams notified by email before a single changeset could run.

  • Quarterly 4 to 6 Hour Maintenance Windows: Schema migrations could only be applied during scheduled downtime windows lasting 4 to 6 hours per quarter, halting the entire logistics platform and limiting schema evolution to four cycles per year, blocking every feature that required a data model change from shipping between windows.
  • Manual Rollbacks That Frequently Failed: When migrations encountered errors mid-execution, rollback procedures were manual, undocumented, and inconsistently executed, regularly failing and leaving the Oracle Autonomous Database in a partially migrated, inconsistent state that required emergency DBA intervention to resolve.
  • No Migration History or Audit Trail: No record existed of which schema changes had been applied, when, by whom, or in what order, making it impossible to reproduce schema state, diagnose regression sources, or satisfy compliance audit requirements that the logistics organisation was subject to.
  • No Compatibility Testing Against 40-Plus Dependent Services: Schema changes were applied without verifying compatibility with the API contracts and query patterns of the 40-plus dependent microservices; breaking changes were discovered post-migration in production, causing cascading service failures across the logistics platform.
  • Manual Coordination Across 40-Plus Service Owners: Every migration cycle required manual notification, review, and sign-off from all 40-plus service team leads: a coordination overhead consuming days of engineering management time per quarter before a single changeset could execute.
  • No Shadow-Mode Testing Before Production: No mechanism existed to validate migration behaviour on a representative database replica before applying to production. Every changeset was executed directly against the live Oracle Autonomous Database with no pre-validated safety guarantee and no confidence in the outcome.
Our Solution

Ksolves, an AI-first DevOps consulting services company, implemented a zero-downtime schema migration pipeline using Dagger and Liquibase, eliminating maintenance windows. Migrations are validated through backups, compatibility checks, and shadow execution with automatic rollback on failure. A Backstage plugin provides real-time migration status and rollback visibility.

  • Dagger TypeScript SDK Migration Pipeline: Built a fully automated migration pipeline in Dagger TypeScript that orchestrates the complete migration lifecycle: ADB backup snapshot, compatibility validation, OCI Resource Manager clone provisioning, shadow-mode Liquibase testing, progressive production rollout, and automatic rollback, replacing every manual DBA step and maintenance window with a single automated pipeline run.
  • Liquibase Schema Migration Engine: Liquibase manages all schema change definitions as versioned, auditable changesets checked into GitLab, providing a complete migration history, controlled execution order, and deterministic rollback scripts for every schema change, eliminating the zero-history state that had made regression diagnosis and compliance audits impossible.
  • Pre-Migration ADB Backup to OCI Object Storage: Before every migration run, the Dagger pipeline automatically triggers an Oracle Autonomous Database backup snapshot stored in OCI Object Storage, ensuring a recoverable baseline exists before any schema change is applied to production, with no manual DBA step required.
  • Shadow-Mode Testing on ADB Clone: OCI Resource Manager provisions a restored Oracle ADB clone from the latest backup; Liquibase applies all pending changesets against the clone first with automated validation checks - no change is permitted to execute against production until it has passed shadow-mode testing on a live replica.
  • Automated Service Compatibility Checks: The Dagger pipeline executes automated compatibility checks against the API contracts and query patterns of all 40-plus dependent microservices before migration proceeds, catching every breaking change before it reaches production and eliminating the post-migration cascading service failures that had characterised every previous migration cycle.
  • Backstage Custom Migration Status Plugin: A custom Backstage plugin surfaces real-time migration status, service compatibility matrix, rollback history per service, and migration trend metrics, replacing manual cross-team coordination emails with self-service visibility for all 40-plus service teams from a single portal.

Technology Stack

Category Technology
CI/CD Portability Dagger (TypeScript SDK)
Database Migration Liquibase
Database Oracle Autonomous Database
Object Storage OCI Object Storage
Developer Portal Backstage (Migration Plugin)
Impact

From four migration cycles a year and six-hour downtime windows to continuous, automated, zero-downtime schema evolution with every change shadow-tested, compatibility-checked, and automatically recoverable.

  • Quarterly 4 to 6 Hour Maintenance Windows Eliminated: Automated Dagger and Liquibase pipeline delivers continuous, zero-downtime schema migrations running on every merge to main: the quarterly maintenance window is eliminated entirely, removing the four-times-per-year constraint that had been blocking every feature requiring a data model change.
  • Rollback Reliability Transformed From Frequently Failing to Fully Automatic: Automatic rollback is triggered on any failure at any pipeline stage, reverting to the pre-migration ADB snapshot in OCI Object Storage with zero manual DBA recovery required, replacing a manual rollback process that regularly failed and left the database in a partially migrated state.
  • Migration Compatibility Coverage From Zero to 100% of 40-Plus Services: Automated compatibility checks against all 40-plus service contracts run on every migration before shadow-mode testing begins - 100% of breaking changes caught pre-production, replacing a model where breaking changes were discovered post-migration in production and caused cascading service failures.
  • Manual Cross-Team Coordination Eliminated Across 40-Plus Service Owners: Backstage migration status plugin gives all 40-plus service teams real-time self-service visibility into compatibility status and migration progress, replacing the days of manual notification, review, and sign-off coordination that had consumed engineering management time before every migration cycle.
  • Complete Migration History and Audit Trail Established From Zero: Every Liquibase changeset is version-controlled in GitLab with full execution history, author attribution, timestamps, and rollback status: a complete, auditable migration trail available from day one, replacing a state where compliance audits were impossible and regression diagnosis was guesswork.
Solution Architecture
stream-dfd
Conclusion

Quarterly database migration windows were limiting feature delivery and increasing operational risk in the logistics platform. Ksolves removed this constraint by implementing a Dagger-based CI pipeline that runs schema migrations on every merge with automated backup, compatibility checks, shadow testing, and rollback. All changes are version-controlled, compliance-ready, and validated across 40+ services before production impact, enabling continuous schema evolution aligned with feature delivery.

Is Your Oracle Database Schema Still Held Hostage to Quarterly Maintenance Windows?