Continue On Conditions

The continue_on configuration allows workflows to continue execution even when steps encounter failures, specific exit codes, or produce certain outputs. This powerful feature enables resilient workflows that can handle errors gracefully and implement sophisticated control flow patterns.

Overview

By default, Boltbase stops workflow execution when a step fails (returns a non-zero exit code). The continue_on configuration overrides this behavior, allowing you to:

Continue execution after failures
Handle specific exit codes differently
React to command output patterns
Mark steps as successful despite failures
Build fault-tolerant workflows

Syntax

The continue_on field supports two syntaxes:

Shorthand Syntax

For simple cases, use a string value:

yaml

steps:
  - command: rm -rf /tmp/cache/*
    continue_on: failed    # Continue if step fails

  - command: echo "Optional"
    continue_on: skipped   # Continue if step is skipped

Object Syntax

For advanced configuration with multiple options:

yaml

steps:
  - command: echo "Complex case"
    continue_on:
      failure: true
      exit_code: [0, 1, 2]
      output: ["WARNING", "re:^INFO:.*"]
      mark_success: true

Configuration Fields

The continue_on configuration supports the following fields:

Field	Type	Description	Default
`failure`	boolean	Continue execution when the step fails	`false`
`skipped`	boolean	Continue execution when the step is skipped	`false`
`exit_code`	array	Continue execution for specific exit codes	`[]`
`output`	array	Continue execution when output matches patterns	`[]`
`mark_success`	boolean	Mark the step as successful when conditions are met	`false`

Field Details

`failure`

When set to true, the workflow continues even if the step fails with any non-zero exit code.

yaml

steps:
  # Shorthand syntax
  - name: optional-cleanup
    command: rm -rf /tmp/cache/*
    continue_on: failed

  # Object syntax (equivalent)
  - name: optional-cleanup
    command: rm -rf /tmp/cache/*
    continue_on:
      failure: true

`skipped`

When set to true, the workflow continues when a step is skipped due to unmet preconditions.

yaml

steps:
  # Shorthand syntax
  - name: conditional-task
    command: echo "Processing"
    preconditions:
      - condition: "${ENABLE_FEATURE}"
        expected: "true"
    continue_on: skipped

  # Object syntax (equivalent)
  - name: conditional-task
    command: echo "Processing"
    preconditions:
      - condition: "${ENABLE_FEATURE}"
        expected: "true"
    continue_on:
      skipped: true

`exit_code`

An array of specific exit codes that should not stop the workflow. This is useful when dealing with commands that use non-zero exit codes for non-error conditions.

yaml

steps:
  - name: check-service
    command: echo "Health check OK"
    continue_on:
      exit_code: [0, 1, 2]  # 0=healthy, 1=warning, 2=maintenance

`output`

An array of patterns to match against the command's stdout output. Supports both literal strings and regular expressions (prefixed with re:).

yaml

steps:
  - name: validate-data
    command: echo "Validating"
    continue_on:
      output:
        - "WARNING"                    # Literal string match (substring)
        - "SKIP"                       # Another literal string
        - "re:^INFO:.*"                # Regex: lines starting with "INFO:"
        - "re:WARN-[0-9]+"             # Regex: WARN- followed by numbers

Pattern Matching Rules:

Literal patterns: Matched as substrings (e.g., "WARNING" matches "WARNING: Low memory")
Regex patterns: Must start with re: prefix (e.g., re:^ERROR.*)
Patterns are matched against each line of stdout only (stderr is not checked)
Matching is case-sensitive

`mark_success`

When set to true, the step is marked as successful if any of the continue conditions are met, even if it would normally be considered a failure.

yaml

steps:
  - name: best-effort-optimization
    command: echo "Optimizing"
    continue_on:
      failure: true
      mark_success: true  # Step shows as successful in UI/logs

Common Patterns

Optional Steps

For steps that are nice-to-have but not critical:

yaml

steps:
  - name: cache-warmup
    command: echo "Warming cache"
    continue_on: failed

  - name: main-process
    command: echo "Processing"

Handling Known Exit Codes

When working with tools that use exit codes for non-error states:

yaml

steps:
  - name: git-diff
    command: git diff --exit-code
    continue_on:
      exit_code: [0, 1]  # 0=no changes, 1=changes exist
      
  - name: process-changes
    command: echo "Handling changes"

Warning Detection

Continue execution but handle warnings differently:

yaml

steps:
  - name: lint-code
    command: eslint .
    continue_on:
      output: ["WARNING", "re:.*warning.*"]
      exit_code: [0, 1]  # 0=no issues, 1=warnings only
      
  - name: strict-lint
    command: eslint . --max-warnings 0
    continue_on:
      failure: false  # This one must pass

Graceful Degradation

Build workflows that degrade gracefully:

yaml

steps:
  - name: try-optimal-method
    command: echo "Processing with optimal settings"
    continue_on: failed

  - name: fallback-method
    command: echo "Processing with fallback settings"
    preconditions:
      - condition: "${TRY_OPTIMAL_METHOD_EXIT_CODE}"
        expected: "re:[1-9][0-9]*"  # Only run if previous failed

Complex Output Matching

React to specific output patterns:

yaml

steps:
  - name: deployment-check
    command: kubectl rollout status deployment/app
    continue_on:
      output:
        - "re:Waiting for.*replicas"
        - "re:deployment.*not found"
        - "Unable to connect"
      exit_code: [1]
      
  - name: handle-deployment-issue
    command: echo "Fixing deployment"

Interaction with Other Features

With Retry Policies

continue_on is evaluated after all retries are exhausted:

yaml

steps:
  - name: flaky-service
    command: echo "Calling service"
    retry_policy:
      limit: 3
      interval_sec: 5
    continue_on:
      exit_code: [503]  # Continue if still 503 after retries

With Lifecycle Handlers

When a step with continue_on fails but the DAG continues, the final status is partially_succeeded, which triggers the onSuccess handler (not onFailure):

yaml

handler_on:
  success:
    command: echo "DAG completed (status: ${DAG_RUN_STATUS})"  # partially_succeeded

steps:
  - name: optional-step
    command: exit 1
    continue_on: failed  # Continues, DAG ends as partially_succeeded

With Dependencies

Dependent steps see the actual status unless mark_success is used:

yaml

type: graph
steps:
  - name: step-a
    command: exit 1
    continue_on:
      failure: true
      mark_success: false  # Default
      
  - name: step-b
    command: echo "Step A status: failed"
    depends: [step-a]  # Runs because of continue_on
    
  - name: step-c
    command: exit 1
    continue_on:
      failure: true
      mark_success: true  # Override status
      
  - name: step-d
    command: echo "Step C status: success"
    depends: [step-c]  # Sees step-c as successful

Examples

Database Migration with Warnings

yaml

steps:
  - name: run-migration
    command: echo "Running migration"
    continue_on:
      output:
        - "re:WARNING:.*already exists"
        - "re:NOTICE:.*will be created"
      exit_code: [0, 1]  # 1 might indicate warnings
      
  - name: verify-migration
    command: echo "Verifying database"

Multi-Cloud Deployment

yaml

steps:
  - name: deploy-aws
    command: echo "Deploying to AWS"
    continue_on: failed  # Continue even if AWS fails

  - name: deploy-gcp
    command: echo "Deploying to GCP"
    continue_on: failed  # Continue even if GCP fails

  - name: verify-deployment
    command: echo "Verifying cloud deployment"
    # No continue_on - at least one cloud must be working

Service Health Check

yaml

steps:
  - name: check-primary
    command: curl -f https://primary.example.com/health
    continue_on:
      exit_code: [0, 22, 7]  # 22=HTTP error, 7=connection failed
      
  - name: check-secondary
    command: curl -f https://secondary.example.com/health
    preconditions:
      - condition: "${CHECK_PRIMARY_EXIT_CODE}"
        expected: "re:[1-9][0-9]*"  # Only if primary failed

Continue On Conditions ​

Overview ​

Syntax ​

Shorthand Syntax ​

Object Syntax ​

Configuration Fields ​

Field Details ​

failure ​

skipped ​

exit_code ​

output ​

mark_success ​

Common Patterns ​

Optional Steps ​

Handling Known Exit Codes ​

Warning Detection ​

Graceful Degradation ​

Complex Output Matching ​

Interaction with Other Features ​

With Retry Policies ​

With Lifecycle Handlers ​

With Dependencies ​

Examples ​

Database Migration with Warnings ​

Multi-Cloud Deployment ​

Service Health Check ​

Continue On Conditions

Overview

Syntax

Shorthand Syntax

Object Syntax

Configuration Fields

Field Details

`failure`

`skipped`

`exit_code`

`output`

`mark_success`

Common Patterns

Optional Steps

Handling Known Exit Codes

Warning Detection

Graceful Degradation

Complex Output Matching

Interaction with Other Features

With Retry Policies

With Lifecycle Handlers

With Dependencies

Examples

Database Migration with Warnings

Multi-Cloud Deployment

Service Health Check