Version: Next

State Storage and Recovery

Overview

SeaTunnel Engine (Zeta) persists several categories of state data during job execution. Understanding what each category stores, where it lives, and how to manage it is critical for running production-grade CDC or long-running streaming jobs.

Storage Category	Purpose	Default Location
Checkpoint	Fault-tolerant snapshots of pipeline operator state	`seatunnel.yaml` `checkpoint.storage`
Savepoint	User-triggered named checkpoint for planned stop/restart	Same path as checkpoint, different directory
IMap / MapStore	Distributed in-memory state (job metadata, running job data, metrics)	`hazelcast.yaml` MapStore base-dir
WAL (Write-Ahead Log)	Durability log for IMap persistence	Same MapStore base-dir
History Job Metadata	Completed/failed job records	MapStore base-dir

1. Checkpoint Storage

What is stored

A checkpoint captures a consistent snapshot of all pipeline operator states at a given point in time. For a CDC job this includes:

Binlog / WAL offset (MySQL binlog position, PostgreSQL LSN, Oracle SCN)
Split-level progress for parallel readers
2PC sink transaction state (Doris, StarRocks, Kafka transaction IDs)
SeaTunnelRow buffers in-flight through transforms

Storage path layout

<namespace>/                          # configured namespace, default /seatunnel/checkpoint/
  <job-id>/
    <pipeline-id>/
      <checkpoint-id>/
        <task-location>/state-data

Configuration reference

seatunnel:
  engine:
    checkpoint:
      interval: 10000              # milliseconds between checkpoints
      timeout: 60000               # checkpoint completion timeout (ms)
      max-concurrent: 1            # max concurrent in-flight checkpoints
      tolerable-failure: 2         # allowed consecutive checkpoint failures
      storage:
        type: hdfs                 # hdfs | localfile (deprecated)
        plugin-config:
          namespace: /seatunnel/checkpoint/    # must end with /
          # For S3:
          # fs.s3a.endpoint: https://s3.amazonaws.com
          # fs.s3a.access.key: <your-access-key>
          # fs.s3a.secret.key: <your-secret-key>

Checkpoint retention and cleanup

Checkpoints are not automatically deleted by history-job-expire-minutes. They must be cleaned up manually or via a configurable retention policy.
Only the latest N checkpoints are retained per pipeline (controlled by Hazelcast in-memory references). Old checkpoint directories may remain on disk if the job was killed unexpectedly.
Safe cleanup rule: A checkpoint directory for a job ID can be deleted only after the job has been cancelled and you have confirmed that you do not intend to restore it.

2. Savepoint

Savepoint vs Checkpoint

Aspect	Checkpoint	Savepoint
Trigger	Periodic / automatic	Manual (`seatunnel.sh -r <jobId> --savepoint`)
Purpose	Fault tolerance	Planned stop, upgrade, migration
Lifecycle	Managed by engine	Managed by operator
Retention	Auto-rotated	Kept until manually deleted

Triggering a savepoint

# Stop a running job and create a savepoint
$SEATUNNEL_HOME/bin/seatunnel.sh --stop-job <job-id> --savepoint

# Or via REST API v2
curl -X POST http://<master>:8080/stop-job \
  -H 'Content-Type: application/json' \
  -d '{
    "jobId": <job-id>,
    "isStopWithSavePoint": true,
    "force": false
  }'

Restoring from a savepoint

# Submit with --restore to resume from the latest savepoint
$SEATUNNEL_HOME/bin/seatunnel.sh --config job.conf --restore <savepoint-path>

Savepoint path layout

<namespace>/
  savepoint/
    <job-id>/
      <savepoint-timestamp>/
        <pipeline-id>/
          <task-location>/state-data

Safe cleanup

A savepoint can be deleted only if you are certain you will never restore that job from it. Deleting an active savepoint mid-restore causes the job to fail with a "state not found" error.

3. IMap and MapStore (Hazelcast Distributed State)

What IMap stores

SeaTunnel Engine uses Hazelcast IMap as its distributed in-memory key-value store. The following logical maps are persisted:

IMap Name	Content
`running-job-state`	Current state machine status of each running job
`running-job-metrics`	Real-time throughput, latency, and record count metrics
`running-pipeline-state`	Pipeline-level state for each logical pipeline
`finished-job-state`	Terminal state for completed, cancelled, or failed jobs
`finished-job-metrics`	Final metrics snapshot after job termination

MapStore (disk-backed persistence)

Hazelcast MapStore writes IMap entries to local disk so they survive process restarts. This is separate from checkpoint storage.

Default storage path: configured in hazelcast.yaml:

map:
  seatunnel:
    map-store:
      enabled: true
      initial-load-mode: EAGER
      properties:
        hazelcast.fs.base-dir: /tmp/seatunnel/imap   # absolute path
        hazelcast.fs.write-behind-delay-seconds: 1

MapStore directory layout:

<hazelcast.fs.base-dir>/
  maps/
    running-job-state/
    running-job-metrics/
    finished-job-state/
    finished-job-metrics/

Relationship between IMap, MapStore, and Checkpoint

Checkpoint storage  <──────────────────────────────────────────>  Operator state (offsets, splits)
IMap / MapStore     <──────────────────────────────────────────>  Job/pipeline lifecycle state

They are independent. Deleting checkpoint storage does not affect IMap, and vice versa. A job can be restarted from a checkpoint even if the IMap MapStore is wiped, but the job ID and pipeline mapping must be re-submitted because running-job-state is lost.

4. Write-Ahead Log (WAL)

Hazelcast uses a write-ahead log for MapStore durability. WAL files accumulate under:

<hazelcast.fs.base-dir>/
  wal/
    <imap-name>-<partition>.wal

WAL growth in long-running CDC jobs

Each CDC binlog event that updates running-job-metrics or running-pipeline-state generates a WAL write. Over time (days or weeks) WAL files can grow into several GB if:

write-behind-delay-seconds is set very low (high flush frequency)
The job processes millions of events per second

Mitigation:

hazelcast.fs.write-behind-delay-seconds: 5   # increase flush interval
hazelcast.fs.compaction-threshold: 1000       # trigger compaction after N entries

WAL files for finished jobs can be safely compacted or removed after the corresponding IMap entries have been flushed to MapStore files. Do not delete WAL files for running jobs.

5. History Job Expiration

What `history-job-expire-minutes` does and does NOT do

seatunnel:
  engine:
    history-job-expire-minutes: 1440   # 24 hours

Action	Covered by expire?
Remove from `finished-job-state` IMap	Yes
Remove from `finished-job-metrics` IMap	Yes
Delete MapStore persistence files for that job	Yes (after IMap eviction)
Delete checkpoint storage directories	No
Delete savepoint directories	No
Delete WAL files	No (only indirectly via compaction)

Key takeaway: history-job-expire-minutes only cleans up the job metadata stored in finished-job-state. Checkpoint and savepoint directories on HDFS / S3 / local are never touched by this setting. You must manage them separately.

6. Capacity Planning for Long-Running CDC Jobs

Checkpoint size estimation

Source	Typical per-checkpoint size
MySQL CDC (1 table, low volume)	1–10 KB (binlog offset + split state)
MySQL CDC (multi-table, 100 splits)	100 KB – 1 MB
MySQL CDC (full snapshot phase)	10–500 MB (snapshot split states)
PostgreSQL CDC (logical replication)	1–50 KB per table
Oracle CDC (LogMiner)	50 KB – 2 MB

Formula: checkpoint_size ≈ N_tables × avg_splits × state_per_split × max_concurrent_checkpoints

Storage sizing rule of thumb

Retain at least 3 checkpoints per job at all times. Allocate:

storage_needed = checkpoint_size × 3 × safety_factor(1.5)

IMap / WAL sizing

Each running CDC job stores approximately:

50–200 bytes per pipeline in running-job-state
1–2 KB per pipeline per metric flush in running-job-metrics

For a cluster running 100 CDC jobs:

imap_memory ≈ 100 × 200B ≈ 20 KB (state)
imap_memory ≈ 100 × 1KB × flush_rate ≈ manageable

WAL disk: plan 2–5 GB per node for a busy CDC cluster. Mount a dedicated disk partition and monitor with:

du -sh $SEATUNNEL_HOME/imap/wal/
watch -n 60 'du -sh /tmp/seatunnel/imap/'

7. Troubleshooting

Checkpoint directory growing unboundedly

Symptom: HDFS / S3 usage grows continuously even after jobs complete.

Diagnosis:

# List checkpoint directories by job ID
hadoop fs -ls /seatunnel/checkpoint/
# Or for local storage
ls -lh /tmp/seatunnel/checkpoint/

Root cause: SeaTunnel does not have a built-in TTL on checkpoint directories. Each new checkpoint adds directories; old ones are only removed if the job completes cleanly and its state is rotated out.

Fix:

Query finished job metadata through REST API v2:

curl "http://<master>:8080/finished-jobs/FINISHED?page=1&rows=100"
curl "http://<master>:8080/finished-jobs/CANCELED?page=1&rows=100"
curl "http://<master>:8080/finished-jobs/FAILED?page=1&rows=100"
curl "http://<master>:8080/finished-jobs/SAVEPOINT_DONE?page=1&rows=100"
curl "http://<master>:8080/finished-jobs/UNKNOWABLE?page=1&rows=100"

For job IDs not in the response, their checkpoint directories are orphaned and safe to delete.

IMap / WAL directory growing

Symptom: /tmp/seatunnel/imap/ fills the disk on master/worker nodes.

Diagnosis:

du -sh /tmp/seatunnel/imap/wal/
du -sh /tmp/seatunnel/imap/maps/

Root cause candidates:

running-job-metrics updates at every checkpoint for every running job
write-behind-delay-seconds is too low (default 1 s)
WAL compaction is not happening frequently enough

Fix:

# hazelcast.yaml
hazelcast.fs.write-behind-delay-seconds: 10
hazelcast.fs.compaction-threshold: 500

After restarting the engine, old WAL files from finished jobs will be compacted during startup.

"State not found" error on job restart

Symptom: Job fails immediately on restart with checkpoint state not found or restore pipeline state failed.

Cause: The checkpoint directory was deleted or the storage path was changed.

Fix:

Verify the checkpoint path in seatunnel.yaml matches the actual storage location.
If the checkpoint is gone, submit the job fresh (no --restore flag). For CDC jobs, decide whether to restart from startup.mode=initial (re-snapshot) or startup.mode=latest (skip missed data).

Safe cleanup checklist

Before deleting any state directory:

Confirm the job is in FINISHED, CANCELLED, or FAILED terminal state
Confirm you will not restore from checkpoint or savepoint
Confirm the job ID is not referenced in any monitoring or alerting rule
For checkpoint storage: delete <namespace>/<job-id>/ recursively
For savepoint: delete <namespace>/savepoint/<job-id>/ recursively
For MapStore/WAL: restart the engine after manual deletion to trigger reload

State Storage and Recovery

Overview​

1. Checkpoint Storage​

What is stored​

Storage path layout​

Configuration reference​

Checkpoint retention and cleanup​

2. Savepoint​

Savepoint vs Checkpoint​

Triggering a savepoint​

Restoring from a savepoint​

Savepoint path layout​

Safe cleanup​

3. IMap and MapStore (Hazelcast Distributed State)​

What IMap stores​

MapStore (disk-backed persistence)​

Relationship between IMap, MapStore, and Checkpoint​

4. Write-Ahead Log (WAL)​

WAL growth in long-running CDC jobs​

5. History Job Expiration​

What history-job-expire-minutes does and does NOT do​

6. Capacity Planning for Long-Running CDC Jobs​

Checkpoint size estimation​

Storage sizing rule of thumb​

IMap / WAL sizing​

7. Troubleshooting​

Checkpoint directory growing unboundedly​

IMap / WAL directory growing​

"State not found" error on job restart​

Safe cleanup checklist​

See Also​

Overview

1. Checkpoint Storage

What is stored

Storage path layout

Configuration reference

Checkpoint retention and cleanup

2. Savepoint

Savepoint vs Checkpoint

Triggering a savepoint

Restoring from a savepoint

Savepoint path layout

Safe cleanup

3. IMap and MapStore (Hazelcast Distributed State)

What IMap stores

MapStore (disk-backed persistence)

Relationship between IMap, MapStore, and Checkpoint

4. Write-Ahead Log (WAL)

WAL growth in long-running CDC jobs

5. History Job Expiration

What `history-job-expire-minutes` does and does NOT do

6. Capacity Planning for Long-Running CDC Jobs

Checkpoint size estimation

Storage sizing rule of thumb

IMap / WAL sizing

7. Troubleshooting

Checkpoint directory growing unboundedly

IMap / WAL directory growing

"State not found" error on job restart

Safe cleanup checklist

See Also