You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Fixes the behavior where `fail_if_no_checkpoint` would also control if
the pipeline fails in case of network failures.
Now, the pipeline always fails in case of network failures.
Signed-off-by: Abhinav Gyawali <22275402+abhizer@users.noreply.github.com>
// On our final attempt to pull the checkpoint after activation, if we fail, we should error out and not activate with a potentially stale or missing checkpoint.
|`endpoint`|`string`|| The S3-compatible object store endpoint (e.g., `http://localhost:9000` for MinIO). |
43
43
|`bucket`\*|`string`|| The bucket name and optional prefix to store checkpoints (e.g., `mybucket/checkpoints`). |
44
44
|`region`|`string`|`us-east-1`| The region of the bucket. Leave empty for MinIO. If `provider` is AWS, and no region is specified, `us-east-1` is used. |
45
45
|`provider`\*|`string`|| The S3 provider identifier. Must match [rclone’s list](https://rclone.org/s3/#providers). Case-sensitive. Use `"Other"` if unsure. |
46
46
|`access_key`|`string`|| S3 access key. Not required if using environment-based auth (e.g., IRSA). |
47
47
|`secret_key`|`string`|| S3 secret key. Not required if using environment-based auth. |
48
48
|`start_from_checkpoint`|`string`|| Checkpoint UUID to resume from, or `latest` to restore from the latest checkpoint. |
49
-
|`fail_if_no_checkpoint`|`boolean`|`false`| When `true` the pipeline will fail to initialize if fetching the specified checkpoint fails. <p> When `false`, the pipeline will start from scratch instead. Ignored if `start_from_checkpoint` is not set. </p>|
49
+
|`fail_if_no_checkpoint`|`boolean`|`false`|Only applies when `start_from_checkpoint` is set to `latest`. <p> When `true`, the pipeline fails to start if no checkpoint exists in remote storage. When `false`, the pipeline starts from scratch instead. </p> |
50
50
|`standby`|`boolean`|`false`| When `true`, the pipeline starts in **standby** mode. <p> To start processing the data the pipeline must be activated (`POST /activate`). </p> <p> If a previously activated pipeline is restarted without clearing storage, it auto-activates. </p> `start_from_checkpoint` must be set to use standby mode. |
51
51
|`pull_interval`|`integer(u64)`|`10`| Interval (in seconds) between fetch attempts for the latest checkpoint while standby. |
52
-
|`push_interval`|`integer(u64)`|| Interval (in seconds) between [automatic sync](/pipelines/checkpoint-sync#automatic-checkpoint-synchronization) of a local checkpoint to object store measured from the completion of the previous sync attempt. Disabled by default. |
52
+
|`push_interval`|`integer(u64)`|| Interval (in seconds) between [automatic sync](/pipelines/checkpoint-sync#automatic-checkpoint-synchronization) of a local checkpoint to object store measured from the completion of the previous sync attempt. Disabled by default. |
53
53
|`transfers`|`integer (u8)`|`20`| Number of concurrent file transfers. |
54
54
|`checkers`|`integer (u8)`|`20`| Number of parallel checkers for verification. |
55
55
|`ignore_checksum`|`boolean`|`false`| Skip checksum verification after transfer and only check the file size. Might improve throughput. |
56
56
|`multi_thread_streams`|`integer (u8)`|`10`| Number of streams for multi-threaded downloads. |
|`upload_concurrency`|`integer (u8)`|`10`| Number of concurrent chunks to upload during multipart uploads. |
59
59
|`flags`|`array[string]`|| Extra flags to pass to `rclone`.<p> ⚠️ Incorrect or conflicting flags may break behavior. See [rclone flags](https://rclone.org/flags/) and [S3 flags](https://rclone.org/s3/). </p> |
60
-
|`retention_min_count`|`integer (u32)`|`10`| The minimum number of checkpoints to retain in object store. No checkpoints will be deleted if the total count is below this threshold. |
61
-
|`retention_min_age`|`integer (u32)`|`30`| The minimum age (in days) a checkpoint must reach before it becomes eligible for deletion. All younger checkpoints will be preserved. |
60
+
|`retention_min_count`|`integer (u32)`|`10`| The minimum number of checkpoints to retain in object store. No checkpoints will be deleted if the total count is below this threshold. |
61
+
|`retention_min_age`|`integer (u32)`|`30`| The minimum age (in days) a checkpoint must reach before it becomes eligible for deletion. All younger checkpoints will be preserved. |
0 commit comments