Hello,
Question: Is it possible to update the checkpoint and/or savepoint timeout of a running job without restarting it? If not, is this something that would be a welcomed contribution (not sure how easy this would be)?
Context: sometimes we have jobs who are making progress but get into a state where checkpoints are timing out, though we believe they would be successful if we could increase the checkpoint timeout. Unfortunately we currently need to restart the job to change this, and we would like to avoid this if possible. Ideally we could make this change temporarily, allow a checkpoint or savepoint to succeed, and then change the settings back.
Best,
Aaron Levin