(DEPRECATED) Apache Flink User Mailing List archive.

Flink CheckPoint/Savepoint Behavior Question

Classic

List

Threaded

3 messages Options

Jason Liu

Flink CheckPoint/Savepoint Behavior Question

We currently have some logic to load data from S3 into memory in our Flink/Kinesis Analytics app. This happens before the RichFunction.open() function.

We have two questions here and I can't find too much information in the apache.org website:

(More of a clarification) When Flink does checkpointing/savepointing only the state and the stream positions are saved right? The memory content won't be saved and restored.
When Flink restores from checkpoint/savepoint, does it still go through the application initialization phase? Basically will the code before the RichFunction' open() be run? If not, would the operators.open() functions run, when Flink restore from checkpoint/savepoint?

Thanks,

Jason

raghav280392

Re: Flink CheckPoint/Savepoint Behavior Question

Flink is aware of all the tasks running in the cluster. If any of the tasks fails, the failed task is restored using the checkpoint (only If the task uses Flink Operator State). This scenario will not use savepoints. Savepoints are same as checkpoints and the difference is that the savepoints are created manually or when we manually cancel/stop a job. We can then start the same job again by pointing to the savepoint. If we start a job without a savepoint, the job will start with an empty operator state.

Correct me If I am wrong.

Other references:

https://stackoverflow.com/questions/62935269/apache-flink-how-checkpoint-savepoint-works-if-we-run-duplicate-jobs-multi-te

https://stackoverflow.com/questions/64605940/apache-flink-fsstatebackend-how-state-is-recovered-in-case-of-ta+sk-manager-f

https://stackoverflow.com/questions/55613112/is-it-possible-to-recover-after-losing-the-checkpoint-coordinator/55615858#55615858

https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html#retained-checkpoints

Thank you

Virus-free. www.avast.com

On Tue, Feb 2, 2021 at 4:07 AM Jason Liu <[hidden email]> wrote:

We currently have some logic to load data from S3 into memory in our Flink/Kinesis Analytics app. This happens before the RichFunction.open() function.
We have two questions here and I can't find too much information in the apache.org website:

(More of a clarification) When Flink does checkpointing/savepointing only the state and the stream positions are saved right? The memory content won't be saved and restored.
When Flink restores from checkpoint/savepoint, does it still go through the application initialization phase? Basically will the code before the RichFunction' open() be run? If not, would the operators.open() functions run, when Flink restore from checkpoint/savepoint?
Thanks,
Jason

Raghavendar T S

www.teknosrc.com

Virus-free. www.avast.com

Arvid Heise-4

Re: Flink CheckPoint/Savepoint Behavior Question

Hi Jason,

you got it perfectly right. So everything that is not in an explicit state (or checkpointed in CheckpointedFunction#snapshotState) is lost on recovery. However, Flink applications always go through the complete life-cycle.

Note that I'd look into CheckpointedFunction if the side-information that you fetch from S3 is not changing and rather small.

Best,

Arvid

On Tue, Feb 2, 2021 at 5:42 AM Raghavendar T S <[hidden email]> wrote:

Flink is aware of all the tasks running in the cluster. If any of the tasks fails, the failed task is restored using the checkpoint (only If the task uses Flink Operator State). This scenario will not use savepoints. Savepoints are same as checkpoints and the difference is that the savepoints are created manually or when we manually cancel/stop a job. We can then start the same job again by pointing to the savepoint. If we start a job without a savepoint, the job will start with an empty operator state.

Correct me If I am wrong.

Other references:
https://stackoverflow.com/questions/62935269/apache-flink-how-checkpoint-savepoint-works-if-we-run-duplicate-jobs-multi-te
https://stackoverflow.com/questions/64605940/apache-flink-fsstatebackend-how-state-is-recovered-in-case-of-ta+sk-manager-f
https://stackoverflow.com/questions/55613112/is-it-possible-to-recover-after-losing-the-checkpoint-coordinator/55615858#55615858
https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html#retained-checkpoints

Thank you

Virus-free. www.avast.com

On Tue, Feb 2, 2021 at 4:07 AM Jason Liu <[hidden email]> wrote:
We currently have some logic to load data from S3 into memory in our Flink/Kinesis Analytics app. This happens before the RichFunction.open() function.
We have two questions here and I can't find too much information in the apache.org website:

(More of a clarification) When Flink does checkpointing/savepointing only the state and the stream positions are saved right? The memory content won't be saved and restored.
When Flink restores from checkpoint/savepoint, does it still go through the application initialization phase? Basically will the code before the RichFunction' open() be run? If not, would the operators.open() functions run, when Flink restore from checkpoint/savepoint?
Thanks,
Jason

--
Raghavendar T S
www.teknosrc.com

Virus-free. www.avast.com