(DEPRECATED) Apache Flink User Mailing List archive.

checkpoint interval and hdfs file capacity

Classic

List

Threaded

4 messages Options

lec ssmi

checkpoint interval and hdfs file capacity

Hi, if I set the checkpoint interval to be very small, such as 5 seconds, will there be a lot of state files on HDFS? In theory, no matter what the interval is set, every time you checkpoint, the old file will be deleted and new file will be written, right?

Congxian Qiu

Re: checkpoint interval and hdfs file capacity

No matter what interval you set, Flink will take care of the checkpoints(remove the useless checkpoint when it can), but when you set a very small checkpoint interval, there may be much high pressure for the storage system(here is RPC pressure of HDFS NN).

Best,

Congxian

lec ssmi <[hidden email]> 于2020年11月10日周二下午1:19写道：

Hi, if I set the checkpoint interval to be very small, such as 5 seconds, will there be a lot of state files on HDFS? In theory, no matter what the interval is set, every time you checkpoint, the old file will be deleted and new file will be written, right?

lec ssmi

Re: checkpoint interval and hdfs file capacity

Thanks.

I have some jobs with the checkpoint interval 1000ms. And the HDFS files grow too large to work normally .

What I am curious about is, are writing and deleting performed synchronously? Is it possible to add too fast to delete old files?

Congxian Qiu <[hidden email]> 于2020年11月10日周二下午2:16写道：

Hi
No matter what interval you set, Flink will take care of the checkpoints(remove the useless checkpoint when it can), but when you set a very small checkpoint interval, there may be much high pressure for the storage system(here is RPC pressure of HDFS NN).

Best,
Congxian

lec ssmi <[hidden email]> 于2020年11月10日周二下午1:19写道：
Hi, if I set the checkpoint interval to be very small, such as 5 seconds, will there be a lot of state files on HDFS? In theory, no matter what the interval is set, every time you checkpoint, the old file will be deleted and new file will be written, right?

Congxian Qiu

Re: checkpoint interval and hdfs file capacity

Currently, checkpoint discard logic was executed in Executor[1], maybe it will not be deleted so quickly

[1] https://github.com/apache/flink/blob/91404f435f20c5cd6714ee18bf4ccf95c81fb73e/flink-runtime/src/main/java/org/apache/flink/runtime/checkpoint/CheckpointsCleaner.java#L45

Best,

Congxian

lec ssmi <[hidden email]> 于2020年11月10日周二下午2:25写道：

Thanks.
I have some jobs with the checkpoint interval 1000ms. And the HDFS files grow too large to work normally .
What I am curious about is, are writing and deleting performed synchronously? Is it possible to add too fast to delete old files?

Congxian Qiu <[hidden email]> 于2020年11月10日周二下午2:16写道：
Hi
No matter what interval you set, Flink will take care of the checkpoints(remove the useless checkpoint when it can), but when you set a very small checkpoint interval, there may be much high pressure for the storage system(here is RPC pressure of HDFS NN).

Best,
Congxian

lec ssmi <[hidden email]> 于2020年11月10日周二下午1:19写道：
Hi, if I set the checkpoint interval to be very small, such as 5 seconds, will there be a lot of state files on HDFS? In theory, no matter what the interval is set, every time you checkpoint, the old file will be deleted and new file will be written, right?