(DEPRECATED) Apache Flink User Mailing List archive.

high-availability.storageDir clean up?

Classic

List

Threaded

3 messages Options

Elias Levy

high-availability.storageDir clean up?

I noticed in one of our cluster that they are relatively old submittedJobGraph* and completedCheckpoint* files. I was wondering at what point it is save to clean some of these up.

Fabian Hueske-2

Re: high-availability.storageDir clean up?

Hi Elias,

Till (in CC) is familiar with Flink's HA implementation.

He might be able to answer your question.

Thanks,

Fabian

2018-06-25 23:24 GMT+02:00 Elias Levy <[hidden email]>:

I noticed in one of our cluster that they are relatively old submittedJobGraph* and completedCheckpoint* files. I was wondering at what point it is save to clean some of these up.

Till Rohrmann

Re: high-availability.storageDir clean up?

Hi Elias,

Flink will remove these files if the job reached a globally terminal state (FINISHED, FAILED, CANCELLED). The files should only remain if the cluster crashed. This should give you the opportunity to restart the cluster which can then recover the jobs which have not yet reached a globally terminal state. If you don't intend to recover these jobs, then it should be safe to delete the files.

Cheers,

Till

On Wed, Jun 27, 2018 at 10:14 AM Fabian Hueske <[hidden email]> wrote:

Hi Elias,

Till (in CC) is familiar with Flink's HA implementation.
He might be able to answer your question.

Thanks,
Fabian

2018-06-25 23:24 GMT+02:00 Elias Levy <[hidden email]>:
I noticed in one of our cluster that they are relatively old submittedJobGraph* and completedCheckpoint* files. I was wondering at what point it is save to clean some of these up.