Re: Question about the checkpoint mechanism in Flink.
Posted by
Renjie Liu on
URL: http://deprecated-apache-flink-user-mailing-list-archive.369.s1.nabble.com/Question-about-the-checkpoint-mechanism-in-Flink-tp9792p9800.html
Hi, Till:
By operator with multiple inputs, do you mean inputs from multiple subtasks?
Hi Li,
the statement refers to operators with multiple inputs (two in this case). With the current implementation you will indeed block one of the inputs after receiving a checkpoint barrier n until you've received the corresponding checkpoint barrier n on the other input as well. This is what we call checkpoint barrier alignment. If the processing time on both input paths is similar and thus there is no back pressure on any of the inputs, the alignment should not take too long. In case where one of the inputs is considerably slower than the other, you should an additional delay.
For single input operators, you don't have to align the checkpoint barriers.
The checkpoint barrier alignment is not strictly necessary, but it allows us to not having to store all in flight records from the second input which arrive between the checkpoint barrier on the first input and the corresponding barrier on the second input. We might change this implementation in the future, though.
Cheers,
Till
--
Liu, Renjie
Software Engineer, MVAD