Re: With the checkpoint interval of the same size, the Flink 1.12 version of the job checkpoint time-consuming increase and production failure, the Flink1.9 job is running normally

Posted by Haihang Jing on
URL: http://deprecated-apache-flink-user-mailing-list-archive.369.s1.nabble.com/With-the-checkpoint-interval-of-the-same-size-the-Flink-1-12-version-of-the-job-checkpoint-time-consy-tp42471p42534.html

Hi,Congxian ,thanks for your replay.
job run on Flink1.9 (checkpoint interval 3min)
<http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/file/t3050/6.png>
job run on Flink1.12 (checkpoint interval 10min)
<http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/file/t3050/7.png>
job run on Flink1.12 (checkpoint interval 3min)
Pic1:Time used to complete the checkpoint in 1.12 is longer(5m32s):
<http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/file/t3050/2.png>
Pic2:Start delay(4m27s):
<http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/file/t3050/1.png>
Pic3:Next checkpoint failed(task141 ack n/a):
<http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/file/t3050/3.png>
Pic4:Did not see back pressure and data skew:
<http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/file/t3050/4.png>
Pic5:Subtask deal same data nums ,checkpoint endToEnd fast:
<http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/file/t3050/5.png>
Best,
Haihang



--
Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/