Flink Checkpoint runs slow for low load stream
Posted by
Chakravarthy varaga on
URL: http://deprecated-apache-flink-user-mailing-list-archive.369.s1.nabble.com/Flink-Checkpoint-runs-slow-for-low-load-stream-tp9147.html
Hi Aljoscha & Fabian,
I have a stream application that has 2 stream source as below.
KeyedStream<String, String> ks1 = ds1.keyBy("*") ;
KeyedStream<Tuple2<String, V>, String> ks2 = ds2.flatMap(split T into k-v pairs).keyBy(0);
ks1.connect(ks2).flatMap(X);
//X is a CoFlatMapFunction that inserts and removes elements from ks2 into
a key-value state member. Elements from ks1 are matched against that
state. the CoFlatMapFunction operator maintains ValueState<Tuple2<Long, Long>>;
//ks1 is streaming about 100K events/sec from kafka topic
//ks2 is streaming about 1 event every 10 minutes... Precisely when the
1st event is consumed from this stream, checkpoint takes 2 minutes
straight away.
The version of flink is 1.1.2.
I tried to use checkpoint every 10 Secs using a
FsStateBackend... What I notice is that the checkpoint duration is
almost 2 minutes for many cases, while for the other cases it varies
from 100 ms to 1.5 minutes frequently. I'm attaching the snapshot of the dashboard for your reference.
Is this an issue with flink checkpointing?
Best Regards
CVP