Re: Streaming API has a long delay at the beginning of the process.
Posted by
Fabian Hueske-2 on
URL: http://deprecated-apache-flink-user-mailing-list-archive.369.s1.nabble.com/heap-dump-shows-StoppableSourceStreamTask-retained-by-java-lang-finalizer-tp15535p15577.html
Hi,
If I understand you correctly, the problem is only for the first events that are processed.
AFAIK, Flink lazily instantiates its operators which means that a source task starts to consume records from Kafka before the subsequent tasks have been started.
That's why the latency of the first records is higher.
Not sure if or what can be done about this behavior.
I'll loop in Till who knows more about the lifecycle of tasks.
Best, Fabian