Hi,
I am observing a behaviour in the task statistics that I don't fully understand.
Essentially I have create a partitioner that assigns all the edges to a single partition.
I see imbalance (in terms of records sent/received) in the task statistics of different instances of the same operator for the second and third stages.
But from fourth stage onwards, all operator instances are executing pretty much the same number of records. I would have expected that the imbalance would exist in those stages as well.
Details of the my code and task statistics are in this stackoverflow question:
https://stackoverflow.com/questions/55138553/behaviour-of-custom-partitioner-in-apache-flinkThanks.
- Bilal