Can I ask why some operations run only one slot? I understand that file writes should happen only one one slot but GroupByKey operation could be distributed across all slots. I am having around 20k distinct keys every minute. Is there any way to break this operator chain? I noticed that CombinePerKey operations that don't have IO related transformation are scheduled across all 32 slots. My cluster has 32 slots across 2 task managers. Running Beam 2.2. and Flink 1.3.2
Thanks, Pawel |
Hi Pawel, This question might be better suited for the Beam user list.2018-01-18 16:02 GMT+01:00 Pawel Bartoszek <[hidden email]>:
|
Hi,
What are the other stages in that program? Best, Aljoscha
|
Free forum by Nabble | Edit this page |