Hey,
The thread you are referring to is about DataStream API job and long checkpointing issue. While from your message it seems like you are using Table API (SQL) to process a batch data? Or what exactly do you mean by:
> i notice that there are one or two subtasks that take too long to finish
Aside from that, don’t you have just a problem with a data skew, where some subset of keys are more heavily used than others?
Piotrek
Hi,
Any idea on how to debug this?
Thanks
Fanbin