I think I found the reason for what happened. The way I used the QueryableStateClient is that I wrapped
scala.concurrent.Future in a
FlinkFuture and then called
FlinkFuture.thenAccept. It turns out
thenAccept doesn't throw exceptions and when an exception happens (which likely happened once I inreased the parallelism) the job simply doesn't finish. I solved the problem by using
resultFuture.get()which araised the appropriate exceptions when they happens and failed the job.
Best,
Yassine