When I submit a job to flink session with parallelism higher than 128, the job is submitted and renders in the UI but when I view the job itself the UI starts to rapidly emit errors in the upper right:
Server Response: Unable to load requested file /bad-request. Is this a known issue? Is there a fix? Does this indicate underlying stability issues? |
We also encountered a similar issue internally. cc +huweihua.ckl Richard Moorhead <[hidden email]> 于2020年2月13日周四 上午9:40写道:
|
In reply to this post by Richard Moorhead
Hi, Richard This is most likely that the Rest Api has timed out, you can try to find some evidence in the jobmanager log. You can provide the full log to help us find the root cause. Best Weihua Hu
|
On Wed, Feb 12, 2020 at 11:30 PM HuWeihua <[hidden email]> wrote:
|
These logs prove that it is indeed a timeout issue, In our scenario, it was due to the task deploy took a lot of time. You can check if the time from Task from SCHEDULED to DEPLOYING in the log is greater than 10s. This step are processed in mainThread and will block the processing of requests from the UI. By now, you can increase the ‘akka.ask.timeout’ to avoid this. I have created a jira issue to improve this. https://issues.apache.org/jira/browse/FLINK-16069 . Best Weihua Hu
|
Free forum by Nabble | Edit this page |