Re: Task manager not able to rejoin job manager after network hicup
Posted by
Ashish Pokharel on
URL: http://deprecated-apache-flink-user-mailing-list-archive.369.s1.nabble.com/Task-manager-not-able-to-rejoin-job-manager-after-network-hicup-tp18525p18547.html
We see the same in 1.4. I dont think we could see this in 1.3. I had started a thread a while back on this. Till asked for more details. I havent had a chance to get back to him on this. If you can repro this easily perhaps you can get to it faster. I will find the thread and resend.
Thanks,
-- Ashish
On Fri, Feb 23, 2018 at 9:56 AM, jelmer
We found out there's a taskmanager.exit-on-fatal-akka-error property that will restart flink in this situation but it is not enabled by default and that feels like a rather blunt tool. I expect systems like this to be more resilient to this