Flink resource manager unable to connect to mesos after restart

classic Classic list List threaded Threaded
6 messages Options
Reply | Threaded
Open this post in threaded view
|

Flink resource manager unable to connect to mesos after restart

Renjie Liu
Hi, all:

I'm testing flink 1.5.0 and find that flink mesos resource manager unable to connect to mesos after restart. Have you seen this happenen?
--
Liu, Renjie
Software Engineer, MVAD
Reply | Threaded
Open this post in threaded view
|

Re: Flink resource manager unable to connect to mesos after restart

Gary Yao-2
Hi,

If you are able to re-produce this reliably, can you post the jobmanager logs?

Best,
Gary

On Wed, Jul 18, 2018 at 10:33 AM, Renjie Liu <[hidden email]> wrote:
Hi, all:

I'm testing flink 1.5.0 and find that flink mesos resource manager unable to connect to mesos after restart. Have you seen this happenen?
--
Liu, Renjie
Software Engineer, MVAD

Reply | Threaded
Open this post in threaded view
|

Re: Flink resource manager unable to connect to mesos after restart

Renjie Liu
Hi, Gary:

It can be reproduced stablely, just need to kill job manager and restart it.

Attached is jobmanager's log, but I don't find anyting valuable since it just keep reporting unable to connect to mesos master.

On Thu, Jul 19, 2018 at 4:55 AM Gary Yao <[hidden email]> wrote:
Hi,

If you are able to re-produce this reliably, can you post the jobmanager logs?

Best,
Gary
On Wed, Jul 18, 2018 at 10:33 AM, Renjie Liu <[hidden email]> wrote:
Hi, all:

I'm testing flink 1.5.0 and find that flink mesos resource manager unable to connect to mesos after restart. Have you seen this happenen?
--
Liu, Renjie
Software Engineer, MVAD
--
Liu, Renjie
Software Engineer, MVAD
Reply | Threaded
Open this post in threaded view
|

Re: Flink resource manager unable to connect to mesos after restart

Renjie Liu
Attached is job manager's log.


On Thu, Jul 19, 2018 at 4:38 PM Renjie Liu <[hidden email]> wrote:
Hi, Gary:

It can be reproduced stablely, just need to kill job manager and restart it.

Attached is jobmanager's log, but I don't find anyting valuable since it just keep reporting unable to connect to mesos master.

On Thu, Jul 19, 2018 at 4:55 AM Gary Yao <[hidden email]> wrote:
Hi,

If you are able to re-produce this reliably, can you post the jobmanager logs?

Best,
Gary
On Wed, Jul 18, 2018 at 10:33 AM, Renjie Liu <[hidden email]> wrote:
Hi, all:

I'm testing flink 1.5.0 and find that flink mesos resource manager unable to connect to mesos after restart. Have you seen this happenen?
--
Liu, Renjie
Software Engineer, MVAD
--
Liu, Renjie
Software Engineer, MVAD
--
Liu, Renjie
Software Engineer, MVAD

jobmanager.txt (3M) Download Attachment
Reply | Threaded
Open this post in threaded view
|

Re: Flink resource manager unable to connect to mesos after restart

Gary Yao-2
Hi,

Sorry for the late reply. I have seen that you debugged this already and
created FLINK-9936. Thank you for looking into the issue. I think your
conclusions are correct. I just wanted to note that there is an even older
ticket describing the same problem:
   
    https://issues.apache.org/jira/browse/FLINK-7470

One of them should be closed.

Best,
Gary

On Thu, Jul 19, 2018 at 10:45 AM, Renjie Liu <[hidden email]> wrote:
Attached is job manager's log.


On Thu, Jul 19, 2018 at 4:38 PM Renjie Liu <[hidden email]> wrote:
Hi, Gary:

It can be reproduced stablely, just need to kill job manager and restart it.

Attached is jobmanager's log, but I don't find anyting valuable since it just keep reporting unable to connect to mesos master.

On Thu, Jul 19, 2018 at 4:55 AM Gary Yao <[hidden email]> wrote:
Hi,

If you are able to re-produce this reliably, can you post the jobmanager logs?

Best,
Gary
On Wed, Jul 18, 2018 at 10:33 AM, Renjie Liu <[hidden email]> wrote:
Hi, all:

I'm testing flink 1.5.0 and find that flink mesos resource manager unable to connect to mesos after restart. Have you seen this happenen?
--
Liu, Renjie
Software Engineer, MVAD
--
Liu, Renjie
Software Engineer, MVAD
--
Liu, Renjie
Software Engineer, MVAD

Reply | Threaded
Open this post in threaded view
|

Re: Flink resource manager unable to connect to mesos after restart

Renjie Liu
OK, I'll close 7470

On Thu, Jul 26, 2018 at 11:25 PM Gary Yao <[hidden email]> wrote:
Hi,

Sorry for the late reply. I have seen that you debugged this already and
created FLINK-9936. Thank you for looking into the issue. I think your
conclusions are correct. I just wanted to note that there is an even older
ticket describing the same problem:
   
    https://issues.apache.org/jira/browse/FLINK-7470

One of them should be closed.

Best,
Gary


On Thu, Jul 19, 2018 at 10:45 AM, Renjie Liu <[hidden email]> wrote:
Attached is job manager's log.


On Thu, Jul 19, 2018 at 4:38 PM Renjie Liu <[hidden email]> wrote:
Hi, Gary:

It can be reproduced stablely, just need to kill job manager and restart it.

Attached is jobmanager's log, but I don't find anyting valuable since it just keep reporting unable to connect to mesos master.

On Thu, Jul 19, 2018 at 4:55 AM Gary Yao <[hidden email]> wrote:
Hi,

If you are able to re-produce this reliably, can you post the jobmanager logs?

Best,
Gary
On Wed, Jul 18, 2018 at 10:33 AM, Renjie Liu <[hidden email]> wrote:
Hi, all:

I'm testing flink 1.5.0 and find that flink mesos resource manager unable to connect to mesos after restart. Have you seen this happenen?
--
Liu, Renjie
Software Engineer, MVAD
--
Liu, Renjie
Software Engineer, MVAD
--
Liu, Renjie
Software Engineer, MVAD

--
Liu, Renjie
Software Engineer, MVAD