Flink 1.7.2 UI : Jobs removed from Completed Jobs section

classic Classic list List threaded Threaded
5 messages Options
Reply | Threaded
Open this post in threaded view
|

Flink 1.7.2 UI : Jobs removed from Completed Jobs section

Jins George

Hi,

 

I am facing a weird problem in which jobs from ‘Completed Jobs’ section in Flink 1.7.2 UI disappear.  Looking at the job manager logs, I see the job was failed and restarted  ‘restart-strategy.fixed-delay.attempts’ times and the JobMaster was stopped.

I was able to see the job in Completed Jobs section with the status as FAILED, but after some time,  I don’t see it any more.  The jobmanager was never restarted, so I expected the Failed or completed jobs to appear in Completed Jobs section.

 

Any idea what might be happening ?

 

 

JobManager.log:                

 

2019-04-06 18:21:10,638 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Could not restart the job dwellalert-ubuntu-0403174608-698009a0 (b274377e6a223078d6f40b9c0620ee0d) because the restart strategy prevented it.

2019-04-06 18:21:10,662 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Stopping the JobMaster for job dwellalert-ubuntu-0403174608-698009a0(b274377e6a223078d6f40b9c0620ee0d).

 

Restart Strategy Conf:

 

restart-strategy: fixed-delay

restart-strategy.fixed-delay.attempts: 10

restart-strategy.fixed-delay.delay: 10 s

 

 

Thanks

Jins George

Reply | Threaded
Open this post in threaded view
|

Re: Flink 1.7.2 UI : Jobs removed from Completed Jobs section

Timothy Victor
I face the same issue in Flink 1.7.1.

Would be good to know a solution. 

Tim

On Mon, Apr 8, 2019, 12:45 PM Jins George <[hidden email]> wrote:

Hi,

 

I am facing a weird problem in which jobs from ‘Completed Jobs’ section in Flink 1.7.2 UI disappear.  Looking at the job manager logs, I see the job was failed and restarted  ‘restart-strategy.fixed-delay.attempts’ times and the JobMaster was stopped.

I was able to see the job in Completed Jobs section with the status as FAILED, but after some time,  I don’t see it any more.  The jobmanager was never restarted, so I expected the Failed or completed jobs to appear in Completed Jobs section.

 

Any idea what might be happening ?

 

 

JobManager.log:                

 

2019-04-06 18:21:10,638 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Could not restart the job dwellalert-ubuntu-0403174608-698009a0 (b274377e6a223078d6f40b9c0620ee0d) because the restart strategy prevented it.

2019-04-06 18:21:10,662 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Stopping the JobMaster for job dwellalert-ubuntu-0403174608-698009a0(b274377e6a223078d6f40b9c0620ee0d).

 

Restart Strategy Conf:

 

restart-strategy: fixed-delay

restart-strategy.fixed-delay.attempts: 10

restart-strategy.fixed-delay.delay: 10 s

 

 

Thanks

Jins George

Reply | Threaded
Open this post in threaded view
|

Re: Flink 1.7.2 UI : Jobs removed from Completed Jobs section

Jins George

Any input on this UI behavior ?

 

Thanks,

Jins

 

From: Timothy Victor <[hidden email]>
Date: Monday, April 8, 2019 at 10:47 AM
To: Jins George <[hidden email]>
Cc: user <[hidden email]>
Subject: Re: Flink 1.7.2 UI : Jobs removed from Completed Jobs section

 

I face the same issue in Flink 1.7.1.

 

Would be good to know a solution. 

 

Tim

 

On Mon, Apr 8, 2019, 12:45 PM Jins George <[hidden email]> wrote:

Hi,

 

I am facing a weird problem in which jobs from ‘Completed Jobs’ section in Flink 1.7.2 UI disappear.  Looking at the job manager logs, I see the job was failed and restarted  ‘restart-strategy.fixed-delay.attempts’ times and the JobMaster was stopped.

I was able to see the job in Completed Jobs section with the status as FAILED, but after some time,  I don’t see it any more.  The jobmanager was never restarted, so I expected the Failed or completed jobs to appear in Completed Jobs section.

 

Any idea what might be happening ?

 

 

JobManager.log:                

 

2019-04-06 18:21:10,638 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Could not restart the job dwellalert-ubuntu-0403174608-698009a0 (b274377e6a223078d6f40b9c0620ee0d) because the restart strategy prevented it.

2019-04-06 18:21:10,662 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Stopping the JobMaster for job dwellalert-ubuntu-0403174608-698009a0(b274377e6a223078d6f40b9c0620ee0d).

 

Restart Strategy Conf:

 

restart-strategy: fixed-delay

restart-strategy.fixed-delay.attempts: 10

restart-strategy.fixed-delay.delay: 10 s

 

 

Thanks

Jins George

Reply | Threaded
Open this post in threaded view
|

Re: Flink 1.7.2 UI : Jobs removed from Completed Jobs section

Guowei Ma
I am not very sure about this problem. But you could try to increase jobstore.expiration-time in config.
Best,
Guowei


Jins George <[hidden email]> 于2019年4月10日周三 下午1:01写道:

Any input on this UI behavior ?

 

Thanks,

Jins

 

From: Timothy Victor <[hidden email]>
Date: Monday, April 8, 2019 at 10:47 AM
To: Jins George <[hidden email]>
Cc: user <[hidden email]>
Subject: Re: Flink 1.7.2 UI : Jobs removed from Completed Jobs section

 

I face the same issue in Flink 1.7.1.

 

Would be good to know a solution. 

 

Tim

 

On Mon, Apr 8, 2019, 12:45 PM Jins George <[hidden email]> wrote:

Hi,

 

I am facing a weird problem in which jobs from ‘Completed Jobs’ section in Flink 1.7.2 UI disappear.  Looking at the job manager logs, I see the job was failed and restarted  ‘restart-strategy.fixed-delay.attempts’ times and the JobMaster was stopped.

I was able to see the job in Completed Jobs section with the status as FAILED, but after some time,  I don’t see it any more.  The jobmanager was never restarted, so I expected the Failed or completed jobs to appear in Completed Jobs section.

 

Any idea what might be happening ?

 

 

JobManager.log:                

 

2019-04-06 18:21:10,638 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Could not restart the job dwellalert-ubuntu-0403174608-698009a0 (b274377e6a223078d6f40b9c0620ee0d) because the restart strategy prevented it.

2019-04-06 18:21:10,662 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Stopping the JobMaster for job dwellalert-ubuntu-0403174608-698009a0(b274377e6a223078d6f40b9c0620ee0d).

 

Restart Strategy Conf:

 

restart-strategy: fixed-delay

restart-strategy.fixed-delay.attempts: 10

restart-strategy.fixed-delay.delay: 10 s

 

 

Thanks

Jins George

Reply | Threaded
Open this post in threaded view
|

Re: Flink 1.7.2 UI : Jobs removed from Completed Jobs section

Jins George

Thank you Guowei. That was the trick!

 

By default jobs from completed section are expired and removed in 1 hour.  I have increased jobstore.expiration-time and now completed jobs are retained.

 

Thanks,

Jins

 

From: Guowei Ma <[hidden email]>
Date: Wednesday, April 10, 2019 at 3:29 AM
To: Jins George <[hidden email]>
Cc: Timothy Victor <[hidden email]>, user <[hidden email]>
Subject: Re: Flink 1.7.2 UI : Jobs removed from Completed Jobs section

 

I am not very sure about this problem. But you could try to increase jobstore.expiration-time in config.

Best,

Guowei

 

 

Jins George <[hidden email]> 2019410日周三 下午1:01写道:

Any input on this UI behavior ?

 

Thanks,

Jins

 

From: Timothy Victor <[hidden email]>
Date: Monday, April 8, 2019 at 10:47 AM
To: Jins George <[hidden email]>
Cc: user <[hidden email]>
Subject: Re: Flink 1.7.2 UI : Jobs removed from Completed Jobs section

 

I face the same issue in Flink 1.7.1.

 

Would be good to know a solution. 

 

Tim

 

On Mon, Apr 8, 2019, 12:45 PM Jins George <[hidden email]> wrote:

Hi,

 

I am facing a weird problem in which jobs from ‘Completed Jobs’ section in Flink 1.7.2 UI disappear.  Looking at the job manager logs, I see the job was failed and restarted  ‘restart-strategy.fixed-delay.attempts’ times and the JobMaster was stopped.

I was able to see the job in Completed Jobs section with the status as FAILED, but after some time,  I don’t see it any more.  The jobmanager was never restarted, so I expected the Failed or completed jobs to appear in Completed Jobs section.

 

Any idea what might be happening ?

 

 

JobManager.log:                

 

2019-04-06 18:21:10,638 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Could not restart the job dwellalert-ubuntu-0403174608-698009a0 (b274377e6a223078d6f40b9c0620ee0d) because the restart strategy prevented it.

2019-04-06 18:21:10,662 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Stopping the JobMaster for job dwellalert-ubuntu-0403174608-698009a0(b274377e6a223078d6f40b9c0620ee0d).

 

Restart Strategy Conf:

 

restart-strategy: fixed-delay

restart-strategy.fixed-delay.attempts: 10

restart-strategy.fixed-delay.delay: 10 s

 

 

Thanks

Jins George