ERROR while creating save points..

classic Classic list List threaded Threaded
3 messages Options
Reply | Threaded
Open this post in threaded view
|

ERROR while creating save points..

Sathi Chowdhury

Hi Flink Dev,

I am running flink on yarn from EMR and I was running this command to test an external savepoint

flink savepoint 8c4c885c5899544de556c5caa984502d  /mnt

 

 

The program finished with the following exception:

 

org.apache.flink.runtime.leaderretrieval.LeaderRetrievalException: Could not retrieve the leader gateway

            at org.apache.flink.runtime.util.LeaderRetrievalUtils.retrieveLeaderGateway(LeaderRetrievalUtils.java:127)

            at org.apache.flink.client.program.ClusterClient.getJobManagerGateway(ClusterClient.java:645)

            at org.apache.flink.client.CliFrontend.getJobManagerGateway(CliFrontend.java:868)

            at org.apache.flink.client.CliFrontend.triggerSavepoint(CliFrontend.java:653)

            at org.apache.flink.client.CliFrontend.savepoint(CliFrontend.java:643)

            at org.apache.flink.client.CliFrontend.parseParameters(CliFrontend.java:1016)

            at org.apache.flink.client.CliFrontend.main(CliFrontend.java:1048)

Caused by: java.util.concurrent.TimeoutException: Futures timed out after [10000 milliseconds]

            at scala.concurrent.impl.Promise$DefaultPromise.ready(Promise.scala:219)

            at scala.concurrent.impl.Promise$DefaultPromise.result(Promise.scala:223)

            at scala.concurrent.Await$$anonfun$result$1.apply(package.scala:107)

            at scala.concurrent.BlockContext$DefaultBlockContext$.blockOn(BlockContext.scala:53)

            at scala.concurrent.Await$.result(package.scala:107)

            at scala.concurrent.Await.result(package.scala)

            at org.apache.flink.runtime.util.LeaderRetrievalUtils.retrieveLeaderGateway(LeaderRetrievalUtils.java:125)

            ... 6 more

2017-05-22 19:50:10,330 INFO  org.apache.flink.yarn.YarnClusterClient                       - Shutting down YarnClusterClient from the client shutdown hook

2017-05-22 19:50:10,330 INFO  org.apache.flink.yarn.YarnClusterClient                       - Disconnecting YarnClusterClient from ApplicationMaster

[hadoop@ip-10-202-4-24 flink]$ flink savepoint 8c4c885c5899544de556c5caa984502d  /mnt

 

Any clue what’s wrong?

=============Notice to Recipient: This e-mail transmission, and any documents, files or previous e-mail messages attached to it may contain information that is confidential or legally privileged, and intended for the use of the individual or entity named above. If you are not the intended recipient, or a person responsible for delivering it to the intended recipient, you are hereby notified that you must not read this transmission and that any disclosure, copying, printing, distribution or use of any of the information contained in or attached to this transmission is STRICTLY PROHIBITED. If you have received this transmission in error, please immediately notify the sender by telephone or return e-mail and delete the original transmission and its attachments without reading or saving in any manner. Thank you. =============
Reply | Threaded
Open this post in threaded view
|

Re: ERROR while creating save points..

Sathi Chowdhury

I was able to bypass that one ..by running it from bin/flink …

Now encountering

by: java.lang.NullPointerException: Checkpoint properties say that the checkpoint should have been persisted, but missing external path.

 

From: Sathi Chowdhury <[hidden email]>
Date: Monday, May 22, 2017 at 12:55 PM
To: "[hidden email]" <[hidden email]>
Subject: ERROR while creating save points..

 

Hi Flink Dev,

I am running flink on yarn from EMR and I was running this command to test an external savepoint

flink savepoint 8c4c885c5899544de556c5caa984502d  /mnt

 

 

The program finished with the following exception:

 

org.apache.flink.runtime.leaderretrieval.LeaderRetrievalException: Could not retrieve the leader gateway

            at org.apache.flink.runtime.util.LeaderRetrievalUtils.retrieveLeaderGateway(LeaderRetrievalUtils.java:127)

            at org.apache.flink.client.program.ClusterClient.getJobManagerGateway(ClusterClient.java:645)

            at org.apache.flink.client.CliFrontend.getJobManagerGateway(CliFrontend.java:868)

            at org.apache.flink.client.CliFrontend.triggerSavepoint(CliFrontend.java:653)

            at org.apache.flink.client.CliFrontend.savepoint(CliFrontend.java:643)

            at org.apache.flink.client.CliFrontend.parseParameters(CliFrontend.java:1016)

            at org.apache.flink.client.CliFrontend.main(CliFrontend.java:1048)

Caused by: java.util.concurrent.TimeoutException: Futures timed out after [10000 milliseconds]

            at scala.concurrent.impl.Promise$DefaultPromise.ready(Promise.scala:219)

            at scala.concurrent.impl.Promise$DefaultPromise.result(Promise.scala:223)

            at scala.concurrent.Await$$anonfun$result$1.apply(package.scala:107)

            at scala.concurrent.BlockContext$DefaultBlockContext$.blockOn(BlockContext.scala:53)

            at scala.concurrent.Await$.result(package.scala:107)

            at scala.concurrent.Await.result(package.scala)

            at org.apache.flink.runtime.util.LeaderRetrievalUtils.retrieveLeaderGateway(LeaderRetrievalUtils.java:125)

            ... 6 more

2017-05-22 19:50:10,330 INFO  org.apache.flink.yarn.YarnClusterClient                       - Shutting down YarnClusterClient from the client shutdown hook

2017-05-22 19:50:10,330 INFO  org.apache.flink.yarn.YarnClusterClient                       - Disconnecting YarnClusterClient from ApplicationMaster

[hadoop@ip-10-202-4-24 flink]$ flink savepoint 8c4c885c5899544de556c5caa984502d  /mnt

 

Any clue what’s wrong?

=============Notice to Recipient: This e-mail transmission, and any documents, files or previous e-mail messages attached to it may contain information that is confidential or legally privileged, and intended for the use of the individual or entity named above. If you are not the intended recipient, or a person responsible for delivering it to the intended recipient, you are hereby notified that you must not read this transmission and that any disclosure, copying, printing, distribution or use of any of the information contained in or attached to this transmission is STRICTLY PROHIBITED. If you have received this transmission in error, please immediately notify the sender by telephone or return e-mail and delete the original transmission and its attachments without reading or saving in any manner. Thank you. =============

=============Notice to Recipient: This e-mail transmission, and any documents, files or previous e-mail messages attached to it may contain information that is confidential or legally privileged, and intended for the use of the individual or entity named above. If you are not the intended recipient, or a person responsible for delivering it to the intended recipient, you are hereby notified that you must not read this transmission and that any disclosure, copying, printing, distribution or use of any of the information contained in or attached to this transmission is STRICTLY PROHIBITED. If you have received this transmission in error, please immediately notify the sender by telephone or return e-mail and delete the original transmission and its attachments without reading or saving in any manner. Thank you. =============
Reply | Threaded
Open this post in threaded view
|

Re: ERROR while creating save points..

Till Rohrmann
Hi Sathi,

this looks not right on the first glance. Could you please share the exact command you've used to trigger a savepoint and also the full stack trace of the exception?

Cheers,
Till

On Mon, May 22, 2017 at 10:55 PM, Sathi Chowdhury <[hidden email]> wrote:

I was able to bypass that one ..by running it from bin/flink …

Now encountering

by: java.lang.NullPointerException: Checkpoint properties say that the checkpoint should have been persisted, but missing external path.

 

From: Sathi Chowdhury <[hidden email]>
Date: Monday, May 22, 2017 at 12:55 PM
To: "[hidden email]" <[hidden email]>
Subject: ERROR while creating save points..

 

Hi Flink Dev,

I am running flink on yarn from EMR and I was running this command to test an external savepoint

flink savepoint 8c4c885c5899544de556c5caa984502d  /mnt

 

 

The program finished with the following exception:

 

org.apache.flink.runtime.leaderretrieval.LeaderRetrievalException: Could not retrieve the leader gateway

            at org.apache.flink.runtime.util.LeaderRetrievalUtils.retrieveLeaderGateway(LeaderRetrievalUtils.java:127)

            at org.apache.flink.client.program.ClusterClient.getJobManagerGateway(ClusterClient.java:645)

            at org.apache.flink.client.CliFrontend.getJobManagerGateway(CliFrontend.java:868)

            at org.apache.flink.client.CliFrontend.triggerSavepoint(CliFrontend.java:653)

            at org.apache.flink.client.CliFrontend.savepoint(CliFrontend.java:643)

            at org.apache.flink.client.CliFrontend.parseParameters(CliFrontend.java:1016)

            at org.apache.flink.client.CliFrontend.main(CliFrontend.java:1048)

Caused by: java.util.concurrent.TimeoutException: Futures timed out after [10000 milliseconds]

            at scala.concurrent.impl.Promise$DefaultPromise.ready(Promise.scala:219)

            at scala.concurrent.impl.Promise$DefaultPromise.result(Promise.scala:223)

            at scala.concurrent.Await$$anonfun$result$1.apply(package.scala:107)

            at scala.concurrent.BlockContext$DefaultBlockContext$.blockOn(BlockContext.scala:53)

            at scala.concurrent.Await$.result(package.scala:107)

            at scala.concurrent.Await.result(package.scala)

            at org.apache.flink.runtime.util.LeaderRetrievalUtils.retrieveLeaderGateway(LeaderRetrievalUtils.java:125)

            ... 6 more

2017-05-22 19:50:10,330 INFO  org.apache.flink.yarn.YarnClusterClient                       - Shutting down YarnClusterClient from the client shutdown hook

2017-05-22 19:50:10,330 INFO  org.apache.flink.yarn.YarnClusterClient                       - Disconnecting YarnClusterClient from ApplicationMaster

[hadoop@ip-10-202-4-24 flink]$ flink savepoint 8c4c885c5899544de556c5caa984502d  /mnt

 

Any clue what’s wrong?

=============Notice to Recipient: This e-mail transmission, and any documents, files or previous e-mail messages attached to it may contain information that is confidential or legally privileged, and intended for the use of the individual or entity named above. If you are not the intended recipient, or a person responsible for delivering it to the intended recipient, you are hereby notified that you must not read this transmission and that any disclosure, copying, printing, distribution or use of any of the information contained in or attached to this transmission is STRICTLY PROHIBITED. If you have received this transmission in error, please immediately notify the sender by telephone or return e-mail and delete the original transmission and its attachments without reading or saving in any manner. Thank you. =============

=============Notice to Recipient: This e-mail transmission, and any documents, files or previous e-mail messages attached to it may contain information that is confidential or legally privileged, and intended for the use of the individual or entity named above. If you are not the intended recipient, or a person responsible for delivering it to the intended recipient, you are hereby notified that you must not read this transmission and that any disclosure, copying, printing, distribution or use of any of the information contained in or attached to this transmission is STRICTLY PROHIBITED. If you have received this transmission in error, please immediately notify the sender by telephone or return e-mail and delete the original transmission and its attachments without reading or saving in any manner. Thank you. =============