Flink 1.1.3 | Shutting down YarnClusterClient from the client shutdown hook | happening frequently

classic Classic list List threaded Threaded
2 messages Options
Reply | Threaded
Open this post in threaded view
|

Flink 1.1.3 | Shutting down YarnClusterClient from the client shutdown hook | happening frequently

Anchit Jatana
This post was updated on .
flink-a12345-client-mydns.log

Hi All,

I'm running my flink application on YARN. It's frequently getting suspended, though gracefully. Below is the snippet of the error, attaching full jobmanager log to help debug. Please help me identify the cause and resolve the issue.

Thank you

Regards,
Anchit

Error snippet:

2016-11-09 03:15:01,238 INFO  org.apache.flink.yarn.YarnClusterClient                       - Shutting down YarnClusterClient from the client shutdown hook
2016-11-09 03:15:01,242 INFO  org.apache.flink.yarn.YarnClusterClient                       - Sending shutdown request to the Application Master
2016-11-09 03:15:01,248 INFO  org.apache.flink.yarn.YarnClusterClient                       - Start application client.
2016-11-09 03:15:01,260 INFO  org.apache.flink.yarn.ApplicationClient                       - Notification about new leader address akka.tcp://flink@10.60.200.106:36465/user/jobmanager with session ID null.
2016-11-09 03:15:01,263 INFO  org.apache.flink.yarn.ApplicationClient                       - Sending StopCluster request to JobManager.
2016-11-09 03:15:01,264 INFO  org.apache.flink.yarn.ApplicationClient                       - Received address of new leader akka.tcp://flink@10.60.200.106:36465/user/jobmanager with session ID null.
2016-11-09 03:15:01,265 INFO  org.apache.flink.yarn.ApplicationClient                       - Disconnect from JobManager null.
2016-11-09 03:15:01,269 INFO  org.apache.flink.yarn.ApplicationClient                       - Trying to register at JobManager akka.tcp://flink@10.60.200.106:36465/user/jobmanager.
2016-11-09 03:15:01,279 INFO  org.apache.flink.yarn.ApplicationClient                       - Successfully registered at the ResourceManager using JobManager Actor[akka.tcp://flink@10.60.200.106:36465/user/jobmanager#918758350]
2016-11-09 03:15:02,282 INFO  org.apache.flink.yarn.ApplicationClient                       - Sending StopCluster request to JobManager.
2016-11-09 03:15:02,295 INFO  org.apache.flink.yarn.YarnClusterClient                       - Deleting files in hdfs://ldnsns/user/a12345/.flink/application_1478099802210_11790
2016-11-09 03:15:02,319 INFO  org.apache.flink.runtime.client.JobClientActor                - 11/09/2016 03:15:02 Job execution switched to status SUSPENDED.

Reply | Threaded
Open this post in threaded view
|

Re: Flink 1.1.3 | Shutting down YarnClusterClient from the client shutdown hook | happening frequently

Till Rohrmann
HI Anchit,

the ML does not support attachments. To understand your problem, the JM logs would be really helpful.

Cheers,
Till

On Thu, Nov 10, 2016 at 2:01 AM, Anchit Jatana <[hidden email]> wrote:
Hi All,

I'm running my flink application on YARN. It's frequently getting suspended,
though gracefully. Below is the snippet of the error, attaching full
jobmanager log to help debug. Please help me identify the cause and resolve
the issue.

Thank you

Regards,
Anchit

Error snippet:

2016-11-09 03:15:01,238 INFO  org.apache.flink.yarn.YarnClusterClient
- Shutting down YarnClusterClient from the client shutdown hook
2016-11-09 03:15:01,242 INFO  org.apache.flink.yarn.YarnClusterClient
- Sending shutdown request to the Application Master
2016-11-09 03:15:01,248 INFO  org.apache.flink.yarn.YarnClusterClient
- Start application client.
2016-11-09 03:15:01,260 INFO  org.apache.flink.yarn.ApplicationClient
- Notification about new leader address
akka.tcp://flink@10.60.200.106:36465/user/jobmanager with session ID null.
2016-11-09 03:15:01,263 INFO  org.apache.flink.yarn.ApplicationClient
- Sending StopCluster request to JobManager.
2016-11-09 03:15:01,264 INFO  org.apache.flink.yarn.ApplicationClient
- Received address of new leader
akka.tcp://flink@10.60.200.106:36465/user/jobmanager with session ID null.
2016-11-09 03:15:01,265 INFO  org.apache.flink.yarn.ApplicationClient
- Disconnect from JobManager null.
2016-11-09 03:15:01,269 INFO  org.apache.flink.yarn.ApplicationClient
- Trying to register at JobManager
akka.tcp://flink@10.60.200.106:36465/user/jobmanager.
2016-11-09 03:15:01,279 INFO  org.apache.flink.yarn.ApplicationClient
- Successfully registered at the ResourceManager using JobManager
Actor[akka.tcp://flink@10.60.200.106:36465/user/jobmanager#918758350]
2016-11-09 03:15:02,282 INFO  org.apache.flink.yarn.ApplicationClient
- Sending StopCluster request to JobManager.
2016-11-09 03:15:02,295 INFO  org.apache.flink.yarn.YarnClusterClient
- Deleting files in
hdfs://ldnsns/user/a12345/.flink/application_1478099802210_11790
2016-11-09 03:15:02,319 INFO  org.apache.flink.runtime.client.JobClientActor
- 11/09/2016 03:15:02   Job execution switched to status SUSPENDED.





--
View this message in context: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Flink-1-1-3-Shutting-down-YarnClusterClient-from-the-client-shutdown-hook-happening-frequently-tp10019.html
Sent from the Apache Flink User Mailing List archive. mailing list archive at Nabble.com.