Confusion in Heartbeat configurations

classic Classic list List threaded Threaded
3 messages Options
Reply | Threaded
Open this post in threaded view
|

Confusion in Heartbeat configurations

sohimankotia
Hi,

In
https://ci.apache.org/projects/flink/flink-docs-release-1.7/ops/config.html
link there are two heartbeat config are mentioned .

akka.watch.heartbeat.interval
akka.watch.heartbeat.pause

Vs

heartbeat.interval
heartbeat.timeout


Can u guys pls explain what exactly is difference between them and which
component of job execution graph they impact .

Thanks




--
Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/
Reply | Threaded
Open this post in threaded view
|

Re: Confusion in Heartbeat configurations

Zhijiang(wangzhijiang999)
Hi sohimankotia,

In order not to strongly rely on the akka implementation, flink implements the heartbeat mechanism for health monitor for the components of TaskExecutor, JobMaster and ResourceManager from FLIP6. So you can see two sets of heartbeat setting, one is for akka internal implementation prefix with `akka` and the other is flink internal implementation.

Best,
Zhijiang
------------------------------------------------------------------
From:sohimankotia <[hidden email]>
Send Time:2019年2月18日(星期一) 14:40
To:user <[hidden email]>
Subject:Confusion in Heartbeat configurations

Hi, 

In
https://ci.apache.org/projects/flink/flink-docs-release-1.7/ops/config.html
link there are two heartbeat config are mentioned . 

akka.watch.heartbeat.interval
akka.watch.heartbeat.pause

Vs

heartbeat.interval
heartbeat.timeout


Can u guys pls explain what exactly is difference between them and which
component of job execution graph they impact . 

Thanks




--
Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

Reply | Threaded
Open this post in threaded view
|

Re: Confusion in Heartbeat configurations

sohimankotia
Thanks Zhijiang .

Sorry to ask again . So both set of heartbeats are implementing same feature
.

If Yes , which one has highest priority to detect failure .
If no , can you explain little more or point to some references to
understand difference .

Thanks
Sohi



--
Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/