Hello flinkers,
We will activate namenode HDFS high availability in our cluster, and I want to know if there is additional configuration for flink ? We actually use YARN for launching our flink application, and hdfs filesystem to store the state backend Thanks Thomas |
I think the only keys of interest for your needs (highly available with HDFS state backend) are state.backend: filesystem state.backend.fs.checkpointdir: hdfs:///path/to/checkpoints # fill in according to your needs recovery.zookeeper.storageDir: /path/to/znode # again, fill in according to your needs recovery.mode: zookeeper recovery.zookeeper.quorum: zk-ensemble-1:2181,zk-ensemble-2:2181,zk-ensemble-3:2181 # put your zk ensemble here If these keys are set you should be good to go. I hope I've been of some help. :) On Mon, May 23, 2016 at 12:37 PM, <[hidden email]> wrote: Hello flinkers, BR, Stefano Baghino |
One last quick note: if you're going to run individual jobs on YARN instead of a long running session, make sure you provide each job with a separate set of directories for (surely) ZK storage and (possibly*) state backend, otherwise the state of the jobs will end up entangled and you may experience some undefined behavior. * I'm not really sure about this last one, perhaps some more experienced ML user can help me out on this. On Mon, May 23, 2016 at 12:54 PM, Stefano Baghino <[hidden email]> wrote:
BR, Stefano Baghino |
Ok, we have all this configuration set up, so it will be fine :-) Thanks for getting response ! Thomas
One last quick note: if you're going to run individual jobs on YARN instead of a long running session, make sure you provide each job with a separate set of directories for (surely) ZK storage and (possibly*) state backend, otherwise the state of the jobs will end up entangled and you may experience some undefined behavior. * I'm not really sure about this last one, perhaps some more experienced ML user can help me out on this. On Mon, May 23, 2016 at 12:54 PM, Stefano Baghino <[hidden email]> wrote:
BR, Stefano Baghino |
Hi Thomas, if you want to run multiple Flink cluster in HA mode, you should configure for every cluster a specific Cheers, On Tue, May 24, 2016 at 7:38 AM, <[hidden email]> wrote:
|
Free forum by Nabble | Edit this page |