What is the recommended way to run flink with high availability on AWS?

classic Classic list List threaded Threaded
4 messages Options
Reply | Threaded
Open this post in threaded view
|

What is the recommended way to run flink with high availability on AWS?

Pei HE
Hi there,
Looking at Flink document [1], EMR is the recommended way to run Flink on AWS. However, Flink is currently not supported in an EMR cluster with multiple master nodes [2].

For example, I can create a EMR HA cluster with Zookeeper, Hive, etc. But, if I add Flink to the application list when I create the cluster, I got the following error from aws cli:
"An error occurred (ValidationException) when calling the RunJobFlow operation: HA clusters do not support the specified applications: Flink.  Revise the applications and resubmit.".

My question is what is the recommended way to run flink with high availability on AWS.

Thanks
--
Pei

Reply | Threaded
Open this post in threaded view
|

Re: What is the recommended way to run flink with high availability on AWS?

sri hari kali charan Tummala
check this out AWS managed Flink cluster in aws kinesis, in AWS kinesis try to create Kinesis data analytics application using flink instead of sql.

On Tue, Aug 20, 2019 at 9:28 PM Pei HE <[hidden email]> wrote:
Hi there,
Looking at Flink document [1], EMR is the recommended way to run Flink on AWS. However, Flink is currently not supported in an EMR cluster with multiple master nodes [2].

For example, I can create a EMR HA cluster with Zookeeper, Hive, etc. But, if I add Flink to the application list when I create the cluster, I got the following error from aws cli:
"An error occurred (ValidationException) when calling the RunJobFlow operation: HA clusters do not support the specified applications: Flink.  Revise the applications and resubmit.".

My question is what is the recommended way to run flink with high availability on AWS.

Thanks
--
Pei



--
Thanks & Regards
Sri Tummala


Screen Shot 2019-08-20 at 10.08.59 PM.png (31K) Download Attachment
Screen Shot 2019-08-20 at 10.09.20 PM.png (113K) Download Attachment
Screen Shot 2019-08-20 at 10.13.37 PM.png (21K) Download Attachment
Reply | Threaded
Open this post in threaded view
|

Re: What is the recommended way to run flink with high availability on AWS?

Pei HE
Thanks Kali for the information. However, it doesn't work for me, because I need features in Flink 1.7.x or later and use manged Amazon MSK.
--
Pei



On Tue, Aug 20, 2019 at 7:17 PM sri hari kali charan Tummala <[hidden email]> wrote:
check this out AWS managed Flink cluster in aws kinesis, in AWS kinesis try to create Kinesis data analytics application using flink instead of sql.

On Tue, Aug 20, 2019 at 9:28 PM Pei HE <[hidden email]> wrote:
Hi there,
Looking at Flink document [1], EMR is the recommended way to run Flink on AWS. However, Flink is currently not supported in an EMR cluster with multiple master nodes [2].

For example, I can create a EMR HA cluster with Zookeeper, Hive, etc. But, if I add Flink to the application list when I create the cluster, I got the following error from aws cli:
"An error occurred (ValidationException) when calling the RunJobFlow operation: HA clusters do not support the specified applications: Flink.  Revise the applications and resubmit.".

My question is what is the recommended way to run flink with high availability on AWS.

Thanks
--
Pei



--
Thanks & Regards
Sri Tummala

Reply | Threaded
Open this post in threaded view
|

Re: What is the recommended way to run flink with high availability on AWS?

sri hari kali charan Tummala
Ok, no problem. 

On Wed, Aug 21, 2019 at 12:22 AM Pei HE <[hidden email]> wrote:
Thanks Kali for the information. However, it doesn't work for me, because I need features in Flink 1.7.x or later and use manged Amazon MSK.
--
Pei



On Tue, Aug 20, 2019 at 7:17 PM sri hari kali charan Tummala <[hidden email]> wrote:
check this out AWS managed Flink cluster in aws kinesis, in AWS kinesis try to create Kinesis data analytics application using flink instead of sql.

On Tue, Aug 20, 2019 at 9:28 PM Pei HE <[hidden email]> wrote:
Hi there,
Looking at Flink document [1], EMR is the recommended way to run Flink on AWS. However, Flink is currently not supported in an EMR cluster with multiple master nodes [2].

For example, I can create a EMR HA cluster with Zookeeper, Hive, etc. But, if I add Flink to the application list when I create the cluster, I got the following error from aws cli:
"An error occurred (ValidationException) when calling the RunJobFlow operation: HA clusters do not support the specified applications: Flink.  Revise the applications and resubmit.".

My question is what is the recommended way to run flink with high availability on AWS.

Thanks
--
Pei



--
Thanks & Regards
Sri Tummala



--
Thanks & Regards
Sri Tummala