Flink batch processing fault tolerance

classic Classic list List threaded Threaded
7 messages Options
Reply | Threaded
Open this post in threaded view
|

Flink batch processing fault tolerance

Renjie Liu
Hi, all:
I'm learning flink's doc and curious about the fault tolerance of batch process jobs. It seems that when one of task execution fails, the whole job will be restarted, is it true? If so, isn't it impractical to deploy large flink batch jobs? 
--
Liu, Renjie
Software Engineer, MVAD
Reply | Threaded
Open this post in threaded view
|

Re: Flink batch processing fault tolerance

Aljoscha Krettek
Hi,
yes, this is indeed true. We had some plans for how to resolve this but they never materialised because of the focus on Stream Processing. We might unite the two in the future and then you will get fault-tolerant batch/stream processing in the same API.

Best,
Aljoscha

On Wed, 15 Feb 2017 at 09:28 Renjie Liu <[hidden email]> wrote:
Hi, all:
I'm learning flink's doc and curious about the fault tolerance of batch process jobs. It seems that when one of task execution fails, the whole job will be restarted, is it true? If so, isn't it impractical to deploy large flink batch jobs? 
--
Liu, Renjie
Software Engineer, MVAD
Reply | Threaded
Open this post in threaded view
|

RE: Flink batch processing fault tolerance

Anton Solovev

Hi Aljoscha,

Could you share your plans of resolving it?

 

Best,

Anton

 

 

From: Aljoscha Krettek [mailto:[hidden email]]
Sent: Thursday, February 16, 2017 2:48 PM
To: [hidden email]
Subject: Re: Flink batch processing fault tolerance

 

Hi,

yes, this is indeed true. We had some plans for how to resolve this but they never materialised because of the focus on Stream Processing. We might unite the two in the future and then you will get fault-tolerant batch/stream processing in the same API.

 

Best,

Aljoscha

 

On Wed, 15 Feb 2017 at 09:28 Renjie Liu <[hidden email]> wrote:

Hi, all:
I'm learning flink's doc and curious about the fault tolerance of batch process jobs. It seems that when one of task execution fails, the whole job will be restarted, is it true? If so, isn't it impractical to deploy large flink batch jobs? 

--

Liu, Renjie

Software Engineer, MVAD

Reply | Threaded
Open this post in threaded view
|

Re: Flink batch processing fault tolerance

Renjie Liu

On Thu, Feb 16, 2017 at 7:34 PM Anton Solovev <[hidden email]> wrote:

Hi Aljoscha,

Could you share your plans of resolving it?

 

Best,

Anton

 

 

From: Aljoscha Krettek [mailto:[hidden email]]
Sent: Thursday, February 16, 2017 2:48 PM
To: [hidden email]
Subject: Re: Flink batch processing fault tolerance

 

Hi,

yes, this is indeed true. We had some plans for how to resolve this but they never materialised because of the focus on Stream Processing. We might unite the two in the future and then you will get fault-tolerant batch/stream processing in the same API.

 

Best,

Aljoscha

 

On Wed, 15 Feb 2017 at 09:28 Renjie Liu <[hidden email]> wrote:

Hi, all:
I'm learning flink's doc and curious about the fault tolerance of batch process jobs. It seems that when one of task execution fails, the whole job will be restarted, is it true? If so, isn't it impractical to deploy large flink batch jobs? 

--

Liu, Renjie

Software Engineer, MVAD

--
Liu, Renjie
Software Engineer, MVAD
Reply | Threaded
Open this post in threaded view
|

Re: Flink batch processing fault tolerance

Si-li Liu
Hi, 

It's the reason why I gave up use Flink for my current project and pick up traditional Hadoop Framework again. 

2017-02-17 10:56 GMT+08:00 Renjie Liu <[hidden email]>:

On Thu, Feb 16, 2017 at 7:34 PM Anton Solovev <[hidden email]> wrote:

Hi Aljoscha,

Could you share your plans of resolving it?

 

Best,

Anton

 

 

From: Aljoscha Krettek [mailto:[hidden email]]
Sent: Thursday, February 16, 2017 2:48 PM
To: [hidden email]
Subject: Re: Flink batch processing fault tolerance

 

Hi,

yes, this is indeed true. We had some plans for how to resolve this but they never materialised because of the focus on Stream Processing. We might unite the two in the future and then you will get fault-tolerant batch/stream processing in the same API.

 

Best,

Aljoscha

 

On Wed, 15 Feb 2017 at 09:28 Renjie Liu <[hidden email]> wrote:

Hi, all:
I'm learning flink's doc and curious about the fault tolerance of batch process jobs. It seems that when one of task execution fails, the whole job will be restarted, is it true? If so, isn't it impractical to deploy large flink batch jobs? 

--

Liu, Renjie

Software Engineer, MVAD

--
Liu, Renjie
Software Engineer, MVAD



--
Best regards

Sili Liu
Reply | Threaded
Open this post in threaded view
|

回复:Flink batch processing fault tolerance

Zhijiang(wangzhijiang999)
In reply to this post by Renjie Liu
yes, it is really a critical problem for large batch job because the unexpected failure is a common case.
And we are already focusing on realizing the ideas mentioned in FLIP1, wish to contirbute to flink in months.

Best,

Zhijiang
------------------------------------------------------------------
发件人:Si-li Liu <[hidden email]>
发送时间:2017年2月17日(星期五) 11:22
收件人:user <[hidden email]>
主 题:Re: Flink batch processing fault tolerance

Hi, 

It's the reason why I gave up use Flink for my current project and pick up traditional Hadoop Framework again. 

2017-02-17 10:56 GMT+08:00 Renjie Liu <[hidden email]>:

On Thu, Feb 16, 2017 at 7:34 PM Anton Solovev <[hidden email]> wrote:

Hi Aljoscha,

Could you share your plans of resolving it?

 

Best,

Anton

 

 

From: Aljoscha Krettek [mailto:[hidden email]]
Sent: Thursday, February 16, 2017 2:48 PM
To: [hidden email]
Subject: Re: Flink batch processing fault tolerance

 

Hi,

yes, this is indeed true. We had some plans for how to resolve this but they never materialised because of the focus on Stream Processing. We might unite the two in the future and then you will get fault-tolerant batch/stream processing in the same API.

 

Best,

Aljoscha

 

On Wed, 15 Feb 2017 at 09:28 Renjie Liu <[hidden email]> wrote:

Hi, all:
I'm learning flink's doc and curious about the fault tolerance of batch process jobs. It seems that when one of task execution fails, the whole job will be restarted, is it true? If so, isn't it impractical to deploy large flink batch jobs? 

--

Liu, Renjie

Software Engineer, MVAD

--
Liu, Renjie
Software Engineer, MVAD



--
Best regards

Sili Liu

Reply | Threaded
Open this post in threaded view
|

Re: Flink batch processing fault tolerance

Aljoscha Krettek
@Anton, these are the Ideas I was mentioning and I'm afraid I have nothing more to add. (In the FLIP)

On Fri, 17 Feb 2017 at 06:26 wangzhijiang999 <[hidden email]> wrote:
yes, it is really a critical problem for large batch job because the unexpected failure is a common case.
And we are already focusing on realizing the ideas mentioned in FLIP1, wish to contirbute to flink in months.

Best,

Zhijiang
------------------------------------------------------------------
发件人:Si-li Liu <[hidden email]>
发送时间:2017年2月17日(星期五) 11:22
收件人:user <[hidden email]>
主 题:Re: Flink batch processing fault tolerance

Hi, 

It's the reason why I gave up use Flink for my current project and pick up traditional Hadoop Framework again. 
2017-02-17 10:56 GMT+08:00 Renjie Liu <[hidden email]>:

On Thu, Feb 16, 2017 at 7:34 PM Anton Solovev <[hidden email]> wrote:

Hi Aljoscha,

Could you share your plans of resolving it?

 

Best,

Anton

 

 

From: Aljoscha Krettek [mailto:[hidden email]]
Sent: Thursday, February 16, 2017 2:48 PM
To: [hidden email]
Subject: Re: Flink batch processing fault tolerance

 

Hi,

yes, this is indeed true. We had some plans for how to resolve this but they never materialised because of the focus on Stream Processing. We might unite the two in the future and then you will get fault-tolerant batch/stream processing in the same API.

 

Best,

Aljoscha

 

On Wed, 15 Feb 2017 at 09:28 Renjie Liu <[hidden email]> wrote:

Hi, all:
I'm learning flink's doc and curious about the fault tolerance of batch process jobs. It seems that when one of task execution fails, the whole job will be restarted, is it true? If so, isn't it impractical to deploy large flink batch jobs? 

--

Liu, Renjie

Software Engineer, MVAD

--
Liu, Renjie
Software Engineer, MVAD
--
Best regards

Sili Liu