Can rocksDBBackend handle rescaling?

classic Classic list List threaded Threaded
4 messages Options
Reply | Threaded
Open this post in threaded view
|

Can rocksDBBackend handle rescaling?

bupt_ljy

Hi, all

   I’m using Flink 1.6 and I’m goint to use RocksDBBackend as the checkpoint state backend. Before that, I need to make sure that the checkpoint can handle rescale. From the docs on Apache Flink https://ci.apache.org/projects/flink/flink-docs-release-1.6/ops/state/checkpoints.html, it says that the checkpoint do not support Flink’s rescaling, however, from the source code, the rescaling logic seems not to be related to the state backend if the substask’s states are restored from files successfully.

   Could anyone explain this? Or tell me if I miss something?

   

Best, Jiayi Liao



 

Reply | Threaded
Open this post in threaded view
|

Re: Can rocksDBBackend handle rescaling?

杨力
A checkpoint cannot be used for rescaling. You should take a savepoint, stop the application, and resume from the savepoint with a new parallelism.



On Fri, Sep 14, 2018 at 1:50 AM 廖嘉逸 <[hidden email]> wrote:

Hi, all

   I’m using Flink 1.6 and I’m goint to use RocksDBBackend as the checkpoint state backend. Before that, I need to make sure that the checkpoint can handle rescale. From the docs on Apache Flink https://ci.apache.org/projects/flink/flink-docs-release-1.6/ops/state/checkpoints.html, it says that the checkpoint do not support Flink’s rescaling, however, from the source code, the rescaling logic seems not to be related to the state backend if the substask’s states are restored from files successfully.

   Could anyone explain this? Or tell me if I miss something?

   

Best, Jiayi Liao



 

Reply | Threaded
Open this post in threaded view
|

Re: Can rocksDBBackend handle rescaling?

bupt_ljy
In reply to this post by bupt_ljy

Thanks for your answer, but I still have some questions.

Could you tell me why the checkpoint cannot be used for rescaling? From my perspective, the difference between checkpoint and savepoint is nullable externalSavepointLocation. And from this doc: https://flink.apache.org/features/2017/07/04/flink-rescalable-state.html, it talked about rescaling based on the checkpointing. Do I confuse these two things?


 Original Message 
Sender: 杨力<[hidden email]>
Recipient: 廖嘉逸<[hidden email]>
Cc: user<[hidden email]>
Date: Friday, Sep 14, 2018 02:03
Subject: Re: Can rocksDBBackend handle rescaling?

A checkpoint cannot be used for rescaling. You should take a savepoint, stop the application, and resume from the savepoint with a new parallelism.



On Fri, Sep 14, 2018 at 1:50 AM 廖嘉逸 <[hidden email]> wrote:

Hi, all

   I’m using Flink 1.6 and I’m goint to use RocksDBBackend as the checkpoint state backend. Before that, I need to make sure that the checkpoint can handle rescale. From the docs on Apache Flink https://ci.apache.org/projects/flink/flink-docs-release-1.6/ops/state/checkpoints.html, it says that the checkpoint do not support Flink’s rescaling, however, from the source code, the rescaling logic seems not to be related to the state backend if the substask’s states are restored from files successfully.

   Could anyone explain this? Or tell me if I miss something?

   

Best, Jiayi Liao



 

Reply | Threaded
Open this post in threaded view
|

Re: Can rocksDBBackend handle rescaling?

Andrey Zagrebin
Hi Jiayi Liao,

I have answered the similar question here:

This link might be also useful:

I would recommend to use explicit savepoints at the moment for rescaling.

Best,
Andrey

On 13 Sep 2018, at 20:29, bupt_ljy <[hidden email]> wrote:

Thanks for your answer, but I still have some questions.
Could you tell me why the checkpoint cannot be used for rescaling? From my perspective, the difference between checkpoint and savepoint is nullable externalSavepointLocation. And from this doc: https://flink.apache.org/features/2017/07/04/flink-rescalable-state.html, it talked about rescaling based on the checkpointing. Do I confuse these two things?

 Original Message 
Sender: 杨力<[hidden email]>
Recipient: 廖嘉逸<[hidden email]>
Cc: user<[hidden email]>
Date: Friday, Sep 14, 2018 02:03
Subject: Re: Can rocksDBBackend handle rescaling?

A checkpoint cannot be used for rescaling. You should take a savepoint, stop the application, and resume from the savepoint with a new parallelism.



On Fri, Sep 14, 2018 at 1:50 AM 廖嘉逸 <[hidden email]> wrote:
Hi, all
   I’m using Flink 1.6 and I’m goint to use RocksDBBackend as the checkpoint state backend. Before that, I need to make sure that the checkpoint can handle rescale. From the docs on Apache Flink https://ci.apache.org/projects/flink/flink-docs-release-1.6/ops/state/checkpoints.html, it says that the checkpoint do not support Flink’s rescaling, however, from the source code, the rescaling logic seems not to be related to the state backend if the substask’s states are restored from files successfully.
   Could anyone explain this? Or tell me if I miss something?

   

Best, Jiayi Liao