Hi all,
I am working on a flink archiver application. In a gist this application tries to reads a bunch of schematized messages from kafka and archives them to s3. Due to the nature of the naming of the files, I had to go towards a custom sink implementation. As of the current progress the application in general is able to archive files to s3 ok. I am having some issues during the recovery phase. A sample of the code can be found on link. My issue is on recovery when initializeState is called, it is not able to get(recover) the last checkpointed ListState( i.e. checkpointedMessages is 0). I think this might be because of the way I am retrieving the checkpointed messages. Could someone please point me to what is wrong? or direct me to some examples which do a similar thing( Please note Message class is our own implementation) Thanks, Vipul |
Hi,
How are you testing the recovery behaviour? Are you taking a savepoint ,then shutting down, and then restarting the Job from the savepoint? Best, Aljoscha
|
Hi Aljoscha, I am running the application till a few checkpoints are complete. I am stopping the application between two checkpoints, so there will be messages in the list state, which should be checkpointed when snapshot is called. I am able to see a checkpoint file on S3( I am saving the checkpoints on s3 using rockstatedb). On restarting the application, I add a debug point here, to see if there are any messages in checkpointedMessages, but as shown below, the list is empty. Do you think there might be an error in the way I am trying to retrieve messages?
On Mon, Aug 28, 2017 at 2:52 AM, Aljoscha Krettek <[hidden email]> wrote:
Thanks, Vipul |
Hi,
If you are not manually doing a savepoint and then restoring from that savepoint you will not restore state. Simply stopping a job and then restarting will not restore state. The regular checkpoints are only used for recovery if a job fails, not for a user-induced shutdown. Best, Aljoscha
|
Free forum by Nabble | Edit this page |