Flink sql task failure recovery does not work.

classic Classic list List threaded Threaded
2 messages Options
Reply | Threaded
Open this post in threaded view
|

Flink sql task failure recovery does not work.

1095193290@qq.com
Hi team,
   Following Task Failure Recovery document  https://ci.apache.org/projects/flink/flink-docs-release-1.13/docs/dev/execution/task_failure_recovery/ ,  I have  enabled state.checkpoints.dir parameter in flink-conf.yaml.
 
    state.checkpoints.dir: hdfs://172.16.1.192:9000/flink-checkpoints
    state.savepoints.dir: hdfs://172.16.1.192:9000/flink-savepoints
    
    However, Flink sql Task still throw Exception org.apache.flink.runtime.JobException: Recovery is suppressed by NoRestartBackoffTimeStrategy.  Apparently, no restart strategy enabled. Do we have enabled other configuration rather than state.checkpoints.dir.  Thanks for any suggestions.


Reply | Threaded
Open this post in threaded view
|

Re: Flink sql task failure recovery does not work.

Guowei Ma
Hi 
I think you could configure some restart strategy[1] likes 
restart-strategy: fixed-delay


On Thu, May 13, 2021 at 12:02 PM [hidden email] <[hidden email]> wrote:
Hi team,
   Following Task Failure Recovery document  https://ci.apache.org/projects/flink/flink-docs-release-1.13/docs/dev/execution/task_failure_recovery/ ,  I have  enabled state.checkpoints.dir parameter in flink-conf.yaml.
 
    state.checkpoints.dir: hdfs://172.16.1.192:9000/flink-checkpoints
    state.savepoints.dir: hdfs://172.16.1.192:9000/flink-savepoints
    
    However, Flink sql Task still throw Exception org.apache.flink.runtime.JobException: Recovery is suppressed by NoRestartBackoffTimeStrategy.  Apparently, no restart strategy enabled. Do we have enabled other configuration rather than state.checkpoints.dir.  Thanks for any suggestions.