Re: Large number of sources in Flink Job
Posted by
Fabian Hueske-2 on
URL: http://deprecated-apache-flink-user-mailing-list-archive.369.s1.nabble.com/Large-number-of-sources-in-Flink-Job-tp20360p20364.html
Hi Chirag,
There have been some issue with very large execution graphs.
You might need to adjust the default configuration and configure larger Akka buffers and/or timeouts.
Also, 2000 sources means that you run at least 2000 threads at once.
The FileInputFormat (and most of its sub-classes) in Flink 1.5.0 can be configured to accept multiple directories.
This would be a preferred approach to creating one source per directory.
Best, Fabian