Re: Distribute DataSet to subset of nodes
Posted by
Fabian Hueske-2 on
URL: http://deprecated-apache-flink-user-mailing-list-archive.369.s1.nabble.com/Distribute-DataSet-to-subset-of-nodes-tp2814p2832.html
Hi Stefan,
forcing the scheduling of tasks to certain nodes and reading files from the local file system in a multi-node setup is actually quite tricky and requires a bit understanding of the internals.
It is possible and I can help you with that, but would recommend to use a shared filesystem such as HDFS if that is possible.
Best, Fabian