|
Hi Sylvain,
what you could do for example is to load a static data set, e.g. from HDFS, in the open method of your comparator and cache it there. The open method is called for each task once when it is created. The comparator could then be a RichMapFunction implementation. By making the field storing the small data set static, you can even share the data among all tasks which run on the same TaskManager.
Cheers, Till
|