Hi Stefania!
I think there is no hook for that right now. If I understand you correctly, assuming you run YARN or so, you want to give the sources a set of hostnames, and when scheduling, the sources have preferences for those nodes.
Within a dataflow program (job), Flink will attempt to co-locate operations to minimize network traffic.
Greetings,
Stephan