Does flink require all the map tasks to finish before the reducers can proceed like Spark, or can the reducer operations start before all the mappers have finished like the older Hadoop mapreduce.
Also my understanding is that flink manages it's own heap, do you/we have a sense of the performance impact of this as compared to say …. Spark where it's all in the JVM.
Regards,
Bill.
--
Jonathan (Bill) Sparks
Software Architecture
Cray Inc.