Hi Wepngong,
This is an interesting proposal. There are indeed many streaming optimisations out there but as Gyula said we should focus on a few and engineer them in a nice way. Perhaps for the time being it makes sense to focus on a streaming job graph optimiser
that basically applies optimisations by statically analysing the graph before submitting it ie. query re-writing, operator reordering, operator sharing, intermediate result sharing etc. A runtime optimiser that can do things like load balancing and online
reconfiguration would certainly be the next step.
cheers
Paris
Hey,
Of course the aim of the project would not be to implement all possible optimizations because that would be impossible to do so in such short time :)
It would be nice if one could carefully select some optimizations that would make the most impact on the performance and implement those.
Regards,
Gyula