How to implement different Join strategies for a Flink stream application?

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

How to implement different Join strategies for a Flink stream application?

Felipe Gutierrez
Hi,
First I was trying to mimic the implementation of the IntervalJoinOperator which uses a KeyedStream > connect > keyBy > and transform. It was a little confusing to me because I was not sure if I had to use a window transformation after transform() or not.

Then I found this answer (https://stackoverflow.com/a/46052661/2096986) which uses keyBy > connect(keyBy()) > process(CoProcessFunction()) with some example here (https://github.com/ververica/flink-training-exercises/blob/master/src/main/java/com/dataartisans/flinktraining/exercises/datastream_java/process/EventTimeJoinExercise.java)

If I want to decide whether I use BradCastJoin or HashJoin or any other Join algorithm, which way do you think it is better? is there any other example code that I could borrow?

Thanks,
Felipe
--
-- Felipe Gutierrez
-- skype: felipe.o.gutierrez