Re: streaming join implementation
Posted by
Balaji Rajagopalan on
URL: http://deprecated-apache-flink-user-mailing-list-archive.369.s1.nabble.com/streaming-join-implementation-tp6095p6096.html
You can implement join in flink (which is a inner join) the below mentioned pseudo code . The below join is for a 5 minute interval, yes will be some corners cases when the data coming after 5 minutes will be missed out in the join window, I actually had solved this problem but storing some data in redis and wrote correlation logic to take care of the corner cases that were missed out in the join window.
val output: DataStream[(OutputData)] = stream1.join(stream2).where(_.key1).equalTo(_.key2).
window(TumblingEventTimeWindows.of(Time.of(5, TimeUnit.MINUTE))).apply(new SomeJoinFunction)