Hi, If I have to aggregate a value in a stream of records, which one of the below approaches will be the most/least efficient?
Please rate the three approaches according to their efficiency. Regards, Teena |
Hi Teena If you are proceeding with point 3, no doubt it will add some overhead but major significance is that you are persisting the state as per some key. so there will not be data loss in case of the job failure. On Thu, Apr 19, 2018 at 11:45 AM, Teena Kappen // BPRISE <[hidden email]> wrote:
|
Hi Teena, I'd go with approach 2. The performance difference shouldn't be significant compared to 1. but it is much easier to implement, IMO.Flink's checkpointing mechanism (as used by 1. and 2. ) gives you better consistency and protection against failures than what you can achieve with 3. 2018-04-19 8:42 GMT+02:00 Puneet Kinra <[hidden email]>:
|
Free forum by Nabble | Edit this page |