Hi Jiewen,
Since a DataStream can have infinite number of elements, you can't globally sorted all the elements.
If the number of element is finite, you can use the DataSet API, which will look smth like this
DataSet<List<comparable_pojo>> a;
DataSet<comparable_pojo> aFlatten = a.flatMap(..);
DataSet<comparable_pojo> aSorted = aFlatten.partitionByRange(...).sortPartition(...);
Best regards.
Kien
In Flink, I have DataStream<List<comparable_pojo>>, each list is individually pre-sorted, what I need to do is persist everything in one shot with global sort order. any ides the best to do this? Hope it makes sense.
Thanks in advance!
Free forum by Nabble | Edit this page |