Hi everybody
In a DeltaIteration I have a DataSet<Tuple3<K, V, HashSet<K>>> where, at a certain point of the iteration, i need to count the total number of tuples and the total number of elements in the HashSet of each tuple, and then send both value to the ConvergenceCriterion function.
Example:
this is the content of my DataSet:
(1,2,[2,3])
(2,1,[3,4])
(3,2,[4,5])
i should have:
first count: 3 (1,2,3)
second count: 4 (2,3,4,5)
i tried to iterate the dataset through a flatMap and exploit so an aggregator, putting an HashSet into it(Aggregator), but it didn’t work!
Do you have any suggestion??
thanks
Riccardo