|
Hello all,
1st question:
Is there any way to know the count or the content of a "Fink DataSet"
without using count() or collect()? The problem is that I have a loop
which the number of iterations depends on the count of a DataSet. Using
count() may force the whole pipeline to be executed again. I do not like
to use delta or bulk iteration.
2nd question:
Using the "Aggregations.Max" on a DataSet of Tuple2<String, Integer> on
the second field, I observed that the second field is the real maximum
of the whole dataset while the first field is not the corresponding one
to the second!!!
Best,
Alieh
|