Hi,
I'm trying to implement a custom FileInputFormat (to read the MNIST Dataset).
The creation of Flink DataSet (DataSet<byte[]> matrices) seems to be OK, but when I try to print it using either
matrices.print();
or
matrices.collect();
It finishes with exit code -17.
(Before, I compiled using Java 11 and aside from a reflection warning, this approach caused the program to run indefinitely. Now I use JDK 8)
The total number of elements is 60 000. Now the strange thing is that when I run
matrices.first(60000).print();
it does print the elements just fine. But my understanding is that these two approaches should work the same way, if there are exactly 60 000 records.
Is this a bug? Or something that can be explained by my extension of FileInputFormat (I might very well not use it correctly)?
Best regards,
Hynek