Dear all
I'm trying to run Flink ALS against Yahoo-R2 data set[1] on HDFS. The program is running without showing any errors, but it does not finish. The operators running indefinitely are:
CoGroup (CoGroup at org.apache.flink.ml.recommendation.ALS$.updateFactors(ALS.scala:606))(11/240)
Join(Join at org.apache.flink.ml.recommendation.ALS$.updateFactors(ALS.scala:576))(15/240)
I was using the below parameters to run:
val als = ALS()
.setIterations(10)
.setNumFactors(10)
.setBlocks(100)
And I didn't set the hdfs temporary path. Can someone tell me the parameters to set to run ALS on such large data sets? Why are these operators running indefinitely?
Best
Ziyad