gelly scatter/gather

classic Classic list List threaded Threaded
3 messages Options
Reply | Threaded
Open this post in threaded view
|

gelly scatter/gather

Alieh
Hi all

I have an iterative algorithm implemented using Gelly scatter/gather.
Using 8 workers of a cluster, I encounter the error
"akka.pattern.AskTimeoutException", which I think the reason is heap
size. Surprisingly, using 4 workers of the same cluster, my program is
executed!!! It seems that I have some I/O with 4 workers, but why using
8 workers I can not run the job?

Best,

Alieh

Reply | Threaded
Open this post in threaded view
|

Re: gelly scatter/gather

Kaepke, Marc
Hi Alieh,

I can't solve your problem yet. But I work with gelly and Scatter/Gather and later GSA too. Could you publish or show me your algorithm?

Best from Hamburg
Marc

Sent from my iPhone

> On 28. Apr 2017, at 13:58, Alieh <[hidden email]> wrote:
>
> Hi all
>
> I have an iterative algorithm implemented using Gelly scatter/gather. Using 8 workers of a cluster, I encounter the error "akka.pattern.AskTimeoutException", which I think the reason is heap size. Surprisingly, using 4 workers of the same cluster, my program is executed!!! It seems that I have some I/O with 4 workers, but why using 8 workers I can not run the job?
>
> Best,
>
> Alieh
>
Reply | Threaded
Open this post in threaded view
|

Re: gelly scatter/gather

Till Rohrmann

Hi Alieh,

where do you see the AskTimeoutException exactly? Maybe you can share the complete stack trace and the logs with us. Moreover, which version of Flink are you running?

Cheers,
Till


On Fri, Apr 28, 2017 at 2:13 PM, Kaepke, Marc <[hidden email]> wrote:
Hi Alieh,

I can't solve your problem yet. But I work with gelly and Scatter/Gather and later GSA too. Could you publish or show me your algorithm?

Best from Hamburg
Marc

Sent from my iPhone

> On 28. Apr 2017, at 13:58, Alieh <[hidden email]> wrote:
>
> Hi all
>
> I have an iterative algorithm implemented using Gelly scatter/gather. Using 8 workers of a cluster, I encounter the error "akka.pattern.AskTimeoutException", which I think the reason is heap size. Surprisingly, using 4 workers of the same cluster, my program is executed!!! It seems that I have some I/O with 4 workers, but why using 8 workers I can not run the job?
>
> Best,
>
> Alieh
>