(DEPRECATED) Apache Flink User Mailing List archive.

Optimal Configuration for Cluster

Classic

List

Threaded

10 messages Options

tambunanw

Optimal Configuration for Cluster

Hi All,

We are trying to running our job in cluster that has this information

1. # of machine: 16
2. memory : 128 gb

3. # of core : 48

However when we try to run we have an exception.

"insufficient number of network buffers. 48 required but only 10 available. the total number of network buffers is currently set to 2048"

After looking at the documentation we set configuration based on docs

taskmanager.network.numberOfBuffers: # core ^ 2 * # machine * 4

However we face another error from JVM

java.io.IOException: Cannot allocate network buffer pool: Could not allocate enough memory segments for NetworkBufferPool (required (Mb): 2304, allocated (Mb): 698, missing (Mb): 1606). Cause: Java heap space

We fiddle the taskmanager.heap.mb: 4096

Finally the cluster is running.

However i'm still not sure about the configuration and fiddling in task manager heap really fine tune. So my question is

Am i doing it right for numberOfBuffers ?
How much should we allocate on taskmanager.heap.mb given the information
Any suggestion which configuration we need to set to make it optimal for the cluster ?
Is there any chance that this will get automatically resolve by memory/network buffer manager ?

Thanks a lot for the help

Cheers

Welly Tambunan
Triplelands

http://weltam.wordpress.com

http://www.triplelands.com

Fabian Hueske-2

Re: Optimal Configuration for Cluster

Hi Welly,

sorry for the late response.

The number of network buffers primarily depends on the maximum parallelism of your job.

The given formula assumes a specific cluster configuration (1 task manager per machine, one parallel task per CPU).

The formula can be translated to:

taskmanager.network.numberOfBuffers: p ^ 2 * t * 4

where p is the maximum parallelism of the job and t is the number of task manager.

You can process more than one parallel task per TM if you configure more than one processing slot per machine ( taskmanager.numberOfTaskSlots). The TM will divide its memory among all its slots. So it would be possible to start one TM for each machine with 100GB+ memory and 48 slots each.

We can compute the number of network buffers if you give a few more details about your setup:

- How many task managers do you start? I assume more than one TM per machine given that you assign only 4GB of memory out of 128GB to each TM.

- What is the maximum parallelism of you program?
- How many processing slots do you configure for each TM?

In general, pipelined shuffles with a high parallelism require a lot of memory.

If you configure batch instead of pipelined transfer, the memory requirement goes down (ExecutionConfig.setExecutionMode(ExecutionMode.BATCH)).

Eventually, we want to merge the network buffer and the managed memory pools. So the "taskmanager.network.numberOfBuffers" configuration whill hopefully disappear at some point in the future.

Best, Fabian

2016-02-19 9:34 GMT+01:00 Welly Tambunan <[hidden email]>:

Hi All,

We are trying to running our job in cluster that has this information

1. # of machine: 16
2. memory : 128 gb
3. # of core : 48

However when we try to run we have an exception.

"insufficient number of network buffers. 48 required but only 10 available. the total number of network buffers is currently set to 2048"

After looking at the documentation we set configuration based on docs

taskmanager.network.numberOfBuffers: # core ^ 2 * # machine * 4

However we face another error from JVM

java.io.IOException: Cannot allocate network buffer pool: Could not allocate enough memory segments for NetworkBufferPool (required (Mb): 2304, allocated (Mb): 698, missing (Mb): 1606). Cause: Java heap space

We fiddle the taskmanager.heap.mb: 4096

Finally the cluster is running.

However i'm still not sure about the configuration and fiddling in task manager heap really fine tune. So my question is

Am i doing it right for numberOfBuffers ?
How much should we allocate on taskmanager.heap.mb given the information
Any suggestion which configuration we need to set to make it optimal for the cluster ?
Is there any chance that this will get automatically resolve by memory/network buffer manager ?
Thanks a lot for the help

Cheers

--
Welly Tambunan
Triplelands

http://weltam.wordpress.com
http://www.triplelands.com

Fabian Hueske-2

Re: Optimal Configuration for Cluster

Hi Welly,

I have to correct the formula I posted before:

taskmanager.network.numberOfBuffers: p ^ 2 * t * 4

p is NOT the parallelism of the job, BUT the number of slots of a task manager.

So if you configure one TM for each machine with 48 slots, you get:

48^2 * 16 * 4 = 147.456 buffers, with 32KB per buffer you need 4.5GB network memory for each TM, i.e. 4.5GB per machine

If you configure 48 TMs for each machine with 1 slot each, you get:

1^2 * (48*16) * 4 = 3.072 buffers, with 32KB per buffer: 96MB per TM and 4.5GB per machine (with 48 TMs per machine)

Batch transfers are only possible for DataSet (batch) programs.

Hope this helps, Fabian

2016-02-22 11:26 GMT+01:00 Fabian Hueske <[hidden email]>:

Hi Welly,

sorry for the late response.

The number of network buffers primarily depends on the maximum parallelism of your job.
The given formula assumes a specific cluster configuration (1 task manager per machine, one parallel task per CPU).
The formula can be translated to:

taskmanager.network.numberOfBuffers: p ^ 2 * t * 4

where p is the maximum parallelism of the job and t is the number of task manager.
You can process more than one parallel task per TM if you configure more than one processing slot per machine ( taskmanager.numberOfTaskSlots). The TM will divide its memory among all its slots. So it would be possible to start one TM for each machine with 100GB+ memory and 48 slots each.

We can compute the number of network buffers if you give a few more details about your setup:
- How many task managers do you start? I assume more than one TM per machine given that you assign only 4GB of memory out of 128GB to each TM.
- What is the maximum parallelism of you program?
- How many processing slots do you configure for each TM?

In general, pipelined shuffles with a high parallelism require a lot of memory.
If you configure batch instead of pipelined transfer, the memory requirement goes down (ExecutionConfig.setExecutionMode(ExecutionMode.BATCH)).

Eventually, we want to merge the network buffer and the managed memory pools. So the "taskmanager.network.numberOfBuffers" configuration whill hopefully disappear at some point in the future.

Best, Fabian

2016-02-19 9:34 GMT+01:00 Welly Tambunan <[hidden email]>:
Hi All,

We are trying to running our job in cluster that has this information

1. # of machine: 16
2. memory : 128 gb
3. # of core : 48

However when we try to run we have an exception.

"insufficient number of network buffers. 48 required but only 10 available. the total number of network buffers is currently set to 2048"

After looking at the documentation we set configuration based on docs

taskmanager.network.numberOfBuffers: # core ^ 2 * # machine * 4

However we face another error from JVM

java.io.IOException: Cannot allocate network buffer pool: Could not allocate enough memory segments for NetworkBufferPool (required (Mb): 2304, allocated (Mb): 698, missing (Mb): 1606). Cause: Java heap space

We fiddle the taskmanager.heap.mb: 4096

Finally the cluster is running.

However i'm still not sure about the configuration and fiddling in task manager heap really fine tune. So my question is

Am i doing it right for numberOfBuffers ?
How much should we allocate on taskmanager.heap.mb given the information
Any suggestion which configuration we need to set to make it optimal for the cluster ?
Is there any chance that this will get automatically resolve by memory/network buffer manager ?
Thanks a lot for the help

Cheers

--
Welly Tambunan
Triplelands

http://weltam.wordpress.com
http://www.triplelands.com

tambunanw

Re: Optimal Configuration for Cluster

In reply to this post by Fabian Hueske-2

Hi Fabian,

Thanks a lot for your response.

- How many task managers do you start? I assume more than one TM per machine given that you assign only 4GB of memory out of 128GB to each TM.

Currently what we have done is start a 1 TM per machine with number of task slot 48.

- What is the maximum parallelism of you program?

Paralleism is around 30 and 40.

- How many processing slots do you configure for each TM?

We configure 48 (#core) for each TM. One TM for each machine.

But i would like to ask another question. Is that better to start 48 task manager in one machine with number of task slot 1 ? Any trade-off that we should know etc ?

On Mon, Feb 22, 2016 at 5:26 PM, Fabian Hueske <[hidden email]> wrote:

Hi Welly,

sorry for the late response.

The number of network buffers primarily depends on the maximum parallelism of your job.
The given formula assumes a specific cluster configuration (1 task manager per machine, one parallel task per CPU).
The formula can be translated to:

taskmanager.network.numberOfBuffers: p ^ 2 * t * 4

where p is the maximum parallelism of the job and t is the number of task manager.
You can process more than one parallel task per TM if you configure more than one processing slot per machine ( taskmanager.numberOfTaskSlots). The TM will divide its memory among all its slots. So it would be possible to start one TM for each machine with 100GB+ memory and 48 slots each.

We can compute the number of network buffers if you give a few more details about your setup:
- How many task managers do you start? I assume more than one TM per machine given that you assign only 4GB of memory out of 128GB to each TM.
- What is the maximum parallelism of you program?
- How many processing slots do you configure for each TM?

In general, pipelined shuffles with a high parallelism require a lot of memory.
If you configure batch instead of pipelined transfer, the memory requirement goes down (ExecutionConfig.setExecutionMode(ExecutionMode.BATCH)).

Eventually, we want to merge the network buffer and the managed memory pools. So the "taskmanager.network.numberOfBuffers" configuration whill hopefully disappear at some point in the future.

Best, Fabian

2016-02-19 9:34 GMT+01:00 Welly Tambunan <[hidden email]>:
Hi All,

We are trying to running our job in cluster that has this information

1. # of machine: 16
2. memory : 128 gb
3. # of core : 48

However when we try to run we have an exception.

"insufficient number of network buffers. 48 required but only 10 available. the total number of network buffers is currently set to 2048"

After looking at the documentation we set configuration based on docs

taskmanager.network.numberOfBuffers: # core ^ 2 * # machine * 4

However we face another error from JVM

java.io.IOException: Cannot allocate network buffer pool: Could not allocate enough memory segments for NetworkBufferPool (required (Mb): 2304, allocated (Mb): 698, missing (Mb): 1606). Cause: Java heap space

We fiddle the taskmanager.heap.mb: 4096

Finally the cluster is running.

However i'm still not sure about the configuration and fiddling in task manager heap really fine tune. So my question is

Am i doing it right for numberOfBuffers ?
How much should we allocate on taskmanager.heap.mb given the information
Any suggestion which configuration we need to set to make it optimal for the cluster ?
Is there any chance that this will get automatically resolve by memory/network buffer manager ?
Thanks a lot for the help

Cheers

--
Welly Tambunan
Triplelands

http://weltam.wordpress.com
http://www.triplelands.com

Welly Tambunan
Triplelands

http://weltam.wordpress.com

http://www.triplelands.com

tambunanw

Re: Optimal Configuration for Cluster

In reply to this post by Fabian Hueske-2

Hi Fabian,

Previously when using flink 0.9-0.10 we start the cluster with streaming mode or batch mode. I see that this one is gone on Flink 1.00 snapshot ? So this one has already taken care of the flink and optimize by runtime >

On Mon, Feb 22, 2016 at 5:26 PM, Fabian Hueske <[hidden email]> wrote:

Hi Welly,

sorry for the late response.

The number of network buffers primarily depends on the maximum parallelism of your job.
The given formula assumes a specific cluster configuration (1 task manager per machine, one parallel task per CPU).
The formula can be translated to:

taskmanager.network.numberOfBuffers: p ^ 2 * t * 4

where p is the maximum parallelism of the job and t is the number of task manager.
You can process more than one parallel task per TM if you configure more than one processing slot per machine ( taskmanager.numberOfTaskSlots). The TM will divide its memory among all its slots. So it would be possible to start one TM for each machine with 100GB+ memory and 48 slots each.

We can compute the number of network buffers if you give a few more details about your setup:
- How many task managers do you start? I assume more than one TM per machine given that you assign only 4GB of memory out of 128GB to each TM.
- What is the maximum parallelism of you program?
- How many processing slots do you configure for each TM?

In general, pipelined shuffles with a high parallelism require a lot of memory.
If you configure batch instead of pipelined transfer, the memory requirement goes down (ExecutionConfig.setExecutionMode(ExecutionMode.BATCH)).

Eventually, we want to merge the network buffer and the managed memory pools. So the "taskmanager.network.numberOfBuffers" configuration whill hopefully disappear at some point in the future.

Best, Fabian

2016-02-19 9:34 GMT+01:00 Welly Tambunan <[hidden email]>:
Hi All,

We are trying to running our job in cluster that has this information

1. # of machine: 16
2. memory : 128 gb
3. # of core : 48

However when we try to run we have an exception.

"insufficient number of network buffers. 48 required but only 10 available. the total number of network buffers is currently set to 2048"

After looking at the documentation we set configuration based on docs

taskmanager.network.numberOfBuffers: # core ^ 2 * # machine * 4

However we face another error from JVM

java.io.IOException: Cannot allocate network buffer pool: Could not allocate enough memory segments for NetworkBufferPool (required (Mb): 2304, allocated (Mb): 698, missing (Mb): 1606). Cause: Java heap space

We fiddle the taskmanager.heap.mb: 4096

Finally the cluster is running.

However i'm still not sure about the configuration and fiddling in task manager heap really fine tune. So my question is

Am i doing it right for numberOfBuffers ?
How much should we allocate on taskmanager.heap.mb given the information
Any suggestion which configuration we need to set to make it optimal for the cluster ?
Is there any chance that this will get automatically resolve by memory/network buffer manager ?
Thanks a lot for the help

Cheers

--
Welly Tambunan
Triplelands

http://weltam.wordpress.com
http://www.triplelands.com

Welly Tambunan
Triplelands

http://weltam.wordpress.com

http://www.triplelands.com

Ufuk Celebi

Re: Optimal Configuration for Cluster

The new default is equivalent to the previous "streaming mode". The
community decided to get rid of this distinction, because it was
confusing to users.

The difference between "streaming mode" and "batch mode" was how
Flink's managed memory was allocated, either lazily when required
('streaming mode") or eagerly on task manager start up ("batch mode").
Now it's lazy by default.

This is not something you need to worry about, but if you are mostly
using the DataSet API where pre allocation has benefits, you can get
the "batch mode" behaviour by using the following configuration key:

taskmanager.memory.preallocate: true

But you are using the DataStream API anyways, right?

– Ufuk

On Tue, Feb 23, 2016 at 6:36 AM, Welly Tambunan <[hidden email]> wrote:

> Hi Fabian,
>
> Previously when using flink 0.9-0.10 we start the cluster with streaming
> mode or batch mode. I see that this one is gone on Flink 1.00 snapshot ? So
> this one has already taken care of the flink and optimize by runtime >
>
> On Mon, Feb 22, 2016 at 5:26 PM, Fabian Hueske <[hidden email]> wrote:
>>
>> Hi Welly,
>>
>> sorry for the late response.
>>
>> The number of network buffers primarily depends on the maximum parallelism
>> of your job.
>> The given formula assumes a specific cluster configuration (1 task manager
>> per machine, one parallel task per CPU).
>> The formula can be translated to:
>>
>> taskmanager.network.numberOfBuffers: p ^ 2 * t * 4
>>
>> where p is the maximum parallelism of the job and t is the number of task
>> manager.
>> You can process more than one parallel task per TM if you configure more
>> than one processing slot per machine ( taskmanager.numberOfTaskSlots). The
>> TM will divide its memory among all its slots. So it would be possible to
>> start one TM for each machine with 100GB+ memory and 48 slots each.
>>
>> We can compute the number of network buffers if you give a few more
>> details about your setup:
>> - How many task managers do you start? I assume more than one TM per
>> machine given that you assign only 4GB of memory out of 128GB to each TM.
>> - What is the maximum parallelism of you program?
>> - How many processing slots do you configure for each TM?
>>
>> In general, pipelined shuffles with a high parallelism require a lot of
>> memory.
>> If you configure batch instead of pipelined transfer, the memory
>> requirement goes down
>> (ExecutionConfig.setExecutionMode(ExecutionMode.BATCH)).
>>
>> Eventually, we want to merge the network buffer and the managed memory
>> pools. So the "taskmanager.network.numberOfBuffers" configuration whill
>> hopefully disappear at some point in the future.
>>
>> Best, Fabian
>>
>> 2016-02-19 9:34 GMT+01:00 Welly Tambunan <[hidden email]>:
>>>
>>> Hi All,
>>>
>>> We are trying to running our job in cluster that has this information
>>>
>>> 1. # of machine: 16
>>> 2. memory : 128 gb
>>> 3. # of core : 48
>>>
>>> However when we try to run we have an exception.
>>>
>>> "insufficient number of network buffers. 48 required but only 10
>>> available. the total number of network buffers is currently set to 2048"
>>>
>>> After looking at the documentation we set configuration based on docs
>>>
>>> taskmanager.network.numberOfBuffers: # core ^ 2 * # machine * 4
>>>
>>> However we face another error from JVM
>>>
>>> java.io.IOException: Cannot allocate network buffer pool: Could not
>>> allocate enough memory segments for NetworkBufferPool (required (Mb): 2304,
>>> allocated (Mb): 698, missing (Mb): 1606). Cause: Java heap space
>>>
>>> We fiddle the taskmanager.heap.mb: 4096
>>>
>>> Finally the cluster is running.
>>>
>>> However i'm still not sure about the configuration and fiddling in task
>>> manager heap really fine tune. So my question is
>>>
>>> Am i doing it right for numberOfBuffers ?
>>> How much should we allocate on taskmanager.heap.mb given the information
>>> Any suggestion which configuration we need to set to make it optimal for
>>> the cluster ?
>>> Is there any chance that this will get automatically resolve by
>>> memory/network buffer manager ?
>>>
>>> Thanks a lot for the help
>>>
>>> Cheers
>>>
>>> --
>>> Welly Tambunan
>>> Triplelands
>>>
>>> http://weltam.wordpress.com
>>> http://www.triplelands.com
>>
>>
>
>
>
> --
> Welly Tambunan
> Triplelands
>
> http://weltam.wordpress.com
> http://www.triplelands.com

tambunanw

Re: Optimal Configuration for Cluster

Hi Ufuk,

Thanks for the explanation.

Yes. Our jobs is all streaming job.

Cheers

On Tue, Feb 23, 2016 at 2:48 PM, Ufuk Celebi <[hidden email]> wrote:

The new default is equivalent to the previous "streaming mode". The
community decided to get rid of this distinction, because it was
confusing to users.

The difference between "streaming mode" and "batch mode" was how
Flink's managed memory was allocated, either lazily when required
('streaming mode") or eagerly on task manager start up ("batch mode").
Now it's lazy by default.

This is not something you need to worry about, but if you are mostly
using the DataSet API where pre allocation has benefits, you can get
the "batch mode" behaviour by using the following configuration key:

taskmanager.memory.preallocate: true

But you are using the DataStream API anyways, right?

– Ufuk

On Tue, Feb 23, 2016 at 6:36 AM, Welly Tambunan <[hidden email]> wrote:
> Hi Fabian,
>
> Previously when using flink 0.9-0.10 we start the cluster with streaming
> mode or batch mode. I see that this one is gone on Flink 1.00 snapshot ? So
> this one has already taken care of the flink and optimize by runtime >
>
> On Mon, Feb 22, 2016 at 5:26 PM, Fabian Hueske <[hidden email]> wrote:
>>
>> Hi Welly,
>>
>> sorry for the late response.
>>
>> The number of network buffers primarily depends on the maximum parallelism
>> of your job.
>> The given formula assumes a specific cluster configuration (1 task manager
>> per machine, one parallel task per CPU).
>> The formula can be translated to:
>>
>> taskmanager.network.numberOfBuffers: p ^ 2 * t * 4
>>
>> where p is the maximum parallelism of the job and t is the number of task
>> manager.
>> You can process more than one parallel task per TM if you configure more
>> than one processing slot per machine ( taskmanager.numberOfTaskSlots). The
>> TM will divide its memory among all its slots. So it would be possible to
>> start one TM for each machine with 100GB+ memory and 48 slots each.
>>
>> We can compute the number of network buffers if you give a few more
>> details about your setup:
>> - How many task managers do you start? I assume more than one TM per
>> machine given that you assign only 4GB of memory out of 128GB to each TM.
>> - What is the maximum parallelism of you program?
>> - How many processing slots do you configure for each TM?
>>
>> In general, pipelined shuffles with a high parallelism require a lot of
>> memory.
>> If you configure batch instead of pipelined transfer, the memory
>> requirement goes down
>> (ExecutionConfig.setExecutionMode(ExecutionMode.BATCH)).
>>
>> Eventually, we want to merge the network buffer and the managed memory
>> pools. So the "taskmanager.network.numberOfBuffers" configuration whill
>> hopefully disappear at some point in the future.
>>
>> Best, Fabian
>>
>> 2016-02-19 9:34 GMT+01:00 Welly Tambunan <[hidden email]>:
>>>
>>> Hi All,
>>>
>>> We are trying to running our job in cluster that has this information
>>>
>>> 1. # of machine: 16
>>> 2. memory : 128 gb
>>> 3. # of core : 48
>>>
>>> However when we try to run we have an exception.
>>>
>>> "insufficient number of network buffers. 48 required but only 10
>>> available. the total number of network buffers is currently set to 2048"
>>>
>>> After looking at the documentation we set configuration based on docs
>>>
>>> taskmanager.network.numberOfBuffers: # core ^ 2 * # machine * 4
>>>
>>> However we face another error from JVM
>>>
>>> java.io.IOException: Cannot allocate network buffer pool: Could not
>>> allocate enough memory segments for NetworkBufferPool (required (Mb): 2304,
>>> allocated (Mb): 698, missing (Mb): 1606). Cause: Java heap space
>>>
>>> We fiddle the taskmanager.heap.mb: 4096
>>>
>>> Finally the cluster is running.
>>>
>>> However i'm still not sure about the configuration and fiddling in task
>>> manager heap really fine tune. So my question is
>>>
>>> Am i doing it right for numberOfBuffers ?
>>> How much should we allocate on taskmanager.heap.mb given the information
>>> Any suggestion which configuration we need to set to make it optimal for
>>> the cluster ?
>>> Is there any chance that this will get automatically resolve by
>>> memory/network buffer manager ?
>>>
>>> Thanks a lot for the help
>>>
>>> Cheers
>>>
>>> --
>>> Welly Tambunan
>>> Triplelands
>>>
>>> http://weltam.wordpress.com
>>> http://www.triplelands.com
>>
>>
>
>
>
> --
> Welly Tambunan
> Triplelands
>
> http://weltam.wordpress.com
> http://www.triplelands.com

Welly Tambunan
Triplelands

http://weltam.wordpress.com

http://www.triplelands.com

tambunanw

Re: Optimal Configuration for Cluster

Hi Ufuk and Fabian,

Is that better to start 48 task manager ( one slot each ) in one machine than having single task manager with 48 slot ? Any trade-off that we should know etc ?

Cheers

On Tue, Feb 23, 2016 at 3:03 PM, Welly Tambunan <[hidden email]> wrote:

Hi Ufuk,

Thanks for the explanation.

Yes. Our jobs is all streaming job.

Cheers

On Tue, Feb 23, 2016 at 2:48 PM, Ufuk Celebi <[hidden email]> wrote:
The new default is equivalent to the previous "streaming mode". The
community decided to get rid of this distinction, because it was
confusing to users.

The difference between "streaming mode" and "batch mode" was how
Flink's managed memory was allocated, either lazily when required
('streaming mode") or eagerly on task manager start up ("batch mode").
Now it's lazy by default.

This is not something you need to worry about, but if you are mostly
using the DataSet API where pre allocation has benefits, you can get
the "batch mode" behaviour by using the following configuration key:

taskmanager.memory.preallocate: true

But you are using the DataStream API anyways, right?

– Ufuk

On Tue, Feb 23, 2016 at 6:36 AM, Welly Tambunan <[hidden email]> wrote:
> Hi Fabian,
>
> Previously when using flink 0.9-0.10 we start the cluster with streaming
> mode or batch mode. I see that this one is gone on Flink 1.00 snapshot ? So
> this one has already taken care of the flink and optimize by runtime >
>
> On Mon, Feb 22, 2016 at 5:26 PM, Fabian Hueske <[hidden email]> wrote:
>>
>> Hi Welly,
>>
>> sorry for the late response.
>>
>> The number of network buffers primarily depends on the maximum parallelism
>> of your job.
>> The given formula assumes a specific cluster configuration (1 task manager
>> per machine, one parallel task per CPU).
>> The formula can be translated to:
>>
>> taskmanager.network.numberOfBuffers: p ^ 2 * t * 4
>>
>> where p is the maximum parallelism of the job and t is the number of task
>> manager.
>> You can process more than one parallel task per TM if you configure more
>> than one processing slot per machine ( taskmanager.numberOfTaskSlots). The
>> TM will divide its memory among all its slots. So it would be possible to
>> start one TM for each machine with 100GB+ memory and 48 slots each.
>>
>> We can compute the number of network buffers if you give a few more
>> details about your setup:
>> - How many task managers do you start? I assume more than one TM per
>> machine given that you assign only 4GB of memory out of 128GB to each TM.
>> - What is the maximum parallelism of you program?
>> - How many processing slots do you configure for each TM?
>>
>> In general, pipelined shuffles with a high parallelism require a lot of
>> memory.
>> If you configure batch instead of pipelined transfer, the memory
>> requirement goes down
>> (ExecutionConfig.setExecutionMode(ExecutionMode.BATCH)).
>>
>> Eventually, we want to merge the network buffer and the managed memory
>> pools. So the "taskmanager.network.numberOfBuffers" configuration whill
>> hopefully disappear at some point in the future.
>>
>> Best, Fabian
>>
>> 2016-02-19 9:34 GMT+01:00 Welly Tambunan <[hidden email]>:
>>>
>>> Hi All,
>>>
>>> We are trying to running our job in cluster that has this information
>>>
>>> 1. # of machine: 16
>>> 2. memory : 128 gb
>>> 3. # of core : 48
>>>
>>> However when we try to run we have an exception.
>>>
>>> "insufficient number of network buffers. 48 required but only 10
>>> available. the total number of network buffers is currently set to 2048"
>>>
>>> After looking at the documentation we set configuration based on docs
>>>
>>> taskmanager.network.numberOfBuffers: # core ^ 2 * # machine * 4
>>>
>>> However we face another error from JVM
>>>
>>> java.io.IOException: Cannot allocate network buffer pool: Could not
>>> allocate enough memory segments for NetworkBufferPool (required (Mb): 2304,
>>> allocated (Mb): 698, missing (Mb): 1606). Cause: Java heap space
>>>
>>> We fiddle the taskmanager.heap.mb: 4096
>>>
>>> Finally the cluster is running.
>>>
>>> However i'm still not sure about the configuration and fiddling in task
>>> manager heap really fine tune. So my question is
>>>
>>> Am i doing it right for numberOfBuffers ?
>>> How much should we allocate on taskmanager.heap.mb given the information
>>> Any suggestion which configuration we need to set to make it optimal for
>>> the cluster ?
>>> Is there any chance that this will get automatically resolve by
>>> memory/network buffer manager ?
>>>
>>> Thanks a lot for the help
>>>
>>> Cheers
>>>
>>> --
>>> Welly Tambunan
>>> Triplelands
>>>
>>> http://weltam.wordpress.com
>>> http://www.triplelands.com
>>
>>
>
>
>
> --
> Welly Tambunan
> Triplelands
>
> http://weltam.wordpress.com
> http://www.triplelands.com

--
Welly Tambunan
Triplelands

http://weltam.wordpress.com
http://www.triplelands.com

Welly Tambunan
Triplelands

http://weltam.wordpress.com

http://www.triplelands.com

Ufuk Celebi

Re: Optimal Configuration for Cluster

I would go with one task manager with 48 slots per machine. This
reduces the communication overheads between task managers.

Regarding memory configuration: Given that the machines have plenty of
memory, I would configure a bigger heap than the 4 GB you had
previously. Furhermore, you can also consider adding more network
buffers, which should improve job throughput.

– Ufuk

On Tue, Feb 23, 2016 at 11:57 AM, Welly Tambunan <[hidden email]> wrote:

> Hi Ufuk and Fabian,
>
> Is that better to start 48 task manager ( one slot each ) in one machine
> than having single task manager with 48 slot ? Any trade-off that we should
> know etc ?
>
> Cheers
>
> On Tue, Feb 23, 2016 at 3:03 PM, Welly Tambunan <[hidden email]> wrote:
>>
>> Hi Ufuk,
>>
>> Thanks for the explanation.
>>
>> Yes. Our jobs is all streaming job.
>>
>> Cheers
>>
>> On Tue, Feb 23, 2016 at 2:48 PM, Ufuk Celebi <[hidden email]> wrote:
>>>
>>> The new default is equivalent to the previous "streaming mode". The
>>> community decided to get rid of this distinction, because it was
>>> confusing to users.
>>>
>>> The difference between "streaming mode" and "batch mode" was how
>>> Flink's managed memory was allocated, either lazily when required
>>> ('streaming mode") or eagerly on task manager start up ("batch mode").
>>> Now it's lazy by default.
>>>
>>> This is not something you need to worry about, but if you are mostly
>>> using the DataSet API where pre allocation has benefits, you can get
>>> the "batch mode" behaviour by using the following configuration key:
>>>
>>> taskmanager.memory.preallocate: true
>>>
>>> But you are using the DataStream API anyways, right?
>>>
>>> – Ufuk
>>>
>>>
>>> On Tue, Feb 23, 2016 at 6:36 AM, Welly Tambunan <[hidden email]>
>>> wrote:
>>> > Hi Fabian,
>>> >
>>> > Previously when using flink 0.9-0.10 we start the cluster with
>>> > streaming
>>> > mode or batch mode. I see that this one is gone on Flink 1.00 snapshot
>>> > ? So
>>> > this one has already taken care of the flink and optimize by runtime >
>>> >
>>> > On Mon, Feb 22, 2016 at 5:26 PM, Fabian Hueske <[hidden email]>
>>> > wrote:
>>> >>
>>> >> Hi Welly,
>>> >>
>>> >> sorry for the late response.
>>> >>
>>> >> The number of network buffers primarily depends on the maximum
>>> >> parallelism
>>> >> of your job.
>>> >> The given formula assumes a specific cluster configuration (1 task
>>> >> manager
>>> >> per machine, one parallel task per CPU).
>>> >> The formula can be translated to:
>>> >>
>>> >> taskmanager.network.numberOfBuffers: p ^ 2 * t * 4
>>> >>
>>> >> where p is the maximum parallelism of the job and t is the number of
>>> >> task
>>> >> manager.
>>> >> You can process more than one parallel task per TM if you configure
>>> >> more
>>> >> than one processing slot per machine ( taskmanager.numberOfTaskSlots).
>>> >> The
>>> >> TM will divide its memory among all its slots. So it would be possible
>>> >> to
>>> >> start one TM for each machine with 100GB+ memory and 48 slots each.
>>> >>
>>> >> We can compute the number of network buffers if you give a few more
>>> >> details about your setup:
>>> >> - How many task managers do you start? I assume more than one TM per
>>> >> machine given that you assign only 4GB of memory out of 128GB to each
>>> >> TM.
>>> >> - What is the maximum parallelism of you program?
>>> >> - How many processing slots do you configure for each TM?
>>> >>
>>> >> In general, pipelined shuffles with a high parallelism require a lot
>>> >> of
>>> >> memory.
>>> >> If you configure batch instead of pipelined transfer, the memory
>>> >> requirement goes down
>>> >> (ExecutionConfig.setExecutionMode(ExecutionMode.BATCH)).
>>> >>
>>> >> Eventually, we want to merge the network buffer and the managed memory
>>> >> pools. So the "taskmanager.network.numberOfBuffers" configuration
>>> >> whill
>>> >> hopefully disappear at some point in the future.
>>> >>
>>> >> Best, Fabian
>>> >>
>>> >> 2016-02-19 9:34 GMT+01:00 Welly Tambunan <[hidden email]>:
>>> >>>
>>> >>> Hi All,
>>> >>>
>>> >>> We are trying to running our job in cluster that has this information
>>> >>>
>>> >>> 1. # of machine: 16
>>> >>> 2. memory : 128 gb
>>> >>> 3. # of core : 48
>>> >>>
>>> >>> However when we try to run we have an exception.
>>> >>>
>>> >>> "insufficient number of network buffers. 48 required but only 10
>>> >>> available. the total number of network buffers is currently set to
>>> >>> 2048"
>>> >>>
>>> >>> After looking at the documentation we set configuration based on docs
>>> >>>
>>> >>> taskmanager.network.numberOfBuffers: # core ^ 2 * # machine * 4
>>> >>>
>>> >>> However we face another error from JVM
>>> >>>
>>> >>> java.io.IOException: Cannot allocate network buffer pool: Could not
>>> >>> allocate enough memory segments for NetworkBufferPool (required (Mb):
>>> >>> 2304,
>>> >>> allocated (Mb): 698, missing (Mb): 1606). Cause: Java heap space
>>> >>>
>>> >>> We fiddle the taskmanager.heap.mb: 4096
>>> >>>
>>> >>> Finally the cluster is running.
>>> >>>
>>> >>> However i'm still not sure about the configuration and fiddling in
>>> >>> task
>>> >>> manager heap really fine tune. So my question is
>>> >>>
>>> >>> Am i doing it right for numberOfBuffers ?
>>> >>> How much should we allocate on taskmanager.heap.mb given the
>>> >>> information
>>> >>> Any suggestion which configuration we need to set to make it optimal
>>> >>> for
>>> >>> the cluster ?
>>> >>> Is there any chance that this will get automatically resolve by
>>> >>> memory/network buffer manager ?
>>> >>>
>>> >>> Thanks a lot for the help
>>> >>>
>>> >>> Cheers
>>> >>>
>>> >>> --
>>> >>> Welly Tambunan
>>> >>> Triplelands
>>> >>>
>>> >>> http://weltam.wordpress.com
>>> >>> http://www.triplelands.com
>>> >>
>>> >>
>>> >
>>> >
>>> >
>>> > --
>>> > Welly Tambunan
>>> > Triplelands
>>> >
>>> > http://weltam.wordpress.com
>>> > http://www.triplelands.com
>>
>>
>>
>>
>> --
>> Welly Tambunan
>> Triplelands
>>
>> http://weltam.wordpress.com
>> http://www.triplelands.com
>
>
>
>
> --
> Welly Tambunan
> Triplelands
>
> http://weltam.wordpress.com
> http://www.triplelands.com

tambunanw

Re: Optimal Configuration for Cluster

Hi Ufuk,

Thanks for this. Really appreciated.

Cheers

On Tue, Feb 23, 2016 at 8:04 PM, Ufuk Celebi <[hidden email]> wrote:

I would go with one task manager with 48 slots per machine. This
reduces the communication overheads between task managers.

Regarding memory configuration: Given that the machines have plenty of
memory, I would configure a bigger heap than the 4 GB you had
previously. Furhermore, you can also consider adding more network
buffers, which should improve job throughput.

– Ufuk

On Tue, Feb 23, 2016 at 11:57 AM, Welly Tambunan <[hidden email]> wrote:
> Hi Ufuk and Fabian,
>
> Is that better to start 48 task manager ( one slot each ) in one machine
> than having single task manager with 48 slot ? Any trade-off that we should
> know etc ?
>
> Cheers
>
> On Tue, Feb 23, 2016 at 3:03 PM, Welly Tambunan <[hidden email]> wrote:
>>
>> Hi Ufuk,
>>
>> Thanks for the explanation.
>>
>> Yes. Our jobs is all streaming job.
>>
>> Cheers
>>
>> On Tue, Feb 23, 2016 at 2:48 PM, Ufuk Celebi <[hidden email]> wrote:
>>>
>>> The new default is equivalent to the previous "streaming mode". The
>>> community decided to get rid of this distinction, because it was
>>> confusing to users.
>>>
>>> The difference between "streaming mode" and "batch mode" was how
>>> Flink's managed memory was allocated, either lazily when required
>>> ('streaming mode") or eagerly on task manager start up ("batch mode").
>>> Now it's lazy by default.
>>>
>>> This is not something you need to worry about, but if you are mostly
>>> using the DataSet API where pre allocation has benefits, you can get
>>> the "batch mode" behaviour by using the following configuration key:
>>>
>>> taskmanager.memory.preallocate: true
>>>
>>> But you are using the DataStream API anyways, right?
>>>
>>> – Ufuk
>>>
>>>
>>> On Tue, Feb 23, 2016 at 6:36 AM, Welly Tambunan <[hidden email]>
>>> wrote:
>>> > Hi Fabian,
>>> >
>>> > Previously when using flink 0.9-0.10 we start the cluster with
>>> > streaming
>>> > mode or batch mode. I see that this one is gone on Flink 1.00 snapshot
>>> > ? So
>>> > this one has already taken care of the flink and optimize by runtime >
>>> >
>>> > On Mon, Feb 22, 2016 at 5:26 PM, Fabian Hueske <[hidden email]>
>>> > wrote:
>>> >>
>>> >> Hi Welly,
>>> >>
>>> >> sorry for the late response.
>>> >>
>>> >> The number of network buffers primarily depends on the maximum
>>> >> parallelism
>>> >> of your job.
>>> >> The given formula assumes a specific cluster configuration (1 task
>>> >> manager
>>> >> per machine, one parallel task per CPU).
>>> >> The formula can be translated to:
>>> >>
>>> >> taskmanager.network.numberOfBuffers: p ^ 2 * t * 4
>>> >>
>>> >> where p is the maximum parallelism of the job and t is the number of
>>> >> task
>>> >> manager.
>>> >> You can process more than one parallel task per TM if you configure
>>> >> more
>>> >> than one processing slot per machine ( taskmanager.numberOfTaskSlots).
>>> >> The
>>> >> TM will divide its memory among all its slots. So it would be possible
>>> >> to
>>> >> start one TM for each machine with 100GB+ memory and 48 slots each.
>>> >>
>>> >> We can compute the number of network buffers if you give a few more
>>> >> details about your setup:
>>> >> - How many task managers do you start? I assume more than one TM per
>>> >> machine given that you assign only 4GB of memory out of 128GB to each
>>> >> TM.
>>> >> - What is the maximum parallelism of you program?
>>> >> - How many processing slots do you configure for each TM?
>>> >>
>>> >> In general, pipelined shuffles with a high parallelism require a lot
>>> >> of
>>> >> memory.
>>> >> If you configure batch instead of pipelined transfer, the memory
>>> >> requirement goes down
>>> >> (ExecutionConfig.setExecutionMode(ExecutionMode.BATCH)).
>>> >>
>>> >> Eventually, we want to merge the network buffer and the managed memory
>>> >> pools. So the "taskmanager.network.numberOfBuffers" configuration
>>> >> whill
>>> >> hopefully disappear at some point in the future.
>>> >>
>>> >> Best, Fabian
>>> >>
>>> >> 2016-02-19 9:34 GMT+01:00 Welly Tambunan <[hidden email]>:
>>> >>>
>>> >>> Hi All,
>>> >>>
>>> >>> We are trying to running our job in cluster that has this information
>>> >>>
>>> >>> 1. # of machine: 16
>>> >>> 2. memory : 128 gb
>>> >>> 3. # of core : 48
>>> >>>
>>> >>> However when we try to run we have an exception.
>>> >>>
>>> >>> "insufficient number of network buffers. 48 required but only 10
>>> >>> available. the total number of network buffers is currently set to
>>> >>> 2048"
>>> >>>
>>> >>> After looking at the documentation we set configuration based on docs
>>> >>>
>>> >>> taskmanager.network.numberOfBuffers: # core ^ 2 * # machine * 4
>>> >>>
>>> >>> However we face another error from JVM
>>> >>>
>>> >>> java.io.IOException: Cannot allocate network buffer pool: Could not
>>> >>> allocate enough memory segments for NetworkBufferPool (required (Mb):
>>> >>> 2304,
>>> >>> allocated (Mb): 698, missing (Mb): 1606). Cause: Java heap space
>>> >>>
>>> >>> We fiddle the taskmanager.heap.mb: 4096
>>> >>>
>>> >>> Finally the cluster is running.
>>> >>>
>>> >>> However i'm still not sure about the configuration and fiddling in
>>> >>> task
>>> >>> manager heap really fine tune. So my question is
>>> >>>
>>> >>> Am i doing it right for numberOfBuffers ?
>>> >>> How much should we allocate on taskmanager.heap.mb given the
>>> >>> information
>>> >>> Any suggestion which configuration we need to set to make it optimal
>>> >>> for
>>> >>> the cluster ?
>>> >>> Is there any chance that this will get automatically resolve by
>>> >>> memory/network buffer manager ?
>>> >>>
>>> >>> Thanks a lot for the help
>>> >>>
>>> >>> Cheers
>>> >>>
>>> >>> --
>>> >>> Welly Tambunan
>>> >>> Triplelands
>>> >>>
>>> >>> http://weltam.wordpress.com
>>> >>> http://www.triplelands.com
>>> >>
>>> >>
>>> >
>>> >
>>> >
>>> > --
>>> > Welly Tambunan
>>> > Triplelands
>>> >
>>> > http://weltam.wordpress.com
>>> > http://www.triplelands.com
>>
>>
>>
>>
>> --
>> Welly Tambunan
>> Triplelands
>>
>> http://weltam.wordpress.com
>> http://www.triplelands.com
>
>
>
>
> --
> Welly Tambunan
> Triplelands
>
> http://weltam.wordpress.com
> http://www.triplelands.com

Welly Tambunan
Triplelands

http://weltam.wordpress.com

http://www.triplelands.com