(DEPRECATED) Apache Flink User Mailing List archive.

Gelly forward

Classic

List

Threaded

8 messages Options

Flavio Pompermaier

Gelly forward

Hi to all,

is there a way in gelly to forward received messages (and modify their content before sending)?

Best,

Flavio

Vasiliki Kalavri

Re: Gelly forward

Hi Flavio!

Are you talking about vertex-centric iterations in gelly?

If yes, you can send messages to a particular vertex with "sendMessageTo(vertexId, msg)" and

to all neighbors with "sendMessageToAllNeighbors(msg)". These methods are available inside the MessagingFunction.

Accessing received messages happens inside the VertexUpdateFunction.

So, the usual way of writing these programs is to:

(1) go through received messages in the VertexUpdateFunction and compute the new vertex value based them

(2) compute and send messages in the MessagingFunction.

Would that work in you case?

Cheers,

Vasia.

On 8 July 2015 at 08:47, Flavio Pompermaier <[hidden email]> wrote:

Hi to all,
is there a way in gelly to forward received messages (and modify their content before sending)?

Best,
Flavio

Flavio Pompermaier

Re: Gelly forward

The problem is that my nodes have to gather data coming from some path of interest along the graph (depending on the type of the node), otherwise they just have to forward the received message adding their id to the message path (more or less). It's like a postal tracking system.

The problem is that I have to reset the "outbox" of each vertex once the messages have been sent..

Do you think that it makes sense in this case to have an outbox of messages (destination, message) in each vertex and reset it in the postSuperstep() of the VertexUpdateFunction?

On Wed, Jul 8, 2015 at 9:38 AM, Vasiliki Kalavri <[hidden email]> wrote:

Hi Flavio!

Are you talking about vertex-centric iterations in gelly?

If yes, you can send messages to a particular vertex with "sendMessageTo(vertexId, msg)" and
to all neighbors with "sendMessageToAllNeighbors(msg)". These methods are available inside the MessagingFunction.
Accessing received messages happens inside the VertexUpdateFunction.
So, the usual way of writing these programs is to:
(1) go through received messages in the VertexUpdateFunction and compute the new vertex value based them
(2) compute and send messages in the MessagingFunction.

Would that work in you case?

Cheers,
Vasia.

On 8 July 2015 at 08:47, Flavio Pompermaier <[hidden email]> wrote:

Hi to all,
is there a way in gelly to forward received messages (and modify their content before sending)?

Best,
Flavio

Vasiliki Kalavri

Re: Gelly forward

Is it the same message that you propagate or is it different for each vertex / neighbor? If you have to store a <neighborID, msg> pair for each neighbor, then you will quickly run out of memory. If it's the same message you need to send, or you simply need to add the current vertex Id, then you can probably get rid of the neighborID.

By "outbox" I believe you mean storing them in the vertex value, correct?
I don't think you will have to explicitly reset it, as in each superstep vertices only receive messages sent in the previous superstep, i.e. "old" messages don't get re-sent.

On Jul 8, 2015 9:48 AM, "Flavio Pompermaier" <[hidden email]> wrote:

The problem is that my nodes have to gather data coming from some path of interest along the graph (depending on the type of the node), otherwise they just have to forward the received message adding their id to the message path (more or less). It's like a postal tracking system.

The problem is that I have to reset the "outbox" of each vertex once the messages have been sent..
Do you think that it makes sense in this case to have an outbox of messages (destination, message) in each vertex and reset it in the postSuperstep() of the VertexUpdateFunction?

On Wed, Jul 8, 2015 at 9:38 AM, Vasiliki Kalavri <[hidden email]> wrote:
Hi Flavio!

Are you talking about vertex-centric iterations in gelly?

If yes, you can send messages to a particular vertex with "sendMessageTo(vertexId, msg)" and
to all neighbors with "sendMessageToAllNeighbors(msg)". These methods are available inside the MessagingFunction.
Accessing received messages happens inside the VertexUpdateFunction.
So, the usual way of writing these programs is to:
(1) go through received messages in the VertexUpdateFunction and compute the new vertex value based them
(2) compute and send messages in the MessagingFunction.

Would that work in you case?

Cheers,
Vasia.

On 8 July 2015 at 08:47, Flavio Pompermaier <[hidden email]> wrote:

Hi to all,
is there a way in gelly to forward received messages (and modify their content before sending)?

Best,
Flavio

Flavio Pompermaier

Re: Gelly forward

Let's say I have some nodes of some type of interest (type1 and type2).

My source data looks like <sourceNodeId, type, edgeName, destNodeId>.

For example, I could be interested in sourceNodes having type == Person to gather the value obtained from the expansion of some paths (for example livesIn.name and marriedTo.name). Notice that I could define other paths of interest for different types.. (For nodes of type Place I could be interested in gathering containedIn.name).

If my source data looks like:

(1, Person, livesIn, 2)

(1, Person, livesIn, 3)

(2, Place, name, "Rome")

(2, Place, lat, 101.3)

(2, Place, long, 101.3)

(3, Place, name, "Berlin")

(3, Place, containedIn, 4)

(4, Country, name, "Germany")

I'd like that node 1 (in the end) collect the following paths:

livesIn.name : "Rome" (from node 2)

livesIn.name : "Berlin" (from node 3)

livesIn.containedIn.name: "Germany" (from node 4)

marriedTo.name : null because not married :)

So, in my vertexCentricIteration each Vertex knows its neighbors (e.g. node 1 knows that 2 and 3 should be queried for their "name" attribute).

If they receive a message asking for "name" from node 1 they have to reply to node 1 with the value of that property.

So in my implementation, I check whether my node has to send some query to neighbors and wait for the response. The problem is that node 3 for example, once queried for property containedIn.name from node 1 it just have to forward this path to node 4 and thell to 4 to reply to 1.

Is that possible?

On Wed, Jul 8, 2015 at 10:19 AM, Vasiliki Kalavri <[hidden email]> wrote:

Is it the same message that you propagate or is it different for each vertex / neighbor? If you have to store a <neighborID, msg> pair for each neighbor, then you will quickly run out of memory. If it's the same message you need to send, or you simply need to add the current vertex Id, then you can probably get rid of the neighborID.

By "outbox" I believe you mean storing them in the vertex value, correct?
I don't think you will have to explicitly reset it, as in each superstep vertices only receive messages sent in the previous superstep, i.e. "old" messages don't get re-sent.

On Jul 8, 2015 9:48 AM, "Flavio Pompermaier" <[hidden email]> wrote:
The problem is that my nodes have to gather data coming from some path of interest along the graph (depending on the type of the node), otherwise they just have to forward the received message adding their id to the message path (more or less). It's like a postal tracking system.

The problem is that I have to reset the "outbox" of each vertex once the messages have been sent..
Do you think that it makes sense in this case to have an outbox of messages (destination, message) in each vertex and reset it in the postSuperstep() of the VertexUpdateFunction?

On Wed, Jul 8, 2015 at 9:38 AM, Vasiliki Kalavri <[hidden email]> wrote:
Hi Flavio!

Are you talking about vertex-centric iterations in gelly?

If yes, you can send messages to a particular vertex with "sendMessageTo(vertexId, msg)" and
to all neighbors with "sendMessageToAllNeighbors(msg)". These methods are available inside the MessagingFunction.
Accessing received messages happens inside the VertexUpdateFunction.
So, the usual way of writing these programs is to:
(1) go through received messages in the VertexUpdateFunction and compute the new vertex value based them
(2) compute and send messages in the MessagingFunction.

Would that work in you case?

Cheers,
Vasia.

On 8 July 2015 at 08:47, Flavio Pompermaier <[hidden email]> wrote:

Hi to all,
is there a way in gelly to forward received messages (and modify their content before sending)?

Best,
Flavio

Vasiliki Kalavri

Re: Gelly forward

Hi Flavio,

yes, I think it's possible. I have one question before I try to explain how:

do you model "Rome", "Berlin", "101.3" etc. in your above example as separate vertices or as properties of vertices?

On 8 July 2015 at 10:43, Flavio Pompermaier <[hidden email]> wrote:

Let's say I have some nodes of some type of interest (type1 and type2).
My source data looks like <sourceNodeId, type, edgeName, destNodeId>.
For example, I could be interested in sourceNodes having type == Person to gather the value obtained from the expansion of some paths (for example livesIn.name and marriedTo.name). Notice that I could define other paths of interest for different types.. (For nodes of type Place I could be interested in gathering containedIn.name).
If my source data looks like:

(1, Person, livesIn, 2)
(1, Person, livesIn, 3)
(2, Place, name, "Rome")
(2, Place, lat, 101.3)
(2, Place, long, 101.3)
(3, Place, name, "Berlin")
(3, Place, containedIn, 4)
(4, Country, name, "Germany")

I'd like that node 1 (in the end) collect the following paths:

livesIn.name : "Rome" (from node 2)
livesIn.name : "Berlin" (from node 3)
livesIn.containedIn.name: "Germany" (from node 4)
marriedTo.name : null because not married :)

So, in my vertexCentricIteration each Vertex knows its neighbors (e.g. node 1 knows that 2 and 3 should be queried for their "name" attribute).
If they receive a message asking for "name" from node 1 they have to reply to node 1 with the value of that property.

So in my implementation, I check whether my node has to send some query to neighbors and wait for the response. The problem is that node 3 for example, once queried for property containedIn.name from node 1 it just have to forward this path to node 4 and thell to 4 to reply to 1.

Is that possible?

On Wed, Jul 8, 2015 at 10:19 AM, Vasiliki Kalavri <[hidden email]> wrote:
Is it the same message that you propagate or is it different for each vertex / neighbor? If you have to store a <neighborID, msg> pair for each neighbor, then you will quickly run out of memory. If it's the same message you need to send, or you simply need to add the current vertex Id, then you can probably get rid of the neighborID.

By "outbox" I believe you mean storing them in the vertex value, correct?
I don't think you will have to explicitly reset it, as in each superstep vertices only receive messages sent in the previous superstep, i.e. "old" messages don't get re-sent.

On Jul 8, 2015 9:48 AM, "Flavio Pompermaier" <[hidden email]> wrote:
The problem is that my nodes have to gather data coming from some path of interest along the graph (depending on the type of the node), otherwise they just have to forward the received message adding their id to the message path (more or less). It's like a postal tracking system.

The problem is that I have to reset the "outbox" of each vertex once the messages have been sent..
Do you think that it makes sense in this case to have an outbox of messages (destination, message) in each vertex and reset it in the postSuperstep() of the VertexUpdateFunction?

On Wed, Jul 8, 2015 at 9:38 AM, Vasiliki Kalavri <[hidden email]> wrote:
Hi Flavio!

Are you talking about vertex-centric iterations in gelly?

If yes, you can send messages to a particular vertex with "sendMessageTo(vertexId, msg)" and
to all neighbors with "sendMessageToAllNeighbors(msg)". These methods are available inside the MessagingFunction.
Accessing received messages happens inside the VertexUpdateFunction.
So, the usual way of writing these programs is to:
(1) go through received messages in the VertexUpdateFunction and compute the new vertex value based them
(2) compute and send messages in the MessagingFunction.

Would that work in you case?

Cheers,
Vasia.

On 8 July 2015 at 08:47, Flavio Pompermaier <[hidden email]> wrote:

Hi to all,
is there a way in gelly to forward received messages (and modify their content before sending)?

Best,
Flavio

Flavio Pompermaier

Re: Gelly forward

In my Implementation every vertex has its own bags of "knowledge" (i.e. it knows all tuples belonging to it..).

So in Vertex 1 I have a field (an HashMap) containing the following info:

type=Person
livesIn=2 (and I know also that 2 is a vertexId)

In Vertex 3 I know:

type=Place
name=Berlin
containedIn=4 (and I know also that 4 is a vertexId)

and so on..

On Wed, Jul 8, 2015 at 2:29 PM, Vasiliki Kalavri <[hidden email]> wrote:

Hi Flavio,

yes, I think it's possible. I have one question before I try to explain how:
do you model "Rome", "Berlin", "101.3" etc. in your above example as separate vertices or as properties of vertices?

On 8 July 2015 at 10:43, Flavio Pompermaier <[hidden email]> wrote:
Let's say I have some nodes of some type of interest (type1 and type2).
My source data looks like <sourceNodeId, type, edgeName, destNodeId>.
For example, I could be interested in sourceNodes having type == Person to gather the value obtained from the expansion of some paths (for example livesIn.name and marriedTo.name). Notice that I could define other paths of interest for different types.. (For nodes of type Place I could be interested in gathering containedIn.name).
If my source data looks like:

(1, Person, livesIn, 2)
(1, Person, livesIn, 3)
(2, Place, name, "Rome")
(2, Place, lat, 101.3)
(2, Place, long, 101.3)
(3, Place, name, "Berlin")
(3, Place, containedIn, 4)
(4, Country, name, "Germany")

I'd like that node 1 (in the end) collect the following paths:

livesIn.name : "Rome" (from node 2)
livesIn.name : "Berlin" (from node 3)
livesIn.containedIn.name: "Germany" (from node 4)
marriedTo.name : null because not married :)

So, in my vertexCentricIteration each Vertex knows its neighbors (e.g. node 1 knows that 2 and 3 should be queried for their "name" attribute).
If they receive a message asking for "name" from node 1 they have to reply to node 1 with the value of that property.

So in my implementation, I check whether my node has to send some query to neighbors and wait for the response. The problem is that node 3 for example, once queried for property containedIn.name from node 1 it just have to forward this path to node 4 and thell to 4 to reply to 1.

Is that possible?

On Wed, Jul 8, 2015 at 10:19 AM, Vasiliki Kalavri <[hidden email]> wrote:
Is it the same message that you propagate or is it different for each vertex / neighbor? If you have to store a <neighborID, msg> pair for each neighbor, then you will quickly run out of memory. If it's the same message you need to send, or you simply need to add the current vertex Id, then you can probably get rid of the neighborID.

By "outbox" I believe you mean storing them in the vertex value, correct?
I don't think you will have to explicitly reset it, as in each superstep vertices only receive messages sent in the previous superstep, i.e. "old" messages don't get re-sent.

On Jul 8, 2015 9:48 AM, "Flavio Pompermaier" <[hidden email]> wrote:
The problem is that my nodes have to gather data coming from some path of interest along the graph (depending on the type of the node), otherwise they just have to forward the received message adding their id to the message path (more or less). It's like a postal tracking system.

The problem is that I have to reset the "outbox" of each vertex once the messages have been sent..
Do you think that it makes sense in this case to have an outbox of messages (destination, message) in each vertex and reset it in the postSuperstep() of the VertexUpdateFunction?

On Wed, Jul 8, 2015 at 9:38 AM, Vasiliki Kalavri <[hidden email]> wrote:
Hi Flavio!

Are you talking about vertex-centric iterations in gelly?

If yes, you can send messages to a particular vertex with "sendMessageTo(vertexId, msg)" and
to all neighbors with "sendMessageToAllNeighbors(msg)". These methods are available inside the MessagingFunction.
Accessing received messages happens inside the VertexUpdateFunction.
So, the usual way of writing these programs is to:
(1) go through received messages in the VertexUpdateFunction and compute the new vertex value based them
(2) compute and send messages in the MessagingFunction.

Would that work in you case?

Cheers,
Vasia.

On 8 July 2015 at 08:47, Flavio Pompermaier <[hidden email]> wrote:

Hi to all,
is there a way in gelly to forward received messages (and modify their content before sending)?

Best,
Flavio

Vasiliki Kalavri

Re: Gelly forward

Thanks Flavio.

Here's my suggestion. If your paths are short and don't vary much, it might make sense to do this using the DataSet API. The iteration will probably be too much overhead.

For example, if you want to find the paths for "livesIn.containedIn.name", then you can create 3 DataSets, one for each path hop, by simply filtering by edgeName. Then, you can easily create the final "messages", joining the first 2 DataSets (get the "livesIn.containedIn" path) and then joining the output with the third.

If your paths are long and you think that the above solution doesn't work, then you could use iterations.

First, I think you will need to have a way to check whether an attribute exists in a path and in which position. Using the same example, if you have the path "livesIn.containedIn.name", then you need a method that would get a path hop, e.g. "containedIn" and return the hop number, e.g. 1 for "livesIn", 2 for "containedIn", 3 for "name" (maybe -1 if it's not there).

Once you have this, then you can use it to check whether to send a message over an edge or not.

Your Messaging function could look like this:

for (edge: edges) {

if (getPathHop(edge.getValue()) == getSuperstep) { // forward the msg

sendMessageTo(edge.getTarget(), msg);

}

Like this, in superstep 1 you will send a msg over edges labeled as "livesIn", in superstep 2 you will send a message over the edges labeled as "containedIn" and in superstep 3 you will find the ones with "name".

Would this work for you?

Cheers,

Vasia.

On 8 July 2015 at 14:52, Flavio Pompermaier <[hidden email]> wrote:

In my Implementation every vertex has its own bags of "knowledge" (i.e. it knows all tuples belonging to it..).
So in Vertex 1 I have a field (an HashMap) containing the following info:
type=Person
livesIn=2 (and I know also that 2 is a vertexId)
In Vertex 3 I know:
type=Place
name=Berlin
containedIn=4 (and I know also that 4 is a vertexId)
and so on..

On Wed, Jul 8, 2015 at 2:29 PM, Vasiliki Kalavri <[hidden email]> wrote:
Hi Flavio,

yes, I think it's possible. I have one question before I try to explain how:
do you model "Rome", "Berlin", "101.3" etc. in your above example as separate vertices or as properties of vertices?

On 8 July 2015 at 10:43, Flavio Pompermaier <[hidden email]> wrote:
Let's say I have some nodes of some type of interest (type1 and type2).
My source data looks like <sourceNodeId, type, edgeName, destNodeId>.
For example, I could be interested in sourceNodes having type == Person to gather the value obtained from the expansion of some paths (for example livesIn.name and marriedTo.name). Notice that I could define other paths of interest for different types.. (For nodes of type Place I could be interested in gathering containedIn.name).
If my source data looks like:

(1, Person, livesIn, 2)
(1, Person, livesIn, 3)
(2, Place, name, "Rome")
(2, Place, lat, 101.3)
(2, Place, long, 101.3)
(3, Place, name, "Berlin")
(3, Place, containedIn, 4)
(4, Country, name, "Germany")

I'd like that node 1 (in the end) collect the following paths:

livesIn.name : "Rome" (from node 2)
livesIn.name : "Berlin" (from node 3)
livesIn.containedIn.name: "Germany" (from node 4)
marriedTo.name : null because not married :)

So, in my vertexCentricIteration each Vertex knows its neighbors (e.g. node 1 knows that 2 and 3 should be queried for their "name" attribute).
If they receive a message asking for "name" from node 1 they have to reply to node 1 with the value of that property.

So in my implementation, I check whether my node has to send some query to neighbors and wait for the response. The problem is that node 3 for example, once queried for property containedIn.name from node 1 it just have to forward this path to node 4 and thell to 4 to reply to 1.

Is that possible?

On Wed, Jul 8, 2015 at 10:19 AM, Vasiliki Kalavri <[hidden email]> wrote:
Is it the same message that you propagate or is it different for each vertex / neighbor? If you have to store a <neighborID, msg> pair for each neighbor, then you will quickly run out of memory. If it's the same message you need to send, or you simply need to add the current vertex Id, then you can probably get rid of the neighborID.

By "outbox" I believe you mean storing them in the vertex value, correct?
I don't think you will have to explicitly reset it, as in each superstep vertices only receive messages sent in the previous superstep, i.e. "old" messages don't get re-sent.

On Jul 8, 2015 9:48 AM, "Flavio Pompermaier" <[hidden email]> wrote:
The problem is that my nodes have to gather data coming from some path of interest along the graph (depending on the type of the node), otherwise they just have to forward the received message adding their id to the message path (more or less). It's like a postal tracking system.

The problem is that I have to reset the "outbox" of each vertex once the messages have been sent..
Do you think that it makes sense in this case to have an outbox of messages (destination, message) in each vertex and reset it in the postSuperstep() of the VertexUpdateFunction?

On Wed, Jul 8, 2015 at 9:38 AM, Vasiliki Kalavri <[hidden email]> wrote:
Hi Flavio!

Are you talking about vertex-centric iterations in gelly?

If yes, you can send messages to a particular vertex with "sendMessageTo(vertexId, msg)" and
to all neighbors with "sendMessageToAllNeighbors(msg)". These methods are available inside the MessagingFunction.
Accessing received messages happens inside the VertexUpdateFunction.
So, the usual way of writing these programs is to:
(1) go through received messages in the VertexUpdateFunction and compute the new vertex value based them
(2) compute and send messages in the MessagingFunction.

Would that work in you case?

Cheers,
Vasia.

On 8 July 2015 at 08:47, Flavio Pompermaier <[hidden email]> wrote:

Hi to all,
is there a way in gelly to forward received messages (and modify their content before sending)?

Best,
Flavio