"myuid" in snapshot.readingstate

classic Classic list List threaded Threaded
6 messages Options
Reply | Threaded
Open this post in threaded view
|

"myuid" in snapshot.readingstate

Abdullah bin Omar
Hi,

when we readstate of of savepooint, we use, "myuid" as a argument of the function. For example,

DataSet<KeyedState> keyedState = savepoint.readKeyedState("my-uid", new ReaderFunction());

Question 1:

In [1] (line no 79), we get the "uid" with datastream. Then in [2] (line no 45), how can we use the "uid" that we have got from in [1]? Because in [2], there is no declaration of using the "uid" from [1]. 

Question 2:

what does it mean by "uid" in the datastream of [1]? is it something a unique user identification for each stream or for each state of the datastream?



Thank you


Reply | Threaded
Open this post in threaded view
|

Re: "myuid" in snapshot.readingstate

David Anderson-4
You can read about assigning unique IDs to stateful operators in the docs [1][2]. What the uid() method does is to establish a stable and unique identifier for a stateful operator. Then as you evolve your application, this helps ensure that future versions of your job will be able to restore savepoints taken by earlier versions and find the state they need, despite changes to the topology.

The two uids specified in those two independent jobs (the streaming RidesAndFaresSolution job and the batch ReadRidesAndFaresSnapshot job) must match -- in other words, the strings must be the same. But there's nothing in that example that ensures this will be the case.

Regards,
David

On Fri, Apr 30, 2021 at 4:52 PM Abdullah bin Omar <[hidden email]> wrote:
Hi,

when we readstate of of savepooint, we use, "myuid" as a argument of the function. For example,

DataSet<KeyedState> keyedState = savepoint.readKeyedState("my-uid", new ReaderFunction());

Question 1:

In [1] (line no 79), we get the "uid" with datastream. Then in [2] (line no 45), how can we use the "uid" that we have got from in [1]? Because in [2], there is no declaration of using the "uid" from [1]. 

Question 2:

what does it mean by "uid" in the datastream of [1]? is it something a unique user identification for each stream or for each state of the datastream?



Thank you


Reply | Threaded
Open this post in threaded view
|

Re: "myuid" in snapshot.readingstate

Abu Bakar Siddiqur Rahman Rocky
Hi David,

A quick question more 

I am trying to import org.apache.flink.training.exercises.common.sources.TaxiFareGenerator; However, it can not resolve.

What is dependency (in pom.xml) for the org.apache.flink.training?


Thank you

On Fri, Apr 30, 2021 at 10:12 AM David Anderson <[hidden email]> wrote:
You can read about assigning unique IDs to stateful operators in the docs [1][2]. What the uid() method does is to establish a stable and unique identifier for a stateful operator. Then as you evolve your application, this helps ensure that future versions of your job will be able to restore savepoints taken by earlier versions and find the state they need, despite changes to the topology.

The two uids specified in those two independent jobs (the streaming RidesAndFaresSolution job and the batch ReadRidesAndFaresSnapshot job) must match -- in other words, the strings must be the same. But there's nothing in that example that ensures this will be the case.

Regards,
David

On Fri, Apr 30, 2021 at 4:52 PM Abdullah bin Omar <[hidden email]> wrote:
Hi,

when we readstate of of savepooint, we use, "myuid" as a argument of the function. For example,

DataSet<KeyedState> keyedState = savepoint.readKeyedState("my-uid", new ReaderFunction());

Question 1:

In [1] (line no 79), we get the "uid" with datastream. Then in [2] (line no 45), how can we use the "uid" that we have got from in [1]? Because in [2], there is no declaration of using the "uid" from [1]. 

Question 2:

what does it mean by "uid" in the datastream of [1]? is it something a unique user identification for each stream or for each state of the datastream?



Thank you




--
Regards,
Abu Bakar Siddiqur Rahman

Reply | Threaded
Open this post in threaded view
|

Re: "myuid" in snapshot.readingstate

Abu Bakar Siddiqur Rahman Rocky
I am asking on behalf of my colleagues the previous question. Though we have a couple of questions that were in another email

On Fri, Apr 30, 2021 at 10:18 AM Abu Bakar Siddiqur Rahman Rocky <[hidden email]> wrote:
Hi David,

A quick question more 

I am trying to import org.apache.flink.training.exercises.common.sources.TaxiFareGenerator; However, it can not resolve.

What is dependency (in pom.xml) for the org.apache.flink.training?


Thank you

On Fri, Apr 30, 2021 at 10:12 AM David Anderson <[hidden email]> wrote:
You can read about assigning unique IDs to stateful operators in the docs [1][2]. What the uid() method does is to establish a stable and unique identifier for a stateful operator. Then as you evolve your application, this helps ensure that future versions of your job will be able to restore savepoints taken by earlier versions and find the state they need, despite changes to the topology.

The two uids specified in those two independent jobs (the streaming RidesAndFaresSolution job and the batch ReadRidesAndFaresSnapshot job) must match -- in other words, the strings must be the same. But there's nothing in that example that ensures this will be the case.

Regards,
David

On Fri, Apr 30, 2021 at 4:52 PM Abdullah bin Omar <[hidden email]> wrote:
Hi,

when we readstate of of savepooint, we use, "myuid" as a argument of the function. For example,

DataSet<KeyedState> keyedState = savepoint.readKeyedState("my-uid", new ReaderFunction());

Question 1:

In [1] (line no 79), we get the "uid" with datastream. Then in [2] (line no 45), how can we use the "uid" that we have got from in [1]? Because in [2], there is no declaration of using the "uid" from [1]. 

Question 2:

what does it mean by "uid" in the datastream of [1]? is it something a unique user identification for each stream or for each state of the datastream?



Thank you




--
Regards,
Abu Bakar Siddiqur Rahman



--
Regards,
Abu Bakar Siddiqur Rahman

Reply | Threaded
Open this post in threaded view
|

Re: "myuid" in snapshot.readingstate

David Anderson-4
In reply to this post by Abu Bakar Siddiqur Rahman Rocky
What is dependency (in pom.xml) for the org.apache.flink.training?

We don't publish artifacts for this repository. 

David

On Fri, Apr 30, 2021 at 5:19 PM Abu Bakar Siddiqur Rahman Rocky <[hidden email]> wrote:
Hi David,

A quick question more 

I am trying to import org.apache.flink.training.exercises.common.sources.TaxiFareGenerator; However, it can not resolve.

What is dependency (in pom.xml) for the org.apache.flink.training?


Thank you

On Fri, Apr 30, 2021 at 10:12 AM David Anderson <[hidden email]> wrote:
You can read about assigning unique IDs to stateful operators in the docs [1][2]. What the uid() method does is to establish a stable and unique identifier for a stateful operator. Then as you evolve your application, this helps ensure that future versions of your job will be able to restore savepoints taken by earlier versions and find the state they need, despite changes to the topology.

The two uids specified in those two independent jobs (the streaming RidesAndFaresSolution job and the batch ReadRidesAndFaresSnapshot job) must match -- in other words, the strings must be the same. But there's nothing in that example that ensures this will be the case.

Regards,
David

On Fri, Apr 30, 2021 at 4:52 PM Abdullah bin Omar <[hidden email]> wrote:
Hi,

when we readstate of of savepooint, we use, "myuid" as a argument of the function. For example,

DataSet<KeyedState> keyedState = savepoint.readKeyedState("my-uid", new ReaderFunction());

Question 1:

In [1] (line no 79), we get the "uid" with datastream. Then in [2] (line no 45), how can we use the "uid" that we have got from in [1]? Because in [2], there is no declaration of using the "uid" from [1]. 

Question 2:

what does it mean by "uid" in the datastream of [1]? is it something a unique user identification for each stream or for each state of the datastream?



Thank you




--
Regards,
Abu Bakar Siddiqur Rahman

Reply | Threaded
Open this post in threaded view
|

Re: "myuid" in snapshot.readingstate

Abu Bakar Siddiqur Rahman Rocky
Thank you, David. 


On Fri, Apr 30, 2021 at 10:31 AM David Anderson <[hidden email]> wrote:
What is dependency (in pom.xml) for the org.apache.flink.training?

We don't publish artifacts for this repository. 

David

On Fri, Apr 30, 2021 at 5:19 PM Abu Bakar Siddiqur Rahman Rocky <[hidden email]> wrote:
Hi David,

A quick question more 

I am trying to import org.apache.flink.training.exercises.common.sources.TaxiFareGenerator; However, it can not resolve.

What is dependency (in pom.xml) for the org.apache.flink.training?


Thank you

On Fri, Apr 30, 2021 at 10:12 AM David Anderson <[hidden email]> wrote:
You can read about assigning unique IDs to stateful operators in the docs [1][2]. What the uid() method does is to establish a stable and unique identifier for a stateful operator. Then as you evolve your application, this helps ensure that future versions of your job will be able to restore savepoints taken by earlier versions and find the state they need, despite changes to the topology.

The two uids specified in those two independent jobs (the streaming RidesAndFaresSolution job and the batch ReadRidesAndFaresSnapshot job) must match -- in other words, the strings must be the same. But there's nothing in that example that ensures this will be the case.

Regards,
David

On Fri, Apr 30, 2021 at 4:52 PM Abdullah bin Omar <[hidden email]> wrote:
Hi,

when we readstate of of savepooint, we use, "myuid" as a argument of the function. For example,

DataSet<KeyedState> keyedState = savepoint.readKeyedState("my-uid", new ReaderFunction());

Question 1:

In [1] (line no 79), we get the "uid" with datastream. Then in [2] (line no 45), how can we use the "uid" that we have got from in [1]? Because in [2], there is no declaration of using the "uid" from [1]. 

Question 2:

what does it mean by "uid" in the datastream of [1]? is it something a unique user identification for each stream or for each state of the datastream?



Thank you




--
Regards,
Abu Bakar Siddiqur Rahman



--
Regards,
Abu Bakar Siddiqur Rahman