DataStream with one DataSource and two different Sinks with diff. schema.

classic Classic list List threaded Threaded
2 messages Options
Reply | Threaded
Open this post in threaded view
|

DataStream with one DataSource and two different Sinks with diff. schema.

Marke Builder
Hi,

what is the recommended way to implement the following use-case for DataStream:
One data sink, same map() functions for parsing and normalization and different map() function for format and two different sinks for the output?

The (same)data must be stored in both sinks.
And I prefere one job (related to the same source and map functions)

How I can/should use the split() function for this use-case?

Thanks!

use-case.png (18K) Download Attachment
Reply | Threaded
Open this post in threaded view
|

Re: DataStream with one DataSource and two different Sinks with diff. schema.

Hequn Cheng
Hi Marke,

You can use split() and select() as is shown here[1].

Best, Hequn


On Sat, Nov 10, 2018 at 12:23 AM Marke Builder <[hidden email]> wrote:
Hi,

what is the recommended way to implement the following use-case for DataStream:
One data sink, same map() functions for parsing and normalization and different map() function for format and two different sinks for the output?

The (same)data must be stored in both sinks.
And I prefere one job (related to the same source and map functions)

How I can/should use the split() function for this use-case?

Thanks!