Re: Calling external services/databases from DataStream API
Posted by
Stephan Ewen on
URL: http://deprecated-apache-flink-user-mailing-list-archive.369.s1.nabble.com/Calling-external-services-databases-from-DataStream-API-tp11366p11370.html
Hi!
The Distributed cache would actually indeed be nice to add to the DataStream API. Since the runtime parts for that are all in place, the code would be mainly on the "client" side that sets up the JobGraph to be submitted and executed.
For the problem of scaling this, there are two solutions that I can see:
(2) More elaborate: Convert the lookup database into a "changelog stream" and make the enrichment operation a "stream-to-stream" join.
Greetings,
Stephan