Re: Yahoo! Streaming Benchmark with Flink
Posted by
Till Rohrmann on
URL: http://deprecated-apache-flink-user-mailing-list-archive.369.s1.nabble.com/Yahoo-Streaming-Benchmark-with-Flink-tp9768p9771.html
Hi Eric,
concerning your first question. I think that AdvertisingTopologyFlinkStateHighKeyCard models a different scenario where one tries to count the number ads per campaign for a large number of campaigns. In this scenario, the input data already contains the campaign id for each ad. I think this is the job for the paragraph "Winning Twitter Hack Week: Eliminating the key-value store bottleneck".
concerning your second question. The response actor is registered at the registration service. The registration service exposes the akka URL of this actor under the index of the running task. When you run AkkaStateQuery, the registration is queried to retrieve the akka URL and then a query state request is sent to the response actor via the QueryActor. That is how the actor comes into play.
At the moment the registration service is implemented using ZooKeeper. This means that the akka URL is written to ZooKeeper from where it can be retrieved.
I hope this answers your questions.
Cheers,
Till