|
Hi Bill,
Flink's built-in aggregation functions are implemented against the same interface as UDAGGs and are applied in parallel. The performance depends of course on the implementation of the UDAGG. For example, you should try to keep the size of the accumulator as small as possible because it will be stored in the state backend. If you are using the RocksDBStatebackend, this means that the accumulator is de/serialized for every records. Best, Fabian
|