Dataset statistics

classic Classic list List threaded Threaded
3 messages Options
Reply | Threaded
Open this post in threaded view
|

Dataset statistics

Flavio Pompermaier
Hi to all,
is there any effort to standardize descriptive statistics in Apache Flink?
Is there any suggested way to achieve this?

Best,
Flavio
Reply | Threaded
Open this post in threaded view
|

Re: Dataset statistics

Flavio Pompermaier
No effort in this direction, then?
I had a try using SQL on Table API but I fear that the generated plan is not the optimal one..I'm looking for an efficient way to implement describe() method on a table or dataset/datasource

On Fri, Feb 8, 2019 at 10:35 AM Flavio Pompermaier <[hidden email]> wrote:
Hi to all,
is there any effort to standardize descriptive statistics in Apache Flink?
Is there any suggested way to achieve this?

Best,
Flavio

Reply | Threaded
Open this post in threaded view
|

Re: Dataset statistics

Flavio Pompermaier
We've just published a first attempt (on Flink 1.6.2) that extract some descriptive statistics from a batch dataset[1].
Any feedback is welcome.

Best,

On Thu, Feb 14, 2019 at 11:19 AM Flavio Pompermaier <[hidden email]> wrote:
No effort in this direction, then?
I had a try using SQL on Table API but I fear that the generated plan is not the optimal one..I'm looking for an efficient way to implement describe() method on a table or dataset/datasource

On Fri, Feb 8, 2019 at 10:35 AM Flavio Pompermaier <[hidden email]> wrote:
Hi to all,
is there any effort to standardize descriptive statistics in Apache Flink?
Is there any suggested way to achieve this?

Best,
Flavio



--
Flavio Pompermaier
Development Department

OKKAM S.r.l.
Tel. +(39) 0461 041809