[ANNOUNCE] Weekly Community Update 2019/27

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[ANNOUNCE] Weekly Community Update 2019/27

Konstantin Knauf-2
Dear community

this week's community digest with news on Flink 1.9.0 and Flink 1.8.1, our Travis setup, Flink on PyPi, and a couple of new initiatives around the DataStream API.

As always, please feel free to add additional updates and news to this thread!

Flink Development
===============

* [releases] The feature freeze for Flink 1.9.0 has been announced on Friday. The release branch will only be cut next Thursday though to allow for some finishing strokes mainly on the planned features for the Table API. [1]

* [releases] Flink 1.8.1 has been released. The release blog post [2] contains a list of all resolved issues.

* [development process] Over the last weeks there has been an ongoing discussion on how to deal with the lack of resources available for Flink's CI (Travis). This week Chesnay has started a voting thread to move away from ASF's Travis account to a Travis account sponsored by Ververica. This got a lot of approvals, but the actual implementation is not trivial as we can not link an external Travis account to the apache/flink repository. The approach currently poc'ed by Chensay involves us fetching every PR to a ververica-controlled repository and publishing the results back to apache/flink. [3]

* [python] Flink 1.9.0 will contain a first version of a Python Table API. Consequently, Jincheng started a discussion to publish it to PyPi. The name will simply be "apache-flink". Currently, the main question is whether the python package should bundle the binary distribution of Apache Flink (Java/Scala) it depends on. It seems more feedback is needed to move this forward. [4]

* [datastream-api] With FLIP-34 the Flink community introduced a the "stop" command for graceful shutdown of a Flink Job. UDFs currently only have a close() method, which is called for any type of shutdown. Now, Klou started a discussion about adding an optional interface, which methods would only be called during gracelful shutdown scenarios (but e.g. not in the case of failure). [5,6]

* [datastream-api] Xingcan has started a VOTE on how to proceed with the split/select API. Voting Time ended one ago. More opinions wouldn't hurt to move this forward. [7]

* [state management] After an offline sync between Vino and Kurt the discussion on "local keyed state" and "local aggregations" covered last week will now first focus on the proposal of "local keyed state" and "localKeyBy()" as this was the core of Vino's original proposal.  [8]

* [logging] An old discussion thread [9] by Vino on separating the logs between Flink Jobs within the same cluster was revived a bit this week in the context of a new ticket [10]. A lot of the (interesting) discussion has moved to this ticket for now.


Notable Bugs
===========

* [FLINK-13063] [1.8.1] [1.7.2] [1.6.4] In an operator chain containing an AsyncOperator records can be lost or duplicated due to a bug in the AsyncOperator. Until the mailbox model has been fully implemented, the fix ensures that the AsyncOperator can only be the head of an operator. Fixed in 1.7.3 and 1.8.2. [11]

* [FLINK-12889] [1.8.1] [1.7.2] In the case where a TM fails with an OOME during checkpointing, its Tasks might not reach a proper failed state, because the failure handling mechanism is affected by the OOME, too. Fixed in 1.73 and 1.8.2. [12]

* [FLINK-13059] The CassandraSink might become deadlocked during shutdown, when an exception happens during asynchronously sending requests to Cassandra. A PR is available but needs a review. [13]

* [FLINK-12122] [1.6.4] [ 1.7.2] [ 1.8.1] Not a new bug, but a continuous source of questions: Since FLIP-6 tasks are not spread out evenly among all TaskManagers as before, which can result in very uneven loads if the number of available TaskSlots exceeds the parallelism of the submitted job. Right now it looks like a fix is targeted for 1.9.1 and 1.8.2 [14]


Events, Blog Posts, Misc
====================

* Flink Forward Europe early-bird ends on the 15th of July. [15]
* Upcoming Meetups
    * On 18th of July Christos Hadjinikolis is speaking at the "Big Data LDN Meetup" on "How real-time data processing is used for application in customer experience?" [16]


--

Konstantin Knauf | Solutions Architect

+49 160 91394525


--

Ververica GmbH | Invalidenstrasse 115, 10115 Berlin, Germany

--

Ververica GmbH
Registered at Amtsgericht Charlottenburg: HRB 158244 B
Managing Directors: Dr. Kostas Tzoumas, Dr. Stephan Ewen