Hi all, We are facing an issue when trying to integrate PostgreSQL with Flink JDBC. When you
establish a connection to the PostgreSQL database, it is in auto-commit mode. It means that each SQL statement is treated as a transaction and is automatically committed, but this functionality results in
unexpected behavior (e.g., out-of-memory errors) when executed for large result sets. In order to bypass such issues, we must disable the auto-commit mode. To do this, in a simple Java application, we call the
So, my question is: How can we achieve this by using JDBCInputFormat of Flink? Thanks in advance, Konstantinos |
Hi Konstantinos, Seems like setting for auto commit is not directly possible in the current JDBCInputFormatBuilder. However there's a way to specify the fetch size [1] for your DB round-trip, doesn't that resolve your issue? Similarly in JDBCOutputFormat, a batching mode was also used to stash upload rows before flushing to DB. -- Rong On Fri, Apr 12, 2019 at 6:23 AM Papadopoulos, Konstantinos <[hidden email]> wrote:
|
Hi Rong, We have already tried to set the fetch size with no success. According to PG documentation we have to set both configuration parameters (i.e., auto-commit to false and limit fetch) to
achieve our purpose. Thanks, Konstantinos From: Rong Rong <[hidden email]>
Hi Konstantinos, Seems like setting for auto commit is not directly possible in the current JDBCInputFormatBuilder. However there's a way to specify the fetch size [1] for your DB round-trip, doesn't that resolve your issue? Similarly in JDBCOutputFormat, a batching mode was also used to stash upload rows before flushing to DB. -- Rong On Fri, Apr 12, 2019 at 6:23 AM Papadopoulos, Konstantinos <[hidden email]> wrote:
|
Hi Konstantinos, This sounds like a useful extension to me. Would you like to create a Jira issue and contribute the improvement? In the meantime, you can just fork the code of JDBCInputFormat and adjust it to your needs. Best, Fabian Am Mo., 15. Apr. 2019 um 08:53 Uhr schrieb Papadopoulos, Konstantinos <[hidden email]>:
|
Hi Fabian, Glad to hear that you agree for such an improvement. Of course, I can handle it. Best,
Konstantinos From: Fabian Hueske <[hidden email]>
Hi Konstantinos, This sounds like a useful extension to me. Would you like to create a Jira issue and contribute the improvement? In the meantime, you can just fork the code of JDBCInputFormat and adjust it to your needs. Best, Fabian Am Mo., 15. Apr. 2019 um 08:53 Uhr schrieb Papadopoulos, Konstantinos <[hidden email]>:
|
Hi Fabian, I opened the following issue to track the improvement proposed: https://issues.apache.org/jira/browse/FLINK-12198 Best, Konstantinos From: Papadopoulos, Konstantinos <[hidden email]>
Hi Fabian, Glad to hear that you agree for such an improvement. Of course, I can handle it. Best,
Konstantinos From: Fabian Hueske <[hidden email]>
Hi Konstantinos, This sounds like a useful extension to me. Would you like to create a Jira issue and contribute the improvement? In the meantime, you can just fork the code of JDBCInputFormat and adjust it to your needs. Best, Fabian Am Mo., 15. Apr. 2019 um 08:53 Uhr schrieb Papadopoulos, Konstantinos <[hidden email]>:
|
Great, thank you! Am Mo., 15. Apr. 2019 um 16:28 Uhr schrieb Papadopoulos, Konstantinos <[hidden email]>:
|
+1, Thanks Konstantinos for opening the ticket. This would definitely be a useful feature. -- Rong On Mon, Apr 15, 2019 at 7:34 AM Fabian Hueske <[hidden email]> wrote:
|
Free forum by Nabble | Edit this page |