Re: DataStream API in Batch Execution mode
Posted by
Guowei Ma on
URL: http://deprecated-apache-flink-user-mailing-list-archive.369.s1.nabble.com/DataStream-API-in-Batch-Execution-mode-tp44264p44271.html
Hi, Macro
I think you could try the `FileSource` and you could find an example from [1]. The `FileSource` would scan the file under the given directory recursively.
Would you mind opening an issue for lacking the document?
On Tue, Jun 8, 2021 at 5:59 AM Marco Villalobos <
[hidden email]> wrote:
How do I use a hierarchical directory structure as a file source in S3 when using the DataStream API in Batch Execution mode?
I have been trying to find out if the API supports that, because currently our data is organized by years, halves, quarters, months, and but before I launch the job, I flatten the file structure just to process the right set of files.