Need Help/Code Examples with reading/writing Parquet File with Flink ?

classic Classic list List threaded Threaded
3 messages Options
Reply | Threaded
Open this post in threaded view
|

Need Help/Code Examples with reading/writing Parquet File with Flink ?

sohimankotia
Hi ..

I have file in hdfs in format file.snappy.parquet . Can someone please
point/help with code example of reading parquet files .


-Sohi



--
Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/
Reply | Threaded
Open this post in threaded view
|

Re: Need Help/Code Examples with reading/writing Parquet File with Flink ?

Shuyi Chen
AFA I remember, there is no ParquetInputFormat in Flink. But there is a JIRA logged and an attempt in this PR, but was never merged. We do have an internal implementation that is being used in our company, and we can contribute back to the community. 

+Peter, could we help check how we could contribute the ParquetInputFormat implementation back?

Also, FYI, if you want to read parquet with Flink SQL, here is the JIRA for adding ParquetTableSource.

On Tue, Apr 17, 2018 at 10:23 PM, sohimankotia <[hidden email]> wrote:
Hi ..

I have file in hdfs in format file.snappy.parquet . Can someone please
point/help with code example of reading parquet files .


-Sohi



--
Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/



--
"So you have to trust that the dots will somehow connect in your future."
Reply | Threaded
Open this post in threaded view
|

Re: Need Help/Code Examples with reading/writing Parquet File with Flink ?

Jörn Franke
In reply to this post by sohimankotia
You can use the corresponding HadoopInputformat within Flink

> On 18. Apr 2018, at 07:23, sohimankotia <[hidden email]> wrote:
>
> Hi ..
>
> I have file in hdfs in format file.snappy.parquet . Can someone please
> point/help with code example of reading parquet files .
>
>
> -Sohi
>
>
>
> --
> Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/