(DEPRECATED) Apache Flink User Mailing List archive.

flink use hdfs DistributedCache

Classic

List

Threaded

3 messages Options

何春平

flink use hdfs DistributedCache

hi everyone!

can flink submit job which read some custom file distributed by hdfs DistributedCache.

like spark can do that with the follow command:

bin/spark-submit --master yarn --deploy-mode cluster --files /opt/its007-datacollection-conf.properties#its007-datacollection-conf.properties ...

then spark driver can read `its007-datacollection-conf.properties` file in work directory.

thanks!

Rong Rong

Re: flink use hdfs DistributedCache

I am not sure if this suits your use case, but Flink YARN cli does support transferring local resource to all YARN nodes.

Simply use[1]:

bin/flink run -m yarn-cluster -yt <local_resource>

bin/flink run -m yarn-cluster --yarnship <local_resource>

should do the trick.

It might have not been using the HDFS DistributedCache API though.

Thanks,

Rong

[1] https://ci.apache.org/projects/flink/flink-docs-release-1.6/ops/cli.html#usage

On Sun, Sep 2, 2018 at 2:07 AM 何春平 <[hidden email]> wrote:

hi everyone!
can flink submit job which read some custom file distributed by hdfs DistributedCache.
like spark can do that with the follow command:
bin/spark-submit --master yarn --deploy-mode cluster --files /opt/its007-datacollection-conf.properties#its007-datacollection-conf.properties ...
then spark driver can read `its007-datacollection-conf.properties` file in work directory.

thanks!

何春平

回复： flink use hdfs DistributedCache

Rong,thanks for your reply!

This is what i need!

------------------ 原始邮件 ------------------

发件人: "Rong Rong"<[hidden email]>;

发送时间: 2018年9月3日(星期一) 凌晨0:02

收件人: "何春平"<[hidden email]>;

抄送: "user"<[hidden email]>;

主题: Re: flink use hdfs DistributedCache