Hi, Sagar.
Ans: Yea, Make the program exists and in that time if a checkpoint does not finished will lead the status keeps in .pending state then. Under the normal circumstances, the programs that running in the production env will never be stoped or existed if everything is fine.
Ans: Yes. Writer should work with default settings correct. Yes. We do not have to explicitly set these parameters to make it work. Yes. Assumption correct indeed. However, you know, flink is a real time streaming framework, so under normal circumstances,you don't really go to use the default settings when it comes to a specific business. Especially together work with offline end(Like hadoop mapreduce). In this case, you need to tell the offline end when time a bucket is close and when time the data for the specify bucket is ready. So, you can take a look on https://issues.apache.org/jira/browse/FLINK-9609. Cheers Zhangminglei
|
For #1, the word exist should be exit, right ?
Thanks -------- Original message -------- From: zhangminglei <[hidden email]> Date: 6/23/18 10:12 AM (GMT+08:00) To: sagar loke <[hidden email]> Cc: dev <[hidden email]>, user <[hidden email]> Subject: Re: [Flink-9407] Question about proposed ORC Sink ! Hi, Sagar.
Ans: Yea, Make the program exists and in that time if a checkpoint does not finished will lead the status keeps in .pending state then. Under the normal circumstances, the programs that running in the production env will never be stoped or existed if everything is fine.
Ans: Yes. Writer should work with default settings correct. Yes. We do not have to explicitly set these parameters to make it work. Yes. Assumption correct indeed. However, you know, flink is a real time streaming framework, so under normal circumstances,you don't really go to use the default settings when it comes to a specific business. Especially together work with offline end(Like hadoop mapreduce). In this case, you need to tell the offline end when time a bucket is close and when time the data for the specify bucket is ready. So, you can take a look on https://issues.apache.org/jira/browse/FLINK-9609. Cheers Zhangminglei
|
Yes, it should be exit. Thanks to Ted Yu. Very exactly! Cheers Zhangminglei
|
@zhangminglei, Question about the schema for ORC format: 1. Does it always need to be of complex type "<Struct>" ? 2. Or can it be created with individual data types directly ? eg. "name:string, age:int" ? Thanks, Sagar On Fri, Jun 22, 2018 at 11:56 PM, zhangminglei <[hidden email]> wrote:
Regards,
SAGAR. |
Hi Sagar, That's more a question for the ORC community, but AFAIK, the top-level type is always a struct because it needs to wrap the fields, e.g., struct(name:string, age:int) Best, Fabian 2018-06-26 22:38 GMT+02:00 sagar loke <[hidden email]>:
|
Thanks @zhangminglei and @Fabian for confirming. Even I looked at the ORC parsing code and it seems that using <struct> type is mandatory for now. Thanks, Sagar On Wed, Jun 27, 2018 at 12:59 AM, Fabian Hueske <[hidden email]> wrote:
Regards,
SAGAR. |
Free forum by Nabble | Edit this page |