Strange output paths in Flink 0.8.1

classic Classic list List threaded Threaded
4 messages Options
Reply | Threaded
Open this post in threaded view
|

Strange output paths in Flink 0.8.1

Sebastian Schelter
Hi I'm getting strange output paths for this piece of code:

computeDistribution(
   "/home/ssc/Desktop/trackthetrackers/sampleSeg.tsv",
   "/home/ssc/Entwicklung/datasets/hyperlink2012/pld-index",
   "/home/ssc/Desktop/trackthetrackers/out/trackerDistribution/")

def computeDistribution(trackingGraphFile: String,
     domainIndexFile: String, outputPath: String) = {

   ...

   topTrackerDomains.writeAsCsv(outputPath, fieldDelimiter = "\t",
                                writeMode = WriteMode.OVERWRITE)

}

For some reasons, the last directory name is repeated twice in the final
output:

/home/ssc/Desktop/trackthetrackers/out/trackerDistribution/trackerDistribution/

How does this come?

Best,
Sebastian
Reply | Threaded
Open this post in threaded view
|

Re: Strange output paths in Flink 0.8.1

Fabian Hueske-2
I reproduced the bug and will look into that.

Cheers, Fabian

2015-03-03 14:08 GMT+01:00 Sebastian <[hidden email]>:
Hi I'm getting strange output paths for this piece of code:     

computeDistribution(
  "/home/ssc/Desktop/trackthetrackers/sampleSeg.tsv",
  "/home/ssc/Entwicklung/datasets/hyperlink2012/pld-index",
  "/home/ssc/Desktop/trackthetrackers/out/trackerDistribution/")

def computeDistribution(trackingGraphFile: String,
    domainIndexFile: String, outputPath: String) = {

  ...

  topTrackerDomains.writeAsCsv(outputPath, fieldDelimiter = "\t",
                               writeMode = WriteMode.OVERWRITE)

}

For some reasons, the last directory name is repeated twice in the final output:

/home/ssc/Desktop/trackthetrackers/out/trackerDistribution/trackerDistribution/

How does this come?

Best,
Sebastian

Reply | Threaded
Open this post in threaded view
|

Re: Strange output paths in Flink 0.8.1

Fabian Hueske-2
A quick workaround is to remove the last '/' from the output path. 

I created https://issues.apache.org/jira/browse/FLINK-1640 and will provide a fix tomorrow.

2015-03-03 21:59 GMT+01:00 Fabian Hueske <[hidden email]>:
I reproduced the bug and will look into that.

Cheers, Fabian

2015-03-03 14:08 GMT+01:00 Sebastian <[hidden email]>:
Hi I'm getting strange output paths for this piece of code:     

computeDistribution(
  "/home/ssc/Desktop/trackthetrackers/sampleSeg.tsv",
  "/home/ssc/Entwicklung/datasets/hyperlink2012/pld-index",
  "/home/ssc/Desktop/trackthetrackers/out/trackerDistribution/")

def computeDistribution(trackingGraphFile: String,
    domainIndexFile: String, outputPath: String) = {

  ...

  topTrackerDomains.writeAsCsv(outputPath, fieldDelimiter = "\t",
                               writeMode = WriteMode.OVERWRITE)

}

For some reasons, the last directory name is repeated twice in the final output:

/home/ssc/Desktop/trackthetrackers/out/trackerDistribution/trackerDistribution/

How does this come?

Best,
Sebastian


Reply | Threaded
Open this post in threaded view
|

Re: Strange output paths in Flink 0.8.1

Sebastian Schelter
Thank you!

On 03.03.2015 23:09, Fabian Hueske wrote:

> A quick workaround is to remove the last '/' from the output path.
>
> I created https://issues.apache.org/jira/browse/FLINK-1640 and will
> provide a fix tomorrow.
>
> 2015-03-03 21:59 GMT+01:00 Fabian Hueske <[hidden email]
> <mailto:[hidden email]>>:
>
>     I reproduced the bug and will look into that.
>
>     Cheers, Fabian
>
>     2015-03-03 14:08 GMT+01:00 Sebastian <[hidden email]
>     <mailto:[hidden email]>>:
>
>         Hi I'm getting strange output paths for this piece of code:
>
>         computeDistribution(
>            "/home/ssc/Desktop/__trackthetrackers/sampleSeg.__tsv",
>            "/home/ssc/Entwicklung/__datasets/hyperlink2012/pld-__index",
>
>         "/home/ssc/Desktop/__trackthetrackers/out/__trackerDistribution/")
>
>         def computeDistribution(__trackingGraphFile: String,
>              domainIndexFile: String, outputPath: String) = {
>
>            ...
>
>            topTrackerDomains.writeAsCsv(__outputPath, fieldDelimiter = "\t",
>                                         writeMode = WriteMode.OVERWRITE)
>
>         }
>
>         For some reasons, the last directory name is repeated twice in
>         the final output:
>
>         /home/ssc/Desktop/__trackthetrackers/out/__trackerDistribution/__trackerDistribution/
>
>         How does this come?
>
>         Best,
>         Sebastian
>
>
>