ClassNotFoundException upon Savepoint Disposal

classic Classic list List threaded Threaded
6 messages Options
Reply | Threaded
Open this post in threaded view
|

ClassNotFoundException upon Savepoint Disposal

Konstantin Gregor
Hey everyone,

we are experiencing an issue in the disposal of savepoints in
Flink-1.1.3. We have a streaming job that has custom state (user objects
are part of the state). We create a savepoint:

$ flink savepoint <JOBID>
[...]
Savepoint completed. Path:
hdfs:/bigdata/flink/savepoints/savepoint-20f064fb9f50
[...]

Then we want to simply dispose of that savepoint where we also provide
the jar to the job from which the savepoint was made:
$ flink savepoint -d
hdfs:/bigdata/flink/savepoints/savepoint-20f064fb9f50 -j
/path/to/jar/application.jar

This gives us a ClassNotFoundException of our custom objects [1].

Adding our jar to the flink/lib directory is not an option for us,
things will break because of this.
Does anyone have an idea on how to proceed here?

Thanks and best regards,

Konstantin

[1]
java.io.IOException: Failed to dispose savepoint hdfs:///bigdata/flink/savepoints/savepoint-20f064fb9f50.
        at org.apache.flink.runtime.checkpoint.savepoint.FsSavepointStore.disposeSavepoint(FsSavepointStore.java:163)
        at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply$mcV$sp(JobManager.scala:745)
        at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:727)
        at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:727)
        at scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24)
        at scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24)
        at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41)
        at akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401)
        at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
        at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253)
        at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346)
        at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
        at scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
Caused by: java.lang.ClassNotFoundException: our.company.application.eventdata.EventDataRecord
        at java.net.URLClassLoader$1.run(URLClassLoader.java:366)
        at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
        at java.security.AccessController.doPrivileged(Native Method)
        at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:425)
        at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:358)
        at java.lang.Class.forName0(Native Method)
        at java.lang.Class.forName(Class.java:270)
        at org.apache.flink.util.InstantiationUtil$ClassLoaderObjectInputStream.resolveClass(InstantiationUtil.java:65)
        at java.io.ObjectInputStream.readNonProxyDesc(ObjectInputStream.java:1612)
        at java.io.ObjectInputStream.readClassDesc(ObjectInputStream.java:1517)
        at java.io.ObjectInputStream.readClass(ObjectInputStream.java:1483)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1333)
        at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
        at java.io.ObjectInputStream.defaultReadObject(ObjectInputStream.java:500)
        at org.apache.flink.api.java.typeutils.runtime.PojoSerializer.readObject(PojoSerializer.java:131)
        at sun.reflect.GeneratedMethodAccessor33.invoke(Unknown Source)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:606)
        at java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
        at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
        at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
        at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
        at java.io.ObjectInputStream.defaultReadObject(ObjectInputStream.java:500)
        at org.apache.flink.api.common.state.StateDescriptor.readObject(StateDescriptor.java:268)
        at sun.reflect.GeneratedMethodAccessor32.invoke(Unknown Source)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:606)
        at java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
        at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
        at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
        at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
        at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
        at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
        at java.io.ObjectInputStream.readObject(ObjectInputStream.java:370)
        at java.util.HashMap.readObject(HashMap.java:1184)
        at sun.reflect.GeneratedMethodAccessor6.invoke(Unknown Source)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:606)
        at java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
        at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
        at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
        at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
        at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
        at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
        at java.io.ObjectInputStream.readArray(ObjectInputStream.java:1706)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1344)
        at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
        at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
        at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
        at java.io.ObjectInputStream.readObject(ObjectInputStream.java:370)
        at org.apache.flink.util.InstantiationUtil.deserializeObject(InstantiationUtil.java:291)
        at org.apache.flink.util.SerializedValue.deserializeValue(SerializedValue.java:58)
        at org.apache.flink.runtime.checkpoint.SubtaskState.discard(SubtaskState.java:85)
        at org.apache.flink.runtime.checkpoint.TaskState.discard(TaskState.java:147)
        at org.apache.flink.runtime.checkpoint.savepoint.SavepointV0.dispose(SavepointV0.java:66)
        at org.apache.flink.runtime.checkpoint.savepoint.FsSavepointStore.disposeSavepoint(FsSavepointStore.java:151)

-- 
Konstantin Gregor * [hidden email]
TNG Technology Consulting GmbH, Betastr. 13a, 85774 Unterföhring
Geschäftsführer: Henrik Klagges, Christoph Stock, Dr. Robert Dahlke
Sitz: Unterföhring * Amtsgericht München * HRB 135082

signature.asc (836 bytes) Download Attachment
Reply | Threaded
Open this post in threaded view
|

Re: ClassNotFoundException upon Savepoint Disposal

Stefan Richter
Hi, 

could you provide us with the log from the job client, with logging on debug level for package org.apache.flink.client? Also, did you check if this problem also exists in the latest bugfix release for your version (1.1.5) ? 

Best,
Stefan

Am 27.03.2017 um 11:41 schrieb Konstantin Gregor <[hidden email]>:

Hey everyone,

we are experiencing an issue in the disposal of savepoints in
Flink-1.1.3. We have a streaming job that has custom state (user objects
are part of the state). We create a savepoint:

$ flink savepoint <JOBID>
[...]
Savepoint completed. Path:
hdfs:/bigdata/flink/savepoints/savepoint-20f064fb9f50
[...]

Then we want to simply dispose of that savepoint where we also provide
the jar to the job from which the savepoint was made:
$ flink savepoint -d
hdfs:/bigdata/flink/savepoints/savepoint-20f064fb9f50 -j
/path/to/jar/application.jar

This gives us a ClassNotFoundException of our custom objects [1].

Adding our jar to the flink/lib directory is not an option for us,
things will break because of this.
Does anyone have an idea on how to proceed here?

Thanks and best regards,

Konstantin

[1]
java.io.IOException: Failed to dispose savepoint <a href="hdfs:///bigdata/flink/savepoints/savepoint-20f064fb9f50" class="">hdfs:///bigdata/flink/savepoints/savepoint-20f064fb9f50.
        at org.apache.flink.runtime.checkpoint.savepoint.FsSavepointStore.disposeSavepoint(FsSavepointStore.java:163)
        at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply$mcV$sp(JobManager.scala:745)
        at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:727)
        at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:727)
        at scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24)
        at scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24)
        at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41)
        at akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401)
        at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
        at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253)
        at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346)
        at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
        at scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
Caused by: java.lang.ClassNotFoundException: our.company.application.eventdata.EventDataRecord
        at java.net.URLClassLoader$1.run(URLClassLoader.java:366)
        at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
        at java.security.AccessController.doPrivileged(Native Method)
        at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:425)
        at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:358)
        at java.lang.Class.forName0(Native Method)
        at java.lang.Class.forName(Class.java:270)
        at org.apache.flink.util.InstantiationUtil$ClassLoaderObjectInputStream.resolveClass(InstantiationUtil.java:65)
        at java.io.ObjectInputStream.readNonProxyDesc(ObjectInputStream.java:1612)
        at java.io.ObjectInputStream.readClassDesc(ObjectInputStream.java:1517)
        at java.io.ObjectInputStream.readClass(ObjectInputStream.java:1483)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1333)
        at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
        at java.io.ObjectInputStream.defaultReadObject(ObjectInputStream.java:500)
        at org.apache.flink.api.java.typeutils.runtime.PojoSerializer.readObject(PojoSerializer.java:131)
        at sun.reflect.GeneratedMethodAccessor33.invoke(Unknown Source)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:606)
        at java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
        at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
        at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
        at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
        at java.io.ObjectInputStream.defaultReadObject(ObjectInputStream.java:500)
        at org.apache.flink.api.common.state.StateDescriptor.readObject(StateDescriptor.java:268)
        at sun.reflect.GeneratedMethodAccessor32.invoke(Unknown Source)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:606)
        at java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
        at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
        at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
        at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
        at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
        at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
        at java.io.ObjectInputStream.readObject(ObjectInputStream.java:370)
        at java.util.HashMap.readObject(HashMap.java:1184)
        at sun.reflect.GeneratedMethodAccessor6.invoke(Unknown Source)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:606)
        at java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
        at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
        at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
        at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
        at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
        at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
        at java.io.ObjectInputStream.readArray(ObjectInputStream.java:1706)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1344)
        at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
        at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
        at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
        at java.io.ObjectInputStream.readObject(ObjectInputStream.java:370)
        at org.apache.flink.util.InstantiationUtil.deserializeObject(InstantiationUtil.java:291)
        at org.apache.flink.util.SerializedValue.deserializeValue(SerializedValue.java:58)
        at org.apache.flink.runtime.checkpoint.SubtaskState.discard(SubtaskState.java:85)
        at org.apache.flink.runtime.checkpoint.TaskState.discard(TaskState.java:147)
        at org.apache.flink.runtime.checkpoint.savepoint.SavepointV0.dispose(SavepointV0.java:66)
        at org.apache.flink.runtime.checkpoint.savepoint.FsSavepointStore.disposeSavepoint(FsSavepointStore.java:151)

-- 
Konstantin Gregor * [hidden email]
TNG Technology Consulting GmbH, Betastr. 13a, 85774 Unterföhring
Geschäftsführer: Henrik Klagges, Christoph Stock, Dr. Robert Dahlke
Sitz: Unterföhring * Amtsgericht München * HRB 135082

Reply | Threaded
Open this post in threaded view
|

Re: ClassNotFoundException upon Savepoint Disposal

Ufuk Celebi
What kind of state backend where you using for the checkpoints?

If there is a bug that prevents us from deleting the savepoint files
automatically, we can do a manual workaround and delete the
checkpoints files manually. With Flink 1.3 this becomes very straight
forward as savepoint data all go to a self contained directory that
can be deleted manually.

On Mon, Mar 27, 2017 at 12:46 PM, Stefan Richter
<[hidden email]> wrote:

> Hi,
>
> could you provide us with the log from the job client, with logging on debug
> level for package org.apache.flink.client? Also, did you check if this
> problem also exists in the latest bugfix release for your version (1.1.5) ?
>
> Best,
> Stefan
>
>
> Am 27.03.2017 um 11:41 schrieb Konstantin Gregor
> <[hidden email]>:
>
> Hey everyone,
>
> we are experiencing an issue in the disposal of savepoints in
> Flink-1.1.3. We have a streaming job that has custom state (user objects
> are part of the state). We create a savepoint:
>
> $ flink savepoint <JOBID>
> [...]
> Savepoint completed. Path:
> hdfs:/bigdata/flink/savepoints/savepoint-20f064fb9f50
> [...]
>
> Then we want to simply dispose of that savepoint where we also provide
> the jar to the job from which the savepoint was made:
> $ flink savepoint -d
> hdfs:/bigdata/flink/savepoints/savepoint-20f064fb9f50 -j
> /path/to/jar/application.jar
>
> This gives us a ClassNotFoundException of our custom objects [1].
>
> Adding our jar to the flink/lib directory is not an option for us,
> things will break because of this.
> Does anyone have an idea on how to proceed here?
>
> Thanks and best regards,
>
> Konstantin
>
> [1]
> java.io.IOException: Failed to dispose savepoint
> hdfs:///bigdata/flink/savepoints/savepoint-20f064fb9f50.
>         at
> org.apache.flink.runtime.checkpoint.savepoint.FsSavepointStore.disposeSavepoint(FsSavepointStore.java:163)
>         at
> org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply$mcV$sp(JobManager.scala:745)
>         at
> org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:727)
>         at
> org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:727)
>         at
> scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24)
>         at
> scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24)
>         at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41)
>         at
> akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401)
>         at
> scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
>         at
> scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253)
>         at
> scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346)
>         at
> scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
>         at
> scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
> Caused by: java.lang.ClassNotFoundException:
> our.company.application.eventdata.EventDataRecord
>         at java.net.URLClassLoader$1.run(URLClassLoader.java:366)
>         at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
>         at java.security.AccessController.doPrivileged(Native Method)
>         at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
>         at java.lang.ClassLoader.loadClass(ClassLoader.java:425)
>         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
>         at java.lang.ClassLoader.loadClass(ClassLoader.java:358)
>         at java.lang.Class.forName0(Native Method)
>         at java.lang.Class.forName(Class.java:270)
>         at
> org.apache.flink.util.InstantiationUtil$ClassLoaderObjectInputStream.resolveClass(InstantiationUtil.java:65)
>         at
> java.io.ObjectInputStream.readNonProxyDesc(ObjectInputStream.java:1612)
>         at
> java.io.ObjectInputStream.readClassDesc(ObjectInputStream.java:1517)
>         at java.io.ObjectInputStream.readClass(ObjectInputStream.java:1483)
>         at
> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1333)
>         at
> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>         at
> java.io.ObjectInputStream.defaultReadObject(ObjectInputStream.java:500)
>         at
> org.apache.flink.api.java.typeutils.runtime.PojoSerializer.readObject(PojoSerializer.java:131)
>         at sun.reflect.GeneratedMethodAccessor33.invoke(Unknown Source)
>         at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>         at java.lang.reflect.Method.invoke(Method.java:606)
>         at
> java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
>         at
> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
>         at
> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>         at
> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>         at
> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>         at
> java.io.ObjectInputStream.defaultReadObject(ObjectInputStream.java:500)
>         at
> org.apache.flink.api.common.state.StateDescriptor.readObject(StateDescriptor.java:268)
>         at sun.reflect.GeneratedMethodAccessor32.invoke(Unknown Source)
>         at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>         at java.lang.reflect.Method.invoke(Method.java:606)
>         at
> java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
>         at
> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
>         at
> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>         at
> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>         at
> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>         at
> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
>         at
> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>         at
> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>         at java.io.ObjectInputStream.readObject(ObjectInputStream.java:370)
>         at java.util.HashMap.readObject(HashMap.java:1184)
>         at sun.reflect.GeneratedMethodAccessor6.invoke(Unknown Source)
>         at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>         at java.lang.reflect.Method.invoke(Method.java:606)
>         at
> java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
>         at
> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
>         at
> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>         at
> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>         at
> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>         at
> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
>         at
> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>         at
> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>         at java.io.ObjectInputStream.readArray(ObjectInputStream.java:1706)
>         at
> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1344)
>         at
> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>         at
> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
>         at
> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>         at
> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>         at java.io.ObjectInputStream.readObject(ObjectInputStream.java:370)
>         at
> org.apache.flink.util.InstantiationUtil.deserializeObject(InstantiationUtil.java:291)
>         at
> org.apache.flink.util.SerializedValue.deserializeValue(SerializedValue.java:58)
>         at
> org.apache.flink.runtime.checkpoint.SubtaskState.discard(SubtaskState.java:85)
>         at
> org.apache.flink.runtime.checkpoint.TaskState.discard(TaskState.java:147)
>         at
> org.apache.flink.runtime.checkpoint.savepoint.SavepointV0.dispose(SavepointV0.java:66)
>         at
> org.apache.flink.runtime.checkpoint.savepoint.FsSavepointStore.disposeSavepoint(FsSavepointStore.java:151)
>
>
> --
> Konstantin Gregor * [hidden email]
> TNG Technology Consulting GmbH, Betastr. 13a, 85774 Unterföhring
> Geschäftsführer: Henrik Klagges, Christoph Stock, Dr. Robert Dahlke
> Sitz: Unterföhring * Amtsgericht München * HRB 135082
>
>
Reply | Threaded
Open this post in threaded view
|

Re: ClassNotFoundException upon Savepoint Disposal

Konstantin Gregor

Hi Ufuk, hi Stefan,


thanks a lot for your replies.

Ufuk, we are using the HDFS state backend.

Stefan, I installed 1.1.5 on our machines and built our software with the Flink 1.1.5 dependency, but the problem remains. Below are the logs for savepoint creation [1] and savepoint disposal [2] as well as the logs from the start of the job [3]. There were not many more log lines when I set org.apache.flink.client to DEBUG, so I set the whole package org.apache.flink to DEBUG in the hope of some findings. But I couldn't really find anything suspicious.

Again, thanks a lot for your help!

Best regards


Konstantin


[1]

2017-03-28 12:21:32,033 INFO  org.apache.flink.client.CliFrontend                           - --------------------------------------------------------------------------------
2017-03-28 12:21:32,034 INFO  org.apache.flink.client.CliFrontend                           -  Starting Command Line Client (Version: 1.1.3, Rev:a56d810, Date:10.11.2016 @ 13:25:34 CET)
2017-03-28 12:21:32,035 INFO  org.apache.flink.client.CliFrontend                           -  Current user: our_user
2017-03-28 12:21:32,035 INFO  org.apache.flink.client.CliFrontend                           -  JVM: Java HotSpot(TM) 64-Bit Server VM - Oracle Corporation - 1.7/24.51-b03
2017-03-28 12:21:32,035 INFO  org.apache.flink.client.CliFrontend                           -  Maximum heap size: 1749 MiBytes
2017-03-28 12:21:32,035 INFO  org.apache.flink.client.CliFrontend                           -  JAVA_HOME: /usr/java/default
2017-03-28 12:21:32,037 INFO  org.apache.flink.client.CliFrontend                           -  Hadoop version: 2.3.0
2017-03-28 12:21:32,038 INFO  org.apache.flink.client.CliFrontend                           -  JVM Options:
2017-03-28 12:21:32,038 INFO  org.apache.flink.client.CliFrontend                           -     -Dlog.file=/path/to/our/lib/flink-1.1.3/log/flink-our_user-client-ourserver.log
2017-03-28 12:21:32,038 INFO  org.apache.flink.client.CliFrontend                           -     -Dlog4j.configuration=file:/path/to/our/lib/flink-1.1.3/conf/log4j-cli.properties
2017-03-28 12:21:32,038 INFO  org.apache.flink.client.CliFrontend                           -     -Dlogback.configurationFile=file:/path/to/our/lib/flink-1.1.3/conf/logback.xml
2017-03-28 12:21:32,038 INFO  org.apache.flink.client.CliFrontend                           -  Program Arguments:
2017-03-28 12:21:32,038 INFO  org.apache.flink.client.CliFrontend                           -     savepoint
2017-03-28 12:21:32,038 INFO  org.apache.flink.client.CliFrontend                           -     7e865198e220bea8a2203ebdb0827b6f
2017-03-28 12:21:32,039 INFO  org.apache.flink.client.CliFrontend                           -     -j
2017-03-28 12:21:32,039 INFO  org.apache.flink.client.CliFrontend                           -     /path/to/our/lib/our_program/lib/our_program-6.2.6-SNAPSHOT-all.jar
2017-03-28 12:21:32,039 INFO  org.apache.flink.client.CliFrontend                           -  Classpath: /path/to/our/lib/flink-1.1.3/lib/flink-dist_2.10-1.1.3.1.jar:/path/to/our/lib/flink-1.1.3/lib/flink-python_2.10-1.1.3.jar:/path/to/our/lib/flink-1.1.3/lib/flink-reporter-1.0.2-20161206.140111-118.jar:/path/to/our/lib/flink-1.1.3/lib/flink-table_2.10-1.1.3.jar:/path/to/our/lib/flink-1.1.3/lib/log4j-1.2.17.jar:/path/to/our/lib/flink-1.1.3/lib/ojdbc6-11.2.0.3.jar:/path/to/our/lib/flink-1.1.3/lib/slf4j-log4j12-1.7.7.jar::/etc/hadoop/conf:
2017-03-28 12:21:32,039 INFO  org.apache.flink.client.CliFrontend                           - --------------------------------------------------------------------------------
2017-03-28 12:21:32,039 INFO  org.apache.flink.client.CliFrontend                           - Using configuration directory /path/to/our/lib/flink-1.1.3/conf
2017-03-28 12:21:32,039 INFO  org.apache.flink.client.CliFrontend                           - Trying to load configuration file
2017-03-28 12:21:32,050 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: env.java.opts, -Djavax.net.ssl.trustStore=/path/to/our/cacerts -XX:HeapDumpPath=/path/to/our/hadoop/yarn/log -XX:+HeapDumpOnOutOfMemoryError -XX:MaxPermSize=192m
2017-03-28 12:21:32,050 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.rpc.address, localhost
2017-03-28 12:21:32,050 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.rpc.port, 6123
2017-03-28 12:21:32,051 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.heap.mb, 256
2017-03-28 12:21:32,051 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.heap.mb, 512
2017-03-28 12:21:32,051 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.numberOfTaskSlots, 4
2017-03-28 12:21:32,051 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.memory.preallocate, false
2017-03-28 12:21:32,051 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: parallelism.default, 1
2017-03-28 12:21:32,051 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.web.port, 8081
2017-03-28 12:21:32,051 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.web.submit.enable, false
2017-03-28 12:21:32,052 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: state.backend, filesystem
2017-03-28 12:21:32,052 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: state.backend.fs.checkpointdir, hdfs://ourserver:8020/our_user/flink/state
2017-03-28 12:21:32,052 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.network.numberOfBuffers, 4096
2017-03-28 12:21:32,052 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: fs.hdfs.hadoopconf, /etc/hadoop/conf/
2017-03-28 12:21:32,052 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.mode, zookeeper
2017-03-28 12:21:32,052 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.quorum, ourserver:2181,ourserver2:2181,ourserver3:2181
2017-03-28 12:21:32,053 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.storageDir, hdfs:///our_user/flink/recovery
2017-03-28 12:21:32,053 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.path.root, flink
2017-03-28 12:21:32,053 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.path.namespace, yarn_session
2017-03-28 12:21:32,053 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: reocvery.zookeeper.client.connection-timeout, 30000
2017-03-28 12:21:32,053 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.client.session-timeout, 120000
2017-03-28 12:21:32,053 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.client.retry-wait, 5000
2017-03-28 12:21:32,053 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.client.max-retry-attempts, 5
2017-03-28 12:21:32,054 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: yarn.application-attempts, 10
2017-03-28 12:21:32,054 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: yarn.maximum-failed-containers, 80
2017-03-28 12:21:32,054 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: akka.watch.heartbeat.interval, 50s
2017-03-28 12:21:32,055 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: akka.log.lifecycle.events, true
2017-03-28 12:21:32,055 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: akka.ask.timeout, 20s
2017-03-28 12:21:32,055 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: savepoints.state.backend, filesystem
2017-03-28 12:21:32,055 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: savepoints.state.backend.fs.dir, hdfs:///our_user/flink/savepoints
2017-03-28 12:21:32,281 INFO  org.apache.flink.client.CliFrontend                           - Running 'savepoint' command.
2017-03-28 12:21:32,287 INFO  org.apache.flink.client.CliFrontend                           - Retrieving JobManager.
2017-03-28 12:21:32,288 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli                 - Found YARN properties file /tmp/.yarn-properties-our_user
2017-03-28 12:21:32,372 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli                 - Using Yarn application id from YARN properties application_1488884688139_2648
2017-03-28 12:21:32,372 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli                 - YARN properties set default parallelism to 12
2017-03-28 12:21:32,372 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli                 - Found YARN properties file /tmp/.yarn-properties-our_user
2017-03-28 12:21:32,373 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli                 - Using Yarn application id from YARN properties application_1488884688139_2648
2017-03-28 12:21:32,373 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli                 - YARN properties set default parallelism to 12
2017-03-28 12:21:32,440 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: env.java.opts, -Djavax.net.ssl.trustStore=/path/to/our/cacerts -XX:HeapDumpPath=/path/to/our/hadoop/yarn/log -XX:+HeapDumpOnOutOfMemoryError -XX:MaxPermSize=192m
2017-03-28 12:21:32,440 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.rpc.address, localhost
2017-03-28 12:21:32,440 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.rpc.port, 6123
2017-03-28 12:21:32,441 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.heap.mb, 256
2017-03-28 12:21:32,441 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.heap.mb, 512
2017-03-28 12:21:32,441 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.numberOfTaskSlots, 4
2017-03-28 12:21:32,441 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.memory.preallocate, false
2017-03-28 12:21:32,441 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: parallelism.default, 1
2017-03-28 12:21:32,441 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.web.port, 8081
2017-03-28 12:21:32,441 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.web.submit.enable, false
2017-03-28 12:21:32,441 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: state.backend, filesystem
2017-03-28 12:21:32,441 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: state.backend.fs.checkpointdir, hdfs://ourserver:8020/our_user/flink/state
2017-03-28 12:21:32,442 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.network.numberOfBuffers, 4096
2017-03-28 12:21:32,442 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: fs.hdfs.hadoopconf, /etc/hadoop/conf/
2017-03-28 12:21:32,442 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.mode, zookeeper
2017-03-28 12:21:32,442 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.quorum, ourserver:2181,ourserver:2181,ourserver:2181
2017-03-28 12:21:32,442 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.storageDir, hdfs:///our_user/flink/recovery
2017-03-28 12:21:32,442 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.path.root, flink
2017-03-28 12:21:32,442 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.path.namespace, yarn_session
2017-03-28 12:21:32,442 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: reocvery.zookeeper.client.connection-timeout, 30000
2017-03-28 12:21:32,442 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.client.session-timeout, 120000
2017-03-28 12:21:32,443 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.client.retry-wait, 5000
2017-03-28 12:21:32,443 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.client.max-retry-attempts, 5
2017-03-28 12:21:32,443 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: yarn.application-attempts, 10
2017-03-28 12:21:32,443 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: yarn.maximum-failed-containers, 80
2017-03-28 12:21:32,443 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: akka.watch.heartbeat.interval, 50s
2017-03-28 12:21:32,444 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: akka.log.lifecycle.events, true
2017-03-28 12:21:32,444 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: akka.ask.timeout, 20s
2017-03-28 12:21:32,444 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: savepoints.state.backend, filesystem
2017-03-28 12:21:32,444 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: savepoints.state.backend.fs.dir, hdfs:///our_user/flink/savepoints
2017-03-28 12:21:32,541 INFO  org.apache.hadoop.yarn.client.RMProxy                         - Connecting to ResourceManager at ourserver/ourserver_ip:8050
2017-03-28 12:21:32,718 INFO  org.apache.flink.yarn.YarnClusterDescriptor                   - Found application JobManager host name 'ourserver' and port '36901' from supplied application id 'application_1488884688139_2648'
2017-03-28 12:21:32,732 INFO  org.apache.flink.runtime.util.ZooKeeperUtils                  - Using 'flink/yarn_session' as zookeeper namespace.
2017-03-28 12:21:32,831 INFO  org.apache.flink.shaded.org.apache.curator.framework.imps.CuratorFrameworkImpl  - Starting
2017-03-28 12:21:32,832 DEBUG org.apache.flink.shaded.org.apache.curator.CuratorZookeeperClient  - Starting
2017-03-28 12:21:32,832 DEBUG org.apache.flink.shaded.org.apache.curator.ConnectionState    - Starting
2017-03-28 12:21:32,833 DEBUG org.apache.flink.shaded.org.apache.curator.ConnectionState    - reset
2017-03-28 12:21:32,874 INFO  org.apache.flink.shaded.org.apache.curator.framework.state.ConnectionStateManager  - State change: CONNECTED
2017-03-28 12:21:33,891 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Starting ZooKeeperLeaderRetrievalService.
2017-03-28 12:21:33,906 DEBUG org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Leader node has changed.
2017-03-28 12:21:33,912 DEBUG org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - New leader information: Leader=akka.tcp://flink@ourserver_ip:36901/user/jobmanager, session ID=a3c337e5-1749-4c42-9949-0203bbae58d5.
2017-03-28 12:21:33,914 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Stopping ZooKeeperLeaderRetrievalService.
2017-03-28 12:21:33,914 DEBUG org.apache.flink.shaded.org.apache.curator.framework.imps.CuratorFrameworkImpl  - Closing
2017-03-28 12:21:33,915 DEBUG org.apache.flink.shaded.org.apache.curator.CuratorZookeeperClient  - Closing
2017-03-28 12:21:33,915 DEBUG org.apache.flink.shaded.org.apache.curator.ConnectionState    - Closing
2017-03-28 12:21:33,920 INFO  org.apache.flink.client.CliFrontend                           - Using address /ourserver_ip:36901 to connect to JobManager.
2017-03-28 12:21:33,926 INFO  org.apache.flink.yarn.YarnClusterClient                       - Starting client actor system.
2017-03-28 12:21:33,928 DEBUG org.apache.flink.runtime.net.ConnectionUtils                  - Trying to connect to (ourserver/ourserver_ip:36901) from local address ourserver/ourserver_ip with timeout 200
2017-03-28 12:21:33,931 DEBUG org.apache.flink.runtime.net.ConnectionUtils                  - Using InetAddress.getLocalHost() immediately for the connecting address
2017-03-28 12:21:34,673 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Starting ZooKeeperLeaderRetrievalService.
2017-03-28 12:21:34,677 DEBUG org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Leader node has changed.
2017-03-28 12:21:34,677 DEBUG org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - New leader information: Leader=akka.tcp://flink@ourserver_ip:36901/user/jobmanager, session ID=a3c337e5-1749-4c42-9949-0203bbae58d5.
2017-03-28 12:21:34,823 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Stopping ZooKeeperLeaderRetrievalService.
2017-03-28 12:21:34,826 INFO  org.apache.flink.client.CliFrontend                           - Triggering savepoint for job 7e865198e220bea8a2203ebdb0827b6f.
2017-03-28 12:21:34,828 INFO  org.apache.flink.client.CliFrontend                           - Waiting for response...
2017-03-28 12:21:34,993 INFO  org.apache.flink.client.CliFrontend                           - Savepoint completed. Path: hdfs:/our_user/flink/savepoints/savepoint-77214a0f9902
2017-03-28 12:21:34,994 INFO  org.apache.flink.client.CliFrontend                           - You can resume your program from this savepoint with the run command.
2017-03-28 12:21:34,994 INFO  org.apache.flink.yarn.YarnClusterClient                       - Shutting down YarnClusterClient from the client shutdown hook
2017-03-28 12:21:34,994 INFO  org.apache.flink.yarn.YarnClusterClient                       - Disconnecting YarnClusterClient from ApplicationMaster


[2]

2017-03-28 12:19:58,063 INFO  org.apache.flink.client.CliFrontend                           - --------------------------------------------------------------------------------
2017-03-28 12:19:58,064 INFO  org.apache.flink.client.CliFrontend                           -  Starting Command Line Client (Version: 1.1.3, Rev:a56d810, Date:10.11.2016 @ 13:25:34 CET)
2017-03-28 12:19:58,064 INFO  org.apache.flink.client.CliFrontend                           -  Current user: our_user
2017-03-28 12:19:58,064 INFO  org.apache.flink.client.CliFrontend                           -  JVM: Java HotSpot(TM) 64-Bit Server VM - Oracle Corporation - 1.7/24.51-b03
2017-03-28 12:19:58,065 INFO  org.apache.flink.client.CliFrontend                           -  Maximum heap size: 1749 MiBytes
2017-03-28 12:19:58,065 INFO  org.apache.flink.client.CliFrontend                           -  JAVA_HOME: /usr/java/default
2017-03-28 12:19:58,067 INFO  org.apache.flink.client.CliFrontend                           -  Hadoop version: 2.3.0
2017-03-28 12:19:58,067 INFO  org.apache.flink.client.CliFrontend                           -  JVM Options:
2017-03-28 12:19:58,068 INFO  org.apache.flink.client.CliFrontend                           -     -Dlog.file=/path/to/our/lib/flink-1.1.3/log/flink-our_user-client-ourserver.log
2017-03-28 12:19:58,068 INFO  org.apache.flink.client.CliFrontend                           -     -Dlog4j.configuration=file:/path/to/our/lib/flink-1.1.3/conf/log4j-cli.properties
2017-03-28 12:19:58,068 INFO  org.apache.flink.client.CliFrontend                           -     -Dlogback.configurationFile=file:/path/to/our/lib/flink-1.1.3/conf/logback.xml
2017-03-28 12:19:58,068 INFO  org.apache.flink.client.CliFrontend                           -  Program Arguments:
2017-03-28 12:19:58,069 INFO  org.apache.flink.client.CliFrontend                           -     savepoint
2017-03-28 12:19:58,069 INFO  org.apache.flink.client.CliFrontend                           -     -d
2017-03-28 12:19:58,069 INFO  org.apache.flink.client.CliFrontend                           -     hdfs:/our_user/flink/savepoints/savepoint-d16441420a87
2017-03-28 12:19:58,069 INFO  org.apache.flink.client.CliFrontend                           -     -j
2017-03-28 12:19:58,069 INFO  org.apache.flink.client.CliFrontend                           -     /path/to/our/lib/our_program/lib/our_program-6.2.6-SNAPSHOT-all.jar
2017-03-28 12:19:58,069 INFO  org.apache.flink.client.CliFrontend                           -  Classpath: /path/to/our/lib/flink-1.1.3/lib/flink-dist_2.10-1.1.3.1.jar:/path/to/our/lib/flink-1.1.3/lib/flink-python_2.10-1.1.3.jar:/path/to/our/lib/flink-1.1.3/lib/flink-reporter-1.0.2-20161206.140111-118.jar:/path/to/our/lib/flink-1.1.3/lib/flink-table_2.10-1.1.3.jar:/path/to/our/lib/flink-1.1.3/lib/log4j-1.2.17.jar:/path/to/our/lib/flink-1.1.3/lib/ojdbc6-11.2.0.3.jar:/path/to/our/lib/flink-1.1.3/lib/slf4j-log4j12-1.7.7.jar::/etc/hadoop/conf:
2017-03-28 12:19:58,070 INFO  org.apache.flink.client.CliFrontend                           - --------------------------------------------------------------------------------
2017-03-28 12:19:58,070 INFO  org.apache.flink.client.CliFrontend                           - Using configuration directory /path/to/our/lib/flink-1.1.3/conf
2017-03-28 12:19:58,070 INFO  org.apache.flink.client.CliFrontend                           - Trying to load configuration file
2017-03-28 12:19:58,085 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: env.java.opts, -Djavax.net.ssl.trustStore=/path/to/our/cacerts -XX:HeapDumpPath=/path/to/our/hadoop/yarn/log -XX:+HeapDumpOnOutOfMemoryError -XX:MaxPermSize=192m
2017-03-28 12:19:58,085 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.rpc.address, localhost
2017-03-28 12:19:58,086 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.rpc.port, 6123
2017-03-28 12:19:58,086 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.heap.mb, 256
2017-03-28 12:19:58,086 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.heap.mb, 512
2017-03-28 12:19:58,086 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.numberOfTaskSlots, 4
2017-03-28 12:19:58,086 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.memory.preallocate, false
2017-03-28 12:19:58,086 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: parallelism.default, 1
2017-03-28 12:19:58,087 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.web.port, 8081
2017-03-28 12:19:58,087 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.web.submit.enable, false
2017-03-28 12:19:58,087 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: state.backend, filesystem
2017-03-28 12:19:58,087 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: state.backend.fs.checkpointdir, hdfs://ourserver:8020/our_user/flink/state
2017-03-28 12:19:58,087 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.network.numberOfBuffers, 4096
2017-03-28 12:19:58,087 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: fs.hdfs.hadoopconf, /etc/hadoop/conf/
2017-03-28 12:19:58,088 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.mode, zookeeper
2017-03-28 12:19:58,088 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.quorum, ourserver:2181,ourserver2:2181,ourserver3:2181
2017-03-28 12:19:58,088 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.storageDir, hdfs:///our_user/flink/recovery
2017-03-28 12:19:58,088 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.path.root, flink
2017-03-28 12:19:58,088 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.path.namespace, yarn_session
2017-03-28 12:19:58,088 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: reocvery.zookeeper.client.connection-timeout, 30000
2017-03-28 12:19:58,089 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.client.session-timeout, 120000
2017-03-28 12:19:58,089 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.client.retry-wait, 5000
2017-03-28 12:19:58,089 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.client.max-retry-attempts, 5
2017-03-28 12:19:58,089 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: yarn.application-attempts, 10
2017-03-28 12:19:58,089 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: yarn.maximum-failed-containers, 80
2017-03-28 12:19:58,090 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: akka.watch.heartbeat.interval, 50s
2017-03-28 12:19:58,090 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: akka.log.lifecycle.events, true
2017-03-28 12:19:58,090 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: akka.ask.timeout, 20s
2017-03-28 12:19:58,090 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: savepoints.state.backend, filesystem
2017-03-28 12:19:58,090 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: savepoints.state.backend.fs.dir, hdfs:///our_user/flink/savepoints
2017-03-28 12:19:58,367 INFO  org.apache.flink.client.CliFrontend                           - Running 'savepoint' command.
2017-03-28 12:19:58,372 INFO  org.apache.flink.client.CliFrontend                           - Retrieving JobManager.
2017-03-28 12:19:58,373 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli                 - Found YARN properties file /tmp/.yarn-properties-our_user
2017-03-28 12:19:58,484 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli                 - Using Yarn application id from YARN properties application_1488884688139_2648
2017-03-28 12:19:58,485 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli                 - YARN properties set default parallelism to 12
2017-03-28 12:19:58,485 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli                 - Found YARN properties file /tmp/.yarn-properties-our_user
2017-03-28 12:19:58,485 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli                 - Using Yarn application id from YARN properties application_1488884688139_2648
2017-03-28 12:19:58,485 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli                 - YARN properties set default parallelism to 12
2017-03-28 12:19:58,604 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: env.java.opts, -Djavax.net.ssl.trustStore=/path/to/our/cacerts -XX:HeapDumpPath=/path/to/our/hadoop/yarn/log -XX:+HeapDumpOnOutOfMemoryError -XX:MaxPermSize=192m
2017-03-28 12:19:58,604 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.rpc.address, localhost
2017-03-28 12:19:58,604 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.rpc.port, 6123
2017-03-28 12:19:58,604 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.heap.mb, 256
2017-03-28 12:19:58,605 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.heap.mb, 512
2017-03-28 12:19:58,605 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.numberOfTaskSlots, 4
2017-03-28 12:19:58,605 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.memory.preallocate, false
2017-03-28 12:19:58,605 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: parallelism.default, 1
2017-03-28 12:19:58,605 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.web.port, 8081
2017-03-28 12:19:58,605 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.web.submit.enable, false
2017-03-28 12:19:58,605 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: state.backend, filesystem
2017-03-28 12:19:58,605 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: state.backend.fs.checkpointdir, hdfs://ourserver:8020/our_user/flink/state
2017-03-28 12:19:58,605 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.network.numberOfBuffers, 4096
2017-03-28 12:19:58,606 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: fs.hdfs.hadoopconf, /etc/hadoop/conf/
2017-03-28 12:19:58,606 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.mode, zookeeper
2017-03-28 12:19:58,606 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.quorum, ourserver:2181,ourserver:2181,ourserver:2181
2017-03-28 12:19:58,606 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.storageDir, hdfs:///our_user/flink/recovery
2017-03-28 12:19:58,606 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.path.root, flink
2017-03-28 12:19:58,606 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.path.namespace, yarn_session
2017-03-28 12:19:58,606 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: reocvery.zookeeper.client.connection-timeout, 30000
2017-03-28 12:19:58,606 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.client.session-timeout, 120000
2017-03-28 12:19:58,606 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.client.retry-wait, 5000
2017-03-28 12:19:58,606 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: recovery.zookeeper.client.max-retry-attempts, 5
2017-03-28 12:19:58,607 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: yarn.application-attempts, 10
2017-03-28 12:19:58,607 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: yarn.maximum-failed-containers, 80
2017-03-28 12:19:58,607 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: akka.watch.heartbeat.interval, 50s
2017-03-28 12:19:58,607 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: akka.log.lifecycle.events, true
2017-03-28 12:19:58,607 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: akka.ask.timeout, 20s
2017-03-28 12:19:58,608 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: savepoints.state.backend, filesystem
2017-03-28 12:19:58,608 DEBUG org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: savepoints.state.backend.fs.dir, hdfs:///our_user/flink/savepoints
2017-03-28 12:19:58,685 INFO  org.apache.hadoop.yarn.client.RMProxy                         - Connecting to ResourceManager at ourserver/ourserver_ip:8050
2017-03-28 12:19:58,969 INFO  org.apache.flink.yarn.YarnClusterDescriptor                   - Found application JobManager host name 'ourserver' and port '36901' from supplied application id 'application_1488884688139_2648'
2017-03-28 12:19:58,989 INFO  org.apache.flink.runtime.util.ZooKeeperUtils                  - Using 'flink/yarn_session' as zookeeper namespace.
2017-03-28 12:19:59,114 INFO  org.apache.flink.shaded.org.apache.curator.framework.imps.CuratorFrameworkImpl  - Starting
2017-03-28 12:19:59,115 DEBUG org.apache.flink.shaded.org.apache.curator.CuratorZookeeperClient  - Starting
2017-03-28 12:19:59,115 DEBUG org.apache.flink.shaded.org.apache.curator.ConnectionState    - Starting
2017-03-28 12:19:59,115 DEBUG org.apache.flink.shaded.org.apache.curator.ConnectionState    - reset
2017-03-28 12:19:59,172 INFO  org.apache.flink.shaded.org.apache.curator.framework.state.ConnectionStateManager  - State change: CONNECTED
2017-03-28 12:20:00,212 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Starting ZooKeeperLeaderRetrievalService.
2017-03-28 12:20:00,229 DEBUG org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Leader node has changed.
2017-03-28 12:20:00,235 DEBUG org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - New leader information: Leader=akka.tcp://flink@ourserver_ip:36901/user/jobmanager, session ID=a3c337e5-1749-4c42-9949-0203bbae58d5.
2017-03-28 12:20:00,238 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Stopping ZooKeeperLeaderRetrievalService.
2017-03-28 12:20:00,238 DEBUG org.apache.flink.shaded.org.apache.curator.framework.imps.CuratorFrameworkImpl  - Closing
2017-03-28 12:20:00,238 DEBUG org.apache.flink.shaded.org.apache.curator.CuratorZookeeperClient  - Closing
2017-03-28 12:20:00,239 DEBUG org.apache.flink.shaded.org.apache.curator.ConnectionState    - Closing
2017-03-28 12:20:00,245 INFO  org.apache.flink.client.CliFrontend                           - Using address /ourserver_ip:36901 to connect to JobManager.
2017-03-28 12:20:00,245 INFO  org.apache.flink.runtime.util.ZooKeeperUtils                  - Using 'flink/yarn_session' as zookeeper namespace.
2017-03-28 12:20:00,252 INFO  org.apache.flink.yarn.YarnClusterClient                       - Starting client actor system.
2017-03-28 12:20:00,254 DEBUG org.apache.flink.runtime.net.ConnectionUtils                  - Trying to connect to (ourserver/ourserver_ip:36901) from local address ourserver/ourserver_ip with timeout 200
2017-03-28 12:20:00,259 DEBUG org.apache.flink.runtime.net.ConnectionUtils                  - Using InetAddress.getLocalHost() immediately for the connecting address
2017-03-28 12:20:01,209 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Starting ZooKeeperLeaderRetrievalService.
2017-03-28 12:20:01,213 DEBUG org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Leader node has changed.
2017-03-28 12:20:01,213 DEBUG org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - New leader information: Leader=akka.tcp://flink@ourserver_ip:36901/user/jobmanager, session ID=a3c337e5-1749-4c42-9949-0203bbae58d5.
2017-03-28 12:20:01,442 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Stopping ZooKeeperLeaderRetrievalService.
2017-03-28 12:20:01,446 INFO  org.apache.flink.client.CliFrontend                           - Disposing savepoint 'hdfs:/our_user/flink/savepoints/savepoint-d16441420a87' with JAR /path/to/our/lib/our_program/lib/our_program-6.2.6-SNAPSHOT-all.jar.
2017-03-28 12:20:01,590 INFO  org.apache.flink.client.CliFrontend                           - Waiting for response...
2017-03-28 12:20:01,636 ERROR org.apache.flink.client.CliFrontend                           - Error while running the command.
java.io.IOException: Failed to dispose savepoint hdfs:/our_user/flink/savepoints/savepoint-d16441420a87.
        at org.apache.flink.runtime.checkpoint.savepoint.FsSavepointStore.disposeSavepoint(FsSavepointStore.java:163)
        at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply$mcV$sp(JobManager.scala:745)
        at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:727)
        at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:727)
        at scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24)
        at scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24)
        at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41)
        at akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401)
        at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
        at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253)
        at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346)
        at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
        at scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
Caused by: java.lang.ClassNotFoundException: our.company.eventdata.EventDataRecord
        at java.net.URLClassLoader$1.run(URLClassLoader.java:366)
        at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
        at java.security.AccessController.doPrivileged(Native Method)
        at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:425)
        at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:358)
        at java.lang.Class.forName0(Native Method)
        at java.lang.Class.forName(Class.java:270)
        at org.apache.flink.util.InstantiationUtil$ClassLoaderObjectInputStream.resolveClass(InstantiationUtil.java:65)
        at java.io.ObjectInputStream.readNonProxyDesc(ObjectInputStream.java:1612)
        at java.io.ObjectInputStream.readClassDesc(ObjectInputStream.java:1517)
        at java.io.ObjectInputStream.readClass(ObjectInputStream.java:1483)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1333)
        at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
        at java.io.ObjectInputStream.defaultReadObject(ObjectInputStream.java:500)
        at org.apache.flink.api.java.typeutils.runtime.PojoSerializer.readObject(PojoSerializer.java:131)
        at sun.reflect.GeneratedMethodAccessor51.invoke(Unknown Source)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:606)
        at java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
        at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
        at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
        at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
        at java.io.ObjectInputStream.defaultReadObject(ObjectInputStream.java:500)
        at org.apache.flink.api.common.state.StateDescriptor.readObject(StateDescriptor.java:268)
        at sun.reflect.GeneratedMethodAccessor29.invoke(Unknown Source)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:606)
        at java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
        at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
        at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
        at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
        at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
        at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
        at java.io.ObjectInputStream.readObject(ObjectInputStream.java:370)
        at java.util.HashMap.readObject(HashMap.java:1184)
        at sun.reflect.GeneratedMethodAccessor6.invoke(Unknown Source)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:606)
        at java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
        at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
        at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
        at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
        at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
        at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
        at java.io.ObjectInputStream.readArray(ObjectInputStream.java:1706)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1344)
        at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
        at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
        at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
        at java.io.ObjectInputStream.readObject(ObjectInputStream.java:370)
        at org.apache.flink.util.InstantiationUtil.deserializeObject(InstantiationUtil.java:291)
        at org.apache.flink.util.SerializedValue.deserializeValue(SerializedValue.java:58)
        at org.apache.flink.runtime.checkpoint.SubtaskState.discard(SubtaskState.java:85)
        at org.apache.flink.runtime.checkpoint.TaskState.discard(TaskState.java:147)
        at org.apache.flink.runtime.checkpoint.savepoint.SavepointV0.dispose(SavepointV0.java:66)
        at org.apache.flink.runtime.checkpoint.savepoint.FsSavepointStore.disposeSavepoint(FsSavepointStore.java:151)
        ... 12 more
2017-03-28 12:20:01,652 INFO  org.apache.flink.yarn.YarnClusterClient                       - Shutting down YarnClusterClient from the client shutdown hook
2017-03-28 12:20:01,653 INFO  org.apache.flink.yarn.YarnClusterClient                       - Disconnecting YarnClusterClient from ApplicationMaster


[3]

2017-03-28 10:43:57,361 INFO  org.apache.flink.client.CliFrontend                           - --------------------------------------------------------------------------------
2017-03-28 10:43:57,362 INFO  org.apache.flink.client.CliFrontend                           -  Starting Command Line Client (Version: 1.1.3, Rev:a56d810, Date:10.11.2016 @ 13:25:34 CET)
2017-03-28 10:43:57,362 INFO  org.apache.flink.client.CliFrontend                           -  Current user: our_user
2017-03-28 10:43:57,362 INFO  org.apache.flink.client.CliFrontend                           -  JVM: Java HotSpot(TM) 64-Bit Server VM - Oracle Corporation - 1.7/24.51-b03
2017-03-28 10:43:57,362 INFO  org.apache.flink.client.CliFrontend                           -  Maximum heap size: 1749 MiBytes
2017-03-28 10:43:57,363 INFO  org.apache.flink.client.CliFrontend                           -  JAVA_HOME: /usr/java/default
2017-03-28 10:43:57,365 INFO  org.apache.flink.client.CliFrontend                           -  Hadoop version: 2.3.0
2017-03-28 10:43:57,365 INFO  org.apache.flink.client.CliFrontend                           -  JVM Options:
2017-03-28 10:43:57,365 INFO  org.apache.flink.client.CliFrontend                           -     -Dlog.file=/path/to/our/lib/flink-1.1.3/log/flink-our_user-client-ourserver.log
2017-03-28 10:43:57,365 INFO  org.apache.flink.client.CliFrontend                           -     -Dlog4j.configuration=file:/path/to/our/lib/flink-1.1.3/conf/log4j-cli.properties
2017-03-28 10:43:57,365 INFO  org.apache.flink.client.CliFrontend                           -     -Dlogback.configurationFile=file:/path/to/our/lib/flink-1.1.3/conf/logback.xml
2017-03-28 10:43:57,365 INFO  org.apache.flink.client.CliFrontend                           -  Program Arguments:
2017-03-28 10:43:57,366 INFO  org.apache.flink.client.CliFrontend                           -     run
2017-03-28 10:43:57,366 INFO  org.apache.flink.client.CliFrontend                           -     -p
2017-03-28 10:43:57,366 INFO  org.apache.flink.client.CliFrontend                           -     5
2017-03-28 10:43:57,366 INFO  org.apache.flink.client.CliFrontend                           -     -c
2017-03-28 10:43:57,366 INFO  org.apache.flink.client.CliFrontend                           -     our.company.package.OurProgramClass
2017-03-28 10:43:57,366 INFO  org.apache.flink.client.CliFrontend                           -     /path/to/our/lib/our_program/lib/our_program.jar
2017-03-28 10:43:57,366 INFO  org.apache.flink.client.CliFrontend                           -     /path/to/our/lib/our_program/conf/our_program.properties
2017-03-28 10:43:57,367 INFO  org.apache.flink.client.CliFrontend                           -  Classpath: /path/to/our/lib/flink-1.1.3/lib/flink-dist_2.10-1.1.3.1.jar:/path/to/our/lib/flink-1.1.3/lib/flink-python_2.10-1.1.3.jar:/path/to/our/lib/flink-1.1.3/lib/flink-reporter-1.0.2-20161206.140111-118.jar:/path/to/our/lib/flink-1.1.3/lib/flink-table_2.10-1.1.3.jar:/path/to/our/lib/flink-1.1.3/lib/log4j-1.2.17.jar:/path/to/our/lib/flink-1.1.3/lib/ojdbc6-11.2.0.3.jar:/path/to/our/lib/flink-1.1.3/lib/slf4j-log4j12-1.7.7.jar::/etc/hadoop/conf:
2017-03-28 10:43:57,367 INFO  org.apache.flink.client.CliFrontend                           - --------------------------------------------------------------------------------
2017-03-28 10:43:57,367 INFO  org.apache.flink.client.CliFrontend                           - Using configuration directory /path/to/our/lib/flink-1.1.3/conf
2017-03-28 10:43:57,367 INFO  org.apache.flink.client.CliFrontend                           - Trying to load configuration file
2017-03-28 10:43:57,664 INFO  org.apache.flink.client.CliFrontend                           - Running 'run' command.
2017-03-28 10:43:57,671 INFO  org.apache.flink.client.CliFrontend                           - Building program from JAR file
2017-03-28 10:43:57,827 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli                 - Found YARN properties file /tmp/.yarn-properties-our_user
2017-03-28 10:43:57,921 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli                 - Using Yarn application id from YARN properties application_1488884688139_2648
2017-03-28 10:43:57,921 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli                 - YARN properties set default parallelism to 12
2017-03-28 10:43:57,921 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli                 - Found YARN properties file /tmp/.yarn-properties-our_user
2017-03-28 10:43:57,922 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli                 - Using Yarn application id from YARN properties application_1488884688139_2648
2017-03-28 10:43:57,922 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli                 - YARN properties set default parallelism to 12
2017-03-28 10:43:58,046 INFO  org.apache.hadoop.yarn.client.RMProxy                         - Connecting to ResourceManager at ourserver/ourserver_ip:8050
2017-03-28 10:43:58,237 INFO  org.apache.flink.yarn.YarnClusterDescriptor                   - Found application JobManager host name 'ourserver' and port '36901' from supplied application id 'application_1488884688139_2648'
2017-03-28 10:43:58,246 INFO  org.apache.flink.client.CliFrontend                           - Cluster configuration: Yarn cluster with application id application_1488884688139_2648
2017-03-28 10:43:59,439 INFO  org.apache.flink.client.CliFrontend                           - Using address ourserver_ip:36901 to connect to JobManager.
2017-03-28 10:43:59,439 INFO  org.apache.flink.client.CliFrontend                           - JobManager web interface address http://ourserver:8088/proxy/application_1488884688139_2648/
2017-03-28 10:43:59,439 DEBUG org.apache.flink.client.CliFrontend                           - Client slots is set to -1
2017-03-28 10:43:59,440 DEBUG org.apache.flink.client.CliFrontend                           - Savepoint path is set to null
2017-03-28 10:43:59,440 DEBUG org.apache.flink.client.CliFrontend                           - User parallelism is set to 5
2017-03-28 10:43:59,440 INFO  org.apache.flink.client.CliFrontend                           - Starting execution of program
2017-03-28 10:43:59,440 INFO  org.apache.flink.yarn.YarnClusterClient                       - Starting program in interactive mode
2017-03-28 10:44:00,593 WARN  org.apache.hadoop.hdfs.BlockReaderLocal                       - The short-circuit local reads feature cannot be used because libhadoop cannot be loaded.
2017-03-28 10:44:01,672 INFO  org.apache.flink.yarn.YarnClusterClient                       - Waiting until all TaskManagers have connected
2017-03-28 10:44:02,702 INFO  org.apache.flink.yarn.YarnClusterClient                       - Starting client actor system.
2017-03-28 10:44:03,717 INFO  org.apache.flink.yarn.YarnClusterClient                       - TaskManager status (3/1)
2017-03-28 10:44:03,720 INFO  org.apache.flink.yarn.YarnClusterClient                       - All TaskManagers are connected
2017-03-28 10:44:04,736 INFO  org.apache.flink.yarn.YarnClusterClient                       - Submitting job with JobID: d33a2835c9c25881a0765c250bbceb7e. Waiting for job completion.
Connected to JobManager at Actor[akka.tcp://flink@ourserver_ip:36901/user/jobmanager#-429328340]
03/28/2017 10:44:06     Job execution switched to status RUNNING.


On 27.03.2017 15:24, Ufuk Celebi wrote:
What kind of state backend where you using for the checkpoints?

If there is a bug that prevents us from deleting the savepoint files
automatically, we can do a manual workaround and delete the
checkpoints files manually. With Flink 1.3 this becomes very straight
forward as savepoint data all go to a self contained directory that
can be deleted manually.

On Mon, Mar 27, 2017 at 12:46 PM, Stefan Richter
[hidden email] wrote:
Hi,

could you provide us with the log from the job client, with logging on debug
level for package org.apache.flink.client? Also, did you check if this
problem also exists in the latest bugfix release for your version (1.1.5) ?

Best,
Stefan


Am 27.03.2017 um 11:41 schrieb Konstantin Gregor
[hidden email]:

Hey everyone,

we are experiencing an issue in the disposal of savepoints in
Flink-1.1.3. We have a streaming job that has custom state (user objects
are part of the state). We create a savepoint:

$ flink savepoint <JOBID>
[...]
Savepoint completed. Path:
hdfs:/bigdata/flink/savepoints/savepoint-20f064fb9f50
[...]

Then we want to simply dispose of that savepoint where we also provide
the jar to the job from which the savepoint was made:
$ flink savepoint -d
hdfs:/bigdata/flink/savepoints/savepoint-20f064fb9f50 -j
/path/to/jar/application.jar

This gives us a ClassNotFoundException of our custom objects [1].

Adding our jar to the flink/lib directory is not an option for us,
things will break because of this.
Does anyone have an idea on how to proceed here?

Thanks and best regards,

Konstantin

[1]
java.io.IOException: Failed to dispose savepoint
hdfs:///bigdata/flink/savepoints/savepoint-20f064fb9f50.
        at
org.apache.flink.runtime.checkpoint.savepoint.FsSavepointStore.disposeSavepoint(FsSavepointStore.java:163)
        at
org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply$mcV$sp(JobManager.scala:745)
        at
org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:727)
        at
org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:727)
        at
scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24)
        at
scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24)
        at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41)
        at
akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401)
        at
scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
        at
scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253)
        at
scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346)
        at
scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
        at
scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
Caused by: java.lang.ClassNotFoundException:
our.company.application.eventdata.EventDataRecord
        at java.net.URLClassLoader$1.run(URLClassLoader.java:366)
        at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
        at java.security.AccessController.doPrivileged(Native Method)
        at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:425)
        at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:358)
        at java.lang.Class.forName0(Native Method)
        at java.lang.Class.forName(Class.java:270)
        at
org.apache.flink.util.InstantiationUtil$ClassLoaderObjectInputStream.resolveClass(InstantiationUtil.java:65)
        at
java.io.ObjectInputStream.readNonProxyDesc(ObjectInputStream.java:1612)
        at
java.io.ObjectInputStream.readClassDesc(ObjectInputStream.java:1517)
        at java.io.ObjectInputStream.readClass(ObjectInputStream.java:1483)
        at
java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1333)
        at
java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
        at
java.io.ObjectInputStream.defaultReadObject(ObjectInputStream.java:500)
        at
org.apache.flink.api.java.typeutils.runtime.PojoSerializer.readObject(PojoSerializer.java:131)
        at sun.reflect.GeneratedMethodAccessor33.invoke(Unknown Source)
        at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:606)
        at
java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
        at
java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
        at
java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
        at
java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
        at
java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
        at
java.io.ObjectInputStream.defaultReadObject(ObjectInputStream.java:500)
        at
org.apache.flink.api.common.state.StateDescriptor.readObject(StateDescriptor.java:268)
        at sun.reflect.GeneratedMethodAccessor32.invoke(Unknown Source)
        at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:606)
        at
java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
        at
java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
        at
java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
        at
java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
        at
java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
        at
java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
        at
java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
        at
java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
        at java.io.ObjectInputStream.readObject(ObjectInputStream.java:370)
        at java.util.HashMap.readObject(HashMap.java:1184)
        at sun.reflect.GeneratedMethodAccessor6.invoke(Unknown Source)
        at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:606)
        at
java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
        at
java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
        at
java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
        at
java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
        at
java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
        at
java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
        at
java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
        at
java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
        at java.io.ObjectInputStream.readArray(ObjectInputStream.java:1706)
        at
java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1344)
        at
java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
        at
java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
        at
java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
        at
java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
        at java.io.ObjectInputStream.readObject(ObjectInputStream.java:370)
        at
org.apache.flink.util.InstantiationUtil.deserializeObject(InstantiationUtil.java:291)
        at
org.apache.flink.util.SerializedValue.deserializeValue(SerializedValue.java:58)
        at
org.apache.flink.runtime.checkpoint.SubtaskState.discard(SubtaskState.java:85)
        at
org.apache.flink.runtime.checkpoint.TaskState.discard(TaskState.java:147)
        at
org.apache.flink.runtime.checkpoint.savepoint.SavepointV0.dispose(SavepointV0.java:66)
        at
org.apache.flink.runtime.checkpoint.savepoint.FsSavepointStore.disposeSavepoint(FsSavepointStore.java:151)


--
Konstantin Gregor * [hidden email]
TNG Technology Consulting GmbH, Betastr. 13a, 85774 Unterföhring
Geschäftsführer: Henrik Klagges, Christoph Stock, Dr. Robert Dahlke
Sitz: Unterföhring * Amtsgericht München * HRB 135082



-- 
Konstantin Gregor * [hidden email]
TNG Technology Consulting GmbH, Betastr. 13a, 85774 Unterföhring
Geschäftsführer: Henrik Klagges, Christoph Stock, Dr. Robert Dahlke
Sitz: Unterföhring * Amtsgericht München * HRB 135082
Reply | Threaded
Open this post in threaded view
|

Re: ClassNotFoundException upon Savepoint Disposal

Ufuk Celebi
Thanks for providing the logs. It looks like the JARs are uploaded
(modulo any bugs ;-))... could you double check that the class is
actually part of the JAR and has not been moved around via

jar -tf <jarfile> | grep EventDataRecord


If everything looks good in the JAR, I could write a short tool that
prints the pointed file paths and you could manually delete the files
as a work around. Sorry for all the trouble with this.

In version >1.2 we don't need the user code any more to dispose savepoints.

– Ufuk


On Wed, Mar 29, 2017 at 11:50 AM, Konstantin Gregor
<[hidden email]> wrote:

> Hi Ufuk, hi Stefan,
>
>
> thanks a lot for your replies.
>
> Ufuk, we are using the HDFS state backend.
>
> Stefan, I installed 1.1.5 on our machines and built our software with the
> Flink 1.1.5 dependency, but the problem remains. Below are the logs for
> savepoint creation [1] and savepoint disposal [2] as well as the logs from
> the start of the job [3]. There were not many more log lines when I set
> org.apache.flink.client to DEBUG, so I set the whole package
> org.apache.flink to DEBUG in the hope of some findings. But I couldn't
> really find anything suspicious.
>
> Again, thanks a lot for your help!
>
> Best regards
>
>
> Konstantin
>
>
> [1]
>
> 2017-03-28 12:21:32,033 INFO  org.apache.flink.client.CliFrontend
> -
> --------------------------------------------------------------------------------
> 2017-03-28 12:21:32,034 INFO  org.apache.flink.client.CliFrontend
> -  Starting Command Line Client (Version: 1.1.3, Rev:a56d810,
> Date:10.11.2016 @ 13:25:34 CET)
> 2017-03-28 12:21:32,035 INFO  org.apache.flink.client.CliFrontend
> -  Current user: our_user
> 2017-03-28 12:21:32,035 INFO  org.apache.flink.client.CliFrontend
> -  JVM: Java HotSpot(TM) 64-Bit Server VM - Oracle Corporation -
> 1.7/24.51-b03
> 2017-03-28 12:21:32,035 INFO  org.apache.flink.client.CliFrontend
> -  Maximum heap size: 1749 MiBytes
> 2017-03-28 12:21:32,035 INFO  org.apache.flink.client.CliFrontend
> -  JAVA_HOME: /usr/java/default
> 2017-03-28 12:21:32,037 INFO  org.apache.flink.client.CliFrontend
> -  Hadoop version: 2.3.0
> 2017-03-28 12:21:32,038 INFO  org.apache.flink.client.CliFrontend
> -  JVM Options:
> 2017-03-28 12:21:32,038 INFO  org.apache.flink.client.CliFrontend
> -
> -Dlog.file=/path/to/our/lib/flink-1.1.3/log/flink-our_user-client-ourserver.log
> 2017-03-28 12:21:32,038 INFO  org.apache.flink.client.CliFrontend
> -
> -Dlog4j.configuration=file:/path/to/our/lib/flink-1.1.3/conf/log4j-cli.properties
> 2017-03-28 12:21:32,038 INFO  org.apache.flink.client.CliFrontend
> -
> -Dlogback.configurationFile=file:/path/to/our/lib/flink-1.1.3/conf/logback.xml
> 2017-03-28 12:21:32,038 INFO  org.apache.flink.client.CliFrontend
> -  Program Arguments:
> 2017-03-28 12:21:32,038 INFO  org.apache.flink.client.CliFrontend
> -     savepoint
> 2017-03-28 12:21:32,038 INFO  org.apache.flink.client.CliFrontend
> -     7e865198e220bea8a2203ebdb0827b6f
> 2017-03-28 12:21:32,039 INFO  org.apache.flink.client.CliFrontend
> -     -j
> 2017-03-28 12:21:32,039 INFO  org.apache.flink.client.CliFrontend
> -     /path/to/our/lib/our_program/lib/our_program-6.2.6-SNAPSHOT-all.jar
> 2017-03-28 12:21:32,039 INFO  org.apache.flink.client.CliFrontend
> -  Classpath:
> /path/to/our/lib/flink-1.1.3/lib/flink-dist_2.10-1.1.3.1.jar:/path/to/our/lib/flink-1.1.3/lib/flink-python_2.10-1.1.3.jar:/path/to/our/lib/flink-1.1.3/lib/flink-reporter-1.0.2-20161206.140111-118.jar:/path/to/our/lib/flink-1.1.3/lib/flink-table_2.10-1.1.3.jar:/path/to/our/lib/flink-1.1.3/lib/log4j-1.2.17.jar:/path/to/our/lib/flink-1.1.3/lib/ojdbc6-11.2.0.3.jar:/path/to/our/lib/flink-1.1.3/lib/slf4j-log4j12-1.7.7.jar::/etc/hadoop/conf:
> 2017-03-28 12:21:32,039 INFO  org.apache.flink.client.CliFrontend
> -
> --------------------------------------------------------------------------------
> 2017-03-28 12:21:32,039 INFO  org.apache.flink.client.CliFrontend
> - Using configuration directory /path/to/our/lib/flink-1.1.3/conf
> 2017-03-28 12:21:32,039 INFO  org.apache.flink.client.CliFrontend
> - Trying to load configuration file
> 2017-03-28 12:21:32,050 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: env.java.opts,
> -Djavax.net.ssl.trustStore=/path/to/our/cacerts
> -XX:HeapDumpPath=/path/to/our/hadoop/yarn/log
> -XX:+HeapDumpOnOutOfMemoryError -XX:MaxPermSize=192m
> 2017-03-28 12:21:32,050 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.rpc.address, localhost
> 2017-03-28 12:21:32,050 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.rpc.port, 6123
> 2017-03-28 12:21:32,051 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.heap.mb, 256
> 2017-03-28 12:21:32,051 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: taskmanager.heap.mb, 512
> 2017-03-28 12:21:32,051 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: taskmanager.numberOfTaskSlots, 4
> 2017-03-28 12:21:32,051 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: taskmanager.memory.preallocate, false
> 2017-03-28 12:21:32,051 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: parallelism.default, 1
> 2017-03-28 12:21:32,051 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.web.port, 8081
> 2017-03-28 12:21:32,051 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.web.submit.enable, false
> 2017-03-28 12:21:32,052 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: state.backend, filesystem
> 2017-03-28 12:21:32,052 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: state.backend.fs.checkpointdir,
> hdfs://ourserver:8020/our_user/flink/state
> 2017-03-28 12:21:32,052 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: taskmanager.network.numberOfBuffers, 4096
> 2017-03-28 12:21:32,052 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: fs.hdfs.hadoopconf, /etc/hadoop/conf/
> 2017-03-28 12:21:32,052 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.mode, zookeeper
> 2017-03-28 12:21:32,052 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.quorum,
> ourserver:2181,ourserver2:2181,ourserver3:2181
> 2017-03-28 12:21:32,053 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.storageDir,
> hdfs:///our_user/flink/recovery
> 2017-03-28 12:21:32,053 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.path.root, flink
> 2017-03-28 12:21:32,053 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.path.namespace, yarn_session
> 2017-03-28 12:21:32,053 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: reocvery.zookeeper.client.connection-timeout, 30000
> 2017-03-28 12:21:32,053 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.client.session-timeout, 120000
> 2017-03-28 12:21:32,053 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.client.retry-wait, 5000
> 2017-03-28 12:21:32,053 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.client.max-retry-attempts, 5
> 2017-03-28 12:21:32,054 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: yarn.application-attempts, 10
> 2017-03-28 12:21:32,054 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: yarn.maximum-failed-containers, 80
> 2017-03-28 12:21:32,054 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: akka.watch.heartbeat.interval, 50s
> 2017-03-28 12:21:32,055 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: akka.log.lifecycle.events, true
> 2017-03-28 12:21:32,055 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: akka.ask.timeout, 20s
> 2017-03-28 12:21:32,055 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: savepoints.state.backend, filesystem
> 2017-03-28 12:21:32,055 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: savepoints.state.backend.fs.dir,
> hdfs:///our_user/flink/savepoints
> 2017-03-28 12:21:32,281 INFO  org.apache.flink.client.CliFrontend
> - Running 'savepoint' command.
> 2017-03-28 12:21:32,287 INFO  org.apache.flink.client.CliFrontend
> - Retrieving JobManager.
> 2017-03-28 12:21:32,288 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
> - Found YARN properties file /tmp/.yarn-properties-our_user
> 2017-03-28 12:21:32,372 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
> - Using Yarn application id from YARN properties
> application_1488884688139_2648
> 2017-03-28 12:21:32,372 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
> - YARN properties set default parallelism to 12
> 2017-03-28 12:21:32,372 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
> - Found YARN properties file /tmp/.yarn-properties-our_user
> 2017-03-28 12:21:32,373 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
> - Using Yarn application id from YARN properties
> application_1488884688139_2648
> 2017-03-28 12:21:32,373 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
> - YARN properties set default parallelism to 12
> 2017-03-28 12:21:32,440 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: env.java.opts,
> -Djavax.net.ssl.trustStore=/path/to/our/cacerts
> -XX:HeapDumpPath=/path/to/our/hadoop/yarn/log
> -XX:+HeapDumpOnOutOfMemoryError -XX:MaxPermSize=192m
> 2017-03-28 12:21:32,440 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.rpc.address, localhost
> 2017-03-28 12:21:32,440 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.rpc.port, 6123
> 2017-03-28 12:21:32,441 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.heap.mb, 256
> 2017-03-28 12:21:32,441 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: taskmanager.heap.mb, 512
> 2017-03-28 12:21:32,441 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: taskmanager.numberOfTaskSlots, 4
> 2017-03-28 12:21:32,441 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: taskmanager.memory.preallocate, false
> 2017-03-28 12:21:32,441 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: parallelism.default, 1
> 2017-03-28 12:21:32,441 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.web.port, 8081
> 2017-03-28 12:21:32,441 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.web.submit.enable, false
> 2017-03-28 12:21:32,441 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: state.backend, filesystem
> 2017-03-28 12:21:32,441 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: state.backend.fs.checkpointdir,
> hdfs://ourserver:8020/our_user/flink/state
> 2017-03-28 12:21:32,442 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: taskmanager.network.numberOfBuffers, 4096
> 2017-03-28 12:21:32,442 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: fs.hdfs.hadoopconf, /etc/hadoop/conf/
> 2017-03-28 12:21:32,442 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.mode, zookeeper
> 2017-03-28 12:21:32,442 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.quorum,
> ourserver:2181,ourserver:2181,ourserver:2181
> 2017-03-28 12:21:32,442 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.storageDir,
> hdfs:///our_user/flink/recovery
> 2017-03-28 12:21:32,442 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.path.root, flink
> 2017-03-28 12:21:32,442 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.path.namespace, yarn_session
> 2017-03-28 12:21:32,442 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: reocvery.zookeeper.client.connection-timeout, 30000
> 2017-03-28 12:21:32,442 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.client.session-timeout, 120000
> 2017-03-28 12:21:32,443 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.client.retry-wait, 5000
> 2017-03-28 12:21:32,443 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.client.max-retry-attempts, 5
> 2017-03-28 12:21:32,443 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: yarn.application-attempts, 10
> 2017-03-28 12:21:32,443 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: yarn.maximum-failed-containers, 80
> 2017-03-28 12:21:32,443 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: akka.watch.heartbeat.interval, 50s
> 2017-03-28 12:21:32,444 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: akka.log.lifecycle.events, true
> 2017-03-28 12:21:32,444 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: akka.ask.timeout, 20s
> 2017-03-28 12:21:32,444 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: savepoints.state.backend, filesystem
> 2017-03-28 12:21:32,444 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: savepoints.state.backend.fs.dir,
> hdfs:///our_user/flink/savepoints
> 2017-03-28 12:21:32,541 INFO  org.apache.hadoop.yarn.client.RMProxy
> - Connecting to ResourceManager at ourserver/ourserver_ip:8050
> 2017-03-28 12:21:32,718 INFO  org.apache.flink.yarn.YarnClusterDescriptor
> - Found application JobManager host name 'ourserver' and port '36901' from
> supplied application id 'application_1488884688139_2648'
> 2017-03-28 12:21:32,732 INFO  org.apache.flink.runtime.util.ZooKeeperUtils
> - Using 'flink/yarn_session' as zookeeper namespace.
> 2017-03-28 12:21:32,831 INFO
> org.apache.flink.shaded.org.apache.curator.framework.imps.CuratorFrameworkImpl
> - Starting
> 2017-03-28 12:21:32,832 DEBUG
> org.apache.flink.shaded.org.apache.curator.CuratorZookeeperClient  -
> Starting
> 2017-03-28 12:21:32,832 DEBUG
> org.apache.flink.shaded.org.apache.curator.ConnectionState    - Starting
> 2017-03-28 12:21:32,833 DEBUG
> org.apache.flink.shaded.org.apache.curator.ConnectionState    - reset
> 2017-03-28 12:21:32,874 INFO
> org.apache.flink.shaded.org.apache.curator.framework.state.ConnectionStateManager
> - State change: CONNECTED
> 2017-03-28 12:21:33,891 INFO
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> Starting ZooKeeperLeaderRetrievalService.
> 2017-03-28 12:21:33,906 DEBUG
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> Leader node has changed.
> 2017-03-28 12:21:33,912 DEBUG
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> New leader information:
> Leader=akka.tcp://flink@ourserver_ip:36901/user/jobmanager, session
> ID=a3c337e5-1749-4c42-9949-0203bbae58d5.
> 2017-03-28 12:21:33,914 INFO
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> Stopping ZooKeeperLeaderRetrievalService.
> 2017-03-28 12:21:33,914 DEBUG
> org.apache.flink.shaded.org.apache.curator.framework.imps.CuratorFrameworkImpl
> - Closing
> 2017-03-28 12:21:33,915 DEBUG
> org.apache.flink.shaded.org.apache.curator.CuratorZookeeperClient  - Closing
> 2017-03-28 12:21:33,915 DEBUG
> org.apache.flink.shaded.org.apache.curator.ConnectionState    - Closing
> 2017-03-28 12:21:33,920 INFO  org.apache.flink.client.CliFrontend
> - Using address /ourserver_ip:36901 to connect to JobManager.
> 2017-03-28 12:21:33,926 INFO  org.apache.flink.yarn.YarnClusterClient
> - Starting client actor system.
> 2017-03-28 12:21:33,928 DEBUG org.apache.flink.runtime.net.ConnectionUtils
> - Trying to connect to (ourserver/ourserver_ip:36901) from local address
> ourserver/ourserver_ip with timeout 200
> 2017-03-28 12:21:33,931 DEBUG org.apache.flink.runtime.net.ConnectionUtils
> - Using InetAddress.getLocalHost() immediately for the connecting address
> 2017-03-28 12:21:34,673 INFO
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> Starting ZooKeeperLeaderRetrievalService.
> 2017-03-28 12:21:34,677 DEBUG
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> Leader node has changed.
> 2017-03-28 12:21:34,677 DEBUG
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> New leader information:
> Leader=akka.tcp://flink@ourserver_ip:36901/user/jobmanager, session
> ID=a3c337e5-1749-4c42-9949-0203bbae58d5.
> 2017-03-28 12:21:34,823 INFO
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> Stopping ZooKeeperLeaderRetrievalService.
> 2017-03-28 12:21:34,826 INFO  org.apache.flink.client.CliFrontend
> - Triggering savepoint for job 7e865198e220bea8a2203ebdb0827b6f.
> 2017-03-28 12:21:34,828 INFO  org.apache.flink.client.CliFrontend
> - Waiting for response...
> 2017-03-28 12:21:34,993 INFO  org.apache.flink.client.CliFrontend
> - Savepoint completed. Path:
> hdfs:/our_user/flink/savepoints/savepoint-77214a0f9902
> 2017-03-28 12:21:34,994 INFO  org.apache.flink.client.CliFrontend
> - You can resume your program from this savepoint with the run command.
> 2017-03-28 12:21:34,994 INFO  org.apache.flink.yarn.YarnClusterClient
> - Shutting down YarnClusterClient from the client shutdown hook
> 2017-03-28 12:21:34,994 INFO  org.apache.flink.yarn.YarnClusterClient
> - Disconnecting YarnClusterClient from ApplicationMaster
>
>
> [2]
>
> 2017-03-28 12:19:58,063 INFO  org.apache.flink.client.CliFrontend
> -
> --------------------------------------------------------------------------------
> 2017-03-28 12:19:58,064 INFO  org.apache.flink.client.CliFrontend
> -  Starting Command Line Client (Version: 1.1.3, Rev:a56d810,
> Date:10.11.2016 @ 13:25:34 CET)
> 2017-03-28 12:19:58,064 INFO  org.apache.flink.client.CliFrontend
> -  Current user: our_user
> 2017-03-28 12:19:58,064 INFO  org.apache.flink.client.CliFrontend
> -  JVM: Java HotSpot(TM) 64-Bit Server VM - Oracle Corporation -
> 1.7/24.51-b03
> 2017-03-28 12:19:58,065 INFO  org.apache.flink.client.CliFrontend
> -  Maximum heap size: 1749 MiBytes
> 2017-03-28 12:19:58,065 INFO  org.apache.flink.client.CliFrontend
> -  JAVA_HOME: /usr/java/default
> 2017-03-28 12:19:58,067 INFO  org.apache.flink.client.CliFrontend
> -  Hadoop version: 2.3.0
> 2017-03-28 12:19:58,067 INFO  org.apache.flink.client.CliFrontend
> -  JVM Options:
> 2017-03-28 12:19:58,068 INFO  org.apache.flink.client.CliFrontend
> -
> -Dlog.file=/path/to/our/lib/flink-1.1.3/log/flink-our_user-client-ourserver.log
> 2017-03-28 12:19:58,068 INFO  org.apache.flink.client.CliFrontend
> -
> -Dlog4j.configuration=file:/path/to/our/lib/flink-1.1.3/conf/log4j-cli.properties
> 2017-03-28 12:19:58,068 INFO  org.apache.flink.client.CliFrontend
> -
> -Dlogback.configurationFile=file:/path/to/our/lib/flink-1.1.3/conf/logback.xml
> 2017-03-28 12:19:58,068 INFO  org.apache.flink.client.CliFrontend
> -  Program Arguments:
> 2017-03-28 12:19:58,069 INFO  org.apache.flink.client.CliFrontend
> -     savepoint
> 2017-03-28 12:19:58,069 INFO  org.apache.flink.client.CliFrontend
> -     -d
> 2017-03-28 12:19:58,069 INFO  org.apache.flink.client.CliFrontend
> -     hdfs:/our_user/flink/savepoints/savepoint-d16441420a87
> 2017-03-28 12:19:58,069 INFO  org.apache.flink.client.CliFrontend
> -     -j
> 2017-03-28 12:19:58,069 INFO  org.apache.flink.client.CliFrontend
> -     /path/to/our/lib/our_program/lib/our_program-6.2.6-SNAPSHOT-all.jar
> 2017-03-28 12:19:58,069 INFO  org.apache.flink.client.CliFrontend
> -  Classpath:
> /path/to/our/lib/flink-1.1.3/lib/flink-dist_2.10-1.1.3.1.jar:/path/to/our/lib/flink-1.1.3/lib/flink-python_2.10-1.1.3.jar:/path/to/our/lib/flink-1.1.3/lib/flink-reporter-1.0.2-20161206.140111-118.jar:/path/to/our/lib/flink-1.1.3/lib/flink-table_2.10-1.1.3.jar:/path/to/our/lib/flink-1.1.3/lib/log4j-1.2.17.jar:/path/to/our/lib/flink-1.1.3/lib/ojdbc6-11.2.0.3.jar:/path/to/our/lib/flink-1.1.3/lib/slf4j-log4j12-1.7.7.jar::/etc/hadoop/conf:
> 2017-03-28 12:19:58,070 INFO  org.apache.flink.client.CliFrontend
> -
> --------------------------------------------------------------------------------
> 2017-03-28 12:19:58,070 INFO  org.apache.flink.client.CliFrontend
> - Using configuration directory /path/to/our/lib/flink-1.1.3/conf
> 2017-03-28 12:19:58,070 INFO  org.apache.flink.client.CliFrontend
> - Trying to load configuration file
> 2017-03-28 12:19:58,085 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: env.java.opts,
> -Djavax.net.ssl.trustStore=/path/to/our/cacerts
> -XX:HeapDumpPath=/path/to/our/hadoop/yarn/log
> -XX:+HeapDumpOnOutOfMemoryError -XX:MaxPermSize=192m
> 2017-03-28 12:19:58,085 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.rpc.address, localhost
> 2017-03-28 12:19:58,086 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.rpc.port, 6123
> 2017-03-28 12:19:58,086 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.heap.mb, 256
> 2017-03-28 12:19:58,086 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: taskmanager.heap.mb, 512
> 2017-03-28 12:19:58,086 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: taskmanager.numberOfTaskSlots, 4
> 2017-03-28 12:19:58,086 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: taskmanager.memory.preallocate, false
> 2017-03-28 12:19:58,086 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: parallelism.default, 1
> 2017-03-28 12:19:58,087 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.web.port, 8081
> 2017-03-28 12:19:58,087 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.web.submit.enable, false
> 2017-03-28 12:19:58,087 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: state.backend, filesystem
> 2017-03-28 12:19:58,087 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: state.backend.fs.checkpointdir,
> hdfs://ourserver:8020/our_user/flink/state
> 2017-03-28 12:19:58,087 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: taskmanager.network.numberOfBuffers, 4096
> 2017-03-28 12:19:58,087 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: fs.hdfs.hadoopconf, /etc/hadoop/conf/
> 2017-03-28 12:19:58,088 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.mode, zookeeper
> 2017-03-28 12:19:58,088 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.quorum,
> ourserver:2181,ourserver2:2181,ourserver3:2181
> 2017-03-28 12:19:58,088 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.storageDir,
> hdfs:///our_user/flink/recovery
> 2017-03-28 12:19:58,088 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.path.root, flink
> 2017-03-28 12:19:58,088 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.path.namespace, yarn_session
> 2017-03-28 12:19:58,088 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: reocvery.zookeeper.client.connection-timeout, 30000
> 2017-03-28 12:19:58,089 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.client.session-timeout, 120000
> 2017-03-28 12:19:58,089 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.client.retry-wait, 5000
> 2017-03-28 12:19:58,089 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.client.max-retry-attempts, 5
> 2017-03-28 12:19:58,089 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: yarn.application-attempts, 10
> 2017-03-28 12:19:58,089 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: yarn.maximum-failed-containers, 80
> 2017-03-28 12:19:58,090 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: akka.watch.heartbeat.interval, 50s
> 2017-03-28 12:19:58,090 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: akka.log.lifecycle.events, true
> 2017-03-28 12:19:58,090 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: akka.ask.timeout, 20s
> 2017-03-28 12:19:58,090 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: savepoints.state.backend, filesystem
> 2017-03-28 12:19:58,090 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: savepoints.state.backend.fs.dir,
> hdfs:///our_user/flink/savepoints
> 2017-03-28 12:19:58,367 INFO  org.apache.flink.client.CliFrontend
> - Running 'savepoint' command.
> 2017-03-28 12:19:58,372 INFO  org.apache.flink.client.CliFrontend
> - Retrieving JobManager.
> 2017-03-28 12:19:58,373 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
> - Found YARN properties file /tmp/.yarn-properties-our_user
> 2017-03-28 12:19:58,484 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
> - Using Yarn application id from YARN properties
> application_1488884688139_2648
> 2017-03-28 12:19:58,485 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
> - YARN properties set default parallelism to 12
> 2017-03-28 12:19:58,485 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
> - Found YARN properties file /tmp/.yarn-properties-our_user
> 2017-03-28 12:19:58,485 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
> - Using Yarn application id from YARN properties
> application_1488884688139_2648
> 2017-03-28 12:19:58,485 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
> - YARN properties set default parallelism to 12
> 2017-03-28 12:19:58,604 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: env.java.opts,
> -Djavax.net.ssl.trustStore=/path/to/our/cacerts
> -XX:HeapDumpPath=/path/to/our/hadoop/yarn/log
> -XX:+HeapDumpOnOutOfMemoryError -XX:MaxPermSize=192m
> 2017-03-28 12:19:58,604 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.rpc.address, localhost
> 2017-03-28 12:19:58,604 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.rpc.port, 6123
> 2017-03-28 12:19:58,604 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.heap.mb, 256
> 2017-03-28 12:19:58,605 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: taskmanager.heap.mb, 512
> 2017-03-28 12:19:58,605 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: taskmanager.numberOfTaskSlots, 4
> 2017-03-28 12:19:58,605 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: taskmanager.memory.preallocate, false
> 2017-03-28 12:19:58,605 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: parallelism.default, 1
> 2017-03-28 12:19:58,605 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.web.port, 8081
> 2017-03-28 12:19:58,605 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.web.submit.enable, false
> 2017-03-28 12:19:58,605 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: state.backend, filesystem
> 2017-03-28 12:19:58,605 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: state.backend.fs.checkpointdir,
> hdfs://ourserver:8020/our_user/flink/state
> 2017-03-28 12:19:58,605 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: taskmanager.network.numberOfBuffers, 4096
> 2017-03-28 12:19:58,606 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: fs.hdfs.hadoopconf, /etc/hadoop/conf/
> 2017-03-28 12:19:58,606 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.mode, zookeeper
> 2017-03-28 12:19:58,606 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.quorum,
> ourserver:2181,ourserver:2181,ourserver:2181
> 2017-03-28 12:19:58,606 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.storageDir,
> hdfs:///our_user/flink/recovery
> 2017-03-28 12:19:58,606 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.path.root, flink
> 2017-03-28 12:19:58,606 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.path.namespace, yarn_session
> 2017-03-28 12:19:58,606 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: reocvery.zookeeper.client.connection-timeout, 30000
> 2017-03-28 12:19:58,606 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.client.session-timeout, 120000
> 2017-03-28 12:19:58,606 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.client.retry-wait, 5000
> 2017-03-28 12:19:58,606 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: recovery.zookeeper.client.max-retry-attempts, 5
> 2017-03-28 12:19:58,607 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: yarn.application-attempts, 10
> 2017-03-28 12:19:58,607 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: yarn.maximum-failed-containers, 80
> 2017-03-28 12:19:58,607 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: akka.watch.heartbeat.interval, 50s
> 2017-03-28 12:19:58,607 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: akka.log.lifecycle.events, true
> 2017-03-28 12:19:58,607 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: akka.ask.timeout, 20s
> 2017-03-28 12:19:58,608 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: savepoints.state.backend, filesystem
> 2017-03-28 12:19:58,608 DEBUG
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: savepoints.state.backend.fs.dir,
> hdfs:///our_user/flink/savepoints
> 2017-03-28 12:19:58,685 INFO  org.apache.hadoop.yarn.client.RMProxy
> - Connecting to ResourceManager at ourserver/ourserver_ip:8050
> 2017-03-28 12:19:58,969 INFO  org.apache.flink.yarn.YarnClusterDescriptor
> - Found application JobManager host name 'ourserver' and port '36901' from
> supplied application id 'application_1488884688139_2648'
> 2017-03-28 12:19:58,989 INFO  org.apache.flink.runtime.util.ZooKeeperUtils
> - Using 'flink/yarn_session' as zookeeper namespace.
> 2017-03-28 12:19:59,114 INFO
> org.apache.flink.shaded.org.apache.curator.framework.imps.CuratorFrameworkImpl
> - Starting
> 2017-03-28 12:19:59,115 DEBUG
> org.apache.flink.shaded.org.apache.curator.CuratorZookeeperClient  -
> Starting
> 2017-03-28 12:19:59,115 DEBUG
> org.apache.flink.shaded.org.apache.curator.ConnectionState    - Starting
> 2017-03-28 12:19:59,115 DEBUG
> org.apache.flink.shaded.org.apache.curator.ConnectionState    - reset
> 2017-03-28 12:19:59,172 INFO
> org.apache.flink.shaded.org.apache.curator.framework.state.ConnectionStateManager
> - State change: CONNECTED
> 2017-03-28 12:20:00,212 INFO
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> Starting ZooKeeperLeaderRetrievalService.
> 2017-03-28 12:20:00,229 DEBUG
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> Leader node has changed.
> 2017-03-28 12:20:00,235 DEBUG
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> New leader information:
> Leader=akka.tcp://flink@ourserver_ip:36901/user/jobmanager, session
> ID=a3c337e5-1749-4c42-9949-0203bbae58d5.
> 2017-03-28 12:20:00,238 INFO
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> Stopping ZooKeeperLeaderRetrievalService.
> 2017-03-28 12:20:00,238 DEBUG
> org.apache.flink.shaded.org.apache.curator.framework.imps.CuratorFrameworkImpl
> - Closing
> 2017-03-28 12:20:00,238 DEBUG
> org.apache.flink.shaded.org.apache.curator.CuratorZookeeperClient  - Closing
> 2017-03-28 12:20:00,239 DEBUG
> org.apache.flink.shaded.org.apache.curator.ConnectionState    - Closing
> 2017-03-28 12:20:00,245 INFO  org.apache.flink.client.CliFrontend
> - Using address /ourserver_ip:36901 to connect to JobManager.
> 2017-03-28 12:20:00,245 INFO  org.apache.flink.runtime.util.ZooKeeperUtils
> - Using 'flink/yarn_session' as zookeeper namespace.
> 2017-03-28 12:20:00,252 INFO  org.apache.flink.yarn.YarnClusterClient
> - Starting client actor system.
> 2017-03-28 12:20:00,254 DEBUG org.apache.flink.runtime.net.ConnectionUtils
> - Trying to connect to (ourserver/ourserver_ip:36901) from local address
> ourserver/ourserver_ip with timeout 200
> 2017-03-28 12:20:00,259 DEBUG org.apache.flink.runtime.net.ConnectionUtils
> - Using InetAddress.getLocalHost() immediately for the connecting address
> 2017-03-28 12:20:01,209 INFO
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> Starting ZooKeeperLeaderRetrievalService.
> 2017-03-28 12:20:01,213 DEBUG
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> Leader node has changed.
> 2017-03-28 12:20:01,213 DEBUG
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> New leader information:
> Leader=akka.tcp://flink@ourserver_ip:36901/user/jobmanager, session
> ID=a3c337e5-1749-4c42-9949-0203bbae58d5.
> 2017-03-28 12:20:01,442 INFO
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> Stopping ZooKeeperLeaderRetrievalService.
> 2017-03-28 12:20:01,446 INFO  org.apache.flink.client.CliFrontend
> - Disposing savepoint
> 'hdfs:/our_user/flink/savepoints/savepoint-d16441420a87' with JAR
> /path/to/our/lib/our_program/lib/our_program-6.2.6-SNAPSHOT-all.jar.
> 2017-03-28 12:20:01,590 INFO  org.apache.flink.client.CliFrontend
> - Waiting for response...
> 2017-03-28 12:20:01,636 ERROR org.apache.flink.client.CliFrontend
> - Error while running the command.
> java.io.IOException: Failed to dispose savepoint
> hdfs:/our_user/flink/savepoints/savepoint-d16441420a87.
>         at
> org.apache.flink.runtime.checkpoint.savepoint.FsSavepointStore.disposeSavepoint(FsSavepointStore.java:163)
>         at
> org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply$mcV$sp(JobManager.scala:745)
>         at
> org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:727)
>         at
> org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:727)
>         at
> scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24)
>         at
> scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24)
>         at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41)
>         at
> akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401)
>         at
> scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
>         at
> scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253)
>         at
> scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346)
>         at
> scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
>         at
> scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
> Caused by: java.lang.ClassNotFoundException:
> our.company.eventdata.EventDataRecord
>         at java.net.URLClassLoader$1.run(URLClassLoader.java:366)
>         at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
>         at java.security.AccessController.doPrivileged(Native Method)
>         at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
>         at java.lang.ClassLoader.loadClass(ClassLoader.java:425)
>         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
>         at java.lang.ClassLoader.loadClass(ClassLoader.java:358)
>         at java.lang.Class.forName0(Native Method)
>         at java.lang.Class.forName(Class.java:270)
>         at
> org.apache.flink.util.InstantiationUtil$ClassLoaderObjectInputStream.resolveClass(InstantiationUtil.java:65)
>         at
> java.io.ObjectInputStream.readNonProxyDesc(ObjectInputStream.java:1612)
>         at
> java.io.ObjectInputStream.readClassDesc(ObjectInputStream.java:1517)
>         at java.io.ObjectInputStream.readClass(ObjectInputStream.java:1483)
>         at
> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1333)
>         at
> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>         at
> java.io.ObjectInputStream.defaultReadObject(ObjectInputStream.java:500)
>         at
> org.apache.flink.api.java.typeutils.runtime.PojoSerializer.readObject(PojoSerializer.java:131)
>         at sun.reflect.GeneratedMethodAccessor51.invoke(Unknown Source)
>         at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>         at java.lang.reflect.Method.invoke(Method.java:606)
>         at
> java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
>         at
> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
>         at
> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>         at
> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>         at
> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>         at
> java.io.ObjectInputStream.defaultReadObject(ObjectInputStream.java:500)
>         at
> org.apache.flink.api.common.state.StateDescriptor.readObject(StateDescriptor.java:268)
>         at sun.reflect.GeneratedMethodAccessor29.invoke(Unknown Source)
>         at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>         at java.lang.reflect.Method.invoke(Method.java:606)
>         at
> java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
>         at
> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
>         at
> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>         at
> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>         at
> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>         at
> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
>         at
> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>         at
> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>         at java.io.ObjectInputStream.readObject(ObjectInputStream.java:370)
>         at java.util.HashMap.readObject(HashMap.java:1184)
>         at sun.reflect.GeneratedMethodAccessor6.invoke(Unknown Source)
>         at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>         at java.lang.reflect.Method.invoke(Method.java:606)
>         at
> java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
>         at
> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
>         at
> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>         at
> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>         at
> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>         at
> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
>         at
> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>         at
> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>         at java.io.ObjectInputStream.readArray(ObjectInputStream.java:1706)
>         at
> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1344)
>         at
> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>         at
> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
>         at
> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>         at
> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>         at java.io.ObjectInputStream.readObject(ObjectInputStream.java:370)
>         at
> org.apache.flink.util.InstantiationUtil.deserializeObject(InstantiationUtil.java:291)
>         at
> org.apache.flink.util.SerializedValue.deserializeValue(SerializedValue.java:58)
>         at
> org.apache.flink.runtime.checkpoint.SubtaskState.discard(SubtaskState.java:85)
>         at
> org.apache.flink.runtime.checkpoint.TaskState.discard(TaskState.java:147)
>         at
> org.apache.flink.runtime.checkpoint.savepoint.SavepointV0.dispose(SavepointV0.java:66)
>         at
> org.apache.flink.runtime.checkpoint.savepoint.FsSavepointStore.disposeSavepoint(FsSavepointStore.java:151)
>         ... 12 more
> 2017-03-28 12:20:01,652 INFO  org.apache.flink.yarn.YarnClusterClient
> - Shutting down YarnClusterClient from the client shutdown hook
> 2017-03-28 12:20:01,653 INFO  org.apache.flink.yarn.YarnClusterClient
> - Disconnecting YarnClusterClient from ApplicationMaster
>
>
> [3]
>
> 2017-03-28 10:43:57,361 INFO  org.apache.flink.client.CliFrontend
> -
> --------------------------------------------------------------------------------
> 2017-03-28 10:43:57,362 INFO  org.apache.flink.client.CliFrontend
> -  Starting Command Line Client (Version: 1.1.3, Rev:a56d810,
> Date:10.11.2016 @ 13:25:34 CET)
> 2017-03-28 10:43:57,362 INFO  org.apache.flink.client.CliFrontend
> -  Current user: our_user
> 2017-03-28 10:43:57,362 INFO  org.apache.flink.client.CliFrontend
> -  JVM: Java HotSpot(TM) 64-Bit Server VM - Oracle Corporation -
> 1.7/24.51-b03
> 2017-03-28 10:43:57,362 INFO  org.apache.flink.client.CliFrontend
> -  Maximum heap size: 1749 MiBytes
> 2017-03-28 10:43:57,363 INFO  org.apache.flink.client.CliFrontend
> -  JAVA_HOME: /usr/java/default
> 2017-03-28 10:43:57,365 INFO  org.apache.flink.client.CliFrontend
> -  Hadoop version: 2.3.0
> 2017-03-28 10:43:57,365 INFO  org.apache.flink.client.CliFrontend
> -  JVM Options:
> 2017-03-28 10:43:57,365 INFO  org.apache.flink.client.CliFrontend
> -
> -Dlog.file=/path/to/our/lib/flink-1.1.3/log/flink-our_user-client-ourserver.log
> 2017-03-28 10:43:57,365 INFO  org.apache.flink.client.CliFrontend
> -
> -Dlog4j.configuration=file:/path/to/our/lib/flink-1.1.3/conf/log4j-cli.properties
> 2017-03-28 10:43:57,365 INFO  org.apache.flink.client.CliFrontend
> -
> -Dlogback.configurationFile=file:/path/to/our/lib/flink-1.1.3/conf/logback.xml
> 2017-03-28 10:43:57,365 INFO  org.apache.flink.client.CliFrontend
> -  Program Arguments:
> 2017-03-28 10:43:57,366 INFO  org.apache.flink.client.CliFrontend
> -     run
> 2017-03-28 10:43:57,366 INFO  org.apache.flink.client.CliFrontend
> -     -p
> 2017-03-28 10:43:57,366 INFO  org.apache.flink.client.CliFrontend
> -     5
> 2017-03-28 10:43:57,366 INFO  org.apache.flink.client.CliFrontend
> -     -c
> 2017-03-28 10:43:57,366 INFO  org.apache.flink.client.CliFrontend
> -     our.company.package.OurProgramClass
> 2017-03-28 10:43:57,366 INFO  org.apache.flink.client.CliFrontend
> -     /path/to/our/lib/our_program/lib/our_program.jar
> 2017-03-28 10:43:57,366 INFO  org.apache.flink.client.CliFrontend
> -     /path/to/our/lib/our_program/conf/our_program.properties
> 2017-03-28 10:43:57,367 INFO  org.apache.flink.client.CliFrontend
> -  Classpath:
> /path/to/our/lib/flink-1.1.3/lib/flink-dist_2.10-1.1.3.1.jar:/path/to/our/lib/flink-1.1.3/lib/flink-python_2.10-1.1.3.jar:/path/to/our/lib/flink-1.1.3/lib/flink-reporter-1.0.2-20161206.140111-118.jar:/path/to/our/lib/flink-1.1.3/lib/flink-table_2.10-1.1.3.jar:/path/to/our/lib/flink-1.1.3/lib/log4j-1.2.17.jar:/path/to/our/lib/flink-1.1.3/lib/ojdbc6-11.2.0.3.jar:/path/to/our/lib/flink-1.1.3/lib/slf4j-log4j12-1.7.7.jar::/etc/hadoop/conf:
> 2017-03-28 10:43:57,367 INFO  org.apache.flink.client.CliFrontend
> -
> --------------------------------------------------------------------------------
> 2017-03-28 10:43:57,367 INFO  org.apache.flink.client.CliFrontend
> - Using configuration directory /path/to/our/lib/flink-1.1.3/conf
> 2017-03-28 10:43:57,367 INFO  org.apache.flink.client.CliFrontend
> - Trying to load configuration file
> 2017-03-28 10:43:57,664 INFO  org.apache.flink.client.CliFrontend
> - Running 'run' command.
> 2017-03-28 10:43:57,671 INFO  org.apache.flink.client.CliFrontend
> - Building program from JAR file
> 2017-03-28 10:43:57,827 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
> - Found YARN properties file /tmp/.yarn-properties-our_user
> 2017-03-28 10:43:57,921 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
> - Using Yarn application id from YARN properties
> application_1488884688139_2648
> 2017-03-28 10:43:57,921 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
> - YARN properties set default parallelism to 12
> 2017-03-28 10:43:57,921 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
> - Found YARN properties file /tmp/.yarn-properties-our_user
> 2017-03-28 10:43:57,922 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
> - Using Yarn application id from YARN properties
> application_1488884688139_2648
> 2017-03-28 10:43:57,922 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
> - YARN properties set default parallelism to 12
> 2017-03-28 10:43:58,046 INFO  org.apache.hadoop.yarn.client.RMProxy
> - Connecting to ResourceManager at ourserver/ourserver_ip:8050
> 2017-03-28 10:43:58,237 INFO  org.apache.flink.yarn.YarnClusterDescriptor
> - Found application JobManager host name 'ourserver' and port '36901' from
> supplied application id 'application_1488884688139_2648'
> 2017-03-28 10:43:58,246 INFO  org.apache.flink.client.CliFrontend
> - Cluster configuration: Yarn cluster with application id
> application_1488884688139_2648
> 2017-03-28 10:43:59,439 INFO  org.apache.flink.client.CliFrontend
> - Using address ourserver_ip:36901 to connect to JobManager.
> 2017-03-28 10:43:59,439 INFO  org.apache.flink.client.CliFrontend
> - JobManager web interface address
> http://ourserver:8088/proxy/application_1488884688139_2648/
> 2017-03-28 10:43:59,439 DEBUG org.apache.flink.client.CliFrontend
> - Client slots is set to -1
> 2017-03-28 10:43:59,440 DEBUG org.apache.flink.client.CliFrontend
> - Savepoint path is set to null
> 2017-03-28 10:43:59,440 DEBUG org.apache.flink.client.CliFrontend
> - User parallelism is set to 5
> 2017-03-28 10:43:59,440 INFO  org.apache.flink.client.CliFrontend
> - Starting execution of program
> 2017-03-28 10:43:59,440 INFO  org.apache.flink.yarn.YarnClusterClient
> - Starting program in interactive mode
> 2017-03-28 10:44:00,593 WARN  org.apache.hadoop.hdfs.BlockReaderLocal
> - The short-circuit local reads feature cannot be used because libhadoop
> cannot be loaded.
> 2017-03-28 10:44:01,672 INFO  org.apache.flink.yarn.YarnClusterClient
> - Waiting until all TaskManagers have connected
> 2017-03-28 10:44:02,702 INFO  org.apache.flink.yarn.YarnClusterClient
> - Starting client actor system.
> 2017-03-28 10:44:03,717 INFO  org.apache.flink.yarn.YarnClusterClient
> - TaskManager status (3/1)
> 2017-03-28 10:44:03,720 INFO  org.apache.flink.yarn.YarnClusterClient
> - All TaskManagers are connected
> 2017-03-28 10:44:04,736 INFO  org.apache.flink.yarn.YarnClusterClient
> - Submitting job with JobID: d33a2835c9c25881a0765c250bbceb7e. Waiting for
> job completion.
> Connected to JobManager at
> Actor[akka.tcp://flink@ourserver_ip:36901/user/jobmanager#-429328340]
> 03/28/2017 10:44:06     Job execution switched to status RUNNING.
>
>
> On 27.03.2017 15:24, Ufuk Celebi wrote:
>
> What kind of state backend where you using for the checkpoints?
>
> If there is a bug that prevents us from deleting the savepoint files
> automatically, we can do a manual workaround and delete the
> checkpoints files manually. With Flink 1.3 this becomes very straight
> forward as savepoint data all go to a self contained directory that
> can be deleted manually.
>
> On Mon, Mar 27, 2017 at 12:46 PM, Stefan Richter
> <[hidden email]> wrote:
>
> Hi,
>
> could you provide us with the log from the job client, with logging on debug
> level for package org.apache.flink.client? Also, did you check if this
> problem also exists in the latest bugfix release for your version (1.1.5) ?
>
> Best,
> Stefan
>
>
> Am 27.03.2017 um 11:41 schrieb Konstantin Gregor
> <[hidden email]>:
>
> Hey everyone,
>
> we are experiencing an issue in the disposal of savepoints in
> Flink-1.1.3. We have a streaming job that has custom state (user objects
> are part of the state). We create a savepoint:
>
> $ flink savepoint <JOBID>
> [...]
> Savepoint completed. Path:
> hdfs:/bigdata/flink/savepoints/savepoint-20f064fb9f50
> [...]
>
> Then we want to simply dispose of that savepoint where we also provide
> the jar to the job from which the savepoint was made:
> $ flink savepoint -d
> hdfs:/bigdata/flink/savepoints/savepoint-20f064fb9f50 -j
> /path/to/jar/application.jar
>
> This gives us a ClassNotFoundException of our custom objects [1].
>
> Adding our jar to the flink/lib directory is not an option for us,
> things will break because of this.
> Does anyone have an idea on how to proceed here?
>
> Thanks and best regards,
>
> Konstantin
>
> [1]
> java.io.IOException: Failed to dispose savepoint
> hdfs:///bigdata/flink/savepoints/savepoint-20f064fb9f50.
>         at
> org.apache.flink.runtime.checkpoint.savepoint.FsSavepointStore.disposeSavepoint(FsSavepointStore.java:163)
>         at
> org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply$mcV$sp(JobManager.scala:745)
>         at
> org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:727)
>         at
> org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:727)
>         at
> scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24)
>         at
> scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24)
>         at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41)
>         at
> akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401)
>         at
> scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
>         at
> scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253)
>         at
> scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346)
>         at
> scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
>         at
> scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
> Caused by: java.lang.ClassNotFoundException:
> our.company.application.eventdata.EventDataRecord
>         at java.net.URLClassLoader$1.run(URLClassLoader.java:366)
>         at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
>         at java.security.AccessController.doPrivileged(Native Method)
>         at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
>         at java.lang.ClassLoader.loadClass(ClassLoader.java:425)
>         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
>         at java.lang.ClassLoader.loadClass(ClassLoader.java:358)
>         at java.lang.Class.forName0(Native Method)
>         at java.lang.Class.forName(Class.java:270)
>         at
> org.apache.flink.util.InstantiationUtil$ClassLoaderObjectInputStream.resolveClass(InstantiationUtil.java:65)
>         at
> java.io.ObjectInputStream.readNonProxyDesc(ObjectInputStream.java:1612)
>         at
> java.io.ObjectInputStream.readClassDesc(ObjectInputStream.java:1517)
>         at java.io.ObjectInputStream.readClass(ObjectInputStream.java:1483)
>         at
> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1333)
>         at
> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>         at
> java.io.ObjectInputStream.defaultReadObject(ObjectInputStream.java:500)
>         at
> org.apache.flink.api.java.typeutils.runtime.PojoSerializer.readObject(PojoSerializer.java:131)
>         at sun.reflect.GeneratedMethodAccessor33.invoke(Unknown Source)
>         at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>         at java.lang.reflect.Method.invoke(Method.java:606)
>         at
> java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
>         at
> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
>         at
> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>         at
> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>         at
> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>         at
> java.io.ObjectInputStream.defaultReadObject(ObjectInputStream.java:500)
>         at
> org.apache.flink.api.common.state.StateDescriptor.readObject(StateDescriptor.java:268)
>         at sun.reflect.GeneratedMethodAccessor32.invoke(Unknown Source)
>         at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>         at java.lang.reflect.Method.invoke(Method.java:606)
>         at
> java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
>         at
> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
>         at
> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>         at
> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>         at
> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>         at
> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
>         at
> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>         at
> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>         at java.io.ObjectInputStream.readObject(ObjectInputStream.java:370)
>         at java.util.HashMap.readObject(HashMap.java:1184)
>         at sun.reflect.GeneratedMethodAccessor6.invoke(Unknown Source)
>         at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>         at java.lang.reflect.Method.invoke(Method.java:606)
>         at
> java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
>         at
> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
>         at
> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>         at
> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>         at
> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>         at
> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
>         at
> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>         at
> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>         at java.io.ObjectInputStream.readArray(ObjectInputStream.java:1706)
>         at
> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1344)
>         at
> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>         at
> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
>         at
> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>         at
> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>         at java.io.ObjectInputStream.readObject(ObjectInputStream.java:370)
>         at
> org.apache.flink.util.InstantiationUtil.deserializeObject(InstantiationUtil.java:291)
>         at
> org.apache.flink.util.SerializedValue.deserializeValue(SerializedValue.java:58)
>         at
> org.apache.flink.runtime.checkpoint.SubtaskState.discard(SubtaskState.java:85)
>         at
> org.apache.flink.runtime.checkpoint.TaskState.discard(TaskState.java:147)
>         at
> org.apache.flink.runtime.checkpoint.savepoint.SavepointV0.dispose(SavepointV0.java:66)
>         at
> org.apache.flink.runtime.checkpoint.savepoint.FsSavepointStore.disposeSavepoint(FsSavepointStore.java:151)
>
>
> --
> Konstantin Gregor * [hidden email]
> TNG Technology Consulting GmbH, Betastr. 13a, 85774 Unterföhring
> Geschäftsführer: Henrik Klagges, Christoph Stock, Dr. Robert Dahlke
> Sitz: Unterföhring * Amtsgericht München * HRB 135082
>
>
>
> --
> Konstantin Gregor * [hidden email]
> TNG Technology Consulting GmbH, Betastr. 13a, 85774 Unterföhring
> Geschäftsführer: Henrik Klagges, Christoph Stock, Dr. Robert Dahlke
> Sitz: Unterföhring * Amtsgericht München * HRB 135082
Reply | Threaded
Open this post in threaded view
|

Re: ClassNotFoundException upon Savepoint Disposal

Konstantin Gregor
Hi Ufuk,

yes the class is part of the jar.
Actually, don't worry about this issue too much, we were just wondering
if we broke something. Since that does not seem to be the case, we will
deal with this until we have 1.2. So no need to write a tool, but thanks
for the offer :)

We want to upgrade to 1.2 in the near future anyway, until then this is
a problem we can deal with :)

So thanks a lot for your support!

Best regards

Konstantin

On 29.03.2017 16:05, Ufuk Celebi wrote:

> Thanks for providing the logs. It looks like the JARs are uploaded
> (modulo any bugs ;-))... could you double check that the class is
> actually part of the JAR and has not been moved around via
>
> jar -tf <jarfile> | grep EventDataRecord
>
>
> If everything looks good in the JAR, I could write a short tool that
> prints the pointed file paths and you could manually delete the files
> as a work around. Sorry for all the trouble with this.
>
> In version >1.2 we don't need the user code any more to dispose savepoints.
>
> – Ufuk
>
>
> On Wed, Mar 29, 2017 at 11:50 AM, Konstantin Gregor
> <[hidden email]> wrote:
>> Hi Ufuk, hi Stefan,
>>
>>
>> thanks a lot for your replies.
>>
>> Ufuk, we are using the HDFS state backend.
>>
>> Stefan, I installed 1.1.5 on our machines and built our software with the
>> Flink 1.1.5 dependency, but the problem remains. Below are the logs for
>> savepoint creation [1] and savepoint disposal [2] as well as the logs from
>> the start of the job [3]. There were not many more log lines when I set
>> org.apache.flink.client to DEBUG, so I set the whole package
>> org.apache.flink to DEBUG in the hope of some findings. But I couldn't
>> really find anything suspicious.
>>
>> Again, thanks a lot for your help!
>>
>> Best regards
>>
>>
>> Konstantin
>>
>>
>> [1]
>>
>> 2017-03-28 12:21:32,033 INFO  org.apache.flink.client.CliFrontend
>> -
>> --------------------------------------------------------------------------------
>> 2017-03-28 12:21:32,034 INFO  org.apache.flink.client.CliFrontend
>> -  Starting Command Line Client (Version: 1.1.3, Rev:a56d810,
>> Date:10.11.2016 @ 13:25:34 CET)
>> 2017-03-28 12:21:32,035 INFO  org.apache.flink.client.CliFrontend
>> -  Current user: our_user
>> 2017-03-28 12:21:32,035 INFO  org.apache.flink.client.CliFrontend
>> -  JVM: Java HotSpot(TM) 64-Bit Server VM - Oracle Corporation -
>> 1.7/24.51-b03
>> 2017-03-28 12:21:32,035 INFO  org.apache.flink.client.CliFrontend
>> -  Maximum heap size: 1749 MiBytes
>> 2017-03-28 12:21:32,035 INFO  org.apache.flink.client.CliFrontend
>> -  JAVA_HOME: /usr/java/default
>> 2017-03-28 12:21:32,037 INFO  org.apache.flink.client.CliFrontend
>> -  Hadoop version: 2.3.0
>> 2017-03-28 12:21:32,038 INFO  org.apache.flink.client.CliFrontend
>> -  JVM Options:
>> 2017-03-28 12:21:32,038 INFO  org.apache.flink.client.CliFrontend
>> -
>> -Dlog.file=/path/to/our/lib/flink-1.1.3/log/flink-our_user-client-ourserver.log
>> 2017-03-28 12:21:32,038 INFO  org.apache.flink.client.CliFrontend
>> -
>> -Dlog4j.configuration=file:/path/to/our/lib/flink-1.1.3/conf/log4j-cli.properties
>> 2017-03-28 12:21:32,038 INFO  org.apache.flink.client.CliFrontend
>> -
>> -Dlogback.configurationFile=file:/path/to/our/lib/flink-1.1.3/conf/logback.xml
>> 2017-03-28 12:21:32,038 INFO  org.apache.flink.client.CliFrontend
>> -  Program Arguments:
>> 2017-03-28 12:21:32,038 INFO  org.apache.flink.client.CliFrontend
>> -     savepoint
>> 2017-03-28 12:21:32,038 INFO  org.apache.flink.client.CliFrontend
>> -     7e865198e220bea8a2203ebdb0827b6f
>> 2017-03-28 12:21:32,039 INFO  org.apache.flink.client.CliFrontend
>> -     -j
>> 2017-03-28 12:21:32,039 INFO  org.apache.flink.client.CliFrontend
>> -     /path/to/our/lib/our_program/lib/our_program-6.2.6-SNAPSHOT-all.jar
>> 2017-03-28 12:21:32,039 INFO  org.apache.flink.client.CliFrontend
>> -  Classpath:
>> /path/to/our/lib/flink-1.1.3/lib/flink-dist_2.10-1.1.3.1.jar:/path/to/our/lib/flink-1.1.3/lib/flink-python_2.10-1.1.3.jar:/path/to/our/lib/flink-1.1.3/lib/flink-reporter-1.0.2-20161206.140111-118.jar:/path/to/our/lib/flink-1.1.3/lib/flink-table_2.10-1.1.3.jar:/path/to/our/lib/flink-1.1.3/lib/log4j-1.2.17.jar:/path/to/our/lib/flink-1.1.3/lib/ojdbc6-11.2.0.3.jar:/path/to/our/lib/flink-1.1.3/lib/slf4j-log4j12-1.7.7.jar::/etc/hadoop/conf:
>> 2017-03-28 12:21:32,039 INFO  org.apache.flink.client.CliFrontend
>> -
>> --------------------------------------------------------------------------------
>> 2017-03-28 12:21:32,039 INFO  org.apache.flink.client.CliFrontend
>> - Using configuration directory /path/to/our/lib/flink-1.1.3/conf
>> 2017-03-28 12:21:32,039 INFO  org.apache.flink.client.CliFrontend
>> - Trying to load configuration file
>> 2017-03-28 12:21:32,050 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: env.java.opts,
>> -Djavax.net.ssl.trustStore=/path/to/our/cacerts
>> -XX:HeapDumpPath=/path/to/our/hadoop/yarn/log
>> -XX:+HeapDumpOnOutOfMemoryError -XX:MaxPermSize=192m
>> 2017-03-28 12:21:32,050 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.rpc.address, localhost
>> 2017-03-28 12:21:32,050 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.rpc.port, 6123
>> 2017-03-28 12:21:32,051 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.heap.mb, 256
>> 2017-03-28 12:21:32,051 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: taskmanager.heap.mb, 512
>> 2017-03-28 12:21:32,051 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: taskmanager.numberOfTaskSlots, 4
>> 2017-03-28 12:21:32,051 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: taskmanager.memory.preallocate, false
>> 2017-03-28 12:21:32,051 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: parallelism.default, 1
>> 2017-03-28 12:21:32,051 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.web.port, 8081
>> 2017-03-28 12:21:32,051 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.web.submit.enable, false
>> 2017-03-28 12:21:32,052 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: state.backend, filesystem
>> 2017-03-28 12:21:32,052 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: state.backend.fs.checkpointdir,
>> hdfs://ourserver:8020/our_user/flink/state
>> 2017-03-28 12:21:32,052 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: taskmanager.network.numberOfBuffers, 4096
>> 2017-03-28 12:21:32,052 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: fs.hdfs.hadoopconf, /etc/hadoop/conf/
>> 2017-03-28 12:21:32,052 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.mode, zookeeper
>> 2017-03-28 12:21:32,052 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.quorum,
>> ourserver:2181,ourserver2:2181,ourserver3:2181
>> 2017-03-28 12:21:32,053 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.storageDir,
>> hdfs:///our_user/flink/recovery
>> 2017-03-28 12:21:32,053 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.path.root, flink
>> 2017-03-28 12:21:32,053 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.path.namespace, yarn_session
>> 2017-03-28 12:21:32,053 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: reocvery.zookeeper.client.connection-timeout, 30000
>> 2017-03-28 12:21:32,053 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.client.session-timeout, 120000
>> 2017-03-28 12:21:32,053 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.client.retry-wait, 5000
>> 2017-03-28 12:21:32,053 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.client.max-retry-attempts, 5
>> 2017-03-28 12:21:32,054 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: yarn.application-attempts, 10
>> 2017-03-28 12:21:32,054 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: yarn.maximum-failed-containers, 80
>> 2017-03-28 12:21:32,054 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: akka.watch.heartbeat.interval, 50s
>> 2017-03-28 12:21:32,055 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: akka.log.lifecycle.events, true
>> 2017-03-28 12:21:32,055 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: akka.ask.timeout, 20s
>> 2017-03-28 12:21:32,055 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: savepoints.state.backend, filesystem
>> 2017-03-28 12:21:32,055 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: savepoints.state.backend.fs.dir,
>> hdfs:///our_user/flink/savepoints
>> 2017-03-28 12:21:32,281 INFO  org.apache.flink.client.CliFrontend
>> - Running 'savepoint' command.
>> 2017-03-28 12:21:32,287 INFO  org.apache.flink.client.CliFrontend
>> - Retrieving JobManager.
>> 2017-03-28 12:21:32,288 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
>> - Found YARN properties file /tmp/.yarn-properties-our_user
>> 2017-03-28 12:21:32,372 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
>> - Using Yarn application id from YARN properties
>> application_1488884688139_2648
>> 2017-03-28 12:21:32,372 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
>> - YARN properties set default parallelism to 12
>> 2017-03-28 12:21:32,372 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
>> - Found YARN properties file /tmp/.yarn-properties-our_user
>> 2017-03-28 12:21:32,373 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
>> - Using Yarn application id from YARN properties
>> application_1488884688139_2648
>> 2017-03-28 12:21:32,373 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
>> - YARN properties set default parallelism to 12
>> 2017-03-28 12:21:32,440 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: env.java.opts,
>> -Djavax.net.ssl.trustStore=/path/to/our/cacerts
>> -XX:HeapDumpPath=/path/to/our/hadoop/yarn/log
>> -XX:+HeapDumpOnOutOfMemoryError -XX:MaxPermSize=192m
>> 2017-03-28 12:21:32,440 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.rpc.address, localhost
>> 2017-03-28 12:21:32,440 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.rpc.port, 6123
>> 2017-03-28 12:21:32,441 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.heap.mb, 256
>> 2017-03-28 12:21:32,441 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: taskmanager.heap.mb, 512
>> 2017-03-28 12:21:32,441 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: taskmanager.numberOfTaskSlots, 4
>> 2017-03-28 12:21:32,441 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: taskmanager.memory.preallocate, false
>> 2017-03-28 12:21:32,441 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: parallelism.default, 1
>> 2017-03-28 12:21:32,441 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.web.port, 8081
>> 2017-03-28 12:21:32,441 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.web.submit.enable, false
>> 2017-03-28 12:21:32,441 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: state.backend, filesystem
>> 2017-03-28 12:21:32,441 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: state.backend.fs.checkpointdir,
>> hdfs://ourserver:8020/our_user/flink/state
>> 2017-03-28 12:21:32,442 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: taskmanager.network.numberOfBuffers, 4096
>> 2017-03-28 12:21:32,442 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: fs.hdfs.hadoopconf, /etc/hadoop/conf/
>> 2017-03-28 12:21:32,442 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.mode, zookeeper
>> 2017-03-28 12:21:32,442 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.quorum,
>> ourserver:2181,ourserver:2181,ourserver:2181
>> 2017-03-28 12:21:32,442 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.storageDir,
>> hdfs:///our_user/flink/recovery
>> 2017-03-28 12:21:32,442 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.path.root, flink
>> 2017-03-28 12:21:32,442 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.path.namespace, yarn_session
>> 2017-03-28 12:21:32,442 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: reocvery.zookeeper.client.connection-timeout, 30000
>> 2017-03-28 12:21:32,442 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.client.session-timeout, 120000
>> 2017-03-28 12:21:32,443 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.client.retry-wait, 5000
>> 2017-03-28 12:21:32,443 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.client.max-retry-attempts, 5
>> 2017-03-28 12:21:32,443 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: yarn.application-attempts, 10
>> 2017-03-28 12:21:32,443 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: yarn.maximum-failed-containers, 80
>> 2017-03-28 12:21:32,443 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: akka.watch.heartbeat.interval, 50s
>> 2017-03-28 12:21:32,444 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: akka.log.lifecycle.events, true
>> 2017-03-28 12:21:32,444 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: akka.ask.timeout, 20s
>> 2017-03-28 12:21:32,444 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: savepoints.state.backend, filesystem
>> 2017-03-28 12:21:32,444 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: savepoints.state.backend.fs.dir,
>> hdfs:///our_user/flink/savepoints
>> 2017-03-28 12:21:32,541 INFO  org.apache.hadoop.yarn.client.RMProxy
>> - Connecting to ResourceManager at ourserver/ourserver_ip:8050
>> 2017-03-28 12:21:32,718 INFO  org.apache.flink.yarn.YarnClusterDescriptor
>> - Found application JobManager host name 'ourserver' and port '36901' from
>> supplied application id 'application_1488884688139_2648'
>> 2017-03-28 12:21:32,732 INFO  org.apache.flink.runtime.util.ZooKeeperUtils
>> - Using 'flink/yarn_session' as zookeeper namespace.
>> 2017-03-28 12:21:32,831 INFO
>> org.apache.flink.shaded.org.apache.curator.framework.imps.CuratorFrameworkImpl
>> - Starting
>> 2017-03-28 12:21:32,832 DEBUG
>> org.apache.flink.shaded.org.apache.curator.CuratorZookeeperClient  -
>> Starting
>> 2017-03-28 12:21:32,832 DEBUG
>> org.apache.flink.shaded.org.apache.curator.ConnectionState    - Starting
>> 2017-03-28 12:21:32,833 DEBUG
>> org.apache.flink.shaded.org.apache.curator.ConnectionState    - reset
>> 2017-03-28 12:21:32,874 INFO
>> org.apache.flink.shaded.org.apache.curator.framework.state.ConnectionStateManager
>> - State change: CONNECTED
>> 2017-03-28 12:21:33,891 INFO
>> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
>> Starting ZooKeeperLeaderRetrievalService.
>> 2017-03-28 12:21:33,906 DEBUG
>> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
>> Leader node has changed.
>> 2017-03-28 12:21:33,912 DEBUG
>> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
>> New leader information:
>> Leader=akka.tcp://flink@ourserver_ip:36901/user/jobmanager, session
>> ID=a3c337e5-1749-4c42-9949-0203bbae58d5.
>> 2017-03-28 12:21:33,914 INFO
>> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
>> Stopping ZooKeeperLeaderRetrievalService.
>> 2017-03-28 12:21:33,914 DEBUG
>> org.apache.flink.shaded.org.apache.curator.framework.imps.CuratorFrameworkImpl
>> - Closing
>> 2017-03-28 12:21:33,915 DEBUG
>> org.apache.flink.shaded.org.apache.curator.CuratorZookeeperClient  - Closing
>> 2017-03-28 12:21:33,915 DEBUG
>> org.apache.flink.shaded.org.apache.curator.ConnectionState    - Closing
>> 2017-03-28 12:21:33,920 INFO  org.apache.flink.client.CliFrontend
>> - Using address /ourserver_ip:36901 to connect to JobManager.
>> 2017-03-28 12:21:33,926 INFO  org.apache.flink.yarn.YarnClusterClient
>> - Starting client actor system.
>> 2017-03-28 12:21:33,928 DEBUG org.apache.flink.runtime.net.ConnectionUtils
>> - Trying to connect to (ourserver/ourserver_ip:36901) from local address
>> ourserver/ourserver_ip with timeout 200
>> 2017-03-28 12:21:33,931 DEBUG org.apache.flink.runtime.net.ConnectionUtils
>> - Using InetAddress.getLocalHost() immediately for the connecting address
>> 2017-03-28 12:21:34,673 INFO
>> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
>> Starting ZooKeeperLeaderRetrievalService.
>> 2017-03-28 12:21:34,677 DEBUG
>> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
>> Leader node has changed.
>> 2017-03-28 12:21:34,677 DEBUG
>> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
>> New leader information:
>> Leader=akka.tcp://flink@ourserver_ip:36901/user/jobmanager, session
>> ID=a3c337e5-1749-4c42-9949-0203bbae58d5.
>> 2017-03-28 12:21:34,823 INFO
>> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
>> Stopping ZooKeeperLeaderRetrievalService.
>> 2017-03-28 12:21:34,826 INFO  org.apache.flink.client.CliFrontend
>> - Triggering savepoint for job 7e865198e220bea8a2203ebdb0827b6f.
>> 2017-03-28 12:21:34,828 INFO  org.apache.flink.client.CliFrontend
>> - Waiting for response...
>> 2017-03-28 12:21:34,993 INFO  org.apache.flink.client.CliFrontend
>> - Savepoint completed. Path:
>> hdfs:/our_user/flink/savepoints/savepoint-77214a0f9902
>> 2017-03-28 12:21:34,994 INFO  org.apache.flink.client.CliFrontend
>> - You can resume your program from this savepoint with the run command.
>> 2017-03-28 12:21:34,994 INFO  org.apache.flink.yarn.YarnClusterClient
>> - Shutting down YarnClusterClient from the client shutdown hook
>> 2017-03-28 12:21:34,994 INFO  org.apache.flink.yarn.YarnClusterClient
>> - Disconnecting YarnClusterClient from ApplicationMaster
>>
>>
>> [2]
>>
>> 2017-03-28 12:19:58,063 INFO  org.apache.flink.client.CliFrontend
>> -
>> --------------------------------------------------------------------------------
>> 2017-03-28 12:19:58,064 INFO  org.apache.flink.client.CliFrontend
>> -  Starting Command Line Client (Version: 1.1.3, Rev:a56d810,
>> Date:10.11.2016 @ 13:25:34 CET)
>> 2017-03-28 12:19:58,064 INFO  org.apache.flink.client.CliFrontend
>> -  Current user: our_user
>> 2017-03-28 12:19:58,064 INFO  org.apache.flink.client.CliFrontend
>> -  JVM: Java HotSpot(TM) 64-Bit Server VM - Oracle Corporation -
>> 1.7/24.51-b03
>> 2017-03-28 12:19:58,065 INFO  org.apache.flink.client.CliFrontend
>> -  Maximum heap size: 1749 MiBytes
>> 2017-03-28 12:19:58,065 INFO  org.apache.flink.client.CliFrontend
>> -  JAVA_HOME: /usr/java/default
>> 2017-03-28 12:19:58,067 INFO  org.apache.flink.client.CliFrontend
>> -  Hadoop version: 2.3.0
>> 2017-03-28 12:19:58,067 INFO  org.apache.flink.client.CliFrontend
>> -  JVM Options:
>> 2017-03-28 12:19:58,068 INFO  org.apache.flink.client.CliFrontend
>> -
>> -Dlog.file=/path/to/our/lib/flink-1.1.3/log/flink-our_user-client-ourserver.log
>> 2017-03-28 12:19:58,068 INFO  org.apache.flink.client.CliFrontend
>> -
>> -Dlog4j.configuration=file:/path/to/our/lib/flink-1.1.3/conf/log4j-cli.properties
>> 2017-03-28 12:19:58,068 INFO  org.apache.flink.client.CliFrontend
>> -
>> -Dlogback.configurationFile=file:/path/to/our/lib/flink-1.1.3/conf/logback.xml
>> 2017-03-28 12:19:58,068 INFO  org.apache.flink.client.CliFrontend
>> -  Program Arguments:
>> 2017-03-28 12:19:58,069 INFO  org.apache.flink.client.CliFrontend
>> -     savepoint
>> 2017-03-28 12:19:58,069 INFO  org.apache.flink.client.CliFrontend
>> -     -d
>> 2017-03-28 12:19:58,069 INFO  org.apache.flink.client.CliFrontend
>> -     hdfs:/our_user/flink/savepoints/savepoint-d16441420a87
>> 2017-03-28 12:19:58,069 INFO  org.apache.flink.client.CliFrontend
>> -     -j
>> 2017-03-28 12:19:58,069 INFO  org.apache.flink.client.CliFrontend
>> -     /path/to/our/lib/our_program/lib/our_program-6.2.6-SNAPSHOT-all.jar
>> 2017-03-28 12:19:58,069 INFO  org.apache.flink.client.CliFrontend
>> -  Classpath:
>> /path/to/our/lib/flink-1.1.3/lib/flink-dist_2.10-1.1.3.1.jar:/path/to/our/lib/flink-1.1.3/lib/flink-python_2.10-1.1.3.jar:/path/to/our/lib/flink-1.1.3/lib/flink-reporter-1.0.2-20161206.140111-118.jar:/path/to/our/lib/flink-1.1.3/lib/flink-table_2.10-1.1.3.jar:/path/to/our/lib/flink-1.1.3/lib/log4j-1.2.17.jar:/path/to/our/lib/flink-1.1.3/lib/ojdbc6-11.2.0.3.jar:/path/to/our/lib/flink-1.1.3/lib/slf4j-log4j12-1.7.7.jar::/etc/hadoop/conf:
>> 2017-03-28 12:19:58,070 INFO  org.apache.flink.client.CliFrontend
>> -
>> --------------------------------------------------------------------------------
>> 2017-03-28 12:19:58,070 INFO  org.apache.flink.client.CliFrontend
>> - Using configuration directory /path/to/our/lib/flink-1.1.3/conf
>> 2017-03-28 12:19:58,070 INFO  org.apache.flink.client.CliFrontend
>> - Trying to load configuration file
>> 2017-03-28 12:19:58,085 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: env.java.opts,
>> -Djavax.net.ssl.trustStore=/path/to/our/cacerts
>> -XX:HeapDumpPath=/path/to/our/hadoop/yarn/log
>> -XX:+HeapDumpOnOutOfMemoryError -XX:MaxPermSize=192m
>> 2017-03-28 12:19:58,085 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.rpc.address, localhost
>> 2017-03-28 12:19:58,086 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.rpc.port, 6123
>> 2017-03-28 12:19:58,086 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.heap.mb, 256
>> 2017-03-28 12:19:58,086 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: taskmanager.heap.mb, 512
>> 2017-03-28 12:19:58,086 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: taskmanager.numberOfTaskSlots, 4
>> 2017-03-28 12:19:58,086 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: taskmanager.memory.preallocate, false
>> 2017-03-28 12:19:58,086 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: parallelism.default, 1
>> 2017-03-28 12:19:58,087 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.web.port, 8081
>> 2017-03-28 12:19:58,087 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.web.submit.enable, false
>> 2017-03-28 12:19:58,087 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: state.backend, filesystem
>> 2017-03-28 12:19:58,087 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: state.backend.fs.checkpointdir,
>> hdfs://ourserver:8020/our_user/flink/state
>> 2017-03-28 12:19:58,087 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: taskmanager.network.numberOfBuffers, 4096
>> 2017-03-28 12:19:58,087 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: fs.hdfs.hadoopconf, /etc/hadoop/conf/
>> 2017-03-28 12:19:58,088 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.mode, zookeeper
>> 2017-03-28 12:19:58,088 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.quorum,
>> ourserver:2181,ourserver2:2181,ourserver3:2181
>> 2017-03-28 12:19:58,088 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.storageDir,
>> hdfs:///our_user/flink/recovery
>> 2017-03-28 12:19:58,088 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.path.root, flink
>> 2017-03-28 12:19:58,088 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.path.namespace, yarn_session
>> 2017-03-28 12:19:58,088 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: reocvery.zookeeper.client.connection-timeout, 30000
>> 2017-03-28 12:19:58,089 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.client.session-timeout, 120000
>> 2017-03-28 12:19:58,089 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.client.retry-wait, 5000
>> 2017-03-28 12:19:58,089 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.client.max-retry-attempts, 5
>> 2017-03-28 12:19:58,089 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: yarn.application-attempts, 10
>> 2017-03-28 12:19:58,089 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: yarn.maximum-failed-containers, 80
>> 2017-03-28 12:19:58,090 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: akka.watch.heartbeat.interval, 50s
>> 2017-03-28 12:19:58,090 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: akka.log.lifecycle.events, true
>> 2017-03-28 12:19:58,090 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: akka.ask.timeout, 20s
>> 2017-03-28 12:19:58,090 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: savepoints.state.backend, filesystem
>> 2017-03-28 12:19:58,090 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: savepoints.state.backend.fs.dir,
>> hdfs:///our_user/flink/savepoints
>> 2017-03-28 12:19:58,367 INFO  org.apache.flink.client.CliFrontend
>> - Running 'savepoint' command.
>> 2017-03-28 12:19:58,372 INFO  org.apache.flink.client.CliFrontend
>> - Retrieving JobManager.
>> 2017-03-28 12:19:58,373 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
>> - Found YARN properties file /tmp/.yarn-properties-our_user
>> 2017-03-28 12:19:58,484 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
>> - Using Yarn application id from YARN properties
>> application_1488884688139_2648
>> 2017-03-28 12:19:58,485 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
>> - YARN properties set default parallelism to 12
>> 2017-03-28 12:19:58,485 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
>> - Found YARN properties file /tmp/.yarn-properties-our_user
>> 2017-03-28 12:19:58,485 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
>> - Using Yarn application id from YARN properties
>> application_1488884688139_2648
>> 2017-03-28 12:19:58,485 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
>> - YARN properties set default parallelism to 12
>> 2017-03-28 12:19:58,604 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: env.java.opts,
>> -Djavax.net.ssl.trustStore=/path/to/our/cacerts
>> -XX:HeapDumpPath=/path/to/our/hadoop/yarn/log
>> -XX:+HeapDumpOnOutOfMemoryError -XX:MaxPermSize=192m
>> 2017-03-28 12:19:58,604 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.rpc.address, localhost
>> 2017-03-28 12:19:58,604 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.rpc.port, 6123
>> 2017-03-28 12:19:58,604 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.heap.mb, 256
>> 2017-03-28 12:19:58,605 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: taskmanager.heap.mb, 512
>> 2017-03-28 12:19:58,605 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: taskmanager.numberOfTaskSlots, 4
>> 2017-03-28 12:19:58,605 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: taskmanager.memory.preallocate, false
>> 2017-03-28 12:19:58,605 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: parallelism.default, 1
>> 2017-03-28 12:19:58,605 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.web.port, 8081
>> 2017-03-28 12:19:58,605 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.web.submit.enable, false
>> 2017-03-28 12:19:58,605 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: state.backend, filesystem
>> 2017-03-28 12:19:58,605 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: state.backend.fs.checkpointdir,
>> hdfs://ourserver:8020/our_user/flink/state
>> 2017-03-28 12:19:58,605 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: taskmanager.network.numberOfBuffers, 4096
>> 2017-03-28 12:19:58,606 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: fs.hdfs.hadoopconf, /etc/hadoop/conf/
>> 2017-03-28 12:19:58,606 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.mode, zookeeper
>> 2017-03-28 12:19:58,606 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.quorum,
>> ourserver:2181,ourserver:2181,ourserver:2181
>> 2017-03-28 12:19:58,606 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.storageDir,
>> hdfs:///our_user/flink/recovery
>> 2017-03-28 12:19:58,606 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.path.root, flink
>> 2017-03-28 12:19:58,606 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.path.namespace, yarn_session
>> 2017-03-28 12:19:58,606 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: reocvery.zookeeper.client.connection-timeout, 30000
>> 2017-03-28 12:19:58,606 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.client.session-timeout, 120000
>> 2017-03-28 12:19:58,606 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.client.retry-wait, 5000
>> 2017-03-28 12:19:58,606 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: recovery.zookeeper.client.max-retry-attempts, 5
>> 2017-03-28 12:19:58,607 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: yarn.application-attempts, 10
>> 2017-03-28 12:19:58,607 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: yarn.maximum-failed-containers, 80
>> 2017-03-28 12:19:58,607 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: akka.watch.heartbeat.interval, 50s
>> 2017-03-28 12:19:58,607 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: akka.log.lifecycle.events, true
>> 2017-03-28 12:19:58,607 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: akka.ask.timeout, 20s
>> 2017-03-28 12:19:58,608 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: savepoints.state.backend, filesystem
>> 2017-03-28 12:19:58,608 DEBUG
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: savepoints.state.backend.fs.dir,
>> hdfs:///our_user/flink/savepoints
>> 2017-03-28 12:19:58,685 INFO  org.apache.hadoop.yarn.client.RMProxy
>> - Connecting to ResourceManager at ourserver/ourserver_ip:8050
>> 2017-03-28 12:19:58,969 INFO  org.apache.flink.yarn.YarnClusterDescriptor
>> - Found application JobManager host name 'ourserver' and port '36901' from
>> supplied application id 'application_1488884688139_2648'
>> 2017-03-28 12:19:58,989 INFO  org.apache.flink.runtime.util.ZooKeeperUtils
>> - Using 'flink/yarn_session' as zookeeper namespace.
>> 2017-03-28 12:19:59,114 INFO
>> org.apache.flink.shaded.org.apache.curator.framework.imps.CuratorFrameworkImpl
>> - Starting
>> 2017-03-28 12:19:59,115 DEBUG
>> org.apache.flink.shaded.org.apache.curator.CuratorZookeeperClient  -
>> Starting
>> 2017-03-28 12:19:59,115 DEBUG
>> org.apache.flink.shaded.org.apache.curator.ConnectionState    - Starting
>> 2017-03-28 12:19:59,115 DEBUG
>> org.apache.flink.shaded.org.apache.curator.ConnectionState    - reset
>> 2017-03-28 12:19:59,172 INFO
>> org.apache.flink.shaded.org.apache.curator.framework.state.ConnectionStateManager
>> - State change: CONNECTED
>> 2017-03-28 12:20:00,212 INFO
>> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
>> Starting ZooKeeperLeaderRetrievalService.
>> 2017-03-28 12:20:00,229 DEBUG
>> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
>> Leader node has changed.
>> 2017-03-28 12:20:00,235 DEBUG
>> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
>> New leader information:
>> Leader=akka.tcp://flink@ourserver_ip:36901/user/jobmanager, session
>> ID=a3c337e5-1749-4c42-9949-0203bbae58d5.
>> 2017-03-28 12:20:00,238 INFO
>> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
>> Stopping ZooKeeperLeaderRetrievalService.
>> 2017-03-28 12:20:00,238 DEBUG
>> org.apache.flink.shaded.org.apache.curator.framework.imps.CuratorFrameworkImpl
>> - Closing
>> 2017-03-28 12:20:00,238 DEBUG
>> org.apache.flink.shaded.org.apache.curator.CuratorZookeeperClient  - Closing
>> 2017-03-28 12:20:00,239 DEBUG
>> org.apache.flink.shaded.org.apache.curator.ConnectionState    - Closing
>> 2017-03-28 12:20:00,245 INFO  org.apache.flink.client.CliFrontend
>> - Using address /ourserver_ip:36901 to connect to JobManager.
>> 2017-03-28 12:20:00,245 INFO  org.apache.flink.runtime.util.ZooKeeperUtils
>> - Using 'flink/yarn_session' as zookeeper namespace.
>> 2017-03-28 12:20:00,252 INFO  org.apache.flink.yarn.YarnClusterClient
>> - Starting client actor system.
>> 2017-03-28 12:20:00,254 DEBUG org.apache.flink.runtime.net.ConnectionUtils
>> - Trying to connect to (ourserver/ourserver_ip:36901) from local address
>> ourserver/ourserver_ip with timeout 200
>> 2017-03-28 12:20:00,259 DEBUG org.apache.flink.runtime.net.ConnectionUtils
>> - Using InetAddress.getLocalHost() immediately for the connecting address
>> 2017-03-28 12:20:01,209 INFO
>> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
>> Starting ZooKeeperLeaderRetrievalService.
>> 2017-03-28 12:20:01,213 DEBUG
>> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
>> Leader node has changed.
>> 2017-03-28 12:20:01,213 DEBUG
>> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
>> New leader information:
>> Leader=akka.tcp://flink@ourserver_ip:36901/user/jobmanager, session
>> ID=a3c337e5-1749-4c42-9949-0203bbae58d5.
>> 2017-03-28 12:20:01,442 INFO
>> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
>> Stopping ZooKeeperLeaderRetrievalService.
>> 2017-03-28 12:20:01,446 INFO  org.apache.flink.client.CliFrontend
>> - Disposing savepoint
>> 'hdfs:/our_user/flink/savepoints/savepoint-d16441420a87' with JAR
>> /path/to/our/lib/our_program/lib/our_program-6.2.6-SNAPSHOT-all.jar.
>> 2017-03-28 12:20:01,590 INFO  org.apache.flink.client.CliFrontend
>> - Waiting for response...
>> 2017-03-28 12:20:01,636 ERROR org.apache.flink.client.CliFrontend
>> - Error while running the command.
>> java.io.IOException: Failed to dispose savepoint
>> hdfs:/our_user/flink/savepoints/savepoint-d16441420a87.
>>         at
>> org.apache.flink.runtime.checkpoint.savepoint.FsSavepointStore.disposeSavepoint(FsSavepointStore.java:163)
>>         at
>> org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply$mcV$sp(JobManager.scala:745)
>>         at
>> org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:727)
>>         at
>> org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:727)
>>         at
>> scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24)
>>         at
>> scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24)
>>         at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41)
>>         at
>> akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401)
>>         at
>> scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
>>         at
>> scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253)
>>         at
>> scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346)
>>         at
>> scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
>>         at
>> scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
>> Caused by: java.lang.ClassNotFoundException:
>> our.company.eventdata.EventDataRecord
>>         at java.net.URLClassLoader$1.run(URLClassLoader.java:366)
>>         at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
>>         at java.security.AccessController.doPrivileged(Native Method)
>>         at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
>>         at java.lang.ClassLoader.loadClass(ClassLoader.java:425)
>>         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
>>         at java.lang.ClassLoader.loadClass(ClassLoader.java:358)
>>         at java.lang.Class.forName0(Native Method)
>>         at java.lang.Class.forName(Class.java:270)
>>         at
>> org.apache.flink.util.InstantiationUtil$ClassLoaderObjectInputStream.resolveClass(InstantiationUtil.java:65)
>>         at
>> java.io.ObjectInputStream.readNonProxyDesc(ObjectInputStream.java:1612)
>>         at
>> java.io.ObjectInputStream.readClassDesc(ObjectInputStream.java:1517)
>>         at java.io.ObjectInputStream.readClass(ObjectInputStream.java:1483)
>>         at
>> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1333)
>>         at
>> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>>         at
>> java.io.ObjectInputStream.defaultReadObject(ObjectInputStream.java:500)
>>         at
>> org.apache.flink.api.java.typeutils.runtime.PojoSerializer.readObject(PojoSerializer.java:131)
>>         at sun.reflect.GeneratedMethodAccessor51.invoke(Unknown Source)
>>         at
>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>>         at java.lang.reflect.Method.invoke(Method.java:606)
>>         at
>> java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
>>         at
>> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
>>         at
>> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>>         at
>> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>>         at
>> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>>         at
>> java.io.ObjectInputStream.defaultReadObject(ObjectInputStream.java:500)
>>         at
>> org.apache.flink.api.common.state.StateDescriptor.readObject(StateDescriptor.java:268)
>>         at sun.reflect.GeneratedMethodAccessor29.invoke(Unknown Source)
>>         at
>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>>         at java.lang.reflect.Method.invoke(Method.java:606)
>>         at
>> java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
>>         at
>> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
>>         at
>> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>>         at
>> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>>         at
>> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>>         at
>> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
>>         at
>> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>>         at
>> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>>         at java.io.ObjectInputStream.readObject(ObjectInputStream.java:370)
>>         at java.util.HashMap.readObject(HashMap.java:1184)
>>         at sun.reflect.GeneratedMethodAccessor6.invoke(Unknown Source)
>>         at
>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>>         at java.lang.reflect.Method.invoke(Method.java:606)
>>         at
>> java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
>>         at
>> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
>>         at
>> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>>         at
>> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>>         at
>> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>>         at
>> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
>>         at
>> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>>         at
>> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>>         at java.io.ObjectInputStream.readArray(ObjectInputStream.java:1706)
>>         at
>> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1344)
>>         at
>> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>>         at
>> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
>>         at
>> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>>         at
>> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>>         at java.io.ObjectInputStream.readObject(ObjectInputStream.java:370)
>>         at
>> org.apache.flink.util.InstantiationUtil.deserializeObject(InstantiationUtil.java:291)
>>         at
>> org.apache.flink.util.SerializedValue.deserializeValue(SerializedValue.java:58)
>>         at
>> org.apache.flink.runtime.checkpoint.SubtaskState.discard(SubtaskState.java:85)
>>         at
>> org.apache.flink.runtime.checkpoint.TaskState.discard(TaskState.java:147)
>>         at
>> org.apache.flink.runtime.checkpoint.savepoint.SavepointV0.dispose(SavepointV0.java:66)
>>         at
>> org.apache.flink.runtime.checkpoint.savepoint.FsSavepointStore.disposeSavepoint(FsSavepointStore.java:151)
>>         ... 12 more
>> 2017-03-28 12:20:01,652 INFO  org.apache.flink.yarn.YarnClusterClient
>> - Shutting down YarnClusterClient from the client shutdown hook
>> 2017-03-28 12:20:01,653 INFO  org.apache.flink.yarn.YarnClusterClient
>> - Disconnecting YarnClusterClient from ApplicationMaster
>>
>>
>> [3]
>>
>> 2017-03-28 10:43:57,361 INFO  org.apache.flink.client.CliFrontend
>> -
>> --------------------------------------------------------------------------------
>> 2017-03-28 10:43:57,362 INFO  org.apache.flink.client.CliFrontend
>> -  Starting Command Line Client (Version: 1.1.3, Rev:a56d810,
>> Date:10.11.2016 @ 13:25:34 CET)
>> 2017-03-28 10:43:57,362 INFO  org.apache.flink.client.CliFrontend
>> -  Current user: our_user
>> 2017-03-28 10:43:57,362 INFO  org.apache.flink.client.CliFrontend
>> -  JVM: Java HotSpot(TM) 64-Bit Server VM - Oracle Corporation -
>> 1.7/24.51-b03
>> 2017-03-28 10:43:57,362 INFO  org.apache.flink.client.CliFrontend
>> -  Maximum heap size: 1749 MiBytes
>> 2017-03-28 10:43:57,363 INFO  org.apache.flink.client.CliFrontend
>> -  JAVA_HOME: /usr/java/default
>> 2017-03-28 10:43:57,365 INFO  org.apache.flink.client.CliFrontend
>> -  Hadoop version: 2.3.0
>> 2017-03-28 10:43:57,365 INFO  org.apache.flink.client.CliFrontend
>> -  JVM Options:
>> 2017-03-28 10:43:57,365 INFO  org.apache.flink.client.CliFrontend
>> -
>> -Dlog.file=/path/to/our/lib/flink-1.1.3/log/flink-our_user-client-ourserver.log
>> 2017-03-28 10:43:57,365 INFO  org.apache.flink.client.CliFrontend
>> -
>> -Dlog4j.configuration=file:/path/to/our/lib/flink-1.1.3/conf/log4j-cli.properties
>> 2017-03-28 10:43:57,365 INFO  org.apache.flink.client.CliFrontend
>> -
>> -Dlogback.configurationFile=file:/path/to/our/lib/flink-1.1.3/conf/logback.xml
>> 2017-03-28 10:43:57,365 INFO  org.apache.flink.client.CliFrontend
>> -  Program Arguments:
>> 2017-03-28 10:43:57,366 INFO  org.apache.flink.client.CliFrontend
>> -     run
>> 2017-03-28 10:43:57,366 INFO  org.apache.flink.client.CliFrontend
>> -     -p
>> 2017-03-28 10:43:57,366 INFO  org.apache.flink.client.CliFrontend
>> -     5
>> 2017-03-28 10:43:57,366 INFO  org.apache.flink.client.CliFrontend
>> -     -c
>> 2017-03-28 10:43:57,366 INFO  org.apache.flink.client.CliFrontend
>> -     our.company.package.OurProgramClass
>> 2017-03-28 10:43:57,366 INFO  org.apache.flink.client.CliFrontend
>> -     /path/to/our/lib/our_program/lib/our_program.jar
>> 2017-03-28 10:43:57,366 INFO  org.apache.flink.client.CliFrontend
>> -     /path/to/our/lib/our_program/conf/our_program.properties
>> 2017-03-28 10:43:57,367 INFO  org.apache.flink.client.CliFrontend
>> -  Classpath:
>> /path/to/our/lib/flink-1.1.3/lib/flink-dist_2.10-1.1.3.1.jar:/path/to/our/lib/flink-1.1.3/lib/flink-python_2.10-1.1.3.jar:/path/to/our/lib/flink-1.1.3/lib/flink-reporter-1.0.2-20161206.140111-118.jar:/path/to/our/lib/flink-1.1.3/lib/flink-table_2.10-1.1.3.jar:/path/to/our/lib/flink-1.1.3/lib/log4j-1.2.17.jar:/path/to/our/lib/flink-1.1.3/lib/ojdbc6-11.2.0.3.jar:/path/to/our/lib/flink-1.1.3/lib/slf4j-log4j12-1.7.7.jar::/etc/hadoop/conf:
>> 2017-03-28 10:43:57,367 INFO  org.apache.flink.client.CliFrontend
>> -
>> --------------------------------------------------------------------------------
>> 2017-03-28 10:43:57,367 INFO  org.apache.flink.client.CliFrontend
>> - Using configuration directory /path/to/our/lib/flink-1.1.3/conf
>> 2017-03-28 10:43:57,367 INFO  org.apache.flink.client.CliFrontend
>> - Trying to load configuration file
>> 2017-03-28 10:43:57,664 INFO  org.apache.flink.client.CliFrontend
>> - Running 'run' command.
>> 2017-03-28 10:43:57,671 INFO  org.apache.flink.client.CliFrontend
>> - Building program from JAR file
>> 2017-03-28 10:43:57,827 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
>> - Found YARN properties file /tmp/.yarn-properties-our_user
>> 2017-03-28 10:43:57,921 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
>> - Using Yarn application id from YARN properties
>> application_1488884688139_2648
>> 2017-03-28 10:43:57,921 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
>> - YARN properties set default parallelism to 12
>> 2017-03-28 10:43:57,921 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
>> - Found YARN properties file /tmp/.yarn-properties-our_user
>> 2017-03-28 10:43:57,922 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
>> - Using Yarn application id from YARN properties
>> application_1488884688139_2648
>> 2017-03-28 10:43:57,922 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli
>> - YARN properties set default parallelism to 12
>> 2017-03-28 10:43:58,046 INFO  org.apache.hadoop.yarn.client.RMProxy
>> - Connecting to ResourceManager at ourserver/ourserver_ip:8050
>> 2017-03-28 10:43:58,237 INFO  org.apache.flink.yarn.YarnClusterDescriptor
>> - Found application JobManager host name 'ourserver' and port '36901' from
>> supplied application id 'application_1488884688139_2648'
>> 2017-03-28 10:43:58,246 INFO  org.apache.flink.client.CliFrontend
>> - Cluster configuration: Yarn cluster with application id
>> application_1488884688139_2648
>> 2017-03-28 10:43:59,439 INFO  org.apache.flink.client.CliFrontend
>> - Using address ourserver_ip:36901 to connect to JobManager.
>> 2017-03-28 10:43:59,439 INFO  org.apache.flink.client.CliFrontend
>> - JobManager web interface address
>> http://ourserver:8088/proxy/application_1488884688139_2648/
>> 2017-03-28 10:43:59,439 DEBUG org.apache.flink.client.CliFrontend
>> - Client slots is set to -1
>> 2017-03-28 10:43:59,440 DEBUG org.apache.flink.client.CliFrontend
>> - Savepoint path is set to null
>> 2017-03-28 10:43:59,440 DEBUG org.apache.flink.client.CliFrontend
>> - User parallelism is set to 5
>> 2017-03-28 10:43:59,440 INFO  org.apache.flink.client.CliFrontend
>> - Starting execution of program
>> 2017-03-28 10:43:59,440 INFO  org.apache.flink.yarn.YarnClusterClient
>> - Starting program in interactive mode
>> 2017-03-28 10:44:00,593 WARN  org.apache.hadoop.hdfs.BlockReaderLocal
>> - The short-circuit local reads feature cannot be used because libhadoop
>> cannot be loaded.
>> 2017-03-28 10:44:01,672 INFO  org.apache.flink.yarn.YarnClusterClient
>> - Waiting until all TaskManagers have connected
>> 2017-03-28 10:44:02,702 INFO  org.apache.flink.yarn.YarnClusterClient
>> - Starting client actor system.
>> 2017-03-28 10:44:03,717 INFO  org.apache.flink.yarn.YarnClusterClient
>> - TaskManager status (3/1)
>> 2017-03-28 10:44:03,720 INFO  org.apache.flink.yarn.YarnClusterClient
>> - All TaskManagers are connected
>> 2017-03-28 10:44:04,736 INFO  org.apache.flink.yarn.YarnClusterClient
>> - Submitting job with JobID: d33a2835c9c25881a0765c250bbceb7e. Waiting for
>> job completion.
>> Connected to JobManager at
>> Actor[akka.tcp://flink@ourserver_ip:36901/user/jobmanager#-429328340]
>> 03/28/2017 10:44:06     Job execution switched to status RUNNING.
>>
>>
>> On 27.03.2017 15:24, Ufuk Celebi wrote:
>>
>> What kind of state backend where you using for the checkpoints?
>>
>> If there is a bug that prevents us from deleting the savepoint files
>> automatically, we can do a manual workaround and delete the
>> checkpoints files manually. With Flink 1.3 this becomes very straight
>> forward as savepoint data all go to a self contained directory that
>> can be deleted manually.
>>
>> On Mon, Mar 27, 2017 at 12:46 PM, Stefan Richter
>> <[hidden email]> wrote:
>>
>> Hi,
>>
>> could you provide us with the log from the job client, with logging on debug
>> level for package org.apache.flink.client? Also, did you check if this
>> problem also exists in the latest bugfix release for your version (1.1.5) ?
>>
>> Best,
>> Stefan
>>
>>
>> Am 27.03.2017 um 11:41 schrieb Konstantin Gregor
>> <[hidden email]>:
>>
>> Hey everyone,
>>
>> we are experiencing an issue in the disposal of savepoints in
>> Flink-1.1.3. We have a streaming job that has custom state (user objects
>> are part of the state). We create a savepoint:
>>
>> $ flink savepoint <JOBID>
>> [...]
>> Savepoint completed. Path:
>> hdfs:/bigdata/flink/savepoints/savepoint-20f064fb9f50
>> [...]
>>
>> Then we want to simply dispose of that savepoint where we also provide
>> the jar to the job from which the savepoint was made:
>> $ flink savepoint -d
>> hdfs:/bigdata/flink/savepoints/savepoint-20f064fb9f50 -j
>> /path/to/jar/application.jar
>>
>> This gives us a ClassNotFoundException of our custom objects [1].
>>
>> Adding our jar to the flink/lib directory is not an option for us,
>> things will break because of this.
>> Does anyone have an idea on how to proceed here?
>>
>> Thanks and best regards,
>>
>> Konstantin
>>
>> [1]
>> java.io.IOException: Failed to dispose savepoint
>> hdfs:///bigdata/flink/savepoints/savepoint-20f064fb9f50.
>>         at
>> org.apache.flink.runtime.checkpoint.savepoint.FsSavepointStore.disposeSavepoint(FsSavepointStore.java:163)
>>         at
>> org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply$mcV$sp(JobManager.scala:745)
>>         at
>> org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:727)
>>         at
>> org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessage$1$$anonfun$applyOrElse$7.apply(JobManager.scala:727)
>>         at
>> scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24)
>>         at
>> scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24)
>>         at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41)
>>         at
>> akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401)
>>         at
>> scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
>>         at
>> scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253)
>>         at
>> scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346)
>>         at
>> scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
>>         at
>> scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
>> Caused by: java.lang.ClassNotFoundException:
>> our.company.application.eventdata.EventDataRecord
>>         at java.net.URLClassLoader$1.run(URLClassLoader.java:366)
>>         at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
>>         at java.security.AccessController.doPrivileged(Native Method)
>>         at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
>>         at java.lang.ClassLoader.loadClass(ClassLoader.java:425)
>>         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
>>         at java.lang.ClassLoader.loadClass(ClassLoader.java:358)
>>         at java.lang.Class.forName0(Native Method)
>>         at java.lang.Class.forName(Class.java:270)
>>         at
>> org.apache.flink.util.InstantiationUtil$ClassLoaderObjectInputStream.resolveClass(InstantiationUtil.java:65)
>>         at
>> java.io.ObjectInputStream.readNonProxyDesc(ObjectInputStream.java:1612)
>>         at
>> java.io.ObjectInputStream.readClassDesc(ObjectInputStream.java:1517)
>>         at java.io.ObjectInputStream.readClass(ObjectInputStream.java:1483)
>>         at
>> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1333)
>>         at
>> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>>         at
>> java.io.ObjectInputStream.defaultReadObject(ObjectInputStream.java:500)
>>         at
>> org.apache.flink.api.java.typeutils.runtime.PojoSerializer.readObject(PojoSerializer.java:131)
>>         at sun.reflect.GeneratedMethodAccessor33.invoke(Unknown Source)
>>         at
>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>>         at java.lang.reflect.Method.invoke(Method.java:606)
>>         at
>> java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
>>         at
>> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
>>         at
>> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>>         at
>> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>>         at
>> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>>         at
>> java.io.ObjectInputStream.defaultReadObject(ObjectInputStream.java:500)
>>         at
>> org.apache.flink.api.common.state.StateDescriptor.readObject(StateDescriptor.java:268)
>>         at sun.reflect.GeneratedMethodAccessor32.invoke(Unknown Source)
>>         at
>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>>         at java.lang.reflect.Method.invoke(Method.java:606)
>>         at
>> java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
>>         at
>> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
>>         at
>> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>>         at
>> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>>         at
>> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>>         at
>> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
>>         at
>> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>>         at
>> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>>         at java.io.ObjectInputStream.readObject(ObjectInputStream.java:370)
>>         at java.util.HashMap.readObject(HashMap.java:1184)
>>         at sun.reflect.GeneratedMethodAccessor6.invoke(Unknown Source)
>>         at
>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>>         at java.lang.reflect.Method.invoke(Method.java:606)
>>         at
>> java.io.ObjectStreamClass.invokeReadObject(ObjectStreamClass.java:1017)
>>         at
>> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1893)
>>         at
>> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>>         at
>> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>>         at
>> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>>         at
>> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
>>         at
>> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>>         at
>> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>>         at java.io.ObjectInputStream.readArray(ObjectInputStream.java:1706)
>>         at
>> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1344)
>>         at
>> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1990)
>>         at
>> java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1915)
>>         at
>> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1798)
>>         at
>> java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1350)
>>         at java.io.ObjectInputStream.readObject(ObjectInputStream.java:370)
>>         at
>> org.apache.flink.util.InstantiationUtil.deserializeObject(InstantiationUtil.java:291)
>>         at
>> org.apache.flink.util.SerializedValue.deserializeValue(SerializedValue.java:58)
>>         at
>> org.apache.flink.runtime.checkpoint.SubtaskState.discard(SubtaskState.java:85)
>>         at
>> org.apache.flink.runtime.checkpoint.TaskState.discard(TaskState.java:147)
>>         at
>> org.apache.flink.runtime.checkpoint.savepoint.SavepointV0.dispose(SavepointV0.java:66)
>>         at
>> org.apache.flink.runtime.checkpoint.savepoint.FsSavepointStore.disposeSavepoint(FsSavepointStore.java:151)
>>
>>
>> --
>> Konstantin Gregor * [hidden email]
>> TNG Technology Consulting GmbH, Betastr. 13a, 85774 Unterföhring
>> Geschäftsführer: Henrik Klagges, Christoph Stock, Dr. Robert Dahlke
>> Sitz: Unterföhring * Amtsgericht München * HRB 135082
>>
>>
>>
>> --
>> Konstantin Gregor * [hidden email]
>> TNG Technology Consulting GmbH, Betastr. 13a, 85774 Unterföhring
>> Geschäftsführer: Henrik Klagges, Christoph Stock, Dr. Robert Dahlke
>> Sitz: Unterföhring * Amtsgericht München * HRB 135082

--
Konstantin Gregor * [hidden email] * +49-174-3410311
TNG Technology Consulting GmbH, Betastr. 13a, 85774 Unterföhring
Geschäftsführer: Henrik Klagges, Christoph Stock, Dr. Robert Dahlke
Sitz: Unterföhring * Amtsgericht München * HRB 135082