(DEPRECATED) Apache Flink User Mailing List archive.

"java.net.SocketException: No buffer space available (maximum connections reached?)" when reading from HDFS

Classic

List

Threaded

4 messages Options

Yassine MARZOUGUI

"java.net.SocketException: No buffer space available (maximum connections reached?)" when reading from HDFS

Hi all,

I'm reading a large number of small files from HDFS in batch mode (about 20 directories, each directory contains about 3000 files, using recursive.file.enumeration=true), and each time, at about 200 GB of received data, my job fails with the following exception:

java.io.IOException: Error opening the Input Split hdfs:///filepath/filename.csv.gz [0,-1]: Could not obtain block: BP-812793611-127.0.0.1-1455882335652:blk_1075977174_2237313 file=/filepath/filename.csv.gz

at org.apache.flink.api.common.io.FileInputFormat.open(FileInputFormat.java:693)

at org.apache.flink.api.common.io.DelimitedInputFormat.open(DelimitedInputFormat.java:424)

at org.apache.flink.api.common.io.DelimitedInputFormat.open(DelimitedInputFormat.java:47)

at org.apache.flink.runtime.operators.DataSourceTask.invoke(DataSourceTask.java:140)

at org.apache.flink.runtime.taskmanager.Task.run(Task.java:584)

at java.lang.Thread.run(Unknown Source)

Caused by: org.apache.hadoop.hdfs.BlockMissingException: Could not obtain block: BP-812793611-127.0.0.1-1455882335652:blk_1075977174_2237313 file=/filepath/filename.csv.gz

at org.apache.hadoop.hdfs.DFSInputStream.chooseDataNode(DFSInputStream.java:984)

at org.apache.hadoop.hdfs.DFSInputStream.blockSeekTo(DFSInputStream.java:642)

at org.apache.hadoop.hdfs.DFSInputStream.readWithStrategy(DFSInputStream.java:882)

at org.apache.hadoop.hdfs.DFSInputStream.read(DFSInputStream.java:934)

at org.apache.hadoop.hdfs.DFSInputStream.read(DFSInputStream.java:735)

at java.io.FilterInputStream.read(Unknown Source)

at org.apache.flink.runtime.fs.hdfs.HadoopDataInputStream.read(HadoopDataInputStream.java:59)

at java.util.zip.CheckedInputStream.read(Unknown Source)

at java.util.zip.GZIPInputStream.readUByte(Unknown Source)

at java.util.zip.GZIPInputStream.readUShort(Unknown Source)

at java.util.zip.GZIPInputStream.readHeader(Unknown Source)

at java.util.zip.GZIPInputStream.<init>(Unknown Source)

at org.apache.flink.api.common.io.compression.GzipInflaterInputStreamFactory.create(GzipInflaterInputStreamFactory.java:44)

at org.apache.flink.api.common.io.compression.GzipInflaterInputStreamFactory.create(GzipInflaterInputStreamFactory.java:31)

at org.apache.flink.api.common.io.FileInputFormat.decorateInputStream(FileInputFormat.java:717)

at org.apache.flink.api.common.io.FileInputFormat.open(FileInputFormat.java:689)

... 5 more

I checked the file each time and it exists and is healthy. Looking at the taskmanager logs, I found the following exceptions which suggests it is running out of connections:

2016-10-15 18:20:27,034 WARN org.apache.hadoop.hdfs.BlockReaderFactory - I/O error constructing remote block reader.

java.net.SocketException: No buffer space available (maximum connections reached?): connect

at sun.nio.ch.Net.connect0(Native Method)

at sun.nio.ch.Net.connect(Unknown Source)

at sun.nio.ch.SocketChannelImpl.connect(Unknown Source)

at org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:192)

at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:531)

at org.apache.hadoop.hdfs.DFSClient.newConnectedPeer(DFSClient.java:3436)

at org.apache.hadoop.hdfs.BlockReaderFactory.nextTcpPeer(BlockReaderFactory.java:777)

at org.apache.hadoop.hdfs.BlockReaderFactory.getRemoteBlockReaderFromTcp(BlockReaderFactory.java:694)

at org.apache.hadoop.hdfs.BlockReaderFactory.build(BlockReaderFactory.java:355)

at org.apache.hadoop.hdfs.DFSInputStream.blockSeekTo(DFSInputStream.java:673)