Re: Graph Analytics on HBase With HGraphDB and Apache Flink Gelly
Posted by
Jörn Franke on
URL: http://deprecated-apache-flink-user-mailing-list-archive.369.s1.nabble.com/Graph-Analytics-on-HBase-With-HGraphDB-and-Apache-Flink-Gelly-tp14500p19351.html
Have you checked janusgraph source code , it used also hbase as a storage backend:
It combines it with elasticsearch for indexing. Maybe you can inspire from the architecture there.
Generally, hbase it depends a lot on how the data is written to regions, the order of data and the right key (-> this has then impact on how it is read, also in flink to use locality). There is of course more detail on that and depends on the use case. Generally the hbase documentation is rather good.
Restarting this thread since it is relevant to us. We are thinking of usingHBase/Cassandra to store graph data and then load the data from here intoFlink/Gelly. One of the issues we are concerned about is the readperformance. So far we tried our tests with data residing on HDFS and thatworked fine. Is there any guidance on reading from HBase for batch jobs ? Wondering ifany experience with this approach. Do's/Don'ts etc..Thanks--Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/