Note that there are some explanatory texts on larger screens.

plurals
  1. POHBase Key-Value Compression?
    primarykey
    data
    text
    <p>Thanks for taking interest in my question. Before I begin, I'd like to let you know that I'm very new to Hadoop &amp; HBase. So far, I find Hadoop very interesting and would like to contribute more in the future.</p> <p>I'm primarily interested in improving performance of HBase. To do so, I had modified <code>Writer</code> methods in HBase's <code>/io/hfile/Hfile.java</code> in a way that it does high-speed buffered data assembly and then directly write to Hadoop so that it can later be loaded by HBase.</p> <p>Now, I'm trying to come up with a way to compress key-value pairs so that bandwidth could be saved. I've done a lot of research to figure out how; and then realized that HBase has built-in compression libraries.</p> <p>I'm currently looking at SequenceFile <strong>(1)</strong>; setCompressMapOutput <strong>(2)</strong> (deprecated); and Class Compression <strong>(3)</strong>. I also found a <a href="http://hadoop.apache.org/common/docs/current/mapred_tutorial.html#Data+Compression" rel="nofollow">tutorial</a> on Apache's MapReduce. </p> <p>Could someone explain what "SequenceFile" is, and how I can implement those compression libraries and algorithms? These different classes and documents are so confusing to me.</p> <p>I'd sincerely appreciate your help.</p> <p>--</p> <p>Hyperlinks:</p> <p>(1): hadoop.apache.org/common/docs/current/api/org/apache/hadoop/io/SequenceFile.html</p> <p>(2): hadoop.apache.org/common/docs/current/api/org/apache/hadoop/mapred/JobConf.html#setCompressMapOutput%28boolean%29</p> <p>(3): www.apache.org/dist/hbase/docs/apidocs/org/apache/hadoop/hbase/io/hfile/Compression.html</p>
    singulars
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    plurals
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload