The compactions model is changing drastically with CDH 5/HBase 0.96. Here’s what you need to know. Apache HBase is a distributed data store based upon a log-structured merge tree, so optimal read performance would come from having only one file per store (Column Family). However, that ideal isn’t possible during periods of heavy incoming writes. […]