Author Archives: Sunil Sitaula

Apache Hadoop for Archiving Email – Part 2

Categories: General Hadoop HDFS Use Case

Part 1 of this post covered how to convert and store email messages for archival purposes using Apache Hadoop, and outlined how to perform a rudimentary search through those archives. But, let’s face it: for search to be of any real value, you need robust features and a fast response time. To accomplish this we use Solr/Lucene-type indexing capabilities on top of HDFS and MapReduce.

Before getting into indexing within Hadoop,

Read More