Author Archives: Hari Shreedharan

About Hari Shreedharan

Hari is a Software Engineer at Cloudera. He is also a committer on Apache Flume.

Streaming Data into Apache HBase using Apache Flume

Categories: Flume Hadoop HBase

The following post was originally published via; we are re-publishing it here.

Apache Flume was conceived as a fault-tolerant ingest system for the Apache Hadoop ecosystem. Flume comes packaged with an HDFS Sink which can be used to write events into HDFS, and two different implementations of HBase sinks to write events into Apache HBase. You can read about the basic architecture of Apache Flume 1.x in this blog post.

Read More

Apache Flume Development Status Update

Categories: Avro Data Ingestion Flume General Hadoop HBase

Apache Flume is a scalable, reliable, fault-tolerant, distributed system designed to collect, transfer, and store massive amounts of event data into HDFS. Apache Flume recently graduated from the Apache Incubator as a Top Level Project at Apache. Flume is designed to send data over multiple hops from the initial source(s) to the final destination(s). Click here for details of the basic architecture of Flume. In this article, we will discuss in detail some new components in Flume 1.x (also known as Flume NG),

Read More