Category Archives: HBase

Apache HBase is Everywhere

Categories: Community Events HBase

For Cloudera, Apache HBase has grown into a stable, scalable, mature, and critical component of the Apache Hadoop stack.  

HBase adds the ability to do low-latency random read/write across your big data. While it is a key piece of the Apache Hadoop ecosystem, HBase itself has an ecosystem of projects and products that use it as a storage engine for systems such as time series database (OpenTSDB), or SQL-style databases (Apache Phoenix,

Read More

Inside Santander’s Near Real-Time Data Ingest Architecture (Part 2)

Categories: HBase Kafka Use Case

Thanks to Pedro Boado and Abel Fernandez Alfonso from Santander’s engineering team for their collaboration on this post about how Santander UK is using Apache HBase as a near real-time serving engine to power its innovative Spendlytics app.

The Spendlytics iOS app is designed to help Santander’s personal debit and credit-card customers keep on top of their spending, including payments made via Apple Pay. It uses real-time transaction data to enable customers to analyze their card spend across time periods (weekly,

Read More

How-to: Improve Apache HBase Performance via Data Serialization with Apache Avro

Categories: Avro HBase Performance

Taking a thoughtful approach to data serialization can achieve significant performance improvements for HBase deployments.

The question of using tall versus wide tables in Apache HBase is a commonly discussed design pattern (see reference here and here). However, there are more considerations here than making that simple choice. Because HBase stores each column of a table as an independent row in the underlying HFiles, significant storage overhead can occur when storing small pieces of information.

Read More

HBaseCon 2016 Speaker Lineup Announced

Categories: Community Events HBase

The speaker lineup for the fifth annual edition of HBaseCon reflects an amazing diversity of production deployments.

The organizers of HBaseCon, the conference for the Apache HBase community, have published the agenda for the conference (May 24, 2016, in San Francisco)—and once again, the impressive geographical and use-case diversity of HBase are on full display.

Keynotes include:

  • “State of Apache HBase” – Apache HBase PMC
  • “Facebook’s Return to (Real) Open Source” – 

Read More

HBaseCon 2016 in Full Effect: Call for Papers and Early Registration

Categories: Community Events HBase

HBaseCon 2016 will occur on May 24, 2016, at The Village in San Francisco.

HBaseCon is back, and CfP and Early Bird registration are both open for business.

hbasecon16

Now in its fifth year, HBaseCon is the premier community event for Apache HBase contributors, developers, admins, and users of all skill levels. The event is hosted and organized by Cloudera, with a Program Committee reflecting a cross-section of the HBase community (including employees of Bloomberg LP,

Read More