Tag Archives: Careers

The Value of Certification

Categories: Careers Training

Each year in early November, my inbox fills up with people asking advice about certification. Some are reflecting on their careers and looking to move on or move up; others have given themselves or their managers the goal of getting certified this year. They awake one morning in early November and realize the clock is ticking.

The first thing they ask for is a discount, of course. Beyond that, they want to know what a certification is going to do for them more generally,

Read more

Caching in Apache HBase: SlabCache

Categories: Careers Community General HBase

This was my summer internship project at Cloudera, and I’m very thankful for the level of support and mentorship I’ve received from the Apache HBase community. I started off in June with a very limited knowledge of both HBase and distributed systems in general, and by September, managed to get this patch committed to HBase trunk. I couldn’t have done this without a phenomenal amount of help from Cloudera and the greater HBase community.

Background

The amount of memory available on a commodity server has increased drastically in tune with Moore’s law.

Read more

My Internship at Cloudera

Categories: Careers Community

David joined us as part of our intern program, and built the prototype for the distributed log search functionality that’s available as part of Cloudera Manager 3.7. He did an awesome job, and wrote the following blog post which, now that CM3.7 has been released, we’re pleased to publish.

The project

My intern project was to build a log searching tool, specialized for Apache Hadoop. My mini-app allows Hadoop cluster admins and operators to search their error logs across many machines,

Read more

Avoiding Full GCs in Apache HBase with MemStore-Local Allocation Buffers: Part 3

Categories: General HBase

This is the third and final post in a series detailing a recent improvement in Apache HBase that helps to reduce the frequency of garbage collection pauses. Be sure you’ve read part 1 and part 2 before continuing on to this post.

Recap

It’s been a few days since the first two posts, so let’s start with a quick refresher. In the first post we discussed Java garbage collection algorithms in general and explained that the problem of lengthy pauses in HBase has only gotten worse over time as heap sizes have grown.

Read more