YuniKorn: a universal resource scheduler

Categories: Cloud Hadoop YARN

Hello world, it’s been a while!

We are super excited today to announce the open-sourcing of one of the exciting new projects we’ve been working behind the scenes at the intersection of big-data and computation platforms – YuniKorn!

Yunikorn is a new standalone universal resource-scheduler responsible for allocating/managing resources for big-data workloads including batch jobs and long-running services.

Let’s dive right in!

Introduction

YuniKorn is a light-weight,

Read more

Diagnostic Data Processing on Cloudera Altus

Categories: Altus Cloud

Introduction

Many of Cloudera’s customers set up Cloudera Manager to collect their clusters’ diagnostic data on a regular schedule and automatically send that data to Cloudera. Cloudera analyzes this data, identifies potential problems in the customer’s environment, and alerts customers, requiring fewer back-and-forths with our customers when they file a support case and provides Cloudera with critical information to improve future versions of all of Cloudera’s software. If Cloudera discovers a serious issue, Cloudera searches this diagnostic data and proactively notifies Cloudera customers who might encounter problems due to the issue.

Read more

Best Practices Guide for Systems Security Services Daemon Configuration and Installation – Part 1

Categories: Platform Security & Cybersecurity

Background

Authentication is a basic security requirement for any computing environment. In simple terms, users and services must prove their identity (authenticate) to the system before they can use system features. Kerberos provides strong authentication which is used in the exchange between requesting user or process and service during authentication. When a user authenticates to a particular Hadoop component, the user’s Kerberos principal is presented. The principal is presented in the form user@REALM. The Kerberos principal is mapped [1] to a short name after authentication.

Read more

Cloudera Fast Forward Labs Quarterly Updates – July 2019

Categories: Fast Forward Labs

Cloudera Fast Forward Labs is an applied machine learning research and consulting services group within Cloudera, which helps enterprises accelerate data value creation through the adoption of emerging ML techniques, cutting-edge technical architectures and industry leading ML best practices. We focus on expert knowledge transfer and skills development, empowering organizations to continually evolve, differentiate themselves and ultimately own the future of their business by leveraging open technologies and data. Enabling ethical and responsible ML outcomes to our customers,

Read more

Cloudera at ACM SIGMOD/PODS 2019

Categories: Events Hive

Sigmod conf 2019

The annual ACM SIGMOD/PODS Conference is a leading international forum for database researchers, practitioners, developers, and users to explore cutting-edge ideas and results, and to exchange techniques, tools, and experiences. This year ACM SIGMOD/PODS will be held in Amsterdam, The Netherlands on June 30th – July 5th, 2019, and Cloudera will be present in the conference, contributing to and learning from the broader research community.

Last year,

Read more