Data Engineering Archives

February 12, 2024 | Technical

DNS Zone Setup Best Practices on Azure

Deep dive for using DNS with Cloudera Data Services on Azure

by Dongkai Yu 7 min read

November 7, 2023 | Technical

Apache Ozone – A Multi-Protocol Aware Storage System

Bucket Layouts in Apache Ozone

by Saketa Chandra Chalamchala , Ethan Rose 5 min read

CDP Private Cloud Cloudera Data Platform (CDP) Data Engineering Data Warehouse Machine Learning Data Ingestion

April 20, 2023 | Technical

Using Dead Letter Queues with SQL Stream Builder

What is a dead letter queue (DLQ)? Cloudera SQL Stream builder gives non-technical users the power of a unified stream processing engine so they can integrate, aggregate, query, and analyze both streaming and batch data sources in a single SQL interface. This allows business users to define events of interest for which they need to […]

by Cloudera 4 min read

CDP Private Cloud CDP Public Cloud Cloudera Data Science Workbench Data Engineering Data Science Streaming

March 27, 2023 | Business

Trusted Data: Alchemy For Misinformation

CDO Spotlight

by Shayde Christian 2 min read

Data Engineering Customer Analytics Governance

March 23, 2023 | Technical

Materialized Views in SQL Stream Builder

What are materialized views and how to configure them

by Cloudera 7 min read

CDP Private Cloud CDP Public Cloud Cloudera Data Platform (CDP) Cloudera Data Science Workbench Data Engineering Data Hub Data Science Streaming

February 22, 2023 | Business

Implementing and Using UDFs in Cloudera SQL Stream Builder

Developing and using custom User Defined Functions on SSB

by Cloudera 5 min read

CDP Private Cloud CDP Public Cloud Cloudera Data Science Workbench Data Engineering Data Hub Data Science Streaming

February 9, 2023 | Technical

Job Notifications in SQL Stream Builder

Special co-author credits: Adam Andras Toth, Software Engineer Intern With enterprises’ needs for data analytics and processing getting more complex by the day, Cloudera aims to keep up with these needs, offering constantly evolving, cutting-edge solutions to all your data related problems. Cloudera Stream Processing aims to take real-time data analytics to the next level. […]

by Botond Kismoni 5 min read

Cloudera Data Platform (CDP) Cloudera Data Science Workbench Data Engineering Data Warehouse DataFlow

February 8, 2023 | Technical

Spark Technical Debt Deep Dive

A study of the impact of suboptimal Spark code on performance

by François Reynald 10 min read

Apache Spark Data Engineering Performance

December 20, 2022 | Business

Optimizing the Energy Sector with Data Analytics

The move toward renewable energy has a distinct and significant impact on energy generation and distribution that needs to be carefully managed. Efficient use of data will therefore be critical to improving the competitiveness and productivity of assets, both traditional and renewable generation.

by Pablo Boixeda 6 min read

Data Engineering Energy & Utilities Customer Analytics IoT/ Connected Products Machine Learning

December 16, 2022 | Business

Cloudera Named a Leader in the 2022 Gartner® Magic Quadrant™ for Cloud Database Management Systems (DBMS)

We are pleased to announce that Cloudera has been named a Leader in the 2022 Gartner® Magic Quadrant™ for Cloud Database Management Systems. Cloudera has been recognized in this cloud DBMS report since its inception in 2020. This year we’ve been named a Leader. This validates our significant momentum in global enterprises. And together, with […]

by David Dichmann , Navita Sood 4 min read

CDP Public Cloud Cloudera Data Platform (CDP) Data Engineering Data Warehouse Machine Learning SDX Technologies Governance Machine Learning Modernize Architecture Performance Security, Risk, & Compliance

Filter By