Performance Archives - Page 3 of 9

August 8, 2022 | Technical

How to Use Apache Iceberg in CDP’s Open Lakehouse

In June 2022, Cloudera announced the general availability of Apache Iceberg in the Cloudera Data Platform (CDP). Iceberg is a 100% open-table format, developed through the Apache Software Foundation, which helps users avoid vendor lock-in and implement an open lakehouse. The general availability covers Iceberg running within some of the key data services in CDP, […]

by Bill Zhang , Peter Ableda , Shaun Ahmadian , Manish Maheshwari 7 min read

CDP Public Cloud Cloudera Data Platform (CDP) Data Engineering Data Warehouse Machine Learning SDX Technologies Governance Machine Learning Modernize Architecture Performance Security, Risk, & Compliance

June 30, 2022 | Technical

Supercharge Your Data Lakehouse with Apache Iceberg in Cloudera Data Platform

Cloudera Technology Spotlight

by Bill Zhang , Shaun Ahmadian , Cloudera Contributors 5 min read

CDP Public Cloud Cloudera Data Platform (CDP) Data Engineering Data Warehouse Machine Learning SDX Technologies Governance Machine Learning Modernize Architecture Performance Security, Risk, & Compliance

June 17, 2022 | Business

The Future of the Data Lakehouse – Open

Cloudera customers run some of the biggest data lakes on earth. These lakes power mission critical large scale data analytics, business intelligence (BI), and machine learning use cases, including enterprise data warehouses. In recent years, the term “data lakehouse” was coined to describe this architectural pattern of tabular analytics over data in the data lake. […]

by Ram Venkatesh , Priyank Patel 4 min read

CDP Public Cloud Cloudera Data Platform (CDP) Data Engineering Data Warehouse Machine Learning SDX Technologies Governance Machine Learning Modernize Architecture Performance Security, Risk, & Compliance

April 19, 2022 | Business

From the Ground Up: The Truth About Data Innovation

Data holds incredible untapped potential for Australian organisations across industries, regardless of individual business goals, and all organisations are at different points in their data transformation journey with some achieving success faster than others. To be successful, the use of data insights must become a central lifeforce throughout an organisation and not just reside within […]

by Renee Dvir 3 min read

Cloudera Data Platform (CDP) Data Engineering Data Warehouse SDX Technologies Customer Analytics Governance IoT/ Connected Products Machine Learning Modernize Architecture Performance Search Security, Risk, & Compliance

April 4, 2022 | Business

Why Can’t we Advance Healthcare and Life Sciences this Fast all the Time?

Embedding Data and Analytics into the DNA of Life Sciences Organizations

by Cindy Maike 2 min read

CDP Private Cloud CDP Public Cloud Cloudera Data Platform (CDP) Data Warehouse DataFlow SDX Technologies Healthcare & Life Sciences IoT/ Connected Products Machine Learning Modernize Architecture Performance Security, Risk, & Compliance

March 23, 2022 | Technical

5 Reasons to Use Apache Iceberg on Cloudera Data Platform (CDP)

Please join us on March 24 for Future of Data meetup where we do a deep dive into Iceberg with CDP What is Apache Iceberg? Apache Iceberg is a high-performance, open table format, born-in-the cloud that scales to petabytes independent of the underlying storage layer and the access engine layer. By being a truly open […]

by Shaun Ahmadian , Luiz Carrossoni Neto 7 min read

Cloudera Data Platform (CDP) Data Engineering Data Warehouse Data Ingestion Modernize Architecture Ops and DevOps Performance

March 2, 2022 | Technical

Memory Optimizations for Analytic Queries in Cloudera Data Warehouse

Apache Impala is used today by over 1,000 customers to power their analytics in on premise as well as cloud-based deployments. Large user communities of analysts and developers benefit from Impala’s fast query execution, helping them get their work done more effectively. For these users performance and concurrency are always top of mind. An important […]

by Justin Hayes 8 min read

Cloudera Data Platform (CDP) Data Warehouse Customer Analytics Performance

February 22, 2022 | Technical

Introducing Apache Iceberg in Cloudera Data Platform

Over the past decade, the successful deployment of large scale data platforms at our customers has acted as a big data flywheel driving demand to bring in even more data, apply more sophisticated analytics, and on-board many new data practitioners from business analysts to data scientists. This unprecedented level of big data workloads hasn’t come […]

by Bill Zhang , Shaun Ahmadian , Peter Vary , Marton Bod , Wing Yew Poon 6 min read

CDP Public Cloud Cloudera Data Platform (CDP) Data Engineering Data Warehouse Governance Modernize Architecture Performance Security, Risk, & Compliance

December 21, 2021 | Technical

Cloudera Data Engineering 2021 Year End Review

Since the release of Cloudera Data Engineering (CDE) more than a year ago, our number one goal was operationalizing Spark pipelines at scale with first class tooling designed to streamline automation and observability. In working with thousands of customers deploying Spark applications, we saw significant challenges with managing Spark as well as automating, delivering, […]

by Shaun Ahmadian 6 min read

CDP Public Cloud Cloudera Data Platform (CDP) Data Engineering Ops and DevOps Performance

December 8, 2021 | Technical

Delivering High Performance for Cloudera Data Platform Operational Database (HBase) When Using S3

CDP Operational Database allows developers to use Amazon Simple Storage Service (S3) as its main persistence layer for saving table data.

by Ankit Singhal , Surbhi Kochhar 7 min read

CDP Public Cloud Operational DB Performance

Filter By