Apache Iceberg Archives | Cloudera Blog

December 3, 2024 | Business

Secure Data Sharing and Interoperability Powered by Iceberg REST Catalog

Many enterprises have heterogeneous data platforms and technology stacks across different business units or data domains. For decades, they have been struggling with scale, speed, and correctness required to derive timely, meaningful, and actionable insights from vast and diverse big data environments. Despite various architectural patterns and paradigms, they still end up with perpetual “data […]

by Navita Sood , Vincent Kulandaisamy , Jonathan Ingalls , Tamara Astakhova , Naveen Gangam , Bill Zhang , Ramesh Mani 7 min read

Apache Iceberg Data Lakehouse

November 15, 2024 | Technical

Empower Your Cyber Defenders with Real-Time Analytics

Today, cyber defenders face an unprecedented set of challenges as they work to secure and protect their organizations. In fact, according to the Identity Theft Resource Center (ITRC) Annual Data Breach Report, there were 2,365 cyber attacks in 2023 with more than 300 million victims, and a 72% increase in data breaches since 2021. The […]

by Carolyn Duby 4 min read

Apache Iceberg AI DataFlow Streaming Public Sector Data Lakehouse Security, Risk, & Compliance

October 10, 2024 | Technical

Cloudera Lakehouse Optimizer Makes it Easier Than Ever to Deliver High-Performance Iceberg Tables

The open data lakehouse is quickly becoming the standard architecture for unified multifunction analytics on large volumes of data. It combines the flexibility and scalability of data lake storage with the data analytics, data governance, and data management functionality of the data warehouse. Open table formats are a key component of this architecture, as they […]

by Bill Zhang 3 min read

Apache Iceberg Data Lakehouse

June 7, 2024 | Business

Databricks Follows Cloudera by Adopting Iceberg, While Snowflake Mulls Open Source Approach

A constant flow of breaking news from the data lakehouse space is making notable tech headlines this week. On Tuesday, Databricks announced that it will acquire Tabular, a data management company founded by the creators of Apache Iceberg, Ryan Blue, Daniel Weeks, and Jason Reidfor. The deal was for an unconfirmed sum, but some reports […]

by Venkat Rajaji 3 min read

Apache Iceberg Data Lakehouse

October 16, 2023 | Technical

Getting Started With Cloudera Open Data Lakehouse on Private Cloud

Part 1: Streaming Data Ingestion

by Bill Zhang , Pierre Villard , Jonathan Ingalls 4 min read

Apache Iceberg Private Cloud

July 14, 2023 | Technical

From Hive Tables to Iceberg Tables: Hassle-Free

Introduction For more than a decade now, the Hive table format has been a ubiquitous presence in the big data ecosystem, managing petabytes of data with remarkable efficiency and scale. But as the data volumes, data variety, and data usage grows, users face many challenges when using Hive tables because of its antiquated directory-based table […]

by Srinivas Rishindra Pothireddi 9 min read

Apache Iceberg

July 13, 2023 | Technical

12 Times Faster Query Planning With Iceberg Manifest Caching in Impala

Iceberg is an emerging open-table format designed for large analytic workloads. The Apache Iceberg project continues developing an implementation of Iceberg specification in the form of Java Library. Several compute engines such as Impala, Hive, Spark, and Trino have supported querying data in Iceberg table format by adopting this Java Library provided by the Apache […]

by Riza Suminto 7 min read

Apache Iceberg Apache Impala Data Warehouse Public Cloud Performance

April 3, 2023 | Technical

Open Data Lakehouse powered by Iceberg for all your Data Warehouse needs

Since we announced the general availability of Apache Iceberg in Cloudera Data Platform (CDP), we are excited to see customers testing their analytic workloads on Iceberg. We are also receiving several requests to share more details on how key data services in CDP, such as Cloudera Data Warehousing (CDW), Cloudera Data Engineering (CDE), Cloudera Machine […]

by Zoltán Borók-Nagy , Ayush Saxena , Tamas Mate , Simhadri Govindappa 8 min read

Apache Iceberg Data Warehouse Data Ingestion

September 7, 2022 | Business

Large Scale Industrialization Key to Open Source Innovation

Cloudera’s open source licensing policies have evolved with the changing dynamics in open source innovation. For more information on Cloudera’s current policy, please contact OSSQuestions@cloudera.com. We are now well into 2022 and the megatrends that drove the last decade in data—The Apache Software Foundation as a primary innovation vehicle for big data, the arrival of […]

by Cloudera 5 min read

Apache Atlas Apache Hadoop Apache Iceberg Apache Kafka Apache Ozone Apache Ranger Apache Spark Apache Yunikorn

Filter By