Cloudera customers run some of the biggest data lakes on earth. These lakes power mission-critical, large-scale data analytics and AI use cases—including enterprise data warehouses. Nearly two years ago, Cloudera announced the general availability of Apache Iceberg in the Cloudera platform, which helps users avoid vendor lock-in and implement an open lakehouse. With an open […]
The global pandemic has ushered in a new wave of economics. Driven by the rapid convergence of changing circumstances, data, automation and Artificial Intelligence (AI), businesses today have to contend with a whirlwind of internal and external pressures. Companies are experiencing pressure from growing customer demands, and amidst a global talent shortage, a workforce no […]
This blog post was written by Pedro Pereira as a guest author for Cloudera. Right now, someone somewhere is writing the next fake news story or editing a deepfake video. An authoritarian regime is manipulating an artificial intelligence (AI) system to spy on technology users. No matter how good the intentions behind the development of […]
Recommendation systems have become a cornerstone of modern life, spanning sectors that include online retail, music and video streaming, and even content publishing. These systems help us navigate the sheer volume of content on the internet, allowing us to discover what’s interesting or important to us. The classic modeling approaches to recommendation systems can be […]
Since the release of Cloudera Data Visualization (DV) back in Oct 2020, our primary mission has been to expand access to data analytics and predictive insights across enterprise businesses. Since that launch, we’ve worked tirelessly to deliver best-in-class data visualization, dashboarding, and predictive applications capabilities across our cloud and on-premises infrastructures through Cloudera’s machine learning […]
Introduction In our previous blog post in this series, we explored the benefits of using GPUs for data science workflows, and demonstrated how to set up sessions in Cloudera Machine Learning (CML) to access NVIDIA GPUs for accelerating Machine Learning Projects. While the time-saving potential of using GPUs for complex and large tasks is massive, […]
You’ve probably heard it more than once: Machine learning (ML) can take your digital transformation to another level. It’s a pie-in-the-sky statement that sounds great, right? And while you’d be forgiven for thinking that it might sound too good to be true, operational ML is, in fact, achievable and sustainable. You can get the very […]
When working on complex, or rigorous enterprise machine learning projects, Data Scientists and Machine Learning Engineers experience various degrees of processing lag training models at scale. While model training on small data can typically take minutes, doing the same on large volumes of data can take hours or even weeks. To overcome this, practitioners often […]
This is part 4 in this blog series. You can read part 1 here and part 2 here, and watch part 3 here. This blog series follows the manufacturing and operations data lifecycle stages of an electric car manufacturer – typically experienced in large, data-driven manufacturing companies. The first blog introduced a mock vehicle manufacturing […]