Shashank Naik, Author at Cloudera Blog

May 9, 2019 | Technical

Small Files, Big Foils: Addressing the Associated Metadata and Application Challenges

Small files are a common challenge in the Apache Hadoop world and when not handled with care, they can lead to a number of complications. The Apache Hadoop Distributed File System (HDFS) was developed to store and process large data sets over the range of terabytes and petabytes. However, HDFS stores small files inefficiently, leading […]

by Shashank Naik , Bhagya Gummalla 11 min read

Apache Hadoop Apache HDFS

More by this author:

Small Files, Big Foils: Addressing the Associated Metadata and Application Challenges