Apache Hadoop logo

Apache Hadoop

Enables distributed processing of large data sets across clusters of computers.

Made by Apache Software Foundation

  • Developer Tools

  • big-data

  • Web Development

  • distributed-file-system

What is Apache Hadoop?

Apache Hadoop is a versatile open-source software framework that enables reliable, scalable, and distributed computing across large-scale data sets. Designed to handle massive amounts of information, Hadoop empowers applications to process data-intensive workloads efficiently on clusters of commodity hardware

Highlights

  • Distributed Processing: Hadoop allows for the distributed processing of large data sets across clusters of computers, leveraging simple programming models to harness the collective computing power
  • Scalability: The framework can scale up from single servers to thousands of machines, with each node providing local computation and storage capabilities
  • Open-Source: Hadoop is an open-source tool, licensed under the Apache v2 license, with an active community and substantial contributions, as evidenced by its 13.7K GitHub stars and 8.5K GitHub forks.

Platforms

  • Mac
  • Windows
  • Linux

Languages

  • English

Social

Features

    • Distributed Computing