2024

Distributed MapReduce Framework

Distributed Systems, Data Processing

A full Python implementation of a distributed MapReduce system with socket-based job coordination, configurable job submission, and parallel worker execution.

Language:

  • Python

Libraries:

  • Click

  • Socket

  • JSON

  • threading

Platforms:

  • AWS EC2

  • Hadoop

README.md