In this project, we want to find a good way of dividing our managers into a specific number of team (parameter k
) and manage them to have responsibility for different areas in an optimized way.
We want to use one of the most reliable clustering algorithms called Kmeans
. Actually, We need to find a way to distribute each team to the proper cluster, in order to make the cost of transportation lower and lower.
But in this case, our main goal is to implement the algorithm from scratch (using scikit-learn
prepared modules are forbidden)
We forced to use just these libraries:
- Pandas - pandas is a fast, powerful, flexible and easy to use open source data analysis and manipulation tool, built on top of the Python programming language
- Matplotlib - Matplotlib is a comprehensive library for creating static, animated, and interactive visualizations in Python
- Numpy - NumPy brings the computational power of languages like C and Fortran to Python, a language much easier to learn and use
As mentioned in the previous section, you need to install some modules for this project. The modules are referenced to install in the requirements.txt
. After cloning the repository, go to the directory of the repo and use the command below to install the packages:
$ pip install -r requirements.txt
check this for the details of the packages