These are the problem statements that were solved using mapreduce tasks in hadoop using java
Create a data folder inside the part b folder and the download the dataset and put the downloaded foler inside data folder and then move inside q4 and q5 folder and follow their instructions file