We should figure out why MPI is slow here and fix it. https://github.com/intel-ai/timedf_benchmarks/blob/b092ea0d490eb630224fc4ffdbc2f62630f57e49/timedf_benchmarks/hm_fashion_recs/fe.py#L159