This can lead to OOM error and inefficiencies as reloading model checkpoints takes a significant amount of time. Currently a quick workaround is increasing the chunk size to the dataset size, but I wanted to raise this issue in case anyone else also faces this.