I would like to ask how many GPUs are needed for training with Qwen-2.5-7B, and how long does it take to train?