May I ask the author, after the training of a2c algorithm, shouldn't the randomness be eliminated when testing?