I would like to consult that how can I control the genre of the generated music.
I notice that the paper mentions that one can control the instrument and genre by adding conditions before the input sequence. Also, in the train an inference code there are initial tokens.
However, the genre in initial tokens are always "5" which map to "pop". I try to modify the genre and instrument tokens during the training procedure while it doesn't work well.