I'd find it very beneficial to be able to display the calculated spectrogram in the background. It definitely would help adjusting the pitches, when the automated pitch extraction fails.
As an alternative, I'd propose to be able to set an own spectrogram as a background (e.g. I separate the vocals from the songs via Spleeter, and then draw a spectrogram via python's scipy/matplotlib).
Of course, in this case we have to agree on a given format, i.e. how many Hz or ms a pixel represents in the image.