Just in case you're not aware, audio-to-midi convertors are very useful for making new VSQxs. I use SynthV's built in 'extract notes from audio' tool, but there are other options available (fun fact: Melodyne also has this). You need only to plug the singing audio file into the software, and it will extract the correct notes and timing as a file you can start with. However, it will struggle if you have voices singing different notes in the background.
If no vocal track is available, there are stem separators you can use to extract the vocals from the instrumental as a separate mp3. There are also tools in DAWs and online to identify the BPM of a track, so you know where to start with your project settings.
I don't have the musical ear to transcribe songs by listening, so that's the method I use.

If it's helpful, I can try this method for you, and get a rough base you can compare with your current file.