I recently started working on a book review for school. The book was only available from my university's library as an eBook, which is unfortunate because I prefer reading physical copies. Nevertheless, I read it last week, and yesterday I typed in all the points I wanted to discuss in my review.
This morning I began researching the author to learn more about their qualifications. However, while doing so, I discovered why the book was only available electronically: the publisher had recalled and destroyed all unsold physical copies. The reason for this was because large swaths had been plagiarized from other sources.
Ugh. I may have to read another book for my review.
Moral of the story: Remember to check your sources!
And as is usual in plagiarism cases, this guy had a history. I wonder if his PhD was revoked?
I just discovered a cure for SynthV Studio’s harshness. (Or at least a cure for Saki AI, but I suspect that this works for all SynthV Studio voicebanks, including those running in the original SynthV, which also exhibited harshness.)
Normally I use Z-Noise to try and reduce some of SynthV’s harshness, but today I got thinking. Does Z-Noise work better at higher sample rates? So instead of exporting Saki AI at 44.1 kHz, I decided to try 88.2 kHz—except there was no such option. All I could choose from was 44.1, 48, and 96 kHz. So I gave 96 kHz a spin.
But after doing so, and even before I engaged Z-Noise, I noticed how much cleaner Saki AI had become. The harshness had disappeared without the need for extra post-processing!
Owing to the lack of an 88.2 kHz option, I then wondered if SynthV preferred multiples of 48 kHz, so I tried exporting at that sample rate. But when doing so, I noticed that SynthV did not re-render the audio before saving. In fact, once the vocal track had been internally rendered, it could be saved as 44.1, 48, and 96 kHz in rapid succession without any extra CPU grinding. This led me to wonder if SynthV is always running internally at 96 kHz, and if SynthV’s harshness is caused by poor sample-rate conversion.
A quick listen to the 48 kHz export option seemed to confirm this, as it sounded as nearly harsh as the 44.1 kHz export. So I tried down-sampling the 96 kHz export to 44.1 kHz using Audacity and compared it to the SynthV native 44.1 kHz export. The difference was not subtle, with the 44.1 kHz Audacity output sounding essentially identical to the 96 kHz source.
If you are a SynthV user, I recommend exporting at 96 kHz and then down-converting to 44.1 kHz in either Audacity, your DAW, or whatever. Just avoid exporting natively at 44.1 or 48 kHz and you’ll have much less harsh sounding vocals. The difference is very much audible.
And yes, sample-rate conversion (SRC) does vary from software to software, as evidenced by the data presented at this website:
Welp, here's the opening two minutes of my first Saki AI song:
This was a lyrics-first song, and りくりくりさん give me more lyrics than I could handle. It's not that there's a lot of lines, but each line is very lengthy and there's no pattern to the syllable count. Oh well, I did what I could to fit the music to his lyrics, and there's even some word painting.
But the whole thing is a bit like a sumo match—long moments of inactivity followed by a few brief moments of action. The full song is nearly five minutes long, and only the last 20 seconds are interesting. (Not included in the above excerpt.)
Does anyone know if it's possible to keep Synth V Studio Pro's transport controls visible at all times? It's really annoying how it keeps disappearing when doing certain functions, despite the available screen space remaining essentially the same.
Well... This doesn't really belong on Vocaverse, but here's my latest video:
For the record, it did begin life as an Eleanor Forte song, but I decided to swap SynthV for one of those oLd SkOoL singers that require food and water. Sorry...
Unlike TDR's Kotelnikov, which aims for transparent compression, Molotok goes for colouration. I haven't had much chance to play with it, but when it's compressing really heavily the resultant distortion is super nice.
I stumbled across something that may prove useful to all you vocal-synth users: Reaper has a JS Plugin called "Transient Controller," which is explained in this video:
However, while experimenting with "Transient Controller" on drums, I also discovered that it is useful for reducing vocal-synth plosives. All you have to do is reduce the "Attack (%)" slider to where you like it and all T, K, etc. sounds become less annoying.
If you don't use Reaper, I believe it is also available in the ReaJS plugin that is part of the ReaPlugs VST package: