I wish they'd stop using the tension and breathiness parameters... They don't sound good, lol.
OOOOOOOH! a chance for me to rant about one of my few niche issues with SV!
i think it's crazy how there are so many features which rely on variations in the vb's actual vocal data... but then things like breathiness and tension, which realistically CAN be enhanced through using the actual tension and breath data from the dataset, just aren't. obviously i don't know the technical side of things, but this is actually how diffsinger handles those parameters and, when it works well, it's MUCH better - say, extracting the actual breathy or powerful tones from each individual VM for a more realistic tension effect.
i say 'when it works well' because... the reason they may not have added it is because i know diffsinger's tension can actually also cause a lot of problems, and require a bit of trial and error for the best output - and i know that's a bit of a staple with AI in general, but probably wouldn't be such a great look for a commercial product such as SV. still, i think something along these lines would go a long way if implemented well, because i've always found the current parameters to be a bit lacking in how they clearly still appear to be simulated. of course a VM could be used to the same effect, but when used in tandem it would be much, much better.