I've been using her for a week and I can say with certainty that all these problems are very real and apparent, on top of these issues there seems to be dictionary issues? when singing do re mi fa so la ti do she sings do re me sa so la ti do? I think 200 for her is a bit... questionable. She's not versatile enough to justify it in my opinion.Anyone else kind of disappointed hearing that CeVIO AI has problems with Yukari? It makes me worried the other voice banks will have the same problems.
I read this blog by kM4osM (pronounced kurosu) where he explained common problems and (hopefully) how to fix them:
CeVIO AIの出音がなんか変だなーって思ったときに取る手段
CeVIO AI発売から約1週間ですね。みなさん麗さん使ってますか?このごろ、私のTwitterTLでは「麗さんはじゃじゃ馬」っていう話を聞くようになってきました。VOCALOIDや無印CeVIOを使ってた人の中にもCeVIO AIには手こkm4osm.com
Apparently long notes have pronunciation issues (singing turning muffled, becoming mispronounced, going off pitch, becoming quiet and loud again), which reminded me how we talked about KAFU having weird pronunciation/having to look at the lyrics, so i'm suspicious this will be a "CeVIO AI" problem. And I heard people saying that Yukari sounded best singing slow songs compared to fast songs, so uhh, sounds like Yukari working best with long notes will definitely make these problems crop up.
I've heard from 3 popular people in the fandom who own many synths from every engine that I trust say that Yukari goes off of pitch, which makes me worried because I do not have that good of an ear and having to manually fix a vocal synth which should be able to sing on better pitch than a human does not instill confidence. kM4osM said that if a note goes sharp you can literally move notes up/down to the neighboring note to compensate, change note length, try stuff like a "shi" note followed by "i" (same vowel), stick in breaths or slurs, try to put the funky notes on their own track, try using -, change location of consonants, use a っ, stick on ※ after a vowel to turn it into falsetto, add a bunch of the same vowel (ex: "aaaa" turns it into "a---"), pick a consonant + vowel that's sort of similar sounding instead, break apart contracted sounds (ex: "nya" into "ni + ya"), change TMG or ALP.
What do you all think? Personally, at the $200ish price tag, it seems troublesome to have those problems.
which reminded me how we talked about KAFU having weird pronunciation/having to look at the lyrics, so i'm suspicious this will be a "CeVIO AI" problem.
It really reminds me of that one earlier Kafu demo we've heard where we noted she sounds extremely pitchy.I've heard from 3 popular people in the fandom who own many synths from every engine that I trust say that Yukari goes off of pitch,
The problem is that Techno-Speech themselves do the programming for the AI, you only need to provide them with hours of singing data and they do all the work themselves (even the phoneme labeling), so technically any "bad voice banks" would be mostly Techno-Speech's fault other than maybe blaming the voice provider for singing off key or something that was used for deep learning data. S:Hopefully it's just a thing Cevio AI is prone to and only bad Cevio AI vbs take on these characteristics
I initially looked at it as like "editor quirk" just like how Vocaloid tends to make voices more nasally and depending on a vb the nasally-ness is more or less severe but you're correct, it's just bad programming.so technically any "bad voice banks" would be mostly Techno-Speech's fault
I was gonna mention the new Police Piccadilly song here - very impressed with how Kafu sounds in that song, even though she sounds a bit muffled on the higher notes. (Though at this point, considering the issues surrounding Yukari Rei, I'm not sure if it's the VB or CeVIO AI to blame for the muffled sound.) I still also have that nagging feeling in my mind that the song could sound even better if Kafu was recorded at Kaf's natural pitch, but I'm starting to get over it now lol.Speaking of CeVIO though, KAFU's new demo actually sounds really good. Sounds like Police Piccadilly got to tune her too.
Techno-Speech required (if I remember correctly, don't want to dig for my old post with the time) 5 hours total of singing data. So I don't think it's a lack of singing data. If you compare it to NEUTRINO, I think that only had 1 hour's worth (at least for Kiritan when they released all that deep learning data for anyone to use)? And Synth V's Saki AI only was taught with 5 songs. So I feel like... maybe it's something Techno-Speech is doing differently than NEUTRINO and Synth V since the results are so drastically different?Part of me thinks it might be the fault of a lack of data but we can't be too sure without knowing exactly what they put in to her...
Seems like it's just VOCALOMARKETS messing up labelling then... Considering how the other CeVIO AI voices sound I don't think much blame can be put on Techno-Speech for Yukari considering she also has dictionary problems. With regards to Saki AI, I've been reliably informed the 5 songs is for her pitch model (the auto-tuning function) and she was trained on more than that for her actual database, though still much less than CeVIO AI. Just hoping they can do something to fix Yukari because I don't think anyone wants to pay $200 to use a standalone library like that.Techno-Speech required (if I remember correctly, don't want to dig for my old post with the time) 5 hours total of singing data. So I don't think it's a lack of singing data. If you compare it to NEUTRINO, I think that only had 1 hour's worth (at least for Kiritan when they released all that deep learning data for anyone to use)? And Synth V's Saki AI only was taught with 5 songs. So I feel like... maybe it's something Techno-Speech is doing differently than NEUTRINO and Synth V since the results are so drastically different?
Is the person on the left of IA is that "Project H"?View attachment 4194
And who are those furry Lion and Deer guy behind them? Please tell me they are just back up dancers/Characters like that Pheonix and Cat human thing during the Aria Musical.