i want more web/mobile based synths. The only tunable ones that exist are saki web, voca mobile and ace. And even so, Saki does not work on mobile, voca mobile is hard to get and ace isn't fun to tune on mobile
Oh, I do agree that the noisiness has diminished overtime. It's just something that continues to bug me about CeVIO and stops me loving it as much as Vocaloid and SynthV.Eh, honestly, that’s an incredibly common opinion regarding CeVIO. I don’t think anyone’s denying it’s noisy, it’s popularity simply comes from people finding that they like certain things about it more than they dislike the noisiness. (Unpopular opinion of my own: I thing CeVIO’s noise has gotten a lot less bothersome the further along it’s come. Compare Yukari Rei to Sasara, for example; Sasara and everyone that’s come after her gave noise, but it’s not grating and invasive like Rei’s is.)
I'd be down for this if more companies offered support for browsers other than Chrome. It feels like we're back to the bad old days of Internet Explorer dominating website coding and design, where some sites would treat you like a proper weirdo for using an alternative browser.i want more web/mobile based synths. The only tunable ones that exist are saki web, voca mobile and ace. And even so, Saki does not work on mobile, voca mobile is hard to get and ace isn't fun to tune on mobile
The context it exists in. Ultimately they're both just neutral tools, but AI art generators and the culture around it had malicious intent towards artists (to the point that there are people purposely targeting "No AI" artists). Diff-svc on the other hand, it's still too early to tell, but currently the culture around diff-svc is just people toying around with it and putting their own data into it. In the context of vocal synth fandom, its just another medium to put their own characters in like NNSVS or UTAU. If however people start using diff-svc for malicious reasons, the you can expect people to quickly turn against it like back when people turned against AI art generators when it stopped being a fun thing to toy around with and started becoming malicious towards artists.So it's been about a month since English-speaking vocal synth fans blew up on Mitchie M for posting a silly AI-generated image of Miku playing a saxotrumpet on Twitter, and now English-speaking vocal synth fans are going crazy over Diff-SVC, which is based on the exact same technology as NovelAI (diffusion model machine learning). I'd just like to ask: what exactly makes one of these techniques inherently unethical and bad for human artists everywhere, and what makes the other one just a neat new thing to mess around with?
While yes, they are the same act, there is a vital factor that changes everything about its ethics: consent. AI art generators are automatically trained on all publically available art, regardless of whether the artist wants to participate. All the user has to do is type in the name of an artist, and the technology will replicate their work. This violates just about every principle of ethical labour: the person whose work is being used must give permission and be compensated for their contribution. The vocal version needs that data to be added manually and by choice, which most people are doing with their own voices. Consent changes it from being fun and interesting to violating.So it's been about a month since English-speaking vocal synth fans blew up on Mitchie M for posting a silly AI-generated image of Miku playing a saxotrumpet on Twitter, and now English-speaking vocal synth fans are going crazy over Diff-SVC, which is based on the exact same technology as NovelAI (diffusion model machine learning). I'd just like to ask: what exactly makes one of these techniques inherently unethical and bad for human artists everywhere, and what makes the other one just a neat new thing to mess around with?
For the record, my opinion on both of these tools is that they're ultimately morally neutral by themselves, and it's entirely on the user whether they're utilized for good (shitposting, supplementing "traditional" art, or giving unskilled artists/non-artists a chance to "create" something are all perfectly acceptable use cases to me) or for bad (art theft, stealing job opportunities from human artists, honestly you've all probably heard the arguments against it at this point). And honestly, for all the bad press "AI art" has gotten about its supposed potential harms to society, I feel like giving randos the ability to clone people's voices with relatively little effort is way more risky on its own. Art theft is bad and hurting artists' livelihoods is horrible, but there are so many awful things one could do with voice cloning that I'm struggling to narrow it down to one or two examples for this sentence. Imagine someone using your voice for like, hate speech or something and that permanently affecting your career or social life.
You don't even really have to wait to see this is the thing, you just have to step outside the vocal synth fandom for a bit. I'm pretty sure I've talked before in this very thread about the incredibly negative reaction social media had to Disney using AI to recreate a young Mark Hamill's voice in (I believe?) The Mandalorian. And more recently, I read news about some video game developers using (or just considering using) virtual voice actors for their NPCs, and the reaction I saw to that was even more incensed. Like, everything from a (to be fair, not entirely unfounded) fear of real human voice actors becoming obsolete in the near future, to an almost spiritual revulsion to the very concept of synthesized voice acting; I'm talking Hayao Miyazaki "I strongly feel this is an insult to life itself" kind of stuff....If however people start using diff-svc for malicious reasons, the you can expect people to quickly turn against it like back when people turned against AI art generators when it stopped being a fun thing to toy around with and started becoming malicious towards artists.
i agree on like every point that u've made dwdffdsfxdsfxdsfdxs the people on the blue bird app do criticize cevio way too much when compared to synthv, and tbh there are a lot of reoccurring statements they make that do kind of hint at the fact that they haven't really listened to cevio that much (aka the way people are saying that there are kafu clones on cevio which is like? the only bank that sounds like kafu is rime, and she was defenitly supposed to be like kafu but more mature, which in my opinion she does really well with)I probably sound like a biased Cevio liker at this point but after the amount of takes I’ve read on the engine & its voices (these mostly being on the cursed blue bird app so I shouldn’t even read that deeply into it) I keep wondering if half the ppl who are extremely critical about it even listened to more than half a demo of its voices??? Claims about the Kamitsubaki voices being appends of each other, every other vocal being Sasara appends.. very specifically I never understood the Kafu and Sekai comparisons, or actually I do get it a bit cuz the higher pitch from the base alpha parameter already sounding formant shifted mixed with the current engine noise can make it seem a bit like that in Cevio BUT they still sing in completely different styles… they honestly emulate their respective VPs’ singing style pretty well (and Kaf & Isekaijoucho have pretty distinct voices from each other as well). This aside the Alpha parameter also exists and it’s rly only nerfed for Kafu and increasing it for Sekai makes her sound a hell of a lot closer to Ojou. I also find it a bit funny that a lot of this also comes from people who eat up every single SynthV letter of the week who also have a very apparent same vocal type syndrome going on despite the engine noise being different but anyways I don’t wanna start on this fkdkslfkg
Going back to the singing style point, I think it is the biggest plus Cevio AI has over other current AI singing synths: it does a pretty damn good job at retaining singing styles and quirks from the voices their banks are based on. IA AI may have been clowned a bit on this end of the fandom and her initial showings didn’t do her justice but I still find it impressive how close she can sound to Lia just out of the box… she may be one of the trickiest banks to work with cause her AI tuning can be pretty wild but I also commend how it commits to keeping it close to Lia’s singing style. The other Cevio AI banks I’ve tried aren’t as much of a challenge to work with but I find it really fun to work with these singing styles and tuning them to fit into different songs! The forced auto-tuning may not be for everyone and I understand that, but ever since I’ve read Cevio banks being described as “giving a singer a script while you act as their vocal director” as opposed to being more of instruments you’re in control of like Vocaloid/SynthV with auto tuning off/Utau I kind of get the vision for Cevio AI a lot better now lol. I can see why this can turn ppl off who prefer a more neutral base that other synths can give you to work with, but I honestly find it makes using Cevio AI really fun to use to me like I said earlier and the AI part of it doesn’t really eliminate the user input when it comes to tuning.
Another really weird take I’ve seen is people claiming SynthV not being popular in Japan is due to… xenophobia??? Which I honestly find baffling, please correct me if I’m wrong but these seem like baseless assumptions and I’ll retract this if there’s anything pointing to that. It’s true something like this may have happened with SeeU back in 2011 but I really don’t see any indication that anything similar is happening here. This reads a bit like denial coming from some people about SynthV just not being as popular as engines that they deem as inferior lol. Fact is that marketing is minimal and not that great outside of their Chinese market & there just isn’t much to write home about when it comes to their Japanese output… maybe up until Mai’s release which seems to have attracted quite a bit of attention for once! Which is proof to me that their lack of success so far isn’t derived from prejudice but from there not really being much of an attractive offering or incentives to buy the program up till now.
As impressive as SV can be the sales aren’t gonna form out of thin air if it has no worthwhile exposure Mai is a standout among the alphabet soup and a free voice for SynthV Pro buyers who had a pretty impressive sounding demo that got quite a decent bit of views and the result of that is SV Pro being on top of DLSite sale rankings rn. See how that works!!!!! It’s finally a step in the right direction and hopefully one that they’ll keep up. It doesn’t help that the whole 1.7 fiasco kind of fumbled the bag a bit when it comes to 3rd party and I wonder how sustainable the feature creep every single update can be to get older VBs up to date but hopefully something like the HDVM situation doesn’t happen again…
anyways sorry I went on a bit of a rant here but I also had some of this stuff on my mind for a bit and I wanted to get some of it out
based blackoutI probably sound like a biased Cevio liker at this point but after the amount of takes I’ve read on the engine & its voices (these mostly being on the cursed blue bird app so I shouldn’t even read that deeply into it) I keep wondering if half the ppl who are extremely critical about it even listened to more than half a demo of its voices??? Claims about the Kamitsubaki voices being appends of each other, every other vocal being Sasara appends.. very specifically I never understood the Kafu and Sekai comparisons, or actually I do get it a bit cuz the higher pitch from the base alpha parameter already sounding formant shifted mixed with the current engine noise can make it seem a bit like that in Cevio BUT they still sing in completely different styles… they honestly emulate their respective VPs’ singing style pretty well (and Kaf & Isekaijoucho have pretty distinct voices from each other as well). This aside the Alpha parameter also exists and it’s rly only nerfed for Kafu and increasing it for Sekai makes her sound a hell of a lot closer to Ojou. I also find it a bit funny that a lot of this also comes from people who eat up every single SynthV letter of the week who also have a very apparent same vocal type syndrome going on despite the engine noise being different but anyways I don’t wanna start on this fkdkslfkg
Going back to the singing style point, I think it is the biggest plus Cevio AI has over other current AI singing synths: it does a pretty damn good job at retaining singing styles and quirks from the voices their banks are based on. IA AI may have been clowned a bit on this end of the fandom and her initial showings didn’t do her justice but I still find it impressive how close she can sound to Lia just out of the box… she may be one of the trickiest banks to work with cause her AI tuning can be pretty wild but I also commend how it commits to keeping it close to Lia’s singing style. The other Cevio AI banks I’ve tried aren’t as much of a challenge to work with but I find it really fun to work with these singing styles and tuning them to fit into different songs! The forced auto-tuning may not be for everyone and I understand that, but ever since I’ve read Cevio banks being described as “giving a singer a script while you act as their vocal director” as opposed to being more of instruments you’re in control of like Vocaloid/SynthV with auto tuning off/Utau I kind of get the vision for Cevio AI a lot better now lol. I can see why this can turn ppl off who prefer a more neutral base that other synths can give you to work with, but I honestly find it makes using Cevio AI really fun to use to me like I said earlier and the AI part of it doesn’t really eliminate the user input when it comes to tuning.
Another really weird take I’ve seen is people claiming SynthV not being popular in Japan is due to… xenophobia??? Which I honestly find baffling, please correct me if I’m wrong but these seem like baseless assumptions and I’ll retract this if there’s anything pointing to that. It’s true something like this may have happened with SeeU back in 2011 but I really don’t see any indication that anything similar is happening here. This reads a bit like denial coming from some people about SynthV just not being as popular as engines that they deem as inferior lol. Fact is that marketing is minimal and not that great outside of their Chinese market & there just isn’t much to write home about when it comes to their Japanese output… maybe up until Mai’s release which seems to have attracted quite a bit of attention for once! Which is proof to me that their lack of success so far isn’t derived from prejudice but from there not really being much of an attractive offering or incentives to buy the program up till now.
As impressive as SV can be the sales aren’t gonna form out of thin air if it has no worthwhile exposure Mai is a standout among the alphabet soup and a free voice for SynthV Pro buyers who had a pretty impressive sounding demo that got quite a decent bit of views and the result of that is SV Pro being on top of DLSite sale rankings rn. See how that works!!!!! It’s finally a step in the right direction and hopefully one that they’ll keep up. It doesn’t help that the whole 1.7 fiasco kind of fumbled the bag a bit when it comes to 3rd party and I wonder how sustainable the feature creep every single update can be to get older VBs up to date but hopefully something like the HDVM situation doesn’t happen again…
anyways sorry I went on a bit of a rant here but I also had some of this stuff on my mind for a bit and I wanted to get some of it out
All good! :)I'm gonna steal this from @pico as I've seen them mention it on twitter, forgive me
I also find it a bit funny that a lot of this also comes from people who eat up every single SynthV letter of the week who also have a very apparent same vocal type syndrome going on despite the engine noise being different but anyways I don’t wanna start on this fkdkslfkg
...
It doesn’t help that the whole 1.7 fiasco kind of fumbled the bag a bit when it comes to 3rd party and I wonder how sustainable the feature creep every single update can be to get older VBs up to date but hopefully something like the HDVM situation doesn’t happen again…
anyways sorry I went on a bit of a rant here but I also had some of this stuff on my mind for a bit and I wanted to get some of it out
This may be completely anectodal to the specific people I follow and see and social media - but I find this opinion isn't unpopular at all. People who love SynthV are generally sick of the alphabet soup and unhappy with how Dreamtonics deals with third party voicebanks. The vast majority of SynthV Twitter was clowning on Dreamtonics for their HDVM update. We like how SynthV sounds, which is why Dreamtonics' choices hurt so much.On the other end, you had the whole HDVM fiasco, where it was pretty much revealed that 3rd party developers are left in the dark, all 3rd party developers were first to apologize and act, leaving DT to be one of the last ones to react, the development on the matter's been very messy and sluggish too, and I don't believe the beta releases, which are basically are way to prevent disasters like this to happen in the future, would've even happened if DT wouldn't have gotten slap on the wrist from Tokyo6 ( which is honestly embarrassing ), and even then, miscommunications still appear to be an issue considering what's happened with Chifuyu's Lite vb and Eleanor's beta vb, and the feature creep is still an issue, leaving the future of vbs on the engine uncertain.
It's just all really really bad look when you contrast it next to techno speech, and when put side by side it's no wonder Cevio AI is more favored in terms of development.
I've seen excuses for it made already that it's just their "1st little oopsie " but that's in my opinion grossly underestimating just how bad all mentioned above really is...and that HDVM on launch was a result of an issue that's been going on this whole time and only became most apparent with HDVM.
Personally, I think the argument is less “People act like SynthV can do no wrong,” which, as you’ve pointed out, isn’t quite the case these days, and more “Even at its most criticized, people are still willing to overlook SynthV’s flaws in favor of its strengths, something CeVIO isn’t afforded in this side of the fandom.” Each engine has strengths and weaknesses and different appeals, but there’s still such a huge attitude of negativity towards anything that’s not SynthV. Someone made a comment on VocaTwitter in response to SeeU and Uni’s possible returns saying they personally hoped they were on CeVIO rather than SynthV, and someone replied “When you hate quality and love engine noise.” It had about three times the amount of likes as the original comment.This may be completely anectodal to the specific people I follow and see and social media - but I find this opinion isn't unpopular at all. People who love SynthV are generally sick of the alphabet soup and unhappy with how Dreamtonics deals with third party voicebanks. The vast majority of SynthV Twitter was clowning on Dreamtonics for their HDVM update. We like how SynthV sounds, which is why Dreamtonics' choices hurt so much.
To Dreamtonics' credit, they seemed to have finally hired people to do their marketing, and SynthV is starting to reach out their intended audience.
But just as it was mentioned, I wish Dreamtonics allowed AHSoftware and other third party companies to branch out into the Japanese doujin community more. It is possible, with multiple SynthV voicebanks that have already been appreciated by Japanese users. I completely agree that some SynthV users are way too harsh on CeVIO, when it's catering to the same fanbase that love(d) Vocaloid.
I imagine high-pitched, formant-shifted voices like Kafu and Sekai are not going to appeal to a lot of those who like the very natural sound of SynthV (but of course there are multiple people who like both kinds of voices). Kafu also has an issue of being really muffled. But voices like Coko, Poppy and Rose show that CeVIO is equally capable of capturing voices that feel real.
I think most people who think SynthV is objectively superior just don't like the artificial sound of CeVIO's engine. It's not a problem when the voice is strong and belting, but it really jumps out in softer voices. Some people actually like the engine noise, though, and that's cool. Hearing how good Coko and Rose's belting was made me really excited for c flower.
But inversely to SynthesizerV, CeVIO doesn't cater to countries outside of Japan at all. I would be more excited for Rose and c flower if they had proper cross-lingual support. Hopefully, one day! I also realise this post doesn't really contain any unpopular opinions either, so I'm not telling anyone off lol. It's just nice to talk about this, because the bird app does not allow much nuance in discussions.
I didn't share these critiques in isolation, but as contrast to Cevio's strengths, strengths that get overlooked, it's to highlight the refusal of the community to expand onto another editor ( highlight on the word expand, not move to ) despite it doing things that are desired for in the community ( focus on characters and preserving the doujin-y feel ), or not even expand onto but to be even given recognition for it.This may be completely anectodal to the specific people I follow and see and social media - but I find this opinion isn't unpopular at all. People who love SynthV are generally sick of the alphabet soup and unhappy with how Dreamtonics deals with third party voicebanks. The vast majority of SynthV Twitter was clowning on Dreamtonics for their HDVM update. We like how SynthV sounds, which is why Dreamtonics' choices hurt so much.
And with anecdotal evidence on my end, even when criticized, I find that the community is a lot softer with it's criticism towards SVS, I've seen the community be at it's most polite when directing their criticism at it, which says a lot in my book considering they're capable of borderline harassment for a lot less, I've seen people get more worked up over V5 going on sale than the status of Eleanor's boxes.Personally, I think the argument is less “People act like SynthV can do no wrong,” which, as you’ve pointed out, isn’t quite the case these days, and more “Even at its most criticized, people are still willing to overlook SynthV’s flaws in favor of its strengths, something CeVIO isn’t afforded in this side of the fandom.
Apologies for double posting but since I mentioned Vocaloid I got reminded of another 🧅 of mine:I've seen people get more worked up over V5 going on sale than the status of Eleanor's boxes.
Wait but didn't the SynthV community WANT niche voicebanks? Even then, from his vocal mode demo he seems almost as versatile as Solaria.On a completely different topic, I saw some folks comment on Twitter that Asterian's voice is too niche. Nothing could be the furthest thing from the truth, but a lot of people just don't know how to bring out colour and expression in baritone and bass voices. They're so rare. I hope Asterian will grow in popularity despite this.