What's new

Latest resources

English patch for XStudio/StudioVoICE haru0l
0.00 star(s) 0 ratings
Updated
0.00 star(s) 0 ratings
Updated
How to VSQx despite being tone-deaf (or bad at pitch detection).
0.00 star(s) 0 ratings
Updated
Resource Kontakt
What Kontakt Is and Information about Its Use
0.00 star(s) 0 ratings
Updated
MikuMikuDance 9.32 ThreeR
MMD 932 version hosted on Geocities.WS
0.00 star(s) 0 ratings
Updated
0.00 star(s) 0 ratings
Updated
Corrected Iori Yuzuru emotion images + ability to mix and match faces/bodies uncreepy
The official images have inconsistent gradients & shoelaces, so I fixed them. Mix and match emotions
0.00 star(s) 0 ratings
Downloads
0
Updated
How to make Japanese Vocaloids roll their R's
0.00 star(s) 0 ratings
Updated
Activate Non-Crypton Vocaloids In Piapro uncreepy
Video demonstration of how to activate non-Crypton Vocaloids in Piapro
0.00 star(s) 0 ratings
Updated

Latest profile posts

Every once in a while I get the urge to make eng utau vb of my own ujhygtf its that time of the month again.
patuk
patuk
I went through huge ass dilema as to what reclist to use, I just knew I didn't wanna deal with VCCV, I settled on Arpasing u_u I've heard it's not the best for non-native accents but I work the best off of reclists based on actual words, not gibberish like many eng utau reclists are :/ plus I am far from a good oto-er so moresampler is of a big help hbgvfcdx
I'm beginning to suspect the "harshness" in Saki AI, which is still audible in version 104, is aliasing distortion.
Ronny777
Ronny777
I don't really know what that means but I really hope it means its issue you can fix or at least create a workaround for!
parallax_fifths
parallax_fifths
The problem lies with SynthV itself. I barely know anything about SynthV or signal processing, but according to Jordi Bonada's 2008 PhD thesis, the original Vocaloid engine did most of its heavy lifting in the frequency domain. This means that an inverse fast Fourier transform was used to push the data into the time domain so that it can be turned into PCM data. Perhaps SynthV also does its heavy lifting in the frequency domain.

A quick google for "inverse fast Fourier transform" and "aliasing" together does return many results for papers on the subject of aliasing and FFTs, but it's all Greek to me.

I also downloaded Bonada's original thesis and, and a search for "alias" brings up four results. It seems that he was worried about aliasing, but not too much.

His thesis is here:

Here's the Wikipedia article on FFTs:

And here's the Wikipedia article on aliasing:

In that Wikipedia aliasing article there is a sound sample containing a sawtooth wave played at 440 Hz (A4), 880 Hz (A5), and 1760 Hz (A6). Each pitch is played twice, first with bandlimiting (anti-aliasing), and then without bandlimiting (aliased). The 440 Hz non-bandlimited sawtooth is closer in range to Saki's higher usable notes, and its aliasing sounds suspiciously similar to the harshness I hear in Saki. (Or SynthV in general.) Of course, the overtones of a sawtooth are inherently harsher than a voice, so the Wikipedia example exaggerates the effect, but that's probably why they used a sawtooth.

Note that I'm not a math person, but the concepts discussed aren't entirely alien to me.

And while we're on the topic of aliasing, here's an awesome Dan Worrall video about oversampling and aliasing that you may find practical when mixing. (And it might clear up some of the nonsense I wrote above):
People say vocaloid music is immature and bad when anything utsu p has ever made outdoes the entirety of the beatles' discography
went to a ghost town todaaay. it was way too windy to really do anything (we've been getting pretty bad wind storms lately) but i enjoyed the roadtrip and got a few pictures out of the adventure ~