General Discussion Thread

aru ii · Jan 27, 2023

Update on Galaco Talk situation. Don’t fully understand it

Announcement of “galacoTalk” distribution and support end date - VOCALOID - the modern singing synthesizer -

We would like to inform you of the distribution and sup

www.vocaloid.com

aru ii · Jan 27, 2023

aru ii said:
Update on Galaco Talk situation. Don’t fully understand it

Announcement of “galacoTalk” distribution and support end date - VOCALOID - the modern singing synthesizer -

We would like to inform you of the distribution and sup

www.vocaloid.com

Ok so, Galaco Talk won’t be included with NEO after 15/2/23, and the end of support for her authorisation and deactivation will be on 31/3/24… this is like Galaco prize all over again

AddictiveCUL (Add) · Jan 27, 2023

aru ii said:
Ok so, Galaco Talk won’t be included with NEO after 15/2/23, and the end of support for her authorisation and deactivation will be on 31/3/24… this is like Galaco prize all over again

Too bad, at least we still have her Vocaloid library.

WyndReed · Jan 28, 2023

“Note 2: After February 16, 2023, customers who registered a VOCALOID ID and purchased "VOCALOID3 Library galaco" can download "galacoTalk" from the VOCALOID SHOP download code confirmation link included in the e-mail you receive when you purchased the product.”
So you can still get it afterwards??

Prism · Jan 28, 2023

I don't think that's 100% the case. It says you can have the installer I don't know if you can activate it tho

aru ii · Jan 28, 2023

Yeah, after the 16tg you would be able to download the installer, but no new codes will be given out

aru ii · Jan 30, 2023

Haruka Nana got a new vb and design for her 14th anniversary

pico · Jan 30, 2023

I am unreasonably excited about this

Rylitah · Jan 30, 2023

Crowdfund for UNI's fifth anniversary has been announced! It's for an album, merch (acrylic stand+clear file+pin badge) of the main visual (of the logo for the pin badge), and a plush (including SeeU)!

(Note: this doesn't imply any sort of synth update, just like how SeeU's anniversary crowdfund went. It's just an anniversary celebration.)

aru ii · Jan 31, 2023

New Coeiroink vb from Tsuina Project. Her name is Namidane Koron

aru ii · Feb 1, 2023

Just discovered this vsynth/tts called poino. It looks like it can talk and sing

poino official site

GitHubからダウンロード

t.co

AddictiveCUL (Add) · Feb 1, 2023

aru ii said:
Just discovered this vsynth/tts called poino. It looks like it can talk and sing

poino official site

GitHubからダウンロード

t.co

Won't lie, it doesn't sound like a person at all, but it's still cool to know this exists

Rylitah · Feb 1, 2023

AddictiveCUL said:
Won't lie, it doesn't sound like a person at all, but it's still cool to know this exists

Going by the poino site, that's because there are no voice actors involved. Similar to Adachi Rei, who also has no voice provider. The site says this is done by editing the formants from envelopes of Fourier transforms (or it uses Fourier transforms to synthesize the sounds? I don't really understand the technical aspect of this, haha) - either way, it'll probably never sound perfectly clear or human, but it's a valiant effort.

... Though I wonder if the first character's name being Reichii (funnily enough, the name of the character on the github download is spelled "Reinii" or "Rainy") is invoking Adachi Rei on purpose, considering they're more or less the same thing. They sound pretty similar too, also because of the source of their voices, I suppose. Both licenses are also pretty free for personal and commercial use (Adachi Rei only requires a commercial license for big businesses/people expecting to make a huge profit off her character/voice, though small-scale paid doujin works are fine to make and distribute without that).

Really, the big difference between them is Adachi Rei's singing voicebank is for UTAU (and A.I.VOICE for speech being a paid product) while poino seems to just be fully it's own free thing. That's pretty neat.

kozet · Feb 1, 2023

Rylitah said:
... Though I wonder if the first character's name being Reichii (funnily enough, the name of the character on the github download is spelled "Reinii" or "Rainy")

There are 2 characters: (by the file names in the repo) Laychie and Layney.

pico · Feb 2, 2023

A SIGNAL PROCESSING MOMENT?! ON MY FORUM? IT'S MORE LIKELY THAN YOU'D THINK! :piko_ani_lili:

The inverse fourier transform generates the output signal in real time. Fourier transforms are basically the most common way of interacting with signals in signal processing. On the most basic level, a fourier transform converts a signal from the time domain to the frequency domain. It's most easy to understand by looking at a picture:

Before the fourier transform is performed, we see a constant signal being generated. But it's kind of hard to understand and modify in this form.
After we perform the fourier transform and convert the signal into the frequency domain, we see an impulse at the frequency the signal is at. In this case, it's about ~3 hertz (Hz). Super obvious!
If you have an input signal that is changing frequency (pitch) over time, like a human singing voice, we can see the individual frequencies with the fourier transform!

What an artificial voice like poino does is filter a signal at a given pitch with the fourier transform to make it resemble a human voice.

For example, we can see what frequencies make up the vowel "a" like this:

Article explaining it: Identifying sounds in spectrograms
When the spectrogram is red, there's a higher density of sound there at a certain frequency. So we want to amplify those red parts of our signal to create an "a" sound, and filter out the rest.

The fourier transform is our tool for accomplishing this. After filtering and amplifying different parts of that simple sine wave we started with, we can end up with a sound that sounds more like a human voice after inverse fourier transforming it back to the original time domain, which you then play out of your speakers as sound.

It works by taking the Fourier transform of the signal, then attenuating or amplifying specific frequencies, and finally inverse transforming the result.

A good article on how filtering with the inverse fourier transform works:
Intro. to Signal Processing:Fourier filter

It can be a lot to get your head around at first!

I think the more poignant difference between Adachi Rei and this software is that Missile created Rei's voice samples by hand by manually shifting the source sin wave around in Audacity and playing with every sound manually, which he then exported. On the other hand, this software is generating a voice completely algorithmically. Rei's voice is going through a lot of different layers of processing by the time you export it from UTAU, while poino generates the voice from scratch in real time. I like Rei's voice for sitting squarely in the middle between completely algorithmically generated sound and being lovingly crafted by a person by hand.

peaches2217 · Feb 2, 2023

pico said:
A SIGNAL PROCESSING MOMENT?! ON MY FORUM? IT'S MORE LIKELY THAN YOU'D THINK!

The inverse fourier transform generates the output signal in real time. Fourier transforms are basically the most common way of interacting with signals in signal processing. On the most basic level, a fourier transform converts a signal from the time domain to the frequency domain. It's most easy to understand by looking at a picture:

Before the fourier transform is performed, we see a constant signal being generated. But it's kind of hard to understand and modify in this form.
After we perform the fourier transform and convert the signal into the frequency domain, we see an impulse at the frequency the signal is at. In this case, it's about ~3 hertz (Hz). Super obvious!
If you have an input signal that is changing frequency (pitch) over time, like a human singing voice, we can see the individual frequencies with the fourier transform!
View attachment 7276

What an artificial voice like poino does is filter a signal at a given pitch with the fourier transform to make it resemble a human voice.

We can tell what frequencies certain sounds make up like this:
View attachment 7277
Article explaining it: Identifying sounds in spectrograms
When the spectrogram is red, there's a higher density of sound there at a certain frequency. So we want to amplify those red parts of our signal to create an "a" sound, and filter out the rest.

The fourier transform is our tool for accomplishing this. After filtering and amplifying different parts of that simple sine wave we started with, we can end up with a sound that sounds more like a human voice after inverse fourier transforming it back to the original time domain, which you then play out of your speakers as sound.

A good article on how filtering with the inverse fourier transform works:
Intro. to Signal Processing:Fourier filter

It can be a lot to get your head around at first!

I think the more poignant difference between Adachi Rei and this software is that Missile created Rei's voice samples by hand by manually shifting the source sin wave around in Audacity and playing with every sound manually, which he then exported. On the other hand, this software is generating a voice completely algorithmically. Rei's voice is going through a lot of different layers of processing by the time you export it from UTAU, while poino generates the voice from scratch in real time. I like Rei's voice for sitting squarely in the middle between completely algorithmically generated sound and being lovingly crafted by a person by hand.

Damn, this is amazing! I don’t understand most of it, but I get the basic differences, and I’m amazed at how much work goes into these things.

pico · Feb 2, 2023

It gets even more complex once you get into the applications orz but it's all fascinating and you can do an incredible amount with it! It wouldn't be an exaggeration to say that modern society is practically built off of fourier transforms! lol

AddictiveCUL (Add) · Feb 2, 2023

pico said:
A SIGNAL PROCESSING MOMENT?! ON MY FORUM? IT'S MORE LIKELY THAN YOU'D THINK!

The inverse fourier transform generates the output signal in real time. Fourier transforms are basically the most common way of interacting with signals in signal processing. On the most basic level, a fourier transform converts a signal from the time domain to the frequency domain. It's most easy to understand by looking at a picture:

Before the fourier transform is performed, we see a constant signal being generated. But it's kind of hard to understand and modify in this form.
After we perform the fourier transform and convert the signal into the frequency domain, we see an impulse at the frequency the signal is at. In this case, it's about ~3 hertz (Hz). Super obvious!
If you have an input signal that is changing frequency (pitch) over time, like a human singing voice, we can see the individual frequencies with the fourier transform!
View attachment 7276

What an artificial voice like poino does is filter a signal at a given pitch with the fourier transform to make it resemble a human voice.

For example, we can see what frequencies make up the vowel "a" like this:
View attachment 7277
Article explaining it: Identifying sounds in spectrograms
When the spectrogram is red, there's a higher density of sound there at a certain frequency. So we want to amplify those red parts of our signal to create an "a" sound, and filter out the rest.

The fourier transform is our tool for accomplishing this. After filtering and amplifying different parts of that simple sine wave we started with, we can end up with a sound that sounds more like a human voice after inverse fourier transforming it back to the original time domain, which you then play out of your speakers as sound.

A good article on how filtering with the inverse fourier transform works:
Intro. to Signal Processing:Fourier filter

It can be a lot to get your head around at first!

I think the more poignant difference between Adachi Rei and this software is that Missile created Rei's voice samples by hand by manually shifting the source sin wave around in Audacity and playing with every sound manually, which he then exported. On the other hand, this software is generating a voice completely algorithmically. Rei's voice is going through a lot of different layers of processing by the time you export it from UTAU, while poino generates the voice from scratch in real time. I like Rei's voice for sitting squarely in the middle between completely algorithmically generated sound and being lovingly crafted by a person by hand.

Nerd! Kkkkkkkkk <3

aru ii · Feb 11, 2023

pico · Feb 11, 2023

I was not expecting that look for the character at all but I’m so glad we got something super unique. Right now people are photoshopping other characters onto the sword (like they were stabbed amidst some kind of hijinks). All good stuff.

General Discussion Thread

Your Neighborhood Tianyi Enthusiast!

Your Neighborhood Tianyi Enthusiast!

CUL addicted!

Dareka tasukete!

Enthusiast

Your Neighborhood Tianyi Enthusiast!

Your Neighborhood Tianyi Enthusiast!

robot enjoyer

kiyoteru enthusiast

Your Neighborhood Tianyi Enthusiast!

Your Neighborhood Tianyi Enthusiast!

CUL addicted!

kiyoteru enthusiast

Conlanger

robot enjoyer

Attachments

Give me Gackpoid AI or give me DEATH

robot enjoyer

CUL addicted!

Your Neighborhood Tianyi Enthusiast!

robot enjoyer

Users Who Are Viewing This Thread (Users: 0, Guests: 1)