New Chinese ai voice synthesiser ByteSing is in development. ByteSing uses duration allocated encoder-decoder acoustic models and WaveRNN vocoders. It has along way to go but the results are amazing.
paper
arxiv.org
demos
paper
ByteSing: A Chinese Singing Voice Synthesis System Using Duration Allocated Encoder-Decoder Acoustic Models and WaveRNN Vocoders
This paper presents ByteSing, a Chinese singing voice synthesis (SVS) system based on duration allocated Tacotron-like acoustic models and WaveRNN neural vocoders. Different from the conventional SVS models, the proposed ByteSing employs Tacotron-like encoder-decoder structures as the acoustic...

demos
TTS demos
bytesings.github.io