Decent TTS (scroll down to test models)

VALL-E Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers

https://valle-demo.github.io/

Reply to this note

Please Login to reply.

Discussion

No replies yet.