Decent TTS (scroll down to test models)
VALL-E Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
Decent TTS (scroll down to test models)
VALL-E Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
No replies yet.