What are some good text to audio AI models out there?
Discussion
Elevenlabs is the best I’ve found. I use their upgraded pro voice plan when in writing edit mode. I use it to instantly audiobook the current draft of my writings and listen to them in my own voice while golfing. I notate edit plans between putts as I listen. A fantastic workflow, I might add. That have a good project UI and features too. I’m enjoying their constant progress.
I’ve heard good things about Udio
I suggested Elevenlabs, but if you find a better one, I’d enjoy hearing where you landed.
I’ve heard of this but haven’t played around. Can you do sound effects and such things through prompts?
nostr:npub1jk9h2jsa8hjmtm9qlcca942473gnyhuynz5rmgve0dlu6hpeazxqc3lqz7 always has a leading edge on things happening in this space. Curious what he’s seeing 🤔
From my understanding, Elevenlabs is purely multilingual, multi-accent, or custom voice speech from text. It gets inflections wrong every once in a while, but in project mode, it’s pretty easy to regenerate isolated portions that needed correction without regenerating the whole thing. Pretty robust. For my use, I don’t need it perfect, since I’m just reviewing my own work and don’t need full production mode.
My purpose for this is, I have been crafting an idea to start a podcast to tell a fictional story about American manufacturing. I can read and record myself but would like to add some production value if I could using ai, or have it read in my voice. Just looking at options now.
AudioCraft does really good sound effects n stuff like that and it’s open source, it’s also really easy to install n run