What are some good text to audio AI models out there?

Reply to this note

Please Login to reply.

Discussion

Elevenlabs is the best I’ve found. I use their upgraded pro voice plan when in writing edit mode. I use it to instantly audiobook the current draft of my writings and listen to them in my own voice while golfing. I notate edit plans between putts as I listen. A fantastic workflow, I might add. That have a good project UI and features too. I’m enjoying their constant progress.

I’ve heard good things about Udio

I suggested Elevenlabs, but if you find a better one, I’d enjoy hearing where you landed.

I’ve heard of this but haven’t played around. Can you do sound effects and such things through prompts?

nostr:npub1jk9h2jsa8hjmtm9qlcca942473gnyhuynz5rmgve0dlu6hpeazxqc3lqz7 always has a leading edge on things happening in this space. Curious what he’s seeing 🤔

From my understanding, Elevenlabs is purely multilingual, multi-accent, or custom voice speech from text. It gets inflections wrong every once in a while, but in project mode, it’s pretty easy to regenerate isolated portions that needed correction without regenerating the whole thing. Pretty robust. For my use, I don’t need it perfect, since I’m just reviewing my own work and don’t need full production mode.

My purpose for this is, I have been crafting an idea to start a podcast to tell a fictional story about American manufacturing. I can read and record myself but would like to add some production value if I could using ai, or have it read in my voice. Just looking at options now.

I digress. In my opinion, your own voice beats elevenlabs for final production value, always. I misunderstood your text-to-audio and answered for text-to-speech.

May be good for other characters or narration is what I was thinking.

Ah, yes. I can see that exploration for it.

AudioCraft does really good sound effects n stuff like that and it’s open source, it’s also really easy to install n run

https://github.com/facebookresearch/audiocraft