I would love to transfer all my written books in PDF format into audio books, read by Guy Swann's voice. is that something this DVM could do as well?

Reply to this note

Please Login to reply.

Discussion

The DVM currently only does Speech to Text for Nostr events, but I can update it to work with urls if the PDF is available online

Full disclosure though, the cost is $0.36/1000 characters (not words) so for a full length book it could be more than $100 depending on the length

I see. Is it that expensive to do inference on the GPU?

It's a API wrapped as a DVM and that's the service cost of the API

I'm not running my own model or hardware

I can make a much cheaper DVM, but the largest part of the expense is the voice cloning for the service I'm using