Not sure if this exists but i'm approaching a need for it.

Creating a vtt transcript from a video & text.

Audiobooks and songs have the text already, it just needs to be timestamped to when it appears in the media file. Logically this should take less CPU and improve accuracy over current blind transcripting services.

Reply to this note

Please Login to reply.