Viewing a single comment thread. View all comments

vivehelpme t1_j7vfol7 wrote

Instead of trying to salvage the original recording why not recreate it by putting the text transcript into a text-to-speech model?

As you have it transcribed you don't even need to do any advanced speech recognition that filters the noise, just paste the text into something a bit more advanced than Microsoft Sam

2

CeFurkan OP t1_j7volvh wrote

but what about synching? how to solve synching problem?

i haven't found any way to re-voice with proper synchronization

i can prepare a perfect .vtt file but how to sync it with video?

1

express_mode_420 t1_j7w3mrm wrote

Could you speech-to-text your lecture, collecting timestamps, do the same with TTS and automagically sync that way?

2

CeFurkan OP t1_j7whf7d wrote

i have vtt file you know the subtitles we use for movies

but i haven't found and text to speech that can generate speech with that timing

do you know any?

​

about your suggested approach, any way to automatically do it? i mean we generate speech then we sync but how?

1

express_mode_420 t1_j7wizoa wrote

I'm not sure how I'd go about syncing it, but would this be an adequate workaround:

  • break apart your script in small chunks by time stamp
  • generate different tts recordings off of each time stamp
  • generate an audio file that inserts each of the produced recordings at their respective time-stamped location
  • replace the audio of the recording with your newly produced recording
2

CeFurkan OP t1_j7wsy5f wrote

so it is a logical layout

any software that can do it?

1

express_mode_420 t1_j7wya6a wrote

I think this is more likely a task for Python. I haven't done anything like this myself, it's just the approach I would start with.

2

CeFurkan OP t1_j7yjgw6 wrote

if only i were not a c# programmer but a python programmer :/

1

express_mode_420 t1_j7z394g wrote

Check out murf.ai, that service works similarly to what i described

2

CeFurkan OP t1_j81neng wrote

tested looks awesome but i have to purchase yearly plan which is 3500$ lol :D

1