Viewing a single comment thread. View all comments

TheMrZZ0 t1_iyou3a8 wrote

I'm curious - what's the source of the data? Being able to add 1400h of new content every day sound great.

1

t0mkaka OP t1_iypifl5 wrote

Audio files I download from the links in the RSS feed. Then I am generating the transcripts using whisper. Not always great but it works most of the time.

1