NWCoffeenut t1_jdsgb83 wrote on March 26, 2023 at 8:54 PM

I think a good part of the latency was with the TTS system. The actual text response for the most part came back reasonably quickly.

illathon t1_jdsoud8 wrote on March 26, 2023 at 9:55 PM

No most implementations of whisper are slow.

Whisper is the speech recognition component.
I don't think he said what he's using for TTS, might be MacOS' builtin thingy.

They're using elevenlabs, which isn't local and hence a slow API call

If we eventually get open source Elevenlabs quality models running locally it's gonna be insane.

!remind me 1 month

I will be messaging you in 1 month on 2023-04-27 04:55:12 UTC to remind you of this link

3 OTHERS CLICKED THIS LINK to send a PM to also be reminded and to reduce spam.

^(Info)	^(Custom)	^(Your Reminders)	^(Feedback)

There's also Tortoise TTS which can be run locally but idk how fast it is.