Comments

You must log in or register to comment.

Taenk t1_jc02fzb wrote

Excellent demo on your page, I just used it on a YT video featuring a non-native English speaker. There was only a slight error in punctuation due to an ambiguously long pause in the speech.

Is this a purely commercial product or will there be an open source release?

4

boyetosekuji t1_jc04pvl wrote

what is the difference between $1.25/hr for Standard, $1.90/hr for Enhanced

4

HyoTwelve t1_jc0u4jo wrote

Any ways to get the encoded speech features?

2

Traditional_Yard_725 t1_jc0xvd2 wrote

Can confirm it is better than whisper, doesn't randomly go off the rails either but I don't wanna have to pay 😅

4

HotRecognition0121 t1_jc10odk wrote

are there wer for other languages? Like in the github page for whisper? I want to compare the performance in other languages

1

nucLeaRStarcraft t1_jc1334g wrote

Why is this tagged [R]. This is a commercial project at best. Where's the paper, where's the code? Can we use it today on our PC like whisper? This really isn't 'research'.

37

CashyJohn t1_jc184r4 wrote

Wav2vec2 is still sota as long as this isn’t open source it’s kinda useless lmao

6

Deep-Station-1746 t1_jc196cg wrote

>25% improvement over Whisper
>Not open source
>doubt.jpeg

18

filisterr t1_jc1c8lp wrote

So is this post kind of a hidden advertisement or what?

3

Guitargamer57 t1_jc1hhjz wrote

I tested it using Japanese and it seems like it misses punctuation for the most part. But, overall, seems to be doing a good job getting the words.

1

dojoteef t1_jc2uum4 wrote

Removed after LOTS of reports. See rules #3 and #8 in the sidebar.

1