babua
babua t1_isvs90p wrote
Reply to comment by Cheap_Meeting in [D] How frustrating are the ML interviews these days!!! TOP 3% interview joke by Mogady
If your interview is designed to fail 97% of interviewees, it's completely truthful to claim that you only hire the top 3% ... ^of^the^people^who^apply^to^you
babua t1_j6khgfr wrote
Reply to comment by psma in [D] What's stopping you from working on speech and voice? by jiamengial
I don't think it stops there either, streaming architecture probably breaks core assumptions of some speech models. e.g. for STT, when do you "try" to infer the word? for TTS, how do you intonate the sentence correctly if you don't know the second half? You'd have to re-train your entire model for the streaming case and create new data augmentations -- plus you'll probably sacrifice some performance even in the best case because your model simply has to deal with more uncertainty.