the_javi_himself OP t1_it1s5tu wrote on October 20, 2022 at 9:27 AM

Reply to comment by iori42 in [R] State of the art audio classification by the_javi_himself

>https://paperswithcode.com/sota/audio-classification-on-audioset

Great resource, thanks a lot!

By the way, do you know if we can access a pretrained version of any of these models? I've found many for computer vision but none for audio.

iori42 t1_it1t572 wrote on October 20, 2022 at 9:41 AM

I don't know about general audio classification, but there are some pretrained models for speech applications (ASR, speaker verification, speech detection). But maybe this is useful for you: https://github.com/qiuqiangkong/audioset_tagging_cnn

firejak308 t1_it2esjt wrote on October 20, 2022 at 1:22 PM

If you go on PapersWithCode and click the GitHub icon for any of the papers, it'll link you to their public repo, which in many cases will have pretrained models available.