Submitted by the_javi_himself t3_y8eun3 in MachineLearning
the_javi_himself OP t1_it1s5tu wrote
Reply to comment by iori42 in [R] State of the art audio classification by the_javi_himself
>https://paperswithcode.com/sota/audio-classification-on-audioset
Great resource, thanks a lot!
​
By the way, do you know if we can access a pretrained version of any of these models? I've found many for computer vision but none for audio.
iori42 t1_it1t572 wrote
I don't know about general audio classification, but there are some pretrained models for speech applications (ASR, speaker verification, speech detection). But maybe this is useful for you: https://github.com/qiuqiangkong/audioset_tagging_cnn
firejak308 t1_it2esjt wrote
If you go on PapersWithCode and click the GitHub icon for any of the papers, it'll link you to their public repo, which in many cases will have pretrained models available.
Viewing a single comment thread. View all comments