Viewing a single comment thread. View all comments

nmfisher t1_j3l4ipq wrote

Echoing this, KD is also very useful for taking a heavyweight GPU model and training a student model that's light enough to run on mobile. Small sacrifice in quality for huge performance gains.

3

fredlafrite OP t1_j3l65ju wrote

Interesting! Echoing this, do you know which kind of companies one could work on this in an applied setting?

1