Lugi OP t1_iqoahos wrote on October 1, 2022 at 9:43 PM

Reply to comment by VenerableSpace_ in [D] Focal loss - why it scales down the loss of minority class? by Lugi

Yes, but I am using specifically the alpha-balanced version, which they used in a counterproductive way.

VenerableSpace_ t1_iqocr2s wrote on October 1, 2022 at 10:00 PM

the alpha term uses inverse class freq to downweight the loss. So if there is 3:1 ratio of majority:minority, alpha_majority = 0.25 and alpha_minority = 0.75.

Lugi OP t1_iqodhe1 wrote on October 1, 2022 at 10:05 PM

Yes, but the problem here is while they mention that in the paper, finally they use alpha of 0.25, which weighs down the minority (foreground) - while the background (majority) class has scaling of 0.75. This is what I'm concerned about.

VenerableSpace_ t1_iqorbnu wrote on October 1, 2022 at 11:53 PM

Ahh I see now, its been a while since I read that paper. So they chalk it down to the interaction between alpha and the focal term. You can see how they need to use a non-intuitive value for alpha when they introduce the focal loss term in tab. 1b. especially when gamma > 0.5