Viewing a single comment thread. View all comments

seba07 t1_j0lg760 wrote

From my experience the loss function also plays an important part. Cross entropy forces the model to be very certain with a decision. Focal loss can produce a smoother output distribution.

0