Viewing a single comment thread. View all comments

No_Cryptographer9806 t1_j6nfqhq wrote

I am curious why do you want to do that? You can always post process the logits but forcing the Network to learn it will cause harm to the underlying representation imo

1

neuralbeans OP t1_j6nmccc wrote

It's for reinforcement learning to keep the model exploring possibilities.

1