Submitted by neuralbeans t3_10puvih in deeplearning
No_Cryptographer9806 t1_j6nfqhq wrote
I am curious why do you want to do that? You can always post process the logits but forcing the Network to learn it will cause harm to the underlying representation imo
neuralbeans OP t1_j6nmccc wrote
It's for reinforcement learning to keep the model exploring possibilities.
Viewing a single comment thread. View all comments