I am a bit confused. So overall, we want to make the generated response to be as close as possible to the ground truth. The paper adds a selection loss that distinguishes the generated response from the ground truth, which would make the generated response as different as possible from the ground truth. How could this help the main task of making these two responses as close as possible?

radi-cho OP t1_j99eb0v wrote on February 20, 2023 at 6:43 AM

#1,880,870

Replying to Cheap_Meeting (#1,873,650)

Thanks for the interest! You can follow me on Twitter: https://twitter.com/radi_cho

impossiblefork t1_j99edtf wrote on February 20, 2023 at 6:44 AM

#1,880,881

Replying to currentscurrents (#1,875,866)

That it's Bulgaria is probably why it's possible at all. Notice 'high school of mathematics'.

Some ex-Soviet/ex-Warsaw pact countries have functioning maths education.

radi-cho OP t1_j99fh5s wrote on February 20, 2023 at 6:57 AM

#1,880,985

Replying to walkingsparrow (#1,877,273)

About the intuition that it would produce responses further from the human ones (in fact, we see that for this variant, the BLEU is lower) - in a way, it could work as a regularization to produce more diverse responses and prevent some overfitting. That loss mostly affects the additional head's weights which are removed during inference, but we also multiply it by an optimal constant to be sure it doesn't affect the whole architecture too much. I've sent you a PM if you wish to receive some more details or empirical insights.

[deleted] t1_j9a56tc wrote on February 20, 2023 at 12:39 PM

#1,882,663

[removed]

walkingsparrow t1_j9b7j3d wrote on February 20, 2023 at 5:29 PM

#1,886,775

Replying to radi-cho (#1,880,985)

I think I understand now. Thanks for the explanation.

[R] [N] In this paper, we show how a conversational model, 3.5x smaller than SOTA, can be optimized to outperform the baselines through Auxiliary Learning. Published in the ACL Anthology: "Efficient Task-Oriented Dialogue Systems with Response Selection as an Auxiliary Task."

Comments

__lawless t1_j96ycry wrote on February 19, 2023 at 6:54 PM

radi-cho OP t1_j96ydyf wrote on February 19, 2023 at 6:55 PM

radi-cho OP t1_j96yj4l wrote on February 19, 2023 at 6:56 PM

Cheap_Meeting t1_j972cc6 wrote on February 19, 2023 at 7:22 PM

currentscurrents t1_j97v09x wrote on February 19, 2023 at 10:43 PM

walkingsparrow t1_j98c2qw wrote on February 20, 2023 at 12:53 AM

radi-cho OP t1_j99eb0v wrote on February 20, 2023 at 6:43 AM

impossiblefork t1_j99edtf wrote on February 20, 2023 at 6:44 AM

radi-cho OP t1_j99fh5s wrote on February 20, 2023 at 6:57 AM

[deleted] t1_j9a56tc wrote on February 20, 2023 at 12:39 PM

walkingsparrow t1_j9b7j3d wrote on February 20, 2023 at 5:29 PM