Submitted by Longjumping_Essay498 t3_1003d7w in MachineLearning
Can we use attention weights from causal models, as explanations or causal attributes for next word predictions?
Submitted by Longjumping_Essay498 t3_1003d7w in MachineLearning
Can we use attention weights from causal models, as explanations or causal attributes for next word predictions?
fawkesdotbe t1_j2f9bcz wrote
Have fun: https://aclanthology.org/2022.acl-long.269/
> Adrien Bibal, Rémi Cardon, David Alfter, Rodrigo Wilkens, Xiaoou Wang, Thomas François, and Patrick Watrin. 2022. Is Attention Explanation? An Introduction to the Debate. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 3889–3900, Dublin, Ireland. Association for Computational Linguistics.