TY - GEN
T1 - Are Shortest Rationales the Best Explanations for Human Understanding?
AU - Shen, Hua
AU - Wu, Tongshuang
AU - Guo, Wenbo
AU - Huang, Ting Hao
N1 - Publisher Copyright:
© 2022 Association for Computational Linguistics.
PY - 2022
Y1 - 2022
N2 - Existing self-explaining models typically favor extracting the shortest possible rationales - snippets of an input text “responsible for” corresponding output - to explain the model prediction, with the assumption that shorter rationales are more intuitive to humans. However, this assumption has yet to be validated. Is the shortest rationale indeed the most human-understandable? To answer this question, we design a self-explaining model, LimitedInk, which allows users to extract rationales at any target length. Compared to existing baselines, LimitedInk achieves compatible end-task performance and human-annotated rationale agreement, making it a suitable representation of the recent class of self-explaining models. We use LimitedInk to conduct a user study on the impact of rationale length, where we ask human judges to predict the sentiment label of documents based only on LimitedInk-generated rationales with different lengths. We show rationales that are too short do not help humans predict labels better than randomly masked text, suggesting the need for more careful design of the best human rationales.
AB - Existing self-explaining models typically favor extracting the shortest possible rationales - snippets of an input text “responsible for” corresponding output - to explain the model prediction, with the assumption that shorter rationales are more intuitive to humans. However, this assumption has yet to be validated. Is the shortest rationale indeed the most human-understandable? To answer this question, we design a self-explaining model, LimitedInk, which allows users to extract rationales at any target length. Compared to existing baselines, LimitedInk achieves compatible end-task performance and human-annotated rationale agreement, making it a suitable representation of the recent class of self-explaining models. We use LimitedInk to conduct a user study on the impact of rationale length, where we ask human judges to predict the sentiment label of documents based only on LimitedInk-generated rationales with different lengths. We show rationales that are too short do not help humans predict labels better than randomly masked text, suggesting the need for more careful design of the best human rationales.
UR - http://www.scopus.com/inward/record.url?scp=85146120941&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85146120941&partnerID=8YFLogxK
U2 - 10.18653/v1/2022.acl-short.2
DO - 10.18653/v1/2022.acl-short.2
M3 - Conference contribution
AN - SCOPUS:85146120941
T3 - Proceedings of the Annual Meeting of the Association for Computational Linguistics
SP - 10
EP - 19
BT - ACL 2022 - 60th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Short Papers)
A2 - Muresan, Smaranda
A2 - Nakov, Preslav
A2 - Villavicencio, Aline
PB - Association for Computational Linguistics (ACL)
T2 - 60th Annual Meeting of the Association for Computational Linguistics, ACL 2022
Y2 - 22 May 2022 through 27 May 2022
ER -