TY - JOUR
T1 - MACSUM
T2 - Controllable Summarization with Mixed Attributes
AU - Zhang, Yusen
AU - Liu, Yang
AU - Yang, Ziyi
AU - Fang, Yuwei
AU - Chen, Yulong
AU - Radev, Dragomir
AU - Zhu, Chenguang
AU - Zeng, Michael
AU - Zhang, Rui
N1 - Publisher Copyright:
© 2023 Association for Computational Linguistics.
PY - 2023
Y1 - 2023
N2 - Controllable summarization allows users to generate customized summaries with specified attributes. However, due to the lack of designated annotations of controlled summaries, existing work has to craft pseudo datasets by adapting generic summarization benchmarks. Furthermore, most research focuses on con¬trolling single attributes individually (e.g., a short summary or a highly abstractive summary) rather than controlling a mix of attributes together (e.g., a short and highly abstractive summary). In this paper, we propose MAC-SUM, the first human-annotated summariza¬tion dataset for controlling mixed attributes. It contains source texts from two domains, news articles and dialogues, with human-annotated summaries controlled by five designed at-tributes (Length, Extractiveness, Specificity, Topic, and Speaker). We propose two simple and effective parameter-efficient approaches for the new task of mixed controllable sum-marization based on hard prompt tuning and soft prefix tuning. Results and analysis demon¬strate that hard prompt models yield the best performance on most metrics and human eval¬uations. However, mixed-attribute control is still challenging for summarization tasks. Our dataset and code are available at https://github.com/psunlpgroup/MACSum.
AB - Controllable summarization allows users to generate customized summaries with specified attributes. However, due to the lack of designated annotations of controlled summaries, existing work has to craft pseudo datasets by adapting generic summarization benchmarks. Furthermore, most research focuses on con¬trolling single attributes individually (e.g., a short summary or a highly abstractive summary) rather than controlling a mix of attributes together (e.g., a short and highly abstractive summary). In this paper, we propose MAC-SUM, the first human-annotated summariza¬tion dataset for controlling mixed attributes. It contains source texts from two domains, news articles and dialogues, with human-annotated summaries controlled by five designed at-tributes (Length, Extractiveness, Specificity, Topic, and Speaker). We propose two simple and effective parameter-efficient approaches for the new task of mixed controllable sum-marization based on hard prompt tuning and soft prefix tuning. Results and analysis demon¬strate that hard prompt models yield the best performance on most metrics and human eval¬uations. However, mixed-attribute control is still challenging for summarization tasks. Our dataset and code are available at https://github.com/psunlpgroup/MACSum.
UR - http://www.scopus.com/inward/record.url?scp=85168929411&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85168929411&partnerID=8YFLogxK
U2 - 10.1162/tacl_a_00575
DO - 10.1162/tacl_a_00575
M3 - Article
AN - SCOPUS:85168929411
SN - 2307-387X
VL - 11
SP - 787
EP - 803
JO - Transactions of the Association for Computational Linguistics
JF - Transactions of the Association for Computational Linguistics
ER -