Abstract
People from different social and demographic groups express diverse perspectives and conflicting opinions on a broad set of topics such as product reviews, healthcare, law, and politics. A fair summary should provide a comprehensive coverage of diverse perspectives without underrepresenting certain groups. However, current work in summarization metrics and Large Language Models (LLMs) evaluation has not explored fair abstractive summarization. In this paper, we systematically investigate fair abstractive summarization for user-generated data. We first formally define fairness in abstractive summarization as not underrepresenting perspectives of any groups of people, and we propose four reference-free automatic metrics by measuring the differences between target and source perspectives. We evaluate nine LLMs, including three GPT models, four LLaMA models, PaLM 2, and Claude, on six datasets collected from social media, online reviews, and recorded transcripts. Experiments show that both the model-generated and the human-written reference summaries suffer from low fairness. We conduct a comprehensive analysis of the common factors influencing fairness and propose three simple but effective methods to alleviate unfair summarization.
| Original language | English (US) |
|---|---|
| Title of host publication | Long Papers |
| Editors | Kevin Duh, Helena Gomez, Steven Bethard |
| Publisher | Association for Computational Linguistics (ACL) |
| Pages | 3404-3426 |
| Number of pages | 23 |
| ISBN (Electronic) | 9798891761148 |
| DOIs | |
| State | Published - 2024 |
| Event | 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2024 - Hybrid, Mexico City, Mexico Duration: Jun 16 2024 → Jun 21 2024 |
Publication series
| Name | Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2024 |
|---|---|
| Volume | 1 |
Conference
| Conference | 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2024 |
|---|---|
| Country/Territory | Mexico |
| City | Hybrid, Mexico City |
| Period | 6/16/24 → 6/21/24 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
All Science Journal Classification (ASJC) codes
- Computer Networks and Communications
- Hardware and Architecture
- Information Systems
- Software
Fingerprint
Dive into the research topics of 'Fair Abstractive Summarization of Diverse Perspectives'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver