Abstract
Current summarization systems yield generic summaries that are disconnected from users' preferences and expectations. To address this limitation, we present CTRLSUM, a generic framework to control generated summaries through a set of keywords. During training keywords are extracted automatically without requiring additional human annotations. At test time CTRLSUM features a control function to map control signal to keywords; through engineering the control function, the same trained model is able to be applied to control summaries on various dimensions, while neither affecting the model training process nor the pretrained models. We additionally explore the combination of keywords and text prompts for more control tasks. Experiments demonstrate the effectiveness of CTRLSUM on three domains of summarization datasets and five control tasks: (1) entity-centric and (2) length-controllable summarization, (3) contribution summarization on scientific papers, (4) invention purpose summarization on patent filings, and (5) question-guided summarization on news articles. Moreover, when used in a standard, unconstrained summarization setting, CTRLSUM is comparable or better than strong pretrained systems.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022 |
| Editors | Yoav Goldberg, Zornitsa Kozareva, Yue Zhang |
| Publisher | Association for Computational Linguistics (ACL) |
| Pages | 5879-5915 |
| Number of pages | 37 |
| ISBN (Electronic) | 9781959429401 |
| DOIs | |
| Publication status | Published - 2022 |
| Externally published | Yes |
| Event | 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022 - Hybrid, Abu Dhabi, United Arab Emirates Duration: 7 Dec 2022 → 11 Dec 2022 |
Publication series
| Name | Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022 |
|---|
Conference
| Conference | 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022 |
|---|---|
| Country/Territory | United Arab Emirates |
| City | Hybrid, Abu Dhabi |
| Period | 7/12/22 → 11/12/22 |
Bibliographical note
Publisher Copyright:© 2022 Association for Computational Linguistics.