Abstract
While reinforcement learning can effectively improve language generation models, it often suffers from generating incoherent and repetitive phrases [1]. In this paper, we propose a novel repetition normalized adversarial reward to mitigate these problems. Our repetition penalized reward can greatly reduce the repetition rate and adversarial training mitigates generating incoherent phrases. Our model significantly outperforms the baseline model on ROUGE-1 (+3.24), ROUGE-L (+2.25), and a decreased repetition-rate (-4.98%).
| Original language | English |
|---|---|
| Title of host publication | 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Proceedings |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 7325-7329 |
| Number of pages | 5 |
| ISBN (Electronic) | 9781479981311 |
| DOIs | |
| Publication status | Published - May 2019 |
| Event | 44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Brighton, United Kingdom Duration: 12 May 2019 → 17 May 2019 |
Publication series
| Name | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings |
|---|---|
| Volume | 2019-May |
| ISSN (Print) | 1520-6149 |
Conference
| Conference | 44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 |
|---|---|
| Country/Territory | United Kingdom |
| City | Brighton |
| Period | 12/05/19 → 17/05/19 |
Bibliographical note
Publisher Copyright:© 2019 IEEE.
Keywords
- adversarial training
- headline generation
- reinforcement learning
- summarization
Fingerprint
Dive into the research topics of 'A Novel Repetition Normalized Adversarial Reward for Headline Generation'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver