Abstract
Distant supervision is an efficient way to generate large-scale training data for relation extraction without human efforts. However, a coin has two sides. The automatically annotated labels for training data are problematic, which can be summarized as multi-instance multi-label problem and coarse-grained (bag-level) supervised signal. To address these problems, we propose two reasonable assumptions and craft reinforcement learning to capture the expressive sentence for each relation mentioned in a bag. More specifically, we extend the original expressed-at-least-once assumption to multi-label level, and introduce a novel express-at-most-one assumption. Besides, we design a fine-grained reward function, and model the sentence selection process as an auction where different relations for a bag need to compete together to achieve the possession of a specific sentence based on its expressiveness. In this way, our model can be dynamically self-adapted, and eventually implements the accurate one-to-one mapping from a relation label to its chosen expressive sentence, which serves as training instances for the extractor. The experimental results on a public dataset demonstrate that our model constantly and substantially outperforms current state-of-the-art methods for relation extraction.
| Original language | English |
|---|---|
| Title of host publication | CIKM 2019 - Proceedings of the 28th ACM International Conference on Information and Knowledge Management |
| Publisher | Association for Computing Machinery |
| Pages | 659-668 |
| Number of pages | 10 |
| ISBN (Electronic) | 9781450369763 |
| DOIs | |
| Publication status | Published - 3 Nov 2019 |
| Externally published | Yes |
| Event | 28th ACM International Conference on Information and Knowledge Management, CIKM 2019 - Beijing, China Duration: 3 Nov 2019 → 7 Nov 2019 |
Publication series
| Name | International Conference on Information and Knowledge Management, Proceedings |
|---|
Conference
| Conference | 28th ACM International Conference on Information and Knowledge Management, CIKM 2019 |
|---|---|
| Country/Territory | China |
| City | Beijing |
| Period | 3/11/19 → 7/11/19 |
Bibliographical note
Publisher Copyright:© 2019 Association for Computing Machinery.
Keywords
- Coarse-grained supervised signal
- Distant supervision
- Multi-instance multi-label
- Reinforcement learning
- Relation extraction