Abstract
An important aspect of natural language understanding involves recognizing and categorizing events and the relations among them. However, these tasks are quite subtle and annotating training data for machine learning based approaches is an expensive task, resulting in supervised systems that attempt to learn complex models from small amounts of data, which they over-fit. This paper addresses this challenge by developing an event detection and co-reference system with minimal supervision, in the form of a few event examples. We view these tasks as semantic similarity problems between event mentions or event mentions and an ontology of types, thus facilitating the use of large amounts of out of domain text data. Notably, our semantic relatedness function exploits the structure of the text by making use of a semantic-role-labeling based representation of an event. We show that our approach to event detection is competitive with the top supervised methods. More significantly, we outperform state-of-the-art supervised methods for event coreference on benchmark data sets, and support significantly better transfer across domains.
| Original language | English |
|---|---|
| Title of host publication | EMNLP 2016 - Conference on Empirical Methods in Natural Language Processing, Proceedings |
| Publisher | Association for Computational Linguistics (ACL) |
| Pages | 392-402 |
| Number of pages | 11 |
| ISBN (Electronic) | 9781945626258 |
| DOIs | |
| Publication status | Published - 2016 |
| Event | 2016 Conference on Empirical Methods in Natural Language Processing, EMNLP 2016 - Austin, United States Duration: 1 Nov 2016 → 5 Nov 2016 |
Publication series
| Name | EMNLP 2016 - Conference on Empirical Methods in Natural Language Processing, Proceedings |
|---|
Conference
| Conference | 2016 Conference on Empirical Methods in Natural Language Processing, EMNLP 2016 |
|---|---|
| Country/Territory | United States |
| City | Austin |
| Period | 1/11/16 → 5/11/16 |
Bibliographical note
Publisher Copyright:© 2016 Association for Computational Linguistics