Abstract
We tackle the challenge of learning part-of-speech classified translations as part of an inversion transduction grammar, by learning translations for English words with known part-of-speech tags, both from existing translation lexica and from parallel corpora. When translating from a low resource language into English, we can expect to have rich resources for English, such as treebanks, and small amounts of bilingual resources, such as translation lexica and parallel corpora. We solve the problem of integrating these heterogeneous resources into a single model using stochastic Inversion Transduction Grammars, which we augment with wildcards to handle unknown translations.
| Original language | English |
|---|---|
| Pages | 55-64 |
| DOIs | |
| Publication status | Published - 2016 |
| Event | Proceedings of the Workshop on Multilingual and Cross-lingual Methods in NLP - Duration: 1 Jan 2016 → 1 Jan 2016 |
Conference
| Conference | Proceedings of the Workshop on Multilingual and Cross-lingual Methods in NLP |
|---|---|
| Period | 1/01/16 → 1/01/16 |
ISBNs
['9781941643877']Fingerprint
Dive into the research topics of 'Learning Translations for Tagged Words: Extending the Translation Lexicon of an ITG for Low Resource Languages'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver