Skip to main navigation Skip to search Skip to main content

Learning Translations for Tagged Words: Extending the Translation Lexicon of an ITG for Low Resource Languages

Research output: Contribution to conferenceConference Paperpeer-review

Abstract

We tackle the challenge of learning part-of-speech classified translations as part of an inversion transduction grammar, by learning translations for English words with known part-of-speech tags, both from existing translation lexica and from parallel corpora. When translating from a low resource language into English, we can expect to have rich resources for English, such as treebanks, and small amounts of bilingual resources, such as translation lexica and parallel corpora. We solve the problem of integrating these heterogeneous resources into a single model using stochastic Inversion Transduction Grammars, which we augment with wildcards to handle unknown translations.
Original languageEnglish
Pages55-64
DOIs
Publication statusPublished - 2016
EventProceedings of the Workshop on Multilingual and Cross­-lingual Methods in NLP -
Duration: 1 Jan 20161 Jan 2016

Conference

ConferenceProceedings of the Workshop on Multilingual and Cross­-lingual Methods in NLP
Period1/01/161/01/16

ISBNs

['9781941643877']

Fingerprint

Dive into the research topics of 'Learning Translations for Tagged Words: Extending the Translation Lexicon of an ITG for Low Resource Languages'. Together they form a unique fingerprint.

Cite this