Aligning a parallel English-Chinese corpus statistically with lexical criteria

Dekai Wu*

*Corresponding author for this work

Research output: Contribution to journalConference article published in journalpeer-review

122 Citations (Scopus)

Abstract

We describe our experience with automatic alignment of sentences in parallel English-Chinese texts. Our report concerns three related topics: (1) progress on the HKUST English-Chinese Parallel Bilingual Corpus; (2) experiments addressing the applicability of Gale & Church's (1991) length-based statistical method to the task of alignment involving a non-Indo-European language; and (3) an improved statistical method that also incorporates domain-specific lexical cues.

Original languageEnglish
Pages (from-to)80-87
Number of pages8
JournalProceedings of the Annual Meeting of the Association for Computational Linguistics
Volume1994-June
Publication statusPublished - 1994
Event32nd Annual Meeting of the Association for Computational Linguistics, ACL 1994 - Las Cruces, United States
Duration: 27 Jun 199430 Jun 1994

Bibliographical note

Publisher Copyright:
© 1994, Association for Computational Linguistics.

Fingerprint

Dive into the research topics of 'Aligning a parallel English-Chinese corpus statistically with lexical criteria'. Together they form a unique fingerprint.

Cite this