Efficient Language Independent Generation from Lexical Conceptual Structures

Loading...
Thumbnail Image

Files

CS-TR-4262.ps (3.97 MB)
No. of downloads: 215
CS-TR-4262.pdf (423.85 KB)
No. of downloads: 668

Publication or External Link

Date

2001-09-05

Advisor

Citation

DRUM DOI

Abstract

This paper describes a system for generating natural-language sentences from an interlingual representation, Lexical Conceptual
Structure (LCS). The system has been developed as part of a Chinese-English Machine Translation system; however, it is designed to be used for many other MT language pairs and Natural Language applications. The contributions of this work include: (1) Development of a language-independent generation system that maximizes efficiency through the use of a hybrid rule-based/statistical module; (2)
Enhancements to an interlingual representation and associated algorithms for interpretation of multiply ambiguous input sentences;(3) Development of an efficient reusable language-independent linearization module with a grammar description language that can be used with other systems; (4) Improvements to an earlier algorithm for hierarchically mapping thematic roles to surface positions; (5) Development of a diagnostic tool for lexicon coverage and correctness and use of the tool for verification of English, Spanish, and Chinese lexicons. An evaluation of translation quality shows comparable performance with a commercial translation system. The generation system can also be straightforwardly extended to other languages and this is demonstrated and evaluated for Spanish. Cross-referenced as UMIACS-TR-2001-43

Notes

Rights