Efficient Language Independent Generation from Lexical Conceptual Structures
dc.contributor.author | Habash, Nizar | en_US |
dc.contributor.author | Dorr, Bonnie | en_US |
dc.contributor.author | Traum, David | en_US |
dc.date.accessioned | 2004-05-31T23:11:26Z | |
dc.date.available | 2004-05-31T23:11:26Z | |
dc.date.created | 2001-09 | en_US |
dc.date.issued | 2001-09-05 | en_US |
dc.description.abstract | This paper describes a system for generating natural-language sentences from an interlingual representation, Lexical Conceptual Structure (LCS). The system has been developed as part of a Chinese-English Machine Translation system; however, it is designed to be used for many other MT language pairs and Natural Language applications. The contributions of this work include: (1) Development of a language-independent generation system that maximizes efficiency through the use of a hybrid rule-based/statistical module; (2) Enhancements to an interlingual representation and associated algorithms for interpretation of multiply ambiguous input sentences;(3) Development of an efficient reusable language-independent linearization module with a grammar description language that can be used with other systems; (4) Improvements to an earlier algorithm for hierarchically mapping thematic roles to surface positions; (5) Development of a diagnostic tool for lexicon coverage and correctness and use of the tool for verification of English, Spanish, and Chinese lexicons. An evaluation of translation quality shows comparable performance with a commercial translation system. The generation system can also be straightforwardly extended to other languages and this is demonstrated and evaluated for Spanish. Cross-referenced as UMIACS-TR-2001-43 | en_US |
dc.format.extent | 4164578 bytes | |
dc.format.mimetype | application/postscript | |
dc.identifier.uri | http://hdl.handle.net/1903/1140 | |
dc.language.iso | en_US | |
dc.relation.isAvailableAt | Digital Repository at the University of Maryland | en_US |
dc.relation.isAvailableAt | University of Maryland (College Park, Md.) | en_US |
dc.relation.isAvailableAt | Tech Reports in Computer Science and Engineering | en_US |
dc.relation.isAvailableAt | UMIACS Technical Reports | en_US |
dc.relation.ispartofseries | UM Computer Science Department; CS-TR-4262 | en_US |
dc.relation.ispartofseries | UMIACS; UMIACS-TR-2001-43 | en_US |
dc.title | Efficient Language Independent Generation from Lexical Conceptual Structures | en_US |
dc.type | Technical Report | en_US |