Efficient Language Independent Generation from Lexical Conceptual Structures

dc.contributor.authorHabash, Nizaren_US
dc.contributor.authorDorr, Bonnieen_US
dc.contributor.authorTraum, Daviden_US
dc.date.accessioned2004-05-31T23:11:26Z
dc.date.available2004-05-31T23:11:26Z
dc.date.created2001-09en_US
dc.date.issued2001-09-05en_US
dc.description.abstractThis paper describes a system for generating natural-language sentences from an interlingual representation, Lexical Conceptual Structure (LCS). The system has been developed as part of a Chinese-English Machine Translation system; however, it is designed to be used for many other MT language pairs and Natural Language applications. The contributions of this work include: (1) Development of a language-independent generation system that maximizes efficiency through the use of a hybrid rule-based/statistical module; (2) Enhancements to an interlingual representation and associated algorithms for interpretation of multiply ambiguous input sentences;(3) Development of an efficient reusable language-independent linearization module with a grammar description language that can be used with other systems; (4) Improvements to an earlier algorithm for hierarchically mapping thematic roles to surface positions; (5) Development of a diagnostic tool for lexicon coverage and correctness and use of the tool for verification of English, Spanish, and Chinese lexicons. An evaluation of translation quality shows comparable performance with a commercial translation system. The generation system can also be straightforwardly extended to other languages and this is demonstrated and evaluated for Spanish. Cross-referenced as UMIACS-TR-2001-43en_US
dc.format.extent4164578 bytes
dc.format.mimetypeapplication/postscript
dc.identifier.urihttp://hdl.handle.net/1903/1140
dc.language.isoen_US
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_US
dc.relation.isAvailableAtUniversity of Maryland (College Park, Md.)en_US
dc.relation.isAvailableAtTech Reports in Computer Science and Engineeringen_US
dc.relation.isAvailableAtUMIACS Technical Reportsen_US
dc.relation.ispartofseriesUM Computer Science Department; CS-TR-4262en_US
dc.relation.ispartofseriesUMIACS; UMIACS-TR-2001-43en_US
dc.titleEfficient Language Independent Generation from Lexical Conceptual Structuresen_US
dc.typeTechnical Reporten_US

Files

Original bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
CS-TR-4262.ps
Size:
3.97 MB
Format:
Postscript Files
Loading...
Thumbnail Image
Name:
CS-TR-4262.pdf
Size:
423.85 KB
Format:
Adobe Portable Document Format
Description:
Auto-generated copy of CS-TR-4262.ps