Domain Tuning of Bilingual Lexicons for MT
dc.contributor.author | Ayan, Necip Fazil | en_US |
dc.contributor.author | Dorr, Bonnie | en_US |
dc.contributor.author | Kolak, Okan | en_US |
dc.date.accessioned | 2004-05-31T23:25:57Z | |
dc.date.available | 2004-05-31T23:25:57Z | |
dc.date.created | 2003-02 | en_US |
dc.date.issued | 2003-02-27 | en_US |
dc.description.abstract | Our overall objective is to translate a domain-specific document in a foreign language (in this case, Chinese) to English. Using automatically induced domain-specific, comparable documents and language-independent clustering, we apply domain-tuning techniques to a bilingual lexicon for downstream translation of the input document to English. We will describe our domain-tuning technique and demonstrate its effectiveness by comparing our results to manually constructed domain-specific vocabulary. Our coverage/accuracy experiments indicate that domain-tuned lexicons achieve 88% precision and 66% recall. We also ran a Bleu experiment to compare our domain-tuned version to its un-tuned counterpart in an IBM-style MT system. Our domain-tuned lexicons brought about an improvement in the Bleu scores: 9.4% higher than a system trained on a uniformly-weighted dictionary and 275% higher than a system trained on no dictionary at all. UMIACS-TR-2003-19 LAMP-TR-096 | en_US |
dc.format.extent | 105161 bytes | |
dc.format.mimetype | application/pdf | |
dc.identifier.uri | http://hdl.handle.net/1903/1262 | |
dc.language.iso | en_US | |
dc.relation.isAvailableAt | Digital Repository at the University of Maryland | en_US |
dc.relation.isAvailableAt | University of Maryland (College Park, Md.) | en_US |
dc.relation.isAvailableAt | Tech Reports in Computer Science and Engineering | en_US |
dc.relation.isAvailableAt | UMIACS Technical Reports | en_US |
dc.relation.ispartofseries | UM Computer Science Department; CS-TR-4449 | en_US |
dc.relation.ispartofseries | UMIACS; UMIACS-TR-2003-19 | en_US |
dc.relation.ispartofseries | LAMP-TR-096 | en_US |
dc.title | Domain Tuning of Bilingual Lexicons for MT | en_US |
dc.type | Technical Report | en_US |
Files
Original bundle
1 - 1 of 1