A Statistical Word-Level Translation Model for Comparable Corpora
dc.contributor.author | Diab, Mona | en_US |
dc.contributor.author | Finch, Steve | en_US |
dc.date.accessioned | 2004-05-31T23:04:55Z | |
dc.date.available | 2004-05-31T23:04:55Z | |
dc.date.created | 2000-06 | en_US |
dc.date.issued | 2000-06-17 | en_US |
dc.description.abstract | In this paper, we present a model of statistical word-level mapping for comparable corpora. The approach is based on the assumption that if two terms have close distributional profiles, their corresponding translations' distributional profiles should be close in a comparable corpus. The proposed model is described. A preliminary investigation on intralanguage comparable corpora is laid out. The preliminary results are >92% accurate, suggesting the feasibility of the model. The model needs to undergo some improvements and should be tested cross linguistically before assessing its significance. (Also cross-referenced as UMIACS-TR-2000-41, LAMP-TR-048) | en_US |
dc.format.extent | 686807 bytes | |
dc.format.mimetype | application/postscript | |
dc.identifier.uri | http://hdl.handle.net/1903/1081 | |
dc.language.iso | en_US | |
dc.relation.isAvailableAt | Digital Repository at the University of Maryland | en_US |
dc.relation.isAvailableAt | University of Maryland (College Park, Md.) | en_US |
dc.relation.isAvailableAt | Tech Reports in Computer Science and Engineering | en_US |
dc.relation.isAvailableAt | UMIACS Technical Reports | en_US |
dc.relation.ispartofseries | UM Computer Science Department; CS-TR-4150 | en_US |
dc.relation.ispartofseries | UMIACS; UMIACS-TR-2000-41 | en_US |
dc.relation.ispartofseries | LAMP-TR-048 | en_US |
dc.title | A Statistical Word-Level Translation Model for Comparable Corpora | en_US |
dc.type | Technical Report | en_US |