|
DRUM >
College of Computer, Mathematical & Natural Sciences >
Computer Science >
Technical Reports from UMIACS >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/1903/1081
|
| Title: | A Statistical Word-Level Translation Model for Comparable Corpora |
| Authors: | Diab, Mona Finch, Steve |
| Type: | Technical Report |
| Issue Date: | 17-Jun-2000 |
| Series/Report no.: | UM Computer Science Department; CS-TR-4150 UMIACS; UMIACS-TR-2000-41 LAMP-TR-048 |
| Abstract: | In this paper, we present a model of statistical word-level mapping for comparable corpora. The approach is based on the assumption that if two terms have close distributional profiles, their corresponding translations' distributional profiles should be close in a comparable corpus. The proposed model is described. A preliminary investigation on intralanguage comparable corpora is laid out. The preliminary results are >92% accurate, suggesting the feasibility of the model. The model needs to undergo some improvements and should be tested cross linguistically before assessing its significance.
(Also cross-referenced as UMIACS-TR-2000-41, LAMP-TR-048) |
| URI: | http://hdl.handle.net/1903/1081 |
| Appears in Collections: | Technical Reports of the Computer Science Department Technical Reports from UMIACS
|
All items in DRUM are protected by copyright, with all rights reserved.
|