A Statistical Word-Level Translation Model for Comparable Corpora

dc.contributor.authorDiab, Monaen_US
dc.contributor.authorFinch, Steveen_US
dc.date.accessioned2004-05-31T23:04:55Z
dc.date.available2004-05-31T23:04:55Z
dc.date.created2000-06en_US
dc.date.issued2000-06-17en_US
dc.description.abstractIn this paper, we present a model of statistical word-level mapping for comparable corpora. The approach is based on the assumption that if two terms have close distributional profiles, their corresponding translations' distributional profiles should be close in a comparable corpus. The proposed model is described. A preliminary investigation on intralanguage comparable corpora is laid out. The preliminary results are >92% accurate, suggesting the feasibility of the model. The model needs to undergo some improvements and should be tested cross linguistically before assessing its significance. (Also cross-referenced as UMIACS-TR-2000-41, LAMP-TR-048)en_US
dc.format.extent686807 bytes
dc.format.mimetypeapplication/postscript
dc.identifier.urihttp://hdl.handle.net/1903/1081
dc.language.isoen_US
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_US
dc.relation.isAvailableAtUniversity of Maryland (College Park, Md.)en_US
dc.relation.isAvailableAtTech Reports in Computer Science and Engineeringen_US
dc.relation.isAvailableAtUMIACS Technical Reportsen_US
dc.relation.ispartofseriesUM Computer Science Department; CS-TR-4150en_US
dc.relation.ispartofseriesUMIACS; UMIACS-TR-2000-41en_US
dc.relation.ispartofseriesLAMP-TR-048en_US
dc.titleA Statistical Word-Level Translation Model for Comparable Corporaen_US
dc.typeTechnical Reporten_US

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
CS-TR-4150.ps
Size:
670.71 KB
Format:
Postscript Files
Loading...
Thumbnail Image
Name:
CS-TR-4150.pdf
Size:
89.43 KB
Format:
Adobe Portable Document Format
Description:
Auto-generated copy of CS-TR-4150.ps