University of Maryland DRUM  
University of Maryland Digital Repository at the University of Maryland

DRUM >
College of Computer, Mathematical & Natural Sciences >
Computer Science >
Technical Reports from UMIACS >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1903/1081

Title: A Statistical Word-Level Translation Model for Comparable Corpora
Authors: Diab, Mona
Finch, Steve
Type: Technical Report
Issue Date: 17-Jun-2000
Series/Report no.: UM Computer Science Department; CS-TR-4150
UMIACS; UMIACS-TR-2000-41
LAMP-TR-048
Abstract: In this paper, we present a model of statistical word-level mapping for comparable corpora. The approach is based on the assumption that if two terms have close distributional profiles, their corresponding translations' distributional profiles should be close in a comparable corpus. The proposed model is described. A preliminary investigation on intralanguage comparable corpora is laid out. The preliminary results are >92% accurate, suggesting the feasibility of the model. The model needs to undergo some improvements and should be tested cross linguistically before assessing its significance. (Also cross-referenced as UMIACS-TR-2000-41, LAMP-TR-048)
URI: http://hdl.handle.net/1903/1081
Appears in Collections:Technical Reports of the Computer Science Department
Technical Reports from UMIACS

Files in This Item:

File Description SizeFormatNo. of Downloads
CS-TR-4150.pdfAuto-generated copy of CS-TR-4150.ps89.43 kBAdobe PDF411View/Open
CS-TR-4150.ps670.71 kBPostscript155View/Open

All items in DRUM are protected by copyright, with all rights reserved.

 

DRUM is brought to you by the University of Maryland Libraries
University of Maryland, College Park, MD 20742-7011 (301)314-1328.
Please send us your comments. -
All Contents