A Comparative Study of Knowledge-Based Approaches for Cross-Language Information Retrieval

dc.contributor.authorOard, Douglas W.en_US
dc.contributor.authorDorr, Bonnie J.en_US
dc.contributor.authorHackett, Paul G.en_US
dc.contributor.authorKatsova, Mariaen_US
dc.date.accessioned2004-05-31T22:51:24Z
dc.date.available2004-05-31T22:51:24Z
dc.date.created1998-04en_US
dc.date.issued1998-10-15en_US
dc.description.abstractCross-language retrieval systems seek to use queries in one natural language to guide the retrieval of documents that might be written in another. Acquisition and representation of translation knowledge plays a central role in this process. This paper explores the utility of two sources of manually encoded translation knowledge, bilingual dictionaries and translation lexicons, for cross-language retrieval. We have implemented six query translation techniques that use bilingual dictionaries, one based on lexical-semantic analysis, and one based on direct use of the translation output from an existing machine translation system; these are compared with a document translation technique that uses output from the same existing translation system. Average precision measures on portions of the TREC collection suggest that arbitrarily selecting a single translation from a bilingual dictionary is typically no less effective than using every translation in the dictionary, that query translation using an existing machine translation system can achieve somewhat better effectiveness than simple dictionary-based techniques, and that performing document translation rather than query translation may result in further improvements in retrieval effectiveness under some conditions. (Also cross-referenced as UMIACS-TR-98-27)en_US
dc.format.extent389185 bytes
dc.format.mimetypeapplication/postscript
dc.identifier.urihttp://hdl.handle.net/1903/952
dc.language.isoen_US
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_US
dc.relation.isAvailableAtUniversity of Maryland (College Park, Md.)en_US
dc.relation.isAvailableAtTech Reports in Computer Science and Engineeringen_US
dc.relation.isAvailableAtUMIACS Technical Reportsen_US
dc.relation.ispartofseriesUM Computer Science Department; CS-TR-3897en_US
dc.relation.ispartofseriesUMIACS; UMIACS-TR-98-27en_US
dc.titleA Comparative Study of Knowledge-Based Approaches for Cross-Language Information Retrievalen_US
dc.typeTechnical Reporten_US

Files

Original bundle

Now showing 1 - 2 of 2
No Thumbnail Available
Name:
CS-TR-3897.ps
Size:
380.06 KB
Format:
Postscript Files
Loading...
Thumbnail Image
Name:
CS-TR-3897.pdf
Size:
195.42 KB
Format:
Adobe Portable Document Format
Description:
Auto-generated copy of CS-TR-3897.ps