CLIR Experiments at Maryland for TREC-2002: Evidence combination for Arabic-English retrieval

Loading...
Thumbnail Image

Files

CS-TR-4456.pdf (82.91 KB)
No. of downloads: 661

Publication or External Link

Date

2003-04-04

Advisor

Citation

DRUM DOI

Abstract

The focus of the experiments reported in this paper was techniques for combining evidence for cross-language retrieval, searching Arabic documents using English queries. Evidence from multiple sources of translation knowledge was combined to estimate translation probabilities, and four techniques for estimating query-language term weights from document-language evidence were tried. A new technique that exploits translation probability information was found to outperform a comparable technique in which that information was not used. Comparative results for three variants of Arabic ^\light^] stemming are also presented. A simple variant of an existing stemming algorithm was found to result in significantly better retrieval effectiveness. UMIACS-TR-2003-26 LAMP-TR-101

Notes

Rights