Probabilistic Structured Query Methods

Thumbnail Image
Files
CS-TR-4457.pdf(155.75 KB)
No. of downloads: 627
Publication or External Link
Date
2003-04-04
Authors
Darwish, Kareem
Oard, Douglas W.
Advisor
Citation
DRUM DOI
Abstract
Structured methods for query term replacement rely on separate estimates of term frequency and document frequency to compute the weight for each query term. This paper reviews prior work on structured query techniques and introduces three new variants that leverage estimates of replacement probabilities. Statistically significant improvements in retrieval effectiveness are demonstrated for cross-language retrieval and for retrieval based on optical character recognition when replacement probabilities are used to estimate both term frequency and document frequency. UMIACS-TR-2003-27 LAMP-TR-102
Notes
Rights