Matching Meaning for Cross-Language Information Retrieval

dc.contributor.advisorOard, Douglas Wen_US
dc.contributor.authorWang, Jianqiangen_US
dc.contributor.departmentLibrary & Information Servicesen_US
dc.contributor.publisherDigital Repository at the University of Marylanden_US
dc.contributor.publisherUniversity of Maryland (College Park, Md.)en_US
dc.date.accessioned2006-02-04T07:39:08Z
dc.date.available2006-02-04T07:39:08Z
dc.date.issued2005-12-06en_US
dc.description.abstractCross-language information retrieval concerns the problem of finding information in one language in response to search requests expressed in another language. The explosive growth of the World Wide Web, with access to information in many languages, has provided a substantial impetus for research on this important problem. In recent years, significant advances in cross-language retrieval effectiveness have resulted from the application of statistical techniques to estimate accurate translation probabilities for individual terms from automated analysis of human-prepared translations. With few exceptions, however, those results have been obtained by applying evidence about the meaning of terms to translation in one direction at a time (e.g., by translating the queries into the document language). This dissertation introduces a more general framework for the use of translation probability in cross-language information retrieval based on the notion that information retrieval is dependent fundamentally upon matching what the searcher means with what the document author meant. The perspective yields a simple computational formulation that provides a natural way of combining what have been known traditionally as query and document translation. When combined with the use of synonym sets as a computational model of meaning, cross-language search results are obtained using English queries that approximate a strong monolingual baseline for both French and Chinese documents. Two well-known techniques (structured queries and probabilistic structured queries) are also shown to be a special case of this model under restrictive assumptions.en_US
dc.format.extent778559 bytes
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/1903/3212
dc.language.isoen_US
dc.subject.pqcontrolledInformation Scienceen_US
dc.subject.pqcontrolledComputer Scienceen_US
dc.subject.pqcontrolledLibrary Scienceen_US
dc.subject.pquncontrolledinformation retrievalen_US
dc.subject.pquncontrolledcross-language information retrievalen_US
dc.subject.pquncontrolledinformation retrieval evaluationen_US
dc.titleMatching Meaning for Cross-Language Information Retrievalen_US
dc.typeDissertationen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
umi-umd-3035.pdf
Size:
760.31 KB
Format:
Adobe Portable Document Format