Structured Translation for Cross-Language Information Retrieval

dc.contributor.authorSperer, Ruthen_US
dc.contributor.authorOard, Douglas W.en_US
dc.date.accessioned2004-05-31T23:05:24Z
dc.date.available2004-05-31T23:05:24Z
dc.date.created2000-06en_US
dc.date.issued2000-06-21en_US
dc.description.abstractThe paper introduces a query translation model that reflects the structure of the cross-language information retrieval task. The model is based on a structured bilingual dictionary in which the translations of each term are clustered into groups with distinct meanings. Query translation is modeled as a two-stage process, with the system first determining the intended meaning of a query term and then selecting translations appropriate to that meaning that might appear in the document collection. An implementation of structured translation based on automatic dictionary clustering is described and evaluated by using Chinese queries to retrieve English documents. Structured translation achieved an average precision that was statistically indistinguishable from Pirkola's technique for very short queries, but Pirkola's technique outperformed structured translation on long queries. The paper concludes with some observations on future work to improve retrieval effectiveness and on other potential uses of structured translation in interactive cross-language retrieval applications. (Also cross-referenced as UMIACS-TR-2000-45, LAMP-TR-052)en_US
dc.format.extent9142936 bytes
dc.format.mimetypeapplication/postscript
dc.identifier.urihttp://hdl.handle.net/1903/1085
dc.language.isoen_US
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_US
dc.relation.isAvailableAtUniversity of Maryland (College Park, Md.)en_US
dc.relation.isAvailableAtTech Reports in Computer Science and Engineeringen_US
dc.relation.isAvailableAtUMIACS Technical Reportsen_US
dc.relation.ispartofseriesUM Computer Science Department; CS-TR-4154en_US
dc.relation.ispartofseriesUMIACS; UMIACS-TR-2000-45en_US
dc.relation.ispartofseriesLAMP-TR-052en_US
dc.titleStructured Translation for Cross-Language Information Retrievalen_US
dc.typeTechnical Reporten_US

Files

Original bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
CS-TR-4154.ps
Size:
8.72 MB
Format:
Postscript Files
Loading...
Thumbnail Image
Name:
CS-TR-4154.pdf
Size:
153.72 KB
Format:
Adobe Portable Document Format
Description:
Auto-generated copy of CS-TR-4154.ps