Improving Information Retrieval Systems using Part of Speech Tagging

dc.contributor.authorChowdhury, Abduren_US
dc.contributor.authorMcCabe, M. Catherineen_US
dc.contributor.departmentISRen_US
dc.date.accessioned2007-05-23T10:05:55Z
dc.date.available2007-05-23T10:05:55Z
dc.date.issued1998en_US
dc.description.abstractThe object of Information Retrieval is to retrieve all relevantdocuments for a user query and only those relevant documents. Muchresearch has focused on achieving this objective with little regard forstorage overhead or performance. In the paper we evaluate the use ofPart of Speech Tagging to improve, the index storage overhead andgeneral speed of the system with only a minimal reduction to precisionrecall measurements. We tagged 500Mbs of the Los Angeles Times 1990 and1989 document collection provided by TREC for parts of speech. We thenexperimented to find the most relevant part of speech to index. We showthat 90 percent of precision recall is achieved with 40 percent of the documentcollections terms. We also show that this is a improvement in overheadwith only a 1 percent reduction in precision recall.en_US
dc.format.extent108887 bytes
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/1903/5958
dc.language.isoen_USen_US
dc.relation.ispartofseriesISR; TR 1998-48en_US
dc.subjectinformation retrievalen_US
dc.subjectparts of speech taggingen_US
dc.subjectimproved IR storage,en_US
dc.titleImproving Information Retrieval Systems using Part of Speech Taggingen_US
dc.typeTechnical Reporten_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
TR_98-48.pdf
Size:
106.33 KB
Format:
Adobe Portable Document Format