Mapping Lexical Entries in a Verbs Database to WordNet Senses

dc.contributor.authorGreen, Rebeccaen_US
dc.contributor.authorPearl, Lisaen_US
dc.contributor.authorDorr, Bonnie J.en_US
dc.contributor.authorResnik, Philipen_US
dc.date.accessioned2004-05-31T23:09:53Z
dc.date.available2004-05-31T23:09:53Z
dc.date.created2001-04en_US
dc.date.issued2001-05-10en_US
dc.description.abstractThis paper describes automatic techniques for mapping 9611 entries in a database of English verbs to WordNet senses. The verbs were initially grouped into 491 classes based on syntactic categories. Mapping these classified verbs into WordNet senses provides a resource that may be used for disambiguation in multilingual applications such as machine translation and cross-language information retrieval. Our techniques make use of (1) a training set of 1791 disambiguated entries, representing 1442 verb entries from 167 of the categories; (2) word sense probabilities based on frequency counts in a previously tagged corpus; (3) semantic similarity of WordNet senses for verbs within the same class; (4) probabilistic correlations between WordNet data and attributes of the verb classes. The best results achieved 72% precision and 58% recall, versus a lower bound of 62% precision and 38% recall for assigning the most frequently occurring WordNet sense, and an upper bound of 87% precision and 75% recall for human judgment. (Cross-referenced as UMIACS-TR-2001-18) (Cross-referenced as LAMP-TR-068)en_US
dc.format.extent148843 bytes
dc.format.mimetypeapplication/postscript
dc.identifier.urihttp://hdl.handle.net/1903/1126
dc.language.isoen_US
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_US
dc.relation.isAvailableAtUniversity of Maryland (College Park, Md.)en_US
dc.relation.isAvailableAtTech Reports in Computer Science and Engineeringen_US
dc.relation.isAvailableAtUMIACS Technical Reportsen_US
dc.relation.ispartofseriesUM Computer Science Department; CS-TR-4230en_US
dc.relation.ispartofseriesUMIACS; UMIACS-TR-2001-18en_US
dc.relation.ispartofseriesLAMP-TR-068en_US
dc.titleMapping Lexical Entries in a Verbs Database to WordNet Sensesen_US
dc.typeTechnical Reporten_US

Files

Original bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
CS-TR-4230.ps
Size:
145.35 KB
Format:
Postscript Files
Loading...
Thumbnail Image
Name:
CS-TR-4230.pdf
Size:
173.59 KB
Format:
Adobe Portable Document Format
Description:
Auto-generated copy of CS-TR-4230.ps