Mapping Lexical Entries in a Verbs Database to WordNet Senses
Files
Publication or External Link
External Link to Data Files
Date
Advisor
Citation
DRUM DOI
Abstract
This paper describes automatic techniques for mapping 9611
entries in a database of English verbs to WordNet senses.
The verbs were initially grouped into 491 classes based on
syntactic categories. Mapping these classified verbs into
WordNet senses provides a resource that may be used for
disambiguation in multilingual applications such as machine
translation and cross-language information retrieval. Our
techniques make use of (1) a training set of 1791 disambiguated
entries, representing 1442 verb entries from 167 of the
categories; (2) word sense probabilities based on frequency
counts in a previously tagged corpus; (3) semantic similarity
of WordNet senses for verbs within the same class; (4)
probabilistic correlations between WordNet data and attributes
of the verb classes. The best results achieved 72% precision
and 58% recall, versus a lower bound of 62% precision and
38% recall for assigning the most frequently occurring WordNet
sense, and an upper bound of 87% precision and 75% recall for
human judgment.
(Cross-referenced as UMIACS-TR-2001-18)
(Cross-referenced as LAMP-TR-068)