Mapping Lexical Entries in a Verbs Database to WordNet Senses

Green, Rebecca; Pearl, Lisa; Dorr, Bonnie J.; Resnik, Philip

Mapping Lexical Entries in a Verbs Database to WordNet Senses

Files

CS-TR-4230.ps (145.35 KB)

No. of downloads: 217

CS-TR-4230.pdf (173.59 KB)

No. of downloads: 711

Date

2001-05-10

Authors

Abstract

This paper describes automatic techniques for mapping 9611 entries in a database of English verbs to WordNet senses. The verbs were initially grouped into 491 classes based on syntactic categories. Mapping these classified verbs into WordNet senses provides a resource that may be used for disambiguation in multilingual applications such as machine translation and cross-language information retrieval. Our
techniques make use of (1) a training set of 1791 disambiguated entries, representing 1442 verb entries from 167 of the categories; (2) word sense probabilities based on frequency counts in a previously tagged corpus; (3) semantic similarity of WordNet senses for verbs within the same class; (4)
probabilistic correlations between WordNet data and attributes of the verb classes. The best results achieved 72% precision and 58% recall, versus a lower bound of 62% precision and 38% recall for assigning the most frequently occurring WordNet sense, and an upper bound of 87% precision and 75% recall for human judgment. (Cross-referenced as UMIACS-TR-2001-18) (Cross-referenced as LAMP-TR-068)

URI (handle)

http://hdl.handle.net/1903/1126

Collections

Technical Reports from UMIACS
Technical Reports of the Computer Science Department

Full item page