Statistical Modality Tagging from Rule-based Annotations and Crowdsourcing
dc.contributor.author | Prabhakaran, Vinodkumar | |
dc.contributor.author | Bloodgood, Michael | |
dc.contributor.author | Diab, Mona | |
dc.contributor.author | Dorr, Bonnie | |
dc.contributor.author | Levin, Lori | |
dc.contributor.author | Piatko, Christine | |
dc.contributor.author | Rambow, Owen | |
dc.contributor.author | Van Durme, Benjamin | |
dc.date.accessioned | 2014-07-17T17:29:32Z | |
dc.date.available | 2014-07-17T17:29:32Z | |
dc.date.issued | 2012-07-13 | |
dc.description.abstract | We explore training an automatic modality tagger. Modality is the attitude that a speaker might have toward an event or state. One of the main hurdles for training a linguistic tagger is gathering training data. This is particularly problematic for training a tagger for modality because modality triggers are sparse for the overwhelming majority of sentences. We investigate an approach to automatically training a modality tagger where we first gathered sentences based on a high-recall simple rule-based modality tagger and then provided these sentences to Mechanical Turk annotators for further annotation. We used the resulting set of training data to train a precise modality tagger using a multi-class SVM that delivers good performance. | en_US |
dc.identifier.citation | Vinodkumar Prabhakaran, Michael Bloodgood, Mona Diab, Bonnie Dorr, Lori Levin, Christine D. Piatko, Owen Rambow, and Benjamin Van Durme. 2012. Statistical modality tagging from rule-based annotations and crowdsourcing. In Proceedings of the ACL-2012 Workshop on Extra-Propositional Aspects of Meaning in Computational Linguistics (ExProM-2012), pages 57-64, Jeju, Republic of Korea, July. Association for Computational Linguistics. | en_US |
dc.identifier.uri | http://hdl.handle.net/1903/15543 | |
dc.language.iso | en_US | en_US |
dc.publisher | Association for Computational Linguistics | en_US |
dc.relation.isAvailableAt | Center for Advanced Study of Language | |
dc.relation.isAvailableAt | Digitial Repository at the University of Maryland | |
dc.relation.isAvailableAt | University of Maryland (College Park, Md) | |
dc.subject | computer science | en_US |
dc.subject | artificial intelligence | en_US |
dc.subject | statistical methods | en_US |
dc.subject | machine learning | en_US |
dc.subject | computational linguistics | en_US |
dc.subject | natural language processing | en_US |
dc.subject | human language technology | en_US |
dc.subject | semantics | en_US |
dc.subject | modality | en_US |
dc.subject | crowdsourcing | en_US |
dc.subject | Mechanical Turk | en_US |
dc.subject | Support Vector Machines | en_US |
dc.subject | cost-weighted Support Vector Machines | en_US |
dc.subject | annotation confidence | en_US |
dc.subject | statistical modality tagging | en_US |
dc.subject | automatic modality tagging | en_US |
dc.title | Statistical Modality Tagging from Rule-based Annotations and Crowdsourcing | en_US |
dc.type | Article | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- statisticalModalityTaggingExProM2012.pdf
- Size:
- 125.48 KB
- Format:
- Adobe Portable Document Format