Bucking the Trend: Large-Scale Cost-Focused Active Learning for Statistical Machine Translation
dc.contributor.author | Bloodgood, Michael | |
dc.contributor.author | Callison-Burch, Chris | |
dc.date.accessioned | 2014-07-31T11:46:32Z | |
dc.date.available | 2014-07-31T11:46:32Z | |
dc.date.issued | 2010-07 | |
dc.description.abstract | We explore how to improve machine translation systems by adding more translation data in situations where we already have substantial resources. The main challenge is how to buck the trend of diminishing returns that is commonly encountered. We present an active learning-style data solicitation algorithm to meet this challenge. We test it, gathering annotations via Amazon Mechanical Turk, and find that we get an order of magnitude increase in performance rates of improvement. | en_US |
dc.description.sponsorship | Johns Hopkins University Human Language Technology Center of Excellence | en_US |
dc.identifier.citation | Michael Bloodgood and Chris Callison-Burch. 2010. Bucking the trend: cost-focused active learning for statistical machine translation. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pages 854-864, Uppsala, Sweden, July. Association for Computational Linguistics. | en_US |
dc.identifier.uri | http://hdl.handle.net/1903/15549 | |
dc.language.iso | en_US | en_US |
dc.publisher | Association for Computational Linguistics | en_US |
dc.relation.isAvailableAt | Center for Advanced Study of Language | |
dc.relation.isAvailableAt | Digitial Repository at the University of Maryland | |
dc.relation.isAvailableAt | University of Maryland (College Park, Md) | |
dc.subject | computer science | en_US |
dc.subject | statistical methods | en_US |
dc.subject | artificial intelligence | en_US |
dc.subject | machine learning | en_US |
dc.subject | computational linguistics | en_US |
dc.subject | natural language processing | en_US |
dc.subject | human language technology | en_US |
dc.subject | translation technology | en_US |
dc.subject | machine translation | en_US |
dc.subject | statistical machine translation | en_US |
dc.subject | active learning | en_US |
dc.subject | selective sampling | en_US |
dc.subject | query learning | en_US |
dc.subject | stopping criteria | en_US |
dc.subject | stopping methods | en_US |
dc.subject | crowdsourcing | en_US |
dc.subject | cost-focused active learning | en_US |
dc.subject | cost-efficient annotation | en_US |
dc.subject | annotation costs | en_US |
dc.subject | annotation bottleneck | en_US |
dc.subject | annotation cost metrics | en_US |
dc.subject | Urdu-English translation | en_US |
dc.subject | uncertainty-based active learning | en_US |
dc.subject | uncertainty-based sampling | en_US |
dc.subject | Amazon Mechanical Turk | en_US |
dc.subject | Highlighted N-Gram Method | en_US |
dc.title | Bucking the Trend: Large-Scale Cost-Focused Active Learning for Statistical Machine Translation | en_US |
dc.type | Article | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- buckingTheTrendActiveLearningMachineTranslationACL2010.pdf
- Size:
- 509.94 KB
- Format:
- Adobe Portable Document Format