An Approach to Reducing Annotation Costs for BioNLP
dc.contributor.author | Bloodgood, Michael | |
dc.contributor.author | Vijay-Shanker, K | |
dc.date.accessioned | 2014-08-26T19:27:04Z | |
dc.date.available | 2014-08-26T19:27:04Z | |
dc.date.issued | 2008-06 | |
dc.description.abstract | There is a broad range of BioNLP tasks for which active learning (AL) can significantly reduce annotation costs and a specific AL algorithm we have developed is particularly effective in reducing annotation costs for these tasks. We have previously developed an AL algorithm called ClosestInitPA that works best with tasks that have the following characteristics: redundancy in training material, burdensome annotation costs, Support Vector Machines (SVMs) work well for the task, and imbalanced datasets (i.e. when set up as a binary classification problem, one class is substantially rarer than the other). Many BioNLP tasks have these characteristics and thus our AL algorithm is a natural approach to apply to BioNLP tasks. | en_US |
dc.identifier | https://doi.org/10.13016/M2VC7V | |
dc.identifier.citation | Michael Bloodgood and K. Vijay-Shanker. 2008. An approach to reducing annotation costs for BioNLP. In Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing, pages 104-105, Columbus, Ohio, June. Association for Computational Linguistics. | en_US |
dc.identifier.uri | http://hdl.handle.net/1903/15584 | |
dc.language.iso | en_US | en_US |
dc.publisher | Association for Computational Linguistics | en_US |
dc.relation.isAvailableAt | Center for Advanced Study of Language | |
dc.relation.isAvailableAt | Digitial Repository at the University of Maryland | |
dc.relation.isAvailableAt | University of Maryland (College Park, Md) | |
dc.subject | computer science | en_US |
dc.subject | statistical methods | en_US |
dc.subject | artificial intelligence | en_US |
dc.subject | machine learning | en_US |
dc.subject | computational linguistics | en_US |
dc.subject | natural language processing | en_US |
dc.subject | human language technology | en_US |
dc.subject | text processing | en_US |
dc.subject | active learning | en_US |
dc.subject | selective sampling | en_US |
dc.subject | query learning | en_US |
dc.subject | annotation bottleneck | en_US |
dc.subject | annotation costs | en_US |
dc.subject | support vector machines | en_US |
dc.subject | SVMs | en_US |
dc.subject | cost-weighted support vector machines | en_US |
dc.subject | cost-weighted SVMs | en_US |
dc.subject | imbalanced data | en_US |
dc.subject | imbalanced datasets | en_US |
dc.subject | asymmetric cost factors | en_US |
dc.subject | asymmetric cost weights | en_US |
dc.subject | cost-sensitive learning | en_US |
dc.subject | cost-sensitive active learning | en_US |
dc.subject | imbalanced learning | en_US |
dc.subject | BioNLP | en_US |
dc.subject | biomedical natural language processing | en_US |
dc.subject | biomedical text processing | en_US |
dc.subject | protein-protein interaction extraction | en_US |
dc.subject | Medline text classification | en_US |
dc.subject | biomedical named entity recognition | en_US |
dc.subject | biomedical NER | en_US |
dc.subject | biomedical named entity classification | en_US |
dc.title | An Approach to Reducing Annotation Costs for BioNLP | en_US |
dc.type | Article | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- reducingAnnotationCostsBioNLP2008.pdf
- Size:
- 60.85 KB
- Format:
- Adobe Portable Document Format