Rapid Adaptation of POS Tagging for Domain Specific Uses

dc.contributor.authorMiller, John
dc.contributor.authorBloodgood, Michael
dc.contributor.authorTorii, Manabu
dc.contributor.authorVijay-Shanker, K
dc.date.accessioned2014-08-25T21:15:47Z
dc.date.available2014-08-25T21:15:47Z
dc.date.issued2006-06
dc.description.abstractPart-of-speech (POS) tagging is a fundamental component for performing natural language tasks such as parsing, information extraction, and question answering. When POS taggers are trained in one domain and applied in significantly different domains, their performance can degrade dramatically. We present a methodology for rapid adaptation of POS taggers to new domains. Our technique is unsupervised in that a manually annotated corpus for the new domain is not necessary. We use suffix information gathered from large amounts of raw text as well as orthographic information to increase the lexical coverage. We present an experiment in the Biological domain where our POS tagger achieves results comparable to POS taggers specifically trained to this domain.en_US
dc.identifierhttps://doi.org/10.13016/M2059S
dc.identifier.citationJohn E. Miller, Michael Bloodgood, Manabu Torii, and K. Vijay-Shanker. 2006. Rapid adaptation of POS tagging for domain specific uses. In Proceedings of the HLT-NAACL BioNLP Workshop on Linking Natural Language and Biology, pages 118-119, New York, New York, June. Association for Computational Linguistics.en_US
dc.identifier.urihttp://hdl.handle.net/1903/15583
dc.language.isoen_USen_US
dc.publisherAssociation for Computational Linguisticsen_US
dc.relation.isAvailableAtCenter for Advanced Study of Language
dc.relation.isAvailableAtDigitial Repository at the University of Maryland
dc.relation.isAvailableAtUniversity of Maryland (College Park, Md)
dc.subjectcomputer scienceen_US
dc.subjectstatistical methodsen_US
dc.subjectartificial intelligenceen_US
dc.subjectmachine learningen_US
dc.subjectcomputational linguisticsen_US
dc.subjectnatural language processingen_US
dc.subjecthuman language technologyen_US
dc.subjecttext processingen_US
dc.subjectTransformation Based Learningen_US
dc.subjectpart-of-speech taggingen_US
dc.subjectPOS taggingen_US
dc.subjectdomain-specific POS taggingen_US
dc.subjectdomain-specific part-of-speech taggingen_US
dc.subjectdomain adaptationen_US
dc.subjectrapid adaptationen_US
dc.subjectrapid domain adaptationen_US
dc.subjectunsupervised domain adaptationen_US
dc.subjectBioNLPen_US
dc.subjectbiomedical natural language processingen_US
dc.subjectbiomedical text processingen_US
dc.subjectbiomedical POS taggingen_US
dc.subjectbiomedical part-of-speech taggingen_US
dc.subjectsuffix-based part-of-speech taggingen_US
dc.subjectsuffix-based POS taggingen_US
dc.titleRapid Adaptation of POS Tagging for Domain Specific Usesen_US
dc.typeArticleen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
rapidAdaptationOf_POS_TaggingBioNLP2006.pdf
Size:
149.24 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.57 KB
Format:
Item-specific license agreed upon to submission
Description: