Annotating Cognates and Etymological Origin in Turkic Languages

dc.contributor.authorMericli, Benjamin
dc.contributor.authorBloodgood, Michael
dc.date.accessioned2014-08-12T18:29:41Z
dc.date.available2014-08-12T18:29:41Z
dc.date.issued2012-05
dc.description.abstractTurkic languages exhibit extensive and diverse etymological relationships among lexical items. These relationships make the Turkic languages promising for exploring automated translation lexicon induction by leveraging cognate and other etymological information. However, due to the extent and diversity of the types of relationships between words, it is not clear how to annotate such information. In this paper, we present a methodology for annotating cognates and etymological origin in Turkic languages. Our method strives to balance the amount of research effort the annotator expends with the utility of the annotations for supporting research on improving automated translation lexicon induction.en_US
dc.identifier.citationBenjamin S. Mericli and Michael Bloodgood. 2012. Annotating cognates and etymological origin in Turkic languages. In Proceedings of the First Workshop on Language Resources and Technologies for Turkic Languages at the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 47-51, Istanbul, Turkey, May. European Language Resources Association.en_US
dc.identifier.urihttp://hdl.handle.net/1903/15564
dc.language.isoen_USen_US
dc.publisherEuropean Language Resources Associationen_US
dc.relation.isAvailableAtCenter for Advanced Study of Language
dc.relation.isAvailableAtDigitial Repository at the University of Maryland
dc.relation.isAvailableAtUniversity of Maryland (College Park, Md)
dc.rights.licensePublished with the permission of ELRA. This paper was published within the proceedings of the LREC 2012 Conference. © 1998-2012 ELRA - European Language Resources Association. All rights reserved.
dc.subjectcomputational linguisticsen_US
dc.subjectnatural language processingen_US
dc.subjecthuman language technologyen_US
dc.subjecttranslation lexicon inductionen_US
dc.subjectcognates detectionen_US
dc.subjectetymologyen_US
dc.subjectdata annotationen_US
dc.subjectannotation guidelinesen_US
dc.subjectannotation methodologyen_US
dc.subjectannotation manualen_US
dc.subjectannotation guideen_US
dc.subjectinter-annotator agreementen_US
dc.subjectcognates annotationen_US
dc.subjectetymology annotationen_US
dc.subjectTurkic languagesen_US
dc.subjectTurkic cognatesen_US
dc.subjectTurkic etymologyen_US
dc.subjectloanwordsen_US
dc.subjectborrowed wordsen_US
dc.subjecthistorical linguisticsen_US
dc.titleAnnotating Cognates and Etymological Origin in Turkic Languagesen_US
dc.typeArticleen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
annotatingTurkicCognates_LREC_TurkicWorkshop2012.pdf
Size:
536.03 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.57 KB
Format:
Item-specific license agreed upon to submission
Description: